Computational Biology of Noncoding Genomes
Publications
3888256
CBNG
1
chicago-author-date
50
date
desc
year
54195
https://www.i2bc.paris-saclay.fr/wp-content/plugins/zotpress/
%7B%22status%22%3A%22success%22%2C%22updateneeded%22%3Afalse%2C%22instance%22%3Afalse%2C%22meta%22%3A%7B%22request_last%22%3A0%2C%22request_next%22%3A0%2C%22used_cache%22%3Atrue%7D%2C%22data%22%3A%5B%7B%22key%22%3A%22IEDFJYAN%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Hak%20et%20al.%22%2C%22parsedDate%22%3A%222026-05-03%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BHak%2C%20Fiona%2C%20Camille%20Marchet%2C%20Daniel%20Gautheret%2C%20and%20M%26%23xE9%3Blina%20Gallopin.%202026.%20%26%23x201C%3BMetappuccino%3A%20Large%20Language%20Model-Driven%20Reconstruction%20of%20Sequence%20Read%20Archive%20Metadata%20for%20Cancer%20Research.%26%23x201D%3B%20%26lt%3Bi%26gt%3BBioinformatics%26lt%3B%5C%2Fi%26gt%3B%20%28Oxford%2C%20England%29%2042%20%285%29%3A%20btag166.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fbioinformatics%5C%2Fbtag166%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fbioinformatics%5C%2Fbtag166%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Metappuccino%3A%20large%20language%20model-driven%20reconstruction%20of%20sequence%20read%20archive%20metadata%20for%20cancer%20research%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Fiona%22%2C%22lastName%22%3A%22Hak%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Camille%22%2C%22lastName%22%3A%22Marchet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Gautheret%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M%5Cu00e9lina%22%2C%22lastName%22%3A%22Gallopin%22%7D%5D%2C%22abstractNote%22%3A%22MOTIVATION%3A%20High-throughput%20RNA%20sequencing%20has%20significantly%20advanced%20transcriptomic%20profiling%20in%20oncology.%20Millions%20of%20RNA-seq%20datasets%20have%20accumulated%20in%20public%20databases%20such%20as%20the%20Sequence%20Read%20Archive%20%28SRA%29.%20However%2C%20fragmented%2C%20ambiguous%2C%20or%20missing%20metadata%20can%20severely%20limit%20accurate%20cohort%20selection%2C%20introduce%20bias%2C%20and%20delay%20discoveries.%5CnRESULTS%3A%20To%20address%20these%20issues%2C%20we%20introduce%20%26%23039%3BMetappuccino%26%23039%3B%2C%20a%20hybrid%20metadata%20enrichment%20tool%20built%20on%20Mistral-7B-Instruct%20and%20specialized%20via%20low-rank%20adaptation%20%28LoRA%29.%20Metappuccino%20reconstructs%2019%20metadata%20classes%20%28e.g.%20organ%2C%20disease%2C%20cell%20type%29%20by%20combining%20deterministic%20extraction%5C%2Fnormalization%20with%20model-based%20completion%3A%204%20submission-mandatory%20fields%20are%20read%20directly%20from%20SRA%5C%2FAPI%20records%2C%20while%20the%20remaining%2015%20classes%20are%20obtained%20through%20validated%20rule-based%20extraction%20when%20explicitly%20supported%20by%20the%20context%20and%20otherwise%20predicted%20by%20the%20LoRA-specialized%20model%20when%20information%20is%20missing%20or%20ambiguous.%20To%20promote%20robust%2C%20context-aware%20inference%20rather%20than%20memorization%2C%20we%20designed%20training%20and%20data%20partitioning%20to%20minimize%20leakage%20and%20preserve%20generalization.%20When%20applicable%2C%20predicted%20values%20are%20mapped%20to%20standardized%20ontologies%20to%20ensure%20consistent%2C%20interoperable%20annotations.%20Across%20our%20benchmarks%2C%20Metappuccino%20substantially%20improves%20accuracy%20over%20the%20base%20model%2C%20matches%20or%20exceeds%20recent%20larger%20open-source%20LLMs%2C%20and%20reduces%20inference%20time%20by%20up%20to%20two-fold%20relative%20to%20these%20baselines.%20By%20enriching%20under-annotated%20public%20RNA-seq%20records%2C%20Metappuccino%20increases%20the%20usability%20of%20SRA%20datasets%20for%20large-scale%20reuse%2C%20with%20applications%20that%20extend%20beyond%20oncology%20transcriptomics.%5CnAVAILABILITY%20AND%20IMPLEMENTATION%3A%20Metappuccino%20source%20code%20is%20available%20on%3A%20github.com%5C%2Fchumphati%5C%2FMetappuccino.%20The%20fine-tuned%20LLM%2C%20MetappuccinoLLModel%2C%20is%20available%20on%3A%20huggingface.co%5C%2Fchumphati%5C%2FMetappuccinoLLModel.%20Both%20repositories%20are%20released%20under%20Apache-2.0%20license.%22%2C%22date%22%3A%222026-05-03%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1093%5C%2Fbioinformatics%5C%2Fbtag166%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2242057294%22%2C%22PMCID%22%3A%22PMC13148957%22%2C%22ISSN%22%3A%221367-4811%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A55%3A23Z%22%7D%7D%2C%7B%22key%22%3A%22IGUDWQEA%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Khamvongsa-Charbonnier%20et%20al.%22%2C%22parsedDate%22%3A%222026-04%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BKhamvongsa-Charbonnier%2C%20Lucie%2C%20Robert%20Aboukhalil%2C%20H%26%23xE9%3Bl%26%23xE8%3Bne%20Chiapello%2C%20et%20al.%202026.%20%26%23x201C%3BTraining%20Biologists%20in%20Unix%20Command-Line%20Skills%3A%20From%20Curriculum%20to%20Interactive%20Online%20Tutorials.%26%23x201D%3B%20%26lt%3Bi%26gt%3BPLoS%20Computational%20Biology%26lt%3B%5C%2Fi%26gt%3B%2022%20%284%29%3A%20e1014133.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pcbi.1014133%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pcbi.1014133%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Training%20biologists%20in%20Unix%20command-line%20skills%3A%20From%20curriculum%20to%20interactive%20online%20tutorials%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Lucie%22%2C%22lastName%22%3A%22Khamvongsa-Charbonnier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Robert%22%2C%22lastName%22%3A%22Aboukhalil%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22H%5Cu00e9l%5Cu00e8ne%22%2C%22lastName%22%3A%22Chiapello%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Thomas%22%2C%22lastName%22%3A%22Denecker%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Pierre%22%2C%22lastName%22%3A%22Poulain%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Denis%22%2C%22lastName%22%3A%22Puthier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Olivier%22%2C%22lastName%22%3A%22Sand%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Morgane%22%2C%22lastName%22%3A%22Thomas-Chollier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Claire%22%2C%22lastName%22%3A%22Toffano-Nioche%22%7D%5D%2C%22abstractNote%22%3A%22As%20the%20generation%20of%20data%20in%20the%20life%20and%20health%20sciences%20expands%20rapidly%2C%20there%20is%20a%20growing%20need%20for%20professionals%20and%20students%20in%20these%20fields%20to%20master%20core%20bioinformatics%20skills%2C%20particularly%20those%20relating%20to%20Unix-like%20systems%2C%20most%20commonly%20used%20in%20bioinformatics.%20This%20paper%20introduces%20two%20key%20contributions%20to%20address%20this%20need%3A%20%281%29%20A%20Unix%20curriculum%20for%20life%20scientists%20with%20little%20or%20no%20command-line%20experience%2C%20based%20on%20progressive%20Unix%20skill%20levels%20for%20bioinformatics%20and%20%282%29%20An%20implementation%20of%20this%20curriculum%20into%20a%20series%20of%20interactive%20online%20tutorials%20deployed%20through%20Sandbox.bio-an%20open-source%20platform%20for%20learning%20bioinformatics%20that%20embeds%20a%20command%20line%20in%20the%20browser%2C%20which%20removes%20barriers%20related%20to%20software%20installation%20and%20access.%20We%20performed%20an%20overall%20evaluation%20of%20this%20teaching%20framework%20in%20different%20contexts.%20This%20inclusive%2C%20sustainable%20approach%20provides%20widespread%20access%20to%20essential%20bioinformatics%20skills%20for%20life%20science%20students%20and%20professionals%20alike.%22%2C%22date%22%3A%222026-04%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1371%5C%2Fjournal.pcbi.1014133%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2241931477%22%2C%22PMCID%22%3A%22PMC13048368%22%2C%22ISSN%22%3A%221553-7358%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A56%3A11Z%22%7D%7D%2C%7B%22key%22%3A%22JKR87EQN%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Zrafi%20et%20al.%22%2C%22parsedDate%22%3A%222026-03-13%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BZrafi%2C%20Wael%20S.%2C%20V%26%23xED%3Bctor%20Albarr%26%23xE1%3Bn-Artahona%2C%20Filippo%20G.%20Dall%26%23x2019%3BOlio%2C%20et%20al.%202026.%20%26%23x201C%3BTumor%20Purity%20as%20a%20Prognostic%20and%20Predictive%20Biomarker%20of%20Postoperative%20Radiotherapy%20Outcomes%20in%20Stage%20IIIA-N2%20Non-Small-Cell%20Lung%20Cancer%3A%20A%20Transcriptomic%20Analysis%20from%20the%20Lung%20ART%20Trial.%26%23x201D%3B%20%26lt%3Bi%26gt%3BInternational%20Journal%20of%20Radiation%20Oncology%2C%20Biology%2C%20Physics%26lt%3B%5C%2Fi%26gt%3B%2C%20March%2013%2C%20S0360-3016%2826%2900491-8.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.ijrobp.2026.03.006%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.ijrobp.2026.03.006%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Tumor%20purity%20as%20a%20prognostic%20and%20predictive%20biomarker%20of%20postoperative%20radiotherapy%20outcomes%20in%20stage%20IIIA-N2%20non-small-cell%20lung%20cancer%3A%20a%20transcriptomic%20analysis%20from%20the%20Lung%20ART%20trial%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Wael%20S.%22%2C%22lastName%22%3A%22Zrafi%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22V%5Cu00edctor%22%2C%22lastName%22%3A%22Albarr%5Cu00e1n-Artahona%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Filippo%20G.%22%2C%22lastName%22%3A%22Dall%27Olio%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Maria%20R.%22%2C%22lastName%22%3A%22Ghigna%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nicolas%22%2C%22lastName%22%3A%22Signolle%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nathalie%22%2C%22lastName%22%3A%22Cozic%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Julien%22%2C%22lastName%22%3A%22Adam%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ludovic%22%2C%22lastName%22%3A%22Lacroix%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22C%5Cu00e9cile%20Le%22%2C%22lastName%22%3A%22Pechoux%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Gautheret%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Benjamin%22%2C%22lastName%22%3A%22Besse%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Antonin%22%2C%22lastName%22%3A%22Levy%22%7D%5D%2C%22abstractNote%22%3A%22PURPOSE%3A%20Tumor%20purity%20%28TP%29%2C%20the%20proportion%20of%20malignant%20cells%20within%20a%20tumor%20sample%2C%20is%20an%20important%20feature%20of%20the%20tumor%20microenvironment%20%28TME%29.%20Using%20transcriptomic%20data%20from%20the%20Lung%20ART-IFCT%200503%20trial%2C%20we%20investigated%20the%20relevance%20of%20TP%20and%20its%20potential%20to%20predict%20benefit%20from%20postoperative%20radiotherapy%20%28PORT%29.%5CnMATERIALS%20AND%20METHODS%3A%20RNA%20sequencing%20was%20successfully%20performed%20on%20285%20samples.%20TP%20was%20inferred%20using%20the%20ESTIMATE%20algorithm.%20Associations%20with%20overall%20survival%20%28OS%29%20and%20disease-free%20survival%20%28DFS%29%20were%20assessed%20using%20Kaplan-Meier%20and%20multivariable%20Cox%20models.%5CnRESULTS%3A%20Among%20285%20patients%20with%20resected%20stage%20IIIA-N2%20NSCLC%2C%20144%20received%20PORT.%20The%20median%20age%20was%2061%20years%2C%2031%25%20were%20female%2C%20and%2080%25%20had%20non-squamous%20histology.%20Baseline%20characteristics%20were%20well%20balanced%20between%20arms.%20The%20median%20TP%20was%200.64%20%28range%200.41-0.92%29%20and%20was%20slightly%20higher%20in%20the%20PORT%20arm%20%280.65%20vs%200.63%3B%20p%5Cu202f%3D%5Cu202f0.006%29.%20TP%20correlated%20with%20H%26amp%3BE%20pathologist-estimated%20cellularity%20%28r%5Cu202f%3D%5Cu202f0.23%2C%20p%20%26lt%3B%200.001%29%2C%20was%20higher%20in%20squamous%20tumors%20%280.68%20vs%200.63%2C%20p%20%26lt%3B%200.001%29%2C%20and%20increased%20with%20necrosis%20%28r%5Cu202f%3D%5Cu202f0.31%2C%20p%20%26lt%3B%2010%5Cu207b%5Cu2076%29.%20Transcriptomic%20analysis%20confirmed%20associations%20with%20proliferation-related%20pathways%20and%20reduced%20hypoxia%20signatures%20%28false%20discovery%20rate%20%26lt%3B%200.001%29.%20TP%20inversely%20correlated%20with%20T-cell%20immune%20infiltration%20by%20IHC%20%28CD3%5Cu207a%20r%5Cu202f%3D%5Cu202f-0.52%3B%20CD8%5Cu207a%20r%5Cu202f%3D%5Cu202f-0.45%29.%20High%20TP%20was%20associated%20with%20worse%20OS%20%2848.5%20vs%20106.5%20months%2C%20p%20%26lt%3B%200.001%29%20and%20DFS%20%2818.4%20vs%2048.0%20months%2C%20p%5Cu202f%3D%5Cu202f0.017%29.%20A%20TP%5Cu202f%5Cu00d7%5Cu202fPORT%20interaction%20was%20observed%20for%20OS%20%28p%5Cu202f%3D%5Cu202f0.049%29%20and%20a%20trend%20for%20DFS%20%28p%5Cu202f%3D%5Cu202f0.07%29.%5CnCONCLUSION%3A%20TP%20reflects%20proliferative%20and%20TME%20features%2C%20a%20lower%20TP%20is%20independently%20associated%20with%20improved%20prognosis%20in%20resected%20stage%20IIIA-N2%20NSCLC.%20PORT%20may%20be%20more%20effective%20in%20tumors%20with%20lower%20TP%3B%20however%2C%20this%20finding%20is%20exploratory%20and%20requires%20independent%20validation.%22%2C%22date%22%3A%222026-03-13%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1016%5C%2Fj.ijrobp.2026.03.006%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2241833910%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221879-355X%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A56%3A30Z%22%7D%7D%2C%7B%22key%22%3A%2299JG3L5L%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Roginski%20et%20al.%22%2C%22parsedDate%22%3A%222026-01-06%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BRoginski%2C%20Paul%2C%20Chris%20Papadopoulos%2C%20Simon%20Herman%2C%20Ambre%20Baumann%2C%20Antoine%20Grislain%2C%20and%20Anne%20Lopes.%202026.%20%26%23x201C%3BImpact%20of%20GC%20Content%20on%20de%20Novo%20Gene%20Birth.%26%23x201D%3B%20%26lt%3Bi%26gt%3BNature%20Communications%26lt%3B%5C%2Fi%26gt%3B%2C%20ahead%20of%20print%2C%20January%206.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1038%5C%2Fs41467-025-68022-7%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1038%5C%2Fs41467-025-68022-7%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Impact%20of%20GC%20content%20on%20de%20novo%20gene%20birth%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Paul%22%2C%22lastName%22%3A%22Roginski%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chris%22%2C%22lastName%22%3A%22Papadopoulos%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Simon%22%2C%22lastName%22%3A%22Herman%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ambre%22%2C%22lastName%22%3A%22Baumann%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Antoine%22%2C%22lastName%22%3A%22Grislain%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne%22%2C%22lastName%22%3A%22Lopes%22%7D%5D%2C%22abstractNote%22%3A%22Noncoding%20regions%20in%20eukaryotes%20are%20extensively%20expressed%20and%20represent%20a%20significant%20source%20of%20novel%20microproteins%2C%20some%20of%20which%20become%20fixed%20as%20de%20novo%20genes.%20However%2C%20the%20structural%20properties%20of%20these%20unevolved%20products%20and%20the%20features%20driving%20their%20fixation%20remain%20poorly%20understood.%20Particularly%2C%20the%20influence%20of%20nucleotide%20composition%20%28GC%20content%29%20on%20their%20structural%20properties%20and%20evolutionary%20trajectories%20is%20still%20unclear.%20Here%2C%20we%20predict%20the%20foldability%20and%20sequence%20properties%20of%20millions%20of%20microproteins%20potentially%20encoded%20in%20the%20noncoding%20open%20reading%20frames%20%28ORFs%29%20of%203%2C379%20eukaryotic%20genomes%20with%20GC%20contents%20ranging%20from%2018%25%20to%2079%25.%20Depending%20on%20GC%20content%2C%20these%20microproteins%20exhibit%20distinct%20structural%20properties%2C%20suggesting%20different%20cellular%20impacts%20if%20non-genic%20regions%20are%20pervasively%20expressed.%20Using%20phylostratigraphy%2C%20de%20novo%20gene%20search%2C%20and%20ancestral%20sequence%20reconstruction%2C%20we%20trace%20the%20evolution%20of%20several%20hundred%20de%20novo%20proteins%20across%2022%20organisms%20with%20varying%20GC%20contents.%20We%20show%20that%20de%20novo%20genes%20preferentially%20emerge%20from%20GC-rich%20ORFs%20with%20folding%20potential%2C%20revealing%20that%20the%20interplay%20between%20GC%20content%20and%20foldability%20-%20rooted%20in%20the%20structure%20of%20the%20genetic%20code%20-%20shapes%20the%20emergence%20of%20novel%20genes.%22%2C%22date%22%3A%222026-01-06%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1038%5C%2Fs41467-025-68022-7%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2241495041%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%222041-1723%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A54%3A48Z%22%7D%7D%2C%7B%22key%22%3A%22T4BLJSSL%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Mariotte%20et%20al.%22%2C%22parsedDate%22%3A%222026-01%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BMariotte%2C%20T.%2C%20R.%20Coudray%2C%20C.%20Toffano-Nioche%2C%20F.%20Guyot%2C%20and%20A.%20Gorlas.%202026.%20%26%23x201C%3BIron%20Sulfides%20Produced%20by%20Thermococcales%3A%20An%20Iron%20Detoxification%20Mechanism.%26%23x201D%3B%20%26lt%3Bi%26gt%3BEnvironmental%20Microbiology%26lt%3B%5C%2Fi%26gt%3B%2028%20%281%29%3A%20e70242.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1111%5C%2F1462-2920.70242%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1111%5C%2F1462-2920.70242%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Iron%20Sulfides%20Produced%20by%20Thermococcales%3A%20An%20Iron%20Detoxification%20Mechanism%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22T.%22%2C%22lastName%22%3A%22Mariotte%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22R.%22%2C%22lastName%22%3A%22Coudray%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22C.%22%2C%22lastName%22%3A%22Toffano-Nioche%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22F.%22%2C%22lastName%22%3A%22Guyot%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22A.%22%2C%22lastName%22%3A%22Gorlas%22%7D%5D%2C%22abstractNote%22%3A%22Thermococcales%2C%20sulfur-reducing%20archaea%20inhabiting%20the%20hottest%20parts%20of%20hydrothermal%20vents%2C%20have%20evolved%20to%20thrive%20in%20environments%20rich%20in%20iron%20and%20sulfide%20species.%20In%20this%20study%2C%20using%20experimental%20analogues%20of%20sulfur-rich%20hydrothermal%20chimneys%2C%20we%20confirm%20previous%20suggestions%20that%20the%20precipitation%20of%20iron%20sulfide%20minerals%20promoted%20by%20Thermococcales%20contributes%20to%20a%20population-wide%20adaptation%20to%20reactive%20species%20induced%20by%20the%20presence%20of%20high%20levels%20of%20iron.%20In%20parallel%20with%20mineral%20phases%20identification%2C%20cellular%20metabolic%20activity%20was%20monitored%20during%20mineralization%2C%20revealing%20a%20mechanism%20in%20which%20a%20subpopulation%20of%20cells%20does%20not%20survive%20mineralization%20and%20becomes%20encrusted%20in%20pyrite%2C%20while%20the%20remaining%20living%20cells%20exhibit%20a%20gene%20expression%20profile%20focused%20on%20DNA%20repair%20and%20metal%20excess%20associated%20detoxification.%20Compared%20to%20abiotic%20conditions%2C%20Thermococcales%20induce%20a%20faster%20precipitation%20of%20dissolved%20iron%2C%20immobilising%20excess%20metal.%20Our%20results%20clarify%20the%20role%20of%20mineralizing%20cells%20in%20this%20survival%20mechanism%2C%20suggesting%20that%20this%20biomineralization%20process%20allows%20resilience%20to%20extreme%20chemical%20stress.%20Upon%20drastic%20levels%20of%20toxic%20dissolved%20iron%2C%20thanks%20to%20a%20population%20of%20mineralizing%20cells%2C%20the%20surviving%20Thermococcales%20are%20thus%20more%20likely%20to%20endure%20those%20still%20harsh%20environments.%20This%20complex%20mechanism%20is%20likely%20a%20key%20factor%20in%20the%20adaptation%20of%20microorganisms%20to%20the%20hottest%20environments%20of%20hydrothermal%20vents.%22%2C%22date%22%3A%222026-01%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1111%5C%2F1462-2920.70242%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2241571618%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221462-2920%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A55%3A04Z%22%7D%7D%2C%7B%22key%22%3A%228VBDKQYQ%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Rossier%20et%20al.%22%2C%22parsedDate%22%3A%222026%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BRossier%2C%20Ombeline%2C%20Florence%20Constantinesco-Becker%2C%20Anne%20Lopes%2C%20et%20al.%202026.%20%26%23x201C%3BGenome%20Sequence%20of%20Corynebacterium%20Glutamicum%20Phage%20MicyPS.%26%23x201D%3B%20%26lt%3Bi%26gt%3BmicroPublication%20Biology%26lt%3B%5C%2Fi%26gt%3B%202026.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.17912%5C%2Fmicropub.biology.001936%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.17912%5C%2Fmicropub.biology.001936%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Genome%20Sequence%20of%20Corynebacterium%20glutamicum%20Phage%20MicyPS%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ombeline%22%2C%22lastName%22%3A%22Rossier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Florence%22%2C%22lastName%22%3A%22Constantinesco-Becker%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne%22%2C%22lastName%22%3A%22Lopes%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Delaruelle%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ana%20A.%22%2C%22lastName%22%3A%22Arteni%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Lydia%22%2C%22lastName%22%3A%22Hassissene%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Malika%22%2C%22lastName%22%3A%22Ouldali%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Laura%22%2C%22lastName%22%3A%22Pieri%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Avril%22%2C%22lastName%22%3A%22Zappini%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Katia%22%2C%22lastName%22%3A%22Zaidi%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Armand%22%2C%22lastName%22%3A%22Tomasella%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Perla%22%2C%22lastName%22%3A%22Tannous%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michel%22%2C%22lastName%22%3A%22Nouhra%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mickael%22%2C%22lastName%22%3A%22Marques%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Cindy%22%2C%22lastName%22%3A%22Goodur%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Louis%22%2C%22lastName%22%3A%22Gachot%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Erine%22%2C%22lastName%22%3A%22Dumond%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Aliz%5Cu00e9e%22%2C%22lastName%22%3A%22Dias%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Heather%22%2C%22lastName%22%3A%22Desolle%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Meissan%22%2C%22lastName%22%3A%22Chikhi%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chahine%22%2C%22lastName%22%3A%22Belhachem%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Adel%22%2C%22lastName%22%3A%22Amriche%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christophe%22%2C%22lastName%22%3A%22Regeard%22%7D%5D%2C%22abstractNote%22%3A%22MicyPS%20is%20a%20bacteriophage%20with%20siphovirus%20morphology%20infecting%20Corynebacterium%20glutamicum%20strain%20MB001.%20It%20was%20isolated%20from%20soil%20near%20a%20henhouse%20in%20Villiers-sur-Marne%2C%20France.%20Its%2078%2C208-bp%20genome%20encodes%20115%20predicted%20protein-encoding%20genes%20and%205%20tRNAs.%20Based%20on%20gene-content%20similarity%20with%20actinobacteriophage%20PSonyx%2C%20MicyPS%20was%20assigned%20to%20the%20new%20cluster%20EQ.%22%2C%22date%22%3A%222026%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.17912%5C%2Fmicropub.biology.001936%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2241728345%22%2C%22PMCID%22%3A%22PMC12921444%22%2C%22ISSN%22%3A%222578-9430%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A54%3A32Z%22%7D%7D%2C%7B%22key%22%3A%227NE9FD5R%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Torossian%20et%20al.%22%2C%22parsedDate%22%3A%222025-12-01%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BTorossian%2C%20Nouritza%2C%20Marc%20Gabriel%2C%20Panagiotis%20Papoutsoglou%2C%20et%20al.%202025.%20%26%23x201C%3BReference-Free%20RNA%20Profiling%20Predicts%20Triple%20Negative%20Breast%20Cancer%20Chemoresistance%20to%20Neoadjuvant%20Treatment.%26%23x201D%3B%20%26lt%3Bi%26gt%3BNAR%20Cancer%26lt%3B%5C%2Fi%26gt%3B%207%20%284%29%3A%20zcaf036.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fnarcan%5C%2Fzcaf036%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fnarcan%5C%2Fzcaf036%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Reference-free%20RNA%20profiling%20predicts%20triple%20negative%20breast%20cancer%20chemoresistance%20to%20neoadjuvant%20treatment%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nouritza%22%2C%22lastName%22%3A%22Torossian%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marc%22%2C%22lastName%22%3A%22Gabriel%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Panagiotis%22%2C%22lastName%22%3A%22Papoutsoglou%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Dominika%22%2C%22lastName%22%3A%22Foretek%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Camille%22%2C%22lastName%22%3A%22Brochard%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Maud%22%2C%22lastName%22%3A%22Kamal%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Linda%22%2C%22lastName%22%3A%22Ramdani%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Constance%22%2C%22lastName%22%3A%22Lamy%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Charlotte%22%2C%22lastName%22%3A%22Lecerf%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Maral%22%2C%22lastName%22%3A%22Halladjian%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Celia%22%2C%22lastName%22%3A%22Dupain%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Josiane%22%2C%22lastName%22%3A%22Lafleur%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Adriana%22%2C%22lastName%22%3A%22Aguilar-Mahecha%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mark%22%2C%22lastName%22%3A%22Basik%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne%22%2C%22lastName%22%3A%22Vincent-Salomon%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christophe%5Cu00a0L%20E%22%2C%22lastName%22%3A%22Tourneau%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sergio%22%2C%22lastName%22%3A%22Roman-Roman%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Gautheret%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Antonin%22%2C%22lastName%22%3A%22Morillon%22%7D%5D%2C%22abstractNote%22%3A%22Triple%20negative%20breast%20cancer%20%28TNBC%29%20is%20the%20most%20aggressive%20breast%20cancer%20%28BC%29%20and%20often%20affects%20young%20women.%20TNBCs%20are%20highly%20heterogeneous%20and%20do%20not%20benefit%20from%20personalized%20medicine%20at%20localized%20stages.%20Most%20TNBC%20patients%20undergo%20neoadjuvant%20chemotherapy%20%28NAC%29%20before%20surgery.%20In%20case%20of%20chemoresistance%20with%20residual%20tumor%20after%20NAC%2C%20survival%20is%20poor%20despite%20execution%20of%20complete%20tumor%20resection.%20There%20is%20currently%20no%20clinically%20useful%20biomarker%20to%20predict%20TNBC%20chemoresistance%20to%20NAC%20that%5Cu00a0would%20enable%20targeted%20therapeutic%20intensification.%20We%20analyzed%20here%20a%20unique%20cohort%20of%20106%20TNBC%20tumors%20before%20NAC%2C%20including%2058%20chemoresistant%20and%2048%20chemosensitive%20cases%2C%20from%202%5Cu00a0independent%20hospitals.%20Using%20machine%20learning%20under%20a%20nested%20cross-validation%20design%2C%20we%20obtained%20two%20transcriptomic%20signatures%20respectively%20generated%20from%20standard%20differential%20gene%20expression%20analysis%20and%20reference-free%20analysis%20of%20differential%20fragments%20of%20transcripts%2C%20without%20any%20annotation%20bias.%20This%20approach%20resulted%20in%20accurate%20signatures%20of%20TNBC%20chemoresistance%20to%20NAC.%20Gene%20ontology%20analyses%20of%20reference-free%20signatures%20highlighted%20DNA%20repair%2C%20replication%2C%20and%20metabolism%2C%20in%20agreement%20with%20current%20knowledge%20of%20TNBC%20resistance%20biology.%20In%20summary%2C%20these%20results%20show%20the%20potential%20of%20a%20reference-free%20generated%20transcriptomic%20signature%20as%20predictive%20biomarker%20of%20early%20TNBC%20chemoresistance.%22%2C%22date%22%3A%222025-12-01%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1093%5C%2Fnarcan%5C%2Fzcaf036%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fnarcan%5C%2Fzcaf036%22%2C%22PMID%22%3A%22%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%222632-8674%22%2C%22language%22%3A%22%22%2C%22collections%22%3A%5B%22GXCB8CXY%22%5D%2C%22dateModified%22%3A%222026-06-02T05%3A55%3A42Z%22%7D%7D%2C%7B%22key%22%3A%22XUG9URTW%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Vergnaud%20et%20al.%22%2C%22parsedDate%22%3A%222025-10-14%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BVergnaud%2C%20Gilles%2C%20Markus%20H.%20Antwerpen%2C%20and%20Gregor%20Grass.%202025.%20%26%23x201C%3BBacillus%20Anthracis%20Phylogeography%3A%20Origin%20of%20the%20East%20Asian%20Polytomy%20and%20Impact%20of%20International%20Trade%20for%20Its%20near%20Global%20Dispersal.%26%23x201D%3B%20%26lt%3Bi%26gt%3BPathogens%20%28Basel%2C%20Switzerland%29%26lt%3B%5C%2Fi%26gt%3B%2014%20%2810%29%3A%201041.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3390%5C%2Fpathogens14101041%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3390%5C%2Fpathogens14101041%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Bacillus%20anthracis%20Phylogeography%3A%20Origin%20of%20the%20East%20Asian%20Polytomy%20and%20Impact%20of%20International%20Trade%20for%20Its%20near%20Global%20Dispersal%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gilles%22%2C%22lastName%22%3A%22Vergnaud%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Markus%20H.%22%2C%22lastName%22%3A%22Antwerpen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gregor%22%2C%22lastName%22%3A%22Grass%22%7D%5D%2C%22abstractNote%22%3A%22Bacillus%20anthracis%20is%20the%20etiological%20agent%20of%20the%20zoonotic%20disease%20anthrax.%20The%20pathogen%20has%20colonized%20many%20regions%20of%20all%20inhabited%20continents.%20Increasing%20evidence%20points%20to%20a%20strong%20contribution%20of%20anthropogenic%20activities%20%28trade%29%20in%20this%20almost%20global%20spread.%20This%20article%20contributes%20further%20genomic%20data%20from%2021%20B.%20anthracis%20strains%2C%20including%2019%20isolated%20in%20Germany%2C%20aiming%20to%20support%20and%20detail%20the%20human%20role%20in%20anthrax%20dispersal.%20The%20newly%20sequenced%20genomes%20belong%20to%20the%20B.%20anthracis%20lineage%20predominant%20in%20China.%20This%20lineage%20is%20remarkable%20because%20of%20its%20phylogenetic%20structure.%20A%20polytomy%20with%20nine%20branches%20radiating%20from%20a%20central%20node%20was%20identified%20by%20whole-genome%20single-nucleotide%20polymorphism%20%28wgSNP%29%20analysis.%20Strains%20from%20Germany%20populate%20two%20among%20the%20nine%20branches.%20Detailed%20analysis%20of%20the%20polytomy%20indicates%20that%20it%20most%20likely%20emerged%20in%20China.%20We%20propose%20that%20the%20polytomy%20is%20the%20result%20of%20the%20import%20of%20contaminated%20animal%20products%20in%20a%20limited%20spatiotemporal%20frame%2C%20followed%20by%20the%20distribution%20of%20these%20products%20to%20different%20locations%20within%20China%2C%20where%20new%20B.%20anthracis%20lineages%20then%20became%20independently%20established.%20Currently%20available%20data%20point%20to%20Bengal%20as%20a%20likely%20geographic%20source%20of%20the%20original%20contamination%2C%20and%20the%20history%20of%20trade%20exchanges%20between%20Bengal%20and%20China%20agrees%20with%20the%20early%20fifteenth%20century%20as%20a%20likely%20time%20period.%20The%20subsequent%20exports%20to%20Germany%20would%20have%20occurred%20during%20the%2019th%20century%20according%20to%20German%20trade%20history.%20Notably%2C%20Germany%20has%20been%20experiencing%20localized%20anthrax%20outbreaks%20from%20this%20trade%20heritage%20up%20into%20the%2021st%20century.%22%2C%22date%22%3A%222025-10-14%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.3390%5C%2Fpathogens14101041%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2241156652%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%222076-0817%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A53%3A57Z%22%7D%7D%2C%7B%22key%22%3A%22CKY9ZMTR%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Saunier%20et%20al.%22%2C%22parsedDate%22%3A%222025-08%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BSaunier%2C%20Marion%2C%20Adeline%20Humbert%2C%20Victor%20Kreis%2C%20et%20al.%202025.%20%26%23x201C%3BDeciphering%20the%20RNA-Based%20Regulation%20Mechanism%20of%20the%20Phage-Encoded%20AbiF%20System%20in%20Clostridioides%20Difficile.%26%23x201D%3B%20%26lt%3Bi%26gt%3BPLoS%20Genetics%26lt%3B%5C%2Fi%26gt%3B%2021%20%288%29%3A%20e1011831.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pgen.1011831%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pgen.1011831%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Deciphering%20the%20RNA-based%20regulation%20mechanism%20of%20the%20phage-encoded%20AbiF%20system%20in%20Clostridioides%20difficile%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marion%22%2C%22lastName%22%3A%22Saunier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Adeline%22%2C%22lastName%22%3A%22Humbert%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Victor%22%2C%22lastName%22%3A%22Kreis%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Johann%22%2C%22lastName%22%3A%22Peltier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Arianna%22%2C%22lastName%22%3A%22Tisba%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sylvie%22%2C%22lastName%22%3A%22Auxilien%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marion%22%2C%22lastName%22%3A%22Blum%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Isabelle%22%2C%22lastName%22%3A%22Caldelari%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jean-Fran%5Cu00e7ois%22%2C%22lastName%22%3A%22Lucier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Joe%22%2C%22lastName%22%3A%22Ueda%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Gautheret%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Claire%22%2C%22lastName%22%3A%22Toffano-Nioche%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jessica%22%2C%22lastName%22%3A%22Andreani%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Louis-Charles%22%2C%22lastName%22%3A%22Fortier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Olga%22%2C%22lastName%22%3A%22Soutourina%22%7D%5D%2C%22abstractNote%22%3A%22Clostridioides%20difficile%20is%20the%20major%20cause%20of%20nosocomial%20infections%20associated%20with%20antibiotic%20therapy.%20The%20severity%20of%20C.%20difficile%20infections%20increased%20worldwide%20with%20the%20emergence%20of%20hypervirulent%20strains%2C%20including%20027%20ribotype%20epidemic%20strains.%20Many%20aspects%20of%20C.%20difficile%20adaptation%20strategies%20during%20pathogenesis%20remain%20poorly%20understood.%20This%20pathogen%20thrives%20in%20gut%20communities%20that%20are%20rich%20in%20microbes%20and%20phages.%20To%20regulate%20horizontal%20transfer%20of%20genetic%20material%20during%20its%20infection%20cycle%2C%20C.%20difficile%20relies%20on%20diverse%20mechanisms.%20More%20specifically%2C%20CRISPR%20%28clustered%20regularly%20interspaced%20short%20palindromic%20repeats%29-Cas%20and%20Toxin-Antitoxin%20%28TA%29%20systems%20contribute%20to%20prophage%20maintenance%2C%20prevention%20of%20phage%20infection%2C%20and%20stress%20response.%20Abortive%20infection%20%28Abi%29%20systems%20can%20provide%20additional%20lines%20of%20anti-phage%20defense.%20RNAs%20have%20emerged%20as%20key%20components%20of%20these%20systems%20including%20CRISPR%20RNAs%20and%20antitoxin%20RNAs%20within%20type%20I%20and%20type%20III%20TA.%20We%20report%20here%20the%20identification%20of%20a%20new%20AbiF-like%20system%20within%20a%20prophage%20of%20the%20hypervirulent%20C.%20difficile%20strain%20R20291.%20It%20is%20associated%20with%20an%20Abi_2%5C%2FAbiD%5C%2FF%20protein%20family%20largely%20distributed%20in%20Bacillota%20and%20Pseudomonadota%20with%20structural%20links%20to%20ancestral%20Cas13%20proteins%20at%20the%20origin%20of%20the%20RNA-targeting%20CRISPR-Cas13%20systems.%20We%20demonstrated%20toxic%20activity%20of%20the%20AbiFCd%20protein%20in%20C.%20difficile%20and%20in%20Escherichia%20coli%20and%20negative%20regulation%20of%20the%20abiFCd%20expression%20by%20an%20associated%20non-coding%20RNA%20RCd22.%20RCd22%20contains%20two%20conserved%20abiF%20motifs%20and%20is%20active%20both%20in%20cis%20and%20in%20trans%20to%20neutralize%20the%20toxin%20by%20direct%20RNA-protein%20interaction%2C%20similar%20to%20RNA%20antitoxin%20in%20type%20III%20TA.%20A%20mass%20spectrometry%20interactomics%20analysis%20of%20protein%20fractions%20from%20MS2-Affinity%20Purification%20coupled%20with%20RNA%20sequencing%20%28MAPS%29%20revealed%20the%20AbiFCd%20protein%20among%20the%20most%20enriched%20RCd22%20partners%20in%20C.%20difficile.%20Structural%20modeling%20of%20the%20RNA-protein%20complex%20and%20mutagenesis%20analysis%20revealed%20key%20positions%20on%20both%20protein%20and%20RNA%20partners%20for%20this%20interaction%20and%20toxic%20activity.%20In%20summary%2C%20these%20findings%20provide%20valuable%20insights%20into%20the%20mechanisms%20of%20interaction%20between%20bacteria%20and%20phages%2C%20which%20are%20pertinent%20to%20the%20advancement%20of%20phage%20therapy%2C%20genome%20editing%2C%20epidemiological%20surveillance%2C%20and%20the%20formulation%20of%20novel%20therapeutic%20approaches.%22%2C%22date%22%3A%222025-08%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1371%5C%2Fjournal.pgen.1011831%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2240828859%22%2C%22PMCID%22%3A%22PMC12373285%22%2C%22ISSN%22%3A%221553-7404%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A54%3A17Z%22%7D%7D%2C%7B%22key%22%3A%22CGNU9ICU%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Papadopoulos%20et%20al.%22%2C%22parsedDate%22%3A%222024-10-14%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BPapadopoulos%2C%20Chris%2C%20Hugo%20Arbes%2C%20David%20Cornu%2C%20et%20al.%202024.%20%26%23x201C%3BThe%20Ribosome%20Profiling%20Landscape%20of%20Yeast%20Reveals%20a%20High%20Diversity%20in%20Pervasive%20Translation.%26%23x201D%3B%20%26lt%3Bi%26gt%3BGenome%20Biology%26lt%3B%5C%2Fi%26gt%3B%2025%20%281%29%3A%20268.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1186%5C%2Fs13059-024-03403-7%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1186%5C%2Fs13059-024-03403-7%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22The%20ribosome%20profiling%20landscape%20of%20yeast%20reveals%20a%20high%20diversity%20in%20pervasive%20translation%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chris%22%2C%22lastName%22%3A%22Papadopoulos%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Hugo%22%2C%22lastName%22%3A%22Arbes%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Cornu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nicolas%22%2C%22lastName%22%3A%22Chevrollier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sandra%22%2C%22lastName%22%3A%22Blanchet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Paul%22%2C%22lastName%22%3A%22Roginski%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Camille%22%2C%22lastName%22%3A%22Rabier%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Safiya%22%2C%22lastName%22%3A%22Atia%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Olivier%22%2C%22lastName%22%3A%22Lespinet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Olivier%22%2C%22lastName%22%3A%22Namy%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne%22%2C%22lastName%22%3A%22Lopes%22%7D%5D%2C%22abstractNote%22%3A%22Pervasive%20translation%20is%20a%20widespread%20phenomenon%20that%20plays%20a%20critical%20role%20in%20the%20emergence%20of%20novel%20microproteins%2C%20but%20the%20diversity%20of%20translation%20patterns%20contributing%20to%20their%20generation%20remains%20unclear.%20Based%20on%2054%20ribosome%20profiling%20%28Ribo-Seq%29%20datasets%2C%20we%20investigated%20the%20yeast%20Ribo-Seq%20landscape%20using%20a%20representation%20framework%20that%20allows%20the%20comprehensive%20inventory%20and%20classification%20of%20the%20entire%20diversity%20of%20Ribo-Seq%20signals%2C%20including%20non-canonical%20ones.%22%2C%22date%22%3A%222024-10-14%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1186%5C%2Fs13059-024-03403-7%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1186%5C%2Fs13059-024-03403-7%22%2C%22PMID%22%3A%22%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221474-760X%22%2C%22language%22%3A%22en%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T06%3A48%3A57Z%22%7D%7D%2C%7B%22key%22%3A%225DIMAKYQ%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Bessi%5Cu00e8re%20et%20al.%22%2C%22parsedDate%22%3A%222024-10-10%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BBessi%26%23xE8%3Bre%2C%20Chlo%26%23xE9%3B%2C%20Haoliang%20Xue%2C%20Benoit%20Guibert%2C%20et%20al.%202024.%20%26%23x201C%3BTransipedia.Org%3A%20K-Mer-Based%20Exploration%20of%20Large%20RNA%20Sequencing%20Datasets%20and%20Application%20to%20Cancer%20Data.%26%23x201D%3B%20%26lt%3Bi%26gt%3BGenome%20Biology%26lt%3B%5C%2Fi%26gt%3B%2025%20%281%29%3A%20266.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1186%5C%2Fs13059-024-03413-5%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1186%5C%2Fs13059-024-03413-5%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Transipedia.org%3A%20k-mer-based%20exploration%20of%20large%20RNA%20sequencing%20datasets%20and%20application%20to%20cancer%20data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chlo%5Cu00e9%22%2C%22lastName%22%3A%22Bessi%5Cu00e8re%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Haoliang%22%2C%22lastName%22%3A%22Xue%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Benoit%22%2C%22lastName%22%3A%22Guibert%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anthony%22%2C%22lastName%22%3A%22Boureux%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Florence%22%2C%22lastName%22%3A%22Ruffl%5Cu00e9%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Julien%22%2C%22lastName%22%3A%22Viot%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Rayan%22%2C%22lastName%22%3A%22Chikhi%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mika%5Cu00ebl%22%2C%22lastName%22%3A%22Salson%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Camille%22%2C%22lastName%22%3A%22Marchet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Th%5Cu00e9r%5Cu00e8se%22%2C%22lastName%22%3A%22Commes%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Gautheret%22%7D%5D%2C%22abstractNote%22%3A%22Indexing%20techniques%20relying%20on%20k-mers%20have%20proven%20effective%20in%20searching%20for%20RNA%20sequences%20across%20thousands%20of%20RNA-seq%20libraries%2C%20but%20without%20enabling%20direct%20RNA%20quantification.%20We%20show%20here%20that%20arbitrary%20RNA%20sequences%20can%20be%20quantified%20in%20seconds%20through%20their%20decomposition%20into%20k-mers%2C%20with%20a%20precision%20akin%20to%20that%20of%20conventional%20RNA%20quantification%20methods.%20Using%20an%20index%20of%20the%20Cancer%20Cell%20Line%20Encyclopedia%20%28CCLE%29%20collection%20consisting%20of%201019%20RNA-seq%20samples%2C%20we%20show%20that%20k-mer%20indexing%20offers%20a%20powerful%20means%20to%20reveal%20non-reference%20sequences%2C%20and%20variant%20RNAs%20induced%20by%20specific%20gene%20alterations%2C%20for%20instance%20in%20splicing%20factors.%22%2C%22date%22%3A%222024-10-10%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1186%5C%2Fs13059-024-03413-5%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2239390592%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221474-760X%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%22VH2H4I9M%22%5D%2C%22dateModified%22%3A%222026-06-02T05%3A56%3A20Z%22%7D%7D%2C%7B%22key%22%3A%22VZN4YYEQ%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Lu%20et%20al.%22%2C%22parsedDate%22%3A%222024-08-12%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BLu%2C%20Xiaocen%2C%20Luiz%20F.%20M.%20Passalacqua%2C%20Matthew%20Nodwell%2C%20et%20al.%202024.%20%26%23x201C%3BSymmetry%20Breaking%20of%20Fluorophore%20Binding%20to%20a%20G-Quadruplex%20Generates%20an%20RNA%20Aptamer%20with%20Picomolar%20KD.%26%23x201D%3B%20%26lt%3Bi%26gt%3BNucleic%20Acids%20Research%26lt%3B%5C%2Fi%26gt%3B%2052%20%2814%29%3A%208039%26%23x2013%3B51.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fnar%5C%2Fgkae493%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fnar%5C%2Fgkae493%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Symmetry%20breaking%20of%20fluorophore%20binding%20to%20a%20G-quadruplex%20generates%20an%20RNA%20aptamer%20with%20picomolar%20KD%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Xiaocen%22%2C%22lastName%22%3A%22Lu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Luiz%5Cu00a0F%20M%22%2C%22lastName%22%3A%22Passalacqua%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Matthew%22%2C%22lastName%22%3A%22Nodwell%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Kristen%5Cu00a0Y%20S%22%2C%22lastName%22%3A%22Kong%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Guillermo%22%2C%22lastName%22%3A%22Caballero-Garc%5Cu00eda%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Elena%5Cu00a0V%22%2C%22lastName%22%3A%22Dolgosheina%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Adrian%5Cu00a0R%22%2C%22lastName%22%3A%22Ferr%5Cu00e9-D%5Cu2019Amar%5Cu00e9%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Robert%22%2C%22lastName%22%3A%22Britton%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Peter%5Cu00a0J%22%2C%22lastName%22%3A%22Unrau%22%7D%5D%2C%22abstractNote%22%3A%22Fluorogenic%20RNA%20aptamer%20tags%20with%20high%20affinity%20enable%20RNA%20purification%20and%20imaging.%20The%20G-quadruplex%20%28G4%29%20based%20Mango%20%28M%29%20series%20of%20aptamers%20were%20selected%20to%20bind%20a%20thiazole%20orange%20based%20%28TO1-Biotin%29%20ligand.%20Using%20a%20chemical%20biology%20and%20reselection%20approach%2C%20we%20have%20produced%20a%20MII.2%20aptamer%5Cu2013ligand%20complex%20with%20a%20remarkable%20set%20of%20properties%3A%20Its%20unprecedented%20KD%20of%2045%20pM%2C%20formaldehyde%20resistance%20%288%25%20v%5C%2Fv%29%2C%20temperature%20stability%20and%20ligand%20photo-recycling%20properties%20are%20all%20unusual%20to%20find%20simultaneously%20within%20a%20small%20RNA%20tag.%20Crystal%20structures%20demonstrate%20how%20MII.2%2C%20which%20differs%20from%20MII%20by%20a%20single%20A23U%20mutation%2C%20and%20modification%20of%20the%20TO1-Biotin%20ligand%20to%20TO1-6A-Biotin%20achieves%20these%20results.%20MII%20binds%20TO1-Biotin%20heterogeneously%20via%20a%20G4%20surface%20that%20is%20surrounded%20by%20a%20stadium%20of%20five%20adenosines.%20Breaking%20this%20pseudo-rotational%20symmetry%20results%20in%20a%20highly%20cooperative%20and%20homogeneous%20ligand%20binding%20pocket%3A%20A22%20of%20the%20G4%20stadium%20stacks%20on%20the%20G4%20binding%20surface%20while%20the%20TO1-6A-Biotin%20ligand%20completely%20fills%20the%20remaining%20three%20quadrants%20of%20the%20G4%20ligand%20binding%20face.%20Similar%20optimization%20attempts%20with%20MIII.1%2C%20which%20already%20binds%20TO1-Biotin%20in%20a%20homogeneous%20manner%2C%20did%20not%20produce%20such%20marked%20improvements.%20We%20use%20the%20novel%20features%20of%20the%20MII.2%20complex%20to%20demonstrate%20a%20powerful%20optically-based%20RNA%20purification%20system.Artificial%20RNA%20tags%20that%20tightly%20bind%20fluorogenic%20ligands%20have%20many%20RNA%20imaging%20and%20RNA-protein%20biomolecular%20purification%20applications.%20Here%2C%20we%20report%20and%20structurally%20characterize%20a%20very%20small%20%2820-nt%29%20biologically%20compatible%20G-quadruplex%20based%20aptamer%20that%20can%20be%20inserted%20into%20commonly%20found%20GNRA%20tetraloops.%20This%20aptamer%20binds%20its%20fluorogenic%20ligand%20with%20an%20unprecedented%20picomolar%20binding%20affinity%20and%20is%20very%20stable%20against%20thermal%20and%20chemical%20insults.%20As%20the%20ligand%20can%20be%20modified%20to%20include%20biotin%2C%20this%20RNA%20tag%20can%20also%20be%20bound%20to%20streptavidin%20magnetic%20beads.%20After%20washing%2C%20tagged%20RNA%20can%20be%20cleanly%20eluted%20by%20exposing%20the%20beads%20to%20intense%20green%20light%2C%20which%20photobleaches%20the%20bound%20fluorogenic%20ligand%2C%20triggering%20the%20release%20of%20the%20bound%20RNA%20complex.%22%2C%22date%22%3A%222024-08-12%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1093%5C%2Fnar%5C%2Fgkae493%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fnar%5C%2Fgkae493%22%2C%22PMID%22%3A%22%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%220305-1048%22%2C%22language%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A55%3A53Z%22%7D%7D%2C%7B%22key%22%3A%225ZAFSG2C%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Roginski%20et%20al.%22%2C%22parsedDate%22%3A%222024-08-01%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BRoginski%2C%20Paul%2C%20Anna%20Grandchamp%2C%20Chlo%26%23xE9%3B%20Quignot%2C%20and%20Anne%20Lopes.%202024.%20%26%23x201C%3BDe%20Novo%20Emerged%20Gene%20Search%20in%20Eukaryotes%20with%20DENSE.%26%23x201D%3B%20%26lt%3Bi%26gt%3BGenome%20Biology%20and%20Evolution%26lt%3B%5C%2Fi%26gt%3B%2016%20%288%29%3A%20evae159.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fgbe%5C%2Fevae159%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fgbe%5C%2Fevae159%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22De%20Novo%20Emerged%20Gene%20Search%20in%20Eukaryotes%20with%20DENSE%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Paul%22%2C%22lastName%22%3A%22Roginski%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anna%22%2C%22lastName%22%3A%22Grandchamp%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chlo%5Cu00e9%22%2C%22lastName%22%3A%22Quignot%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne%22%2C%22lastName%22%3A%22Lopes%22%7D%5D%2C%22abstractNote%22%3A%22The%20discovery%20of%20de%20novo%20emerged%20genes%2C%20originating%20from%20previously%20noncoding%20DNA%20regions%2C%20challenges%20traditional%20views%20of%20species%20evolution.%20Indeed%2C%20the%20hypothesis%20of%20neutrally%20evolving%20sequences%20giving%20rise%20to%20functional%20proteins%20is%20highly%20unlikely.%20This%20conundrum%20has%20sparked%20numerous%20studies%20to%20quantify%20and%20characterize%20these%20genes%2C%20aiming%20to%20understand%20their%20functional%20roles%20and%20contributions%20to%20genome%20evolution.%20Yet%2C%20no%20fully%20automated%20pipeline%20for%20their%20identification%20is%20available.%20Therefore%2C%20we%20introduce%20DENSE%20%28DE%20Novo%20emerged%20gene%20SEarch%29%2C%20an%20automated%20Nextflow%20pipeline%20based%20on%20two%20distinct%20steps%3A%20detection%20of%20taxonomically%20restricted%20genes%20%28TRGs%29%20through%20phylostratigraphy%2C%20and%20filtering%20of%20TRGs%20for%20de%20novo%20emerged%20genes%20via%20genome%20comparisons%20and%20synteny%20search.%20DENSE%20is%20available%20as%20a%20user-friendly%20command-line%20tool%2C%20while%20the%20second%20step%20is%20accessible%20through%20a%20web%20server%20upon%20providing%20a%20list%20of%20TRGs.%20Highly%20flexible%2C%20DENSE%20provides%20various%20strategy%20and%20parameter%20combinations%2C%20enabling%20users%20to%20adapt%20to%20specific%20configurations%20or%20define%20their%20own%20strategy%20through%20a%20rational%20framework%2C%20facilitating%20protocol%20communication%2C%20and%20study%20interoperability.%20We%20apply%20DENSE%20to%20seven%20model%20organisms%2C%20exploring%20the%20impact%20of%20its%20strategies%20and%20parameters%20on%20de%20novo%20gene%20predictions.%20This%20thorough%20analysis%20across%20species%20with%20different%20evolutionary%20rates%20reveals%20useful%20metrics%20for%20users%20to%20define%20input%20datasets%2C%20identify%20favorable%5C%2Funfavorable%20conditions%20for%20de%20novo%20gene%20detection%2C%20and%20control%20potential%20biases%20in%20genome%20annotations.%20Additionally%2C%20predictions%20made%20for%20the%20seven%20model%20organisms%20are%20compiled%20into%20a%20requestable%20database%2C%20which%20we%20hope%20will%20serve%20as%20a%20reference%20for%20de%20novo%20emerged%20gene%20lists%20generated%20with%20specific%20criteria%20combinations.%22%2C%22date%22%3A%222024-08-01%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1093%5C%2Fgbe%5C%2Fevae159%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fgbe%5C%2Fevae159%22%2C%22PMID%22%3A%22%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221759-6653%22%2C%22language%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A54%3A08Z%22%7D%7D%2C%7B%22key%22%3A%22L9VA9JE2%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Shevtsov%20et%20al.%22%2C%22parsedDate%22%3A%222024-07-12%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BShevtsov%2C%20Alexandr%2C%20Uinkul%20Izbanova%2C%20Asylulan%20Amirgazin%2C%20et%20al.%202024.%20%26%23x201C%3BGenetic%20Homogeneity%20of%20Francisella%20Tularensis%20Subsp.%20Mediasiatica%20Strains%20in%20Kazakhstan.%26%23x201D%3B%20%26lt%3Bi%26gt%3BPathogens%20%28Basel%2C%20Switzerland%29%26lt%3B%5C%2Fi%26gt%3B%2013%20%287%29%3A%20581.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3390%5C%2Fpathogens13070581%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3390%5C%2Fpathogens13070581%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Genetic%20Homogeneity%20of%20Francisella%20tularensis%20subsp.%20mediasiatica%20Strains%20in%20Kazakhstan%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alexandr%22%2C%22lastName%22%3A%22Shevtsov%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Uinkul%22%2C%22lastName%22%3A%22Izbanova%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Asylulan%22%2C%22lastName%22%3A%22Amirgazin%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alma%22%2C%22lastName%22%3A%22Kairzhanova%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ayan%22%2C%22lastName%22%3A%22Dauletov%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Vladimir%22%2C%22lastName%22%3A%22Kiyan%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gilles%22%2C%22lastName%22%3A%22Vergnaud%22%7D%5D%2C%22abstractNote%22%3A%22Tularemia%20is%20an%20acute%20febrile%20disease%20caused%20by%20the%20Gram-negative%20bacillus%20Francisella%20tularensis.%20Based%20on%20genetic%20and%20phenotypic%20characteristics%2C%20three%20subspecies%20are%20distinguished%3A%20tularensis%2C%20holarctica%2C%20and%20mediasiatica.%20F.%20tularensis%20subsp.%20mediasiatica%20remains%20the%20least%20studied%20subspecies.%20Over%20the%20past%20decade%2C%20new%20foci%20of%20distribution%20of%20F.%20tularensis%20subsp.%20mediasiatica%20have%20been%20discovered%20in%20Russia%20%28Siberia%29%2C%20expanding%20the%20possible%20distribution%20area%20by%20thousands%20of%20kilometers.%20This%20article%20provides%20whole%20genome%20single%20nucleotide%20polymorphism%20%28wgSNP%29%20and%20polymorphic%20tandem%20repeats%20%28MLVA%29%20analyses%20of%2028%20mediasiatica%20strains%20isolated%20between%201965%20and%202004%20in%20Kazakhstan.%20Despite%20high%20genetic%20homogeneity%2C%20MLVA%20with%20eleven%20loci%20%28MLVA11%29%20demonstrates%20a%20high%20discriminatory%20ability%20%28diversity%20index%2C%200.9497%29.%20The%20topological%20structure%20of%20the%20trees%20based%20on%20wgSNP%20and%20MLVA%20is%20not%20comparable%3B%20however%2C%20clustering%20remains%20congruent%20for%20most%20outbreaks%2C%20with%20the%20exception%20of%20two%20strains%20from%20one%20outbreak%20that%20are%20identical%20in%20terms%20of%20wgSNP%20but%20differ%20at%20three%20tandem%20repeat%20loci.%20Based%20on%20wgSNP%2C%20the%20strains%20are%20assigned%20to%20one%20of%20the%20three%20currently%20known%20mediasiatica%20sublineages%2C%20lineage%20M.I%2C%20together%20with%20other%20historical%20strains%20maintained%20in%20collections%20in%20Russia%20and%20Sweden.%20wgSNP%20shows%20limited%20previously%20unknown%20genetic%20diversity%2C%20with%20the%20M.I%20lineage%20size%20being%20only%20118%20SNPs.%20The%20wgSNP%20genotype%20is%20not%20strongly%20correlated%20with%20year%20and%20place%20of%20isolation.%22%2C%22date%22%3A%222024-07-12%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.3390%5C%2Fpathogens13070581%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2239057808%22%2C%22PMCID%22%3A%22PMC11279412%22%2C%22ISSN%22%3A%222076-0817%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%22VH2H4I9M%22%5D%2C%22dateModified%22%3A%222026-06-02T05%3A54%3A24Z%22%7D%7D%2C%7B%22key%22%3A%225QHK6HGS%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Blein-Nicolas%20et%20al.%22%2C%22parsedDate%22%3A%222024-06-01%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BBlein-Nicolas%2C%20M%26%23xE9%3Blisande%2C%20Emilie%20Devijver%2C%20M%26%23xE9%3Blina%20Gallopin%2C%20and%20Emeline%20Perthame.%202024.%20%26%23x201C%3BNonlinear%20Network-Based%20Quantitative%20Trait%20Prediction%20from%20Biological%20Data.%26%23x201D%3B%20%26lt%3Bi%26gt%3BJournal%20of%20the%20Royal%20Statistical%20Society%20Series%20C%3A%20Applied%20Statistics%26lt%3B%5C%2Fi%26gt%3B%2073%20%283%29%3A%20796%26%23x2013%3B815.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fjrsssc%5C%2Fqlae012%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fjrsssc%5C%2Fqlae012%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Nonlinear%20network-based%20quantitative%20trait%20prediction%20from%20biological%20data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M%5Cu00e9lisande%22%2C%22lastName%22%3A%22Blein-Nicolas%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Emilie%22%2C%22lastName%22%3A%22Devijver%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M%5Cu00e9lina%22%2C%22lastName%22%3A%22Gallopin%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Emeline%22%2C%22lastName%22%3A%22Perthame%22%7D%5D%2C%22abstractNote%22%3A%22Quantitatively%20predicting%20phenotypic%20variables%20using%20biomarkers%20is%20a%20challenging%20task%20for%20several%20reasons.%20First%2C%20the%20collected%20biological%20observations%20might%20be%20heterogeneous%20and%20correspond%20to%20different%20biological%20mechanisms.%20Second%2C%20the%20biomarkers%20used%20to%20predict%20the%20phenotype%20are%20potentially%20highly%20correlated%20since%20biological%20entities%20%28genes%2C%20proteins%2C%20and%20metabolites%29%20interact%20through%20unknown%20regulatory%20networks.%20In%20this%20paper%2C%20we%20present%20a%20novel%20approach%20designed%20to%20predict%20multivariate%20quantitative%20traits%20from%20biological%20data%20which%20address%20the%202%20issues.%20The%20proposed%20model%20performs%20well%20on%20prediction%20but%20it%20is%20also%20fully%20parametric%2C%20with%20clusters%20of%20individuals%20and%20regulatory%20networks%2C%20which%20facilitates%20the%20downstream%20biological%20interpretation.%22%2C%22date%22%3A%222024-06-01%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1093%5C%2Fjrsssc%5C%2Fqlae012%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fjrsssc%5C%2Fqlae012%22%2C%22PMID%22%3A%22%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%220035-9254%22%2C%22language%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-02T05%3A55%3A34Z%22%7D%7D%2C%7B%22key%22%3A%222XH8LDAJ%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Xue%20et%20al.%22%2C%22parsedDate%22%3A%222024-03-05%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BXue%2C%20Haoliang%2C%20M%26%23xE9%3Blina%20Gallopin%2C%20Camille%20Marchet%2C%20et%20al.%202024.%20%26%23x201C%3BKaMRaT%3A%20A%20C%26%23x2009%3B%2B%2B%20Toolkit%20for%20k-Mer%20Count%20Matrix%20Dimension%20Reduction.%26%23x201D%3B%20%26lt%3Bi%26gt%3BBioinformatics%20%28Oxford%2C%20England%29%26lt%3B%5C%2Fi%26gt%3B%2C%20March%205%2C%20btae090.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fbioinformatics%5C%2Fbtae090%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fbioinformatics%5C%2Fbtae090%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22KaMRaT%3A%20a%20C%5Cu2009%2B%2B%20toolkit%20for%20k-mer%20count%20matrix%20dimension%20reduction%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Haoliang%22%2C%22lastName%22%3A%22Xue%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M%5Cu00e9lina%22%2C%22lastName%22%3A%22Gallopin%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Camille%22%2C%22lastName%22%3A%22Marchet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ha%20N.%22%2C%22lastName%22%3A%22Nguyen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yunfeng%22%2C%22lastName%22%3A%22Wang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Antoine%22%2C%22lastName%22%3A%22Lain%5Cu00e9%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chlo%5Cu00e9%22%2C%22lastName%22%3A%22Bessiere%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Gautheret%22%7D%5D%2C%22abstractNote%22%3A%22MOTIVATION%3A%20KaMRaT%20is%20designed%20for%20processing%20large%20k-mer%20count%20tables%20derived%20from%20multi-sample%2C%20RNA-seq%20data.%20Its%20primary%20objective%20is%20to%20identify%20condition-specific%20or%20differentially%20expressed%20sequences%2C%20regardless%20of%20gene%20or%20transcript%20annotation.%5CnRESULTS%3A%20KaMRaT%20is%20implemented%20in%20C%5Cu2009%2B%2B.%20Major%20functions%20include%20scoring%20k-mers%20based%20on%20count%20statistics%2C%20merging%20overlapping%20k-mers%20into%20contigs%20and%20selecting%20k-mers%20based%20on%20their%20occurrence%20across%20specific%20samples.%5CnAVAILABILITY%3A%20Source%20code%20and%20documentation%20are%20available%20via%20https%3A%5C%2F%5C%2Fgithub.com%5C%2FTransipedia%5C%2FKaMRaT.%5CnSUPPLEMENTARY%20INFORMATION%3A%20Supplementary%20data%20are%20available%20at%20Bioinformatics%20online.%22%2C%22date%22%3A%222024-03-05%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1093%5C%2Fbioinformatics%5C%2Fbtae090%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2238444086%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221367-4811%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%22VH2H4I9M%22%5D%2C%22dateModified%22%3A%222026-06-02T05%3A55%3A14Z%22%7D%7D%2C%7B%22key%22%3A%22SF7KU5QR%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Vergnaud%20et%20al.%22%2C%22parsedDate%22%3A%222024-01%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BVergnaud%2C%20Gilles%2C%20Michel%20S.%20Zygmunt%2C%20Roland%20T.%20Ashford%2C%20Adrian%20M.%20Whatmore%2C%20and%20Axel%20Cloeckaert.%202024.%20%26%23x201C%3BGenomic%20Diversity%20and%20Zoonotic%20Potential%20of%20Brucella%20Neotomae.%26%23x201D%3B%20%26lt%3Bi%26gt%3BEmerging%20Infectious%20Diseases%26lt%3B%5C%2Fi%26gt%3B%2030%20%281%29%3A%20155%26%23x2013%3B58.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3201%5C%2Feid3001.221783%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3201%5C%2Feid3001.221783%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Genomic%20Diversity%20and%20Zoonotic%20Potential%20of%20Brucella%20neotomae%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gilles%22%2C%22lastName%22%3A%22Vergnaud%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michel%20S.%22%2C%22lastName%22%3A%22Zygmunt%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Roland%20T.%22%2C%22lastName%22%3A%22Ashford%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Adrian%20M.%22%2C%22lastName%22%3A%22Whatmore%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Axel%22%2C%22lastName%22%3A%22Cloeckaert%22%7D%5D%2C%22abstractNote%22%3A%22After%20reports%20in%202017%20of%20Brucella%20neotomae%20infections%20among%20humans%20in%20Costa%20Rica%2C%20we%20sequenced%2012%20strains%20isolated%20from%20rodents%20during%201955-1964%20from%20Utah%2C%20USA.%20We%20observed%20an%20exact%20strain%20match%20between%20the%20human%20isolates%20and%201%20Utah%20isolate.%20Independent%20confirmation%20is%20required%20to%20clarify%20B.%20neotomae%20zoonotic%20potential.%22%2C%22date%22%3A%222024-01%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.3201%5C%2Feid3001.221783%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2238147057%22%2C%22PMCID%22%3A%22PMC10756370%22%2C%22ISSN%22%3A%221080-6059%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%22VH2H4I9M%22%5D%2C%22dateModified%22%3A%222026-06-02T05%3A54%3A41Z%22%7D%7D%2C%7B%22key%22%3A%22XX3FH85G%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Timofeev%20et%20al.%22%2C%22parsedDate%22%3A%222024%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BTimofeev%2C%20Vitalii%2C%20Irina%20Bakhteeva%2C%20Galina%20Titareva%2C%20et%20al.%202024.%20%26%23x201C%3BAvirulence%20of%20a%20Spontaneous%20Francisella%20Tularensis%20Subsp.%20Mediasiatica%20prmA%20Mutant.%26%23x201D%3B%20%26lt%3Bi%26gt%3BPloS%20One%26lt%3B%5C%2Fi%26gt%3B%2019%20%286%29%3A%20e0305569.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pone.0305569%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pone.0305569%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Avirulence%20of%20a%20spontaneous%20Francisella%20tularensis%20subsp.%20mediasiatica%20prmA%20mutant%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Vitalii%22%2C%22lastName%22%3A%22Timofeev%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Irina%22%2C%22lastName%22%3A%22Bakhteeva%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Galina%22%2C%22lastName%22%3A%22Titareva%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Raisa%22%2C%22lastName%22%3A%22Mironova%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Vera%22%2C%22lastName%22%3A%22Evseeva%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Tatiana%22%2C%22lastName%22%3A%22Kravchenko%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Angelika%22%2C%22lastName%22%3A%22Sizova%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alexander%22%2C%22lastName%22%3A%22Borzilov%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Natalia%22%2C%22lastName%22%3A%22Pavlovich%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alexander%22%2C%22lastName%22%3A%22Mokrievich%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ivan%22%2C%22lastName%22%3A%22Dyatlov%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gilles%22%2C%22lastName%22%3A%22Vergnaud%22%7D%5D%2C%22abstractNote%22%3A%22Francisella%20tularensis%2C%20the%20causative%20agent%20of%20tularemia%2C%20is%20divided%20into%20three%20subspecies.%20Two%20of%20these%2C%20subspecies%20holarctica%20and%20tularensis%2C%20are%20highly%20pathogenic%20to%20humans%20and%20consequently%20relatively%20well%20studied.%20The%20third%20subspecies%2C%20mediasiatica%2C%20is%20rarely%20isolated%20and%20remains%20poorly%20studied.%20It%20is%20distributed%20in%20the%20sparsely%20populated%20regions%20of%20Central%20Asia%20and%20Siberia.%20Curently%20this%20subspecies%20is%20not%20known%20to%20have%20been%20responsible%20for%20human%20infections%20in%20spite%20of%20its%20high%20virulence%20in%20laboratory%20animals.%20Subspecies%20mediasiatica%20is%20currently%20divided%20into%20three%20subgroups-MI%2C%20present%20in%20Central%20Asia%2C%20MII%2C%20present%20in%20southern%20Siberia%2C%20and%20MIII%20represented%20by%20a%20unique%20strain%2C%2060%28B%2957%2C%20isolated%20in%20Uzbekistan%20in%201960.%20We%20describe%20here%20the%20unexpected%20observation%20that%20MIII%20strain%2060%28B%2957%20is%20avirulent%20and%20immunogenic.%20We%20observed%20that%20infection%20with%20this%20strain%20protected%20mice%20from%20challenge%2021%20days%20later%20with%20a%20virulent%20subsp.%20mediasiatica%20strain.%20With%20an%20increase%20of%20this%20interval%2C%20the%20protection%20for%20mice%20was%20significantly%20reduced.%20In%20contrast%2C%20guinea%20pigs%20were%20protected%20from%20challenge%20with%20strains%20of%20the%20subspecies%20holarctica%20and%20mediasiatica%20%28but%20not%20subsp.%20tularensis%29%2090%20days%20after%20infection%20with%2060%28B%2957.%20We%20performed%20genome%20assembly%20based%20on%20whole%20genome%20sequencing%20data%20obtained%20using%20the%20Nanopore%20MinION%20for%20strain%2060%28B%2957%20and%20two%20subsp.%20mediasiatica%20strains%20representing%20the%20Central%20Asian%20MI%20and%20Siberian%20MII%20phylogenetic%20subgroups.%20The%20prmA%20gene%20is%20truncated%20due%20to%20a%20nonsense%20mutation%20in%20strain%2060%28B%2957.%20The%20deletion%20of%20gene%20prmA%20has%20previously%20been%20shown%20to%20induce%20a%20loss%20of%20virulence%20in%20Francisella%20novicida%20the%20closest%20model%20organism%20suggesting%20that%20the%20observed%20mutation%20might%20the%20cause%20of%20the%20avirulence%20of%20strain%2060%28B%2957.%22%2C%22date%22%3A%222024%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1371%5C%2Fjournal.pone.0305569%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2238889158%22%2C%22PMCID%22%3A%22PMC11185464%22%2C%22ISSN%22%3A%221932-6203%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%22VH2H4I9M%22%5D%2C%22dateModified%22%3A%222026-06-02T05%3A53%3A48Z%22%7D%7D%2C%7B%22key%22%3A%228NZAAZJC%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Bastide%20et%20al.%22%2C%22parsedDate%22%3A%222022-12-12%22%2C%22numChildren%22%3A4%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BBastide%2C%20Paul%2C%20Charlotte%20Soneson%2C%20David%20B.%20Stern%2C%20Olivier%20Lespinet%2C%20and%20M%26%23xE9%3Blina%20Gallopin.%202022.%20%26%23x201C%3BA%20Phylogenetic%20Framework%20to%20Simulate%20Synthetic%20Inter-Species%20RNA-Seq%20Data.%26%23x201D%3B%20%26lt%3Bi%26gt%3BMolecular%20Biology%20and%20Evolution%26lt%3B%5C%2Fi%26gt%3B%2C%20December%2012%2C%20msac269.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fmolbev%5C%2Fmsac269%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fmolbev%5C%2Fmsac269%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22A%20Phylogenetic%20Framework%20to%20Simulate%20Synthetic%20Inter-species%20RNA-Seq%20Data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Paul%22%2C%22lastName%22%3A%22Bastide%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Charlotte%22%2C%22lastName%22%3A%22Soneson%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%20B%22%2C%22lastName%22%3A%22Stern%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Olivier%22%2C%22lastName%22%3A%22Lespinet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M%5Cu00e9lina%22%2C%22lastName%22%3A%22Gallopin%22%7D%5D%2C%22abstractNote%22%3A%22Inter-species%20RNA-Seq%20datasets%20are%20increasingly%20common%2C%20and%20have%20the%20potential%20to%20answer%20new%20questions%20about%20the%20evolution%20of%20gene%20expression.%20Single%20species%20differential%20expression%20analysis%20is%20now%20a%20well%20studied%20problem%20that%20benefits%20from%20sound%20statistical%20methods.%20Extensive%20reviews%20on%20biological%20or%20synthetic%20datasets%20have%20provided%20the%20community%20with%20a%20clear%20picture%20on%20the%20relative%20performances%20of%20the%20available%20methods%20in%20various%20settings.%20However%2C%20synthetic%20dataset%20simulation%20tools%20are%20still%20missing%20in%20the%20inter-species%20gene%20expression%20context.%20In%20this%20work%2C%20we%20develop%20and%20implement%20a%20new%20simulation%20framework.%20This%20tool%20builds%20on%20both%20the%20RNA-Seq%20and%20the%20Phylogenetic%20Comparative%20Methods%20literatures%20to%20generate%20realistic%20count%20datasets%2C%20while%20taking%20into%20account%20the%20phylogenetic%20relationships%20between%20the%20samples.%20We%20illustrate%20the%20usefulness%20of%20this%20new%20framework%20through%20a%20targeted%20simulation%20study%2C%20that%20reproduces%20the%20features%20of%20a%20recently%20published%20dataset%2C%20containing%20gene%20expression%20data%20in%20adult%20eye%20tissue%20across%20blind%20and%20sighted%20freshwater%20crayfish%20species.%20Using%20our%20simulated%20datasets%2C%20we%20perform%20a%20fair%20comparison%20of%20several%20approaches%20used%20for%20differential%20expression%20analysis.%20This%20benchmark%20reveals%20some%20of%20the%20strengths%20and%20weaknesses%20of%20both%20the%20classical%20and%20phylogenetic%20approaches%20for%20inter-species%20differential%20expression%20analysis%2C%20and%20allows%20for%20a%20reanalysis%20of%20the%20crayfish%20dataset.%20The%20tool%20has%20been%20integrated%20in%20the%20R%20package%20compcodeR%2C%20freely%20available%20on%20Bioconductor.%22%2C%22date%22%3A%222022-12-12%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1093%5C%2Fmolbev%5C%2Fmsac269%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1093%5C%2Fmolbev%5C%2Fmsac269%22%2C%22PMID%22%3A%22%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221537-1719%22%2C%22language%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222026-06-01T09%3A30%3A31Z%22%7D%7D%2C%7B%22key%22%3A%22FM48KKR4%22%2C%22library%22%3A%7B%22id%22%3A3888256%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Papadopoulos%20et%20al.%22%2C%22parsedDate%22%3A%222021-11-22%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BPapadopoulos%2C%20Chris%2C%20Isabelle%20Callebaut%2C%20Jean-Christophe%20Gelly%2C%20et%20al.%202021.%20%26%23x201C%3BIntergenic%20ORFs%20as%20Elementary%20Structural%20Modules%20of%20de%20Novo%20Gene%20Birth%20and%20Protein%20Evolution.%26%23x201D%3B%20%26lt%3Bi%26gt%3BGenome%20Research%26lt%3B%5C%2Fi%26gt%3B%2C%20ahead%20of%20print%2C%20November%2022.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1101%5C%2Fgr.275638.121%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1101%5C%2Fgr.275638.121%26lt%3B%5C%2Fa%26gt%3B.%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Intergenic%20ORFs%20as%20elementary%20structural%20modules%20of%20de%20novo%20gene%20birth%20and%20protein%20evolution%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chris%22%2C%22lastName%22%3A%22Papadopoulos%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Isabelle%22%2C%22lastName%22%3A%22Callebaut%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jean-Christophe%22%2C%22lastName%22%3A%22Gelly%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Isabelle%22%2C%22lastName%22%3A%22Hatin%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Olivier%22%2C%22lastName%22%3A%22Namy%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Maxime%22%2C%22lastName%22%3A%22Renard%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Olivier%22%2C%22lastName%22%3A%22Lespinet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anne%22%2C%22lastName%22%3A%22Lopes%22%7D%5D%2C%22abstractNote%22%3A%22The%20noncoding%20genome%20plays%20an%20important%20role%20in%20de%20novo%20gene%20birth%20and%20in%20the%20emergence%20of%20genetic%20novelty.%20Nevertheless%2C%20how%20noncoding%20sequences%26%23039%3B%20properties%20could%20promote%20the%20birth%20of%20novel%20genes%20and%20shape%20the%20evolution%20and%20the%20structural%20diversity%20of%20proteins%20remains%20unclear.%20Therefore%2C%20by%20combining%20different%20bioinformatic%20approaches%2C%20we%20characterized%20the%20fold%20potential%20diversity%20of%20the%20amino%20acid%20sequences%20encoded%20by%20all%20intergenic%20open%20reading%20frames%20%28ORFs%29%20of%20S.%20cerevisiae%20with%20the%20aim%20of%20%281%29%20exploring%20whether%20the%20structural%20states%26%23039%3B%20diversity%20of%20proteomes%20is%20already%20present%20in%20noncoding%20sequences%2C%20and%20%282%29%20estimating%20the%20potential%20of%20the%20noncoding%20genome%20to%20produce%20novel%20protein%20bricks%20that%20could%20either%20give%20rise%20to%20novel%20genes%20or%20be%20integrated%20into%20pre-existing%20proteins%2C%20thus%20participating%20in%20protein%20structure%20diversity%20and%20evolution.%20We%20showed%20that%20amino%20acid%20sequences%20encoded%20by%20most%20yeast%20intergenic%20ORFs%20contain%20the%20elementary%20building%20blocks%20of%20protein%20structures.%20Moreover%2C%20they%20encompass%20the%20large%20structural%20state%20diversity%20of%20canonical%20proteins%2C%20with%20the%20majority%20predicted%20as%20foldable.%20Then%2C%20we%20investigated%20the%20early%20stages%20of%20de%20novo%20gene%20birth%20by%20reconstructing%20the%20ancestral%20sequences%20of%2070%20yeast%20de%20novo%20genes%20and%20characterized%20the%20sequence%20and%20structural%20properties%20of%20intergenic%20ORFs%20with%20a%20strong%20translation%20signal.%20This%20enabled%20us%20to%20highlight%20sequence%20and%20structural%20factors%20determining%20de%20novo%20gene%20emergence.%20Finally%2C%20we%20showed%20a%20strong%20correlation%20between%20the%20fold%20potential%20of%20de%20novo%20proteins%20and%20one%20of%20their%20ancestral%20amino%20acid%20sequences%2C%20reflecting%20the%20relationship%20between%20the%20noncoding%20genome%20and%20the%20protein%20structure%20universe.%22%2C%22date%22%3A%222021-11-22%22%2C%22section%22%3A%22%22%2C%22partNumber%22%3A%22%22%2C%22partTitle%22%3A%22%22%2C%22DOI%22%3A%2210.1101%5C%2Fgr.275638.121%22%2C%22citationKey%22%3A%22%22%2C%22url%22%3A%22%22%2C%22PMID%22%3A%2234810219%22%2C%22PMCID%22%3A%22%22%2C%22ISSN%22%3A%221549-5469%22%2C%22language%22%3A%22eng%22%2C%22collections%22%3A%5B%222FBUFWW8%22%2C%22R7I3GKDL%22%5D%2C%22dateModified%22%3A%222026-06-02T05%3A54%3A56Z%22%7D%7D%5D%7D
Hak, Fiona, Camille Marchet, Daniel Gautheret, and Mélina Gallopin. 2026. “Metappuccino: Large Language Model-Driven Reconstruction of Sequence Read Archive Metadata for Cancer Research.” Bioinformatics (Oxford, England) 42 (5): btag166. https://doi.org/10.1093/bioinformatics/btag166.
Khamvongsa-Charbonnier, Lucie, Robert Aboukhalil, Hélène Chiapello, et al. 2026. “Training Biologists in Unix Command-Line Skills: From Curriculum to Interactive Online Tutorials.” PLoS Computational Biology 22 (4): e1014133. https://doi.org/10.1371/journal.pcbi.1014133.
Zrafi, Wael S., Víctor Albarrán-Artahona, Filippo G. Dall’Olio, et al. 2026. “Tumor Purity as a Prognostic and Predictive Biomarker of Postoperative Radiotherapy Outcomes in Stage IIIA-N2 Non-Small-Cell Lung Cancer: A Transcriptomic Analysis from the Lung ART Trial.” International Journal of Radiation Oncology, Biology, Physics, March 13, S0360-3016(26)00491-8. https://doi.org/10.1016/j.ijrobp.2026.03.006.
Roginski, Paul, Chris Papadopoulos, Simon Herman, Ambre Baumann, Antoine Grislain, and Anne Lopes. 2026. “Impact of GC Content on de Novo Gene Birth.” Nature Communications, ahead of print, January 6. https://doi.org/10.1038/s41467-025-68022-7.
Mariotte, T., R. Coudray, C. Toffano-Nioche, F. Guyot, and A. Gorlas. 2026. “Iron Sulfides Produced by Thermococcales: An Iron Detoxification Mechanism.” Environmental Microbiology 28 (1): e70242. https://doi.org/10.1111/1462-2920.70242.
Rossier, Ombeline, Florence Constantinesco-Becker, Anne Lopes, et al. 2026. “Genome Sequence of Corynebacterium Glutamicum Phage MicyPS.” microPublication Biology 2026. https://doi.org/10.17912/micropub.biology.001936.
Torossian, Nouritza, Marc Gabriel, Panagiotis Papoutsoglou, et al. 2025. “Reference-Free RNA Profiling Predicts Triple Negative Breast Cancer Chemoresistance to Neoadjuvant Treatment.” NAR Cancer 7 (4): zcaf036. https://doi.org/10.1093/narcan/zcaf036.
Vergnaud, Gilles, Markus H. Antwerpen, and Gregor Grass. 2025. “Bacillus Anthracis Phylogeography: Origin of the East Asian Polytomy and Impact of International Trade for Its near Global Dispersal.” Pathogens (Basel, Switzerland) 14 (10): 1041. https://doi.org/10.3390/pathogens14101041.
Saunier, Marion, Adeline Humbert, Victor Kreis, et al. 2025. “Deciphering the RNA-Based Regulation Mechanism of the Phage-Encoded AbiF System in Clostridioides Difficile.” PLoS Genetics 21 (8): e1011831. https://doi.org/10.1371/journal.pgen.1011831.
Papadopoulos, Chris, Hugo Arbes, David Cornu, et al. 2024. “The Ribosome Profiling Landscape of Yeast Reveals a High Diversity in Pervasive Translation.” Genome Biology 25 (1): 268. https://doi.org/10.1186/s13059-024-03403-7.
Bessière, Chloé, Haoliang Xue, Benoit Guibert, et al. 2024. “Transipedia.Org: K-Mer-Based Exploration of Large RNA Sequencing Datasets and Application to Cancer Data.” Genome Biology 25 (1): 266. https://doi.org/10.1186/s13059-024-03413-5.
Lu, Xiaocen, Luiz F. M. Passalacqua, Matthew Nodwell, et al. 2024. “Symmetry Breaking of Fluorophore Binding to a G-Quadruplex Generates an RNA Aptamer with Picomolar KD.” Nucleic Acids Research 52 (14): 8039–51. https://doi.org/10.1093/nar/gkae493.
Roginski, Paul, Anna Grandchamp, Chloé Quignot, and Anne Lopes. 2024. “De Novo Emerged Gene Search in Eukaryotes with DENSE.” Genome Biology and Evolution 16 (8): evae159. https://doi.org/10.1093/gbe/evae159.
Shevtsov, Alexandr, Uinkul Izbanova, Asylulan Amirgazin, et al. 2024. “Genetic Homogeneity of Francisella Tularensis Subsp. Mediasiatica Strains in Kazakhstan.” Pathogens (Basel, Switzerland) 13 (7): 581. https://doi.org/10.3390/pathogens13070581.
Blein-Nicolas, Mélisande, Emilie Devijver, Mélina Gallopin, and Emeline Perthame. 2024. “Nonlinear Network-Based Quantitative Trait Prediction from Biological Data.” Journal of the Royal Statistical Society Series C: Applied Statistics 73 (3): 796–815. https://doi.org/10.1093/jrsssc/qlae012.
Xue, Haoliang, Mélina Gallopin, Camille Marchet, et al. 2024. “KaMRaT: A C ++ Toolkit for k-Mer Count Matrix Dimension Reduction.” Bioinformatics (Oxford, England), March 5, btae090. https://doi.org/10.1093/bioinformatics/btae090.
Vergnaud, Gilles, Michel S. Zygmunt, Roland T. Ashford, Adrian M. Whatmore, and Axel Cloeckaert. 2024. “Genomic Diversity and Zoonotic Potential of Brucella Neotomae.” Emerging Infectious Diseases 30 (1): 155–58. https://doi.org/10.3201/eid3001.221783.
Timofeev, Vitalii, Irina Bakhteeva, Galina Titareva, et al. 2024. “Avirulence of a Spontaneous Francisella Tularensis Subsp. Mediasiatica prmA Mutant.” PloS One 19 (6): e0305569. https://doi.org/10.1371/journal.pone.0305569.
Bastide, Paul, Charlotte Soneson, David B. Stern, Olivier Lespinet, and Mélina Gallopin. 2022. “A Phylogenetic Framework to Simulate Synthetic Inter-Species RNA-Seq Data.” Molecular Biology and Evolution, December 12, msac269. https://doi.org/10.1093/molbev/msac269.
Papadopoulos, Chris, Isabelle Callebaut, Jean-Christophe Gelly, et al. 2021. “Intergenic ORFs as Elementary Structural Modules of de Novo Gene Birth and Protein Evolution.” Genome Research, ahead of print, November 22. https://doi.org/10.1101/gr.275638.121.
