Abstract
Gut microbiome dysbiosis is considered to play a critical role in the development of colorectal cancer. While numerous studies have identified specific microbial species enriched in cancer patients, knowledge about their genomic variants remains limited. Here, we systematically investigated the compositional and genomic alterations in the gut metagenomic sequencing data using 1723 samples collected from 10 different CRC datasets. Our combined analysis revealed a core set of 250 microbial genomes, 4365 single nucleotide variants and 21 structural variants differed between CRC and healthy control groups. Utilizing the microbial SNV and SV profiles, we identified strain level phylogenetic diversity among 27 CRC related species, such as Faecalibacterium prausnitzii, Coprococcus_B comes, Fusicatenibacter saccharivorans, and Ruminococcus_D bicirculans. We also constructed metagenomic classification models based on these multi-dimensional alterations and observed superior diagnostic capability of the multi-dimensional model, achieving an average AUC of 0.82 and 0.80 in the discovery stage and validation stage, respectively. Compared with fecal occult blood test (FOBT), we identified a combination of 4 species, could increase the AUC of FOBT testing by 13.5 our integrated analysis uncovers how microbial genomic variations, in conjunction with compositional changes, influence cancer progression and suggests combining microbial features with cost-effective FOBT testing may be a promising approach for the mass screening of CRC.
Original language | English |
---|---|
Title of host publication | Proceedings of the 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics |
Place of Publication | New York |
Publisher | Association for Computing Machinery (ACM) |
Number of pages | 1 |
ISBN (Print) | 9798400713026 |
DOIs | |
Publication status | Published - 16 Dec 2024 |
Event | 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2024 - Shenzhen, China Duration: 22 Nov 2024 → 25 Nov 2024 https://dl.acm.org/doi/proceedings/10.1145/3698587 (Conference proceeding) https://acm-bcb.org/ (Conference website) |
Publication series
Name | Proceedings of ACM International Conference on Bioinformatics, Computational Biology and Health Informatics |
---|---|
Publisher | Association for Computing Machinery |
Conference
Conference | 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2024 |
---|---|
Country/Territory | China |
City | Shenzhen |
Period | 22/11/24 → 25/11/24 |
Internet address |
|
User-Defined Keywords
- Colorectal cancer
- Interpretable machine learning
- Metagenome
- Single nucleotide variant
- Structural variant