Abstract
MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective approaches to reconstruct multi-megabase haplotypes (phase blocks) from sequence data of a single sample. These technologies provide an optimal sequencing platform to characterize SVs, though few computational algorithms can utilize them. Thus, we developed Aquila_stLFR, an approach that resolves SVs through haplotype-based assembly of stLFR linked-reads.
RESULTS: Aquila_stLFR first partitions long fragment reads into two haplotype-specific blocks with the assistance of the high-quality reference genome, by taking advantage of the potential phasing ability of the linked-read itself. Each haplotype is then assembled independently, to achieve a complete diploid assembly to finally reconstruct the genome-wide SVs. We benchmarked Aquila_stLFR on a well-studied sample, NA24385, and showed Aquila_stLFR can detect medium to large size deletions (50 bp-10 kb) with high sensitivity and medium-size insertions (50 bp-1 kb) with high specificity.
AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available on https://github.com/maiziex/Aquila_stLFR.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
| Original language | English |
|---|---|
| Article number | vbab007 |
| Number of pages | 3 |
| Journal | Bioinformatics advances |
| Volume | 1 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 16 Jun 2021 |
Fingerprint
Dive into the research topics of 'Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver