Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads

Yichen Henry Liu, Griffin L Grubbs, Lu Zhang, Xiaodong Fang, David L Dill, Arend Sidow, Xin Zhou*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

5 Citations (Scopus)

Abstract

MOTIVATION: Identifying structural variants (SVs) is critical in health and disease, however, detecting them remains a challenge. Several linked-read sequencing technologies, including 10X Genomics, TELL-Seq and single tube long fragment read (stLFR), have been recently developed as cost-effective approaches to reconstruct multi-megabase haplotypes (phase blocks) from sequence data of a single sample. These technologies provide an optimal sequencing platform to characterize SVs, though few computational algorithms can utilize them. Thus, we developed Aquila_stLFR, an approach that resolves SVs through haplotype-based assembly of stLFR linked-reads.

RESULTS: Aquila_stLFR first partitions long fragment reads into two haplotype-specific blocks with the assistance of the high-quality reference genome, by taking advantage of the potential phasing ability of the linked-read itself. Each haplotype is then assembled independently, to achieve a complete diploid assembly to finally reconstruct the genome-wide SVs. We benchmarked Aquila_stLFR on a well-studied sample, NA24385, and showed Aquila_stLFR can detect medium to large size deletions (50 bp-10 kb) with high sensitivity and medium-size insertions (50 bp-1 kb) with high specificity.

AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available on https://github.com/maiziex/Aquila_stLFR.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.

Original languageEnglish
Article numbervbab007
Number of pages3
JournalBioinformatics advances
Volume1
Issue number1
DOIs
Publication statusPublished - 16 Jun 2021

Fingerprint

Dive into the research topics of 'Aquila_stLFR: diploid genome assembly based structural variant calling package for stLFR linked-reads'. Together they form a unique fingerprint.

Cite this