TY - JOUR
T1 - New insights into Arabidopsis transcriptome complexity revealed by direct sequencing of native RNAs
AU - Zhang, Shoudong
AU - Li, Runsheng
AU - Zhang, Li
AU - Chen, Shengjie
AU - Xie, Min
AU - Yang, Liu
AU - Xia, Yiji
AU - Foyer, Christine H.
AU - Zhao, Zhongying
AU - Lam, Hon Ming
N1 - Hong Kong Research Grants Council Area of Excellence Scheme [AoE/M-403/16 to H. M. L and Y. X], Lo Kwee Seong Biomedical Research Fund [to H. M. L], Hong Kong Research Grant Council General Research Funds [12100717 to S. Z and Y. X, 12100917, 12123716, 12100118 to Z. Z], HKBU Research Committee and Interdisciplinary Research Clusters Matching Scheme [2018/19 to Z.Z.], CUHK direct grant [CUHK4053383 to S.Z], Funding for open access charge: Hong Kong Research Grants Council Area of Excellence Scheme [AoE/M-403/16].
PY - 2020/8/20
Y1 - 2020/8/20
N2 - Arabidopsis thaliana transcriptomes have been extensively studied and characterized under different conditions. However, most of the current 'RNA-sequencing' technologies produce a relatively short read length and demand a reverse-transcription step, preventing effective characterization of transcriptome complexity. Here, we performed Direct RNA Sequencing (DRS) using the latest Oxford Nanopore Technology (ONT) with exceptional read length. We demonstrate that the complexity of the A. thaliana transcriptomes has been substantially under-estimated. The ONT direct RNA sequencing identified novel transcript isoforms at both the vegetative (14-day old seedlings, stage 1.04) and reproductive stages (stage 6.00-6.10) of development. Using in-house software called TrackCluster, we determined alternative transcription initiation (ATI), alternative polyadenylation (APA), alternative splicing (AS), and fusion transcripts. More than 38 500 novel transcript isoforms were identified, including six categories of fusion-transcripts that may result from differential RNA processing mechanisms. Aided by the Tombo algorithm, we found an enrichment of m5C modifications in the mobile mRNAs, consistent with a recent finding that m5C modification in mRNAs is crucial for their long-distance movement. In summary, ONT DRS offers an advantage in the identification and functional characterization of novel RNA isoforms and RNA base modifications, significantly improving annotation of the A. thaliana genome.
AB - Arabidopsis thaliana transcriptomes have been extensively studied and characterized under different conditions. However, most of the current 'RNA-sequencing' technologies produce a relatively short read length and demand a reverse-transcription step, preventing effective characterization of transcriptome complexity. Here, we performed Direct RNA Sequencing (DRS) using the latest Oxford Nanopore Technology (ONT) with exceptional read length. We demonstrate that the complexity of the A. thaliana transcriptomes has been substantially under-estimated. The ONT direct RNA sequencing identified novel transcript isoforms at both the vegetative (14-day old seedlings, stage 1.04) and reproductive stages (stage 6.00-6.10) of development. Using in-house software called TrackCluster, we determined alternative transcription initiation (ATI), alternative polyadenylation (APA), alternative splicing (AS), and fusion transcripts. More than 38 500 novel transcript isoforms were identified, including six categories of fusion-transcripts that may result from differential RNA processing mechanisms. Aided by the Tombo algorithm, we found an enrichment of m5C modifications in the mobile mRNAs, consistent with a recent finding that m5C modification in mRNAs is crucial for their long-distance movement. In summary, ONT DRS offers an advantage in the identification and functional characterization of novel RNA isoforms and RNA base modifications, significantly improving annotation of the A. thaliana genome.
UR - https://www.scopus.com/pages/publications/85089709767
U2 - 10.1093/nar/gkaa588
DO - 10.1093/nar/gkaa588
M3 - Journal article
C2 - 32652016
AN - SCOPUS:85089709767
SN - 0305-1048
VL - 48
SP - 7700
EP - 7711
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 14
ER -