Full-Length Transcript-Based Proteogenomics of Rice Improves Its Genome and Proteome Annotation

Mo Xian Chen, Fu Yuan Zhu, Bei Gao, Kai Long Ma, Youjun Zhang, Alisdair R. Fernie, Xi Chen, Lei Dai, I. Neng Hui Ye, Xue Zhang, Yuan Tian, Di Zhang, Shi Xiao, Jianhua Zhang, Ying Gao Liua*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

44 Citations (Scopus)


Rice (Oryza sativa) molecular breeding has gained considerable attention in recent years, but inaccurate genome annotation hampers its progress and functional studies of the rice genome. In this study, we applied single-molecule long-read RNA sequencing (lrRNA seq)-based proteogenomics to reveal the complexity of the rice transcriptome and its coding abilities. Surprisingly, approximately 60% of loci identified by lrRNA seq are associated with natural antisense transcripts (NATs). The high-density genomic arrangement of NAT genes suggests their potential roles in the multifaceted control of gene expression. In addition, a large number of fusion and intergenic transcripts have been observed. Furthermore, 906,456 transcript isoforms were identified, and 72.9% of the genes can generate splicing isoforms. A total of 706,075 posttranscriptional events were subsequently categorized into 10 subtypes, demonstrating the interdependence of posttranscriptional mechanisms that contribute to transcriptome diversity. Parallel short-read RNA sequencing indicated that lrRNA seq has a superior capacity for the identification of longer transcripts. In addition, over 190,000 unique peptides belonging to 9,706 proteoforms/protein groups were identified, expanding the diversity of the rice proteome. Our findings indicate that the genome organization, transcriptome diversity, and coding potential of the rice transcriptome are far more complex than previously anticipated.

Original languageEnglish
Pages (from-to)1510-1526
Number of pages17
JournalPlant Physiology
Issue number3
Publication statusPublished - Mar 2020

Scopus Subject Areas

  • Physiology
  • Genetics
  • Plant Science


Dive into the research topics of 'Full-Length Transcript-Based Proteogenomics of Rice Improves Its Genome and Proteome Annotation'. Together they form a unique fingerprint.

Cite this