Skip to main navigation Skip to search Skip to main content

Reliable Inference of Phylogenomic Relationship via Assembly-Based Strategy Accommodating Raw Reads and Proteins

  • Yunlong Li*
  • , Xu Liu
  • , Chong Chen
  • , Jian Wen Qiu
  • , Kevin M. Kocot
  • , Jin Sun*
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Phylogenomics is a transformative approach in systematics, conservation biology, and biomedical research, enabling the inference of evolutionary relationships by leveraging hundreds to thousands of genes from genomic or transcriptomic data. However, acquiring high-quality genomes and transcriptomes necessitates samples with intact DNA and RNA, substantial sequencing investments, and extensive bioinformatic processing, such as genome/transcriptome assembly and annotation. This challenge is particularly pronounced for rare or difficult-to-collect species, such as those inhabiting the deep sea, where often only fragmented DNA reads are available due to environmental degradation or suboptimal preservation conditions. To address these limitations, we developed VEHoP (Versatile, Easy-to-use Homology-based Phylogenomic pipeline), a tool designed to infer protein-coding regions from diverse inputs, including raw reads (short and long), draft genomes, transcriptomes, and annotated genomes. VEHoP automates the generation of orthologous sequence alignments, concatenated matrices, and phylogenetic trees, streamlining phylogenomic analyses for researchers across disciplines. The tool expands taxonomic sampling by accommodating a wide range of input data types and simplifies phylogenomic workflows, making them accessible to researchers with varying levels of bioinformatic expertise. We validated VEHoP using datasets from oysters, catfish, and insects, demonstrating its ability to produce robust phylogenetic trees with strong bootstrap support, outperforming assembly-free methods. Additionally, we applied VEHoP to reconstruct the phylogeny of the enigmatic deep-sea gastropod order Neomphalida, resolving a well-supported phylogenetic backbone for this poorly understood group. VEHoP is freely available on GitHub (https://github.com/ylify/VEHoP) and easily installable via Bioconda or the configured container image via Docker, Singularity and Apptainer.
Original languageEnglish
Article numbere70116
Number of pages13
JournalMolecular Ecology Resources
Volume26
Issue number3
DOIs
Publication statusPublished - Apr 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 14 - Life Below Water
    SDG 14 Life Below Water

User-Defined Keywords

  • deep sea
  • evolution
  • phylogenomics
  • phylogeny
  • pipeline
  • reads

Fingerprint

Dive into the research topics of 'Reliable Inference of Phylogenomic Relationship via Assembly-Based Strategy Accommodating Raw Reads and Proteins'. Together they form a unique fingerprint.

Cite this