Resolution-Adaptive Binning Enhances Machine Learning Modeling by Interbatch and Multiplatform Orbitrap-Based Shotgun Mass Spectrometry Data Integration

  • Hiu-Lok Ngan
  • , Jialing Zhang
  • , Kenneth Kin-Leung Kwan
  • , Jacinth Wing-Sum Cheu
  • , Li Zhong
  • , Yike Guo
  • , Xian Yang
  • , Carmen Chak-Lui Wong*
  • , Hong Yan*
  • , Zongwei Cai*
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Machine learning (ML) modeling on mass spectrometry (MS)-based shotgun data facilitates feature selection and disease modeling. However, batch-specific models often struggle with limited transferability and generalizability, necessitating data integration from multiple batches and platforms. Traditional binning methods can either disintegrate or aggregate m/z features, making data combination unreliable. In this study, we introduce a mass resolution-adaptive binning and integration strategy to overcome these challenges. This approach recovers 88-99% of ground truth features in a low mass region (70-434 m/z) from 49 mixed standard solutions at 250, 500, and 1000 ppb. Compared to conventional methods, it demonstrates stable binning and integration across low (100-450 m/z), mid (450-900 m/z), and high (900-1500 m/z) mass regions, resulting in superior predictive models. Using a mouse model of hepatocellular carcinoma as a proof-of-concept study, we identify 10 generic metabolites that showcase advancements in using ambient MS imaging (MSI) data for modeling and deploy the attained model to shotgun data. This facilitates disease detection via various sample introduction methods, including MSI on liver cryosections (F1 score = 0.87) and glass smears (F1 score = 0.80), as well as rapid direct infusion analysis (recall = 0.89 and precision = 0.63). This novel mass resolution-adaptive binning and integration strategy offers a promising approach for integrating different data sets, potentially improving disease detection accuracy in MS applications.

Original languageEnglish
Pages (from-to)26877-26885
Number of pages9
JournalAnalytical Chemistry
Volume97
Issue number48
Early online date25 Nov 2025
DOIs
Publication statusPublished - 9 Dec 2025

Fingerprint

Dive into the research topics of 'Resolution-Adaptive Binning Enhances Machine Learning Modeling by Interbatch and Multiplatform Orbitrap-Based Shotgun Mass Spectrometry Data Integration'. Together they form a unique fingerprint.

Cite this