TY - JOUR
T1 - Unraveling the significance of cuproptosis in hepatocellular carcinoma heterogeneity and tumor microenvironment through integrated single-cell sequencing and machine learning approaches
AU - Liu, Wang
AU - Xia, Liangjing
AU - peng, Yuan
AU - Cao, Qiang
AU - Xu, Ke
AU - Luo, Huiyan
AU - Peng, Yongjun
AU - Zhang, Yanping
N1 - This study is supported by the funding from the Anhui Provincial Health Commission (No.AHWJ2023BAa20028), the Chengdu 2024 Medical Research Project (No. 2024530), the Medical Research Foundation of Chongqing General Hospital (No. Y2023YXYJMSXM04), and the Science and Technology Research Program of Chongqing Municipal Education Commission (No. KJQN202400116).
Publisher Copyright:
© The Author(s) 2025.
PY - 2025/5/24
Y1 - 2025/5/24
N2 - Background: Hepatocellular carcinoma (HCC) exhibits pronounced heterogeneity, which significantly limits the effectiveness of precision therapies. A comprehensive understanding of the biological characteristics and molecular mechanisms underlying HCC cell subpopulations is crucial for improving prognostic predictions and refining treatment strategies. Methods: Single-cell RNA sequencing data were obtained from the GEO database and processed using the Seurat R package for quality control, including data filtering, batch effect correction, and dimensionality reduction via PCA and UMAP to visualize cell distribution and identify distinct subpopulations. Cell types were annotated using established marker genes and literature references. The GSVA method was applied to evaluate the activity of 18 programmed cell death pathways. Cell developmental trajectories were reconstructed using Monocle 2 and validated with cytoTRACE to assess differentiation potential. Metabolic pathway activity was analyzed using the scMetabolism package. Bulk RNA sequencing data from the TCGA cohort were integrated to identify prognosis-associated genes through univariate Cox regression. The malignant potential of tumor subpopulations was quantified using GSVA scoring. Weighted gene co-expression network analysis (WGCNA) was employed to identify cuproptosis-related genes. A risk scoring model was constructed using LASSO regression and multivariate Cox regression based on cuproptosis-related genes and marker genes of cuproptosis-characterized tumor cells. The model's performance was validated across TCGA, GEO, and ICGC datasets. Additionally, the relationships between risk scores, clinical characteristics, key signaling pathways, and immunotherapy responses were explored. Finally, a prognostic nomogram was developed to support clinical decision-making. Results: 12 programmed cell death pathways were enriched in tumors, with cuproptosis defining HCC, particularly in the C2 subpopulation. GSVA highlighted high-risk patient enrichment in proliferation, DNA repair, and metabolism, reflecting aggressive malignancy. Developmental trajectory and metabolic analyses confirmed greater stemness and metabolic activity in C2. TCGA linked cuproptosis-related subpopulations to poor prognosis. The risk model stratified patients (validated in TCGA/GEO/ICGC), correlating with clinical grade, T-stage, survival (HR = 2.597, 95%CI 2.051–3.289, P < 0.05). The nomogram showed strong predictive power (C-index = 0.716), aiding clinical decisions. Conclusion: The C2 subpopulation represents the most malignant subset of HCC cells, with cuproptosis serving as a defining characteristic of this subgroup. The risk scoring and nomogram models based on cuproptosis-related genes offer novel insights and a robust scientific foundation for prognostic prediction and personalized treatment in HCC patients. These findings highlight the potential of targeting cuproptosis and tumor microenvironment interactions to improve therapeutic outcomes in HCC.
AB - Background: Hepatocellular carcinoma (HCC) exhibits pronounced heterogeneity, which significantly limits the effectiveness of precision therapies. A comprehensive understanding of the biological characteristics and molecular mechanisms underlying HCC cell subpopulations is crucial for improving prognostic predictions and refining treatment strategies. Methods: Single-cell RNA sequencing data were obtained from the GEO database and processed using the Seurat R package for quality control, including data filtering, batch effect correction, and dimensionality reduction via PCA and UMAP to visualize cell distribution and identify distinct subpopulations. Cell types were annotated using established marker genes and literature references. The GSVA method was applied to evaluate the activity of 18 programmed cell death pathways. Cell developmental trajectories were reconstructed using Monocle 2 and validated with cytoTRACE to assess differentiation potential. Metabolic pathway activity was analyzed using the scMetabolism package. Bulk RNA sequencing data from the TCGA cohort were integrated to identify prognosis-associated genes through univariate Cox regression. The malignant potential of tumor subpopulations was quantified using GSVA scoring. Weighted gene co-expression network analysis (WGCNA) was employed to identify cuproptosis-related genes. A risk scoring model was constructed using LASSO regression and multivariate Cox regression based on cuproptosis-related genes and marker genes of cuproptosis-characterized tumor cells. The model's performance was validated across TCGA, GEO, and ICGC datasets. Additionally, the relationships between risk scores, clinical characteristics, key signaling pathways, and immunotherapy responses were explored. Finally, a prognostic nomogram was developed to support clinical decision-making. Results: 12 programmed cell death pathways were enriched in tumors, with cuproptosis defining HCC, particularly in the C2 subpopulation. GSVA highlighted high-risk patient enrichment in proliferation, DNA repair, and metabolism, reflecting aggressive malignancy. Developmental trajectory and metabolic analyses confirmed greater stemness and metabolic activity in C2. TCGA linked cuproptosis-related subpopulations to poor prognosis. The risk model stratified patients (validated in TCGA/GEO/ICGC), correlating with clinical grade, T-stage, survival (HR = 2.597, 95%CI 2.051–3.289, P < 0.05). The nomogram showed strong predictive power (C-index = 0.716), aiding clinical decisions. Conclusion: The C2 subpopulation represents the most malignant subset of HCC cells, with cuproptosis serving as a defining characteristic of this subgroup. The risk scoring and nomogram models based on cuproptosis-related genes offer novel insights and a robust scientific foundation for prognostic prediction and personalized treatment in HCC patients. These findings highlight the potential of targeting cuproptosis and tumor microenvironment interactions to improve therapeutic outcomes in HCC.
KW - Cuproptosis
KW - HCC
KW - Machine Learning
KW - Single-Cell Sequencing
KW - Tumor microenvironment
UR - http://www.scopus.com/inward/record.url?scp=105006839458&partnerID=8YFLogxK
U2 - 10.1007/s12672-025-02696-9
DO - 10.1007/s12672-025-02696-9
M3 - Journal article
AN - SCOPUS:105006839458
SN - 1868-8497
VL - 16
JO - Discover Oncology
JF - Discover Oncology
IS - 1
M1 - 900
ER -