TY - JOUR
T1 - Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning
AU - Cai, Hongmin
AU - Huang, Weitian
AU - Yang, Sirui
AU - Ding, Siqi
AU - Zhang, Yue
AU - Hu, Bin
AU - Zhang, Fa
AU - Cheung, Yiu-ming
N1 - Funding information:
This work was supported in part by the National Key Research and Development Program of China under Grant 2022YFE0112200, in part by the Science and Technology Project of Guangdong Province under Grant 2022A0505050014, in part by the Key-Area Research and Development Program of Guangzhou City under Grant 202206030009, in part by the National Natural Science Foundation of China (NSFC) under Grants U21A20520, 62325204, and 62172112, in part by NSFC / Research Grants Council (RGC) Joint Research Scheme under Grant N_HKBU214/21, and in part by the General Research Fund of RGC under Grants 12201321 and 12202622.
Publisher copyright:
© 2023 The Authors.
PY - 2024/5
Y1 - 2024/5
N2 - In multi-view environment, it would yield missing observations due to
the limitation of the observation process. The most current
representation learning methods struggle to explore complete information
by lacking either cross-generative via simply filling in missing view
data, or solidative via inferring a consistent representation among the
existing views. To address this problem, we propose a deep generative
model to learn a complete generative latent representation, namely
Complete Multi-view Variational Auto-Encoders (CMVAE), which models the
generation of the multiple views from a complete latent variable
represented by a mixture of Gaussian distributions. Thus, the missing
view can be fully characterized by the latent variables and is resolved
by estimating its posterior distribution. Accordingly, a novel
variational lower bound is introduced to integrate view-invariant
information into posterior inference to enhance the solidative of the
learned latent representation. The intrinsic correlations between views
are mined to seek cross-view generality, and information leading to
missing views is fused by view weights to reach solidity. Benchmark
experimental results in clustering, classification, and cross-view image
generation tasks demonstrate the superiority of CMVAE, while time
complexity and parameter sensitivity analyses illustrate the efficiency
and robustness. Additionally, application to bioinformatics data
exemplifies its practical significance.
AB - In multi-view environment, it would yield missing observations due to
the limitation of the observation process. The most current
representation learning methods struggle to explore complete information
by lacking either cross-generative via simply filling in missing view
data, or solidative via inferring a consistent representation among the
existing views. To address this problem, we propose a deep generative
model to learn a complete generative latent representation, namely
Complete Multi-view Variational Auto-Encoders (CMVAE), which models the
generation of the multiple views from a complete latent variable
represented by a mixture of Gaussian distributions. Thus, the missing
view can be fully characterized by the latent variables and is resolved
by estimating its posterior distribution. Accordingly, a novel
variational lower bound is introduced to integrate view-invariant
information into posterior inference to enhance the solidative of the
learned latent representation. The intrinsic correlations between views
are mined to seek cross-view generality, and information leading to
missing views is fused by view weights to reach solidity. Benchmark
experimental results in clustering, classification, and cross-view image
generation tasks demonstrate the superiority of CMVAE, while time
complexity and parameter sensitivity analyses illustrate the efficiency
and robustness. Additionally, application to bioinformatics data
exemplifies its practical significance.
KW - deep generative models
KW - incomplete multi-view problem
KW - multi-view learning
KW - representation learning
KW - Deep generative models
UR - http://www.scopus.com/inward/record.url?scp=85181573597&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2023.3346869
DO - 10.1109/TPAMI.2023.3346869
M3 - Journal article
C2 - 38145535
SN - 0162-8828
VL - 46
SP - 3637
EP - 3652
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 5
M1 - 10373887
ER -