TY - JOUR
T1 - Model-Induced Generalization Error Bound for Information-Theoretic Representation Learning in Source-Data-Free Unsupervised Domain Adaptation
AU - Yang, Baoyao
AU - Yeh, Hao-Wei
AU - Harada, Tatsuya
AU - Yuen, Pong Chi
N1 - This work was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 62102098, in part by the Hong Kong Research Grants Council General Research Fund under Grant RGC/HKBU12200518, in part by the Japan Science and Technology Agency (JST) Advanced Intelligence Project (AIP)
Acceleration Research under Grant JPMJCR20U3, in part by the Japan Society
for the Promotion of Science (JSPS) KAKENHI under Grant JP19H01115, in part by the Basic Research Grant (Super AI) of the Institute for AI and Beyond, The University of Tokyo, as well as the Science and Technology Planning Project of Guangdong under Grant 2019A050510041, in part by the Science and Technology Planning Project of Guangzhou under Grant 202103000034 and Grant 202002020090, and in part by the scholarship organized by Japan-Taiwan Exchange Association.
Publisher Copyright:
© 1992-2012 IEEE.
PY - 2022/1
Y1 - 2022/1
N2 - Many unsupervised domain adaptation (UDA) methods have been developed and have achieved promising results in various pattern recognition tasks. However, most existing methods assume that raw source data are available in the target domain when transferring knowledge from the source to the target domain. Due to the emerging regulations on data privacy, the availability of source data cannot be guaranteed when applying UDA methods in a new domain. The lack of source data makes UDA more challenging, and most existing methods are no longer applicable. To handle this issue, this paper analyzes the cross-domain representations in source-data-free unsupervised domain adaptation (SF-UDA). A new theorem is derived to bound the target-domain prediction error using the trained source model instead of the source data. On the basis of the proposed theorem, information bottleneck theory is introduced to minimize the generalization upper bound of the target-domain prediction error, thereby achieving domain adaptation. The minimization is implemented in a variational inference framework using a newly developed latent alignment variational autoencoder (LA-VAE). The experimental results show good performance of the proposed method in several cross-dataset classification tasks without using source data. Ablation studies and feature visualization also validate the effectiveness of our method in SF-UDA.
AB - Many unsupervised domain adaptation (UDA) methods have been developed and have achieved promising results in various pattern recognition tasks. However, most existing methods assume that raw source data are available in the target domain when transferring knowledge from the source to the target domain. Due to the emerging regulations on data privacy, the availability of source data cannot be guaranteed when applying UDA methods in a new domain. The lack of source data makes UDA more challenging, and most existing methods are no longer applicable. To handle this issue, this paper analyzes the cross-domain representations in source-data-free unsupervised domain adaptation (SF-UDA). A new theorem is derived to bound the target-domain prediction error using the trained source model instead of the source data. On the basis of the proposed theorem, information bottleneck theory is introduced to minimize the generalization upper bound of the target-domain prediction error, thereby achieving domain adaptation. The minimization is implemented in a variational inference framework using a newly developed latent alignment variational autoencoder (LA-VAE). The experimental results show good performance of the proposed method in several cross-dataset classification tasks without using source data. Ablation studies and feature visualization also validate the effectiveness of our method in SF-UDA.
KW - information theory
KW - representation learning
KW - Unsupervised domain adaptation (UDA)
UR - http://www.scopus.com/inward/record.url?scp=85121353119&partnerID=8YFLogxK
U2 - 10.1109/TIP.2021.3130530
DO - 10.1109/TIP.2021.3130530
M3 - Journal article
SN - 1057-7149
VL - 31
SP - 419
EP - 432
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
ER -