Trustworthy Machine Learning: From Data to Models

Bo Han, Jiangchao Yao, Tongliang Liu, Bo Li, Sanmi Koyejo, Feng Liu

Research output: Book/ReportBook or reportpeer-review

Abstract

The success of machine learning algorithms relies not only on achieving good performance but also on ensuring trustworthiness across diverse applications and scenarios. Trustworthy machine learning seeks to handle critical problems in addressing the issues of robustness, privacy, security, reliability, and other desirable properties. The broad research area has achieved remarkable advancement and brings various emerging topics along with the progress. We present this survey to provide a systematic overview of the research problems under trustworthy machine learning covering the perspectives from data to model. Starting with fundamental data-centric learning, the survey reviews learning with noisy data, long-tailed distribution, out-of-distribution data, and adversarial examples to achieve robustness. Delving into private and secured learning, the survey elaborates on core methodologies differential privacy, different attacking threats, and learning paradigms, to realize privacy protection and enhance security. Finally, it introduces several trendy issues related to the foundation models, including jailbreak prompts, watermarking, and hallucination, as well as causal learning and reasoning. The survey integrates commonly isolated research problems in a unified manner, which provides general problem setups, detailed sub-directions, and further discussion on its challenges or future developments. We hope the comprehensive investigation presented in this survey can serve as a clear introduction for the problem evolution from data to models and also bring new insight for developing trustworthy machine learning.
Original languageEnglish
PublisherNow Publishers Inc
Number of pages173
ISBN (Electronic)9781638285496
ISBN (Print)9781638285489
DOIs
Publication statusPublished - 29 Apr 2025

Publication series

NameFoundations and Trends® in Privacy and Security
No.2-3
Volume7
ISSN (Print)2474-1558
ISSN (Electronic)2474-1566

Fingerprint

Dive into the research topics of 'Trustworthy Machine Learning: From Data to Models'. Together they form a unique fingerprint.

Cite this