Trustworthy AI under Imperfect Web Data

Jiangchao Yao, Feng Liu, Bo Han

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Trustworthy AI is crucial for web applications, since it ensures user data privacy, enhances security, and fosters user confidence. As web applications increasingly rely on AI for personalization and decision-making, maintaining transparency and accountability becomes essential to prevent bias, misinformation, and unethical practices. By building trust, developers can create safer and more reliable experiences, ultimately promoting user engagement and satisfaction. However, when dataset sizes grow bigger with the rapid web data collection, it is laborious and expensive to obtain perfect data (e.g., clean, safe, and balanced data). As a result, the volume of imperfect data becomes enormous, e.g., web-scale image and speech data with noisy labels, images with specific noise, and long-tail-distributed data. However, standard learning methods assumes that the supervised information is fully correct and intact. Therefore, imperfect data harms the performance of most of the standard learning algorithms, and sometimes even makes existing algorithms break down. In this tutorial, we focus on the algorithmic design of trustworthy AI when facing three types of imperfect data: noisy data, adversarial data, and long-tailed data in the real-world web applications.
Original languageEnglish
Title of host publicationWWW '25: Companion Proceedings of the ACM on Web Conference 2025
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages65-68
Number of pages4
ISBN (Electronic)9798400713316
ISBN (Print)9798400713316
DOIs
Publication statusPublished - 23 May 2025
EventThe ACM Web Conference, WWW 2025 - International Convention & Exhibition Centre, Sydney, Australia
Duration: 28 Apr 20252 May 2025
https://www2025.thewebconf.org/ (Conference website)
https://dl.acm.org/doi/proceedings/10.1145/3696410 (Conference proceedings)

Publication series

NameCompanion Proceedings of the ACM on Web Conference
PublisherAssociation for Computing Machinery

Conference

ConferenceThe ACM Web Conference, WWW 2025
Abbreviated titleWWW '25
Country/TerritoryAustralia
CitySydney
Period28/04/252/05/25
Internet address

User-Defined Keywords

  • imperfect web data
  • trustworthy learning
  • trustworthy web AI
  • Imperfect Web Data
  • Trustworthy Learning
  • Trustworthy Web AI

Fingerprint

Dive into the research topics of 'Trustworthy AI under Imperfect Web Data'. Together they form a unique fingerprint.

Cite this