TY - JOUR
T1 - Monitoring street-level improper dumpsites via a multi-modal and LLM-based framework
AU - Zhang, Siwei
AU - Ma, Jun
AU - Jiang, Feifeng
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
Funding Information:
The authors would like to acknowledge the support from the University of Hong Kong Seed Fund for PI Research - Basic Research (No. 2302101581) and the Research Postgraduate Student Innovation Award. Additionally, the authors extend their sincere thanks to the editors and the anonymous reviewers whose constructive feedback significantly enhanced the quality of this manuscript.
PY - 2025/5/15
Y1 - 2025/5/15
N2 - Effective monitoring and management of urban improper dumpsites have become increasingly critical due to the rising volumes of solid waste and their adverse environmental and public health impacts. Identifying the locations and types of street-level dumpsites is a necessary first step for waste management; however, existing studies lack automated and accurate methods for detecting and categorizing these sites. As a result, governments face substantial labor and financial burdens in managing illegal dumping. To address these gaps, this study presents MultiSense DumpSpotter, a novel cascade model framework that integrates a multimodal deep learning architecture with Large Language Models (LLMs) to identify, classify, and analyze improper dumpsites with greater accuracy than traditional unimodal vision models. To support this framework, we developed UrbanDumpSight, the first annotated street-level urban dumpsite dataset, consisting of over 4000 street view images with metadata that includes geospatial and demographic information. This study contribute to the literature by demonstrating the effectiveness of multimodal data fusion in urban studies and the potential of LLMs in interpreting urban semantics. From a practical standpoint, it introduces a deployable, user-friendly system designed to meet the needs of urban managers, enabling efficient monitoring of improper dumping hotspots, uncovering root causes, and facilitating the implementation of effective governance actions. Overall, this research provides a novel and scalable solution for addressing urban waste challenges, offering insights to support sustainable waste management and policy-making.
AB - Effective monitoring and management of urban improper dumpsites have become increasingly critical due to the rising volumes of solid waste and their adverse environmental and public health impacts. Identifying the locations and types of street-level dumpsites is a necessary first step for waste management; however, existing studies lack automated and accurate methods for detecting and categorizing these sites. As a result, governments face substantial labor and financial burdens in managing illegal dumping. To address these gaps, this study presents MultiSense DumpSpotter, a novel cascade model framework that integrates a multimodal deep learning architecture with Large Language Models (LLMs) to identify, classify, and analyze improper dumpsites with greater accuracy than traditional unimodal vision models. To support this framework, we developed UrbanDumpSight, the first annotated street-level urban dumpsite dataset, consisting of over 4000 street view images with metadata that includes geospatial and demographic information. This study contribute to the literature by demonstrating the effectiveness of multimodal data fusion in urban studies and the potential of LLMs in interpreting urban semantics. From a practical standpoint, it introduces a deployable, user-friendly system designed to meet the needs of urban managers, enabling efficient monitoring of improper dumping hotspots, uncovering root causes, and facilitating the implementation of effective governance actions. Overall, this research provides a novel and scalable solution for addressing urban waste challenges, offering insights to support sustainable waste management and policy-making.
KW - Deep learning
KW - Multi-modality
KW - Street view images
KW - Street-level dumpsites
KW - Urban waste management
KW - Waste monitoring
UR - https://www.scopus.com/pages/publications/86000338301
U2 - 10.1016/j.resconrec.2025.108227
DO - 10.1016/j.resconrec.2025.108227
M3 - Journal article
SN - 0921-3449
VL - 218
JO - Resources, Conservation and Recycling
JF - Resources, Conservation and Recycling
M1 - 108227
ER -