Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs

Changrong Xiao*, Wenxing Ma, Qingping Song, Sean Xin Xu, Kunpeng Zhang, Yufang Wang, Qi Fu

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Receiving timely and personalized feedback is essential for second-language learners, especially when human instructors are unavailable. This study explores the effectiveness of Large Language Models (LLMs), including both proprietary and open-source models, for Automated Essay Scoring (AES). Through extensive experiments with public and private datasets, we find that while LLMs do not surpass conventional state-of-the-art (SOTA) grading models in performance, they exhibit notable consistency, generalizability, and explainability. We propose an open-source LLM-based AES system, inspired by the dual-process theory. Our system offers accurate grading and high-quality feedback, at least comparable to that of fine-tuned proprietary LLMs, in addition to its ability to alleviate misgrading. Furthermore, we conduct human-AI co-grading experiments with both novice and expert graders. We find that our system not only automates the grading process but also enhances the performance and efficiency of human graders, particularly for essays where the model has lower confidence. These results highlight the potential of LLMs to facilitate effective human-AI collaboration in the educational context, potentially transforming learning experiences through AI-generated feedback.

Original languageEnglish
Title of host publication Proceedings of the 15th International Learning Analytics and Knowledge Conference, LAK 2025
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages293-305
Number of pages13
ISBN (Electronic)9798400707018
DOIs
Publication statusPublished - 3 Mar 2025
Event15th International Conference on Learning Analytics and Knowledge, LAK 2025 - Dublin, Ireland
Duration: 3 Mar 20257 Mar 2025
https://dl.acm.org/doi/proceedings/10.1145/3706468 (Conference Proceedings)

Publication series

NameProceedings of the International Learning Analytics and Knowledge Conference, LAK

Conference

Conference15th International Conference on Learning Analytics and Knowledge, LAK 2025
Country/TerritoryIreland
CityDublin
Period3/03/257/03/25
Internet address

User-Defined Keywords

  • LLM Application
  • Automatic Essay Scoring
  • AI-assisted Learning

Fingerprint

Dive into the research topics of 'Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs'. Together they form a unique fingerprint.

Cite this