Comparative Study of GenAI (ChatGPT) vs. Human in Generating Multiple Choice Questions Based on the PIRLS Reading Assessment Framework

  • Yu Yan Lam*
  • , Samuel Kai Wah Chu
  • , Elsie Li Chen Ong
  • , Winnie Wing Lam Suen
  • , Lingran Xu
  • , Lavender Chin Lui Lam
  • , Scarlett Man Yu Wong
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

1 Citation (Scopus)

Abstract

Human-generated multiple-choice questions (MCQs) are commonly used to ensure objective evaluation in education. However, generating high-quality questions is difficult and time-consuming. Generative artificial intelligence (GenAI) has emerged as an automated approach for question generation, but challenges remain in terms of biases and diversity in training data. This study aims to compare the quality of GenAI-generated MCQs with humans-created ones. In Part 1 of this study, 16 MCQs were created by humans and GenAI individually with alignment to the Progress in International Reading Literacy Study (PIRLS) assessment framework. In Part 2, the quality of MCQs generated was assessed based on the clarity, appropriateness, suitability, and alignment to PIRLS by four assessors. Wilcoxon rank sum tests were conducted to compare GenAI versus humans generated MCQs. The findings highlight GenAI's potential as it was difficult to differentiate from human created questions and offer recommendations for integrating AI technology for the future.

Original languageEnglish
Title of host publicationAssociation for Information Science & Technology 2024 87th Annual Meeting, assis&t 2024
EditorsHeather O’Brien, June Abbas
PublisherWiley
Pages537-540
Number of pages4
DOIs
Publication statusPublished - Oct 2024
Event87th Annual Meeting of the Association for Information Science and Technology - Calgary, Canada
Duration: 25 Oct 202429 Oct 2024
https://www.asist.org/meetings-events/am/am24/ (Conference website)
https://asistdl.onlinelibrary.wiley.com/toc/23739231/2024/61/1 (Conference proceeding)

Publication series

NameProceedings of the Association for Information Science and Technology
PublisherJohn Wiley & Sons Ltd
Number1
Volume61
ISSN (Print)2373-9231
ISSN (Electronic)2373-9231

Conference

Conference87th Annual Meeting of the Association for Information Science and Technology
Abbreviated titleASIS&T 2024
Country/TerritoryCanada
CityCalgary
Period25/10/2429/10/24
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education

User-Defined Keywords

  • GenAI
  • PIRLS
  • question assessment
  • question creation
  • Reading

Fingerprint

Dive into the research topics of 'Comparative Study of GenAI (ChatGPT) vs. Human in Generating Multiple Choice Questions Based on the PIRLS Reading Assessment Framework'. Together they form a unique fingerprint.

Cite this