PIRLS Category-specific Question Generation for Reading Comprehension

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

University of Tartu Library

Abstract

According to the internationally recognized PIRLS (Progress in International Reading Literacy Study) assessment standards, reading comprehension questions should encompass all four comprehension processes: retrieval, inferencing, integrating and evaluation. This paper investigates whether Large Language Models can produce high-quality questions for each of these categories. Human assessment on a Chinese dataset shows that GPT-4o can generate usable and category-specific questions, ranging from 74% to 90% accuracy depending on the category.

Description

Keywords

Citation