XI Conference – European Congress of Methodology

Name: XI Conference – European Congress of Methodology
Start: 2025-07-22T09:00:00+01:00
End: 2025-07-25T15:00:00+01:00
Location: No location set

22–25 Jul 2025

Atlantic/Canary timezone

Contact

info@eam2025.eu

Modeling difficulty of listening comprehension items with machine learning

24 Jul 2025, 17:00

30m

Poster Statistical analyses Poster Session 4

Abstract

Understanding the difficulty of items in educational assessment is crucial for test development, teaching, and learning. While prior research used text features and machine learning to model the difficulty of reading comprehension items (Štěpánek et al., 2023), listening comprehension (LC) presents additional complexity. Beyond textual factors, LC item difficulty can be influenced by several additional factors, including variations in speech rate, voice characteristics, and others.

In this study, we analyze data from English, German, and French listening sections of the Czech Matura exams. We extract text features commonly used in reading comprehension difficulty modeling, including readability indices and similarity measures. Additionally, we incorporate audio-specific features, such as speech rate, confidence scores from automatic speech recognition (ASR) models, and voice distinguishability derived from a speaker diarization model. This feature set allows us to capture the linguistic and auditory effects on LC item difficulty.

Using these features, we train a regression model to predict the final LC difficulty as described by student response data. The optimal model is selected based on accuracy measures of different models and model parameters, and compared to predictions of content experts to assess alignment between computational modeling and human evaluation.

The model is implemented into an interactive application to aid exam development. This tool aims to provide educators and test developers with insights into LC item difficulty, allowing for more informed test development.

Keywords	Item Difficulty Modeling, NLP, ML

Mr Filip Martinek (Institute of Computer Science of the Czech Academy of Sciences)

Mr Jan Netík (Institute of Computer Science of the Czech Academy of Sciences; Faculty of Education, Charles University) Patrícia Martinková (Institute of Computer Science of the Czech Academy of Sciences; Faculty of Education, Charles University)

There are no materials yet.

XI Conference – European Congress of Methodology

Contact

Modeling difficulty of listening comprehension items with machine learning

Abstract

Primary author

Co-authors

Presentation materials

Choose timezone

XI Conference – European Congress of Methodology

Contact

Abstract

Primary author

Co-authors

Presentation materials