Automatic Scoring of an Open-Response Measure of Advanced Mind-Reading Using Large Language Models

Research output: Chapter in Book/Report/Conference proceedingConference contribution

48 Downloads (Pure)

Abstract

A rigorous psychometric approach is crucial for the accurate measurement of mind-reading abilities. Traditional scoring methods for such tests, which involve lengthy free-text responses, require considerable time and human effort. This study investigates the use of large language models (LLMs) to automate the scoring of psychometric tests. Data were collected from participants aged 13 to 30 years and scored by trained human coders to establish a benchmark. We evaluated multiple LLMs against human assessments, exploring various prompting strategies to optimize performance and fine-tuning the models using a subset of the collected data to enhance accuracy. Our results demonstrate that LLMs can assess advanced mind-reading abilities with over 90% accuracy on average. Notably, in most test items, the LLMs achieved higher Kappa agreement with the lead coder than two trained human coders, highlighting their potential to reliably score open-response psychometric tests.
Original languageEnglish
Title of host publicationProceedings of the 10th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2025))
PublisherAssociation for Computational Linguistics, ACL
Pages79–89
Number of pages10
ISBN (Electronic)9798891762268
DOIs
Publication statusPublished - May 2025
EventThe Workshop on Computational Linguistics and Clinical Psychology - Albuquerque, United States
Duration: 3 May 20253 May 2025
Conference number: 10
https://clpsych.org/call-for-papers/

Conference

ConferenceThe Workshop on Computational Linguistics and Clinical Psychology
Abbreviated titleCLPsych 2025
Country/TerritoryUnited States
CityAlbuquerque
Period3/05/253/05/25
Internet address

Keywords

  • mind-reading
  • large language models

Fingerprint

Dive into the research topics of 'Automatic Scoring of an Open-Response Measure of Advanced Mind-Reading Using Large Language Models'. Together they form a unique fingerprint.

Cite this