XI Conference – European Congress of Methodology

Name: XI Conference – European Congress of Methodology
Start: 2025-07-22T09:00:00+01:00
End: 2025-07-25T15:00:00+01:00
Location: EAM2025

22–25 Jul 2025

EAM2025

Atlantic/Canary timezone

Contact

info@eam2025.eu

Does the result of an in-silica structural validity matches the structural validity computed in human-gathered data?

25 Jul 2025, 08:50

15m

Faculty of Social Sciences and Communication. (The Pyramid)/9 - Room (Faculty of Social Sciences and Communication. (The Pyramid))

Faculty of Social Sciences and Communication. (The Pyramid)/9 - Room

Faculty of Social Sciences and Communication. (The Pyramid)

Show room on map

Oral Presentation Symposium : "Bridging Psychometrics and Artificial Intelligence"

Lara Russell-Lasalandra (University of Virginia)

The rapid advancement of large language models (LLMs) has enabled automated psychological scale development, yet questions remain about the correspondence between in-silica and human-gathered validation. This study examines whether structural validity metrics computed during automated item development match empirical validation results. Using AI-GENIE (Automatic Item Generation and Validation via Network-Integrated Evaluation), we generated Big Five personality items using five LLMs (Mixtral, Gemma 2, Llama 3, GPT-3.5, GPT-4). AI-GENIE performed in-silica structural validation during item generation and selection. These items were then administered to independent U.S. samples (N = 1000 per model). Comparing the in-silica and empirical structural validity metrics revealed strong correspondence (average correlation r = .89, RMSE = 0.08) across all models. Network invariance tests between in-silica and human-gathered data showed configural (NCT = 0.12, p > .05) and metric invariance (NCT = 0.15, p > .05). These findings suggest that AI-GENIE’s insilica structural validation effectively predicts empirical structural validity, supporting its use in automated scale development.

Lara Russell-Lasalandra (University of Virginia)

There are no materials yet.

XI Conference – European Congress of Methodology

Contact

Does the result of an in-silica structural validity matches the structural validity computed in human-gathered data?

Faculty of Social Sciences and Communication. (The Pyramid)/9 - Room

Faculty of Social Sciences and Communication. (The Pyramid)

Speaker

Description

Primary author

Presentation materials

Choose timezone

XI Conference – European Congress of Methodology

Contact

Speaker

Description

Primary author

Presentation materials