Jagged competencies: Measuring the reliability of generative AI in academic research

Thomas, Llewellyn; Romasanta, Angelo Kenneth; Pujol Priego, Laia; Thomas, Llewellyn; Romasanta, Angelo Kenneth; Pujol Priego, Laia

doi:https://doi.org/10.1016/j.jbusres.2025.115804

Jagged competencies: Measuring the reliability of generative AI in academic research

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: https://hdl.handle.net/20.500.14342/6069

Autor/a

Thomas, Llewellyn

Romasanta, Angelo Kenneth

Pujol Priego, Laia

Otros/as autores/as

Universitat Ramon Llull. Esade

Fecha de publicación

2026-01

Resumen

Large Language Models (LLMs) are increasingly viewed as a valuable tool for academic research. While LLMs have some benefits, a ‘crisis of replicability’ in management scholarship mitigates against unrestrained use. In this paper we investigate the reproducibility of LLM analyses. We analyze three LLMs—ChatGPT, Claude and Mistral—over fifteen weeks, testing the consistency, accuracy and their interaction using the same prompts on the same data corpus. While our results demonstrate significant variations in reliability and consistency across the three LLMs, we also show that LLMs can exhibit deterministic and reliable behavior under specific, well-defined constraints. We argue that replicable LLM-based research will rely on understanding and validating the task-specific operational boundaries of the LLM. To ensure the responsible integration of LLMs into management research, we highlight a need for robust frameworks, transparency, ethical guidelines, and ongoing evaluation. We conclude with actionable guidance for management researchers.

Tipo de documento

Artículo

Versión del documento

Versión publicada

Lengua

Inglés

Materias y palabras clave

Generative AI; LLM; Replication; Reproducibility; Consistency; Accuracy

Páginas

14 p.

Publicado por

Elsevier Inc.

Publicado en

Journal of Business Research, Vol. 203, 115804

Citación recomendada

Esta citación se ha generado automáticamente.

Exportar

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Derechos

Attribution-NonCommercial-NoDerivatives 4.0 International

Este ítem aparece en la(s) siguiente(s) colección(ones)

Esade [299]

Jagged competencies: Measuring the reliability of generative AI in academic research

Autor/a

Otros/as autores/as

Fecha de publicación

Compartir

Resumen

Tipo de documento

Versión del documento

Lengua

Materias y palabras clave

Páginas

Publicado por

Publicado en

Citación recomendada

Exportar

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)