Evaluation of ChatGPT-4 for the detection of surgical site infections from electronic health records after colorectal surgery: a pilot diagnostic accuracy study

dc.contributor.author
Badia, Josep M.
dc.contributor.author
Casanova-Portoles, Daniel
dc.contributor.author
Membrilla Fernández, Estela
dc.contributor.author
Rubiés, Carles
dc.contributor.author
Pujol, Miquel
dc.contributor.author
Sancho, Joan J.
dc.date.accessioned
2026-01-22T01:38:05Z
dc.date.available
2026-01-22T01:38:05Z
dc.date.issued
2026-01-20T14:48:24Z
dc.date.issued
2026-01-20T14:48:24Z
dc.date.issued
2025
dc.date.issued
2026-01-20T14:48:24Z
dc.identifier
Badia JM, Casanova-Portoles D, Membrilla E, Rubiés C, Pujol M, Sancho J. Evaluation of ChatGPT-4 for the detection of surgical site infections from electronic health records after colorectal surgery: a pilot diagnostic accuracy study. J Infect Public Health. 2025;18(2):102627. DOI: 10.1016/j.jiph.2024.102627
dc.identifier
1876-0341
dc.identifier
https://hdl.handle.net/10230/72296
dc.identifier
http://dx.doi.org/10.1016/j.jiph.2024.102627
dc.identifier.uri
http://hdl.handle.net/10230/72296
dc.description.abstract
Background: Surveillance of surgical site infection (SSI) relies on manual methods that are time-consuming and prone to subjectivity. This study evaluates the diagnostic accuracy of ChatGPT for detecting SSI from electronic health records after colorectal surgery via comparison with the results of a nationwide surveillance programme. Methods: This pilot, retrospective, multicentre analysis included 122 patients who underwent colorectal surgery. Patient records were reviewed by both manual surveillance and ChatGPT, which was tasked with identifying SSI and categorizing them as superficial, deep, or organ-space infections. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Receiver operating characteristic (ROC) curve analysis determined the model's diagnostic performance. Results: ChatGPT achieved a sensitivity of 100 %, correctly identifying all SSIs detected by manual methods. The specificity was 54 %, indicating the presence of false positives. The PPV was 67 %, and the NPV was 100 %. The area under the ROC curve was 0.77, indicating good overall accuracy for distinguishing between SSI and non-SSI cases. Minor differences in outcomes were observed between colon and rectal surgeries, as well as between the hospitals participating in the study. Conclusions: ChatGPT shows high sensitivity and good overall accuracy for detecting SSI. It appears to be a useful tool for initial screenings and for reducing manual review workload. The moderate specificity suggests a need for further refinement to reduce the rate of false positives. The integration of ChatGPT alongside electronic medical records, antibiotic consumption and imaging data results for real-time analysis may further improve the surveillance of SSI. Clinicaltrials: gov Identifier: NCT06556017.
dc.format
application/pdf
dc.format
application/pdf
dc.language
eng
dc.publisher
Elsevier
dc.relation
Journal of Infection and Public Health. 2025;18(2):102627
dc.rights
© 2024 The Author(s). Published by Elsevier Ltd on behalf of King Saud Bin Abdulaziz University for Health Sciences. This is an open access article under the CC BY-NCND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
dc.rights
http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
Accuracy
dc.subject
Artificial intelligence
dc.subject
ChatGPT
dc.subject
Diagnosis
dc.subject
LLM
dc.subject
Large Language Model
dc.subject
NLP
dc.subject
Natural language processing
dc.subject
OpenAI
dc.subject
Sensitivity and specificity
dc.subject
Surgical site infection
dc.title
Evaluation of ChatGPT-4 for the detection of surgical site infections from electronic health records after colorectal surgery: a pilot diagnostic accuracy study
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)