The role of large language models (LLMs) in providing triage for maxillofacial trauma cases: a preliminary study

Frosolini, Andrea; Catarzi, Lisa; Benedetti, Simone; Latini, Linda; Chisci, Glauco; Franz, Leonardo; Gennaro, Paolo; Gabriele, Guido

doi:10.3390/diagnostics14080839

SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.

RSS Feed

HELP: Tutorials | FAQ

CONTACT US: Contact info

Search Results

Journal Article

The role of large language models (LLMs) in providing triage for maxillofacial trauma cases: a preliminary study
Citation	Frosolini A, Catarzi L, Benedetti S, Latini L, Chisci G, Franz L, Gennaro P, Gabriele G. Diagnostics (Basel) 2024; 14(8).
Copyright	(Copyright © 2024, MDPI: Multidisciplinary Digital Publishing Institute)
DOI	10.3390/diagnostics14080839
PMID	38667484
PMCID	PMC11048758
Abstract	BACKGROUND: In the evolving field of maxillofacial surgery, integrating advanced technologies like Large Language Models (LLMs) into medical practices, especially for trauma triage, presents a promising yet largely unexplored potential. This study aimed to evaluate the feasibility of using LLMs for triaging complex maxillofacial trauma cases by comparing their performance against the expertise of a tertiary referral center. METHODS: Utilizing a comprehensive review of patient records in a tertiary referral center over a year-long period, standardized prompts detailing patient demographics, injury characteristics, and medical histories were created. These prompts were used to assess the triage suggestions of ChatGPT 4.0 and Google GEMINI against the center's recommendations, supplemented by evaluating the AI's performance using the QAMAI and AIPI questionnaires. RESULTS: The results in 10 cases of major maxillofacial trauma indicated moderate agreement rates between LLM recommendations and the referral center, with some variances in the suggestion of appropriate examinations (70% ChatGPT and 50% GEMINI) and treatment plans (60% ChatGPT and 45% GEMINI). Notably, the study found no statistically significant differences in several areas of the questionnaires, except in the diagnosis accuracy (GEMINI: 3.30, ChatGPT: 2.30; p = 0.032) and relevance of the recommendations (GEMINI: 2.90, ChatGPT: 3.50; p = 0.021). A Spearman correlation analysis highlighted significant correlations within the two questionnaires, specifically between the QAMAI total score and AIPI treatment scores (rho = 0.767, p = 0.010). CONCLUSIONS: This exploratory investigation underscores the potential of LLMs in enhancing clinical decision making for maxillofacial trauma cases, indicating a need for further research to refine their application in healthcare settings. Language: en
Keywords	AIPI; ChatGPT; GEMINI; Large Language Models (LLM); maxillofacial; maxillofacial surgery; QAMAI; trauma; triage

BACK TO RESULTS

NEW SEARCH
Download this record to:
RIS | BibTeX | EndNote

All SafetyLit records are available for automatic download to Zotero & Mendeley

Print
Email

Find full text at...

- Direct link (DOI)
- Publisher website
PubMed Central
- Google Scholar
- Inter-Library Document Request Form (pdf)