Gelişmiş Arama

Basit öğe kaydını göster

dc.contributor.authorYilmaz, Busra Nur Gokkurt
dc.contributor.authorOzbey, Furkan
dc.contributor.authorYilmaz, Birkan Eyup
dc.date.accessioned2025-12-28T16:39:53Z
dc.date.available2025-12-28T16:39:53Z
dc.date.issued2025
dc.identifier.issn0930-1038
dc.identifier.issn1279-8517
dc.identifier.urihttps://doi.org/10.1007/s00276-025-03723-8
dc.identifier.urihttps://hdl.handle.net/20.500.12933/2188
dc.description.abstractPurpose The aim of this study was to assess the performance of various Large Language Models (LLMs) in addressing head and neck anatomy questions from the Dental Specialization Exam (DUS), conducted between 2012 and 2021. Methods A total of 103 multiple-choice questions were selected from DUS examinations over a decade. These questions covered major topics: Musculoskeletal System, Nervous System and Sensory Organs, Dental Anatomy, and Veins, Arteries, Lymphatic System and the Glandular System. Eight of the LLMs Gemini 1.5, Gemini 2, Copilot, Deepseek, Claude, ChatGPT 4o, ChatGPT 4, and ChatGPT o1 were employed using their most updated versions. Each model's accuracy was calculated by comparing the number of correct and incorrect responses. Results The ChatGPT o1 demonstrated the highest accuracy rate among all tested models, while Gemini 1.5 showed the lowest accuracy. These differences were found to be statistically significant (p = 0.027). Post-hoc analysis revealed that the only statistically significant difference among the LLMs was between ChatGPT o1 and Gemini 1.5 (p < 0.0031). When questions were analyzed by topic, no significant accuracy differences emerged in the Musculoskeletal System section. However, the ChatGPT o1 performed best in the Nervous System and Sensory Organs category. For Dental Anatomy questions, both ChatGPT o1 and Copilot achieved top results, and for Veins, Arteries, Lymphatic System and Glandular System section, ChatGPT o1 again excelled. Conclusion Overall, the findings show that LLMs effectively answer DUS head and neck anatomy questions with comparable performance. These insights support future exam-related model development and suggest that LLMs can serve as valuable educational tools.
dc.language.isoen
dc.publisherSpringer France
dc.relation.ispartofSurgical And Radiologic Anatomy
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectLarge language models
dc.subjectDentistry specialization exam
dc.subjectHead and neck anatomy
dc.titleEvaluation of the performance of different large language models on head and neck anatomy questions in the dentistry specialization exam in Turkey
dc.typeArticle
dc.departmentAfyonkarahisar Sağlık Bilimleri Üniversitesi
dc.identifier.doi10.1007/s00276-025-03723-8
dc.identifier.volume47
dc.identifier.issue1
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.department-temp[Yilmaz, Busra Nur Gokkurt] Giresun Oral & Dent Hlth Ctr, Dept Dentomaxillofacial Radiol, Giresun, Turkiye; [Ozbey, Furkan] Afyonkarahisar Hlth Sci Univ, Fac Dent, Dept Dentomaxillofacial Radiol, Afyonkarahisar, Turkiye; [Yilmaz, Birkan Eyup] Giresun Univ, Fac Dent, Dept Oral & Maxillofacial Surg, Giresun, Turkiye
dc.identifier.pmid40983800
dc.identifier.scopus2-s2.0-105016768608
dc.identifier.scopusqualityQ2
dc.identifier.wosWOS:001577790500001
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.indekslendigikaynakPubMed
dc.snmzKA_WoS_20251227


Bu öğenin dosyaları:

DosyalarBoyutBiçimGöster

Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster