Research Article

Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics

Volume: 7 Number: 5 September 15, 2025
TR EN

Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics

Abstract

Aims: Large language models (LLMs) such as ChatGPT are increasingly used in academic and clinical writing. While these tools can generate coherent and domain-specific text, concerns persist regarding the accuracy of their automatically generated references. In musculoskeletal rehabilitation—a field heavily reliant on current evidence—the reliability of citations is especially critical. Yet, systematic evaluations of citation accuracy in AI-generated scientific content are lacking. To evaluate the reference accuracy of scientific texts generated by ChatGPT (GPT-4) in response to musculoskeletal rehabilitation prompts, and to determine whether reference accuracy improves following structured post-generation verification. Methods: ChatGPT was prompted to generate four scientific paragraphs on musculoskeletal rehabilitation topics (manual therapy, ACL reconstruction, low back pain, and rotator cuff repair), each including 10 references with DOIs. A total of 40 references were analyzed using a 3-point scoring system (0=fabricated, 1=partially correct, 2=fully accurate), which was used to assess citation quality. After initial evaluation, ChatGPT was asked to verify and revise its references. Scores before and after this step were compared descriptively and with Wilcoxon signed-rank tests to assess statistical significance, and effect sizes (r) were calculated to estimate the magnitude of improvement. Results: Only 7.5% of references were fully accurate in the initial generation, while 42.5% were completely fabricated. The remaining 50% were partially correct. After verification, the proportion of fully accurate references rose to 77.5%. Wilcoxon signed-rank testing confirmed a statistically significant improvement in accuracy across all prompts (W=561.0, p<0.001, r=0.60). The most common errors included invalid DOIs, fabricated article titles, and mismatched metadata. Conclusion: ChatGPT can generate coherent scientific content, but its initial references are frequently inaccurate or fabricated. Structured post-generation verification significantly improves reference accuracy, as confirmed by statistical testing. These findings suggest that LLMs may be integrated as drafting tools in academic and clinical musculoskeletal contexts, but only when accompanied by strict human-led verification of citations.

Keywords

References

  1. Mondal H, Mondal S. ChatGPT in academic writing: maximizing its benefits and minimizing the risks. Indian J Ophthalmol. 2023;71(12): 3600-3606. doi:10.4103/IJO.IJO_718_23
  2. Bom H-SH. Exploring the opportunities and challenges of ChatGPT in academic writing: a roundtable discussion. Nucl Med Mol Imaging. 2023;57(4):165-1677. doi:10.1007/s13139-023-00809-2
  3. Jarrah AM, Wardat Y, Fidalgo P. Using ChatGPT in academic writing is (not) a form of plagiarism: what does the literature say. Online J Commun Media Technol. 2023;13(4):e202346. doi:10.30935/ojcmt/13572
  4. Gruda D. Three ways ChatGPT helps me in my academic writing. Nature. 2024;10:1-6. doi:10.1038/d41586-024-01042-3
  5. Švab I, Klemenc-Ketiš Z, Zupanič S. New challenges in scientific publications: referencing, artificial intelligence and ChatGPT. Slov J Public Health. 2023;62(3):109-112. doi:10.2478/sjph-2023-0015
  6. Jan R. Examining the reliability of ChatGPT: identifying retracted scientific literature and ensuring accurate citations and references. In: Impacts of generative ai on the future of research and education. Hershey, PA: IGI Global; 2025.
  7. Frosolini A, Gennaro P, Cascino F, Gabriele G. In reference to “role of Chat GPT in public health”, to highlight the AI’s incorrect reference generation. Ann Biomed Eng. 2023;51(10):2120-2122. doi:10.1007/s 10439-023-03248-4
  8. Cohen F, Vallimont J, Gelfand AA. Caution regarding fabricated citations from artificial intelligence. Headache. 2024;64(1):133-135. doi: 10.1111/head.14649

Details

Primary Language

English

Subjects

Physiotherapy

Journal Section

Research Article

Publication Date

September 15, 2025

Submission Date

July 19, 2025

Acceptance Date

September 10, 2025

Published in Issue

Year 2025 Volume: 7 Number: 5

APA
Safran, E., & Çalı, A. (2025). Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics. Anatolian Current Medical Journal, 7(5), 695-702. https://doi.org/10.38053/acmj.1746227
AMA
1.Safran E, Çalı A. Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics. Anatolian Curr Med J / ACMJ / acmj. 2025;7(5):695-702. doi:10.38053/acmj.1746227
Chicago
Safran, Ertuğrul, and Adem Çalı. 2025. “Fabricated or Accurate? Ethical Concerns and Citation Hallucination in AI-Generated Scientific Writing on Musculoskeletal Topics”. Anatolian Current Medical Journal 7 (5): 695-702. https://doi.org/10.38053/acmj.1746227.
EndNote
Safran E, Çalı A (September 1, 2025) Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics. Anatolian Current Medical Journal 7 5 695–702.
IEEE
[1]E. Safran and A. Çalı, “Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics”, Anatolian Curr Med J / ACMJ / acmj, vol. 7, no. 5, pp. 695–702, Sept. 2025, doi: 10.38053/acmj.1746227.
ISNAD
Safran, Ertuğrul - Çalı, Adem. “Fabricated or Accurate? Ethical Concerns and Citation Hallucination in AI-Generated Scientific Writing on Musculoskeletal Topics”. Anatolian Current Medical Journal 7/5 (September 1, 2025): 695-702. https://doi.org/10.38053/acmj.1746227.
JAMA
1.Safran E, Çalı A. Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics. Anatolian Curr Med J / ACMJ / acmj. 2025;7:695–702.
MLA
Safran, Ertuğrul, and Adem Çalı. “Fabricated or Accurate? Ethical Concerns and Citation Hallucination in AI-Generated Scientific Writing on Musculoskeletal Topics”. Anatolian Current Medical Journal, vol. 7, no. 5, Sept. 2025, pp. 695-02, doi:10.38053/acmj.1746227.
Vancouver
1.Ertuğrul Safran, Adem Çalı. Fabricated or accurate? Ethical concerns and citation hallucination in aI-generated scientific writing on musculoskeletal topics. Anatolian Curr Med J / ACMJ / acmj. 2025 Sep. 1;7(5):695-702. doi:10.38053/acmj.1746227

 

TR DİZİN ULAKBİM and International Indexes (1b)
 

Interuniversity Board (UAK) Equivalency:  Article published in Ulakbim TR Index journal [10 POINTS], and Article published in other (excuding 1a, b, c) international indexed journal (1d) [5 POINTS]

Note: Our journal is not WOS indexed and therefore is not classified as Q.

You can download Council of Higher Education (CoHG) [Yüksek Öğretim Kurumu (YÖK)] Criteria) decisions about predatory/questionable journals and the author's clarification text and journal charge policy from your browser. https://dergipark.org.tr/tr/journal/3449/file/4924/show

 

Journal Indexes and Platforms: 

TR Dizin ULAKBİM, Google Scholar, Crossref, Worldcat (OCLC), DRJI, EuroPub, OpenAIRE, Turkiye Citation Index, Turk Medline, ROAD, ICI World of Journal's, Index Copernicus, ASOS Index, General Impact Factor, Scilit.


 

The indexes of the journal's are;


 

download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJqb3VybmFsIiwib3JpZ2luYWxuYW1lIjoiVHJfSW5kZXhfbG9nby5wbmciLCJwYXRoIjoiMDFiOS82MmZhLzA3MzMvNjlkZjNlNTdhMmI4ZjkuODYxMzMxMjQucG5nIiwiZXhwIjoxNzc2MjQxNzY3LCJub25jZSI6ImQyMTQ4MjdiNTg1ZjVmMGQwYzAzZTMxNzMwM2QwMThmIn0.RmnGvwR536HdIoKpGO-ApytZ5aRPRT_BFXE2EpGSIqc

asos-index.png
 
f9ab67f.png
 
WorldCat_Logo_H_Color.png
 

 

18596download?token=eyJhdXRoX3JvbGVzIjpbXSwiZW5kcG9pbnQiOiJqb3VybmFsIiwib3JpZ2luYWxuYW1lIjoiT3BlbkFpcmUuanBnIiwicGF0aCI6IjUyMWYvZjljYy8wMDk3LzY5ZGYzZDNiYmVkZGU0LjQzNDM2OTU3LmpwZyIsImV4cCI6MTc3NjI0MTQ4NCwibm9uY2UiOiIwYjgxZDE2NzRiNzhjMWQyOGVmMDM1OTA1MzI5NjdjZiJ9.xeFppR1ubA4i-dHG-u07ht9bQNogFheXQjLyEaP9GgAimages?q=tbn:ANd9GcQgDnBwx0yUPRKuetgIurtELxYERFv20CPAUcPe4jYrrJiwXzac8rGXlzd57gl8iikb1Tk&usqp=CAU

 

84039476_619085835534619_7808805634291269632_n.jpg

 

 

 

The platforms of the journal's are;
 

COPE.jpg
 
images?q=tbn:ANd9GcTbq2FM8NTdXECzlOUCeKQ1dvrISFL-LhxhC7zy1ZQeJk-GGKSx2XkWQvrsHxcfhtfHWxM&usqp=CAUicmje_1_orig.png
 
 
ncbi.png
 
ORCID_logo.pngimages?q=tbn:ANd9GcQlwX77nfpy3Bu9mpMBZa0miWT2sRt2zjAPJKg2V69ODTrjZM1nT1BbhWzTVPsTNKJMZzQ&usqp=CAU
 

 

images?q=tbn:ANd9GcTaWSousoprPWGwE-qxwxGH2y0ByZ_zdLMN-Oq93MsZpBVFOTfxi9uXV7tdr39qvyE-U0I&usqp=CAU
 


 


 

 


 


The indexes/platforms of the journal are;
 

TR Dizin Ulakbim, Crossref (DOI), Google Scholar, EuroPub, Directory of Research Journal İndexing (DRJI), Worldcat (OCLC), OpenAIRE, ASOS Index, ROAD, Turkiye Citation Index, ICI World of Journal's, Index Copernicus, Turk Medline, General Impact Factor, Scilit 
 


Journal articles are evaluated as "Double-Blind Peer Review"

 

All articles published in this journal are licensed under a Creative Commons Attribution 4.0 International License (CC BY NC ND)