Research Article
BibTex RIS Cite

Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform

Year 2026, Volume: 15 Issue: 1 , 188 - 200 , 24.03.2026
https://doi.org/10.17798/bitlisfen.1787094
https://izlik.org/JA44CN52XP

Abstract

Exploratory Data Analysis (EDA), recognized as the initial and most critical phase of the data science workflow, plays a fundamental role in understanding the structure of datasets, performing data cleaning, and preparing data for subsequent modeling tasks. This study introduces an interactive EDA platform developed with the R programming language and the Shiny framework. The platform allows users to upload datasets and conduct essential statistical analyses and visualizations, while additionally incorporating large language models (LLMs), such as the OpenAI GPT-4-turbo model, to automatically generate explanatory insights and interpretative commentary regarding the data. By complementing traditional statistical evaluations with language model–driven perspectives, the proposed approach enriches the analytical process by enhancing user intuition and interpretive depth. The system was evaluated using sample datasets, through which both conventional EDA outputs and LLM-assisted interpretations were demonstrated. The findings suggest that the integration of LLMs within Shiny applications holds considerable potential to advance data science education, decision support systems, and automated reporting practices.

Ethical Statement

The study is complied with research and publication ethics.

Supporting Institution

TÜBİTAK

Thanks

This study was supported by the TÜBİTAK 2209/A Undergraduate Research Projects Support Program. We sincerely thank TÜBİTAK for their valuable contributions and support to our project titled "Package Development for Exploratory Data Analysis: RSEDA." The financial support provided by TÜBİTAK played a crucial role in the successful completion of this study. We hope that the results obtained will contribute to the effective use of machine learning methods in areas such as water consumption forecasting. We also extend our gratitude to the project team members, Büşra İLGEN, Gülnur PARLAK, and Miraç Can YILMAZ, for their valuable contributions throughout the project.

References

  • C. J. M. Van Steenderen, G. F. Sutton, C. A. Owen, G. D. Martin, and J. A. Coetzee, “Sample size assessments for thermal physiology studies: An R package and R Shiny application,” Physiological Entomology, vol. 48, no. 4, pp. 141–149, 2023.
  • E. Gefenas, J. Lekstutiene, V. Lukaseviciene, et al., “Controversies between regulations of research ethics and protection of personal data: Informed consent at a cross-road,” Medicine, Health Care and Philosophy, vol. 25, pp. 23–30, 2022.
  • P. Hendricks, Anonymizer: Anonymize Data Containing Personally Identifiable Information, R package version 0.2.0, 2015. [Online]. Available: https://github.com/paulhendricks/anonymizer (accessed Oct. 2023).
  • N. Kaur and S. Sodhi, “Data encryption standard algorithm (DES) for secure data transmission,” in Proc. Int. Conf. Advances in Emerging Technology (ICAET), 2016.
  • L. Jia, W. Yao, Y. Jiang, Y. Li, Z. Wang, H. Li, et al., “Development of interactive biological web applications with R/Shiny,” Briefings in Bioinformatics, vol. 23, no. 1, Art. no. bbab415, 2022.
  • M. Boukhlif, M. Hanine, and N. Kharmoum, “A decade of intelligent software testing research: A bibliometric analysis,” Electronics, vol. 12, no. 9, Art. no. 2109, 2023.
  • Y. Chang, X. Wang, J. Wang, Y. Wu, L. Yang, K. Zhu, et al., “A survey on evaluation of large language models,” ACM Transactions on Intelligent Systems and Technology, vol. 15, no. 3, 2024. [Online]. Available: https://doi.org/10.1145/3641289 (accessed 2024).
  • Z. Chen, L. Xu, H. Zheng, L. Chen, A. Tolba, L. Zhao, et al., “Evolution and prospects of foundation models: From large language models to large multimodal models,” Computers, Materials & Continua, vol. 80, no. 2, pp. 1753–1808, 2024. [Online]. Available: https://doi.org/10.32604/cmc.2024.052618 (accessed 2024).
  • J. Cheng, “Applications of large language models in pathology,” Bioengineering, vol. 11, no. 4, 2024. [Online]. Available: https://doi.org/10.3390/bioengineering11040342 (accessed 2024).
  • X. Hou, Y. Zhao, Y. Liu, Z. Yang, K. Wang, L. Li, et al., “Large language models for software engineering: A systematic literature review,” arXiv preprint, arXiv:2308.10620, 2023. [Online]. Available: http://arxiv.org/abs/2308.10620 (accessed 2023).
  • D. Huang, C. Yan, Q. Li, and X. Peng, “From large language models to large multimodal models: A literature review,” Applied Sciences, vol. 14, no. 12, 2024. [Online]. Available: https://doi.org/10.3390/app14125068 (accessed 2024).
  • R. A. Husein, H. Aburajouh, and C. Catal, “Large language models for code completion: A systematic literature review,” Computer Standards & Interfaces, vol. 92, Art. no. 103917, 2025. [Online]. Available: https://doi.org/10.1016/j.csi.2024.103917 (accessed 2025).
  • S. Yin, C. Fu, S. Zhao, K. Li, X. Sun, T. Xu, and E. Chen, “A survey on multimodal large language models,” National Science Review, 2023. [Online]. Available: https://doi.org/10.1093/nsr/nwae403 (accessed 2023).
  • V. Sorin, D. Brin, Y. Barash, E. Konen, A. Charney, G. Nadkarni, and E. Klang, “Large language models and empathy: Systematic review,” Journal of Medical Internet Research, vol. 26, 2024. [Online]. Available: https://doi.org/10.2196/52597 (accessed 2024).
  • A. Telenti, M. Auli, B. L. Hie, C. Maher, S. Saria, and J. P. A. Ioannidis, “Large language models for science and medicine,” European Journal of Clinical Investigation, vol. 54, no. 6, 2024. [Online]. Available: https://doi.org/10.1111/eci.14183 (accessed 2024).
  • O. A. Sarumi and D. Heider, “Large language models and their applications in bioinformatics,” Computational and Structural Biotechnology Journal, vol. 23, pp. 3498–3505, 2024. [Online]. Available: https://doi.org/10.1016/j.csbj.2024.09.031 (accessed 2024).
  • W. Zhang and J. Zhang, “Hallucination mitigation for retrieval-augmented large language models: A review,” Mathematics, vol. 13, no. 5, 2025. [Online]. Available: https://doi.org/10.3390/math13050856 (accessed 2025).
  • H. Y. I. Lam, X. E. Ong, and M. Mutwil, “Large language models in plant biology,” Trends in Plant Science, 2024. [Online]. Available: https://doi.org/10.1016/j.tplants.2024.04.013 (accessed 2024).
  • Z. Zheng, N. Rampal, T. J. Inizan, C. Borgs, J. T. Chayes, and O. M. Yaghi, “Large language models for reticular chemistry,” Nature Reviews Materials, 2025. [Online]. Available: https://doi.org/10.1038/s41578-025-00772-8 (accessed 2025).
  • D. Xu, W. Chen, W. Peng, C. Zhang, T. Xu, X. Zhao, et al., “Large language models for generative information extraction: A survey,” Frontiers of Computer Science, vol. 18, no. 6, Art. no. 186357, 2024. [Online]. Available: https://doi.org/10.1007/s11704-024-40555-y (accessed 2024).
  • S. Nerella, S. Bandyopadhyay, J. Zhang, M. Contreras, S. Siegel, A. Bumin, et al., “Transformers and large language models in healthcare: A review,” Artificial Intelligence in Medicine, vol. 154, Art. no. 102900, 2024. [Online]. Available: https://doi.org/10.1016/j.artmed.2024.102900 (accessed 2024).
There are 21 citations in total.

Details

Primary Language English
Subjects Natural Language Processing, Artificial Intelligence (Other)
Journal Section Research Article
Authors

Ahmet Albayrak 0000-0002-2166-1102

Muammer Albayrak 0000-0002-5946-6310

Metin Kaynaklı 0000-0001-8372-1345

Submission Date September 19, 2025
Acceptance Date December 12, 2025
Publication Date March 24, 2026
DOI https://doi.org/10.17798/bitlisfen.1787094
IZ https://izlik.org/JA44CN52XP
Published in Issue Year 2026 Volume: 15 Issue: 1

Cite

IEEE [1]A. Albayrak, M. Albayrak, and M. Kaynaklı, “Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform”, Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, vol. 15, no. 1, pp. 188–200, Mar. 2026, doi: 10.17798/bitlisfen.1787094.

Bitlis Eren University
Journal of Science Editor
Bitlis Eren University Graduate Institute
Bes Minare Mah. Ahmet Eren Bulvari, Merkez Kampus, 13000 BITLIS