Research Article

Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform

Volume: 15 Number: 1 March 24, 2026

Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform

Abstract

Exploratory Data Analysis (EDA), recognized as the initial and most critical phase of the data science workflow, plays a fundamental role in understanding the structure of datasets, performing data cleaning, and preparing data for subsequent modeling tasks. This study introduces an interactive EDA platform developed with the R programming language and the Shiny framework. The platform allows users to upload datasets and conduct essential statistical analyses and visualizations, while additionally incorporating large language models (LLMs), such as the OpenAI GPT-4-turbo model, to automatically generate explanatory insights and interpretative commentary regarding the data. By complementing traditional statistical evaluations with language model–driven perspectives, the proposed approach enriches the analytical process by enhancing user intuition and interpretive depth. The system was evaluated using sample datasets, through which both conventional EDA outputs and LLM-assisted interpretations were demonstrated. The findings suggest that the integration of LLMs within Shiny applications holds considerable potential to advance data science education, decision support systems, and automated reporting practices.

Keywords

Supporting Institution

TÜBİTAK

Ethical Statement

The study is complied with research and publication ethics.

Thanks

This study was supported by the TÜBİTAK 2209/A Undergraduate Research Projects Support Program. We sincerely thank TÜBİTAK for their valuable contributions and support to our project titled "Package Development for Exploratory Data Analysis: RSEDA." The financial support provided by TÜBİTAK played a crucial role in the successful completion of this study. We hope that the results obtained will contribute to the effective use of machine learning methods in areas such as water consumption forecasting. We also extend our gratitude to the project team members, Büşra İLGEN, Gülnur PARLAK, and Miraç Can YILMAZ, for their valuable contributions throughout the project.

References

  1. C. J. M. Van Steenderen, G. F. Sutton, C. A. Owen, G. D. Martin, and J. A. Coetzee, “Sample size assessments for thermal physiology studies: An R package and R Shiny application,” Physiological Entomology, vol. 48, no. 4, pp. 141–149, 2023.
  2. E. Gefenas, J. Lekstutiene, V. Lukaseviciene, et al., “Controversies between regulations of research ethics and protection of personal data: Informed consent at a cross-road,” Medicine, Health Care and Philosophy, vol. 25, pp. 23–30, 2022.
  3. P. Hendricks, Anonymizer: Anonymize Data Containing Personally Identifiable Information, R package version 0.2.0, 2015. [Online]. Available: https://github.com/paulhendricks/anonymizer (accessed Oct. 2023).
  4. N. Kaur and S. Sodhi, “Data encryption standard algorithm (DES) for secure data transmission,” in Proc. Int. Conf. Advances in Emerging Technology (ICAET), 2016.
  5. L. Jia, W. Yao, Y. Jiang, Y. Li, Z. Wang, H. Li, et al., “Development of interactive biological web applications with R/Shiny,” Briefings in Bioinformatics, vol. 23, no. 1, Art. no. bbab415, 2022.
  6. M. Boukhlif, M. Hanine, and N. Kharmoum, “A decade of intelligent software testing research: A bibliometric analysis,” Electronics, vol. 12, no. 9, Art. no. 2109, 2023.
  7. Y. Chang, X. Wang, J. Wang, Y. Wu, L. Yang, K. Zhu, et al., “A survey on evaluation of large language models,” ACM Transactions on Intelligent Systems and Technology, vol. 15, no. 3, 2024. [Online]. Available: https://doi.org/10.1145/3641289 (accessed 2024).
  8. Z. Chen, L. Xu, H. Zheng, L. Chen, A. Tolba, L. Zhao, et al., “Evolution and prospects of foundation models: From large language models to large multimodal models,” Computers, Materials & Continua, vol. 80, no. 2, pp. 1753–1808, 2024. [Online]. Available: https://doi.org/10.32604/cmc.2024.052618 (accessed 2024).

Details

Primary Language

English

Subjects

Natural Language Processing, Artificial Intelligence (Other)

Journal Section

Research Article

Publication Date

March 24, 2026

Submission Date

September 19, 2025

Acceptance Date

December 12, 2025

Published in Issue

Year 2026 Volume: 15 Number: 1

APA
Albayrak, A., Albayrak, M., & Kaynaklı, M. (2026). Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, 15(1), 188-200. https://doi.org/10.17798/bitlisfen.1787094
AMA
1.Albayrak A, Albayrak M, Kaynaklı M. Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi. 2026;15(1):188-200. doi:10.17798/bitlisfen.1787094
Chicago
Albayrak, Ahmet, Muammer Albayrak, and Metin Kaynaklı. 2026. “Interactive Exploratory Data Analysis With R and Shiny: An LLM-Supported Explanation and Prediction Platform”. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi 15 (1): 188-200. https://doi.org/10.17798/bitlisfen.1787094.
EndNote
Albayrak A, Albayrak M, Kaynaklı M (March 1, 2026) Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi 15 1 188–200.
IEEE
[1]A. Albayrak, M. Albayrak, and M. Kaynaklı, “Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform”, Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, vol. 15, no. 1, pp. 188–200, Mar. 2026, doi: 10.17798/bitlisfen.1787094.
ISNAD
Albayrak, Ahmet - Albayrak, Muammer - Kaynaklı, Metin. “Interactive Exploratory Data Analysis With R and Shiny: An LLM-Supported Explanation and Prediction Platform”. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi 15/1 (March 1, 2026): 188-200. https://doi.org/10.17798/bitlisfen.1787094.
JAMA
1.Albayrak A, Albayrak M, Kaynaklı M. Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi. 2026;15:188–200.
MLA
Albayrak, Ahmet, et al. “Interactive Exploratory Data Analysis With R and Shiny: An LLM-Supported Explanation and Prediction Platform”. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, vol. 15, no. 1, Mar. 2026, pp. 188-00, doi:10.17798/bitlisfen.1787094.
Vancouver
1.Ahmet Albayrak, Muammer Albayrak, Metin Kaynaklı. Interactive Exploratory Data Analysis with R and Shiny: An LLM-Supported Explanation and Prediction Platform. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi. 2026 Mar. 1;15(1):188-200. doi:10.17798/bitlisfen.1787094

Bitlis Eren University

Journal of Science Editor

Bitlis Eren University Graduate Institute

Bes Minare Mah. Ahmet Eren Bulvari, Merkez Kampus, 13000 BITLIS

E-mail: fbe@beu.edu.tr