Data science is the most developing technology in recent years. The need of Data science is most important thing for the development of institutions. It is the process of analyzing, interpreting and decision making of data. There are various methods are included in the analysis of data science. Among those components, Statistics plays an important role. Without the help of statistics, the data cannot be analyzed. The arrangement and visualization of the data are also done with the use of Statistics. This paper explains the basic statistical methods used in the process of analyzing the data in Data science. As the basic terminologies are explained in the beginning, the advanced tools such as Hypothesis testing, Analysis of variance, t test, F test and Chisquare tests are discussed. Then, the interconnection between the Data science and Statistics are explained with the calculations of two tests such as Tukey test and Dunnet test. Finally, the future development and the impact of Statistics in Data sciencehave been explained
Primary Language | English |
---|---|
Subjects | Artificial Life and Complex Adaptive Systems |
Journal Section | Reviews |
Authors | |
Publication Date | December 22, 2020 |
Published in Issue | Year 2020 Volume: 3 Issue: 1 |
AI Research and Application Center, Sakarya University of Applied Sciences, Sakarya, Türkiye.