In recent times, astronomy has entered a new era with rapidly growing data sources and advanced observation techniques. The construction of powerful telescopes has enabled the collection of spectral data from millions of celestial objects. However, the increasing number and variety of data have made it challenging to categorize these celestial objects. This study employs machine learning methods to address the fundamental problem of classifying stars, galaxies, and quasars in astronomy. The dataset underwent detailed preprocessing to identify effective features for classification. KNIME Analytics Platform was used for data analysis and visualization, facilitating rapid and efficient data analysis through its drag-and-drop interface. Among the machine learning methods used in our study—Decision Trees, Random Forest, and Naive Bayes—the highest accuracy rate of 97.86% was achieved with the Random Forest model. Notably, despite its lower overall performance compared to other models, the Naive Bayes classifier exhibited superior performance in distinguishing the STAR class, which is one of the study's interesting findings. Future studies aim to enhance model accuracy by using larger and more diverse datasets and exploring different machine learning algorithms. Additionally, the impact of deep learning methods on classification performance will be investigated.
In recent times, astronomy has entered a new era with rapidly growing data sources and advanced observation techniques. The construction of powerful telescopes has enabled the collection of spectral data from millions of celestial objects. However, the increasing number and variety of data have made it challenging to categorize these celestial objects. This study employs machine learning methods to address the fundamental problem of classifying stars, galaxies, and quasars in astronomy. The dataset underwent detailed preprocessing to identify effective features for classification. KNIME Analytics Platform was used for data analysis and visualization, facilitating rapid and efficient data analysis through its drag-and-drop interface. Among the machine learning methods used in our study—Decision Trees, Random Forest, and Naive Bayes—the highest accuracy rate of 97.86% was achieved with the Random Forest model. Notably, despite its lower overall performance compared to other models, the Naive Bayes classifier exhibited superior performance in distinguishing the STAR class, which is one of the study's interesting findings. Future studies aim to enhance model accuracy by using larger and more diverse datasets and exploring different machine learning algorithms. Additionally, the impact of deep learning methods on classification performance will be investigated.
Primary Language | English |
---|---|
Subjects | Communications Engineering (Other) |
Journal Section | Research Articles |
Authors | |
Early Pub Date | September 4, 2024 |
Publication Date | September 15, 2024 |
Submission Date | July 18, 2024 |
Acceptance Date | September 3, 2024 |
Published in Issue | Year 2024 |