Clustering refers to algorithms to uncover such clusters in unlabeled data. Data points belonging to the same cluster exhibit similar features, whereas data points from different clusters are dissimilar to each other. The identification of such clusters leads to segmentation of data points into a number of distinct groups. In this study it was aimed to classify the 492 Holstein Friesian dairy cattle with determining the optimum number of clusters using the genomic breeding values (GBVs) calculated with 13250 SNPs using GBLUP for milk yield (kg), milk fat (%), milk protein (%), milk lactose (%), and milk dry matter (%). Results showed that the optimum number cluster was determined as two for the genomic breeding values. Determining the most appropriate number of clusters, it provides great convenience in the selection of breeding animals after determining the animals that can provide optimum efficiency in the herd or the animals that need to be eliminated from the existing herd. As a result, it can be said that the k-means method can be used successfully in clustering animals for genomic breeding values, but for this, at first, the optimum number of clusters must be determined.
Ethics committee approval was not required for this study due to there is no experimental study on research material.
Clustering refers to algorithms to uncover such clusters in unlabeled data. Data points belonging to the same cluster exhibit similar features, whereas data points from different clusters are dissimilar to each other. The identification of such clusters leads to segmentation of data points into a number of distinct groups. In this study it was aimed to classify the 492 Holstein Friesian dairy cattle with determining the optimum number of clusters using the genomic breeding values (GBVs) calculated with 13250 SNPs using GBLUP for milk yield (kg), milk fat (%), milk protein (%), milk lactose (%), and milk dry matter (%). Results showed that the optimum number cluster was determined as two for the genomic breeding values. Determining the most appropriate number of clusters, it provides great convenience in the selection of breeding animals after determining the animals that can provide optimum efficiency in the herd or the animals that need to be eliminated from the existing herd. As a result, it can be said that the k-means method can be used successfully in clustering animals for genomic breeding values, but for this, at first, the optimum number of clusters must be determined.
Ethics committee approval was not required for this study due to there is no experimental study on research material.
Primary Language | English |
---|---|
Subjects | Biostatistics |
Journal Section | Research Articles |
Authors | |
Publication Date | January 15, 2025 |
Submission Date | December 15, 2024 |
Acceptance Date | January 8, 2025 |
Published in Issue | Year 2025 Volume: 8 Issue: 1 |