Research Article
BibTex RIS Cite

K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values

Year 2025, Volume: 8 Issue: 1, 263 - 267, 15.01.2025
https://doi.org/10.34248/bsengineering.1601851

Abstract

Clustering refers to algorithms to uncover such clusters in unlabeled data. Data points belonging to the same cluster exhibit similar features, whereas data points from different clusters are dissimilar to each other. The identification of such clusters leads to segmentation of data points into a number of distinct groups. In this study it was aimed to classify the 492 Holstein Friesian dairy cattle with determining the optimum number of clusters using the genomic breeding values (GBVs) calculated with 13250 SNPs using GBLUP for milk yield (kg), milk fat (%), milk protein (%), milk lactose (%), and milk dry matter (%). Results showed that the optimum number cluster was determined as two for the genomic breeding values. Determining the most appropriate number of clusters, it provides great convenience in the selection of breeding animals after determining the animals that can provide optimum efficiency in the herd or the animals that need to be eliminated from the existing herd. As a result, it can be said that the k-means method can be used successfully in clustering animals for genomic breeding values, but for this, at first, the optimum number of clusters must be determined.

Ethical Statement

Ethics committee approval was not required for this study due to there is no experimental study on research material.

References

  • Cebeci Z, Yıldız F, Kayaalp GT. 2015. Choosing an optimal k in k-means clustering. 2. Ulusal Yönetim Bilişim Sistemleri Kongresi, October 8-10, Erzurum, Türkiye, pp: 231-242.
  • Çolak B, Durdağ Z, Erdoğmuş P. 2015. Automatic clustering with k-means. El-Cezeri J Sci Eng, 3(2): 315-323.
  • Doğan İ. 2002. Selection by Cluster Analysis. Turk J Vet Anim Sci, 26: 47-53.
  • Frades I, Matthiesen R. 2010. Overview on techniques in cluster analysis. In: Matthiesen R (eds) Bioinformatics Methods in Clinical Research. Methods in Molecular Biology, vol 593. Humana Press. https://doi.org/10.1007/978-1-60327-194-3_5
  • Janos T, Natasa F, Marton S. 2021. Determining the type of Limousin candidate bulls by cluster analysis. Nat Resour Sust Devel, 11(1): 113-120.
  • Immink KAS, Cai K, Weber JH. 2018. Dynamic threshold detection based on Pearson distance detection. IEEE Transact Commun, 66(7): 2958-2965.
  • Kodinariya TM, Makwana PR. 2013. Review on determining number of cluster in k-means clustering. Int J Adv Res Comput Sci Manag Stud, 1(6): 90-95.
  • Kurnaz B, Önder H. 2021. Distance based regression models. II. International Applied Statistics Conference, June 29 – July 2, Tokat, Türkiye, pp: 120-126.
  • Na S, Xumin L, Yong G. 2010. Research on k-means clustering algorithm: An improved k-means clustering algorithm. Third International Symposium on Intelligent Information Technology and Security Informatics, April 22, Jian, China, pp: 63-67.
  • Önder H, Sitskowska B, Kurnaz B, Piwczynski D, Kolenda M, Sen U, Tırınk C, Çanga Boğa D. 2023. Multi-trait single-step genomic prediction for milk yield and milk components for Polish Holstein population. Animals, 13: 3070. https://doi.org/10.3390/ani13193070

K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values

Year 2025, Volume: 8 Issue: 1, 263 - 267, 15.01.2025
https://doi.org/10.34248/bsengineering.1601851

Abstract

Clustering refers to algorithms to uncover such clusters in unlabeled data. Data points belonging to the same cluster exhibit similar features, whereas data points from different clusters are dissimilar to each other. The identification of such clusters leads to segmentation of data points into a number of distinct groups. In this study it was aimed to classify the 492 Holstein Friesian dairy cattle with determining the optimum number of clusters using the genomic breeding values (GBVs) calculated with 13250 SNPs using GBLUP for milk yield (kg), milk fat (%), milk protein (%), milk lactose (%), and milk dry matter (%). Results showed that the optimum number cluster was determined as two for the genomic breeding values. Determining the most appropriate number of clusters, it provides great convenience in the selection of breeding animals after determining the animals that can provide optimum efficiency in the herd or the animals that need to be eliminated from the existing herd. As a result, it can be said that the k-means method can be used successfully in clustering animals for genomic breeding values, but for this, at first, the optimum number of clusters must be determined.

Ethical Statement

Ethics committee approval was not required for this study due to there is no experimental study on research material.

References

  • Cebeci Z, Yıldız F, Kayaalp GT. 2015. Choosing an optimal k in k-means clustering. 2. Ulusal Yönetim Bilişim Sistemleri Kongresi, October 8-10, Erzurum, Türkiye, pp: 231-242.
  • Çolak B, Durdağ Z, Erdoğmuş P. 2015. Automatic clustering with k-means. El-Cezeri J Sci Eng, 3(2): 315-323.
  • Doğan İ. 2002. Selection by Cluster Analysis. Turk J Vet Anim Sci, 26: 47-53.
  • Frades I, Matthiesen R. 2010. Overview on techniques in cluster analysis. In: Matthiesen R (eds) Bioinformatics Methods in Clinical Research. Methods in Molecular Biology, vol 593. Humana Press. https://doi.org/10.1007/978-1-60327-194-3_5
  • Janos T, Natasa F, Marton S. 2021. Determining the type of Limousin candidate bulls by cluster analysis. Nat Resour Sust Devel, 11(1): 113-120.
  • Immink KAS, Cai K, Weber JH. 2018. Dynamic threshold detection based on Pearson distance detection. IEEE Transact Commun, 66(7): 2958-2965.
  • Kodinariya TM, Makwana PR. 2013. Review on determining number of cluster in k-means clustering. Int J Adv Res Comput Sci Manag Stud, 1(6): 90-95.
  • Kurnaz B, Önder H. 2021. Distance based regression models. II. International Applied Statistics Conference, June 29 – July 2, Tokat, Türkiye, pp: 120-126.
  • Na S, Xumin L, Yong G. 2010. Research on k-means clustering algorithm: An improved k-means clustering algorithm. Third International Symposium on Intelligent Information Technology and Security Informatics, April 22, Jian, China, pp: 63-67.
  • Önder H, Sitskowska B, Kurnaz B, Piwczynski D, Kolenda M, Sen U, Tırınk C, Çanga Boğa D. 2023. Multi-trait single-step genomic prediction for milk yield and milk components for Polish Holstein population. Animals, 13: 3070. https://doi.org/10.3390/ani13193070
There are 10 citations in total.

Details

Primary Language English
Subjects Biostatistics
Journal Section Research Articles
Authors

Buğra Hoşgönül 0009-0002-9548-3457

Hasan Önder 0000-0002-8404-8700

Publication Date January 15, 2025
Submission Date December 15, 2024
Acceptance Date January 8, 2025
Published in Issue Year 2025 Volume: 8 Issue: 1

Cite

APA Hoşgönül, B., & Önder, H. (2025). K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values. Black Sea Journal of Engineering and Science, 8(1), 263-267. https://doi.org/10.34248/bsengineering.1601851
AMA Hoşgönül B, Önder H. K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values. BSJ Eng. Sci. January 2025;8(1):263-267. doi:10.34248/bsengineering.1601851
Chicago Hoşgönül, Buğra, and Hasan Önder. “K-Mean Clustering of Holstein Friesian Dairy Cattle Using Genomic Breeding Values”. Black Sea Journal of Engineering and Science 8, no. 1 (January 2025): 263-67. https://doi.org/10.34248/bsengineering.1601851.
EndNote Hoşgönül B, Önder H (January 1, 2025) K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values. Black Sea Journal of Engineering and Science 8 1 263–267.
IEEE B. Hoşgönül and H. Önder, “K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values”, BSJ Eng. Sci., vol. 8, no. 1, pp. 263–267, 2025, doi: 10.34248/bsengineering.1601851.
ISNAD Hoşgönül, Buğra - Önder, Hasan. “K-Mean Clustering of Holstein Friesian Dairy Cattle Using Genomic Breeding Values”. Black Sea Journal of Engineering and Science 8/1 (January 2025), 263-267. https://doi.org/10.34248/bsengineering.1601851.
JAMA Hoşgönül B, Önder H. K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values. BSJ Eng. Sci. 2025;8:263–267.
MLA Hoşgönül, Buğra and Hasan Önder. “K-Mean Clustering of Holstein Friesian Dairy Cattle Using Genomic Breeding Values”. Black Sea Journal of Engineering and Science, vol. 8, no. 1, 2025, pp. 263-7, doi:10.34248/bsengineering.1601851.
Vancouver Hoşgönül B, Önder H. K-Mean Clustering of Holstein Friesian Dairy Cattle using Genomic Breeding Values. BSJ Eng. Sci. 2025;8(1):263-7.

                                                24890