microarray breast cancer metastasis machine learning feature selection maximum relevance - minimum redundancy (MRMR)
Aim: We aim to predict metastasis in breast cancer patients with tree-based conventional machine learning algorithms and to observe which feature selection methods is more effective in machine learning methods related to microarray breast cancer data reducing the number of features.
Material and Methods: Feature selection methods, least squares absolute shrinkage (LASSO), Boruta and maximum relevance-minimum redundancy (MRMR) and statistical preprocessing steps were first applied before the tree-based learning conventional machine learning methods like Decision-tree, Extremely randomized trees and Gradient Boosting Tree applied on the microarray breast cancer data.
Results: Microarray data with 54675 features (202 (101/101 breast cancer patients with/without metastases)) was first reduced to 235 features, then the feature selection algorithms were applied and the most important features were found with tree-based machine learning algorithms. It was observed that the highest recall and F-measure values were obtained from the XGBoost method and the highest precision value was received from the Extra-tree method. The 10 arrays out of 54675 with the highest variable importance were listed.
Conclusion: The most accurate results were obtained from the statistical preprocessed data for the XGBoost and Extra-trees machine learning algorithms. Statistical and microarray preprocessing steps would be enough in machine learning analysis of microarray data in breast cancer metastases predictions.
Microarray breast cancer metastasis machine learning feature selection
Birincil Dil | İngilizce |
---|---|
Konular | İç Hastalıkları |
Bölüm | Özgün Makaleler |
Yazarlar | |
Erken Görünüm Tarihi | 15 Mayıs 2023 |
Yayımlanma Tarihi | 15 Mayıs 2023 |
Kabul Tarihi | 4 Ocak 2023 |
Yayımlandığı Sayı | Yıl 2023 Cilt: 5 Sayı: 2 |
Chief Editors
Assoc. Prof. Zülal Öner
Address: İzmir Bakırçay University, Department of Anatomy, İzmir, Turkey
Assoc. Prof. Deniz Şenol
Address: Düzce University, Department of Anatomy, Düzce, Turkey
E-mail: medrecsjournal@gmail.com
Publisher:
Medical Records Association (Tıbbi Kayıtlar Derneği)
Address: Orhangazi Neighborhood, 440th Street,
Green Life Complex, Block B, Floor 3, No. 69
Düzce, Türkiye
Web: www.tibbikayitlar.org.tr
Publication Support:
Effect Publishing & Agency
Phone: + 90 (540) 035 44 35
E-mail: info@effectpublishing.com
Address: Akdeniz Neighborhood, Şehit Fethi Bey Street,
No: 66/B, Ground floor, 35210 Konak/İzmir, Türkiye
web: www.effectpublishing.com