Review

Learning molecular machines by machine learning

Volume: 6 Number: 2 July 30, 2025
EN

Learning molecular machines by machine learning

Abstract

Proteins, often referred to as molecular machines, are essential biomolecules that perform a wide range of cellular functions, typically by forming complexes. Understanding their three-dimendional (3D) structures is key to deciphering their functions. However, a significant gap exists between the vast number of known protein sequences and the relatively limited number of experimentally determined protein structures. Unraveling the mechanisms of protein folding remains a central challenge in understanding the sequence-structure/dynamics-function relationship. In recent years, machine learning (ML) has become a transformative tool across many scientific fields, and structural biology is no exception. Proteins have benefited substantially from advances in artificial intelligence (AI), as numerous ML-based methods have emerged for modeling the structures of both individual proteins and their complexes. Recent breakthrough in ML have marked a major leap forward in tackling the protein folding problem. ML-based AI algorithms for protein structure prediction —most notably AlphaFold—use protein sequence information to accurately predict 3D structures of monomers and multimeric protein complexes, achieving unprecedented levels of precision. Following the success of AlphaFold, recognized with the 2024 Nobel Prize in Chemistry, researchers worldwide have intensified efforts to leverage AI for unraveling complex biological challenges—from drug discovery to protein-protein interactions. This review highlights ML-based approaches, with a primary focus on AlphaFold and its derivatives, while also covering other notable methods such as the hybrid deep-learning based RoseTTAFold and protein language model-based ESMFold. These tools have diverse applications in protein structure modeling and significantly advance our understanding of the intricate relationships between sequence, structure, dynamics, and function. While ML-based methods still face limitations in certain cases —such as membrane proteins, which are underrepresented in experimental structural databases, or antibody–antigen interactions, which involve highly diverse and difficult-to-model hypervariable regions—advances in computational techniques and the incorporation of new experimental data are steadily improving the accuracy of these algorithms in tackling such challenges. Overall, the implementation of ML in the study of molecular machines represents a promising direction, with the potential to bridge the sequence-structure gap and address longstanding questions in structural biology and medicine.

Keywords

References

  1. F.S. Collins, F.S., M. Morgan, and A. Patrinos, "The Human Genome Project: lessons from large-scale biology". Science, 300(5617): p. 286-290, 2003. https://www.science.org/doi/10.1126/science.1084564
  2.     E.S. Lander, et al., "Initial sequencing and analysis of the human genome". Nature, 409(6822): p. 860-921, 2001. https://doi.org/10.1038/35057062
  3.     A.J. de Koning, et al., "Repetitive elements may comprise over two-thirds of the human genome". PLoS Genetics, 7(12): p. e1002384, 2011. https://doi.org/10.1371/journal.pgen.1002384
  4.     A. Zanghellini, et al., "New algorithms and an in silico benchmark for computational enzyme design". Protein Science, 15(12): p. 2785-2794, 2006. https://doi.org/10.1110/ps.062353106
  5.     G. Langer, et al., "Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7". Nature Protocols, 3(7): p. 1171-1179 2008. https://doi.org/10.1038/nprot.2008.91
  6.     D. Wishart, "NMR spectroscopy and protein structure determination: applications to drug discovery and development". Current Pharmaceutical Biotechnology, 6(2): p. 105-120, 2005. https://doi.org/10.2174/1389201053642367
  7.     Q. Li, and C. Kang, "A practical perspective on the roles of solution NMR spectroscopy in drug discovery". Molecules, 25(13): p. 2974, 2020. https://doi.org/10.3390/molecules25132974
  8.     C.R. Matthews, "Pathways of protein folding". Annual Review of Biochemistry, 62(Volume 62, 1993): p. 653-683, 1993. https://doi.org/10.1146/annurev.bi.62.070193.003253

Details

Primary Language

English

Subjects

Protein Engineering, Bioengineering (Other)

Journal Section

Review

Publication Date

July 30, 2025

Submission Date

January 15, 2025

Acceptance Date

June 13, 2025

Published in Issue

Year 2025 Volume: 6 Number: 2

APA
Çelik, R. H., İşcil, H. A. O., Bulut, E., & Acuner, S. E. (2025). Learning molecular machines by machine learning. Eurasian Journal of Science Engineering and Technology, 6(2), 100-120. https://doi.org/10.55696/ejset.1620495
AMA
1.Çelik RH, İşcil HAO, Bulut E, Acuner SE. Learning molecular machines by machine learning. (EJSET). 2025;6(2):100-120. doi:10.55696/ejset.1620495
Chicago
Çelik, Rumeysa Hilal, Hacı Aslan Onur İşcil, Ecem Bulut, and Saliha Ece Acuner. 2025. “Learning Molecular Machines by Machine Learning”. Eurasian Journal of Science Engineering and Technology 6 (2): 100-120. https://doi.org/10.55696/ejset.1620495.
EndNote
Çelik RH, İşcil HAO, Bulut E, Acuner SE (July 1, 2025) Learning molecular machines by machine learning. Eurasian Journal of Science Engineering and Technology 6 2 100–120.
IEEE
[1]R. H. Çelik, H. A. O. İşcil, E. Bulut, and S. E. Acuner, “Learning molecular machines by machine learning”, (EJSET), vol. 6, no. 2, pp. 100–120, July 2025, doi: 10.55696/ejset.1620495.
ISNAD
Çelik, Rumeysa Hilal - İşcil, Hacı Aslan Onur - Bulut, Ecem - Acuner, Saliha Ece. “Learning Molecular Machines by Machine Learning”. Eurasian Journal of Science Engineering and Technology 6/2 (July 1, 2025): 100-120. https://doi.org/10.55696/ejset.1620495.
JAMA
1.Çelik RH, İşcil HAO, Bulut E, Acuner SE. Learning molecular machines by machine learning. (EJSET). 2025;6:100–120.
MLA
Çelik, Rumeysa Hilal, et al. “Learning Molecular Machines by Machine Learning”. Eurasian Journal of Science Engineering and Technology, vol. 6, no. 2, July 2025, pp. 100-2, doi:10.55696/ejset.1620495.
Vancouver
1.Rumeysa Hilal Çelik, Hacı Aslan Onur İşcil, Ecem Bulut, Saliha Ece Acuner. Learning molecular machines by machine learning. (EJSET). 2025 Jul. 1;6(2):100-2. doi:10.55696/ejset.1620495