Araştırma Makalesi

Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation

Cilt: 37 Sayı: 1 25 Mart 2025
PDF İndir
EN TR

Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation

Öz

This research uses deep reinforcement learning techniques, notably the AlphaZero algorithm, to construct an artificial intelligence system that can play Pawn Dama at a level that surpasses human players. Pawn dama, a simplified variant of Dama, is a perfect platform to explore AI's ability to think strategically and make decisions. The primary goal is to develop an AI that can use self-play to develop sophisticated strategies and comprehend the game's dynamics and regulations. The project incorporates MCTS to improve decision-making during games and uses a Convolutional Neural Network (CNN) to enhance the AI's learning capabilities. Creating an intuitive graphical user interface, putting the reinforcement learning algorithm into practice, and testing the system against real players are steps in the development process. The accomplishment of this project will contribute to the field of strategic game AI research by providing insights that may be applied to other domains and spurring further advancements in AI-driven game strategies.

Anahtar Kelimeler

Teşekkür

araştırmalarımız sırasında bize bir çatı sağlayan marmara ünv. bilg müh. bölümüne teşekkür ederiz

Kaynakça

  1. Shannon, C.E. (1950). Programming a Computer for Playing Chess. Philosophical Magazine, 41(314), 256-275.
  2. Knuth, D.E., & Moore, R.W. (1975). An Analysis of Alpha-Beta Pruning. Artificial Intelligence, 6(4), 293-326.
  3. Newborn, M. (1997). Kasparov versus Deep Blue: Computer Chess Comes of Age. Springer.
  4. Schaeffer, J. (1997). One Jump Ahead: Challenging Human Supremacy in Checkers. Springer-Verlag.
  5. Tesauro, G. (1995). Temporal Difference Learning and TD-Gammon. Communications of the ACM, 38(3), 58-68.
  6. Samuel, A.L. (1959). Some studies in machine learning using the game of checkers. IBM Journal of Research and Development, 3(3), 210-229.
  7. Coulom, R. (2006). Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search. In Proceedings of the 5th International Conference on Computers and Games (pp. 72-83).
  8. Kocsis, L., & Szepesvári, C. (2006). Bandit Based Monte-Carlo Planning. Machine Learning, 282, 282-293.

Ayrıntılar

Birincil Dil

İngilizce

Konular

Yapay Zeka (Diğer)

Bölüm

Araştırma Makalesi

Erken Görünüm Tarihi

19 Mart 2025

Yayımlanma Tarihi

25 Mart 2025

Gönderilme Tarihi

15 Ocak 2025

Kabul Tarihi

20 Şubat 2025

Yayımlandığı Sayı

Yıl 2025 Cilt: 37 Sayı: 1

Kaynak Göster

APA
Baran, M. K., Pehlivanlar, E., Güleç, C., Gönül, A., & Şeramet, M. (2025). Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation. International Journal of Advances in Engineering and Pure Sciences, 37(1), 27-35. https://doi.org/10.7240/jeps.1620319
AMA
1.Baran MK, Pehlivanlar E, Güleç C, Gönül A, Şeramet M. Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation. JEPS. 2025;37(1):27-35. doi:10.7240/jeps.1620319
Chicago
Baran, Mehmet Kadir, Erdem Pehlivanlar, Cem Güleç, Alperen Gönül, ve Muhammet Şeramet. 2025. “Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation”. International Journal of Advances in Engineering and Pure Sciences 37 (1): 27-35. https://doi.org/10.7240/jeps.1620319.
EndNote
Baran MK, Pehlivanlar E, Güleç C, Gönül A, Şeramet M (01 Mart 2025) Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation. International Journal of Advances in Engineering and Pure Sciences 37 1 27–35.
IEEE
[1]M. K. Baran, E. Pehlivanlar, C. Güleç, A. Gönül, ve M. Şeramet, “Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation”, JEPS, c. 37, sy 1, ss. 27–35, Mar. 2025, doi: 10.7240/jeps.1620319.
ISNAD
Baran, Mehmet Kadir - Pehlivanlar, Erdem - Güleç, Cem - Gönül, Alperen - Şeramet, Muhammet. “Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation”. International Journal of Advances in Engineering and Pure Sciences 37/1 (01 Mart 2025): 27-35. https://doi.org/10.7240/jeps.1620319.
JAMA
1.Baran MK, Pehlivanlar E, Güleç C, Gönül A, Şeramet M. Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation. JEPS. 2025;37:27–35.
MLA
Baran, Mehmet Kadir, vd. “Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation”. International Journal of Advances in Engineering and Pure Sciences, c. 37, sy 1, Mart 2025, ss. 27-35, doi:10.7240/jeps.1620319.
Vancouver
1.Mehmet Kadir Baran, Erdem Pehlivanlar, Cem Güleç, Alperen Gönül, Muhammet Şeramet. Adapting the AlphaZero Algorithm to Pawn Dama: Implementation, Training, and Performance Evaluation. JEPS. 01 Mart 2025;37(1):27-35. doi:10.7240/jeps.1620319