Dynamic Portfolio Optimization with Deep Reinforcement Learning: Evidence from Borsa Istanbul
Abstract
In this study, portfolio optimization has been conducted using the reinforcement learning approach, one of the artificial intelligence algorithms. The data is considered for constituents of the BIST30 index, which is the blue-chip index of Borsa Istanbul. The performance of Deep Deterministic Policy Gradient (DDPG), a deep learning algorithm of reinforcement learning, has been tested against the Markowitz mean-variance and equal-weighted portfolios as benchmark models; the BIST30 index itself has also been taken as a benchmark portfolio. This study contributes to the relevant literature in terms of Türkiye as an example of a developing country and the method employed. The study demonstrates the potential of RL approaches that are becoming widespread for portfolio optimization. The obtained results reveal that the portfolio formed with the DDPG approach shows a superior Sharpe ratio portfolio over portfolios obtained with other classical approaches. These findings, while highlighting the potential of RL approaches in practice, emerge as an alternative option for fund managers, especially in a volatile market environment.
Keywords
References
- Aboussalah, A.M. and Lee, C.G. (2020). Continuous control with stacked deep dynamic recurrent reinforcement learning for portfolio optimization. Expert Systems with Applications, 140, 112891. https://doi.org/10.1016/j.eswa.2019.112891
- Almahdi, S. and Yang, S.Y. (2017). An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown. Expert Systems with Applications, 87, 267–279. https://doi.org/10.1016/j.eswa.2017.06.023
- Bai, Y., Gao, Y., Wan, R., Zhang, S. and Song, R. (2025). A review of reinforcement learning in financial applications. Annual Review of Statistics and Its Application, 12(1), 209–232. https://doi.org/10.48550/arXiv.2411.12746
- Bekaert, G. and Harvey, C.R. (2003). Emerging markets finance. Journal of Empirical Finance, 10(1–2), 3–55. https://doi.org/10.1016/S0927-5398(02)00054-3
- Black, F. and Litterman, R. (1990). Asset allocation: Combining investor views with market equilibrium. Journal of Fixed Income, 1(2), 7–18. https://doi.org/10.3905/jfi.1991.408013
- Black, F. and Litterman, R. (1992). Global portfolio optimization. Financial Analysts Journal, 48(5), 28–43. https://doi.org/10.2469/faj.v48.n5.28
- Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J. and Zaremba, W. (2016). OpenAI gym. arXiv preprint arXiv:1606.01540. https://doi.org/10.48550/arXiv.1606.01540
- De Prado, M.L. (2016). Building diversified portfolios that outperform out-of-sample. Journal of Portfolio Management, 42(4), 59–69. https://doi.org/10.3905/jpm.2016.42.4.059
Details
Primary Language
English
Subjects
Investment and Portfolio Management
Journal Section
Research Article
Publication Date
March 31, 2026
Submission Date
October 27, 2025
Acceptance Date
March 17, 2026
Published in Issue
Year 2026 Volume: 11 Number: 1