IMPLEMENTATION AND COMPARISON OF REINFORCEMENT LEARNING ALGORITHMS FOR SOLVING THE PROBLEM OF FINDING A PATH IN A 2D MATRIX

Authors

  • Yersultan Amanbay Al-Farabi Kazakh National University
  • Nurtugan Azatbekuly Al-Farabi Kazakh National University
  • Aksultan Mukhanbet Al-Farabi Kazakh National University

DOI:

https://doi.org/10.26577/jpcsit2023v1i4a8

Keywords:

SARSA, Q-Leaning, Reinforcement learning

Abstract

The topic of this paper is the study of two reinforcement learning algorithms, SARSA and Q-Learning. Reinforcement learning is generating significant interest due to its potential applications in various domains such as robotics, gaming, optimization, etc. In addition, reinforcement learning is an interesting object of research from the perspective of theory and practice, as it is related to concepts such as exploration and use, learning and planning, consistency, and stabilization, etc. SARSA and Q-Learning are two of the most well-known and widely used reinforcement learning algorithms, which are based on the evaluation of the value function of states and actions. The aim of this paper is to study the learning characteristics of these algorithms in different scenarios of agent's interaction with the environment. To this end, experiments were conducted in which the agent had to find an optimal path in a 2D matrix containing walls to reach the final position safely. The results showed that SARSA was on average 28.3% faster than Q-Learning.

Downloads

Download data is not yet available.
        150 31

Downloads

How to Cite

Amanbay, Y., Azatbekuly, N., & Mukhanbet, A. (2023). IMPLEMENTATION AND COMPARISON OF REINFORCEMENT LEARNING ALGORITHMS FOR SOLVING THE PROBLEM OF FINDING A PATH IN A 2D MATRIX. Journal of Problems in Computer Science and Information Technologies, 1(4). https://doi.org/10.26577/jpcsit2023v1i4a8