A Deep Reinforcement Learning Agent for Snake Game

Md Meem Hossain; Akinwumi Fakokunde; Omololu Isaac Olaolu

doi:10.36079/lamintang.ijai-01002.565

A Deep Reinforcement Learning Agent for Snake Game

Md Meem Hossain
Akinwumi Fakokunde
Omololu Isaac Olaolu

DOI: https://doi.org/10.36079/lamintang.ijai-01002.565

Keywords: Deep Reinforcement Learning, Neural Network, Snake Game

Abstract

After watching AlphaGo a Netflix documentary which presents how AlphaGo is an AI computer game developed by deep-mind technologies based on deep reinforcement learning (DRL). Since then, my interest in reinforcement learning has been growing. In this project, I will apply reinforcement learning to develop an agent to play snake game. Where Deep learning will implement a neural Network to help the agent (snake) to learn what action must take to get a state. If we describe deep reinforcement learning (DRL) model where agent interacts with an environment and chooses an action. Based on action, agents receive feedback from the environment as states (or perceives) and rewards. A state = an array with 11 input values, each input values represent a neural network that provides an output of 3 values, each one represents three possible actions the agent (snake) can take (Straight, Right Turn and Left Turn).

Downloads

Download data is not yet available.

Author Biographies

Md Meem Hossain

School of Computing, Engineering & Digital Technologies, Teesside University. Middleesbrough, Tees Valley, United Kingdom.

Akinwumi Fakokunde

School of Computing, Engineering & Digital Technologies, Teesside University. Middleesbrough, Tees Valley, United Kingdom.

Omololu Isaac Olaolu

School of Computing, Engineering & Digital Technologies, Teesside University. Middleesbrough, Tees Valley, United Kingdom.

This is an open access article, licensed under CC-BY-SA

Published

2023-12-27

Downloads : 293

How to Cite

[1]

M. M. Hossain, A. Fakokunde, and O. I. Olaolu, “A Deep Reinforcement Learning Agent for Snake Game”, International Journal of Artificial Intelligence, vol. 10, no. 2, pp. 92-102, Dec. 2023.

Download Citation

Issue

Vol 10 No 2: December 2023

Section

Articles

References

S. Kuutti, R. Bowden, Y. Jin, P. Barber and S. Fallah, “A survey of deep learning applications to autonomous vehicle control,” IEEE Transactions on Intelligent Transportation Systems, vol. 22, no. 2, pp. 712-733, 2020.

R. S. Sutton and A.G. Barto. (2018). Reinforcement learning: An introduction. MIT press.

K. Arulkumaran, M.P. Deisenroth, M. Brundage and A.A. Bharath, “Deep reinforcement learning: A brief survey,” IEEE Signal Processing Magazine, vol. 34, no. 6, pp. 26-38, 2017.

D.P. Bertsekas, “Feature-based aggregation and deep reinforcement learning: A survey and some new implementations,” IEEE/CAA Journal of Automatica Sinica, vol. 6, no. 1, pp. 1-31, 2018.

V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra and M. Riedmiller, “Playing atari with deep reinforcement learning,” arXiv preprint arXiv:1312.5602, 2013.

P. Henderson, R. Islam, P. Bachman, J. Pineau, D. Precup and D. Meger, “Deep reinforcement learning that matters,” In Proceedings of the AAAI conference on artificial intelligence, vol. 32, no. 1, April 2018

A. Gnanasekaran, J. F. Faba and J. An. Reinforcement learning in Pacman. 2017.

C. Szepesvári, “Algorithms for reinforcement learning,” Synthesis lectures on artificial intelligence and machine learning, vol. 4, no. 1, pp. 1-103, 2010.

Y. Patil, Snake Game Using Reinforcement Learning.

T.P. Lillicrap, J.J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver and D. Wierstra, “Continuous control with deep reinforcement learning,” arXiv preprint arXiv:1509.02971, 2015.

C. Wu, B. Ju, Y. Wu, X. Lin, N. Xiong, G. Xu, H. Li, and X. Liang, “UAV autonomous target search based on deep reinforcement learning in complex disaster scenes,” IEEE Access, vol. 7, pp. 117227-117245, 2019.

B. O'Donoghue, R. Munos, K. Kavukcuoglu and V. Mnih, “Combining policy gradient and Q-learning,” arXiv preprint arXiv:1611.01626, 2016.

D.J. Soemers, C.F. Sironi, T. Schuster and M.H. Winands, “Enhancements for real-time Monte-Carlo tree search in general video game playing,” In 2016 IEEE Conference on Computational Intelligence and Games (CIG), pp. 1-8, September 2016.

L. Xia, J. Xu, Y. Lan, J. Guo, W. Zeng and X. Cheng, “Adapting Markov decision process for search result diversification,” In Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval, pp. 535-544, August 2017.

R. Berthon, M. Randour and J.F. Raskin, “Threshold constraints with guarantees for parity objectives in Markov decision processes,” arXiv preprint arXiv:1702.05472, 2017.

R. Cai and C. Zhang. Train a snake with reinforcement learning algorithms. 2020.

V. Mnih, K. Kavukcuoglu, D. Silver, A.A. Rusu, J. Veness, M.G. Bellemare, A. Graves, M. Riedmiller, A.K. Fidjeland, G. Ostrovski and S. Petersen, “Human-level control through deep reinforcement learning,” nature, vol. 518, no. 7540, pp. 529-533, 2015.

Z. Wei, D. Wang, M. Zhang, A.H. Tan, C. Miao and Y. Zhou, “Autonomous agents in snake game via deep reinforcement learning,” In 2018 IEEE International conference on Agents (ICA), pp. 20-25, July 2018.