Research and implementation of intelligent decision based on a priori knowledge and DQN algorithms in wargame environment

Journal article


Sun, Yuxiang, Yuan, Bo, Zhang, Tao, Tang, Bojian, Zheng, Wanwen and Zhou, Xianzhong 2020. Research and implementation of intelligent decision based on a priori knowledge and DQN algorithms in wargame environment. Electronics. 9 (10), p. 1668. https://doi.org/10.3390/electronics9101668
AuthorsSun, Yuxiang, Yuan, Bo, Zhang, Tao, Tang, Bojian, Zheng, Wanwen and Zhou, Xianzhong
Abstract

The reinforcement learning problem of complex action control in a multi-player wargame has been a hot research topic in recent years. In this paper, a game system based on turn-based confrontation is designed and implemented with state-of-the-art deep reinforcement learning models. Specifically, we first design a Q-learning algorithm to achieve intelligent decision-making, which is based on the DQN (Deep Q Network) to model complex game behaviors. Then, an a priori knowledge-based algorithm PK-DQN (Prior Knowledge-Deep Q Network) is introduced to improve the DQN algorithm, which accelerates the convergence speed and stability of the algorithm. The experiments demonstrate the correctness of the PK-DQN algorithm, it is validated, and its performance surpasses the conventional DQN algorithm. Furthermore, the PK-DQN algorithm shows effectiveness in defeating the high level of rule-based opponents, which provides promising results for the exploration of the field of smart chess and intelligent game deduction

KeywordsDQN algorithm; policy modeling; prior knowledge; intelligent decision
Year2020
JournalElectronics
Journal citation9 (10), p. 1668
PublisherMDPI AG
ISSN2079-9292
Digital Object Identifier (DOI)https://doi.org/10.3390/electronics9101668
Web address (URL)http://hdl.handle.net/10545/625346
http://creativecommons.org/licenses/by-nc-sa/4.0/
hdl:10545/625346
Publication dates13 Oct 2020
Publication process dates
Deposited06 Nov 2020, 11:34
Accepted06 Oct 2020
Rights

Attribution-NonCommercial-ShareAlike 4.0 International

ContributorsUniversity of Derby and Nanjing University, China
File
File Access Level
Open
File
File Access Level
Open
Permalink -

https://repository.derby.ac.uk/item/93vz5/research-and-implementation-of-intelligent-decision-based-on-a-priori-knowledge-and-dqn-algorithms-in-wargame-environment

Download files

  • 28
    total views
  • 0
    total downloads
  • 0
    views this month
  • 0
    downloads this month

Export as

Related outputs

Research on Action Strategies and Simulations of DRL and MCTS-based Intelligent Round Game
Sun, Yuxiang, Yuan, Bo, Zhang, Yongliang, Zheng, Wanwen, Xia, Qingfeng, Tang, Bojian and Zhou, Xianzhong 2021. Research on Action Strategies and Simulations of DRL and MCTS-based Intelligent Round Game. International Journal of Control, Automation and Systems. https://doi.org/10.1007/s12555-020-0277-0
Large-scale Data Integration Using Graph Probabilistic Dependencies (GPDs)
Zada, Muhammad Sadiq Hassan, Yuan, Bo, Anjum, Ashiq, Azad, Muhammad Ajmal, Khan, Wajahat Ali and Reiff-Marganiec, Stephan 2020. Large-scale Data Integration Using Graph Probabilistic Dependencies (GPDs). IEEE. https://doi.org/10.1109/bdcat50828.2020.00028
Explaining probabilistic Artificial Intelligence (AI) models by discretizing Deep Neural Networks
Saleem, Rabia, Yuan, Bo, Kurugollu, Fatih and Anjum, Ashiq 2020. Explaining probabilistic Artificial Intelligence (AI) models by discretizing Deep Neural Networks. IEEE. https://doi.org/10.1109/ucc48980.2020.00070
An experimental online judge system based on docker container for learning and teaching assistance
Yibo, Han, Zhang, Zheng, Yuan, Bo, Bi, Haixia, Shahzad, Mohammad Nasir and Liu, Lu 2020. An experimental online judge system based on docker container for learning and teaching assistance. IEEE. https://doi.org/10.1109/smartworld-uic-atc-scalcom-iop-sci.2019.00264
A privacy-preserved probabilistic routing index model for decentralised online social networks
Yuan, Bo, Gu, Jiayan and Liu, Lu 2020. A privacy-preserved probabilistic routing index model for decentralised online social networks. IEEE. https://doi.org/10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00305
A survey of interpretability of machine learning in accelerator-based high energy physics
Turvill, Danielle, Barnby, Lee, Yuan, Bo and Zahir, Ali 2020. A survey of interpretability of machine learning in accelerator-based high energy physics. IEEE. https://doi.org/10.1109/bdcat50828.2020.00025
Exploring network embedding for efficient message routing in opportunistic mobile social networks
Yuan, Bo, Anjum, Ashiq, Panneerselvam, J. and Liu, Lu 2020. Exploring network embedding for efficient message routing in opportunistic mobile social networks. IEEE. https://doi.org/10.1109/ICDMW.2019.00077
An efficient evolutionary user interest community discovery model in dynamic social networks for internet of people
Jiang, Liang, Shi, Leilei, Lu, Liu, Yao, Jingjing, Yuan, Bo and Zheng, Yongjun 2019. An efficient evolutionary user interest community discovery model in dynamic social networks for internet of people. IEEE Internet of Things Journal. https://doi.org/10.1109/JIOT.2019.2893625
A GRU-based prediction framework for intelligent resource management at cloud data centres in the age of 5G
Lu, Yao, Liu, Lu, Panneerselvam, J., Yuan, Bo, Gu, Jiayan and Antonopoulos, Nick 2019. A GRU-based prediction framework for intelligent resource management at cloud data centres in the age of 5G. IEEE Transactions on Cognitive Communications and Networking. 6 (2), pp. 486-498. https://doi.org/10.1109/tccn.2019.2954388
An inductive content-augmented network embedding model for edge artificial intelligence
Yuan, Bo, Panneerselvam, J., Liu, Lu, Antonopoulos, Nick and Lu, Yao 2019. An inductive content-augmented network embedding model for edge artificial intelligence. IEEE Transactions on Industrial Informatics. 15 (7), pp. 4295-4305. https://doi.org/10.1109/TII.2019.2902877
Efficient service discovery in decentralized online social networks.
Yuan, Bo, Liu, Lu and Antonopoulos, Nikolaos 2017. Efficient service discovery in decentralized online social networks. Future Generation Computer Systems. https://doi.org/10.1016/j.future.2017.04.022
A novel service discovery model for decentralised online social networks.
Yuan, Bo 2018. A novel service discovery model for decentralised online social networks. PhD Thesis https://doi.org/10.48773/93w19
Mobilouds: An energy efficient MCC collaborative framework with extended mobile participation for next generation networks
Panneerselvam, J., Hardy, J., Liu, Lu, Yuan, Bo and Antonopoulos, Nikolaos 2017. Mobilouds: An energy efficient MCC collaborative framework with extended mobile participation for next generation networks. IEEE Access. https://doi.org/10.1109/ACCESS.2016.2602321
An efficient algorithm for partially matched services in internet of services
Ahmed, Mariwan, Liu, Lu, Hardy, J., Yuan, Bo and Antonopoulos, Nikolaos 2016. An efficient algorithm for partially matched services in internet of services. Personal and Ubiquitous Computing. https://doi.org/10.1007/s00779-016-0917-9