[PYTHON] I want DQN Puniki to hit a home run

Deep Q-Network (DQN)

I found Deep Q-Network, which uses Deep Learning + reinforcement learning to learn behavior patterns, interesting, so I implemented it. I will publish it because I got a little result.

The source code is available below. https://github.com/dsanno/chainer-dqn

The following is detailed about DQN. History of DQN + Deep Q-Network written in Chainer

Learning target

The game that DQN will learn this time is Winnie the Pooh's Homelander Bee! . (Note that a sound will be heard at the link destination) The goal is to make Aniki, also known as Puniki, hit many home runs.

The reasons for choosing this game are as follows

There was also a reason that it was very difficult for humans, but I couldn't reach the point where it became difficult. (Reference: [Nico Nico Pedia](http://dic.nicovideo.jp/a/%E3%81%8F%E3%81%BE%E3%81%AE%E3%83%97%E3%83% BC% E3% 81% 95% E3% 82% 93% E3% 81% AE% E3% 83% 9B% E3% 83% BC% E3% 83% A0% E3% 83% A9% E3% 83% B3% E3% 83% 80% E3% 83% BC% E3% 83% 93% E3% 83% BC!))

Development environment

Neural network configuration

About play

About learning

Setting

Learning results

After continuing to study in Stage 1 for about 10 hours, I was able to almost clear Stage 1. I uploaded the play video below. I try not to take random actions when shooting videos. https://youtu.be/J4V6ZveYFUM

After learning including other stages, I was able to confirm that stage 3 was cleared by a fluke.

References

Recommended Posts

I want DQN Puniki to hit a home run
I want to run a quantum computer with Python
I want to print in a comprehension
I want to build a Python environment
I want to make matplotlib a dark theme
I want to easily create a Noise Model
I want to INSERT a DataFrame into MSSQL
I want to create a window in Python
I want to make a game with Python
I don't want to take a coding test
I want to create a plug-in type implementation
I want to easily find a delicious restaurant
I want to write to a file with Python
I want to upload a Django app to heroku
I want to embed a variable in a Python string
I want to easily implement a timeout in python
100 image processing knocks !! (021-030) I want to take a break ...
I want to give a group_id to a pandas data frame
I want to generate a UUID quickly (memorandum) ~ Python ~
I want to transition with a button in flask
I want to climb a mountain with reinforcement learning
I want to write in Python! (2) Let's write a test
I want to find a popular package on PyPi
I want to randomly sample a file in Python
I want to easily build a model-based development environment
I want to work with a robot in python.
I want to split a character string with hiragana
I want to install a package of Php Redis
[Python] I want to make a nested list a tuple
I want to manually create a legend with matplotlib
I want to send a business start email automatically
I'm a windows user but want to run tensorflow
I want to bind a local variable with lambda
I want a mox generator
I want to solve Sudoku (Sudoku)
I want a mox generator (2)
I want to make a blog editor with django admin
I want to start a jupyter environment with one command
[Python] I want to get a common set between numpy
I want to start a lot of processes from python
I want to make a click macro with pyautogui (desire)
I want to automatically generate a modern metal band name
NikuGan ~ I want to see a lot of delicious meat! !!
I want to make a click macro with pyautogui (outlook)
I want to use a virtual environment with jupyter notebook!
I want to install a package from requirements.txt with poetry
I want to send a message from Python to LINE Bot
[Visualization] I want to draw a beautiful graph with Plotly
I want to be able to run Python in VS Code
I want to make input () a nice complement in python
When you want to hit a UNIX command on Python
I want to create a Dockerfile for the time being.
I want to record the execution time and keep a log.
I want to use a wildcard that I want to shell with Python remove
I want to understand systemd roughly
MacBookPro Setup After all I want to do a clean installation
numpy: I want to convert a single type ndarray to a structured array
I want to run Rails with rails s even in vagrant environment
Qiskit: I want to create a circuit that creates arbitrary states! !!
I want to run the Python GUI when starting Raspberry Pi
I want to do a full text search with elasticsearch + python