[PYTHON] Reinforcement learning 28 colaboratory + OpenAI + chainerRL

(The chokozainer RL was updated on December 8, 2019.)

It is intended for AI beginners from junior high school students to university students. I have written up to 27 reinforcement learning series. It was about one month because it was one pace a day. I will write a summary that makes it easier to start from here. There is nothing new.

If you want to start machine learning using GPU easily, colaboratory is free, so I recommend it. No need for troublesome installation. However, that alone will not work, so let's summarize the procedure. I chose chainerRL as the framework. I like tensorflow, but I haven't used it, so ... I think I'll try using tensorflow soon. I will write it in the middle, but let's look at the source code as much as possible. It is published on github. The function name is easy to understand, so I think it's easy to understand. Chainer is easy to read in English, probably because it is made by Japanese people. Or rather, if you translate it into Japanese with chrome, it will be in proper Japanese. What about tensorflow? ?? ?? is. We have released chokozainerRL, which is a wrapper for chainerRL. I haven't done much, but I hope it will be useful for "human learning" in reinforcement learning.

1 Get a Google account

Please create from here.

Create a google account https://support.google.com/accounts/answer/27441?hl=ja

2 Open the Colaboratory page

Open from here. https://colab.research.google.com/notebooks/welcome.ipynb?hl=ja Bookmark it.

3 Open the chokozainer sample page and save it in your own folder

3-1 Open the notebook

open_toolbar.png

File-Open Notebook select_abc.png

Select the GitHub tab. Search with chokozainer, ipynbs/abc.ipynb Choose.

The opened abc.ipynb cannot be used as it is, so make a copy on the drive. select_save.png

Then rename the copied file. change.png

4 Run the notebook.

Running a notebook is explained in detail on various sites, so please do it yourself. Before learning, you could only do a few steps, but after learning, you can see that you can balance up to 200 steps of setting. You can make a video like this. videoimage.png

The learning execution result looks like this. result.png

Since elapsed is the execution time (seconds), learning will be completed in about 15 minutes.

Recommended Posts

Reinforcement learning 28 colaboratory + OpenAI + chainerRL
Reinforcement learning 18 Colaboratory + Acrobat + ChainerRL
Reinforcement learning 17 Colaboratory + CartPole + ChainerRL
Reinforcement learning 19 Colaboratory + Mountain_car + ChainerRL
Reinforcement learning 20 Colaboratory + Pendulum + ChainerRL
Reinforcement learning 21 Colaboratory + Pendulum + ChainerRL + A2C
Reinforcement learning 22 Colaboratory + CartPole + ChainerRL + A3C
Reinforcement learning 24 Colaboratory + CartPole + ChainerRL + ACER
Reinforcement learning 11 Try OpenAI acrobot with ChainerRL.
Reinforcement learning 27 colaboratory 90-minute rule measures chainerRL (+ chokozainerRL)
Reinforcement learning 2 Installation of chainerrl
Reinforcement learning 9 ChainerRL magic remodeling
Reinforcement learning 13 Try Mountain_car with ChainerRL.
[Introduction] Reinforcement learning
Future reinforcement learning_2
Future reinforcement learning_1
Reinforcement learning 14 Pendulum was done at ChainerRL.
Reinforcement learning 12 ChainerRL quick start guide windows version
[Mac] I tried reinforcement learning with OpenAI Baselines
Reinforcement learning for tic-tac-toe
[Reinforcement learning] Bandit task
Python + Unity Reinforcement Learning (Learning)
Reinforcement learning 1 introductory edition
Reinforcement learning 7 Learning data log output
Play with reinforcement learning with MuZero
Reinforcement learning in the shortest time with Keras with OpenAI Gym
[Reinforcement learning] Tracking by multi-agent
Reinforcement learning 6 First Chainer RL
Reinforcement learning starting with Python
Reinforcement learning 5 Try programming CartPole?
Reinforcement learning Learn from today
Reinforcement learning 4 CartPole first step
Deep Reinforcement Learning 1 Introduction to Reinforcement Learning
Reinforcement learning 23 Create and use your own module with Colaboratory
Deep reinforcement learning 2 Implementation of reinforcement learning
DeepMind Reinforcement Learning Framework Acme
Reinforcement learning: Accelerate Value Iteration
I tried deep reinforcement learning (Double DQN) for tic-tac-toe with ChainerRL
[Reinforcement learning] How to draw OpenAI Gym on Google Corab (2020.6 version)
TF2RL: Reinforcement learning library for TensorFlow2.x
Reinforcement learning 34 Make continuous Agent videos
Python + Unity Reinforcement learning environment construction
Explore the maze with reinforcement learning
Reinforcement learning 8 Try using Chainer UI
Reinforcement learning 3 Dynamic programming / TD method
Deep Reinforcement Learning 3 Practical Edition: Breakout
I tried reinforcement learning using PyBrain
Learn while making! Deep reinforcement learning_1