Python learning memo for machine learning by Chainer Chapter 13 Neural network training ~ Chainer completed

What This is an article that summarizes what I noticed and researched when learning machine learning using Chainer. This time, we will study neural network training.

It is written based on my understanding, so it may be incorrect. I will correct any mistakes, please forgive me.

Content

Neural network training

To put it simply, improve the accuracy of the model, and make it smarter for the user.

Objective function

If we dig deeper into the neural network, that is, we will optimize the objective function. The following two typical objective functions are introduced.

Mean squared error is a method for finding the optimum solution for model parameters, whereas one solution is found at once. I understand that the method of predicting that this is more probabilistically possible is cross entropy.

Objective function optimization

Gradient descent method: As the name suggests, a method of updating parameters from the gradient Mini-batch learning method: Derivation of each objective function by making multiple sets of data sets. And how to update the parameters by taking the average value of the objective function (I'm not sure)

Activation function

If the value of the gradient of the activation function is small, the parameters of each layer will also be small. This is called gradient disappearance. Is there any restriction on the activation function? (Does not diverge, converges ...) Since it is output as a probability, should it be 1 or less? ?? ?? The ReLU function is introduced there, but who and how did you find it ...? I will update it when I know the details.

I wonder if deep learning has become possible by solving the problem of gradient disappearance.

Comment For the time being, I got an overview of machine learning. Next, I would like to make a concrete program. image.png

That's why I bought this book. To be honest, I didn't know what to buy because I was too inexperienced, but when I looked at the index of the table of contents, ** Because it uses the library learned by Chainer, it is suitable for actual battle ** ** Introducing web application creation, which may help to make the program publicly available ** So I decided to buy it.

So study this book ** STEP.1 Machine learning battle STEP.2 Master Pyhton to the application release level **

I will do my best to the next goal.

Recommended Posts

Python learning memo for machine learning by Chainer Chapter 13 Neural network training ~ Chainer completed
Python learning memo for machine learning by Chainer from Chapter 2
Python learning memo for machine learning by Chainer Chapter 13 Basics of neural networks
Python learning memo for machine learning by Chainer Chapter 7 Regression analysis
Python learning memo for machine learning by Chainer Chapter 8 Introduction to Numpy
Python learning memo for machine learning by Chainer Chapter 10 Introduction to Cupy
Python learning memo for machine learning by Chainer Chapter 9 Introduction to scikit-learn
Python learning memo for machine learning by Chainer Chapters 1 and 2
Python learning memo for machine learning by Chainer until the end of Chapter 2
Python & Machine Learning Study Memo ③: Neural Network
Python & Machine Learning Study Memo ④: Machine Learning by Backpropagation
[Python / Machine Learning] Why Deep Learning # 1 Perceptron Neural Network
Chapter 7 [Neural Network Deep Learning] P252 ~ 275 (first half) [Learn by moving with Python! New machine learning textbook]
Memo for building a machine learning environment using Python
Rank learning using neural network (Implementation of RankNet by Chainer)
Python Machine Learning Programming Chapter 2 Classification Problems-Machine Learning Algorithm Training Summary
Machine learning summary by Python beginners
PRML Chapter 5 Neural Network Python Implementation
<For beginners> python library <For machine learning>
Interval scheduling learning memo ~ by python ~
"Scraping & machine learning with Python" Learning memo
A memorandum of scraping & machine learning [development technique] by Python (Chapter 4)
A memorandum of scraping & machine learning [development technique] by Python (Chapter 5)
Python & Machine Learning Study Memo: Environment Preparation
[Learning memo] Basics of class by python
Amplify images for machine learning with python
Why Python is chosen for machine learning
[Shakyo] Encounter with Python for machine learning
[Python] Web application design for machine learning
An introduction to Python for machine learning
Python & Machine Learning Study Memo ⑥: Number Recognition
[Chainer] Document classification by convolutional neural network
Introduction to Deep Learning for the first time (Chainer) Japanese character recognition Chapter 2 [Model generation by machine learning]
Python vs Ruby "Deep Learning from scratch" Chapter 3 Implementation of 3-layer neural network
Python & Machine Learning Study Memo ⑤: Classification of irises
Upgrade the Azure Machine Learning SDK for Python
Python & Machine Learning Study Memo ②: Introduction of Library
[Python] Collect images with Icrawler for machine learning [1000 images]
Chapter 6 Supervised Learning: Classification pg212 ~ [Learn by moving with Python! New machine learning textbook]
[Memo] Machine learning
A memo for creating a python environment by a beginner
Python & Machine Learning Study Memo ⑦: Stock Price Forecast
Python learning notes for machine learning with Chainer Chapters 11 and 12 Introduction to Pandas Matplotlib
Image collection Python script for creating datasets for machine learning
Build an interactive environment for machine learning in Python
Preparing to start "Python machine learning programming" (for macOS)
Use scikit-learn training dataset with chainer (for learning / prediction)
[Python] I made a classifier for irises [Machine learning]
Deep Learning Experienced in Python Chapter 2 (Materials for Journals)
Python class (Python learning memo ⑦)
Visualization memo by Python
Machine learning course memo
Build an environment for machine learning using Python on MacOSX
Non-information graduate student studied machine learning from scratch # 2: Neural network
How to use machine learning for work? 03_Python coding procedure
PRML Chapter 7 Related Vector Machine Python Implementation for Regression Problems
Chapter 7 [Error back propagation method] P275 ~ (Middle) [Learn by moving with Python! New machine learning textbook]
Data set for machine learning
Japanese preprocessing for machine learning
Python memo (for myself): Array
Python exception handling (Python learning memo ⑥)