[PYTHON] Full disclosure of methods used in machine learning

Introduction

There are more opportunities for anyone to learn machine learning, such as kaggle and learning sites. On the other hand, I feel that there are many people who do not get a sense of the whole because the amount of information is too much.

Therefore, I would like to publish ** the methods used in machine learning introduced in various reference books and articles **. Please refer to the link for detailed coding.

We also publish various information on SNS, so if you feel good reading the article I would be grateful if you could follow Twitter account "Saku731".

A series of machine learning steps

First, the skills required to learn machine learning are as follows. In practice, it requires a lot of detailed work, but at first it is enough to study the following.

―― 1) Data visualization: Grasp the overall feeling of the data and decide the preprocessing policy -2) Data preprocessing: Clean the data so that the prediction accuracy is high. ―― 3) Algorithm selection: Determine the appropriate algorithm for the data -4) Model learning: Let the computer learn the rules of data -5) Model verification: Confirm the prediction accuracy of the completed model

1) Data visualization

-Confirm basic statistics

2) Data preprocessing

-Handling Missing Values

3) Algorithm selection

Link is here

--Regression --Linear regression (simple regression, multiple regression) --Regression tree --Random forest regression --Classification --Logistic regression --Decision tree --Random forest --Support Vector Machine (SVM) --Can be used for both --Neural network (deep learning) - XGBoost - LightGBM

4) Model learning

-Validation --Holdout method --Cross validation -Parameter tuning --Grid search --Random search -(Application) Bayesian optimization

5) Model verification

-Evaluation index (regression) - RMSE ( Root Mean Squared Error ) - RMSLE ( Root Mean Squared Logarithmic Error ) - MAE ( Mean Absolute Error ) --Coefficient of determination -Evaluation index (2 class classification) --Confusion matrix (TP, TN, FP, FN) - Accuracy - Precision - Recall --F value --LogLoss (cross entropy error) --AUC (ROC curve area) -Evaluation index (multi-class classification) - multi-class accuracy - multi-class logloss - mean-F1, macro-F1, micro-F1

at the end

I will increase the information from time to time. If you have any requests for additions or corrections, please contact us and it will be greatly appreciated.

I also wrote an article in "A series that uses machine learning for work". Please use all means.

-Part 1: Understanding the purpose of machine learning -Part 2: Overview of AI Development Project -Part 3: Python coding procedure

We also publish various information on SNS, so if you feel good reading the article I would be grateful if you could follow Twitter account "Saku731".

~~ Also, at the end of the sentence, we are doing "** Team Development Experience Project **" for a limited time. ~~ ~~ If you are interested, please check [Application Sheet] for details. ~~ (Addition) The deadline has been closed because it is full. The next time is scheduled for March 2019, so if you would like to be informed, please fill in [Reservation Form].

Recommended Posts

Full disclosure of methods used in machine learning
Summary of evaluation functions used in machine learning
Used in machine learning EDA
Summary of methods often used in pandas
[python] Frequently used techniques in machine learning
[Machine learning] List of frequently used packages
Get a glimpse of machine learning in Python
Causal reasoning using machine learning (organization of causal reasoning methods)
List of main probability distributions used in machine learning and statistics and code in python
A memorandum of method often used in machine learning using scikit-learn (for beginners)
About testing in the implementation of machine learning models
Machine learning in Delemas (practice)
Basics of Machine Learning (Notes)
Importance of machine learning datasets
The result of Java engineers learning machine learning in Python www
Survey on the use of machine learning in real services
Significance of machine learning and mini-batch learning
Machine learning ③ Summary of decision tree
Classification and regression in machine learning
Machine learning in Delemas (data acquisition)
Python: Preprocessing in Machine Learning: Overview
Preprocessing in machine learning 2 Data acquisition
Random seed research in machine learning
Preprocessing in machine learning 4 Data conversion
Coursera Machine Learning Challenges in Python: ex5 (Adjustment of Regularization Parameters)
Machine learning algorithm (generalization of linear regression)
Summary of frequently used commands in matplotlib
[Linux] List of Linux commands used in practice
2020 Recommended 20 selections of introductory machine learning books
Python: Preprocessing in machine learning: Data acquisition
Machine learning
Machine learning algorithm (implementation of multi-class classification)
Summary of built-in methods in Python list
[Python] Saving learning results (models) in machine learning
Python: Preprocessing in machine learning: Data conversion
Python: Preprocessing in machine learning: Handling of missing, outlier, and imbalanced data
Paper: Machine learning mimics the learning methods of multilingual children! (Visual Grounding in Video for Unsupervised Word Translation)
Preprocessing in machine learning 1 Data analysis process
Judgment of igneous rock by machine learning ②
I tried to organize the evaluation indexes used in machine learning (regression model)
Machine learning memo of a fledgling engineer Part 1
Data supply tricks using deques in machine learning
Beginning of machine learning (recommended teaching materials / information)
Machine learning of sports-Analysis of J-League as an example-②
Python & Machine Learning Study Memo ⑤: Classification of irises
List of frequently used built-in functions and methods
Numerai Tournament-Fusion of Traditional Quants and Machine Learning-
Summary of what was used in 100 Pandas knocks (# 1 ~ # 32)
Pattern recognition learning in video Part 1 Field of Pattern Recognition
Python & Machine Learning Study Memo ②: Introduction of Library
Utilization of recursive functions used in competition pros
Summary of tools used in Command Line vol.8
Summary of tools used in Command Line vol.5
List of links that machine learning beginners are learning
Fix the argument of the function used in map
Overview of machine learning techniques learned from scikit-learn
About the development contents of machine learning (Example)
Analysis of shared space usage by machine learning
[Translation] scikit-learn 0.18 Tutorial Introduction of machine learning by scikit-learn
Frequently used methods of Selenium and Beautiful Soup
Machine learning memo of a fledgling engineer Part 2