[PYTHON] You will be an engineer in 100 days ――Day 83 ――Programming ――About machine learning 8

Click here until yesterday

You will become an engineer in 100 days --Day 76 --Programming --About machine learning

You will become an engineer in 100 days-Day 70-Programming-About scraping

You will become an engineer in 100 days --Day 66 --Programming --About natural language processing

You will become an engineer in 100 days --Day 63 --Programming --Probability 1

You will become an engineer in 100 days-Day 59-Programming-Algorithms

You will become an engineer in 100 days --- Day 53 --Git --About Git

You will become an engineer in 100 days --Day 42 --Cloud --About cloud services

You will become an engineer in 100 days --Day 36 --Database --About the database

You will be an engineer in 100 days-Day 24-Python-Basics of Python language 1

You will become an engineer in 100 days --Day 18 --Javascript --JavaScript basics 1

You will become an engineer in 100 days --Day 14 --CSS --CSS Basics 1

You will become an engineer in 100 days --Day 6 --HTML --HTML basics 1

This time is a continuation of the story about machine learning.

About clustering

I will explain what you can do with machine learning for the first time, but what you can do with machine learning There are basically three.

・ Regression ・ Classification ・ Clustering

Roughly speaking, it becomes prediction, but the part of what to predict changes.

・ Regression: Predict numerical values ・ Classification: Predict categories ・ Clustering: Make it feel good

Clustering becomes unsupervised learning I don't know the answer, but divide it into something nice You can do that.

The data used this time is the data of digits (numbers) attached to scikit-learn.

Data reading

First, let's read the numerical data. You can load the data with load_digits.

from sklearn.datasets import load_digits
import matplotlib.pyplot as plt
%matplotlib inline

digits = load_digits()
print(digits.data.shape)

plt.gray() 
plt.matshow(digits.images[0]) 
plt.show()

(1797, 64)

	2	3	4	5	9	...	54	58	59	60	61	62
0	5	13	9	1	0	...	0	6	13	10	0	0
1	0	12	13	5	0	...	0	0	11	16	10	0
2	0	4	15	12	0	...	5	0	3	11	16	9
3	7	15	13	1	8	...	9	7	13	13	9	0
4	0	1	11	0	0	...	0	0	2	16	4	0

[PYTHON] You will be an engineer in 100 days ――Day 83 ――Programming ――About machine learning 8

About clustering

Data reading

Perform clustering

Let's see the result

Summary

Author information

	2	3	4	5	9	...	54	58	59	60	61	62
0	5	13	9	1	0	...	0	6	13	10	0	0
1	0	12	13	5	0	...	0	0	11	16	10	0
2	0	4	15	12	0	...	5	0	3	11	16	9
3	7	15	13	1	8	...	9	7	13	13	9	0
4	0	1	11	0	0	...	0	0	2	16	4	0

	2	3	4	5	9	...	54	58	59	60	61	62
0	5	13	9	1	0	...	0	6	13	10	0	0
1	0	12	13	5	0	...	0	0	11	16	10	0
2	0	4	15	12	0	...	5	0	3	11	16	9
3	7	15	13	1	8	...	9	7	13	13	9	0
4	0	1	11	0	0	...	0	0	2	16	4	0

	2	3	4	5	9	...	54	58	59	60	61	62
0	5	13	9	1	0	...	0	6	13	10	0	0
1	0	12	13	5	0	...	0	0	11	16	10	0
2	0	4	15	12	0	...	5	0	3	11	16	9
3	7	15	13	1	8	...	9	7	13	13	9	0
4	0	1	11	0	0	...	0	0	2	16	4	0