[PYTHON] Numerical summary of data

I will write about numerical summarization, which is the basic summarization method for data analysis.

Summary of one-dimensional data

import  numpy as np


average=np.mean(x)  ///Mean value mean function///
(Out  5.6)

med=np.median(x)   ///Median function///
(Out  5.75)

var.p=np.var(x)  ///Sample variance var function///
(Out  8.19)

std=np.std(x)   ///Standard deviation std function///
(Out  2.86)

Please refer to here for the meaning of each word. https://note.com/karaage_love/n/n6f617d38c528

Summary of 2D data

import numpy as np
import matplotlib.pyplot as plt

///example.csv contains two columns of data.///

array_y=array[:,1]  ///slice///


///Creating a scatter plot s is the size c is the color of the scatter plot alpha is the transparency///

(Out   [[6.72727273 3.54545455]
        [3.54545455 6.        ]])
 //The covariance result is a 2 × 2 matrix. The diagonal components are the variances of x and y, respectively. The rest is covariance.///
(Out   [[1.         0.55805471]
        [0.55805471 1.        ]]
///Correlation coefficient: After all, the correlation coefficient is other than the diagonal component.///

See here for a detailed summary of 2D data. https://note.com/karaage_love/n/n992a7fdf9b1f

