Python practice data analysis Summary of learning that I hit about 10 with 100 knocks

How much you use pandas is important in this era

I am trying every day to knock 100 Python practical data analysis that is selling well these days. This time, I will share the lessons I learned from the simple to the necessary ones. (Updated from time to time)

This book contains 100 practical examples. (Book link (Amazon))

As a future trend ** "Python is much simpler than VBA, and you can create materials much faster!" ** I think that ** business people who can combine Excel and Python ** will be popular in the company.

As a remark, ** the machine learning book of Shuwa System is lighter, more practical, cheaper and recommended ** than the O'Reilly book (I also buy the O'Reilly book when I have money).

I will introduce some examples with samples that you can copy and paste with Jupyter immediately. (It's a little different from the contents of the book. I can't introduce everything. The cost performance is outstanding, so please buy it.)

(2019/12/21) @konandoiruasa pointed out that the CSV of the code cannot be read from the URL. If that happens, do the following: Thank you @konandoiruasa.

import ssl
ssl._create_default_https_context = ssl._create_unverified_context

Read the data

Read the data simply.

import pandas as pd
customer_data= pd.read_csv('https://microlearning.site/pydata/ch1/customer_master.csv')
customer_data.head()

Read various data

import pandas as pd
transaction_1 = pd.read_csv('https://microlearning.site/pydata/ch1/transaction_1.csv')
transaction_1.head()
import pandas as pd
transaction_2 = pd.read_csv('https://microlearning.site/pydata/ch1/transaction_1.csv')
transaction_2.head()

Join data

transaction = pd.concat([transaction_1,transaction_2],ignore_index=True)
transaction.head()

Comparing the lengths, I think that ** transaction ** is the total number of columns of ** transaction_1 ** and ** transaction_2 **.

print(len(transaction_1))
print(len(transaction_2)
print(len(transaction))

Change log

--2019/12/21 Newly created

Recommended Posts

Python practice data analysis Summary of learning that I hit about 10 with 100 knocks
Data analysis starting with python (data preprocessing-machine learning)
A summary of Python e-books that are useful for free-to-read data analysis
[Python] Data analysis, machine learning practice (Kaggle) -Data preprocessing-
I started machine learning with Python Data preprocessing
Data analysis with python 2
Data analysis with Python
About data preprocessing of systems that use machine learning
[OpenCV / Python] I tried image analysis of cells with OpenCV
Challenge principal component analysis of text data with Python
Summary of the basic flow of machine learning with Python
Summary of statistical data analysis methods using Python that can be used in business
Python data analysis learning notes
20200329_Introduction to Data Analysis with Python Second Edition Personal Summary
Basic summary of data manipulation with Python Pandas-First half: Data creation & manipulation
Data analysis starting with python (data visualization 1)
Data analysis starting with python (data visualization 2)
I studied four libraries of Python 3 engineer certified data analysis exams
Practical exercise of data analysis with Python ~ 2016 New Coder Survey Edition ~
Practice of data analysis by Python and pandas (Tokyo COVID-19 data edition)
I read "Reinforcement Learning with Python: From Introduction to Practice" Chapter 1
About learning method with original data of CenterNet (Objects as Points)
Image processing with Python 100 knocks # 4 Binarization of Otsu (discriminant analysis method)
I read "Reinforcement Learning with Python: From Introduction to Practice" Chapter 2
[Examples of improving Python] Learning Python with Codecademy
I'm tired of Python, so I analyzed the data with nehan (corona related, is that word now?)
Basic summary of scraping with Requests that beginners can absolutely understand [Python]
Machine learning with python (2) Simple regression analysis
I tried factor analysis with Titanic data!
Source code of sound source separation (machine learning practice series) learned with Python
[Python] First data analysis / machine learning (Kaggle)
Sentiment analysis of tweets with deep learning
I have 0 years of programming experience and challenge data processing with python
Summary of probability distributions that often appear in statistics and data analysis
Recommendation of Altair! Data visualization with Python
I tried fMRI data analysis with python (Introduction to brain information decoding)
I did Python data analysis training remotely
I started machine learning with Python (I also started posting to Qiita) Data preparation
Data analysis in Python Summary of sources to look at first for beginners
Summary of how to read numerical data with python [CSV, NetCDF, Fortran binary]
Summary of scikit-learn data sources that can be used when writing analysis articles
Align the number of samples between classes of data for machine learning with Python
Practice of creating a data analysis platform with BigQuery and Cloud DataFlow (data processing)
I tried hundreds of millions of SQLite with python
Create test data like that with Python (Part 1)
Analysis for Data Scientists: Qiita Self-Article Summary 2020 (Practice)
Python 3 Engineer Certification Data Analysis Exam Pre-Exam Learning
I tried principal component analysis with Titanic data!
I tried to get CloudWatch data with Python
Data analysis in Python: A note about line_profiler
[Python] Summary of S3 file operations with boto3
I / O related summary of python and fortran
Static analysis of Python code with GitLab CI
One-liner that outputs 10000 digits of pi with Python
A well-prepared record of data analysis in Python
A story about data analysis by machine learning
Two things I was happy about with Python 3.9
Data analysis python
[Python] I examined the practice of asynchronous processing that can be executed in parallel with the main thread (multiprocessing, asyncio).
Get a large amount of Starbucks Twitter data with python and try data analysis Part 1
Predicting the goal time of a full marathon with machine learning-③: Visualizing data with Python-