An introduction to statistical modeling for data analysis (Midorimoto) reading notes (in Python and Stan)

Reading memo

We had a reading party for Introduction to Statistical Modeling for Data Analysis, which is famous as "Green Book". At that time, I made a reading memo and share it. I uploaded it as Jupyter Notebook on GitHub. The R and WinBUGS codes in the book are written in Python and Stan as much as possible.

Create a statistical model to understand Chap1 data Chap2 Probability Distribution and Statistical Model Maximum Likelihood Estimate Chap3 Generalized Linear Model (GLM) -Poisson Regression Chap4 GLM Model Selection-AIC and Model Prediction Chap5 GLM Likelihood Ratio Test and Test Asymmetry Expanding the range of applications of Chap6 GLM-logistic regression, etc. Chap7 Generalized Linear Mixed Models (GLMM) -Modeling of Individual Differences Chap8 Markov Chain Monte Carlo (MCMC) Method and Bayesian Statistical Model Bayesian modeling of Chap9 GLM and estimation of posterior distribution Chap10 Hierarchical Bayesian Model-GLMM Bayesian Modeling Chap11 Hierarchical Bayesian model with spatial structure

What is a green book?

Official name, Introduction to Statistical Modeling for Data Analysis. The cover is green, so it's called a green book. An introductory book on statistical modeling (a method of creating a model and applying it to observation data to understand the phenomenon). I think that it is often introduced in TJO's blog article. According to the preface, the reader is supposed to be "a person who has not received basic training to" express and explain mathematical models with phenomena "."

Those who want to recommend green books

I usually touch the data, scatter it in Excel and draw an approximate curve, but I don't know what I'm doing, so I thought it would be good to read it. Personally, I felt a difficult atmosphere from the cover and the title of the book, so I hesitated to read it, but as I read it, as the name "Introduction" suggests, it was written in a very easy-to-understand manner. .. Rather than starting studying from Introduction to Statistics at the University of Tokyo (Akamoto), I feel that it may be better to read after understanding the flow of statistical modeling in the green book. I am.

Even those who use Python are okay

In the book, the explanation is advanced using R, but there are many codes written in Python on the net, and I think that it is possible to run it in Python at hand by referring to that. (Thank you for referring to many sites.) If the sample data is in RData format, I think it is necessary to convert it to CSV on R. You can also see code written in Stan instead of WinBUGS code.

It may get difficult from around chapters 5 and 6, but I think it's a good book, so please read it!

Recommended Posts

An introduction to statistical modeling for data analysis (Midorimoto) reading notes (in Python and Stan)
An introduction to statistical modeling for data analysis
"Introduction to data analysis by Bayesian statistical modeling starting with R and Stan" implemented in Python
Reading Note: An Introduction to Data Analysis with Python
Introduction to Statistical Modeling for Data Analysis GLM Model Selection
Introduction to Statistical Modeling for Data Analysis Generalized Linear Models (GLM)
Introduction to Statistical Modeling for Data Analysis Expanding the range of applications of GLM
An introduction to Python for non-engineers
Solving AOJ's Algorithm and Introduction to Data Structures in Python -Part1-
Solving AOJ's Algorithm and Introduction to Data Structures in Python -Part2-
Solving AOJ's Algorithm and Introduction to Data Structures in Python -Part4-
Solving AOJ's Algorithm and Introduction to Data Structures in Python -Part3-
An introduction to Python for machine learning
An introduction to Python for C programmers
Duck book implemented in Python "Bayesian statistical modeling with Stan and R"
Data analysis: Easily apply descriptive and inference statistics to CSV data in Python
An introduction to voice analysis for music apps
Data analysis in Python Summary of sources to look at first for beginners
Introduction to Time Series Analysis ~ Seasonal Adjustment Model ~ Implemented in R and Python
Python learning notes for machine learning with Chainer Chapters 11 and 12 Introduction to Pandas Matplotlib
Organizing basic procedures for data analysis and statistical processing (4)
[For beginners] How to study Python3 data analysis exam
Organizing basic procedures for data analysis and statistical processing (2)
[Introduction for beginners] Reading and writing Python CSV files
[Technical book] Introduction to data analysis using Python -1 Chapter Introduction-
Reading, summarizing, visualizing, and exporting time series data to an Excel file with Python
[Impression] [Data analysis starting from zero] Introduction to Python data science learned in business cases
[Introduction to cx_Oracle] (Part 6) DB and Python data type mapping
[Introduction to Data Scientists] Descriptive Statistics and Simple Regression Analysis ♬
Python for Data Analysis Chapter 4
How to swap elements in an array in Python, and how to reverse an array.
[Introduction to Udemy Python3 + Application] 42. for statement, break statement, and continue statement
[Introduction to Udemy Python 3 + Application] 36. How to use In and Not
20200329_Introduction to Data Analysis with Python Second Edition Personal Summary
Tips for coding short and easy to read in Python
[Understand in the shortest time] Python basics for data analysis
Python data analysis learning notes
[Introduction to Data Scientists] Basics of Python ♬ Functions and classes
Introduction to Effectiveness Verification Chapters 4 and 5 are written in Python
Python for Data Analysis Chapter 2
An introduction to Python Programming
[Introduction to Python] Combine Nikkei 225 and NY Dow csv data
[Python] Introduction to graph creation using coronavirus data [For beginners]
Introduction to Python For, While
Tips for data analysis ・ Notes
Notes on reading and writing float32 TIFF images in python
Python for Data Analysis Chapter 3
An introduction to type annotations and Pyright for more robust Python code with rich input completion
processing to use notMNIST data in Python (and tried to classify it)
[Python] PCA scratch in the example of "Introduction to multivariate analysis"
Introduction to Structural Equation Modeling (SEM), Covariance Structure Analysis with Python
Introduction to Data Analysis with Python P32-P43 [ch02 3.US Baby Names 1880-2010]
Introduction to Data Analysis with Python P17-P26 [ch02 1.usa.gov data from bit.ly]
[Introduction to Python] How to use the in operator in a for statement?
Introduction to Bayesian Statistical Modeling with python ~ Trying Linear Regression with MCMC ~
[Introduction to Data Scientists] Basics of Python ♬ Conditional branching and loops
Play with Statistical Modeling: Quantify J-League Team Strength with Stan and Python
[Introduction to Data Scientists] Basics of Python ♬ Functions and anonymous functions, etc.
An introduction to Mercurial for non-engineers
[Introduction to Python3 Day 1] Programming and Python
Hashing data in R and Python