[PYTHON] Save & load data with joblib, pickle

Save and load data with joblib, pickle

I will use it often, so I will leave it for notes.

joblib and pickle are ** libraries that can store various data in a nice way **. It can be used not only for text and csv, but also for saving trained models. Also, I feel that the reading and writing speeds are fast. (It seems that memory is used quite a bit)

Basically, it seems that using joblib is better in terms of memory than pickle.

Example

import pandas as pd

arr = ['a','b','c','d','e']
df = pd.DataFrame({'data':arr})
df.head(5)

# data
#0	a
#1	b
#2	c
#3	d
#4	e

Save & load data with joblib

import joblib

#Data storage
joblib.dump(df,'test_jb.pkl', compress=3)
#Data reading
load_df = joblib.load('test_jb.pkl')
load_df.head()

# data
#0	a
#1	b
#2	c
#3	d
#4	e

Save & load data with pickle

import pandas as pd

#Data storage
df.to_pickle('test_pk.pkl')
#Data reading
load_df2 = pd.read_pickle('test_pk.pkl')
load_df2.head()

# data
#0	a
#1	b
#2	c
#3	d
#4	e

If you change the "data" part to "learned model", you can save the model as it is.

Recommended Posts

Save & load data with joblib, pickle
LOAD DATA with PyMysql
Save tweet data with Django
Save / load in-memory DB with python sqlite3
I tried to save the data with discord
Save data to flash with STM32 Nucleo Board
Save the object to a file with pickle
Data analysis with python 2
Save memory with `` __slots__``
Visualize data with Streamlit
Reading data with TensorFlow
Data visualization with pandas
Data manipulation with Pandas!
Shuffle data with pandas
Data Augmentation with openCV
Normarize data with Scipy
Data analysis with Python
Embed audio data with Jupyter
Graph Excel data with matplotlib (1)
Load nested json with pandas
Artificial data generation with numpy
Extract Twitter data with CSV
Save images with web scraping
Get Youtube data with python
Clustering ID-POS data with LDA
Learn new data with PaintsChainer
Binarize photo data with OpenCV
Graph Excel data with matplotlib (2)
Load test Websocket with Locust
Data processing tips with Pandas
Interpolate 2D data with scipy.interpolate.griddata
Read json data with python
How to enable Keras Regressor to be saved with pickle or joblib
Save the results of crawling with Scrapy to the Google Data Store
Let's create a tic-tac-toe AI with Pylearn 2-Save and load models-