[PYTHON] I made it because I want JSON data that can be used freely in demos and prototypes

The reason for thinking

When I come up with an idea for a new web application or library, I sometimes make a prototype to explain it to others.

Keeping the default style of the browser without writing CSS, all the images are gray squares, and the text is easy if it is "○○○" or "dummy dummy dummy", but it is not interesting after all.

In such a case, if you use Bootstrap etc. for the design and select the image from free materials, you can make a good prototype without spending too much time.

However, when I wanted 100 short sentences, 100 names, 100 addresses, etc., I had to do everything by hand, such as automatically generating dummy data or finding the data from somewhere. Even if data is found, it is often difficult to use as it is and processing is required.

As this continued, I came to think that it would be nice if there were free data materials.

Tried to make it

I searched for it again just in case, but I couldn't find any free material with data that seemed to be easy to use, so I decided to try it anyway.

I thought it would be better if the data format was unified, so I chose JSON that is easy to read in any language. I wanted to use some service as the place to register, but I couldn't find a reasonable place, so I set up the site.

FreeJSON.net-Free and free JSON data http://freejson.net

As a starting point, I collected the first 400 characters of each work of Aozora Bunko, thinking that it might be a little interesting as a person's name and data of active students of Takarazuka (so-called Takarazuka is called a student) and text data of the posting site. I tried to register data etc.

Impressions made

It is surprisingly interesting to actually create the data. When I make a prototype, it's different from the purpose, so I feel like making it, but when the purpose is to make data, my mood changes.

Also, the data of Takarazuka students and the data of Aozora Bunko are interesting just by looking at them. I'm going to collect data steadily for a while. After updating the data, I will tweet with the hashtag #FreeJSON.

Since the data that can be created by one person is known, it may be good to create a library-like mechanism that collects such data.

update information

2016-08-06 Added a list of matcha inscriptions 2016-08-06 Added drink menu that seems to be in the cafe 2016-08-02 Added deep-breathing words, Hirofumi Goto (from the deep-breathing words project http://shin-kokyu.jp) 2016-08-01 Added a list of common kanji (provided by @wakufactory) 2016-08-01 Added information on Japanese museums (with latitude and longitude information)

Recommended Posts

I made it because I want JSON data that can be used freely in demos and prototypes
I made a familiar function that can be used in statistics with Python
I made a data extension class for tensorflow> = 2.0 because ImageDataGenerator can no longer be used.
Easy padding of data that can be used in natural language processing
I wrote a tri-tree that can be used for high-speed dictionary implementation in D language and Python.
Functions that can be used in for statements
I want to create a priority queue that can be updated in Python (2.7)
Easy program installer and automatic program updater that can be used in any language
Basic algorithms that can be used in competition pros
ANTs image registration that can be used in 5 minutes
I made a tool to automatically generate a state transition diagram that can be used for both web development and application development
Install Mecab and CaboCha on ubuntu16.04LTS so that it can be used from python3 series
[Django] Field names, user registration, and login methods that can be used in the User model
[Python3] Code that can be used when you want to resize images in folder units
[Atcoder] [C ++] I made a test automation tool that can be used during the contest
Goroutine (parallel control) that can be used in the field
Goroutine that can be used in the field (errgroup.Group edition)
Scripts that can be used when using bottle in Python
I want Sphinx to be convenient and used by everyone
I tried to expand the database so that it can be used with PES analysis software
I heard rumors that malloc is slow and should be stored in memory, so I compared it.
I bought and analyzed the year-end jumbo lottery with Python that can be executed in Colaboratory
[Python] I made my own library that can be imported dynamically
A timer (ticker) that can be used in the field (can be used anywhere)
About character string handling that can be placed in JSON communication
I made a shuffle that can be reset (reverted) with Python
I want to create a pipfile and reflect it in docker
Python standard input summary that can be used in competition pro
Can I be a data scientist?
I tried to build an environment that can acquire, store, and analyze tweet data in WSL (bash)
I created a template for a Python project that can be used universally
I put Alpine Linux in Larkbox and made it my home server
I made a Discord bot in Python that translates when it reacts
I made a simple timer that can be started from the terminal
I made a program in Python that reads CSV data of FX and creates a large amount of chart images
I tried to create a class that can easily serialize Json in Python
[2015.02.22] Youtube-dl has been updated and can no longer be used in previous versions.
A personal memo of Pandas related operations that can be used in practice
Summary of scikit-learn data sources that can be used when writing analysis articles
Note that I understand the least squares algorithm. And I wrote it in Python.
[Python] None I made nullutil.py because it was cluttered by checking and branching.
Linux command that can be used from today if you know it (Basic)
It can be achieved in 1 minute! Decorator that caches function execution results in memcached
I tried to make OneHotEncoder, which is often used for data analysis, so that it can reach the itch.