[PYTHON] Apache Spark Starter Kits

Target

Those who don't know where to start to do Apache Spark.

Here are some links related to Apache Spark. I mainly speak English. The Edx course is highly recommended. It is very easy to understand because it is explained in the video and you will learn by actually writing the code in Python. I will keep you updated! Please comment if you have any other good resources.

Originator

Overview

Compile and Run Example

This post is 1.4, but 1.5 should be the same.

Edx Introduction to Big Data with Apache Spark https://www.edx.org/course/introduction-big-data-apache-spark-uc-berkeleyx-cs100-1x

Scalable Machine Learning https://www.edx.org/course/scalable-machine-learning-uc-berkeleyx-cs190-1x

Bigdata university

Papers

Slide share of Japanese companies (NTT people are a lot.

Books

If you look for it, you will find various things, but what about it? I haven't read the following yet.

Spark summit

Others

Meetup in Japan

Commit email (Thanks to kou for making it.

Since the difference is colored, it is easier to see than the original commit email. You can subscribe below.

To: [email protected]
Cc: [email protected]
Subject: Subscribe
--
subscribe
        

Report any JIRA bugs here

Those who want to contribution

Recommended Posts

Apache Spark Starter Kits
Use apache Spark with jupyter notebook (IPython notebook)
Apache Spark Document Japanese Translation --Quick Start