[PYTHON] [See in the photo] How a kaggle beginner can rank up from "novice" to "Contributor" in 10 minutes.

Introduction

Nikkei xTECH Business AI Advent Calendar 2019 I will be in charge of the 4th day of the road to the AI dojo "Kaggle".

kaggle's voice depends on status

Users are ranked in kaggle. It's called Progression. It's like a progress level or a title.

Depending on this rank, the degree of attention to the posted script and the degree of influence of the remark will differ.

At the time of registration, the title ** novice ** will be given, and if the prescribed conditions are met, it will become ** Contributor **. If you want to reach a higher rank from there, you have to make a statement to collect good evaluations from others, win medals, and so on.

kaggle is a competitive competition, and if you just want to collect medals, you should not submit your own answer until the last minute, read the trial results of others, and post it after improving the accuracy. That alone will not make kaggle prosper. kaggle is also a place for technical exchange. Contribute to kaggle's prosperity by posting your own answers to prosperity and commentary on the answers. By posting useful information, you can collect good evaluations and raise your rank.

If you register first, it's boring to be a beginner ** novice **, so let's aim for ** Contributor **. I have summarized how to become a ** Contributor ** in an easy-to-understand manner.

Register immediately

kaggle Official

image.png

There is a blue registration button in the upper right. I'm comfortable, so I'm logged in with a google account.

Check Progression rank

To check the rank, hover your mouse over ... on the top tab and click Progression.

000.png

As you scroll down, you'll see the achievement conditions for each rank. At first, I think Novice has a check mark. Novice's achievement condition is registration, so it is checked at the time of registration.

Let's take a look at the next higher rank Consumer we are aiming for this time.

kag1.PNG

Check items to achieve

・ Update bio ・ Add a place to live ・ Add job title ・ Add organization ・ Perform SMS authentication ・ Run the script ・ Submit the result to one competition ・ Add one comment ・ Give one like rating

It has become. Let's do it while checking one by one.

update bio

What is bio in the first place? An abbreviation for biography, it is a place to write a biography, but some people write about themselves and hobbies.

At the time of initial registration, I think that the image of Mr. Tori is displayed in the part where the login button on the upper right was. Place your mouse over Mr. Tori and click "my profile" to move to your profile screen.

kag20.PNG

You can see bio under the medal. Click edit on the right to add a self-introduction and you're good to go. I think Progression is checked.

Added address / title / organization

There is a blue edit profile button in the upper right of the figure where bio was added earlier. Press here

035.png

You will be given a place to enter your information. Job title in occupation Organization belonging to organization Address to city Let's enter.

Now there are three more checks.

Also, let's do SMS authentication. Scroll down a little and you will find the SMS verification link. If it has not been authenticated, it will be Not verified.

2033.png

Let's actually submit it to the competition

・ Run the script ・ Submit the result ・ Add a comment

We will achieve this. I think this is the most wally thing. Does it look like it's annoying? There is this article for those who feel.

First, there is a search mark to the left of the competition on the top tab.

kag3.PNG

Enter titanic here to search. You will find a competition called "titanic: Machine Learning from Disaster" with a trophy mark at the top. The "<>" mark below it is a script written by someone.

Let's choose the titanic competition.

kag4.PNG

There is a join competition with the blue button, so join.

kag5.PNG

Read and agree to the terms.

kag6.PNG

join has changed to a button called submit. You can now write and submit a script using the data. Let's press the button.

kag7.PNG

You will see a script (notebook) written by someone else. Let's create a new script from the new notebook button.

kag8.PNG

Select the language in which you want to analyze. Select notebook or script as the submission format. If you want to add a description or execute it partially, notebook Choose your favorite language. This time, let's use R notebook instead of python.

kag9.PNG

As a demo code, explanations such as loading a library and loading a file are written.

For the time being, delete everything here and type in as follows. Execution of the cell of notebook is shift + enter like jupyter.

kag11.PNG

The working directory for this notebook is in kaggle. The files are in "..input", although there are some differences depending on the competition. When I checked the file in the input, there was a folder called "titanic". If you look further inside titanic, you will find three files.

You can predict the test using the model trained on the train.

kag12.PNG

The test is missing a column for the expected variable "whether it survived". Let's create a simple model to predict this.

kag13.PNG

Submit the prediction result with the write.csv function. There are column names that must be entered in the forecast data. This time, the passenger ID is applicable. Also, write.csv will write the line name, which will cause an error. Let's turn it off. When you have finished making it, press the commit button on the upper right to submit it.

999.png

If there are no particular errors, it means that the script has been registered. Press the over view button and go to submit.

If you scroll down your script that you can see in orver view, you can see the csv for submission that you made in the output files part. I think you can confirm.

kag17.PNG

It is finally submitted.

kag18.PNG

When it reaches complete, it is complete. You can also check the score.

In addition, let's comment the explanation etc. on your script.

98.png

This is checked.

Use the last one like

For example, pressing discussion on the top tab discusses scripts and techniques that others have written. Let's take a look inside and like what was interesting.

kag30.PNG

The number 111 is attached to the upper right of the figure. If you think it's good, press the △ button to support it.

Congratulations

image.png

This is the rank up. The rank goes up because you have experienced how to use kaggle. It's a testament to understanding the process of not only registering and reading people's scripts, but also participating in competitions and submitting results yourself.

Thank you for your hard work. Now, it starts from here. Let's enjoy kaggle.

Recommended Posts

[See in the photo] How a kaggle beginner can rank up from "novice" to "Contributor" in 10 minutes.
How to display the modification date of a file in C language up to nanoseconds
You can do it in 3 minutes! How to make a moving QR code (GIF)!
How to post a ticket from the Shogun API
How to set up a simple SMTP server that can be tested locally in Python
How to quickly count the frequency of appearance of characters from a character string in Python?
I want to see the graph in 3D! I can make such a dream come true.
How to plot the distribution of bacterial composition from Qiime2 analysis data in a box plot
How to slice a block multiple array from a multiple array in Python
How to use the __call__ method in a Python class
How to log in automatically like 1Password from the CLI
How to generate a query using the IN operator in Django
How to get the last (last) value in a list in Python
How to take a screenshot of the Chrome screen (prevent it from cutting off in the middle)
[sh] How to store the command execution result in a variable
How to determine the existence of a selenium element in Python
How to get all the possible values in a regular expression
How to check the memory size of a variable in Python
How to get a string from a command line argument in python
[Introduction to Python] How to use the in operator in a for statement?
How to check the memory size of a dictionary in Python
How to get the vertex coordinates of a feature in ArcPy
How to extract the desired character string from a line 4 commands
How to deploy a Django app on heroku in just 5 minutes
[Beginner memo] How to specify the library reading path in Python
How to check local GAE from iPhone browser in the same LAN
How to specify a .ui file in the dialog / widget GUI in PySide
I came up with a way to make a 3D model from a photo.
How to get a value from a parameter store in lambda (using python)
How to get a namespaced view name from a URL (path_info) in Django
How to study until a beginner in statistics gets started with Bayesian statistics
How to sort by specifying a column in the Python Numpy array.
In IPython, when I tried to see the value, it was a generator, so I came up with it when I was frustrated.
Voice actor history Scraping Wikipedia to see how many years have you become a voice actor in the "Aikatsu!" Series
How to create a clone from Github
How to get a stacktrace in python
How to operate Linux from the console
How to create a repository from media
How to access the Datastore from the outside
How to pass the execution result of a shell command in a list in Python
From nothing on Ubuntu 18.04 to setting up a Deep Learning environment in Tensor
How to mention a user group in slack notification, how to check the id of the user group
How to count the number of elements in Django and output to a template
Use dHash to locate on the course from a scene in a racing game
A memorandum of how to execute the! Sudo magic command in Jupyter Notebook
I want to see a list of WebDAV files in the Requests module
Find the ideal property by scraping! A few minutes walk from the property to the destination
How to get a list of files in the same directory with python