[PYTHON] [High School Information Department] Information I / Information II Reiwa 3rd year supplementary teaching materials Exercise examples

Class outline

Applicable chapter Chapter 3: Society Supported by Information Systems
Applicable section Network crimes
Information gathering and research
To Think about the advantages and disadvantages of the technology called "web scraping" from the user's point of view

Grade evaluation criteria

In this class, A to C will be used to evaluate individual lessons.

Correspondence according to grades and A achievement requirements
Student requirements for A You will be able to explain what each line of code used in this class is doing, and you will be able to scrape other sites. You can also use web scraping based on the site rules and the correct usage of the API.
Support for C students Assists in understanding line by line of the code used in this class. We will support you until you know the dangers of web scraping and can explain how to use it properly according to your website.

What is web scraping?

[Practice](Let's investigate and summarize in 5 minutes)

What kind of technology is web scraping? Let's also list the advantages and disadvantages.

======= (shared in class) ========

Let's try web scraping

Let's try the web scraping code actually written in python and see what value is returned.

What kind of tag information (li tag, img tag, etc.) ** would you like to extract ** from what site ** and ** with this web scraping? ??

Scraping according to the terms of the website.

――I wish I could scrape instagram and extract photos of individual users ... ――I wish I could scrape Twitter and Facebook to extract individual user posts ... ――I want to scrape a company's Tech blog, extract titles, and deepen my knowledge from the wording ...

Nowadays, there are more things I want to do with web scraping.

However

Since it is a technology that makes it easy to access personal information, SNS such as Instagram has established rules regarding scraping and API usage.

[Practice](Let's investigate and summarize in 15 minutes)

Choose three large social networking sites like instagram and Twitter, and survey the wording of the scraping rules. Based on the survey, first talk to your neighbors about how to use them with the right decisions, and then exchange ideas with the members in the same table as soon as they share each other. Also, as a confirmation of attendance, ** Please comment on today's knowledge and impressions of the class in this article **.

Recommended Posts

[High School Information Department] Information I / Information II Reiwa 3rd year supplementary teaching materials Exercise examples
[High School Information Department Information I / Information II] Summary of teaching materials for teacher training by python
[High School Information Department Information I] Teaching materials for teacher training: Data format and visualization (python)
Text mining by word2vec etc. by python ([High school information department information II] teaching materials for teacher training)
Classification by k-nearest neighbor method (kNN) by python ([High school information department information II] teaching materials for teacher training)
Object detection using YOLO (python) ([High School Information Department Information II] Teacher training materials)
Data analysis by clustering using k-means method (python) ([High school information department information II] teaching materials for teacher training)
Principal component analysis with python (Scikit-learn version, pandas & numpy version) ([High school information department information II] teaching materials for teacher training)
[High School Curriculum Guidelines Information I] Teaching materials for teacher training: Implementation of Huffman method in python
Classification by k-nearest neighbor method (kNN) by python ([High school information department information II] teaching materials for teacher training)
[High School Information Department Information I / Information II] Summary of teaching materials for teacher training by python
Text mining by word2vec etc. by python ([High school information department information II] teaching materials for teacher training)
Data analysis by clustering using k-means method (python) ([High school information department information II] teaching materials for teacher training)
[High School Information Department Information I] Teaching materials for teacher training: Data format and visualization (python)
Principal component analysis with python (Scikit-learn version, pandas & numpy version) ([High school information department information II] teaching materials for teacher training)
Object detection using YOLO (python) ([High School Information Department Information II] Teacher training materials)
[High School Curriculum Guidelines Information I] Teaching materials for teacher training: Implementation of Huffman method in python
[High School Information Department] Information I / Information II Reiwa 3rd year supplementary teaching materials Exercise examples
Web teaching materials for learning Python