Twi2Wiki is a web application developed by Kosuke Fujita, Toshio Tanaka, and Kojiro Yamamoto in 2020. It has a function to generate article page-like sentences of Wikipedia from the profile information of Twitter . It is compatible with Smartphone and PC. Support for Dial Pulse Black Phone has ended in 2020. ** Click the image to go to the app **
Twi2Wiki is a web application developed for the purpose of allowing individuals to easily create Wikipedia-like profiles. Text information is required to generate the article page (see the system for details), but with an emphasis on ease of use, the method using Twitter's profile text, which was already popular at the time, was adopted. When Kana Nishino's "Torisetsu" became popular in 2015, there was an active movement to convey her detailed profile when building relationships with the other party (Note 1). On the other hand, detailed profiles have resulted in excessive creation costs, and there has been a growing demand for simplification (Torisetsu's digital transformation). Against this background of the times, it was developed in 2020 by a volunteer development team (Fujita, Tanaka, Yamamoto) as an app that anyone can easily create a Wikipedia-style profile. The back end is roughly divided into three functions: Twitter cooperation, occupation judgment, and biography generation, and the generated sentences are displayed as a Wikipedia -like Web page. The application part uses Flask, which is a web application framework of Python, and is operated on a free server of Heroku.
I noticed that it was mid-December, and I couldn't secure enough development resources, so I don't have the function to share on Twitter. As an alternative, screenshot sharing is recommended. The occupation is judged based on the account name, profile text, and the latest tweets of Twitter, and the occupation with the highest possibility is selected from the top 100 occupations that elementary school students want to become. As a judgment method, prepare 1,000 words with similar Distributed expression for one type of occupation, and select the occupation in which the most target words appear in the Twitter sentence. Assigned. Biography is generated using seq2seq (English: sequence to sequence, abbreviation: seq2seq), which is a type of neural network. seq2seq is one of the natural language processing technologies consisting of two models, Encoder and Decoder, based on LSTM (English: Long Short-Term Memory, Japanese: Long Short-Term Memory).
For learning, 130,000 Wikipedia person pages are used. The introductory part of the article page is used as input data, and the bio is used as output data. The conversion from Twitter profile to Wikipedia-style biography uses the same algorithm as the translation from Japanese to English (Note 2).
If you describe your Twitter profile according to Wikipedia's Style Manual, it has the characteristic that a biography is easily generated (Note 3). The training data also includes Donald Trump, Crystal Noda, Minami Hamabe and others.
In creating it, the developer Kojiro Yamamoto wrote two articles, "Sentence generation using Seq2Seq" and "I implemented Attention Seq2Seq with PyTorch". I was referring to it. ** World share ** In December 2020, the number of users exceeded 0 in about one month from the start of Service, and exceeded 3 in January of the following year. If the number of users increases at this rate, it will reach the world population (Note 4) in about 200 million years. This is because of the simplicity of linking with Twitter at the start of use and the user's lead that the result can be posted on Twitter, it is said that new things and things that are likely to be news items will be tweeted Twitter It is said that this is because it matches the user's habits.
** Delayed spread to young people ** There is a view that the spread to young people is delayed. According to the Internet survey conducted independently by the development team, the usage rate of 0 to 3 years old is 0%, which is far below the number of users of 20 to 120 years old, which is the main target group. It was. This is attributed to the delay in responding to Babbling such as "da" and "au" used by young people. Some users have pointed out that the background is significantly different from the actual background. The development team acknowledged this and attributed it to the user's previous life (Note 5) background. The above phenomenon is "Your name. ”(Director: Makoto Shinkai) has been reported to be seen by many users.
Recommended Posts