NLP4J is a Java library that provides the following functions required for text mining and natural language processing.
--Data collection function (crawler) --Natural language processing call function (Annotator)
The basic concept and structure of the three-pronged crawler, Annotator, and Indexer is with IBM Watson Explorer (formerly TAKMI, IBM Content Analytics), the most used enterprise (= business) text mining software in Japan. It is similar.
NLP4J will start providing basic functions and will take time to provide them as a do-it-yourselfer.
NLP4J [001a] Morphological analysis in Java (using Yahoo! Developer Network Japanese morphological analysis) NLP4J [001b] Morphological analysis in Java (using kuromoji) NLP4J [002] Try parsing Japanese using Yahoo! Developer Network Japanese Parsing (V1) in Java NLP4J [003] Try text analysis using natural language processing and part-speech statistical processing in Java NLP4J [004] Try text analysis using natural language processing and parsing statistical processing in Java NLP4J [005-1] Try Twitter analysis with Twitter4J and NLP4J (data collection) NLP4J [005-2] NLP4J + Twitter4J (Analysis 1)
NLP4J [006-030] 100 language processing knocks with NLP4J # 30 Reading morphological analysis results NLP4J [006-031] 100 language processing with NLP4J knock # 31 verb NLP4J [006-032] 100 language processing with NLP4J Knock # 32 Prototype of verb NLP4J [006-033] 100 language processing with NLP4J knock # 33 sa-hen noun NLP4J [006-034] 100 language processing knocks with NLP4J # 34 "A B" NLP4J [006-034b] Try to make an Annotator of 100 language processing knock # 34 "A's B" with NLP4J
Create an Annotator that uses Kuromoji with NLP4J [007]
Save the morphologically parsed document Try using Word2Vec Download "Recall / Bug Information" from the Ministry of Land, Infrastructure, Transport and Tourism in a clean format http://carinf.mlit.go.jp/jidosha/carinf/opn/index.html
Try to associate with emoji
Use COTOHA API with NLP4J https://api.ce-cotoha.com/home https://api.ce-cotoha.com/contents/reference.html
NLP4J [005-3] NLP4J + Twitter4J (data conversion)
NLP4J [00N] What to do with morphological and parsing in Java
NLP4J [00N] Describes the Keyword class
NLP4J [00N] Javadoc
NLP4J [00N] Try Twitter analysis with Twitter4J and NLP4J
NLP4J [00N] Let's make a text analysis system with Azure and NLP4J NLP4J [00N] Let's make a text analysis system with Solr and NLP4J
Get NLP4J [-001] domain name nlp4j.org
NLP4J [-002] Register with Maven Central Repository
Negative story
Natural language solution case study
There is no perfect dictionary
What is the most used Japanese morphological analysis engine in Japan?
Yahoo! Developer Network Text Analysis: About Japanese Morphological Analysis
How to create a hierarchy of corporate dictionaries
https://www.nlp4j.org/
Work has progressed during Corona Fuss & Golden Week, so this post will be updated soon. (5/11)
Recommended Posts