[PYTHON] About LangID

LangID is a library that performs Language Identification. If you give a string as input, it will indicate which language the string belongs to.

The simple usage is as follows.

# -*- coding: utf-8 -*-
import langid

result = langid.classify('This is Japanese')
print(result)  #=> ('ja', -197.7628321647644)

The algorithms in this library are made from publicly known research, and references can be found at here.

The point to be worried about is the difficulty in speed. Since the above simple test takes nearly 3 seconds, it seems that it can not be used very much in the world of the Web where real-time performance is important.

Recommended Posts

About LangID
About CAGR
About virtiofs
About python-apt
About Permission
About sklearn.preprocessing.Imputer
About gunicorn
About requirements.txt
About locale
About permissions
About Opencv ②
About axis = 0, axis = 1
About Opencv ③
About import
About numpy
About pip
About Linux
About numpy.newaxis
About endian
About Linux
About import
About Opencv ①
About Linux
About Linux
About Linux ①
About cv2.imread
About _ and __
About wxPython
Notepad about TecoGAN
Briefly about __name__
About python comprehension
About Docker Volume
[Linux] About export
About reference type
About Twitter scraping
About the test
Learn about programming
About Flask customization
About variable scope. .. ..
About Python tqdm.
About python yield
Notes about with
About python, class
About Linear Models
About Go functions
About pandas describe
About Kivy root
About Firestore timeout
About python inheritance
About python, range ()
About Confusion Matrix
[Linux] About PATH
About python decorators
Linux (about groups)
Note about awk
About python reference
About Bitnami Autostart
About Python decorators
About Milkcocoa SDK
Notes about pytorch
[Python] About multi-process