[PYTHON] R: Use Japanese instead of Japanese in scripts

In R, if there is Japanese in the script, it sometimes causes inconvenience, so we deal with it in the following way.

# R
intToUtf8(c(12371, 12435, 12395, 12385, 12399))
## [1] "Hello"

Which number the character you want corresponds to

# R
utf8ToInt("Hello")
[1] 12371 12435 12395 12385 12399

I checked it once and tried to write it in the script without using Japanese.

You can also look it up in Python.

# python3
[ord(s) for s in "Hello"]
## [12371, 12435, 12395, 12385, 12399]

For python2 series, u "" is required.

# python2
[ord(s) for s in u"Hello"]
## [12371, 12435, 12395, 12385, 12399]

Postscript (thanks: @shiracamus)

It seems that you can also specify Unicode in R.

"\u3053\u3093\u306b\u3061\u306f"
## [1] "Hello"

Is the code specified in hexadecimal? There are many ways to get the hexadecimal code.

In R, it looks like this.

# R
sprintf("%x", utf8ToInt("Hello"))
[1] "3053" "3093" "306b" "3061" "306f"

You can use hex in Python.

# python3
[hex(ord(s)) for s in "Hello"]
['0x3053', '0x3093', '0x306b', '0x3061', '0x306f']

Postscript

By the way, when embedding in R package, if you use a character string of "\ u ..." format in the function definition, the following warning seems to appear.

plotat.Rd: non-ASCII input and no declared encoding

It seems that it is not recommended to use double-byte characters in R help.

Recommended Posts

R: Use Japanese instead of Japanese in scripts
Let's use usercustomize.py instead of sitecustomize.py
Let's use tomotopy instead of gensim
Use of constraints file added in pip 7.1
Hello world instead of localhost in Django
Put Linux in your Chromebook and use R ...
Use the Java SDK of GoogleMapsAPI to get the result of reverse GeoCoding in Japanese.
Summary of how to use MNIST in Python
Uncertainty of Japanese unide code in Tacotron 2 series
Make a joyplot-like plot of R in python
Date of Address already in use error in Flask
[Implementation explanation] How to use the Japanese version of BERT in Google Colaboratory (PyTorch)
Use urlparse.urljoin instead of os.path.join for Python URL joins
Let's use the open data of "Mamebus" in Python
Use date to x-axis of tsplot depicted in seaborn
How to use Spacy Japanese model in Google Colaboratory
I want to use the R dataset in python
EP 7 Use List Comprehensions Instead of map and filter
Convenient use of ipython
Use config.ini in Python
Use dates in Python
Use Mean in DataFrame
Use Valgrind in Python
R in Anaconda (in Ubuntu 14.04)
Japanese output in Python
Use profiler in Python
Japanese localization of Pycharm
English PDF in Japanese
Survey on the use of machine learning in real services
About the garbled Japanese part of pandas-profiling in Jupyter notebook
Enabled to input Japanese in Linux environment (crostini) of Chromebook
Sort the string array in order of length & Japanese syllabary
Make the function of drawing Japanese fonts in OpenCV general
Comparison of data frame handling in Python (pandas), R, Pig