Python Resources for Adding Furigana above Japanese Sentence

六月 08, 2017

Text Segmentation
------
[natto-py](https://github.com/buruzaemon/natto-py): MeCab
[tinysegmenter3](https://github.com/SamuraiT/tinysegmenter): extremely compact Japanese tokenizer. Or [tinysegmenter](https://pypi.python.org/pypi/tinysegmenter)
```
# update: not perfect.
全部食べなさい。 >>> 全/部食/べ/なさい。
```
[JapaneseTokenizers](https://github.com/Kensuke-Mitsuzawa/JapaneseTokenizers): Supports `Mecab` `Juman++` `Kytea`
[Comparison](http://taku910.github.io/mecab/#diff)
Romanize
------
[pykakasi](https://github.com/miurahr/pykakasi): For Kanji >>> romaji conversion.
Method
------
[AutoFurigana](https://github.com/prometheus42/AutoFurigana): Way to combine `tinysegmenter` and `pykakasi` in python.
Other
------
[nihongoparserd](https://github.com/Tatoeba/nihongoparserd) C++ hosted on local port. [Web version](https://tatoeba.org/eng/tools/furigana).

[Furigana](https://github.com/michaelvobrien/furigana): Ruby command line tool:
```bash
$ echo "私は日本語を勉強している。" | furigana --html

<ruby><rb>私</rb><rp>【</rp><rt>わたし</rt><rp>】</rp></ruby>は<ruby><rb>日本語</rb><rp>【</rp><rt>にほんご</rt><rp>】</rp></ruby>を<ruby><rb>勉強</rb><rp>【</rp><rt>べんきょう</rt><rp>】</rp></ruby>している。
```

搜索此博客

Cryptogun Wikipedia

Python Resources for Adding Furigana above Japanese Sentence

评论

发表评论

此博客中的热门博文

Windows下ShadowSocks客户端安装和配置顺带KCP

How User Friendly is a MacOS

Python Resources for Adding Furigana above Japanese Sentence

评论

发表评论

此博客中的热门博文

Windows下ShadowSocks客户端安装和配置 顺带KCP

How User Friendly is a MacOS

Windows下ShadowSocks客户端安装和配置顺带KCP