
Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tlug] oneliners, Was: Moving on from xterm
Thank you for introdution of kakasi, I am a kakasi maintainer.
At Mon, 22 Aug 2016 13:41:01 +0200,
Christian Horn wrote:
> I use kakasi for a quick attempt to convert Kanji into Hiragana:
>
> [chris@спутник ~]$ echo 例文文章です。 | kakasi -JH -i utf8 -f -w
> 例文[れいぶん] 文章[ぶんしょう] です 。
> [chris@спутник ~]$
> [chris@спутник ~]$ qq 例文文章です。
> 例文文章です。
> 例文[れいぶん] 文章[ぶんしょう] です 。
> [chris@спутник ~]$
kakasi has some problem.
* the dictionary is too old
* not good for complex sentence
MeCab is also useful for such situation.
% echo 例文文章です。|mecab -Oyomi
レイブンブンショウデス。
% echo 例文文章です。|mecab --node-format='%pS%m[%f[7]]' --eos-format='\n'
例文[レイブン]文章[ブンショウ]です[デス]。[。]
mecab-ipadic-neologd is a good dictionary for MeCab.
https://github.com/neologd/mecab-ipadic-neologd
To append the following to dicrc, you can use -Oyomi2 option with MeCab:
; yomi2
node-format-yomi2 = %pS%m[%f[7]]
unk-format-yomi2 = %M
eos-format-yomi2 = \n
Home |
Main Index |
Thread Index