Mailing List Archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [tlug] Generating Furigana in documents


On Sat, Mar 30, 2013 at 06:35:43PM +0900, Curt Sampson wrote:
> On 2013-03-30 10:01 +0100 (Sat), Christian Horn wrote:
> Actually, JIS x0208 is a character set; you also need to worry about
> the encoding of that character set. (A character set is merely a list
> of characters, so 僕 might for example be number 1728. How that number
> 1728 is encoded in a stream of bytes can vary. Note that Unicode is a
> character set and UTF-8, UCS-16, et al. are encodings of Unicode.)
> [...]

Thanks a lot for the explanation.
As with looking up informations on switching input methods,
I did not see this nicely documented so far.

>     $ echo '私は馬鹿です' \
>         | iconv -f utf8 -t sjis | kakasi -JK | iconv -f sjis -t utf8
>     ワタシはバカです

This works, for utf8 texts I also directly get the file in


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links