Mailing List Archive

Support open source code!


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Command to tell what character encoding a file has



>> A while back I used a command which would tell me the character encoding
>> of a file, ie in shift_jis or euc.  Now I cant remember the name of the
>> command.  Can someone please enlighten me on this commands name and also
>> where I can find c source for it.

I have always used Ken Lunde's "jconv" for both checking & converting.
The source is on ftp://ftp.cc.monash.edu.au/pub/nihongo/00INDEX.html
It does useful things like converting hanakaku kana to zenkaku.

Of course it fails on files with utf8 in them - most simple
code-detectors think utf8 is Shit-JIS. I have a better one somewhere
around which tells them apart.

Jim
-- 
Jim Breen  [jwb@example.com  http://www.csse.monash.edu.au/~jwb/]
Visiting Professor, Institute for the Study of Languages and Cultures of 
Asia and Africa, Tokyo University of Foreign Studies, Japan
+81 3 5974 3880         [$B%8%`!&%V%j!<%s(B@$BEl5~30Bg(B]


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links