Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- Date: Wed, 27 Jun 2018 14:46:52 +1000
- From: Jim Breen <jimbreen@example.com>
- Subject: Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- References: <23345.41167.951877.900876@turnbull.sk.tsukuba.ac.jp> <23345.44414.330392.350450@turnbull.sk.tsukuba.ac.jp> <CAKXLc7c-LzgY5AtE8XrZzKUrr206nXtmxdtKQC0q8PkcMjiF7A@mail.gmail.com> <CABHGxq6CkEeQVHy7rjjbTP72mOm_QQ5xtthPuPR-QASYQoS_ag@mail.gmail.com> <07A05935-BBD8-4C13-AEF6-667D653EBE45@brightblack.net> <23346.65438.401753.15741@turnbull.sk.tsukuba.ac.jp>
On 27 June 2018 at 13:08, Stephen J. Turnbull <turnbull.stephen.fw@example.com> wrote: > ....... and there are > just too many moving parts if you decide to change encodings. "Don't > fix it if the users have workarounds!" ¯\(ツ)/¯ Amen, brother, amen. I have this issue with the wwwjdic spaghetti code (mea culpa). Parts of it date from before Unicode and UTF-8 existed, and all the internal data structures are built around the assumption that kana and kanji take two bytes. I'd love to move it over to using UTF-8 internally, but I'm not inspired to do it because: (a) it'll be a hellava lot of work. One big issue is that in EUC the difference between equivalent katakana and hiragana characters is a single bit, so relaxed kana matching is trivial. In Unicode they are in the same plane with an irregular offset, so it all gets much harder. (b) very few users would know the difference. The interface defaults to UTF-8 these days and everything goes in and out via iconv(). (c) as I progress through my 8th decade I find I have more interesting (and urgent) things on my bucket list. Jim -- Jim Breen Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University http://www.jimbreen.org/ http://nihongo.monash.edu/
- Follow-Ups:
- Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- From: Stephen J. Turnbull
- References:
- [tlug] Kudos to Jim Breen
- From: Stephen J. Turnbull
- [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- From: Stephen J. Turnbull
- Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- From: Kalin KOZHUHAROV
- Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- From: Jim Breen
- Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- From: grb
- Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- From: Stephen J. Turnbull
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- Next by Date: Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- Previous by thread: Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- Next by thread: Re: [tlug] Bogus Japanese zipfiles [was: Kudos to Jim Breen]
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links