Mailing List Archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [tlug] Re: font/char set question



On 29/07/07, steven smith <sjs@example.com> wrote:
> Jim Breen wrote:
> > (BTW, font is not the issue here. It's all to dowith character
> > sets.)

> Not all fonts implement all the gliphs for all the
> characters in a character set, right?

Partly. Font files are invariably associated with a character set.
The ones for Japanese are associated with either JIS X 0208 (with or
without the JIS X 213 extensions) or Unicode/ISO 10646. Most such font
files implement all (in the case of JIS X 0208) or most (in the case of
Unicode) codepoints. (A glyph is associated with the shape of a particular
characater, e.g. whether the kusakanmuri radical in a kanji has 3 strokes
or four.)

> or am I just
> confusing myself here.

I think you are.

>   The dot or in this case the small
> box -- where does that come from?  Is it output from the OS
> or the application or just the character representation for
> an unimplemented character.  This is the character
> substitution you are talking about, right?

Yes. If you see a dot/lozenge/?/etc. in a field when you paste
some text, the substitution will have been done locally, e.g. by
the browser. If the dot/lozenge/?/etc. is in something sent out
by the server, it could be a result of either browser substitution
before the form contents were sent in by HTTP, or by the server.

I know this smacks of hypocracy, as my servers use EUC-JP (albeit
with UTF-8 available), but things would be much better if everything
changed over to Unicode/UTF-8 ASAP.

Jim

-- 
Jim Breen
Honorary Senior Research Fellow
Clayton School of Information Technology,
Monash University, VIC 3800, Australia
http://www.csse.monash.edu.au/~jwb/


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links