Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: font/char set question: Chinese Amazon charset . . . . . . . [tlug]
- Date: Mon, 30 Jul 2007 11:38:01 +0900
- From: "Stephen J. Turnbull" <stephen@example.com>
- Subject: Re: font/char set question: Chinese Amazon charset . . . . . . . [tlug]
- References: <46AC42CB.1050202@sonic.net> <20070729101324.11e6f1ad.jep200404@columbus.rr.com> <d8fcc0800707291712w1f7fd51dnfed9b26bd49b9ce4@mail.gmail.com>
Josh Glover writes: > That does strike me as odd. Could it be that there is one Chinese > encoding that is assumed to be used? Not by standard. From RFC 2616, Section 3.7.1: The "charset" parameter is used with some media types to define the character set (section 3.4) of the data. When no explicit charset parameter is provided by the sender, media subtypes of the "text" type are defined to have a default charset value of "ISO-8859-1" when received via HTTP. Data in character sets other than "ISO-8859-1" or its subsets MUST be labeled with an appropriate charset value. See section 3.4.1 for compatibility problems. In China, you can in practice assume GB 2312, which the Chinese government is somewhat fanatical about. They even have their own 16-bit version of "Unicode" which grandfathers GB 2312 in a similar way to ASCII in UTF-8. (Unicode in quotes because the mapping obviously can't be fully preserved.) > Seems like it attempts to auto-detect charset on a per-UserAgent > basis, would you concur? Is this in an RFC somewhere? The Vary header is an instruction to caching proxies, and not relevant here. Op cit, section 14.44.
- Follow-Ups:
- References:
- font/char set question: Chinese Amazon charset . . . . . . . [tlug]
- From: jep200404
- Re: font/char set question: Chinese Amazon charset . . . . . . . [tlug]
- From: Josh Glover
Home | Main Index | Thread Index
- Prev by Date: Re: font/char set question: Chinese Amazon charset: Autodetect: Not! . . . . . . . [tlug]
- Next by Date: Re: [tlug] Re: font/char set question
- Previous by thread: Re: font/char set question: Chinese Amazon charset: Autodetect: Not! . . . . . . . [tlug]
- Next by thread: Re: font/char set question: Chinese Encodings: GB 18030? . . . . . . . [tlug]
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links