Mailing List Archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [tlug] unable to create local copy utf8 encoded Japanese MySQLdata



Josh Glover wrote:
> On 14/02/06, Dave Gutteridge <dave@example.com> wrote:
>
>   
>> Most of the Japanese text shows up readable. Some of it, however, shows up as coded
>> numbers (I'm not sure what the term is when utf displays this way):
>> &#12513;&#12540;&#12531;&#12539; I think this might be "legacy" data,
>> held over from the days when MySQL did not have utf8 support.
>>     
>
> I believe these are known as Unicode entities.
>   
In xml/html they are referred to as Character entities.  They are a
method of encoding Unicode characters when using encodings that don't
support the full Unicode character range.  Most common browsers support
them.  This might give you a better idea.

http://www.w3.org/TR/html401/charset.html#h-5.3

    * The syntax "&#/D/;", where /D/ is a decimal number, refers to the
      ISO 10646 decimal character number /D/.
    * The syntax "&#x/H/;" or "&#X/H/;", where /H/ is a hexadecimal
      number, refers to the ISO 10646 hexadecimal character number /H/.
      Hexadecimal numbers in numeric character references are
      case-insensitive.


Edward


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links