Mailing List Archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF-8: each character is one byte . . . . . . (was: Re: Learn a Variety of Languages) [tlug]





On 1/20/07, Ian MacLean <imaclean@example.com> wrote:


sure - in fact all bytes in a utf-8 sequence except those in the ascii range have their high bit set - this is what allows ascii only tools to work with utf-8 data.

And by work I mean "not break horribly" :)



Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links