Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]UTF-8: each character is one byte . . . . . . (was: Re: Learn a Variety of Languages) [tlug]
- Date: Fri, 19 Jan 2007 23:03:46 -0500
- From: Jim <jep200404@example.com>
- Subject: UTF-8: each character is one byte . . . . . . (was: Re: Learn a Variety of Languages) [tlug]
- References: <45AAFDA9.90504@example.com> <157AA731-9AC1-4FEF-ABAD-23A5BE8F0C05@example.com> <Pine.NEB.4.64.0701161628390.12325@example.com> <78d7dd350701160014h40155a75n345183640cbccfc5@example.com> <19dd68ba0701160122i1b813c10jf34c0210d53fbbdd@example.com> <op.tl8roo02rtshzt@example.com> <19dd68ba0701160412y2eb95062r6235fed92b752784@example.com> <Pine.NEB.4.64.0701162139360.10912@example.com> <3156339d0701161820lb684aeubcd51914b19a87bf@example.com> <Pine.NEB.4.64.0701171657080.1515@example.com> <3156339d0701180035k2a4f2b70o3bbf00612501470@example.com> <Pine.NEB.4.64.0701201123230.1314@example.com>
Curt wrote: > In UTF-8, all characters contain exactly one byte without the high bit set. Oh dear. When Ian started questioning Curt's code, it would have been a good time to check one's assumptions. > You can easily look up on the web how the encoding works. Indeed. Especially since on Tue, 16 Jan 2007 09:05:44 -0500 I wrote about which web page to read: > http://en.wikipedia.org/wiki/UTF-8 almost a day before Wed, 17 Jan 2007 17:08:03 +0900 (JST) when Curt wrote: > class String > def utf8_char_count > split('').select { |c| c[0] < 128 }.length > end > end
- Follow-Ups:
- References:
- [tlug] What is the most appropriate scripting language
- From: Dave M G
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Jean-Christophe Helary
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Curt Sampson
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Nguyen Vu Hung
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Guillaume Proux
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Zev Blut
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Guillaume Proux
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Curt Sampson
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Ian MacLean
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Curt Sampson
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Ian MacLean
- Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- From: Curt Sampson
Home | Main Index | Thread Index
- Prev by Date: [tlug] Host Blocking and Logfile Parsing
- Next by Date: Advantage of Having or Not Having Header Files . . . . . . . (was: Re: To package or not to package) [tlug]
- Previous by thread: Re: Learn a Variety of Languages . . . . . . . (was: Re: [tlug] Re: Bourne Shell is the most appropriate scripting language)
- Next by thread: Re: UTF-8: each character is one byte . . . . . . (was: Re: Learn a Variety of Languages) [tlug]
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links