Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] UTF-8 makes multi-byte ignorant UNIX tools play nicemulti-byte characters
- Date: Fri, 13 Jan 2006 21:30:26 -0500
- From: Jim <jep200404@example.com>
- Subject: Re: [tlug] UTF-8 makes multi-byte ignorant UNIX tools play nicemulti-byte characters
- References: <200601130511.k0D5BxWg015897@example.com><43C84B5A.7000703@example.com><d8fcc0800601131819p4c76af1ek@example.com>
Josh wrote: > I don't know about this, but Perl's regexp engine handles Unicode and > multi-line strings. Give Perl a whirl. (Sorry.) Use UTF-8, so that the classic byte-at-a-time UNIX tools that are ignorant of multi-byte character handling, can handle multi-byte characters without knowing or caring how to handle multi-byte characters.
- Follow-Ups:
- References:
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] [tlug-digest] searching for kanji strings, ignore punctuation and end of lines. Text indexing and retrival in unicode.
- Next by Date: Re: [tlug] [tlug-digest] Regex Efficiency
- Previous by thread: Re: [tlug] [tlug-digest] searching for kanji strings, ignore punctuation and end of lines. Text indexing and retrival in unicode.
- Next by thread: Re: [tlug] UTF-8 makes multi-byte ignorant UNIX tools play nice multi-byte characters
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links