Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] Is removal of whitespace appropriate for searching for kanji strings?
- Date: Sun, 15 Jan 2006 21:36:37 +0900
- From: Ian Wells <ijw@example.com>
- Subject: Re: [tlug] Is removal of whitespace appropriate for searching for kanji strings?
- References: <200601130511.k0D5BxWg015897@example.com> <43C84B5A.7000703@example.com> <d8fcc0800601131819p4c76af1ek@example.com> <30ce84360601141544y40a0ce05g28ba58c29be99e64@example.com> <20060114194030.6fc64172.jep200404@example.com>
On 1/15/06, Jim <jep200404@example.com> wrote:Ian Wells <ijw@example.com> wrote:
> $line=~s/\b//g; # Remove whitespace; add to this to remove punctuation;
It would not surprise me if it is correct to remove whitespace,
but Riggs did not mention doing so. Riggs only mentioned
punctuation, newlines and line numbers. The line numbers would
likely be trailed by whitespace, but we should hear from Riggs
about the appropriateness of removing whitespace.
Some sample short strings and some sample text to search through
would help show the problem that has not been completely been
defined.
True 'nuff, and maybe I should have read the question better, but it was the place for a regexp to do the requested dirty work and a regexp was present. I wasn't expecting the program to be perfect, only that someone might pick it up and run with it...
Alternatively, you get what you pay for ;-)
--
Ian.
- References:
- [tlug] [tlug-digest] searching for kanji strings, ignore punctuation and end of lines. Text indexing and retrival in unicode.
- From: David Riggs
- Re: [tlug] [tlug-digest] searching for kanji strings, ignore punctuation and end of lines. Text indexing and retrival in unicode.
- From: Josh Glover
- Re: [tlug] [tlug-digest] searching for kanji strings, ignore punctuation and end of lines. Text indexing and retrival in unicode.
- From: Ian Wells
- Re: [tlug] Is removal of whitespace appropriate for searching forkanji strings?
- From: Jim
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] Nasty Problem: searching for strings that span newlines
- Next by Date: Re: [tlug] Mic Problems
- Previous by thread: Re: [tlug] Is removal of whitespace appropriate for searching forkanji strings?
- Next by thread: Re: [tlug] [tlug-digest] Regex Efficiency
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links