Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] [tlug-digest] Regex Efficiency
- Date: Fri, 13 Jan 2006 21:36:43 -0500
- From: Jim <jep200404@example.com>
- Subject: Re: [tlug] [tlug-digest] Regex Efficiency
- References: <200601130511.k0D5BxWg015897@example.com><43C84B5A.7000703@example.com>
David Riggs <dariggs@example.com> wrote: > I need to find short ... strings in a giant haystack of texts. > Is there a way to do this, preferably a fast way to do this? > My haystack is hundreds of megabytes and I have to do it a lot. If this is something that you are going to do much, then regex efficiency is a serious concern and you must read: Mastering Regular Expressions by Jeffrey E.F. Friedl http://www.oreilly.com/catalog/regex/ especially chapters 4 and 5. Chapter 4 is free: http://www.oreilly.com/catalog/regex/chapter/ch04.html ------------------------------------------------------------------- newline spanning nastiness still to follow
- References:
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] UTF-8 makes multi-byte ignorant UNIX tools play nicemulti-byte characters
- Next by Date: Re: [tlug] Use a shell that groks UTF-8
- Previous by thread: Re: [tlug] Is removal of whitespace appropriate for searching for kanji strings?
- Next by thread: Re: [tlug] Nasty Problem: searching for strings that span newlines
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links