
Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tlug] Re: tlug-digest Digest V2006 #28
- Date: Thu, 19 Jan 2006 22:02:54 +0900
- From: "Stephen J. Turnbull" <stephen@example.com>
- Subject: Re: [tlug] Re: tlug-digest Digest V2006 #28
- References: <200601181740.k0IHegS1024608@example.com><43CEE210.5040006@example.com>
- Organization: The XEmacs Project
- User-agent: Gnus/5.1007 (Gnus v5.10.7) XEmacs/21.5-b24 (dandelion, linux)
>>>>> "David" == David Riggs <dariggs@example.com> writes:
David> It seems likely that there is something to do this with,
David> somewhere? Of course I expect the index to be bigger than
David> the data, but whats a gigabyte or two when you are
David> searching a canon?
Not everybody realizes that, though.
FreeWAIS sounds like the thing. It understands proximity searches
(ie, two strings X words apart). If you tell it that every UTF-8
character is a word ... but you'll end up with indexes 10-20X as big
as your text, I'm afraid.
Frank Bennett <bennett@example.com> (IIRC) is the guy to ask.
--
School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
Ask not how you can "do" free software business;
ask what your business can "do for" free software.
Home |
Main Index |
Thread Index