Mailing List Archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [tlug] Re: tlug-digest Digest V2006 #28



>>>>> "David" == David Riggs <dariggs@example.com> writes:

    David> It seems likely that there is something to do this with,
    David> somewhere? Of course I expect the index to be bigger than
    David> the data, but whats a gigabyte or two when you are
    David> searching a canon?

Not everybody realizes that, though.

FreeWAIS sounds like the thing.  It understands proximity searches
(ie, two strings X words apart).  If you tell it that every UTF-8
character is a word ... but you'll end up with indexes 10-20X as big
as your text, I'm afraid.

Frank Bennett <bennett@example.com> (IIRC) is the guy to ask.

-- 
School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba                    Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
               Ask not how you can "do" free software business;
              ask what your business can "do for" free software.


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links