Mailing List Archive

Support open source code!


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: tlug: Udi Manber: Re: Glimpse support for Asian characters



--------------------------------------------------------
tlug note from "Andrew S. Howell" <andy@example.com>
--------------------------------------------------------
>>>>> "Francis" == Francis Brian O'Carroll <ocarroll@example.com> writes:



    Francis> did they say they would accept patches for japanese if we
    Francis> develeped theym? the code is not copyleft, so we couldn't
    Francis> redistribute I think.

I didn't ask if they would accept patches. Their reply was the one
liner I mentioned.

    Francis> Glimpse is basically an index plus grep; if you grep
    Francis> supports japanese you could hack together a prototype
    Francis> jglimpse with a little c programming.

Doesn't it get a bit complicated though, if you have text with various
encodings. I would think that index would have be in some canoncal
format, say EUC. You would have determine the encoding on the
fly, both when creating the index and when searching through the text
again. Actualy, to get the results of the search to display correctly,
wouldn't you have to convert it to whatever your terminal was set for?

I know there are tools to do the identification and conversion, Ken
Lunde's code springs to mind.

Andy

-----------------------------------------------------------------
a word from the sponsor will appear below
-----------------------------------------------------------------
The TLUG mailing list is proudly sponsored by TWICS - Japan's First
Public-Access Internet System.  Now offering 20,000 yen/year flat
rate Internet access with no time charges.  Full line of corporate
Internet and intranet products are available.   info@example.com
Tel: 03-3351-5977   Fax: 03-3353-6096


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links