Mailing List Archive

Support open source code!


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

tlug: Re: Glimpse support for Asian characters



--------------------------------------------------------
tlug note from Dennis McMurchy <denismcm@example.com>
--------------------------------------------------------
On Mon, 12 May 1997, Stephen J. Turnbull wrote:

> And _this_ is the easy part.  Remember, Japanese is an extremely
> highly inflected language and does not use spaces to separate words.
> Unless your indexing program understands Japanese syntax, it is not
> clear how you would go about doing the indexing.

  For those who are interested in this subject of indexing Japanese
text, there is a research group at Kyoto Univ (?), I think it was,
who have a very large indexing program available on the net.

  The program is called juman, and 3.0 was the latest version I know
of.  They can be contacted at juman@example.com according
to the README file that came with juman 3.0.  I haven't used it at all,
so I can't comment on it in any useful way.  I would be very interested
in hearing more about it, though.  It basically relies on an enormous
dictionary to parse Japanese text. 

Dennis McMurchy, 
Tojinmachi, Fukuoka


-----------------------------------------------------------------
a word from the sponsor will appear below
-----------------------------------------------------------------
The TLUG mailing list is proudly sponsored by TWICS - Japan's First
Public-Access Internet System.  Now offering 20,000 yen/year flat
rate Internet access with no time charges.  Full line of corporate
Internet and intranet products are available.   info@example.com
Tel: 03-3351-5977   Fax: 03-3353-6096


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links