Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] web search software
- Date: Tue, 09 Oct 2007 10:09:20 +0900
- From: Darren Cook <darren@example.com>
- Subject: Re: [tlug] web search software
- References: <46E4B1ED.7080904@ldp.jp> <46E4BB4D.4040703@dcook.org> <46E4C006.7040501@ldp.jp> <87fy1m13s8.wl%knok@daionet.gr.jp> <46E5F9A3.2060103@samsara.bebear.net> <87ejh522qb.wl%knok@daionet.gr.jp> <46FC4ED1.6020603@dcook.org> <87lkamnw5u.wl%knok@daionet.gr.jp>
- User-agent: Thunderbird 1.5.0.12 (X11/20070530)
NOKUBI Takatsugu wrote: > * Lucene > > It has a nice architecture because of separation of document > analyzer. NutchDocumentAnalyzer (from Nutch) provides simple > uni-gram search engine, and Japanese support. > OTOH, JapaneseAnalyzer provides word-based-text-chunking search > engine, based on Sen, a clone implementation of ChaSen by Java. > > http://ultimania.org/sen/ > http://tidus.ultimania.org/wiki/index.php?Lucene Thanks. I think next time I'm asked for a search engine this is the one I'll evaluate first, even though Java is not usually my first choice. A bit of discussion in English here: http://www.jguru.com/faq/view.jsp?EID=1168788 (I thought it was interesting that someone was saying the "dumb" CJKAnalyzer gave better results than the JapaneseAnalyzer for their site.) > Senna is also major search engine with Japanese support. Senna > itself is library form, but there are many bindings. Especially, > Toritonn is very useful because it adds Japanese full-text search > support for MySQL. Senna also supports MeCab and bi-gram. > > http://qwik.jp/senna/FrontPage.html (English) > http://qwik.jp/tritonn/ Also useful to know about. Thanks, Darren -- Darren Cook http://dcook.org/mlsn/ (English-Japanese-German-Chinese free dictionary) http://dcook.org/work/ (About me and my work) http://dcook.org/work/charts/ (My flash charting demos)
- References:
- Re: [tlug] web search software
- From: NOKUBI Takatsugu
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] xml to pdf
- Next by Date: Re: [tlug] xml to pdf
- Previous by thread: Re: [tlug] web search software
- Next by thread: RE: [tlug] Getting Audacity to not stutter (was: How to change IRQaddress for a sound card?)
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links