Mailing List Archive

Support open source code!


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

FW: tlug: Two Qs re translation project



I'm forwarding this to the list, since it looks like Michimuko-san's
home address is not known to the list.

Michimuko-san: I've added some replies below, marked with
"FRANK>"

Cheers,
Frank Bennett

-----Original Message-----
$B:9=P?M(B : Shin MICHIMUKO <smitimko@example.com>
$B08@example.com(B : tlug@example.com <tlug@example.com>
CC : bennett@example.com <bennett@example.com>
$BF|;~(B : 2000$BG/(B1$B7n(B29$BF|(B 1:47
$B7oL>(B : Re: tlug: Two Qs re translation project


>Hi Frank,
>
>  This is the second time to send the same e-mail to tlug. First one has
>gone anywhere (lost). Maybe majordomo checks From and Sender lines, I
>guess (first one was sent from my office). This time I send Frank as cc,
>too.
>
>  Anyway,
>
>In message
<000701bf6968$d3c9db80$10210685@example.com> you write:
>> I just bingo'ed.  There is a Japanese-capable version of WAIS-sf
>> (which I remember fighting with when it was in early beta --- *brilliant*
>> stuff once I got it installed, though).  If my hasty glance through the
>> sites leading to this is correct, this is the search engine underlying
>> the "namazu" CGI bundle that is on its way to becoming a default
>> standard for providing search facilities in Japanese intranets and
>> Web search sites.
>
>  I was programming freeWAIS-sf-1.0 Japanese patch on Sony's NEWS-OS,
>And SSE after that. Mr.Baba extended my idea with freeWAIS-sf on
>variety of UNIXes. SSE (now I changed that name to S.CREW), all of it
>was written in perl, uses Mr.Baba's kakasi patch (kakasi is a tool to
>get a word from Japanese sentenses).
>
>   I guess you are confused with kakasi and WAIS engine underlying
>Namazu.  Namazu, yes it is now a de fact standard of Japanese search
>engine on UNIX or Windows, doesn't using any *WAIS* family's engine.
>Namasu uses "kakasi" in it. It has its own hash table of every words
>to search.  Mr.Baba is also contributing Namazu development after he
>made Japanized freeWAIS. You should be careful to use freeWAIS.


FRANK> That is very helpful, thank you.  I had assumed that
FRANK> Kakasi was used to parse out word elements for indexing
FRANK> by a modified WAIS.  I'll be sure to look into it separately.

>> Although it will be like squashing a knat with a pile driver at this
point,
>> I'm really happy about this: I need a reasonably quick return to
>> Tcl of a list of file names that contain a given string or phrase.  WAIS
>> will certainly do the trick, no matter how large our archive
>> eventually grows; so there won't need to be any reengineering
>> later on.  Time to open the champagne.
>
>  Wait. Some of *WAIS* have a limitation that it ignores any words
>which appears too frequently in target files (like over 100 files).
>You must reconfigure the value of the limit in header file. Further
>more, *WAIS* doesn't have a capability of multiple output of result
>pages.


FRANK> I had forgotten this, too.  Oops.  Where we need complete
FRANK> data, we may be better off with simple string searches.
FRANK> You may have saved me from a costly gotcha.  Thanks!

>> Anyway, here's a site that describes the Japanese patch to the
>> WAIS engine (in Japanese):
>>
>>   http://www.kusastro.kyoto-u.ac.jp/~baba/wais/index.html
>
>  Brief description and comparisons of Namazu, freeWAIS, and S.CREW
>are described also by Mr.Baba, at:
>
>http://www.kusastro.kyoto-u.ac.jp/~baba/wais/other-system.html
>
>Check it please.


FRANK> I will.  I just had a quick glance through the documents
FRANK> and sent an ecstatic letter to the list.  Never a good idea,
FRANK> but there it is.  :-)

># I stopped maintainance of S.CREW (SSE) 3 years ago, because Namazu
># is came out by the S.CREW inspiration. S.CREW played a good role
># enough.


FRANK> And many, many thanks for your efforts!  The availability of
FRANK> these tools is going to make a big difference in my field in the
FRANK> next few years.

>== Money is one of the minimum conditions to do anything, but... ======
>    Shin MICHIMUKO <smitimko@example.com>
>    http://www.peanuts.gr.jp/~smitimko/
>=========================================== Freedom is everything. ====
>

--------------------------------------------------------------------
Next Nomikai Meeting: February 18 (Fri) 19:00 Tengu TokyoEkiMae
Next Technical Meeting:  March 11 (Sat) 13:00 Temple University Japan
* Topic: TBD
--------------------------------------------------------------------
more info: http://www.tlug.gr.jp        Sponsor: Global Online Japan

Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links