Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] oneliners, Was: Moving on from xterm
- Date: Tue, 23 Aug 2016 13:24:15 +1000
- From: Jim Breen <jimbreen@example.com>
- Subject: Re: [tlug] oneliners, Was: Moving on from xterm
- References: <20160819111442.GA30780@quadratic.cynic.net> <9f9cc5f579c92c3ddf7f29865d5862c2@jp.sometwo.net> <20160822114101.GA3944@fluxcoil.net> <87h9ace7zm.wl-knok@daionet.gr.jp>
On 23 August 2016 at 08:47, NOKUBI Takatsugu <knok@example.com> wrote: > kakasi has some problem. > * the dictionary is too old > * not good for complex sentence Quite. It's not for reious work. > MeCab is also useful for such situation. Indeed. The pick of the bunch. > mecab-ipadic-neologd is a good dictionary for MeCab. > https://github.com/neologd/mecab-ipadic-neologd Hmmm. IPADIC is long in the tooth too. Most serious users of mecab would go for unidic as a morpheme dictionary. I see that Toshinori Sato, who has compiled the "neologd" extensions (there's one for unidic too) has added a lot of expanded terms which are not really morphemes. For example if I put ラテン文字で表記される into it, the unidic and ipadic segmentation is ラテン+文字+で+表記+さ+れる, but if I try it with neologd I get ラテン文字+で+表記+さ+れる. In other words he's added ラテン文字 as a unitary noun. If that's what you want, fine, and his work may well help apps which just want to add furigana to text, but it's getting right away from being a morphological analyzer. Jim -- Jim Breen Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
- Follow-Ups:
- Re: [tlug] oneliners, Was: Moving on from xterm
- From: NOKUBI Takatsugu
- References:
- [tlug] Moving on from xterm
- From: Curt Sampson
- Re: [tlug] Moving on from xterm
- From: Furkan Mustafa
- [tlug] oneliners, Was: Moving on from xterm
- From: Christian Horn
- Re: [tlug] oneliners, Was: Moving on from xterm
- From: NOKUBI Takatsugu
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] oneliners, Was: Moving on from xterm
- Next by Date: Re: [tlug] oneliners, Was: Moving on from xterm
- Previous by thread: Re: [tlug] oneliners, Was: Moving on from xterm
- Next by thread: Re: [tlug] oneliners, Was: Moving on from xterm
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links