Mailing List Archive
tlug.jp Mailing List tlug archive tlug Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: [tlug] EDICT dictionary on Kindle
- Date: Sat, 22 Dec 2012 15:34:56 +1100
- From: Jim Breen <jimbreen@example.com>
- Subject: Re: [tlug] EDICT dictionary on Kindle
- References: <CA+04B5BRXGfiC1_=aGLZ8PCo-0wPoL3tPVR60myiLEG0i6xpNg@mail.gmail.com> <CADFHC5yk6BGvhTM0wb3nbSP+5TFSq763JodLruFr7a7bc==y3g@mail.gmail.com> <CA+04B5Dt5+SXf30Wt_L3G=zCkRAbjx3NJEcHfgr+2XZptzGsFw@mail.gmail.com> <CAMEtR2K4-tTQNmqKnGXJNWVFpAat4LSt=CNEcZQRe-s3E4RfLw@mail.gmail.com> <CABHGxq5AudMvnjSjiswDSy20V1oJ4CMVfjnRFqOVe656+a7K3Q@mail.gmail.com> <CAMEtR2JO-Z6LCCD1-U0oD1c3eX82inJbCXxY_aZFK3s_u9ATqA@mail.gmail.com> <20121216222647.GA10816@sanma.local> <CABHGxq6tiwS6i2wMMpiMRK2_yWh+9HaemG4TMFnqmPUhmJJpUQ@mail.gmail.com> <20121221124529.GA6829@sanma.local>
On 21 December 2012 23:45, John Mettraux <jmettraux@example.com> wrote: > On Fri, Dec 21, 2012 at 10:19:49PM +1100, Jim Breen wrote: >> I have put my version at: >> http://www.csse.monash.edu.au/~jwb/edict2.mobi >> Can someone with a Kindle try it out and check on >> (a) and (b) above for me? > For (b) I counted 167060 <idx:entry.../>. Yep, I get that too (in the edict2.html file). > Now for the file at > > http://www.csse.monash.edu.au/~jwb/edict2.mobi > > Regarding (a), lookup from the dictionary itself (magnifying glass) doesn't > work, I guess it's an encoding issue (hint, the attached jpeg). > Lookup from a japanese document doesn't work (the kindle falls back to its > own wawa dictionary, disregards edict2.mobi). There doesn't seem to be a magnifying glass in Kindle Previewer. When I created the edict2.html file file (ruby to_opf.rb < edict2.txt > html/edict2.html) I noticed the entries came out looking like this: <idx:entry name="word" scriptable="yes"> <h2>金円【きんえん】</h2> <idx:orth value="\351\207\221\345\206\206"></idx:orth> <idx:orth value="\343\201\215\343\202\223\343\201\210\343\202\223"></idx:orth> <p>(n) money</p> </idx:entry> In other words, the kanji/kana UTF8 indices are in octal literals. Should that not happen? It's being generated by the ruby script you have on github. Perhaps the ruby I have installed behaves differently? Mine is "ruby 1.8.7 (2010-01-10 patchlevel 249) [i486-linux]" and is the latest available for the Ubuntu I am running. > Sorry, I didn't look at (b). It would be interesting to have a look at the > edict2.html output before it gets rolled into the .mobi file. Is the sample above enough? I can send you the whole thing I checked what's showing in Kindle Previewer. The last entry, marked "118523" is the last in EDICT2, so the other 50k may have been dropped internally. Does your Kindle display the number of entries? Cheers Jim -- Jim Breen Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
- Follow-Ups:
- Re: [tlug] EDICT dictionary on Kindle
- From: John Mettraux
- References:
- Re: [tlug] EDICT dictionary on Kindle
- From: John Mettraux
- Re: [tlug] EDICT dictionary on Kindle
- From: Jim Breen
Home | Main Index | Thread Index
- Prev by Date: Re: [tlug] EDICT dictionary on Kindle
- Next by Date: Re: [tlug] EDICT dictionary on Kindle
- Previous by thread: Re: [tlug] EDICT dictionary on Kindle
- Next by thread: Re: [tlug] EDICT dictionary on Kindle
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links