Mailing List Archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[tlug] How to make a current running kanji compound list from the news

Martin G wrote:

> However, I got to thinking about it, and wondered if with all the
> modern tools and the fact that almost all news is online, surely
> (hopefully) there would be a way to scan news sites for the most
> common compounds and make a spreadsheet of them.

It's been done a few times, and the data is available.

See the page:
and scan down (Ctrl-F) for the word "Girardi". You'll be at the start
of a collection of word frequency lists. Some of them may be useful.


Jim Breen
Adjunct Snr Research Fellow, Clayton School of IT, Monash University
Vice-president: Hawthorn Rowing Club, Treasurer: Japanese Studies Centre
Graduate student: Language Technology Group, University of Melbourne

Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links