
Mailing List Archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[tlug] How to make a current running kanji compound list from the news
- Date: Fri, 22 Jul 2011 14:37:12 +1000
- From: Jim Breen <jimbreen@example.com>
- Subject: [tlug] How to make a current running kanji compound list from the news
Martin G wrote:
> However, I got to thinking about it, and wondered if with all the
> modern tools and the fact that almost all news is online, surely
> (hopefully) there would be a way to scan news sites for the most
> common compounds and make a spreadsheet of them.
It's been done a few times, and the data is available.
See the page: http://ftp.monash.edu.au/pub/nihongo/00INDEX.html
and scan down (Ctrl-F) for the word "Girardi". You'll be at the start
of a collection of word frequency lists. Some of them may be useful.
Jim
--
Jim Breen
Adjunct Snr Research Fellow, Clayton School of IT, Monash University
Vice-president: Hawthorn Rowing Club, Treasurer: Japanese Studies Centre
Graduate student: Language Technology Group, University of Melbourne
Home |
Main Index |
Thread Index