Hi Steve,
 
Sorry to respond in between.
I have meagre technical knowledge but I thought I can 
contribute a bit on the Japanese language front.
 
>http://www2s.biglobe.ne.jp/~suzakihp/index40.html
>The biglobe page is beyond my japanese ability but at 
the top of it is a link to  
>国会図書館データベース
>which I think is some kind of data base at the National Diet 
Library.
The database is indeed of the National Diet Library 
(Japan Database Navigation Service). 
 
The biglobe page has following functions in 
Japanese:
1. Surname Search (Can search by Kanji (any Kanji in 
the name) or Katakana)
2. Exceptional Kanji Search (Kanji out of JIS Level 1 
and 2)
3. Surname Questionnaire and 
Results
4. Message Board
5. Kanji Rankings (From No.1 to 
10k)
6. Introduction to Japanese 
Surnames
7. Members/Surname Search
 
Guess you were able to figure out some things 
already.
Just thought of helping a bit.
Hope I am not disturbing the 
thread.
 
Regards,
Prasad
Jim Breen wrote:
Hard data to get too. When I was at Tokyo Gaidai they had access to
a full copy of the NTT directory. It would have been nice to do some
frequency measures on names, and geographical dispersions on
family names, but there was an embargo on any publications
drawing on the data. They said it was because of "privacy".
  
I don't know if this will be of interest or not, but this 
page:
http://www.japanese-name-translation.com/site/japanese_names.html#02
has 
a link to an xls file that has a sorted list of the top 500 japanese names 
ordered with number of households and romaji pronounciation.  The xls file 
attributes it's data to this page:
http://www2s.biglobe.ne.jp/~suzakihp/index40.html
The 
biglobe page is beyond my japanese ability but at the top of it is a link to 
 
国会図書館データベース
which I 
think is some kind of data base at the National Diet Library.
Steve 
S.