Mailing List ArchiveSupport open source code!
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: tlug: msword files
- To: tlug@example.com
- Subject: Re: tlug: msword files
- From: "Stephen J. Turnbull" <turnbull@example.com>
- Date: Fri, 8 Oct 1999 10:17:53 +0900 (JST)
- Content-Transfer-Encoding: 7bit
- Content-Type: text/plain; charset=us-ascii
- In-Reply-To: <37FD3924.EF3A79B@example.com>
- References: <19991007095454N.hbell@example.com><Pine.LNX.4.10.9910071031110.1892-100000@example.com><19991008064806.A1963@example.com><37FD3924.EF3A79B@example.com>
- Reply-To: tlug@example.com
- Sender: owner-tlug@example.com
>>>>> "Hirotaka" == Hirotaka Yoshioka <hyoshiok@example.com> writes: Hirotaka> Shimpei Yamashita wrote: >> But does "strings" work on Japanese characters at all? I >> suspect not. Yes. As long as they're 7-bit JIS. You'll get ESC-less JIS, but that's easily repaired since you _will_ be left with the rest of the kanji in-out sequences. That won't work on Japanese MS Word files, of course. Hirotaka> Can we write a SJIS version of 'strings'? No. strings(1) simply (1) checks for object/library files, and if so uses libbfd to find the data, if not takes the whole file and (2) scans it byte-by-byte using isprint() and spits out sequences of 4 bytes or more that are printable. Even if we had a POSIX locale for Japan, which we don't, really, the byte-oriented-ness would screw this idea. Not to mention the fact that since MS saw fit to pack GR with the half-wit kana, most executables are going to be about 70-80% printable outside of the string data. Ie, a non-ASCII version of strings must know a lot more about file structure; for Shimpei's application you really need to know what a MS Word file looks like, and even MS Word doesn't know that from one version to the next; I doubt you'll find anybody to maintain a version of strings that handles shift JIS. -- University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN Institute of Policy and Planning Sciences Tel/fax: +81 (298) 53-5091 __________________________________________________________________________ __________________________________________________________________________ What are those two straight lines for? "Free software rules." ------------------------------------------------------------------- Next Technical Meeting: October 9 (Sat), 13:30 place: Temple Univ. * Linux Internationalisation Initiative (Li18nux) speaker: Akio Kido * Japanese TrueType Fonts speaker: Adrian Havill Next Technical Meeting: November 13 (Sat), 13:30 place: Temple Univ. * Network Security speaker: Steve Baur Next Nomikai: December 17 (Fri), 19:00 Tengu TokyoEkiMae 03-3275-3691 ------------------------------------------------------------------- more info: http://www.tlug.gr.jp Sponsor: Global Online Japan
- Follow-Ups:
- Re: tlug: msword files
- From: Darren Cook <darren@example.com>
- Re: tlug: msword files
- From: "Hirotaka Yoshioka" <hyoshiok@example.com>
- References:
- Re: tlug: msword files
- From: Tony Laszlo <laszlo@example.com>
- Re: tlug: msword files
- From: Shimpei Yamashita <shimpei@example.com>
- Re: tlug: msword files
- From: "Hirotaka Yoshioka" <hyoshiok@example.com>
Home | Main Index | Thread Index
- Prev by Date: Re: tlug: msword files
- Next by Date: Re: tlug: msword files
- Prev by thread: Re: tlug: msword files
- Next by thread: Re: tlug: msword files
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links