Mailing List ArchiveSupport open source code!
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]Re: location of pdf2txt
- To: tlug@example.com
- Subject: Re: location of pdf2txt
- From: Frank BENNETT <bennett@example.com>
- Date: Wed, 29 Nov 2000 12:29:02 +0900
- Content-Transfer-Encoding: 7bit
- Content-Type: text/plain; charset=iso-2022-jp
- In-Reply-To: <3A242324.D10A8EE8@example.com>; from Drew Poulin on Tue, Nov 28, 2000 at 01:27:00PM -0800
- References: <3A2418FA.1CF81F89@example.com> <3A242324.D10A8EE8@example.com>
- Reply-To: tlug@example.com
- Resent-From: tlug@example.com
- Resent-Message-ID: <a7nVH.A.YBG.KgHJ6@example.com>
- Resent-Sender: tlug-request@example.com
On Tue, Nov 28, 2000 at 01:27:00PM -0800, Drew Poulin wrote: > Selva wrote: > > > Does anyone know the latest location of pdf2txt? > > I'm not sure if this is what you're looking for, but xpdf includes > pdftotext, which does extract Japanese if you set that compile option. > > http://www.foolabs.com/xpdf/ Motion seconded: this is what you want, Selva. Xpdf now ships with the decryption patches that used to be housed at another site. It also supports Jse (you'll get output in EUCJP). If you are working on vertically formatted docs, drop me a note -- last week I wrote a Python script that munges the debugging output of pdftotext as applied to vertical files into reading-order horizontal text. It would really make my day if someone would take the Python script's algorithms and implement them inside pdftotext (and add the CMap entries for vertically formatted text as well). But the script, such as it is, is free for the asking. Cheers, ---- -x80 Frank G Bennett, Jr @@ Faculty of Law, Nagoya Univ () email: bennett@example.com Tel: +81[(0)52]789-2239 ()
- References:
- location of pdf2txt
- From: Selva <snair@example.com>
- Re: location of pdf2txt
- From: Drew Poulin <poulin@example.com>
Home | Main Index | Thread Index
- Prev by Date: Re: Good CJK distro
- Next by Date: Re: Good CJK distro
- Prev by thread: Re: location of pdf2txt
- Next by thread: Re: location of pdf2txt
- Index(es):
Home Page Mailing List Linux and Japan TLUG Members Links