Mailing List Archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[tlug] Re: Search Tools...

On Thu, Apr 19, 2007 at 05:06:59PM -0700, Frank Jahnke wrote:
> Scott,
> Have you kept up with tools that search for text strings in files?  

Only text files.  I don't work much with pdfs. (See, I actually read the
whole thing before answering.)

So, since I only deal with Excel and Word, I can use antiword and the
other one--the one that has xls2csv as a part of it.  (Hrrm, maybe
that's antiword--there are two programs that will print .doc on the
console, and one will also take an Excel sheet and turn it to a
csv--errm, you might have guessed that from the program's name.)

So therefore, I don't go much beyond grep, locate and find. 

> Well, now I do.  My particular need is that I have about 300 PDF files
> that I need to search as a group for a collection of key words.  Right
> now they are on the W2K box, but they can be moved to BSD or XP if
> needed.  Cheap is preferred, but quality is more important.

Is there any way to turn a pdf into txt and use the builtins, e.g. grep?
I'd google it for you, but I'm really tired this week--everyone is, I
think it's the weather. 

I don't know if that's practical but...hrrm, now I'm curious, I'm going
to do a quick google for pdf to text. Hold on.

Ok, there are two that are part of other programs.  xpdf has pdf2text
and ghostscript includes pdf2ascii.  

I don't know if this makes any sense for you, however, as your needs are
usually very highly sophisticated.


Scott Robbins

PGP keyID EB3467D6
( 1B48 077D 66F6 9DB0 FDC2 A409 FA54 EB34 67D6 )
gpg --keyserver --recv-keys EB3467D6

Harmony: How are you gonna kill her? Think! The second you even 
point that thing at her, you're gonna be all 'Aaagh!' (holding 
her hand to her head in imitation of Spike), and then you'll get 
bitch-slapped up and down Main Street, unless she's finally had 
enough and just stakes you! 
Spike: Sure, it'll hurt like hell for about two hours. But she'll
be dead just a little longer than that. 

Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links