Re: [MacPerl] Reading pdf files

Steve Swantz Fri, 23 Mar 2001 18:57:42 -0800
Bion

No, I haven't even tried yet. I, too, found the docs a little short. Good
examples for getting the meta data out, but I have no idea how to get the
content out - what is the "$pdfstring" to unquote in the first place? A
slurp of the whole file?

After a lot of clicking, I did find the PDF Reference on Adobe's site. I
started learning Perl several years ago to modify a cgi, mission creep set
in, and I now I've learned more than I ever wanted to about a lot of stuff.
Getting another brainache learning pdf is not what I was hoping for.

If getting the text out of a pdf is trivial, I'd love to see an example.

Steve

Bion Pohl wrote:

> Hi Steve,
> 
> Did you get this figured out?  I was wanting to do the same thing,
> reading a PDF as text, but the PODs for the PDF moduals are very shy of
> examples or instruction.  (Probably written by one of those Un*x Gurus
> that seem to think "If you don't know then I'm not going to tell you."
> is an acceptibal form of instruction.  Has anyone else ever noticed
> that?  [Before anyone trashes me, I have been a Un*x user since the 80's
> and a Linux user at work and on my Mac.])
> 
> 
> Steve Swantz wrote:
>> 
>> Umm, disregard....
>> 
>> I already had it, just had to look at it a little harder with Shuck.
>> 
>> use PDF::Core;
>> 
>> and
>> 
>> $string = UnQuoteString($pdfstring);
>> 
>> I'll be back when I can't get it to work. ;-)
>> 
>> Steve
>> 
>>> From: Steve Swantz <[EMAIL PROTECTED]>
>>> Date: Sat, 17 Mar 2001 23:03:03 -0600
>>> To: <[EMAIL PROTECTED]>
>>> Subject: [MacPerl] Reading pdf files
>>> 
>>> I need to extract the text of some pdf files so that I can add the words in
>>> them to a sites search index. The modules I've found on cpan appear to deal
>>> with creating pdf files as opposed to extracting the text from them.
>>> 
>>> Can anyone here point me to an appropriate module?
>>> 
>>> Thanks,
>>> 
>>> Steve Swantz
>>>
Re: [MacPerl] Reading pdf files

Reply via email to