Bion
No, I haven't even tried yet. I, too, found the docs a little short. Good
examples for getting the meta data out, but I have no idea how to get the
content out - what is the "$pdfstring" to unquote in the first place? A
slurp of the whole file?
After a lot of clicking, I did find the PDF Reference on Adobe's site. I
started learning Perl several years ago to modify a cgi, mission creep set
in, and I now I've learned more than I ever wanted to about a lot of stuff.
Getting another brainache learning pdf is not what I was hoping for.
If getting the text out of a pdf is trivial, I'd love to see an example.
Steve
Bion Pohl wrote:
> Hi Steve,
>
> Did you get this figured out? I was wanting to do the same thing,
> reading a PDF as text, but the PODs for the PDF moduals are very shy of
> examples or instruction. (Probably written by one of those Un*x Gurus
> that seem to think "If you don't know then I'm not going to tell you."
> is an acceptibal form of instruction. Has anyone else ever noticed
> that? [Before anyone trashes me, I have been a Un*x user since the 80's
> and a Linux user at work and on my Mac.])
>
>
> Steve Swantz wrote:
>>
>> Umm, disregard....
>>
>> I already had it, just had to look at it a little harder with Shuck.
>>
>> use PDF::Core;
>>
>> and
>>
>> $string = UnQuoteString($pdfstring);
>>
>> I'll be back when I can't get it to work. ;-)
>>
>> Steve
>>
>>> From: Steve Swantz <[EMAIL PROTECTED]>
>>> Date: Sat, 17 Mar 2001 23:03:03 -0600
>>> To: <[EMAIL PROTECTED]>
>>> Subject: [MacPerl] Reading pdf files
>>>
>>> I need to extract the text of some pdf files so that I can add the words in
>>> them to a sites search index. The modules I've found on cpan appear to deal
>>> with creating pdf files as opposed to extracting the text from them.
>>>
>>> Can anyone here point me to an appropriate module?
>>>
>>> Thanks,
>>>
>>> Steve Swantz
>>>