On 27 Jan 01 at 0:00, Heimo Claasen wrote:

>>> Have you tried loading one of those co-processor emulators before the
>>> PDF2TEXT converter program...
>>The announcement said it's a "DOS" prog. So why should it need a math
>>coprocessor ?

To get the benefit of the extra computing "muscle". It might be
insufferably slow otherwise.

>>> I have a 6meg PDF of the manual for my Star Micronics printer
>>> that it converted to a 300K text file. Manually editing out the garbage
>>> and empty space dropped it below 200k.
>>And that's precisely the point: what's needed is a sheer text extractor, not
>>something that leaves you with lots of manual editing. Comparable to this
>>excellent HTML-stripper, HTMSTRIP.

Well, it is obvious (duh) that PDF is much more complex than HTML,
which is obviously pure text to begin with. All a stripper program
needs to do is identify the HTML tags and remove them during
conversion. Would that it were that simple with PDF.

>>BTW, the example mentioned is exemplary for the completely superfluous
>>use of that bloatware - hardly three per cent of "payload" in terms of
>>text contained.

I could not agree more, Heimo, and it was the grossest example I have
ever encountered in my personal experience. I am so glad that
PDF2TEXT works as well as it does. I am planning on opening the pdf
file in the Acrobat reader and the text file in a good text editor
and scroll through the PDF and add the truly necessary graphics to
the text file using standard ASCII characters. Then I will be able to
reclaim that 6 megs of HD space.

Regards,
Dale Mentzer

I'm not crazy, I've just been in a very bad mood for 35 years.


    This mail written by a user of Arachne, the DOS Internet Client
                WWWWW World Wide Web Without Windows
          http://home.arachne.cz Arachne DOS Browser Home Page

To unsubscribe from SURVPC send a message to [EMAIL PROTECTED] with 
unsubscribe SURVPC in the body of the message.
Also, trim this footer from any quoted replies.
More info can be found at;
http://www.softcon.com/archives/SURVPC.html

Reply via email to