It is really very simple.
An empty document is just a picture of a document. Unlike a text document as
say created in Word where each letter is represented by a distinct computer
number which leaves no ambiguity about what that computer number represents,
a picture of a document is a series of darker and lighter smudges on a
screen.
When you print it you form a facsimile of the image. when you scan that with
an OCR programme like K1000 or Open Book or Abbey reader or omnipro the
image is processed and the programme attempts to recognize patterns it can
recognize as letters and spaces.
When you use the so-called virtual printer you are just telling K1000 or
Open Book to use that PDF picture of the document rather than print and
acquire another picture of the printed document from the scanner.
How good the OCR will be depends on the quality of the image. It should be
better direct from the PDF than printing and rescanning the picture as a
certain amount of quality will be lost at each conversion between formats.
Where the original electronic version, that is the file created by the word
processor is not available an image is the only alternative for sending the
document. The sender could print and scan and OCR the document of course
complete with the errors their program might introduce.
Those digitized documents from archives are all images, pictures of the
original complete with doodles in the margins, coffee stains and thumb
prints and all. They are not retyped documents. They might as well be .GIF
or .JPG images.
----- Original Message -----
From: "Marilyn Walker" <[email protected]>
To: <[email protected]>
Sent: Monday, April 13, 2009 4:25 PM
Subject: Re: [JAWS-Users] Adobe saying empty document
Mike, I had this happen to me last week when I tried to access a pdf
document from the University of IllinoisHistorical archives. It seemed
that
the pages, labeled "graphical" were probably scanned document pages that
may
have been digitized. So, as a last resort, I tried the Kurzweil 1000
virtual printer and that opened the document. However, unlike other
pdf's,
this large file reads very slowly with my using the down arrow keys and I
find that normal K1000navigation such as "go to page" scarcely work.
Maybe
there is something I need to know about digitized pages because the time
consumed was really intolerable. marilyn
Visit the JAWS Users List home page at:
http://www.jaws-users.com
Visit the Blind Computing home page at:
http://www.blind-computing.com
Address for the list archives:
http://www.mail-archive.com/[email protected]
To post to this group, send email to
[email protected]
To unsubscribe from this group, send an email to
[email protected]
For help from Mailman with your account Put the word help in the subject
or body of a blank message to:
[email protected]
Use the following address in order to contact the management team
[email protected]
If you wish to join the Blind Computing list send a blank email to the
following address:
[email protected]
Visit the JAWS Users List home page at:
http://www.jaws-users.com
Visit the Blind Computing home page at:
http://www.blind-computing.com
Address for the list archives:
http://www.mail-archive.com/[email protected]
To post to this group, send email to
[email protected]
To unsubscribe from this group, send an email to
[email protected]
For help from Mailman with your account Put the word help in the subject or
body of a blank message to:
[email protected]
Use the following address in order to contact the management team
[email protected]
If you wish to join the Blind Computing list send a blank email to the
following address:
[email protected]