Check out Ghostscript to convert to ps format and then png. Jpeg is not
good for text anyway.
Sven

On Monday, December 17, 2012, davebt wrote:

> Hi. I am not technically minded andhave a question I hope someone can help
> with. I use tesseract 3.1 as i don't have access to a .net wrapper to
> update to 3.2 and use the OCR to read PDF documents from suppliers. I use
> the free open source image printer 2.1 to create the jpeg for the tesseract
> engine to then map and read. However I have come unstuck on one supplier
> where the resolution of the document seems to be probematic and it causes
> the OCR to make too many errors. However if I upload there document to a
> different online JPEG convertor I get results that the OCR can then read
> fine, which means I believe if I can improve the printer driver, I could
> handle the documents. Is anyone using anything that is open source they
> believe is a better printer driver than the imageprinter 2.1?? Suggestions
> for solutions I have overlooked?
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to 
> [email protected]<javascript:_e({}, 'cvml', 
> '[email protected]');>
> To unsubscribe from this group, send email to
> [email protected] <javascript:_e({}, 'cvml',
> 'tesseract-ocr%[email protected]');>
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>


-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to