Check out Ghostscript to convert to ps format and then png. Jpeg is not good for text anyway. Sven
On Monday, December 17, 2012, davebt wrote: > Hi. I am not technically minded andhave a question I hope someone can help > with. I use tesseract 3.1 as i don't have access to a .net wrapper to > update to 3.2 and use the OCR to read PDF documents from suppliers. I use > the free open source image printer 2.1 to create the jpeg for the tesseract > engine to then map and read. However I have come unstuck on one supplier > where the resolution of the document seems to be probematic and it causes > the OCR to make too many errors. However if I upload there document to a > different online JPEG convertor I get results that the OCR can then read > fine, which means I believe if I can improve the printer driver, I could > handle the documents. Is anyone using anything that is open source they > believe is a better printer driver than the imageprinter 2.1?? Suggestions > for solutions I have overlooked? > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to > [email protected]<javascript:_e({}, 'cvml', > '[email protected]');> > To unsubscribe from this group, send email to > [email protected] <javascript:_e({}, 'cvml', > 'tesseract-ocr%[email protected]');> > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

