I have a number of PDF files I am trying to OCR....I have a script
which extracts each page into individual .tif files which I then run
through tesseract, and everything works fine
However I am trying to output these pages into one single file and am
having problems
Currently I am using the command:
tesseract image$i output$i
where it is contained with a loop
It has been suggested to me to use the command:
tesseract image$i >> output
But when I try that I get an error message from tesseract
I am using ubuntu 10.04 and tesseract 3
I have tried searching on the net, but can find no mention of appeding
pages..possibly because I am not using the right terminology
Any advice would be most appreciated please
Thanks
Alexis
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en