On Fri, Dec 14, 2012 at 8:01 PM, Alexis <ya...@antonakis.co.uk> wrote:

> I have a number of PDF files I am trying to OCR....I have a script
> which extracts each page into individual .tif files which I then run
> through tesseract, and everything works fine
> However I am trying to output these pages into one single file and am
> having problems
>
> Currently I am using the command:
>     tesseract image$i output$i
> where it is contained with a loop
>
> It has been suggested to me to use the command:
>      tesseract image$i >> output
> But when I try that I get an error message from tesseract
>
> Of course because you did not specified output file. And tesseract ocr
output goes always to file. So redirection of stdout does not make sense.


> I am using ubuntu 10.04 and tesseract 3
>
> I have tried searching on the net, but can find no mention of appeding
> pages..possibly because I am not using the right terminology
>
> Maybe you should search for "most frequently used commands on Unix-like
operating systems"[1] ;-)
[1] http://www.linfo.org/cat.html


> Any advice would be most appreciated please
>
> Thanks
> Alexis
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to tesseract-ocr@googlegroups.com
> To unsubscribe from this group, send email to
> tesseract-ocr+unsubscr...@googlegroups.com
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesseract-ocr@googlegroups.com
To unsubscribe from this group, send email to
tesseract-ocr+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to