On Fri, Dec 14, 2012 at 8:01 PM, Alexis <ya...@antonakis.co.uk> wrote:
> I have a number of PDF files I am trying to OCR....I have a script > which extracts each page into individual .tif files which I then run > through tesseract, and everything works fine > However I am trying to output these pages into one single file and am > having problems > > Currently I am using the command: > tesseract image$i output$i > where it is contained with a loop > > It has been suggested to me to use the command: > tesseract image$i >> output > But when I try that I get an error message from tesseract > > Of course because you did not specified output file. And tesseract ocr output goes always to file. So redirection of stdout does not make sense. > I am using ubuntu 10.04 and tesseract 3 > > I have tried searching on the net, but can find no mention of appeding > pages..possibly because I am not using the right terminology > > Maybe you should search for "most frequently used commands on Unix-like operating systems"[1] ;-) [1] http://www.linfo.org/cat.html > Any advice would be most appreciated please > > Thanks > Alexis > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to tesseract-ocr@googlegroups.com > To unsubscribe from this group, send email to > tesseract-ocr+unsubscr...@googlegroups.com > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en