[tesseract-ocr] Multiple pages in parallel?

2018-03-10 Thread Matthew Lai
Hello! According to the FAQ[1], if I run tesseract on a multi-page image, it should process the pages in parallel. I am converting a 10-page TIF (in one file) into PDF, and looking at *top*, it seems like tesseract never uses more than about 250% CPU (I have 16 cores / 32 threads on my

Re: [tesseract-ocr] Re: I do not include 'chi_tra' in my tessdata folder . What is it ? I have seen language-specific.sh

2018-03-10 Thread ShreeDevi Kumar
Lang1+lang2 should work. If it does not, please open an issue with an example image. If lang2 is English, you may want to try the script level traineddata, which includes English with the other languages . Please take a look at the readme file in tessdata_fast which explains about script level

[tesseract-ocr] Re: I do not include 'chi_tra' in my tessdata folder . What is it ? I have seen language-specific.sh

2018-03-10 Thread Gonil Rho
2), 3): I'm wondering about using tesseract 4.0 for multiple language, too. After searching & testing a while, I found that it seems not working the old method for tesseract 3. (e.g. running with '-l lang1+lang2' option) Is there any other method that I have to try? Or I have to train

Re: [tesseract-ocr] I do not include 'chi_tra' in my tessdata folder . What is it ? I have seen language-specific.sh

2018-03-10 Thread 이경준
Sorry ... I just want to know tesseract4.0 sorry -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this