On Tue, Mar 8, 2016 at 6:03 AM, Bojan Djuric <[email protected]> wrote:
> > Sorry, I tried the -c tessedit_load_sublangs="" option, which did not > work. > Yes, I said that didn't work. I'd suggest trying the workaround that I said would work, namely, unpacking the config file from srp_latn.traineddata, editing it to remove the offending line, and repacking it. The necessary commands are in my original message below. Tom On Monday, March 7, 2016 at 5:18:36 PM UTC+1, Tom Morris wrote: > >> On Mon, Mar 7, 2016 at 3:39 AM, Bojan Djuric <[email protected]> wrote: >> >>> Tried that, did not work for me either :) >>> >> >> I mentioned two things. Which one(s) did you try? If you tried >> editing/replacing the config file in srp_latn.traineddata and it didn't >> work, you can provide more details on your exact steps and the results? >> >> >>> Workaround could be to copy srp (cyrillic), and osd files to another >>> folder, and use --tessdata-dir parameter. >>> But that would complicate things. >>> >>> On Sunday, March 6, 2016 at 8:27:27 PM UTC+1, Tom Morris wrote: >>>> >>>> >>>> I was hoping you'd be able to override that on the command line, using -c >>>> tessedit_load_sublangs="", but that doesn't seem to work with the >>>> current order of evaluation, at least with my limited testing. >>>> >>>> If you have the training tools installed, you can patch your copy of >>>> the language file by doing the following: >>>> >>>> $ combine_tessdata -e srp_latn.traineddata srp_latn.config >>>> $ cp /dev/null srp_latn.config >>>> >>>> $ combine_tessdata -o srp_latn.traineddata srp_latn.config >>>> >>>> That will remove the problematic line from your config (you might want >>>> to copy srp_latn to srp_latn_only or some other name if you'd like both >>>> behaviors available to you). >>>> >>> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAE9vqEEA7tH-XGSzd3pq059QzE9Sf0tsfw_8m-CXqmy9Tzczjw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

