Bug#658099: tesseract-ocr: unconditionally depends on tesseract-ocr-eng

Jeff Breidenbach Sat, 04 Feb 2012 16:09:17 -0800

First, sorry for closing bug earlier; it must have gotten lost in the shuffle.


I think I'm going to simplify the situation. Upstream has asked every Tesseract
to include dependency on "equ" which is equations, "osd" which is orientation
and script detection. I'm going to add a direct dependency on "eng" as well
and give up on the concept of the tesseract-ocr-language virtual package.

Benefits:

1) Simplification. Smooth package transition from 2.04 -> 3.0.x has taken quite
some effort and this is one less thing to worry about and get wrong.

2) Laziness. I can make this change here, packaging will be "correct"
and I don't
have  to update all 65 language packages.

3) Correctness. Tesseract is quite happy to ignore English training data and
do French-only OCR, for example.

Downsides:

1) Unnecessary download bandwidth + disk use for people with zero interest
in English OCR.

2) Risk of getting accused of English language imperialism. This is
not intentional.
I am all for math equation imperialism if anyone cares about that.



-- 
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Bug#658099: tesseract-ocr: unconditionally depends on tesseract-ocr-eng

Reply via email to