https://bugzilla.wikimedia.org/show_bug.cgi?id=42466
Doug <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |[email protected] | |m Severity|critical |major --- Comment #5 from Doug <[email protected]> 2012-12-01 01:37:12 UTC --- (In reply to comment #2) > > Until this bug is corrected, all Wikisource projects will be unable to begin > any new texts from DjVu source files. Well, not to minimize this bug but that's not true, it's only that they won't be able to rely on the text layer. Frequently, the layer is such crap, especially on older texts, that this has no practical effect. Furthermore, we have our own OCR tool that can be used on the fly with a gadget that's implemented by a button above the edit box that is turned on by default (i.e IPs can use it). For example, I just generated https://fr.wikisource.org/wiki/Page:De_la_D%C3%A9monomanie_des_Sorciers_%281587%29.djvu/141 using that tool. Considering that's a 16th C. work, that's about as good as I'd expect from the text layer associated with the djvu. Furthermore the text layer can be copy pasted in or even botted in. The text layer on this particular work is about equal to what the tool generated and presumably the folks at IA were able to optimize ABBYY FineReader 8.0 for the language and type, unlike the built in tool, which I think still uses tesseract. This isn't critical either, there is no internal data loss, which is part of our definition of critical. It's just a loss of function. The text layer is still there in the file on commons. I'm not saying this isn't an important bug, I'm saying, if you're a wikisourcerer, don't feel tied to the text layer that comes with a djvu or pdf. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug. You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
