https://bugzilla.wikimedia.org/show_bug.cgi?id=42466

Doug <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |[email protected]
                   |                            |m
           Severity|critical                    |major

--- Comment #5 from Doug <[email protected]> 2012-12-01 01:37:12 UTC 
---
(In reply to comment #2)
> 
> Until this bug is corrected, all Wikisource projects will be unable to begin
> any new texts from DjVu source files.

Well, not to minimize this bug but that's not true, it's only that they won't
be able to rely on the text layer.  Frequently, the layer is such crap,
especially on older texts, that this has no practical effect.  Furthermore, we
have our own OCR tool that can be used on the fly with a gadget that's
implemented by a button above the edit box that is turned on by default (i.e
IPs can use it).  For example, I just generated
https://fr.wikisource.org/wiki/Page:De_la_D%C3%A9monomanie_des_Sorciers_%281587%29.djvu/141
using that tool.  Considering that's a 16th C. work, that's about as good as
I'd expect from the text layer associated with the djvu.  Furthermore the text
layer can be copy pasted in or even botted in.  The text layer on this
particular work is about equal to what the tool generated and presumably the
folks at IA were able to optimize ABBYY FineReader 8.0 for the language and
type, unlike the built in tool, which I think still uses tesseract.

This isn't critical either, there is no internal data loss, which is part of
our definition of critical.  It's just a loss of function.  The text layer is
still there in the file on commons.

I'm not saying this isn't an important bug, I'm saying, if you're a
wikisourcerer, don't feel tied to the text layer that comes with a djvu or pdf.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to