https://bugzilla.wikimedia.org/show_bug.cgi?id=68638

--- Comment #16 from Bawolff (Brian Wolff) <[email protected]> ---
(In reply to George Orwell III from comment #14)
> 
> That said, it begs the question:   Can one actually generate such an .xml
> file from a .DjVu file under the current state of mediawiki affairs or not?
> 
> Followed up by:  If so, why in blazes are we (Wikisource) still stuck with
> plain text dumps instead of smartly mapped, rich XML-based ones? (See bug
> 57807 here).

This is getting highly off topic, but presumably because nobody has asked for
an xml formatted dump of the djvu text layer to be generated for wikisource.
I'm unclear what such a thing would be used for, but if you have a use case for
wikisource, I encourage you to file a (separate) bug about it [No guarantees
anyone would do anything about such a bug, but we fix 0 of the bugs that are
not recorded].

For reference, current state of affairs is that when a DjVu (or PDF) files is
uploaded, we collect the text layer of each page, and store the text on a page
by page basis (You can get it all from the api if so inclined).

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to