https://bugzilla.wikimedia.org/show_bug.cgi?id=68638
--- Comment #16 from Bawolff (Brian Wolff) <[email protected]> --- (In reply to George Orwell III from comment #14) > > That said, it begs the question: Can one actually generate such an .xml > file from a .DjVu file under the current state of mediawiki affairs or not? > > Followed up by: If so, why in blazes are we (Wikisource) still stuck with > plain text dumps instead of smartly mapped, rich XML-based ones? (See bug > 57807 here). This is getting highly off topic, but presumably because nobody has asked for an xml formatted dump of the djvu text layer to be generated for wikisource. I'm unclear what such a thing would be used for, but if you have a use case for wikisource, I encourage you to file a (separate) bug about it [No guarantees anyone would do anything about such a bug, but we fix 0 of the bugs that are not recorded]. For reference, current state of affairs is that when a DjVu (or PDF) files is uploaded, we collect the text layer of each page, and store the text on a page by page basis (You can get it all from the api if so inclined). -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
