Alex Brollo, 14/06/2013 08:45:
IA gives abbyy xml files too (as .gz files); I opened one of them after
a suggestion of Phe, and I'm dreaming about extracting anything useful
to help proofreading. The only "small" problem is that I barely know
what a xml is and that is similat to html in its (well-formed)
structure, and that something called XLST exists. :-(

Is any of you working about abbyy xml files with a "little bit" of more
skill?

Someone produced something here: <https://groups.google.com/forum/?fromgroups#!topic/abbyy-ocr-for-linux/Ih7no7KwslA> Also, from 2012: a planned "lura2hocr -- convert Luratech Abbyy XML to hOCR" https://code.google.com/p/hocr-tools/wiki/PageName

Nemo

_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to