https://bugzilla.wikimedia.org/show_bug.cgi?id=29564

--- Comment #9 from Marcin Cieślak <[email protected]> ---
Created attachment 11679
  --> https://bugzilla.wikimedia.org/attachment.cgi?id=11679&action=edit
Dump of the text node of page 803932

Attached please find the result of running:

import xmlreader
out = open("803932.txt", "w")
z = xmlreader.XmlDump("huwiki-20130120-pages-meta-current.xml.bz2")
for i in z.parse():
    if i.id in ["803932"]:
       out.write(i.text.encode("utf-8"))
       break
out.close()

What's interesting, this body looks more complete than what is acutally
displayed under the URL of this bug. Is the output prepared for export of
better quality than the rendered wikipage? Interesting.

-- 
You are receiving this mail because:
You are watching all bug changes.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to