https://bugzilla.wikimedia.org/show_bug.cgi?id=22137
Summary: mwdumper dies with "not a name start character: "U+26""
error
Product: mwdumper
Version: unspecified
Platform: PC
OS/Version: Linux
Status: NEW
Severity: critical
Priority: Normal
Component: general
AssignedTo: [email protected]
ReportedBy: [email protected]
$mwdumper --format=sql:1.5 itwiki-20100108-pages-articles.xml.bz2 | lzma -c >
itwiki-20100108-pages-articles.sql.lzma
1000 pages (88,755/sec), 1000 revs (88,755/sec)
2000 pages (65,935/sec), 2000 revs (65,935/sec)
3000 pages (67,621/sec), 3000 revs (67,621/sec)
4000 pages (80,336/sec), 4000 revs (80,336/sec)
5000 pages (80,457/sec), 5000 revs (80,457/sec)
Exception in thread "main" java.io.IOException: not a name start character:
"U+26"
at org.mediawiki.importer.XmlDumpReader.readDump(mwdumper)
at org.mediawiki.dumper.Dumper.main(mwdumper)
Caused by: org.xml.sax.SAXParseException: not a name start character: "U+26"
at gnu.xml.stream.SAXParser.parse(libgcj.so.81)
at javax.xml.parsers.SAXParser.parse(libgcj.so.81)
at javax.xml.parsers.SAXParser.parse(libgcj.so.81)
at org.mediawiki.importer.XmlDumpReader.readDump(mwdumper)
...1 more
Caused by: javax.xml.stream.XMLStreamException: not a name start character:
"U+26"
at gnu.xml.stream.XMLParser.error(libgcj.so.81)
at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.81)
at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.81)
at gnu.xml.stream.XMLParser.readCharData(libgcj.so.81)
at gnu.xml.stream.XMLParser.next(libgcj.so.81)
at gnu.xml.stream.XMLParser.hasNext(libgcj.so.81)
at gnu.xml.stream.SAXParser.parse(libgcj.so.81)
...4 more
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l