[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #11 from piotr.jagiel...@op.pl --- The only workaround I came up with is trying a different dump. I was able to import enwiki-20140707-pages-articles.xml. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 Andre Klapper changed: What|Removed |Added Keywords||upstream Status|UNCONFIRMED |NEW Ever confirmed|0 |1 --- Comment #10 from Andre Klapper --- Oh, thanks for finding out! -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #9 from Chris Padfield --- And definitely confirmed: 649,000 pages (1,281.975/sec), 649,000 revs (1,281.975/sec) Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 2048 at org.apache.xerces.impl.io.UTF8Reader.read(Unknown Source) at org.apache.xerces.impl.XMLEntityScanner.load(Unknown Source) at org.apache.xerces.impl.XMLEntityScanner.scanContent(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanContent(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) at javax.xml.parsers.SAXParser.parse(SAXParser.java:392) at javax.xml.parsers.SAXParser.parse(SAXParser.java:195) at org.mediawiki.importer.XmlDumpReader.readDump(XmlDumpReader.java:88) at org.mediawiki.dumper.Dumper.main(Dumper.java:142) -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 Chris Padfield changed: What|Removed |Added CC||chris.padfi...@deskpro.com --- Comment #8 from Chris Padfield --- This is a Xerces bug, documented at https://issues.apache.org/jira/browse/XERCESJ-1257 The workaround suggested is to use the JVM's UTF-8 reader instead of the Xerces UTF8Reader. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #7 from mad2on...@yahoo.fr --- piotr did you find out the problem ? -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #6 from mad2on...@yahoo.fr --- 13,200,000 pages (5,538.948/sec), 13,200,000 revs (5,538.948/sec) Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 8192 at com.sun.org.apache.xerces.internal.impl.io.UTF8Reader.read(UTF8Reader.java:546) at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.load(XMLEntityScanner.java:1753) at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.arrangeCapacity(XMLEntityScanner.java:1629) at com.sun.org.apache.xerces.internal.impl.XMLEntityScanner.skipString(XMLEntityScanner.java:1667) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanEndElement(XMLDocumentFragmentScannerImpl.java:1747) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(XMLDocumentFragmentScannerImpl.java:2957) at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(XMLDocumentScannerImpl.java:606) at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(XMLDocumentFragmentScannerImpl.java:510) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:848) at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:777) at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParser.java:141) at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1213) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:649) at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl.parse(SAXParserImpl.java:333) at javax.xml.parsers.SAXParser.parse(SAXParser.java:195) at org.mediawiki.importer.XmlDumpReader.readDump(XmlDumpReader.java:96) at org.mediawiki.dumper.Dumper.main(Dumper.java:142) -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 mad2on...@yahoo.fr changed: What|Removed |Added CC||mad2on...@yahoo.fr --- Comment #5 from mad2on...@yahoo.fr --- I have the same problem with the file enwiki-20140502-pages-articles.xml -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #4 from piotr.jagiel...@op.pl --- Is there anyone here that uses mwdumper to import English Wikipedia XML dump? I tried several ones from past few months and I'm always running into some blocking issue. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #3 from piotr.jagiel...@op.pl --- Was anyone here able to import the latest dump (20140402) with mwdumper? If there is a chance that it's an issue with my local environment I'd be glad to know. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #2 from Andre Klapper --- It'll be confirmed when a second person has reproduced it. -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 --- Comment #1 from piotr.jagiel...@op.pl --- Why is it unconfirmed? I run into it into again with the latest dump. Do you need additional information to reproduce it? -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 Andre Klapper changed: What|Removed |Added Priority|Unprioritized |Normal -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 57236] mwdumper fails to import English wikipedia dump: ArrayIndexOutOfBoundsException; error in SQL syntax
https://bugzilla.wikimedia.org/show_bug.cgi?id=57236 Andre Klapper changed: What|Removed |Added Summary|mwdumper fails to import|mwdumper fails to import |English wikipedia dump |English wikipedia dump: ||ArrayIndexOutOfBoundsExcept ||ion; error in SQL syntax Severity|blocker |critical -- You are receiving this mail because: You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l