Re: [Xmldatadumps-l] bz2 tools

2016-01-23 Thread Bernardo Sulzbach
Hi G. Gonter, Apparently there was an issue with the multistream one (although I am quite sure that I had problems with the "other one" myself, twice). You are the second (maybe third) person to report that could decompress it OK, so it possibly was a problem with my machine. I don't even have an

Re: [Xmldatadumps-l] bz2 tools

2016-01-23 Thread Gerhard Gonter
On Wed, Jan 20, 2016 at 7:10 PM, Bernardo Sulzbach wrote: > If you could decompress it correctly (20151201), don't mind my report. > I just would want it removed if it was confirmed to be a problematic > file. The file that I downloaded from dumps.wikimedia.org (208.80.154.11) earlier today was u

Re: [Xmldatadumps-l] bz2 tools

2016-01-20 Thread Bernardo Sulzbach
I strongly believe so, but I didn't kept the files so I can't do it now to confirm. Also, that Linux installation is gone. I have seen this problem [with 20151201] occur three times: once in this list (Richard F.), once on my local machine (Bernardo S.) and once in a JIRA (was it JIRA?) tracker. S

Re: [Xmldatadumps-l] bz2 tools

2016-01-20 Thread Tim Landscheidt
Bernardo Sulzbach wrote: >> I did not have problems unziping the file >> enwiki-20151201-pages-articles.xml.bz2 >> This was done last month, on linux and on my home computer. My program has >> run multiple times on the file and looks at each entry, so no obvious signs >> of corruption. > OK. I

Re: [Xmldatadumps-l] bz2 tools

2016-01-18 Thread Bernardo Sulzbach
On Mon, Jan 18, 2016 at 4:04 AM, Bryan White wrote: > I did not have problems unziping the file > enwiki-20151201-pages-articles.xml.bz2 > > This was done last month, on linux and on my home computer. My program has > run multiple times on the file and looks at each entry, so no obvious signs > o

Re: [Xmldatadumps-l] bz2 tools

2016-01-17 Thread Bryan White
I did not have problems unziping the file enwiki-20151201-pages-articles.xml.bz2 This was done last month, on linux and on my home computer. My program has run multiple times on the file and looks at each entry, so no obvious signs of corruption. Bryan On Sun, Jan 17, 2016 at 5:03 PM, Bernardo

Re: [Xmldatadumps-l] bz2 tools

2016-01-17 Thread Bernardo Sulzbach
Sure, I did suggest September, a dump I never had problems with. However, I did not test latest English myself. However, if _anyone could confirm the issue with English December_, **let's remove it from the page**? This **matters** for developers that rely on these dumps and will reduce server lo

Re: [Xmldatadumps-l] bz2 tools

2016-01-17 Thread Platonides
On 18/01/16 00:36, Bernardo Sulzbach wrote: Platonides, we were both talking about 20151201 and you tested 20160113, am I correct? I said I reproduced the mentioned problem with that file, not that all files were problematic. Whoops. Sorry :( Seems I didn't notice and just clicked to downloa

Re: [Xmldatadumps-l] bz2 tools

2016-01-17 Thread Bernardo Sulzbach
On Fri, Jan 15, 2016 at 11:44 PM, Platonides wrote: > On 16/01/16 02:30, Richard Farmbrough wrote: >> >> I have problems bunzip2ing pages-articles files. WinRAR fails at 37G, >> and bunzip2 fails at some point >> 14g though it "helpfully" cleans up >> after itself. >> >> Bunzip2 v 1.0.6 >> >> >bu

Re: [Xmldatadumps-l] bz2 tools

2016-01-17 Thread Platonides
On 16/01/16 02:44, Platonides wrote: On 16/01/16 02:30, Richard Farmbrough wrote: I have problems bunzip2ing pages-articles files. WinRAR fails at 37G, and bunzip2 fails at some point >> 14g though it "helpfully" cleans up after itself. Bunzip2 v 1.0.6 >bunzip2 enwiki-20151201-pages-articles.x

Re: [Xmldatadumps-l] bz2 tools

2016-01-16 Thread Richard Farmbrough
Thanks for the responses, it's given me a few things to try. Various people wrote... ___ Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l

Re: [Xmldatadumps-l] bz2 tools

2016-01-15 Thread Platonides
On 16/01/16 02:30, Richard Farmbrough wrote: I have problems bunzip2ing pages-articles files. WinRAR fails at 37G, and bunzip2 fails at some point >> 14g though it "helpfully" cleans up after itself. Bunzip2 v 1.0.6 >bunzip2 enwiki-20151201-pages-articles.xml.bz2 bunzip2: I/O or other error,

Re: [Xmldatadumps-l] bz2 tools

2016-01-15 Thread Bernardo Sulzbach
Hello, Richard. I've had some problems with the December dumps myself. I will not guarantee it for you, but if you download and test it, September will work. Good luck. It's not about the tool, the problem seems to be the dump file. ___ Xmldatadumps-l m

Re: [Xmldatadumps-l] bz2 tools

2016-01-15 Thread John
Have you tried 7zip ? On Fri, Jan 15, 2016 at 8:30 PM, Richard Farmbrough < rich...@farmbrough.co.uk> wrote: > I have problems bunzip2ing pages-articles files. WinRAR fails at 37G, and > bunzip2 fails at some point >> 14g though it "helpfully" cleans up after > itself. > > Bunzip2 v 1.0.6 > > >b

[Xmldatadumps-l] bz2 tools

2016-01-15 Thread Richard Farmbrough
I have problems bunzip2ing pages-articles files. WinRAR fails at 37G, and bunzip2 fails at some point >> 14g though it "helpfully" cleans up after itself. Bunzip2 v 1.0.6 >bunzip2 enwiki-20151201-pages-articles.xml.bz2 bunzip2: I/O or other error, bailing out. Possible reason follows. bunzi