Re: [IO] Using file filters with Java NIO file tree walking

2020-10-14 Thread Bruno P. Kinoshita
 Had a look at the commit [1] and found no issues. Really impressive work Gary. 
You've updated the code and also added a lot of examples in the Javadocs. 
Really golden.
I used simple code snippets to understand how the new API works, and quickly 
test it locally. But will try to update commons-imaging and/or Jena to use the 
latest snapshot in the IDE and test it some more.


Thanks!
Bruno


[1] 
https://github.com/apache/commons-io/commit/8b6d4969ffb55bf7301a44a8156f02b0213e6d68
On Wednesday, 14 October 2020, 3:40:01 am NZDT, Gary Gregory 
 wrote:  
 
 On Sat, Oct 10, 2020 at 11:08 PM Bruno P. Kinoshita 
wrote:

>  Thanks for heads up Gary! Looking forward to reading/reviewing the new
> code!
> Bruno
>

Hi Bruno,

It's in there now, feel free to take a look and review, change, or fiddle
;-)

Gary


>
>    On Sunday, 11 October 2020, 3:16:20 am NZDT, Gary Gregory <
> garydgreg...@gmail.com> wrote:
>
>  Hi All:
>
> FYI,
>
> I am working and nearly done adapting our file filters to work with
> java.nio.file.Files.walk APIs.
>
> My goal overall in Commons is to let our code fit in better with the more
> modern NIO and Path based APIs.
>
> Gary
>
  

Re: [VOTE] Release Apache Commons Net 3.7.2 based on RC1

2020-10-14 Thread Bruno P. Kinoshita
   [x] +1 Release these artifacts


Built from tag with Maven 3.5.4 and Java 8. Build passing OK, reports look good.
Checked signatures, found no issues. Inspected some files in binary/source 
folders of the dist area, all looking good.
Thanks for managing this release Gary.


Bruno

On Thursday, 15 October 2020, 5:20:05 am NZDT, Gary Gregory 
 wrote:  
 
 We have fixed a few bugs since Apache Commons Net 3.7.1 was released, so I
would like to release Apache Commons Net 3.7.2.

Apache Commons Net 3.7.2 RC1 is available for review here:
    https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1 (svn
revision 41933)

The Git tag commons-net-3.7.2-RC1 commit for this RC is
deeb863e66598f51aac6410a85b814f721542ffe which you can browse here:

https://gitbox.apache.org/repos/asf?p=commons-net.git;a=commit;h=deeb863e66598f51aac6410a85b814f721542ffe
You may checkout this tag using:
    git clone https://gitbox.apache.org/repos/asf/commons-net.git --branch
commons-net-3.7.2-RC1 commons-net-3.7.2-RC1

Maven artifacts are here:

https://repository.apache.org/content/repositories/orgapachecommons-1533/commons-net/commons-net/3.7.2/

These are the artifacts and their hashes:

#Release SHA-512s
#Wed Oct 14 12:02:40 EDT 2020
commons-net-3.7.2-bin.tar.gz=0ec5c4a60c24871ec6f975be006eb9cb0e14f4a6e1a55a18bb6d005777cc9cb51e4b8dcd949f07cfbfe1568e009d45a587bc3787d84ce44cdf7f48cfe7ec69a1
commons-net-3.7.2-bin.zip=458dea745fea7b31e0813dcdedafebdae9254a30e8c303fad0d7fc5956c4a62be913fd1cfa3db10bebd7b832c033b853538fcecb781af18788af966f3279612d
commons-net-3.7.2-javadoc.jar=fcfde9980a59ebd14f9b9b033489ae286b3ab965e6c13ae1bfc34ef3ecd6ee0943d32f0b8cc83de88de624e5ad22b12bb36504d176968282a287d757856aded5
commons-net-3.7.2-sources.jar=a6f2758ad105fc18658cbb406e64ff08a58335970d34fea9ef28936b0d2c838271f14667edc3c6ea5654d782721199ff10cc60a8b44a0b1c7fd87c717a2bbe99
commons-net-3.7.2-src.tar.gz=501ddb1a1e84a170355377194efb630f28b2708d73fb9d724af8fc9f313055e38d957ceb6d3f03ea43d33bf2b3574908efa3cdad566c8a426fafc6306f0d1306
commons-net-3.7.2-src.zip=4fa1cc282616017785ec28266ba43acbda46f03d4d7288a41db26862bc0c8cda59d4f5235b144465ece0fcc13f95495de52a8ed6f4e2bed7b7b0a81231f9c073
commons-net-3.7.2-test-sources.jar=04f59e88d5349bd836a78e38386dca132f4116c53eefc8d58bb723c859c8323a0bb4d221495ce35f772653b5db8b5945acfc2e927265ce9d06e29f896540d6fe
commons-net-3.7.2-tests.jar=590d06b3d4db07d2e916599b908ca18d24be671fd04daf5bfa500b721b16cf5481c0517be10b1d1b5ed3766009941d1777d8da0a860d24a82b7d895b516b0180
commons-net-examples-3.7.2.jar=227a3584583774502f36740b3533d26041c00b90e8a47d54c23268471444d3ea18c0c3b31211d7ee17a39e1ca684718ef093ecba5b49ae03128af875e43f8107
commons-net-ftp-3.7.2.jar=89d9a227de7c9fc6646ede9deaf177c04bf9e79512be3559e24a659731f2898a2abf9854acd62153f2138e1379daddbf2f9f32fa492aaeb94b6c5ff970c4539d

I have tested this with:

mvn -V -Prelease -Ptest-deploy -P jacoco -P japicmp clean package site
deploy

using:

Apache Maven 3.6.3 (cecedd343002696d0abb50b32b541b8a6ba2883f)
Maven home: C:\Java\apache-maven-3.6.3\bin\..
Java version: 1.8.0_251, vendor: Oracle Corporation, runtime: C:\Program
Files\Java\jdk1.8.0_251\jre
Default locale: en_US, platform encoding: Cp1252
OS name: "windows 10", version: "10.0", arch: "amd64", family: "windows"

Details of changes since 3.7.1 are in the release notes:

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/RELEASE-NOTES.txt

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/changes-report.html

Site:

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/index.html
    (note some *relative* links are broken and the 3.7.2 directories are
not yet created - these will be OK once the site is deployed.)

    CLIRR Report (compared to 3.7.1):

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/clirr-report.html

    JApiCmp Report (compared to 3.7.1):

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/japicmp.html

RAT Report:

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/rat-report.html

KEYS:
  https://www.apache.org/dist/commons/KEYS

Please review the release candidate and vote.
This vote will close no sooner than 72 hours from now.

  [ ] +1 Release these artifacts
  [ ] +0 OK, but...
  [ ] -0 OK, but really should fix...
  [ ] -1 I oppose this release because...

Thank you,

Gary Gregory,
Release Manager (using key 86fdc7e2a11262cb)

For following is intended as a helper and refresher for reviewers.

Validating a release candidate
==

These guidelines are NOT complete.

Requirements: Git, Java, Maven.

You can validate a release from a release candidate (RC) tag as follows.

1) Clone and checkout the RC tag

git clone https://gitbox.apache.org/repos/asf/commons-net.git --branch
commons-net-3.7.2-RC1 commons-net-3.7.2-RC1
cd commons-net-3.7.2-RC1

2) Check Apache licenses

This step is not required if the site includes a RAT report page which you
then must check.

mvn 

Re: JUnit assertAll and assertThrows question

2020-10-14 Thread Gary Gregory
For my money:

- I REALLY like assertThrows
- Might as well go to JUnit 5 if you feel up for it.
- BUT: My biggest pain point ATM is that while Commons VFS is on JUnit 4,
most tests are still JUnit 3 (TestCase) classes that won't run properly
from Eclipse.

So I would really like help getting VFS to at least be off of anything
JUnit 3-like and at least to JUnit 4 but that won't stop a release which I
hope to do within a week or two.

TY!
Gary

On Wed, Oct 14, 2020 at 3:56 PM Romain Manni-Bucau 
wrote:

> Hi John,
>
> Would be +1 from me if:
>
> 1. It does not require to move from junit4 to junit5 (no gain IMHO)
> 2. Test goes further than a Exception rule or @Test(expected)
>
> If both points are matched it makes sense to me, otherwise it will likely
> add more issues from my experience.
>
>
> Le mer. 14 oct. 2020 à 21:40, John Patrick  a
> écrit :
>
> > before i waste time looking at upgrading tests...
> >
> > any objections if i upgrade tests to use assertAll and assertThrows
> > introduced in JUnit jupiter?
> >
> > I see it as less tech debt removal and I'm happy to spend time doing
> > the upgrade which I've done from maybe several projects now. Just
> > don't want to get to raising PR and them being rejected, which I feel
> > happens with everything I try to help out with related to commons
> > projects at the moment...
> >
> > cheers,
> > john
> >
> > -
> > To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
> > For additional commands, e-mail: dev-h...@commons.apache.org
> >
> >
>


Re: JUnit assertAll and assertThrows question

2020-10-14 Thread Romain Manni-Bucau
Hi John,

Would be +1 from me if:

1. It does not require to move from junit4 to junit5 (no gain IMHO)
2. Test goes further than a Exception rule or @Test(expected)

If both points are matched it makes sense to me, otherwise it will likely
add more issues from my experience.


Le mer. 14 oct. 2020 à 21:40, John Patrick  a
écrit :

> before i waste time looking at upgrading tests...
>
> any objections if i upgrade tests to use assertAll and assertThrows
> introduced in JUnit jupiter?
>
> I see it as less tech debt removal and I'm happy to spend time doing
> the upgrade which I've done from maybe several projects now. Just
> don't want to get to raising PR and them being rejected, which I feel
> happens with everything I try to help out with related to commons
> projects at the moment...
>
> cheers,
> john
>
> -
> To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
> For additional commands, e-mail: dev-h...@commons.apache.org
>
>


JUnit assertAll and assertThrows question

2020-10-14 Thread John Patrick
before i waste time looking at upgrading tests...

any objections if i upgrade tests to use assertAll and assertThrows
introduced in JUnit jupiter?

I see it as less tech debt removal and I'm happy to spend time doing
the upgrade which I've done from maybe several projects now. Just
don't want to get to raising PR and them being rejected, which I feel
happens with everything I try to help out with related to commons
projects at the moment...

cheers,
john

-
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org



[VOTE] Release Apache Commons Net 3.7.2 based on RC1

2020-10-14 Thread Gary Gregory
We have fixed a few bugs since Apache Commons Net 3.7.1 was released, so I
would like to release Apache Commons Net 3.7.2.

Apache Commons Net 3.7.2 RC1 is available for review here:
https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1 (svn
revision 41933)

The Git tag commons-net-3.7.2-RC1 commit for this RC is
deeb863e66598f51aac6410a85b814f721542ffe which you can browse here:

https://gitbox.apache.org/repos/asf?p=commons-net.git;a=commit;h=deeb863e66598f51aac6410a85b814f721542ffe
You may checkout this tag using:
git clone https://gitbox.apache.org/repos/asf/commons-net.git --branch
commons-net-3.7.2-RC1 commons-net-3.7.2-RC1

Maven artifacts are here:

https://repository.apache.org/content/repositories/orgapachecommons-1533/commons-net/commons-net/3.7.2/

These are the artifacts and their hashes:

#Release SHA-512s
#Wed Oct 14 12:02:40 EDT 2020
commons-net-3.7.2-bin.tar.gz=0ec5c4a60c24871ec6f975be006eb9cb0e14f4a6e1a55a18bb6d005777cc9cb51e4b8dcd949f07cfbfe1568e009d45a587bc3787d84ce44cdf7f48cfe7ec69a1
commons-net-3.7.2-bin.zip=458dea745fea7b31e0813dcdedafebdae9254a30e8c303fad0d7fc5956c4a62be913fd1cfa3db10bebd7b832c033b853538fcecb781af18788af966f3279612d
commons-net-3.7.2-javadoc.jar=fcfde9980a59ebd14f9b9b033489ae286b3ab965e6c13ae1bfc34ef3ecd6ee0943d32f0b8cc83de88de624e5ad22b12bb36504d176968282a287d757856aded5
commons-net-3.7.2-sources.jar=a6f2758ad105fc18658cbb406e64ff08a58335970d34fea9ef28936b0d2c838271f14667edc3c6ea5654d782721199ff10cc60a8b44a0b1c7fd87c717a2bbe99
commons-net-3.7.2-src.tar.gz=501ddb1a1e84a170355377194efb630f28b2708d73fb9d724af8fc9f313055e38d957ceb6d3f03ea43d33bf2b3574908efa3cdad566c8a426fafc6306f0d1306
commons-net-3.7.2-src.zip=4fa1cc282616017785ec28266ba43acbda46f03d4d7288a41db26862bc0c8cda59d4f5235b144465ece0fcc13f95495de52a8ed6f4e2bed7b7b0a81231f9c073
commons-net-3.7.2-test-sources.jar=04f59e88d5349bd836a78e38386dca132f4116c53eefc8d58bb723c859c8323a0bb4d221495ce35f772653b5db8b5945acfc2e927265ce9d06e29f896540d6fe
commons-net-3.7.2-tests.jar=590d06b3d4db07d2e916599b908ca18d24be671fd04daf5bfa500b721b16cf5481c0517be10b1d1b5ed3766009941d1777d8da0a860d24a82b7d895b516b0180
commons-net-examples-3.7.2.jar=227a3584583774502f36740b3533d26041c00b90e8a47d54c23268471444d3ea18c0c3b31211d7ee17a39e1ca684718ef093ecba5b49ae03128af875e43f8107
commons-net-ftp-3.7.2.jar=89d9a227de7c9fc6646ede9deaf177c04bf9e79512be3559e24a659731f2898a2abf9854acd62153f2138e1379daddbf2f9f32fa492aaeb94b6c5ff970c4539d

I have tested this with:

mvn -V -Prelease -Ptest-deploy -P jacoco -P japicmp clean package site
deploy

using:

Apache Maven 3.6.3 (cecedd343002696d0abb50b32b541b8a6ba2883f)
Maven home: C:\Java\apache-maven-3.6.3\bin\..
Java version: 1.8.0_251, vendor: Oracle Corporation, runtime: C:\Program
Files\Java\jdk1.8.0_251\jre
Default locale: en_US, platform encoding: Cp1252
OS name: "windows 10", version: "10.0", arch: "amd64", family: "windows"

Details of changes since 3.7.1 are in the release notes:

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/RELEASE-NOTES.txt

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/changes-report.html

Site:

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/index.html
(note some *relative* links are broken and the 3.7.2 directories are
not yet created - these will be OK once the site is deployed.)

CLIRR Report (compared to 3.7.1):

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/clirr-report.html

JApiCmp Report (compared to 3.7.1):

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/japicmp.html

RAT Report:

https://dist.apache.org/repos/dist/dev/commons/net/3.7.2-RC1/site/rat-report.html

KEYS:
  https://www.apache.org/dist/commons/KEYS

Please review the release candidate and vote.
This vote will close no sooner than 72 hours from now.

  [ ] +1 Release these artifacts
  [ ] +0 OK, but...
  [ ] -0 OK, but really should fix...
  [ ] -1 I oppose this release because...

Thank you,

Gary Gregory,
Release Manager (using key 86fdc7e2a11262cb)

For following is intended as a helper and refresher for reviewers.

Validating a release candidate
==

These guidelines are NOT complete.

Requirements: Git, Java, Maven.

You can validate a release from a release candidate (RC) tag as follows.

1) Clone and checkout the RC tag

git clone https://gitbox.apache.org/repos/asf/commons-net.git --branch
commons-net-3.7.2-RC1 commons-net-3.7.2-RC1
cd commons-net-3.7.2-RC1

2) Check Apache licenses

This step is not required if the site includes a RAT report page which you
then must check.

mvn apache-rat:check

3) Check binary compatibility

Older components still use Apache Clirr:

This step is not required if the site includes a Clirr report page which
you then must check.

mvn clirr:check

Newer components use JApiCmp with the japicmp Maven Profile:

This step is not required if the site includes a JApiCmp report page which
you then must check.

mvn install 

Dependabot pr's

2020-10-14 Thread John Patrick
to shortcut multiple people telling me not to manually raise pr's to
upgrade dependencies, and dependabot is the preferred option for
commons to be raising these upgrades, and i should raise a pr to
enable dependabot.

so... here are all the pr's to enable dependabot on the repo's which
lack a dependabot.yml file.

https://github.com/apache/commons-bsf/pull/2
https://github.com/apache/commons-chain/pull/6
https://github.com/apache/commons-crypto/pull/108
https://github.com/apache/commons-daemon/pull/20
https://github.com/apache/commons-digester/pull/6
https://github.com/apache/commons-functor/pull/3
https://github.com/apache/commons-geometry/pull/102
https://github.com/apache/commons-jci/pull/3
https://github.com/apache/commons-jcs/pull/16
https://github.com/apache/commons-jelly/pull/7
https://github.com/apache/commons-jexl/pull/27
https://github.com/apache/commons-jxpath/pull/21
https://github.com/apache/commons-math/pull/160
https://github.com/apache/commons-numbers/pull/86
https://github.com/apache/commons-ognl/pull/10
https://github.com/apache/commons-proxy/pull/5
https://github.com/apache/commons-rng/pull/79
https://github.com/apache/commons-scxml/pull/9
https://github.com/apache/commons-statistics/pull/25
https://github.com/apache/commons-weaver/pull/5

They all have the change md5sum for .github/dependabot.yml which
matches the files in the other repos. I don't believe any other change
is required but i might be wrong.

John

-
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org



.gitignore commons-chain

2020-10-14 Thread John Patrick
hi,

is someone able to review this pr please.
https://github.com/apache/commons-chain/pull/3

It's the last commons project that doesn't have a .gitignore file. My
other pr's adding a .gitignore have either been merged or someone
noticed and added one themselves.

cheers,
john

-
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org



Re: [compress] BZip2CompressorInputStream stops working without rhyme or reason ...

2020-10-14 Thread Albretch Mueller
 I don't know what could there apaprently be exactly at byte offset
2848 in some buffer but files reporing to be fine by bzip2 --test
can't be processed by BZip2CompressorInputStream:
~
$ 
_IFL="/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream1.xml-p1p41242.bz2"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 242624781 Sep 22 05:40
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream1.xml-p1p41242.bz2
$ file --brief "${_IFL}"
bzip2 compressed data, block size = 900k
$ time bzip2 --test --verbose "${_IFL}"
  
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream1.xml-p1p41242.bz2:
ok

real2m0.650s
user2m0.076s
sys 0m0.256s

$ 
_IFL="/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 394001572 Sep 22 05:49
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2
$ file --brief "${_IFL}"
bzip2 compressed data, block size = 900k
$ time bzip2 --test --verbose  "${_IFL}"
  
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2:
ok

real3m6.249s
user3m5.192s
sys 0m0.628s

$ 
_IFL="/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 427323881 Sep 22 05:51
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2
$ file --brief "${_IFL}"
bzip2 compressed data, block size = 900k
$ time bzip2 --test --verbose  "${_IFL}"
  
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2:
ok

real3m20.861s
user3m19.296s
sys 0m0.988s

$ 
_IFL="/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 458830618 Sep 22 05:52
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2
$ file --brief "${_IFL}"
bzip2 compressed data, block size = 900k
$ time bzip2 --test --verbose  "${_IFL}"
  
/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2:
ok

real3m34.213s
user3m32.636s
sys 0m1.056s
$


$ 
_IFL="/home/lbrtchx/cmllpz/prjx/kd/java/IO/compress/logs/UnKmprssBZ2_02Test_20201013234903.log"
$ tail -n 10 "${_IFL}"
// __ Files Context of |4| files containing a total of |1522780852| bytes!
// __ [0/4): ...(30.131%)
|/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2|
// __ aOFlNm: 
|/home/lbrtchx/cmllpz/prjx/kd/java/IO/compress/REF/enwiki-20200920-pages-articles-multistream6-p958046p1483661.xml|
// __ |2848|2848|java.io.IOException:
// __ Read bytes and file lenght not the same! lTtlRdByts: |2848|
(lTtlRdByts != lFlL),  lFlL: |458830618|
at UnKmprssBZ2_02Test.main(UnKmprssBZ2_02Test.java:254)

real0m1.759s
user0m2.920s
sys 0m0.196s

$ 
_IFL="/home/lbrtchx/cmllpz/prjx/kd/java/IO/compress/logs/UnKmprssBZ2_02Test_20201013234826.log"
$ tail -n 10 "${_IFL}"
// __ Files Context of |4| files containing a total of |1522780852| bytes!
// __ [0/4): ...(28.062%)
|/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2|
// __ aOFlNm: 
|/home/lbrtchx/cmllpz/prjx/kd/java/IO/compress/REF/enwiki-20200920-pages-articles-multistream5-p558392p958045.xml|
// __ |2848|2848|java.io.IOException:
// __ Read bytes and file lenght not the same! lTtlRdByts: |2848|
(lTtlRdByts != lFlL),  lFlL: |427323881|
at UnKmprssBZ2_02Test.main(UnKmprssBZ2_02Test.java:254)

real0m1.669s
user0m2.720s
sys 0m0.220s

$ 
_IFL="/home/lbrtchx/cmllpz/prjx/kd/java/IO/compress/logs/UnKmprssBZ2_02Test_20201013234708.log"
$ tail -n 10 "${_IFL}"
// __ Files Context of |4| files containing a total of |1522780852| bytes!
// __ [0/4): ...(25.874%)
|/home/lbrtchx/cmllpz/LklWb/org/wikimedia/dumps/enwiki/20200920/enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2|
// __ aOFlNm: 
|/home/lbrtchx/cmllpz/prjx/kd/java/IO/compress/REF/enwiki-20200920-pages-articles-multistream4-p311330p558391.xml|
// __ |2848|2848|java.io.IOException:
// __ Read bytes and file lenght not the same! lTtlRdByts: |2848|
(lTtlRdByts != lFlL),  lFlL: |394001572|
at UnKmprssBZ2_02Test.main(UnKmprssBZ2_02Test.java:254)

real0m1.665s
user0m2.752s
sys 0m0.172s

$ 

Re: [compress] BZip2CompressorInputStream stops working without rhyme or reason ...

2020-10-14 Thread Albretch Mueller
 the files decompress fine using Linux bzip2:

$  time bzip2 --decompress --verbose --keep
"enwiki-20200920-pages-articles-multistream1.xml-p1p41242.bz2"
  enwiki-20200920-pages-articles-multistream1.xml-p1p41242.bz2: done

real2m22.089s
user2m6.664s
sys 0m7.184s

$  time bzip2 --decompress --verbose --keep
"enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2"
  enwiki-20200920-pages-articles-multistream4.xml-p311330p558391.bz2: done

real3m48.505s
user3m21.632s
sys 0m11.884s

$  time bzip2 --decompress --verbose --keep
"enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2"
  enwiki-20200920-pages-articles-multistream5.xml-p558392p958045.bz2: done

real4m24.744s
user3m48.576s
sys 0m14.524s

$  time bzip2 --decompress --verbose --keep
"enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2"
  enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661.bz2: done

real4m41.823s
user4m6.836s
sys 0m15.004s
$


$ _IFL="enwiki-20200920-pages-articles-multistream1.xml-p1p41242"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 885035378 Oct 14 00:05
enwiki-20200920-pages-articles-multistream1.xml-p1p41242
$ file --brief "${_IFL}"
HTML document, UTF-8 Unicode text, with very long lines

$ _IFL="enwiki-20200920-pages-articles-multistream4.xml-p311330p558391"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 1453887824 Oct 14 00:05
enwiki-20200920-pages-articles-multistream4.xml-p311330p558391
$ file --brief "${_IFL}"
HTML document, UTF-8 Unicode text, with very long lines

$ _IFL="enwiki-20200920-pages-articles-multistream5.xml-p558392p958045"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 1615967954 Oct 14 00:05
enwiki-20200920-pages-articles-multistream5.xml-p558392p958045
$ file --brief "${_IFL}"
HTML document, UTF-8 Unicode text, with very long lines

$ _IFL="enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661"
$ ls -l "${_IFL}"
-r--r--r-- 1 lbrtchx lbrtchx 1718742545 Oct 14 00:06
enwiki-20200920-pages-articles-multistream6.xml-p958046p1483661
$ file --brief "${_IFL}"
HTML document, UTF-8 Unicode text, with very long lines

-
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org