[ 
https://issues.apache.org/jira/browse/COMPRESS-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Frédérik Bilhaut updated COMPRESS-325:
--------------------------------------
    Description: 
Sample code :

{code:java}
URL url = new 
URL("http://downloads.dbpedia.org/current/core-i18n/en/labels_en.nt.bz2";);
InputStream input = new 
BZip2CompressorInputStream(url.openConnection().getInputStream());
BufferedReader reader = new BufferedReader(new InputStreamReader(input, 
"US-ASCII"));
                        
int count = 0;
for(String line = reader.readLine(); line != null; line = reader.readLine()) {
        if(++count > 10000) break;
        else System.out.println(count + ": " + line);
}
{code}

It stops at line 7801 (EOF) :

{code}
7799: <http://dbpedia.org/resource/Gamemaster> 
<http://www.w3.org/2000/01/rdf-schema#label> "Gamemaster"@en .
7800: <http://dbpedia.org/resource/Genetic_engineering> 
<http://www.w3.org/2000/01/rdf-schema#label> "Genetic engineering"@en .
7801: <http://dbpedia.org/resource/Gradius_(video_game)> 
<http://www.w3.org/2000/01/rdf-s
{code}



  was:
Sample code :

{code:java}
URL url = new 
URL("http://downloads.dbpedia.org/current/core-i18n/en/labels_en.nt.bz2";);
InputStream input = new 
BZip2CompressorInputStream(url.openConnection().getInputStream());
BufferedReader reader = new BufferedReader(new InputStreamReader(input, 
"US-ASCII"));
                        
int count = 0;
for(String line = reader.readLine(); line != null; line = reader.readLine()) {
        if(++count > 10000) break;
        else System.out.println(count + ": " + line);
}
{code}

It stops at line 7801 (EOF) :

{{
7799: <http://dbpedia.org/resource/Gamemaster> 
<http://www.w3.org/2000/01/rdf-schema#label> "Gamemaster"@en .
7800: <http://dbpedia.org/resource/Genetic_engineering> 
<http://www.w3.org/2000/01/rdf-schema#label> "Genetic engineering"@en .
7801: <http://dbpedia.org/resource/Gradius_(video_game)> 
<http://www.w3.org/2000/01/rdf-s
}}




> Unable to uncompress bzip2 dbPedia files
> ----------------------------------------
>
>                 Key: COMPRESS-325
>                 URL: https://issues.apache.org/jira/browse/COMPRESS-325
>             Project: Commons Compress
>          Issue Type: Bug
>    Affects Versions: 1.10
>            Reporter: Frédérik Bilhaut
>
> Sample code :
> {code:java}
> URL url = new 
> URL("http://downloads.dbpedia.org/current/core-i18n/en/labels_en.nt.bz2";);
> InputStream input = new 
> BZip2CompressorInputStream(url.openConnection().getInputStream());
> BufferedReader reader = new BufferedReader(new InputStreamReader(input, 
> "US-ASCII"));
>                       
> int count = 0;
> for(String line = reader.readLine(); line != null; line = reader.readLine()) {
>       if(++count > 10000) break;
>       else System.out.println(count + ": " + line);
> }
> {code}
> It stops at line 7801 (EOF) :
> {code}
> 7799: <http://dbpedia.org/resource/Gamemaster> 
> <http://www.w3.org/2000/01/rdf-schema#label> "Gamemaster"@en .
> 7800: <http://dbpedia.org/resource/Genetic_engineering> 
> <http://www.w3.org/2000/01/rdf-schema#label> "Genetic engineering"@en .
> 7801: <http://dbpedia.org/resource/Gradius_(video_game)> 
> <http://www.w3.org/2000/01/rdf-s
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to