[ 
https://issues.apache.org/jira/browse/JENA-2225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17501270#comment-17501270
 ] 

ASF subversion and git services commented on JENA-2225:
-------------------------------------------------------

Commit c07423a356fa73b41120aaf23c40a900d801c7b3 in jena's branch 
refs/heads/main from Andy Seaborne
[ https://gitbox.apache.org/repos/asf?p=jena.git;h=c07423a ]

JENA-2225: Handler long integers reading stats format


> TDB/TDB2 dataset size stat serialized incorrectly for large datasets
> --------------------------------------------------------------------
>
>                 Key: JENA-2225
>                 URL: https://issues.apache.org/jira/browse/JENA-2225
>             Project: Apache Jena
>          Issue Type: Bug
>          Components: TDB, TDB2
>    Affects Versions: Jena 4.3.1
>            Reporter: Lorenz Bühmann
>            Assignee: Andy Seaborne
>            Priority: Minor
>             Fix For: Jena 4.4.0
>
>         Attachments: stats.opt.gz
>
>
> When computing the TDB/TDB2 stats via CLI the size will be serialized 
> incorrectly for large datasets.
> For example for latest Wikidata Truthy we get
> {noformat}
> (count -1983667112)){noformat}
> This happens because for both the corresponding `Stats.java` class does 
> enforce an Integer type Node though the value is a long type:
> {code:java}
> if ( count >= 0 )
>     addPair(meta.getList(), StatsMatcher.COUNT, 
> NodeFactoryExtra.intToNode((int)count)) ; {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to