Lorenz Bühmann created JENA-2225:
------------------------------------

             Summary: TDB/TDB2 dataset size stat serialized inccorrectly for 
large datasets
                 Key: JENA-2225
                 URL: https://issues.apache.org/jira/browse/JENA-2225
             Project: Apache Jena
          Issue Type: Bug
          Components: TDB, TDB2
    Affects Versions: Jena 4.3.1
            Reporter: Lorenz Bühmann


When computing the TDB/TDB2 stats via CLI the size will be serialized 
incorrectly for large datasets.

For example for latest Wikidata Truthy we get
{noformat}
(count -1983667112)){noformat}
This happens because for both the corresponding `Stats.java` class does enforce 
an Integer type Node though the value is a long type:
{code:java}
if ( count >= 0 )
    addPair(meta.getList(), StatsMatcher.COUNT, 
NodeFactoryExtra.intToNode((int)count)) ; {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to