Lorenz Bühmann created JENA-2225: ------------------------------------ Summary: TDB/TDB2 dataset size stat serialized inccorrectly for large datasets Key: JENA-2225 URL: https://issues.apache.org/jira/browse/JENA-2225 Project: Apache Jena Issue Type: Bug Components: TDB, TDB2 Affects Versions: Jena 4.3.1 Reporter: Lorenz Bühmann
When computing the TDB/TDB2 stats via CLI the size will be serialized incorrectly for large datasets. For example for latest Wikidata Truthy we get {noformat} (count -1983667112)){noformat} This happens because for both the corresponding `Stats.java` class does enforce an Integer type Node though the value is a long type: {code:java} if ( count >= 0 ) addPair(meta.getList(), StatsMatcher.COUNT, NodeFactoryExtra.intToNode((int)count)) ; {code} -- This message was sent by Atlassian Jira (v8.20.1#820001)