[ https://issues.apache.org/jira/browse/JENA-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17453968#comment-17453968 ]
Andy Seaborne commented on JENA-2204: ------------------------------------- The data is broken. There are *13946* named graphs. And *one* subject. Every nquads line starts {{{}<raw>{}}}. {noformat} <raw> <http://www.Department12.University0.edu/GraduateStudent9> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.lehigh.edu/~zhp2/2004/0401/univ-bench.owl#ResearchAssistant> . {noformat} {noformat} tdb2.tdbquery --loc DBX 'SELECT (count(*) AS ?C) { GRAPH ?g { } }' {noformat} So it looks like you meant to use the default graph. Fixed data, it's 16M in the default graph. _x9 smaller_ > Storage required by TDB2 is much higher than TDB1, How to Fix ? > --------------------------------------------------------------- > > Key: JENA-2204 > URL: https://issues.apache.org/jira/browse/JENA-2204 > Project: Apache Jena > Issue Type: Question > Reporter: Hemant Tiwari > Priority: Minor > Attachments: LUBM-TDB2-Data.7z > > > The storage required by TDB2 is much higher than TDB1 > For 100k statements - TDB1 takes about 90 MB, while TDB2 is taking ~ close to > 1 GB. > Why is there such a difference and is there any solution available to reduce > the storage size? -- This message was sent by Atlassian Jira (v8.20.1#820001)