[ 
https://issues.apache.org/jira/browse/JENA-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17453968#comment-17453968
 ] 

Andy Seaborne commented on JENA-2204:
-------------------------------------

The data is broken.

There are *13946* named graphs.
And *one* subject.

Every nquads line starts {{{}<raw>{}}}.
{noformat}
<raw> <http://www.Department12.University0.edu/GraduateStudent9> 
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
<http://www.lehigh.edu/~zhp2/2004/0401/univ-bench.owl#ResearchAssistant> .
{noformat}
{noformat}
tdb2.tdbquery --loc DBX 'SELECT (count(*) AS ?C) { GRAPH ?g { } }'
{noformat}
So it looks like you meant to use the default graph.

Fixed data, it's 16M in the default graph.
_x9 smaller_

 

> Storage required by TDB2 is much higher than TDB1, How to Fix ?
> ---------------------------------------------------------------
>
>                 Key: JENA-2204
>                 URL: https://issues.apache.org/jira/browse/JENA-2204
>             Project: Apache Jena
>          Issue Type: Question
>            Reporter: Hemant Tiwari
>            Priority: Minor
>         Attachments: LUBM-TDB2-Data.7z
>
>
> The storage required by TDB2 is much higher than TDB1
> For 100k statements - TDB1 takes about 90 MB, while TDB2 is taking ~ close to 
> 1 GB.
> Why is there such a difference and is there any solution available to reduce 
> the storage size?



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to