[ 
https://issues.apache.org/jira/browse/JENA-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17453955#comment-17453955
 ] 

Rob Vesse commented on JENA-2204:
---------------------------------

{quote}How can I go 5x to 10x lower on storage consumption ? {quote}

[[email protected]] Playing devil's advocate here, what if you can't?

As Andy has already been explaining the design of TDB2 (or any triple store for 
that matter) inevitably makes some choices around how it stores data.  
Additionally your choice of a particular OS may enforce certain choices on the 
implementation that are unavoidable.

Optimizing for storage space is not necessarily the best choice but we don't 
really have enough information to say if, and whether you should, try to 
optimise for that.  Disk space is generally very cheap these days, whether 
physical or cloud, and we'll assume that you have other performance 
considerations that aren't stated in this ticket e.g. can a system actually 
support your actual query/update workload with sufficient performance.

Reading between the lines it seems like there is an underlying question here 
about whether TDB2 is suitable for your data based on making an extrapolation 
from benchmark datasets (which as already noted are not necessarily indicative 
of real world data).

If your concern is whether TDB2 will work for your actual data then I would 
strongly encourage you to try it with your actual data and workload rather than 
synthetic benchmarks.

> Storage required by TDB2 is much higher than TDB1, How to Fix ?
> ---------------------------------------------------------------
>
>                 Key: JENA-2204
>                 URL: https://issues.apache.org/jira/browse/JENA-2204
>             Project: Apache Jena
>          Issue Type: Question
>            Reporter: Hemant Tiwari
>            Priority: Minor
>         Attachments: LUBM-TDB2-Data.7z
>
>
> The storage required by TDB2 is much higher than TDB1
> For 100k statements - TDB1 takes about 90 MB, while TDB2 is taking ~ close to 
> 1 GB.
> Why is there such a difference and is there any solution available to reduce 
> the storage size?



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to