[
https://issues.apache.org/jira/browse/JENA-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17453955#comment-17453955
]
Rob Vesse commented on JENA-2204:
---------------------------------
{quote}How can I go 5x to 10x lower on storage consumption ? {quote}
[[email protected]] Playing devil's advocate here, what if you can't?
As Andy has already been explaining the design of TDB2 (or any triple store for
that matter) inevitably makes some choices around how it stores data.
Additionally your choice of a particular OS may enforce certain choices on the
implementation that are unavoidable.
Optimizing for storage space is not necessarily the best choice but we don't
really have enough information to say if, and whether you should, try to
optimise for that. Disk space is generally very cheap these days, whether
physical or cloud, and we'll assume that you have other performance
considerations that aren't stated in this ticket e.g. can a system actually
support your actual query/update workload with sufficient performance.
Reading between the lines it seems like there is an underlying question here
about whether TDB2 is suitable for your data based on making an extrapolation
from benchmark datasets (which as already noted are not necessarily indicative
of real world data).
If your concern is whether TDB2 will work for your actual data then I would
strongly encourage you to try it with your actual data and workload rather than
synthetic benchmarks.
> Storage required by TDB2 is much higher than TDB1, How to Fix ?
> ---------------------------------------------------------------
>
> Key: JENA-2204
> URL: https://issues.apache.org/jira/browse/JENA-2204
> Project: Apache Jena
> Issue Type: Question
> Reporter: Hemant Tiwari
> Priority: Minor
> Attachments: LUBM-TDB2-Data.7z
>
>
> The storage required by TDB2 is much higher than TDB1
> For 100k statements - TDB1 takes about 90 MB, while TDB2 is taking ~ close to
> 1 GB.
> Why is there such a difference and is there any solution available to reduce
> the storage size?
--
This message was sent by Atlassian Jira
(v8.20.1#820001)