Another factor is the ratio of triples to terms.
In some data, the same terms are used often, Obviously true for
properties much of the time. If theer are a lot of triples with the same
subject, there are a higher number of triples compared to terms.
Some data is more linked, subject and object are the same term.
TDB stores terms once.
Andy
On 25/11/17 11:42, ajs6f wrote:
Andy may be able to be more precise, but I can tell you right away that it's not a
straightforward function. How many literals are there "per triple"? How big are
the literals, on average? How many unique bnodes and URIs? All of these things will
change the eventual size of the database.
ajs6f
On Nov 25, 2017, at 6:40 AM, Laura Morales <[email protected]> wrote:
Is it possible to estimate the size of a TDB2 store from one of nt/turtle/xml
input file, without actually creating the store? Is there maybe a tool for this?