Another factor is the ratio of triples to terms.

In some data, the same terms are used often, Obviously true for properties much of the time. If theer are a lot of triples with the same subject, there are a higher number of triples compared to terms.

Some data is more linked, subject and object are the same term.

TDB stores terms once.

    Andy

On 25/11/17 11:42, ajs6f wrote:
Andy may be able to be more precise, but I can tell you right away that it's not a 
straightforward function. How many literals are there "per triple"? How big are 
the literals, on average? How many unique bnodes and URIs? All of these things will 
change the eventual size of the database.

ajs6f

On Nov 25, 2017, at 6:40 AM, Laura Morales <[email protected]> wrote:

Is it possible to estimate the size of a TDB2 store from one of nt/turtle/xml 
input file, without actually creating the store? Is there maybe a tool for this?

Reply via email to