On 14/06/2019 18:13, Marco Neumann wrote:
I am collecting jena loader benchmarks. if you have results please post
them directly.

http://www.lotico.com/index.php/JENA_Loader_Benchmarks

tdb2.tdbloader has variations controlled by --loader.

--loader=
Loader to use: 'basic', 'phased' (default), 'sequential', 'parallel' or 'light'

"basic" is a super naive parser-add triple loop - it used if a loader can't cope with an already loaded database.

"phased" is a balanced, does not saturate the machine loader. Some parallelism.

"sequential" is the tdbloader algorithm for TDB2, more for reference.

"parallel" is as much parallelism as it wants. (5 for triples, more for quads)

"light" is two threaded. Slightly ligther than "phased".

See LoaderPlans.

On a linux machine I am using "time" to collect data.

Is there a flag on tdb2.tdbloader to report time and triples per second?

I have noticed that storage space use for tdbloader2 is significantly
smaller on disk compared to tdbloader and tdb2.tdbloader. Is there a
straight forward explanation here?

Reply via email to