On 25/04/15 15:05, Daniel Hernández wrote:
El 2015-04-25 10:50, Andy Seaborne escribió:
...

how far did it get?

I was at 12 millions of triples.

$ tail tdb-03-dout.log
12:21:19 INFO  loader               :: Add: 11,950,000 triples (Batch:
2,033 / Avg: 23,940)
12:21:50 INFO  loader               :: Add: 12,000,000 triples (Batch:
1,619 / Avg: 22,640)
12:21:50 INFO  loader               ::   Elapsed: 530.03 seconds
[2015/04/25 12:21:50 UTC]
12:22:14 INFO  loader               :: Add: 12,050,000 triples (Batch:
2,040 / Avg: 21,729)
12:22:48 INFO  loader               :: Add: 12,100,000 triples (Batch:
1,485 / Avg: 20,571)
12:23:24 INFO  loader               :: Add: 12,150,000 triples (Batch:
1,367 / Avg: 19,447)
12:24:03 INFO  loader               :: Add: 12,200,000 triples (Batch:
1,289 / Avg: 18,386)
12:24:56 INFO  loader               :: Add: 12,250,000 triples (Batch:
940 / Avg: 17,092)
12:26:07 INFO  loader               :: Add: 12,300,000 triples (Batch:
708 / Avg: 15,624)
12:27:50 INFO  loader               :: Add: 12,350,000 triples (Batch:
482 / Avg: 13,864)



Daniel,

Than I'm baffled as to where the space is going. While the 15e6 predicates is unusual, a 10G should be wildly too large and I don't see how it affects the loading at the point shown.

For tdbloader2, it seems to be the stats space - that would be affected by 15e6 predciates. A very large heap should merely slow down loading, not still run out of space.(I'll look at adding a "--no-stats" flag anyway)

Could I get a copy of your data to try up in my development setup with a profiler?

        Thanks
        Andy

Reply via email to