On 18/03/14 00:32, Martino Buffolino wrote:
Hi,

I ran tdbloader2 overnight on an amazon box and it seemed to run out of
disk space and ultimately crashed. Is there a way I can start the process
up, beginning at the sort step?

Here is the log:

INFO  Add: 2,647,900,000 Data (Batch: 54,704 / Avg: 24,949)
INFO  Add: 2,647,950,000 Data (Batch: 54,347 / Avg: 24,949)
INFO  Add: 2,648,000,000 Data (Batch: 83,752 / Avg: 24,950)
INFO    Elapsed: 106,130.99 seconds [2014/03/17 22:44:16 UTC]
INFO  Total: 2,648,011,531 tuples : 106,438.23 seconds : 24,878.39
tuples/sec [2014/03/17 22:49:23 UTC]
  22:49:25 Index phase
  22:49:25 Index SPO
sort: write failed: /tmp/sortxRql3B: No space left on device


Thanks for any help,
Martino


Maybe - if the work files are still there.

You can edit the script tdbloader2worker:

1/ Set DATA_TRIPLES and DATA_QUADS to the relevant work files.

2/ Remove the data phase (comment out the first call to java for CmdNodeTableBuilder)

3/ set KEEPWORKFILES=1

By the way, loading is much faster on Amazon if you use one of the instances with a large SSD.

In fact, running plan "tdbloader" and using an SSD machine maybe the best way. tdbloader does not have large intermediate files.

        Andy

Reply via email to