Hi all,

I'm struggling with importing a graph with about 10m nodes and 20m
relationships, with nodes having 0 to 10 relationships. Creating the
nodes takes about 10 minutes, but creating the relationships is slower
by several orders of magnitude. I'm using a 2.4 GHz i7 MacBookPro with
4GB RAM and conventional HDD.

The graph is stored as adjacency list in a text file where each line
has this form:

Foo|Bar|Baz
(Node Foo has relations to Bar and Baz)

My current approach is to iterate over the whole file twice. In the
first run, I create a node with the property "name" for the first
entry in the line (Foo in this case) and add it to an index.
In the second run, I get the start node and the end nodes from the
index by name and create the relationships.

My code can be found here: http://pastie.org/2041801

With my approach, the best I can achieve is 100 created relationships
per second.
I experimented with mapped memory settings, but without much effect.
Is this the speed I can expect?
Any advice on how to speed up this process?

Best regards,
Daniel Hepper
_______________________________________________
Neo4j mailing list
[email protected]
https://lists.neo4j.org/mailman/listinfo/user

Reply via email to