Hi,
I have a graph with 96k nodes , 0 relations and 213k properties. I created
the nodes from a CSV file which is in turn an export of a relational table.
Each tuple should become a node, each attribute of the tuple should become
a property of the node.
In the other hand, each tuple of the relational table had at least one
reference to another tuple in the same table (this is the reason why we
thought of switching to Neo4j, we were using many auto-joins).
Using LOAD CSV, I created the Cypher script that allowed me to move data
from the CSV file and create the nodes with only not null properties. Now,
I need to create the relationships from the references attributes.
I wrote a Cypher query that maches every node and check if it has
references to another node, if so, I create a relation from the current
node to the referenced node.
I'm querying from the Console. Here how does my query look like:
MATCH (a)
where (has(a.reference))
with a.reference as reference, a.name as name
MATCH (b {oid: toInt(reference)}),(a {name: name})
create (a)-[:reference]->(b)
return reference;
The problem is that this query is taking a limitless time to execute. Some
times it just doesn't finis or finish with "unknown error" message. The
query turns to a blocking after adding the second MATCH. It worked with a
small database.
The database is of only 41Mb, so I didn't touch the default settings of the
cache.
*Can you please help me figure out what is eating up all that time?*
I'm using Neo4j 2.1.2 in Server mode. I tried the query in two machines (1)
Windows 8 - i7 CPU - 8Gb Ram (2) Windows Server - i5 CPU - 16Gb Ram.
Regards.
--
You received this message because you are subscribed to the Google Groups
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
For more options, visit https://groups.google.com/d/optout.