[Neo4j] Neo4J Batch Inserter is slow with big ids

Alberto Jesús Rubio Sánchez Wed, 11 Mar 2015 17:13:30 -0700

Hi,

I'm working on an RDF files importer but I have a problem, my data files 
have duplicate nodes. For this reason I use big ids to insert the nodes 
using the batch inserter but the proccess is slow. I have seen this post 
<http://stackoverflow.com/questions/26627394/neo4j-batchimporter-is-slow-with-big-ids>
 witch 
the same problem where Michael recommends use an index for the id but 
adding this index the process remains slow. Another option would be to 
merge duplicate nodes but in this post 
<http://neo4j.narkive.com/IN5R143g/merge-2-different-nodes-and-associate-all-the-realtions-to-the-single-merged-node>
 Michael says 
it is not possible to do this automatically.


Can you help me decide which is the best option?

Thanks! :)

-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

[Neo4j] Neo4J Batch Inserter is slow with big ids

Reply via email to