Hi, I'm working on an RDF files importer but I have a problem, my data files have duplicate nodes. For this reason I use big ids to insert the nodes using the batch inserter but the proccess is slow. I have seen this post <http://stackoverflow.com/questions/26627394/neo4j-batchimporter-is-slow-with-big-ids> witch the same problem where Michael recommends use an index for the id but adding this index the process remains slow. Another option would be to merge duplicate nodes but in this post <http://neo4j.narkive.com/IN5R143g/merge-2-different-nodes-and-associate-all-the-realtions-to-the-single-merged-node> Michael says it is not possible to do this automatically.
Can you help me decide which is the best option? Thanks! :) -- You received this message because you are subscribed to the Google Groups "Neo4j" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
