Which batch-inserter are you using? I recommend looking into the new neo4j-import tool which has good support for external id-linking. neo4j.com/docs/milestone/import-tool.html
In the next release it will also be able to handle duplicates well. Michael > Am 11.03.2015 um 20:27 schrieb Alberto Jesús Rubio Sánchez > <[email protected]>: > > Hi, > > I'm working on an RDF files importer but I have a problem, my data files have > duplicate nodes. For this reason I use big ids to insert the nodes using the > batch inserter but the proccess is slow. I have seen this post > <http://stackoverflow.com/questions/26627394/neo4j-batchimporter-is-slow-with-big-ids> > witch the same problem where Michael recommends use an index for the id but > adding this index the process remains slow. Another option would be to merge > duplicate nodes but in this post > <http://neo4j.narkive.com/IN5R143g/merge-2-different-nodes-and-associate-all-the-realtions-to-the-single-merged-node> > Michael says it is not possible to do this automatically. > > Can you help me decide which is the best option? > > Thanks! :) > > -- > You received this message because you are subscribed to the Google Groups > "Neo4j" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected] > <mailto:[email protected]>. > For more options, visit https://groups.google.com/d/optout > <https://groups.google.com/d/optout>. -- You received this message because you are subscribed to the Google Groups "Neo4j" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
