Can you share the structure of your files? I.e. header?
I think it resolves duplicates if you have the same group (or no group)
across node-types.

is geneSymbol your key property?

---------- Forwarded message ----------
From: Sajid Mughal <saj...@gmail.com>
Date: Wed, Oct 12, 2016 at 9:27 AM
Subject: [Neo4j] neo4j-import and skip-duplicate-nodes over multiple files
To: Neo4j <neo4j@googlegroups.com>


Hi,

I'm using neo4j-import and I have an entity that will be loaded from
multiple files. I know the files will have duplicates in them, but I want
to skip them, so I am using "--skip-duplicate-nodes true". However, when I
go to create a Unique Constraint, it has managed to load in duplicates.

Does it only ignore duplicates inside each file, and not across files?

I'm using 3.0.6, here is command I use

bin/neo4j-import   --into graph.db.db --id-type string --bad-tolerance
50000   --skip-duplicate-nodes true \
--nodes:GeneSymbol GeneSymbol-header.csv,GeneSymbol-1.csv,GeneSymbol-2.
csv,GeneSymbol-3.csv

Here is what happens when I try to create the constraint

neo4j-sh (?)$ CREATE CONSTRAINT ON (p:GeneSymbol) ASSERT p.geneSymbol IS
UNIQUE;
982 ms
WARNING: Unable to create CONSTRAINT ON ( genesymbol:GeneSymbol ) ASSERT
genesymbol.geneSymbol IS UNIQUE:
Multiple nodes with label `GeneSymbol` have property `geneSymbol` = 'X':
  node(32619)
  node(40556)


Many thanks,

-- 
You received this message because you are subscribed to the Google Groups
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to neo4j+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to neo4j+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to