I need to import ~300 million RDF triples from YAGO2s, a mechanically-
generated ontology.  The Batch Importer (preferably the 2.0 version)
is an obvious candidate for this task, if I can figure out some pesky
usage details.  Help?

-r

Background

  If a triple defines a relation between subject and object URIs, I
  can express it as a Neo4j relationship.

  However, many triples define values (eg, hasLatitiude) for entities.
  I'd like to express these as node properties, but the Batch Importer
  uses TSV syntax, which has a fixed set of properties per node.

Questions

  Q:  If I define properties in the TSV header, but leave the data
      fields empty, what will the Batch Importer do?  For example:

        name       works_on    works_in
        Michael    neo4j       Java
        Richard                Ruby
        Xavier             

      Would this create the following nodes?

        Michael:
          works_on:  neo4j
          works_in:  Java
        Richard:
          works_in:  Ruby
        Xavier:

  Q:  If I have already used the Batch Importer to define nodes and
      relationships, can I use it again to simply add properties?

        name       speaks
        Michael    German
        Richard    English

      Given that the nodes file no longer has ID numbers, how do I
      tell the Batch Importer which entities to modify?

-- 
http://www.cfcl.com/rdm           Rich Morin           [email protected]
http://www.cfcl.com/rdm/resume    San Bruno, CA, USA   +1 650-873-7841

Software system design, development, and documentation


-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to