David Allen a data provenance and Neo4j expert wrote neoprofiler for this https://github.com/moxious/neoprofiler
David also did a recorded presentation about data provenance at the baltimore meetup http://www.meetup.com/graphdb-baltimore/events/219260495/ I did a simple meta graph approach that does sth similar http://www.neo4j.org/graphgist?9c4cb9e842ed24a3feae It also references David Pooles interesting use of Neo for sql-server metamodel representation Von meinem iPhone gesendet > Am 02.03.2015 um 22:31 schrieb Daniel Lewis <[email protected]>: > > "Schemaless" databases like Neo4j and MongoDB may not have predefined or > restrictive schemas, but we all know this doesn't absolve us from regular > Data Engineering tasks like Data Profiling and Data Modeling. In fact, the > lack of structure makes it more important to monitor the dynamic schema > developing in your data store so that software engineers and data scientists > can have reasonable expectations of data quality and completeness. > > Are there any techniques, scripts, or softwares people know of or use in > production systems for monitoring changes in schema and the completeness of > graphs we would expect to be complete? > > I've used Talend briefly for data profiling before, is it still one of the > better solutions? > -- > You received this message because you are subscribed to the Google Groups > "Neo4j" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "Neo4j" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
