David Allen a data provenance and Neo4j expert wrote neoprofiler for this 

https://github.com/moxious/neoprofiler

David also did a recorded presentation about data provenance at the baltimore 
meetup
http://www.meetup.com/graphdb-baltimore/events/219260495/

I did a simple meta graph approach that does sth similar

http://www.neo4j.org/graphgist?9c4cb9e842ed24a3feae

It also references David Pooles interesting use of Neo for sql-server metamodel 
representation

Von meinem iPhone gesendet

> Am 02.03.2015 um 22:31 schrieb Daniel Lewis <[email protected]>:
> 
> "Schemaless" databases like Neo4j and MongoDB may not have predefined or 
> restrictive schemas, but we all know this doesn't absolve us from regular 
> Data Engineering tasks like Data Profiling and Data Modeling. In fact, the 
> lack of structure makes it more important to monitor the dynamic schema 
> developing in your data store so that software engineers and data scientists 
> can have reasonable expectations of data quality and completeness. 
> 
> Are there any techniques, scripts, or softwares people know of or use in 
> production systems for monitoring changes in schema and the completeness of 
> graphs we would expect to be complete? 
> 
> I've used Talend briefly for data profiling before, is it still one of the 
> better solutions?
> -- 
> You received this message because you are subscribed to the Google Groups 
> "Neo4j" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to [email protected].
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to