I’ve written up the process used by infovore to create :BaseKB from the 
Freebase Quad Dump here

https://raw.github.com/paulhoule/infovore/master/docs/decoding_the_freebase_quad_dump.txt

I’m doing this now not just to demonstrate the correctness of :BaseKB, but also 
to demonstrate the correctness of the Freebase Quad Dump on which it is based.

I’ve got concerns that other possible export formats might be “valid” RDF but 
may not maintain the properties that make SPARQL queries against :BaseKB 
function almost exactly like MQL queries against graphd.

Infovore, the framework that creates :BaseKB, is available on github
https://github.com/paulhoule/infovore
this, plus the above documentation, make it possible for anyone to verify these 
claims. Infovore contains a test suite that can be runs SPARQL queries against 
a triple store loaded with :BaseKB that confirms correct operation. Infovore 
passes all tests when run against the 2012-11-04 quad dump.

A 1.0 release of infovore is in progress. This is a matter of a single patch to 
locate some temporary files in the right place and a small amount of additional 
documentation. It may take a few more days because a complete test cycle 
including loading into a triple store and running tests takes about 12 hours of 
wallclock time.

People are quite aware of the value of testing of software, but it’s taken 
longer for people to realize that data products need compatibility testing, 
particularly in the RDF and semantic space. 

I’d like to advise Freebase to resume publication of the quad dump until it can 
demonstrate the correctness of any alternative data export. In fact, with 
infovore available under a Apache License and all of my claims independently 
verifiable, the freebase quad dump could remain in use indefinitely.

Freebase users should demand a correct export.
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to