Here's a (not so) hypothetical question.

     Suppose I've got some database that was populated from some data 
source like Freebase or DBpedia.  I've got my own internal identifiers,  
so I can say that

dbpedia:George_Washington -> mysystem:88582
fbase:/en/george_washington -> mysystem:88582

     The whole reason I'm doing this is so I can make assertions about 
these entities.  Some of these come from Freebase and DBpedia and some 
of them come from other places.

     If I look at a single moment in time,  I feel like I've got this 
under control.  I can load DBpedia version X or a freebase data dump 
from a particular week and it all works perfectly.

     Of course,  at some point my data gets stale and I need to update 
it.  For instance,  there's a new batch of players who got drafted by 
the NFL I want to know about,  or I want to know about movies that are 
going to be released soon,  or have some vocabulary for the Fukushima 
muclear accident or maybe I just try to extract some facts from the 
latest Freebase dump and discover that a distressing number of mids have 
changed in the last six months.

     So I need some way to keep my knowledge base synced up with changes 
in external knowledge base(s) although I might settle for treating one 
as authoritative.  It's not clear that there's sufficient information in 
DBpedia to track changes (although some renames could be tracked by 
following the wikipedia id's) but I think a good job can be done with 
Freebase:  mid redirect records can be used to follow renames and 
merges,  and at least sometimes with splits there are gardening hints 
that help.

     I can imagine heuristics that would probably work on an ad hoc 
basis,  but it seems to me there ought to be an intellectually 
consistent approach to this problem.  Is anyone familiar with anything 
in the literature on this?

------------------------------------------------------------------------------
Simplify data backup and recovery for your virtual environment with vRanger.
Installation's a snap, and flexible recovery options mean your data is safe,
secure and there when you need it. Discover what all the cheering's about.
Get your free trial download today. 
http://p.sf.net/sfu/quest-dev2dev2 
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to