On 06/08/2016 08:55 AM, Martynas Jusevičius wrote:
So I think it would be
wrong to ignore the "older" description -- or any "other" description
in general.

This gets into the whole area of what data you choose to believe. Some data is just plain wrong, and lots of data is "correct" (i.e. usable) for some uses but wrong for others and will cause inconsistency when merged. Very little data is universally "correct".

I think it is inescapable that when merging data from multiple sources you need to be careful about which data you choose to include. Putting data from each source in its own named graph is one good way to help keep track of where it came from, and this is useful in deciding whether to include it. But that provides only coarse-grained control. You may well need to eliminate only a few triples from some source data in order to make it merge without causing inconsistencies, and it can be tedious to figure out which triples to drop.

Bottom line: I don't think there is any simple answer to the question of which data to include. It requires a judgement call.

David Booth

Reply via email to