Re: blog: semantic dissonance in uniprot

Bijan Parsia Wed, 25 Mar 2009 07:17:37 -0700

On 25 Mar 2009, at 10:41, Phillip Lord wrote:

"Michel_Dumontier" <[email protected]> writes:

And I'm trying to explain that there is no pragmatic reason to make

explicit the distinction between a biomolecule (and what we knowabout

it) and a database record (and what we know about the biomolecule)
unless they are actually different.  It just complicates things in a
wholly unnecessary way.

I've given a clear example. Where two databases exist, with tworecords,

which appear to be referring to the same (class of) molecules.

[snip]

This is the key example.

But there's the other key example, where one record exists whichappear to be referring to multiple entities (either by ambiguity orby composition). This is a generalization of your point about illdefinedness of the very idea of a gene.

To paraphase you (I think), introducing a resource in the latter casetakes you from 1 mapping problem to 2 mapping problems.

This is why the the Boothian line is quite naive. If it's just thecase that you have 1 (or more) records and a clear relationshipbetter the record(s) and the object described by the record, then itmay (or may not!, by often will) make sense to distinguish them andname each, esp. for the purpose of entity reconciliation, recordreconciliation, entity exploration, etc.

However, if you are forced to do so without a clear purpose, then youjust add more noise to the overall system. You are likely to makebrute errors and you are likely to make choices that conflict withthose motivated by different applications.

This is why clear empirical data is important. It's perfectlypossible to do harm (in aggregate) by following a rule intended toproduce good.


Cheers,
Bijan.

Re: blog: semantic dissonance in uniprot

Reply via email to