Re: scientific publishing task force update

William Bug Tue, 13 Jun 2006 06:55:58 -0700


Here, here!

I think Matthias is making a very important point here - one equallyimportant to efforts to define biological reality empirically fromthe ground up (semantic web and/or computational linguistic/NLPapproaches to distilling KR from the literature-base), as it is totop-down, highly ontology-centric approaches.

Ultimately, what both efforts are about is creating a formal,computable description of reality - biomedical reality when it comesto the range of problems being addressed from biomolecularinformatics on through clinically-oriented medical informaticsprojects. This computability includes implementations of what Iwould call the "low hanging fruit" of query resolution and dataintegration (highly intertwined problems), as well as more complexattempts to create a framework for biomedical reality to support wide-field, reasoning systems. I don't consider any problems across thisspectrum either more or less useful - or more or less ambitious, itjust the data integration & query resolution tasks due to theirrequiring slightly less ontological rigor are being addressed now ona large scale, whereas the reasoning applications right now tomaintain a narrow and explicit focus to be effective.

The issue of having a solid ontological foundation to work from andthe related issue of having a well defined, community-wide collectionof ontological relations (e.g., the OBO Relation ontology - http://genomebiology.com/2005/6/5/R46) is quite critical to efforts aimed atconstructing wide-field KR - for whatever purpose.

I completely agree with Philip's statement regarding the effect ofworking with a foundational ontology on biomedical KR efforts:


On Jun 13, 2006, at 5:39 AM, Phillip Lord wrote:

they tend to complicate some stages of ontology development, mostlynotably the first month when you have lots of biologists tearingtheir hair out trying to work out what a perjurant, continuant,sortal, self-standing kind is.

To my mind, I'm not certain it is either necessary or appropriate todrag the research biologist through this process - unless of coursethey want to go through it. I think at this late date, the field -if you will - of biomedical KR has matured to the point where - as inbioinformatic programming it is no longer a de facto expectation theresearcher will need to be an autodidact computer scientist (thereare degree granting programs targeting the mix of C.S. & biologyrequired to be a good bioinformatician) - there is a cadre ofbiologists emerging who have both a penchant and a talent for workingin the KR realm. Those folks do need to drag themselves through thedifficult slog of becoming Natural Philosophers. These folks can actas both effective KR researchers and asemissaries to the larger community of research biologists to ensurethe KR frameworks developed meet the Use Case-defined computationalneeds of the field, as well as helping the researcher "donate" theirknowledge to the growing KR framework.

I also agree with Philip's statement regarding the usefulness offoundational ontologies:


On Jun 13, 2006, at 5:39 AM, Phillip Lord wrote:

they help to ease the design of an ontology; you have more ideawhere concepts should go, so you can spend more time worrying aboutthe details of what ever you are
modeling and less about the big picture.

Here I would agree with the comments made by Matthias, though I wouldtake particular issue with the term "modeling" as it can trick somepeople into thinking the act of creating object models (a la OODsoftware design and/or use of UML formalisms or or XML to express themodels) has anything necessarily to do with KR expressions ofbiological reality. Modeling and KR tasks can share many goals incommon, but models tend to be tied to the application domain forwhich they were intended and have no de facto requirement torepresent reality. It of course is a good idea to be certain theontological development you do stays in sync with/compatible with therelevant data models in the domains you expect to map via ontologies- e.g., in the neuroimaging domain, you would want to stay compatiblewith data models and associated formats such as DICOM & NifTi.However practical a model may intend to be, however, from aphilosophical perspective they are much closer to Kant than they areto Husserl. ;-)


Where I would have to disagree with Philip is when he states:

On Jun 13, 2006, at 5:39 AM, Phillip Lord wrote:

It's not clear that an upper ontology actually brings significantvalue to the table. The claimed advantage of interoperabilitybetween ontologies is, to my mind, somewhat bogus; they only reallyallow interoperability when you are querying over the concepts inthe upper ontology.

Here I would agree with the sentiment I believe both Robert andMatthias have expressed. There are practical aspects of putting a(or THE) foundational ontology (and foundational relations) in placethat can make a very big difference in the computability of theresulting knowledge maps (aka association files, annotations, etc.).On the BIRN project, our work toward creating a KR framework for allof the neuroscience data we need to accommodate it has provenextremely liberating and effective to fix on the BFO. I believe sameis also true for both the Gene Ontology curation effort, as well asthe work on FuGO and PATO (the latter two being two ontologies we areinvesting heavily in on the BIRN KR efforts). I wouldn't want to putwords in the mouths of those much more experienced contributors toGO, FuGO, or PATO (please chime in if you have corrections orqualifications on this issue), and it certainly is true fixing on afoundational ontology can cause a lot of grief to curators,programmers, and researcher-users of the knowledge resource, but I'mpretty certain the long-term gains of doing so will be significant.

Some may think this has little to do with the more bottom-up approachto KR implicit in Semantic Web projects. I really don't believe thisto be the case. Though I completely agree SW approaches to KR willhelp to provide a more dynamic and fine-grained accuraterepresentation of biological knowledge, I would maintain theresulting triplet repositories will have much more longevity andapplicability to the field, if they are constructed in such as waythat promotes a convergence between the top-down and bottom-upapproach. Each approach can - and should - inform the KR frameworkconstructed by the other.

By way of examples relevant to the issue of neuroscience data KR, asI mentioned, we on the BIRN Ontology Task force have begun toconverge on use of BFO (and the OBO Relation ontology) not only foruse in the ontological efforts originating from BIRN, but also in ourselection of external ontologies we draw on (e.g., Neuro-FMA, FuGO,PATO, GO - not that all of these are yet fully BFO compliant).

There are also neuro-oriented KR projects making productive use ofDOLCE and SKOS. Matthias's Semantic Synapse project, for instance,uses both.


Just my $0.02

Cheers,
Bill




On Jun 13, 2006, at 8:29 AM, Matthias Samwald wrote:

 One small, but significant, dislike of the bio-ontology community
 for SUMO (as used by Solditova and King) is that it isn't really
 only an upper level. It strays into, for instance, stating a
 protein is a foodstuff. this, as you might suppose, causes
 biologists to laugh.
That is very true, and I think that the importance of having hugetop-level ontologies like SUMO or maybe Cyc is largely overrated.On the other hand, having very small and basic foundationalontologies (e.g. the most basic ontologies of the DOLCE liteontology, BFO or SKOS) is more important than most developers ofontologies seem to think. It is a great aid to the development ofinteroperable ontologies to have a common, basic framework ofclasses (e.g. physical-object, perdurant, quality) and properties(e.g. part-of, participant-in).These basic ontologies do not need to be large or complicated to beuseful (around 20 classes and properties are sufficient, I guess).Quite to the contrary, making these foundational ontologies toocomplicated would significantly decrease their usefulness.
//Matthias Samwald


Bill Bug
Senior Analyst/Ontological Engineer

Laboratory for Bioimaging  & Anatomical Informatics
www.neuroterrain.org
Department of Neurobiology & Anatomy
Drexel University College of Medicine
2900 Queen Lane
Philadelphia, PA    19129
215 991 8430 (ph)
610 457 0443 (mobile)
215 843 9367 (fax)


Please Note: I now have a new email - [EMAIL PROTECTED]







&lt;p&gt;This email and any accompanying attachments are confidential. This information 
is intended solely for the use of the individual to whom it is addressed. Any review, 
disclosure, copying, distribution, or use of this email communication by others is strictly 
prohibited. If you are not the intended recipient please notify us immediately by returning 
this message to the sender and delete all copies. Thank you for your 
cooperation.&lt;/p&gt;

Re: scientific publishing task force update

Reply via email to