Re: NeuroNames [was: slides for the UMLS presentation]

chris mungall Tue, 06 Jun 2006 11:45:35 -0700


Hi Bill

Just a minor clarification - the neurodevelopment ontology will notbe distinct from GO, it will be part of the GO biological processontology (and thus part of the OBO Foundry) and available as OWL


Cheers
Chris

On Jun 6, 2006, at 7:57 AM, William Bug wrote:

Oops -

I forgot to mention the following:
There is an upcoming meeting at the Jackson Labs (next Wed - Fri)hosted by Judy Blake of MGI on behalf of the Gene OntologyConsortium. The work will focus on vetting/extending aneurodevelopment ontology they have begun to work on to be placedin the OBO Foundary.
Hopefully, a file will be available in RDF/OWL format at the OBOsite within the next month or so.
Cheers,
Bill

On Jun 6, 2006, at 10:41 AM, William Bug wrote:
Hi All,
Sorry - I'd thought I'd already subscribed to this list, butapparently not - until now.
The need for a mereotopologically-sound, neuroanatomical ontologyis quite pressing across the community of neuroscientists involvedin neuroinformatics projects most of which include a neuroimagingcomponent. Generally there is only one thing neuroscientists areinterested in when analyzing images at whatever resolution fromthe macromolecular (EM) on up to the macroscopic - i.e.,identifying biologically relevant shapes. In order for theseshapes to have any meaning in a context where one attempts to pooldata and perform relevant data reduction operations, the shapesmust exist within a shared coordinate space of some sort. Forinstance, if two separate labs are examining the change in thesize of the Substantia Nigra during the course of Parkinsonianneurodegeneration, in order for them to compare theirobservations, they require several data integration/semanticframeworks:
        - a shared neuroanatomical terminology
- a shared coordinate space (to place the shapes from theirimages in a comparable coordinate framework)- a shared, well-founded anatomical ontology which encapsulatesmereotopological knowledge about shapes in - at least - 3D space.Other knowledge resources can be helpful in supplementing thisarray of tools, but, generally, these are the absolute minimum.
[NOTE: the Wikipedia has a moderately clear definition ofmereotopology (http://en.wikipedia.org/wiki/Mereotopology).Basically, it combines a formal, ontological theory of shapes andboundaries (mereology) with the mathematics of topology with thegoal of providing a computational formalism to support applyinglogical operations to objects in space. As has been pointed outby others, a great deal of the work in this field of appliedbiomedical mereotopology derives from related work in the GISfield. Use of mereotopology by geographers has been going on forquite some time and is much more advanced. Work from GIS can beadapted for use in the biomedical domain, but it must be done withgreat care, as many of the assumptions behind the way researchersrepresent space and manner of information being represented candiffer significantly across these disciplines.]
The same is true as you scale this problem up to field-wideprojects such as BIRN or The NeuroCommons.
As several have mentioned in this thread, there are alreadyexisting resources that can begin to fill this need.
1) NeuroNames
Kei, Olivier, Peter Mork, and others have already given sufficientreferences on NeuroNames in this thread, so that others can dig indeeper to the specifics if they like.
Having worked with Doug Bowden, Mark Dubach, and their colleaguesover the last year or so in an advisory capacity on the specificissue of use of NeuroNames for semantically-based, neuroanatomicaldata set integration, I can add a few important qualifying points:a) Doug et al. have been working on the extremely difficult taskof unifying neuroanatomical terminologies across mammalian speciesfor 20 years now. Embedded in Neuronames & Braininfo, there is awealth of hard won empirical knowledge related to how one achievesthis end. I think it would be ill-advised to try to duplicatetheir effort, as the myriad scientific problems related to thiseffort would surely present themselves again and only need to beworked out once one.b) Doug et al. are extremely collegial and quite receptive tofeedback and collaboration - within the bounds of their limitedresources.c) NeuroNames is a terminological resource - not a well-founded,spatial ontology of brain anatomy capable of supportingmereotopological reasoning. As with most research-basedterminologies, there are many semantically-based relationsembedded in the NeuroNames graphs, but as the primary goal of NNis to disambiguate and integrate across the neuroanatomicallexicon, the embedded semantic information can often lead to alogical dead end. For instance, many neuroanatomical termscritical to specifying location in the rodent brain have beenplaced in the NN category "ancillary terms," as they don't fitinto the core hierarchy in an unambiguous way. This can make useof NN for annotating mouse brain gene & protein expressionpatterns (e.g., GENSAT, the Allen Brain Atlas, various BIRNprojects) extremely problematic.d) The NN primary structures (http://braininfo.rprc.washington.edu/indexabout.html) provide the closestthing to an ontology in NN. As Peter Mork pointed out, there hasbeen an effort in the past to unite this core NN hierarchy withthe FMA, which does provide a mereotopologically sound frameworkfor anatomy. Barry Smith (formal ontologist who has worked forover a decade on problems in biomedical ontology - mostespecially, though hardly exclusively, in the area ofmereotopological reasoning) and his colleagues have worked closelywith the Cornelius Rosse and his colleagues at the FMA project tocreate in association with the work started in the FMA afoundational ontology for biomedicine (the Ontology of BiologicalReality) that is becoming increasingly important to all of theontologies being monitored by NCBO and incorporated into the OBOsite and the emerging OBO Foundary (http://obofoundry.org/).e) Doug and his colleagues have worked closely with Jack Park (aconsulting scientist to SRI's AI Center - http://www.ai.sri.com/)to represent NN as a TopicMap (XTM). As many on this list mayknow, there has been a moderate amount of effort to integrate and/or reconcile XTM with RDF here at the W3C (search on "TopicMaps"at the main RDF page - http://www.w3.org/RDF/). I'm not certainhow this effort will ultimately make NN more "semantic web"compliant, but the bottom line is a great deal of effort hasalready been expended to express NN in a semantically well-grounded formalism.f) Though - as Don points out - neuroanatomical representationsare likely to significantly evolve over the coming decades, as thenumber of large scale gene & protein expression characterizationstudies focussed on the brain continue to accumulate. Having saidthat, the "conventional" view of neuroanatomy will likely remainrelevant for a long while to come, not only because it has beenused to characterize findings in the literature for the last 125+years, but also because it did derive from a wealth of empiricalobservation which is likely to remain valid in many domains ofneuroanatomical study. I would also modify Don's well informedcomment regarding the derivation of "conventional" views ofneuroanatomy. To a large extent they are related to functionalstudies of the brain - as well as lesion based studies offunctional deficits dating back to the 19th century (think"Broca's Area"), but they are also very much based on a study ofthe morphology of the brain - both the external surface morphology(sulci, gyri, and lobes), as well as histological examination ofinternal structures. Many of these studies of structure in spaceare likely to stay with us for some time to come (and are well-founded in reality), though as Tim Clark & Don have pointed out inthis thread, nomenclature is still a very significant problem evenin this very "old" field.g) licensing of NN - Doug et al. formerly had a completely openpolicy to distributing NN. The only a reason a license wasinstituted was at some point about 5 years back another groupsucked down the entirety of NN, reworked a lot of what was there -probably with very practical goals directed toward making NN more"correct" and effective in their problem domain - then"republished" their product as "NeuroNames". This lead to a greatdeal of confusion. The fact they chose to do this on sly alsomeant the work they did was not necessarily compatible with thework done by Doug et al.. In order to avoid this happening again,it was decided a license would be established to discourage thissort of behavior. As anyone who has developed a terminology and/or ontology, it is absolutely essential there remain a singlecurating authority, if the value of the resource is to remain intact. The "vetting" performed by the central authority - as isextensively done by the curators of the Gene Ontology, forinstance - is absolutely essential to the guaranteeing theintegrity of the knowledge resource. This is not a "closed" orproprietary process, just a highly controlled one. Unfortunately,Doug Bowden's resources are MUCH MUCH smaller than those availableto the curators/developers of GO, so the NN curation effortnecessarily moves at a slower pace.
2) Working with the Neuroscience community
As Kei, Don, and others have stated, it would be unwise to proceedin creating an "open source" neuroanatomical ontology withoutinteracting with the researchers who've already put a lot ofeffort into this problem over the past decade or so. With this inmind, I have several suggestions:
        a) The 5 ways of knowing neuroanatomy:
This is a pitch I've been making which I think helps to sum upthe current ways various sub-fields have attempted to identify/label/collate brain morphology
                i) Terminlogies - e.g., NN, BrainLex
ii) Ontologies - e.g., Neuro-FMA (the project Peter Morkreferred to)iii) Literature Informatics (CocoMac, BrainMap, NeuroScholar,BAMS, ArrowSmith, etc.).These are very mature projects. Some include their ownmereotopological reasoning systems (e.g., CocoMac and BrainMap) inorder to be able to pool and compare the relatedness of structuresand connectivity across different studies in the literature. Thegoal in this category is to perform large-scale semantic mining ofthe literature to confirm/refute current knowledge and uncover newcorrelations - very much along the lines of what The NeuroCommonsProject expects to achieve via use of semantic web technologies.Some researchers in this category are actually participating inThe NeuroCommons Project (i.e., Gully Burns, who developedNeuroScholar).
                iv) voxel/pixel analysis:
This approach applies computer vision algorithms toautomatically - or semi-automatically - identify 2D & 3D shapes indigital anatomical images. This field is also extremely mature,though there are many significant caveats to exactly how much ofthis work can be effectively automated.
                v) parameterized models:
Often these are derived from - or used to drive - the voxel/pixel based analysis described in 'iv' - though the spatialmodeling is definitely a distinct approach from the pure voxel/pixel approach.
None of studies you'd fit into these categories exclusively focuson their technique/tool alone without some aspect of the other"ways of knowing neuroanatomy" playing a role in what they do.However, it is clear much fundamental work in this area primarilyfocuses on one technique over the others.
Having said that, when the neuroscience community makes use ofthis work to examine a specific biological problem, they willoften draw significant tools and resources from more than one ofthese domains.
b) NCBO/NCOR sponsored meeting focused on mereotopology inneuroanatomy:Barry Smith is working to bring together researchers working inthe 5 domains described above. There is a very pressing need inlarge-scale, field-wide neuroinformatics projects such as what isbeing done in the BIRN project to have these 5 domains convergeand work more cooperatively. Right now, a lot of manual efforthas to be put out to bring them together. This is something BIRNhas been pursuing. In the last 6 months, we have received a greatdeal of support and guidance on this effort from NCBO. DanielRubin interacts directly with the BIRN Ontology Task Force, andthe work Barry Smith has been doing with FMA, OBO, FuGO, and PATOhave very much begun to create a much more well-founded andcomputable path toward performing large-scale annotation ofneuroimaging data.This meeting is on the NCBO/NCOR slate for 2007, but in theinterim I hope to see more effort invested in the coming yearacross the 5 communities listed above toward the goal ofintegrating across these "ways of knowing" now that the need hasbeen recognized.
                        
3) Microarrays:
Just as Don, Kei, Alan R., and others have pointed out, high-throughput assays - microarrays, BAC-based IHC, in situ studiesusing the Gene Paint technology employed by the Allen Institute ofBrain Science to construct the Allen Brain Atlas of geneexpression in the brain - are going to transform our understandingof neuroanatomy over the coming decades. This is just a given.There is a pressing need to derive a means to integrate spatially-mapped studies of gene & protein expression into a neuroimagingsetting. The spatial resolution may be very coarse - e.g., "wholebrain" - but they still provide sufficient spatial information tobe usable in the context of a neuroanatomical coordinate system.We are working in the BIRN project to create a means forresearchers to integrate these distinct approaches to studying thebrain. As Alan R. pointed out, FuGO is working to put descriptionof microarray experiments on a solid, formal footing, and I wouldexpect one aspect of that will be to represent microarray data inRDF/OWL. This is not a trivial problem, given as much of theavailable data is merely MIAME-compliant - MIAME not even being adata format, but just a collection of minimal data requirements.One need only look at the great complexity of the data submissionprocess at the NCBI GEO site to get an appreciation for howdifficult this problem can be. A great deal of effort is beinginvested in the microarray field to come up with a better meanshandle this issue, and the FuGO effort will be a criticalclearinghouse for this work. The important thing to remember whenit comes to field-wide data pooling and re-analysis, it maysometimes be necessary to get right back to the microarray primaryimage files so as to reapply different criterial when performingthe statistical tests and reductions on pooled data. Given thisrequirement - one we also see in the neuroimaging domain - Ibelieve it is very important to proceed in a well-reasoned mannerwhen seeking to integrate across microarray datasets usingsemantic web technologies. Alan R. and myself - possibly otherstoo - on this list are on the FuGO Coordinators Committee, sohopefully we can help to keep those lines of communication open.
Sorry to go on so, but this is a topic on which I've labored quiteintensively over the past year. There is a lot being done on thisissue, and I think all efforts will get much further more quickly- and in a way that will carry more street cred with practicingneuroscientists - if we all try to work together.
Cheers,
Bill

Bill Bug
Senior Analyst/Ontological Engineer

Laboratory for Bioimaging  & Anatomical Informatics
www.neuroterrain.org
Department of Neurobiology & Anatomy
Drexel University College of Medicine
2900 Queen Lane
Philadelphia, PA    19129
215 991 8430 (ph)
610 457 0443 (mobile)
215 843 9367 (fax)


Please Note: I now have a new email - [EMAIL PROTECTED]
This email and any accompany attachments are confidential. Thisinformation is intended solely for the use of the individual towhom it is addressed. Any review, disclosure, copying,distribution, or use of this email communication by others isstrictly prohibited. If you are not the intended recipient pleasenotify us immediately by returning this message to the sender anddelete all copies. Thank you for your cooperation.
Bill Bug
Senior Analyst/Ontological Engineer

Laboratory for Bioimaging  & Anatomical Informatics
www.neuroterrain.org
Department of Neurobiology & Anatomy
Drexel University College of Medicine
2900 Queen Lane
Philadelphia, PA    19129
215 991 8430 (ph)
610 457 0443 (mobile)
215 843 9367 (fax)


Please Note: I now have a new email - [EMAIL PROTECTED]
This email and any accompany attachments are confidential. Thisinformation is intended solely for the use of the individual towhom it is addressed. Any review, disclosure, copying,distribution, or use of this email communication by others isstrictly prohibited. If you are not the intended recipient pleasenotify us immediately by returning this message to the sender anddelete all copies. Thank you for your cooperation.

Re: NeuroNames [was: slides for the UMLS presentation]

Reply via email to