On Tue, Jun 2, 2015 at 2:53 PM, Rajan Shah <raja...@gmail.com> wrote: > I could setup the demo and look at the results. It's extremely powerful. > One quick question, what's the purpose of these bin files under resources > directory. > > bionlp2004-DNA-en.bin > bionlp2004-RNA-en.bin > bionlp2004-cell_line-en.bin > bionlp2004-cell_type-en.bin > bionlp2004-protein-en.bin
Those are the Open NLP models for Named Entity Extraction. best Rupert > > With best regards, > Rajan > > On Thu, May 28, 2015 at 3:52 AM, Rupert Westenthaler < > rupert.westentha...@gmail.com> wrote: > >> Hi Rajan, >> >> The demo never included any Java code. >> >> The module just provides configurations [1] and datafiles [2]. Those >> will be installed with the bundle using the Sling Installer and >> Stanbol DataFileProvider infrastructure when the bundle is installed. >> Note the <Install-Path> and <Data-Files> instructions configured for >> the maven-bundle-plugin in the pom.xml file. >> >> The demo also provides a shell script [3] the indexes eHealth related >> datasets and of corse the README explaining the demo >> >> best >> Rupert >> >> [1] >> http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/src/main/resources/config/ >> [2] >> http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/src/main/resources/datafiles/ >> [3] >> http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/index.sh >> >> On Wed, May 27, 2015 at 1:27 PM, <raja...@gmail.com> wrote: >> > Hi Rupert, >> > >> > Thanks a lot for the detailed answers. Let me play a little bit further >> before I ask additional follow-up questions. >> > >> > As far as demo is concerned, I am interested in eHealth demo as it >> covers lots of items from my questions. At present, the Java code for it is >> missing. Is it possible to restore Java code for eHealth demo in 0.12 >> branch? >> > >> > With best regards, >> > Rajan >> > >> > Sent from my iPhone >> > >> >> On May 27, 2015, at 6:42 AM, Rupert Westenthaler < >> rupert.westentha...@gmail.com> wrote: >> >> >> >> Hi >> >> >> >>> On Wed, May 27, 2015 at 5:31 AM, Rajan Shah <raja...@gmail.com> wrote: >> >>> Hi, >> >>> >> >>> As I am trying to get my hands around stanbol, I have couple general >> design >> >>> questions. >> >>> >> >>> *1. Enhancement Chain firing and results* >> >>> >> >>> How to find out which enhancement chain detected which entities? One >> way, I >> >>> could see that by adding/removing particular chain. Is it possible to >> just >> >>> enable it via logging within current code? >> >>> >> >>> For ex. >> >>> I have a chain categorized-linking and would like to find out whether >> this >> >>> chain fired and labeled entities properly >> >> >> >> A enhancement chain has 1..* enhancement engines. The engines create >> >> the annotations not the chain. So your question should be what engine >> >> is creating an annotation. This information is provided by the >> >> dc:creator and dc:contributor metadata of the enhancement. See also >> >> the documentation at [1] >> >> >> >>> >> >>> *2. Categorize entities differently* >> >>> >> >>> Is it possible to categorize your detected entities as something else? >> >>> i.e. other than People, Organizations or Places >> >>> >> >>> What steps one need to take in current framework to achieve the same? >> >> >> >> You can use the Custom NER Model Extraction Engine [2]. >> >> The models used in the documentation of this engine can be found at [3] >> >> >> >>> >> >>> *3. Domain specific modeling* >> >>> >> >>> Suppose, I have a small domain and various types of entities. I am >> >>> interested in >> >>> >> >>> a. analyzing various entities >> >>> b. linking them with other entities and find relations from >> dbpedia/freebase >> >>> c. infer interesting aspects using reasoning >> >>> >> >>> Is Stanbol the way to go or Marmotta? or Is it preferred to develop a >> >>> custom engine using Stanbol which uses internal components to perform >> all >> >>> of the above tasks? >> >> >> >> * Entity linking to your custom vocabulary in Stanbol >> >> * If you want to have your custom entities linked with >> >> dbpedia/freebase it is better to do that in the vocabulary. I think >> >> Google refine provided reconciliation to freebase. that could be >> >> definitely an option. >> >> * If you want to find additional entities contained in >> >> freebase/dbpedia configuring an other entity linking in Stanbol makes >> >> complete sense. >> >> >> >> Not sure what you mean with "infer interesting aspects using reasoning". >> >> >> >>> >> >>> *4. Enhance detected entities by annotation* >> >>> >> >>> Suppose, opennlp-ner detected an entity xyz. If I want to annotate this >> >>> entity with additional attributes/fields using different custom >> >>> vocabularies, what are the dev. steps I need to take? >> >>> >> >> >> >> If you just want to link Named Entities with a controlled vocabulary >> >> you can use the FST linking engine [4] with the Linking Mode set to >> >> NER (read the Linking Mode of the engines documentation). In short you >> >> will want to configure a "Apache Stanbol Enhancer Engine: FST Linking: >> >> Named Entities" for the vocabulary you want to link against. >> >> >> >> >> >>> *5. Previous demo project(s)* >> >>> >> >>> At the same time, any luck with restoring demo project(s) within 0.12 >> >>> branch ? I believe, it demonstrates various aspects and it would be >> great >> >>> to have it restored. >> >>> >> >> >> >> I hope those are still functional in the 0.12 branch. No immediate >> >> plans to move them to 1.0.0 (mainly because of lack of time). >> >> Contributions are very welcome. >> >> >> >> Hope this helps >> >> best >> >> Rupert >> >> >> >>> Thanks in advance, >> >>> Rajan >> >> >> >> >> >> [1] >> http://stanbol.apache.org/docs/trunk/components/enhancer/enhancementstructure#fiseenhancement >> >> [2] >> https://stanbol.apache.org/docs/trunk/components/enhancer/engines/opennlpcustomner >> >> [3] >> http://svn.apache.org/repos/asf/stanbol/branches/release-0.12/demos/ehealth/src/main/resources/datafiles/ >> >> [4] >> http://stanbol.apache.org/docs/trunk/components/enhancer/engines/lucenefstlinking >> >> >> >> >> >> -- >> >> | Rupert Westenthaler rupert.westentha...@gmail.com >> >> | Bodenlehenstraße 11 ++43-699-11108907 >> >> | A-5500 Bischofshofen >> >> | REDLINK.CO >> .......................................................................... >> >> | http://redlink.co/ >> >> >> >> -- >> | Rupert Westenthaler rupert.westentha...@gmail.com >> | Bodenlehenstraße 11 ++43-699-11108907 >> | A-5500 Bischofshofen >> | REDLINK.CO >> .......................................................................... >> | http://redlink.co/ >> -- | Rupert Westenthaler rupert.westentha...@gmail.com | Bodenlehenstraße 11 ++43-699-11108907 | A-5500 Bischofshofen | REDLINK.CO .......................................................................... | http://redlink.co/