Hi Sergio

This is exactly i did and i mentioned in my last email

*"What i understand is to enable option "Link ProperNouns only" in
entityhub linking and also to use "opennlp-pos" engine in my weighted chain"
*

I have already checked this option in my own entity hub linking engine

By the way, did you get a chance to look at files i have shared in google
drive folder. Did you notice any problems there ?

I think using custom ontology with stanbol should be a very common use case
and if there are issues getting it working, either i am doing something
terribly wrong or there are some other reasons which i dont know.

But anyways, i am persisting to solve this issue and any help on this from
this dev community will be much appreciated

best regards
tarandeep



On Mon, Jul 15, 2013 at 9:49 PM, Sergio Fernández <
sergio.fernan...@salzburgresearch.at> wrote:

> http://{stanbol}/system/**console/configMgr sorry
>
>
> On 15/07/13 18:15, Sergio Fernández wrote:
>
>> Have you check the
>>
>> 1) go to http://{stanbol}/config/**system/console/configMgr
>>
>> 2) find your EntityHub Linking engine
>>
>> 3) and then "Link ProperNouns only"
>>
>> The documentation in that configuration is quite useful I think:
>>
>> "If activated only ProperNouns will be matched against the Vocabulary.
>> If deactivated any Noun will be matched. NOTE that this parameter
>> requires a tag of the POS TagSet to be mapped against 'olia:PorperNoun'.
>> Otherwise mapping will not work as expected.
>> (enhancer.engines.linking.**properNounsState)"
>>
>> Hope this help. You have to take into account such kind of issues are
>> not easy to solve by email.
>>
>> Cheers,
>>
>> On 15/07/13 16:31, Sawhney, Tarandeep Singh wrote:
>>
>>> Thanks Sergio for your response
>>>
>>> What i understand is to enable option *"Link ProperNouns only"* in
>>> entityhub linking and also to use "opennlp-pos" engine in my weighted
>>> chain
>>>
>>> I did these changes but unable to extract "University of Salzberg"
>>>
>>> Please find below the output RDF/XML from enhancer
>>>
>>> Request you to please let me know if i did not understand your inputs
>>> correctly
>>>
>>> One more thing, in our ontology (yet to be built) we will have entities
>>> which are other than people, places and organisations. For example,
>>> belts,
>>> bags etc
>>>
>>> best regards
>>> tarandeep
>>>
>>> <rdf:RDF
>>>      
>>> xmlns:rdf="http://www.w3.org/**1999/02/22-rdf-syntax-ns#<http://www.w3.org/1999/02/22-rdf-syntax-ns#>
>>> "
>>>      xmlns:j.0="http://purl.org/dc/**terms/ <http://purl.org/dc/terms/>"
>>>      
>>> xmlns:j.1="http://fise.iks-**project.eu/ontology/<http://fise.iks-project.eu/ontology/>"
>>> >
>>>    <rdf:Description
>>> rdf:about="urn:enhancement-**197792bf-f1e8-47bf-626a-**3cdfbdb863b3">
>>>      <j.0:type 
>>> rdf:resource="http://purl.org/**dc/terms/LinguisticSystem<http://purl.org/dc/terms/LinguisticSystem>
>>> "/>
>>>      <j.1:extracted-from
>>> rdf:resource="urn:content-**item-sha1-**3b2998e66582544035454850d2dd81**
>>> 755b747849"/>
>>>
>>>      <j.1:confidence
>>> rdf:datatype="http://www.w3.**org/2001/XMLSchema#double<http://www.w3.org/2001/XMLSchema#double>
>>> ">0.**9999964817340454</j.1:**confidence>
>>>
>>>      <rdf:type
>>> rdf:resource="http://fise.iks-**project.eu/ontology/**Enhancement<http://fise.iks-project.eu/ontology/Enhancement>
>>> "/>
>>>      <rdf:type
>>> rdf:resource="http://fise.iks-**project.eu/ontology/**TextAnnotation<http://fise.iks-project.eu/ontology/TextAnnotation>
>>> "/>
>>>      <j.0:language>en</j.0:**language>
>>>      <j.0:created
>>> rdf:datatype="http://www.w3.**org/2001/XMLSchema#dateTime<http://www.w3.org/2001/XMLSchema#dateTime>
>>> ">**2013-07-15T14:25:43.829Z</j.0:**created>
>>>
>>>      <j.0:creator
>>> rdf:datatype="http://www.w3.**org/2001/XMLSchema#string<http://www.w3.org/2001/XMLSchema#string>
>>> ">**org.apache.stanbol.enhancer.**engines.langdetect.**
>>> LanguageDetectionEnhancementEn**gine</j.0:creator>
>>>
>>>    </rdf:Description>
>>> </rdf:RDF>
>>>
>>>
>>>
>>> On Mon, Jul 15, 2013 at 7:32 PM, Sergio Fernández <
>>> sergio.fernandez@**salzburgresearch.at<sergio.fernan...@salzburgresearch.at>>
>>> wrote:
>>>
>>>  As I said: have you check the proper noun detection and POS tagging in
>>>> your chain?
>>>>
>>>> For instance, enhancing the text "I studied at the University of
>>>> Salzburg,
>>>> which is based in Austria" works at the demo server:
>>>>
>>>> http://dev.iks-project.eu:****8081/enhancer/chain/dbpedia-****
>>>> proper-noun<http://dev.iks-**project.eu:8081/enhancer/**
>>>> chain/dbpedia-proper-noun<http://dev.iks-project.eu:8081/enhancer/chain/dbpedia-proper-noun>
>>>> >
>>>>
>>>>
>>>> Here the details:
>>>>
>>>> http://stanbol.apache.org/****docs/trunk/components/****
>>>> enhancer/engines/**<http://stanbol.apache.org/**docs/trunk/components/**enhancer/engines/**>
>>>> entitylinking#proper-noun-****linking-****
>>>> wzxhzdk14enhancerengineslinkin****
>>>> gpropernounsstatewzxhzdk15<htt**p://stanbol.apache.org/docs/**
>>>> trunk/components/enhancer/**engines/entitylinking#proper-**
>>>> noun-linking-**wzxhzdk14enhancerengineslinkin**
>>>> gpropernounsstatewzxhzdk15<http://stanbol.apache.org/docs/trunk/components/enhancer/engines/entitylinking#proper-noun-linking-wzxhzdk14enhancerengineslinkingpropernounsstatewzxhzdk15>
>>>> >
>>>>
>>>>
>>>> Cheers,
>>>>
>>>>
>>>>
>>>> On 15/07/13 15:27, Sawhney, Tarandeep Singh wrote:
>>>>
>>>>  Just to add to my previous email
>>>>>
>>>>> If i add another individual in my ontology "MyUniversity" under class
>>>>> University
>>>>>
>>>>>
>>>>>
>>>>>       <!--
>>>>> http://www.semanticweb.org/****vi5/ontologies/2013/6/**<http://www.semanticweb.org/**vi5/ontologies/2013/6/**>
>>>>> untitled-ontology-13#****MyUniversity--<http://www.**
>>>>> semanticweb.org/vi5/**ontologies/2013/6/untitled-**
>>>>> ontology-13#MyUniversity--<http://www.semanticweb.org/vi5/ontologies/2013/6/untitled-ontology-13#MyUniversity-->
>>>>> >
>>>>>
>>>>>
>>>>>>
>>>>>       <owl:NamedIndividual rdf:about="
>>>>> http://www.semanticweb.org/****vi5/ontologies/2013/6/**<http://www.semanticweb.org/**vi5/ontologies/2013/6/**>
>>>>> untitled-ontology-13#****MyUniversity<http://www.**
>>>>> semanticweb.org/vi5/**ontologies/2013/6/untitled-**
>>>>> ontology-13#MyUniversity<http://www.semanticweb.org/vi5/ontologies/2013/6/untitled-ontology-13#MyUniversity>
>>>>> >
>>>>>
>>>>> ">
>>>>>           <rdf:type rdf:resource="
>>>>> http://www.semanticweb.org/****vi5/ontologies/2013/6/**<http://www.semanticweb.org/**vi5/ontologies/2013/6/**>
>>>>> untitled-ontology-13#****University<http://www.**semanticweb.org/vi5/*
>>>>> *ontologies/2013/6/untitled-**ontology-13#University<http://www.semanticweb.org/vi5/ontologies/2013/6/untitled-ontology-13#University>
>>>>> >
>>>>>
>>>>> "/>
>>>>>           <rdfs:label>MyUniversity</****rdfs:label>
>>>>>       </owl:NamedIndividual>
>>>>>
>>>>>
>>>>> So with all configurations i have mentioned in the word document (in
>>>>> google
>>>>> drive folder), when i pass text with "MyUniversity" in it, my
>>>>> enhancement
>>>>> chain is able to extract "MyUniversity" and link it with
>>>>> "University" type
>>>>>
>>>>> But same set of configurations doesn't work with individual
>>>>> "University of
>>>>> Salzburg"
>>>>>
>>>>> If anyone of you please provide help on what are we missing to be
>>>>> able to
>>>>> extract custom entities which has space in between, will be a great
>>>>> help
>>>>> to
>>>>> proceed further on our journey with using and contributing to stanbol
>>>>>
>>>>> with best regards,
>>>>> tarandeep
>>>>>
>>>>>
>>>>>
>>>>> On Mon, Jul 15, 2013 at 5:57 PM, Sawhney, Tarandeep Singh <
>>>>> tsawh...@innodata.com> wrote:
>>>>>
>>>>>   Thanks Sergio and Dileepa for your responses
>>>>>
>>>>>>
>>>>>> We haven't been able to resolve the issue. We therefore decided to
>>>>>> keep
>>>>>> just one class and one instance value "University of Salzburg" in our
>>>>>> custom ontology and try to extract this entity and also link it but we
>>>>>> could not get this running. I am sure we are missing some
>>>>>> configurations.
>>>>>>
>>>>>> I am sharing a google drive folder at below link
>>>>>>
>>>>>> https://drive.google.com/****folderview?id=0B-**<https://drive.google.com/**folderview?id=0B-**>
>>>>>> vX9idwHlRtRFFOR000ZnBBOWM&usp=****sharing<https://drive.**
>>>>>> google.com/folderview?id=0B-**vX9idwHlRtRFFOR000ZnBBOWM&usp=**sharing<https://drive.google.com/folderview?id=0B-vX9idwHlRtRFFOR000ZnBBOWM&usp=sharing>
>>>>>> >
>>>>>>
>>>>>>
>>>>>> This folder has 3 files:
>>>>>>
>>>>>> 1) A word document which shows felix snapshots of what all
>>>>>> configurations
>>>>>> we did while configuring Yard, yardsite, entiy linking engine and
>>>>>> weighted
>>>>>> chain
>>>>>> 2) our custom ontology
>>>>>> 3) the result of SPARQL against our graphuri using SPARQL endpoint
>>>>>>
>>>>>> May i request you all to please look at these files and let us know
>>>>>> if we
>>>>>> are missing something in configurations.
>>>>>>
>>>>>> We have referred to below web links in order to configure stanbol for
>>>>>> using our custom ontology for entity extraction and linking
>>>>>>
>>>>>> http://stanbol.apache.org/****docs/trunk/customvocabulary.****html<http://stanbol.apache.org/**docs/trunk/customvocabulary.**html>
>>>>>> <http://stanbol.apache.**org/docs/trunk/**customvocabulary.html<http://stanbol.apache.org/docs/trunk/customvocabulary.html>
>>>>>> >
>>>>>>
>>>>>> http://stanbol.apache.org/****docs/trunk/components/**<http://stanbol.apache.org/**docs/trunk/components/**>
>>>>>> entityhub/managedsite<http://**stanbol.apache.org/docs/trunk/**
>>>>>> components/entityhub/**managedsite<http://stanbol.apache.org/docs/trunk/components/entityhub/managedsite>
>>>>>> >
>>>>>>
>>>>>>
>>>>>> http://stanbol.apache.org/****docs/trunk/components/****
>>>>>> enhancer/engines/**<http://stanbol.apache.org/**docs/trunk/components/**enhancer/engines/**>
>>>>>>
>>>>>> entityhublinking<http://**stanbol.apache.org/docs/trunk/**
>>>>>> components/enhancer/engines/**entityhublinking<http://stanbol.apache.org/docs/trunk/components/enhancer/engines/entityhublinking>
>>>>>> >
>>>>>>
>>>>>>
>>>>>> http://stanbol.apache.org/****docs/trunk/components/**<http://stanbol.apache.org/**docs/trunk/components/**>
>>>>>> enhancer/chains/weightedchain.****html<http://stanbol.apache.**
>>>>>> org/docs/trunk/components/**enhancer/chains/weightedchain.**html<http://stanbol.apache.org/docs/trunk/components/enhancer/chains/weightedchain.html>
>>>>>> >
>>>>>>
>>>>>>
>>>>>> Thanks in advance for your valuable help.
>>>>>>
>>>>>> Best regards
>>>>>> tarandeep
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Sat, Jul 13, 2013 at 5:57 PM, Sergio Fernández <
>>>>>> sergio.fernandez@**salzburgres**earch.at <http://salzburgresearch.at>
>>>>>> <sergio.fernandez@**salzburgresearch.at<sergio.fernan...@salzburgresearch.at>
>>>>>> >>
>>>>>>
>>>>>> wrote:
>>>>>>
>>>>>>   Hi,
>>>>>>
>>>>>>>
>>>>>>> I'm not an expert on entity linking, but from my experience such
>>>>>>> behaviour could be caused by the proper noun detection. Further
>>>>>>> details
>>>>>>> at:
>>>>>>>
>>>>>>> http://stanbol.apache.org/******docs/trunk/components/**<http://stanbol.apache.org/****docs/trunk/components/**>
>>>>>>> <http:**//stanbol.apache.org/**docs/**trunk/components/**<http://stanbol.apache.org/**docs/trunk/components/**>
>>>>>>> >
>>>>>>>
>>>>>>> enhancer/engines/****entitylinking<http://stanbol.****
>>>>>>> apache.org/docs/trunk/****components/enhancer/engines/****
>>>>>>> entitylinking<http://apache.org/docs/trunk/**components/enhancer/engines/**entitylinking>
>>>>>>> <http://stanbol.**apache.org/docs/trunk/**
>>>>>>> components/enhancer/engines/**entitylinking<http://stanbol.apache.org/docs/trunk/components/enhancer/engines/entitylinking>
>>>>>>> >
>>>>>>>
>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>> In addition, I'd like to suggest you to take a look to the
>>>>>>> netiquette in
>>>>>>> mailing lists. This is an open source community; therefore messages
>>>>>>> starting with "URGENT" are not very polite. Specially sending it on
>>>>>>> Friday
>>>>>>> afternoon, when people could be already out for weekend, or even on
>>>>>>> vacations.
>>>>>>>
>>>>>>> Best,
>>>>>>> Sergio
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On 12/07/13 15:54, Sethi, Keval Krishna wrote:
>>>>>>>
>>>>>>>   Hi,
>>>>>>>
>>>>>>>>
>>>>>>>> I am using stanbol to extract entitiies by plugging custom
>>>>>>>> vocabulary
>>>>>>>> as
>>>>>>>> per
>>>>>>>> http://stanbol.apache.org/******docs/trunk/customvocabulary.****
>>>>>>>> **html<http://stanbol.apache.org/****docs/trunk/customvocabulary.****html>
>>>>>>>> <http://stanbol.apache.**org/**docs/trunk/**customvocabulary.**html<http://stanbol.apache.org/**docs/trunk/customvocabulary.**html>
>>>>>>>> >
>>>>>>>>
>>>>>>>> <http://stanbol.apache.**org/**docs/trunk/**customvocabulary.**
>>>>>>>> html<http://stanbol.apache.**org/docs/trunk/**customvocabulary.html<http://stanbol.apache.org/docs/trunk/customvocabulary.html>
>>>>>>>> >
>>>>>>>>
>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>> Following are the steps followed -
>>>>>>>>
>>>>>>>>     Configured Clerezza Yard.
>>>>>>>>     Configured Managed Yard site.
>>>>>>>>     Updated the site by plugging ontology(containing custom
>>>>>>>> entities) .
>>>>>>>>     Configured Entity hub linking Engine(*customLinkingEngine*) with
>>>>>>>> managed
>>>>>>>> site.
>>>>>>>>     Configured a customChain which uses following engine
>>>>>>>>
>>>>>>>>       -  *langdetect*
>>>>>>>>       - *opennlp-sentence*
>>>>>>>>       - *opennlp-token*
>>>>>>>>       - *opennlp-pos*
>>>>>>>>       - *opennlp-chunker*
>>>>>>>>       - *customLinkingEngine*
>>>>>>>>
>>>>>>>> Now, i am able to extract entities like Adidas using *customChain*.
>>>>>>>>
>>>>>>>> However i am facing an issue in extracting entities which has
>>>>>>>> space in
>>>>>>>> between. For example "Tommy Hilfiger".
>>>>>>>>
>>>>>>>> Chain like *dbpedia-disambiguation *(which comes bundeled with
>>>>>>>> stanbol
>>>>>>>> instance) is rightly extracting entities like  "Tommy Hilfiger".
>>>>>>>>
>>>>>>>> I had tried configuring  *customLinkingEngine* same as *
>>>>>>>> dbpedia-disamb-linking *(configured in *dbpedia-disambiguation* )
>>>>>>>> but
>>>>>>>> it
>>>>>>>> didn't work to extract above entity.
>>>>>>>>
>>>>>>>> I have invested more than a week now and running out of options now
>>>>>>>>
>>>>>>>> i request you to please provide help in resolving this issue
>>>>>>>>
>>>>>>>>
>>>>>>>>   --
>>>>>>>>
>>>>>>> Sergio Fernández
>>>>>>> Salzburg Research
>>>>>>> +43 662 2288 318
>>>>>>> Jakob-Haringer Strasse 5/II
>>>>>>> A-5020 Salzburg (Austria)
>>>>>>> http://www.salzburgresearch.at
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>  --
>>>> Sergio Fernández
>>>> Salzburg Research
>>>> +43 662 2288 318
>>>> Jakob-Haringer Strasse 5/II
>>>> A-5020 Salzburg (Austria)
>>>> http://www.salzburgresearch.at
>>>>
>>>>
>>>
>>
> --
> Sergio Fernández
> Salzburg Research
> +43 662 2288 318
> Jakob-Haringer Strasse 5/II
> A-5020 Salzburg (Austria)
> http://www.salzburgresearch.at
>

-- 

"This e-mail and any attachments transmitted with it are for the sole use 
of the intended recipient(s) and may contain confidential , proprietary or 
privileged information. If you are not the intended recipient, please 
contact the sender by reply e-mail and destroy all copies of the original 
message. Any unauthorized review, use, disclosure, dissemination, 
forwarding, printing or copying of this e-mail or any action taken in 
reliance on this e-mail is strictly prohibited and may be unlawful."

Reply via email to