Hi
Those properties are not available in the Felix Webconsole. You can
only configure them by using OSGI config files. The
EntityLinkingEngine has simple to much configuration parameters to
include them all in the Form of the Felix Webconsole.
The best is to use default configuration for the dbpedia
EntityhubLinkingEngine [1] as a template and adapt it to your needs.
e.g. by
adding
enhancer.engines.linking.minFoundTokens=1
enhancer.engines.linking.minLabelScore=0.33
enhancer.engines.linking.minTextScore=0.33
enhancer.engines.linking.minMatchScore=0.2
you will also need to increase the value of
"enhancer.engines.linking.suggestions".
Note that you do NOT need to use the datatypes (e.g. {key}=I"1" for
Integer). The Engine is implemented in a way that is also supports
string values as long as it can parse the expected numeric values from
the provided values.
The file must follow the name
"org.apache.stanbol.enhancer.engines.entityhublinking.EntityhubLinkingEngine-{instance_name}.config".
You can use the Sling Fileinstaller to activate your configuration
file. Simple create the {stanbl-working-dir}/stanbol/fileinstall
directory and copy the config file into this directory.
best
Rupert
p.s. in my last mail I used outdated keys. Also the documentation on
the Stanbol website noted the wrong keys. I corrected this in the
meantime
[1]
http://svn.apache.org/repos/asf/stanbol/trunk/data/defaultconfig/src/main/resources/config/org.apache.stanbol.enhancer.engines.entityhublinking.EntityhubLinkingEngine-dbpedia.config
On Tue, Dec 18, 2012 at 4:27 PM, Rafa Haro <[email protected]> wrote:
> Hi Rupert,
>
> In which revision is it possible to configure such parameters? We are
> working with revision 1421282 and I can't see these options in the Engine
> Configuration Dialogue.
>
> Regards
>
> El 18/12/12 16:21, Rupert Westenthaler escribió:
>
>> Hi Rafa
>>
>> To use the disambiguation engine you will need to tweak the parameters
>> for the EntityhubLinkingEngine. The relevant parameters are
>>
>> * Min Label Match Score
>>
>> "org.apache.stanbol.enhancer.engines.keywordextraction.minLabelMatchFactor"
>> * Min Matched Tokens
>> "org.apache.stanbol.enhancer.engines.keywordextraction.minFoundTokens"
>>
>> see [1] for the documentation
>>
>> from the Documentation:
>>
>> If used in combination with an disambiguation Engine one might want to
>> consider to suggest Entities where only a single token of multi-token
>> labels do match. In such cases a configuration like Min Matched
>> Tokens=1 and Min Label Match Score <= 0.5 (e.g. 0.4) might be
>> considered. With such scenarios users will also want to considerable
>> increase the value for Max Suggestions (typically values > 10).
>>
>> I would suggest that you start of with "minLabelMatchFactor=0.33" and
>> "minFoundTokens=1". In addition I would set the number of suggestions
>> to ~20.
>>
>> best
>> Rupert
>>
>>
>> [1]
>> http://stanbol.apache.org/docs/trunk/components/enhancer/engines/entitylinking#entity-linker-configuration
>>
>> On Tue, Dec 18, 2012 at 3:46 PM, Rafa Haro <[email protected]> wrote:
>>>
>>> Hi all,
>>>
>>> I have been trying to use disambiguation-mlt engine with the new
>>> EntityHub
>>> Linking Engine for Spanish. My goal is to link and disambiguate with any
>>> kind of entity within the EntityHub, not only with Named Entities. So, I
>>> have configured a new Enhancement Chain including only language
>>> detection,
>>> OpenNlpSentenceDetectionEngine, OpenNlpTokenizerEngine,
>>> EntityLinkingEngine
>>> and Disambiguation-mlt (installing the bundle version 0.10). After a few
>>> tests, the disambiguation engine is working but is not able to
>>> disambiguate
>>> anything. Removing the disambiguation engine from the Enhancement Chain
>>> we
>>> have find out that only one candidate for each detected entity is given.
>>> Therefore I think that maybe the disambiguation engine is working fine
>>> but
>>> actually doesn't need to disambiguate anything due to only one candidate
>>> is
>>> being passed to it from entityHub linking engine.
>>>
>>> What can be happening? Our suggestions parameter is set to 5
>>>
>>> Thanks. Regards
>>>
>>> This message should be regarded as confidential. If you have received
>>> this
>>> email in error please notify the sender and destroy it immediately.
>>> Statements of intent shall only become binding when confirmed in hard
>>> copy
>>> by an authorised signatory.
>>>
>>> Zaizi Ltd is registered in England and Wales with the registration number
>>> 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam
>>> Road,
>>> London W10 5JJ, UK.
>>
>>
>>
>
> This message should be regarded as confidential. If you have received this
> email in error please notify the sender and destroy it immediately.
> Statements of intent shall only become binding when confirmed in hard copy
> by an authorised signatory.
>
> Zaizi Ltd is registered in England and Wales with the registration number
> 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam Road,
> London W10 5JJ, UK.
>
--
| Rupert Westenthaler [email protected]
| Bodenlehenstraße 11 ++43-699-11108907
| A-5500 Bischofshofen