Rupert Westenthaler created STANBOL-1219:
--------------------------------------------
Summary: Add option the Entity Co-Mention engine to adapt the
confidence of existing Suggestions
Key: STANBOL-1219
URL: https://issues.apache.org/jira/browse/STANBOL-1219
Project: Stanbol
Issue Type: Bug
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
Fix For: 0.12.0
The Entity Co-Mention engine detect repeating mentions of already detected
Entities. The typical example is
Barack Obama stated [..] Obama also [..]
But as there exists a City called Obama [1] in Japan an EntityLinking engine
linking against a vocabulary containing both Barack Obama AND Obama will link
the mention of Obama with the City.
The Co-Mention engine will however detect that the mention of Obama is most
likely a co-mention of Barack Obama. Therefore it should also be able to adapt
the confidence values of existing suggestion for Obama.
This issue will introduce a new configuration property for the Entity
Co-Mention Engine
enhancer.engines.comention.adjustExistingConfidence
This property will take values in the range [0..1). Confidence values of
existing suggestions will be multiplied with '1-{value}'. Meaning that
configuring '0.0' will not change existing confidence values (deactivate this
features).
The default value will be set to '0.33'.
This change will be applied to both the trunk and the 0.12 releasing branch
[1] http://en.wikipedia.org/wiki/Obama,_Fukui
--
This message was sent by Atlassian JIRA
(v6.1#6144)