Rupert Westenthaler created STANBOL-1268:
--------------------------------------------
Summary: Add option to the Lucene FST Linking Engine to use fst
modles for sub-languages
Key: STANBOL-1268
URL: https://issues.apache.org/jira/browse/STANBOL-1268
Project: Stanbol
Issue Type: New Feature
Components: Enhancement Engines
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
Entities in Vocabularies might use country specific lanugages (e.g.
"Organisation"@en-GB and "Organization"@en-US).
When enhancing an English language text mentioning Organization it would not
get linked to an entity as the language detector reports "en" but the Entity
does not provide a label for that language.
This feature will allow the FST linking engine to use FST models for
sub-languages (languages that start with {lang}-*) for linking.
Notes: enabling this feature will have some influence on linking performance as
the engine needs to lookup entities in additional FST modles.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)