Rupert Westenthaler created STANBOL-1268:
--------------------------------------------

             Summary: Add option to the Lucene FST Linking Engine to use fst 
modles for sub-languages
                 Key: STANBOL-1268
                 URL: https://issues.apache.org/jira/browse/STANBOL-1268
             Project: Stanbol
          Issue Type: New Feature
          Components: Enhancement Engines
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler


Entities in Vocabularies might use country specific lanugages (e.g. 
"Organisation"@en-GB and "Organization"@en-US).

When enhancing an English language text mentioning Organization it would not 
get linked to an entity as the language detector reports "en" but the Entity 
does not provide a label for that language.

This feature will allow the FST linking engine to use FST models for 
sub-languages (languages that start with {lang}-*) for linking.

Notes: enabling this feature will have some influence on linking performance as 
the engine needs to lookup entities in additional FST modles.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to