[ 
https://issues.apache.org/jira/browse/STANBOL-1116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rupert Westenthaler resolved STANBOL-1116.
------------------------------------------

    Resolution: Fixed

fixed with http://svn.apache.org/r1496359 (as part of STANBOL-1114)

> Filter Literals of suggested Entities based on Languages used for Lookups
> -------------------------------------------------------------------------
>
>                 Key: STANBOL-1116
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1116
>             Project: Stanbol
>          Issue Type: Sub-task
>          Components: Enhancement Engines
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> EntityLinking uses two languages to lookup Entities:
> (1) the language of the current document (as detected by language detection)
> (2) the default mapping language (default: null ... labels without language 
> tag)
> In multi-lingual vocabularies (e.g. dbpedia or freebase) entities might 
> define literal values for a lot of languages (for freebase there might be 
> labels for more as 100 languages for some entities)
> Currently the EntityLinkingEngine includes labels of all languages in the 
> EnhancementResults. This has two disadvantages:
> (1) All values need to be provided by the EntitySearcher. This might require 
> to convert all those values to Clerezza RDF (such as in the case of the Solr 
> based EntitySearcher)
> (2) If dereferencing is activated a lot of additional literals (ant therefore 
> triples) are added to the Enhancement results. This has both a negative 
> impact for performance AND also the size of the Enhancement Results.
> This issue will adapt the EntiySearcher interface to allow specifying
> * selected fields
> * selected languages
> with all requests, where the languages used to query will always be included 
> to the parsed selected languages and the label field, type field and redirect 
> field will always be included in the selected fields - as those information 
> are required by the linking process itself.
> EntitySearcher implementation may ignore those configurations and return all 
> values for returned entities instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to