Rupert Westenthaler created STANBOL-1303:
--------------------------------------------

             Summary: Geonames LocationEnhancementEngine confidence values are 
not in the range [0..1]]
                 Key: STANBOL-1303
                 URL: https://issues.apache.org/jira/browse/STANBOL-1303
             Project: Stanbol
          Issue Type: Bug
    Affects Versions: 0.12.0, 1.0.0
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler
             Fix For: 1.0.0, 0.12.1


The Geonames.org service changed the value range of provided scores from 
[0..100] to [0..inv]. Because of that the engine does no longer report 
fise:confidence values in the range of [0..1].

Looking at the reported numbers one can assume that they do represent the 
relative confidence (similar as Solr scores).

For the normalization to [0..1] one could 
1. normalize relative to the result with the highest score
2. use the levenshtein distance between the mention in the text with the best 
matching label.

Until this gets fixed the unit tests for the engine will be deactivated.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to