[ 
https://issues.apache.org/jira/browse/STANBOL-1040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rupert Westenthaler updated STANBOL-1040:
-----------------------------------------

    Description: 
EntityScores are used to represent the importance of an entity within a 
knowledge base.

This provider will rank

Administrative regions
==========

based on the Feature Code

* PC*: 1.0
* ADM1*: 0.5
* ADM2*: 0.25
* ADM3*: 0.125
* ADM4*: 0.062
* ADMD*: 0.062
* ADM5*: 0.031
* others: 0.062

Populated Places
======

based on the population

* population: the population of the current Entity
* maxPopulation: 10.000.000
* minPopulation: 1.000
* base = 10.000
* normPopulation: max(minPopulain,min(maxPopulation,population))/base
* ranking: ln(1+normPopulation)/ln(1+(maxPopulation/base))

Some Examples:

10m: 1.0
5m: 0.9
1m: 0.67
500k: 0.57
100k: 0.35
50k: 0.26
10k: 0.1
5k: 0.059
1k: 0.014

Other Feature Classes
==========

Those are currently not supported and will get an default value of 0.1 what is 
similar to an ADM3 and cities with a population of ~10.000

  was:
EntityScores are used to represent the importance of an entity within a 
knowledge base.

This provider will rank

* Administrative regions: 1.0
* Populated places based on the population
    * places > 1million : 1.0
    * places < 1million in the range [0.2, 1.0] based on the natural log of the 
population
    * places without population : 0.2
* all other feature classes: 0.3

    
> Simple Geonames EntityScoreProvider
> -----------------------------------
>
>                 Key: STANBOL-1040
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1040
>             Project: Stanbol
>          Issue Type: New Feature
>          Components: Entityhub
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>            Priority: Minor
>
> EntityScores are used to represent the importance of an entity within a 
> knowledge base.
> This provider will rank
> Administrative regions
> ==========
> based on the Feature Code
> * PC*: 1.0
> * ADM1*: 0.5
> * ADM2*: 0.25
> * ADM3*: 0.125
> * ADM4*: 0.062
> * ADMD*: 0.062
> * ADM5*: 0.031
> * others: 0.062
> Populated Places
> ======
> based on the population
> * population: the population of the current Entity
> * maxPopulation: 10.000.000
> * minPopulation: 1.000
> * base = 10.000
> * normPopulation: max(minPopulain,min(maxPopulation,population))/base
> * ranking: ln(1+normPopulation)/ln(1+(maxPopulation/base))
> Some Examples:
> 10m: 1.0
> 5m: 0.9
> 1m: 0.67
> 500k: 0.57
> 100k: 0.35
> 50k: 0.26
> 10k: 0.1
> 5k: 0.059
> 1k: 0.014
> Other Feature Classes
> ==========
> Those are currently not supported and will get an default value of 0.1 what 
> is similar to an ADM3 and cities with a population of ~10.000

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to