[ 
https://issues.apache.org/jira/browse/OPENNLP-579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joern Kottmann closed OPENNLP-579.
----------------------------------
    Resolution: Fixed

> Framework to dynamically link N-best matches from external data to named 
> entities by type (EntityLinker framework)
> ------------------------------------------------------------------------------------------------------------------
>
>                 Key: OPENNLP-579
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-579
>             Project: OpenNLP
>          Issue Type: Wish
>          Components: Entity Linker
>    Affects Versions: 1.6.0
>         Environment: Any
>            Reporter: Mark Giaconia
>            Assignee: Joern Kottmann
>            Priority: Minor
>              Labels: features
>             Fix For: 1.6.0
>
>         Attachments: entitylinker.properties, 
> opennlp.geoentitylinker.countrycontext.txt
>
>   Original Estimate: 672h
>  Remaining Estimate: 672h
>
> A framework for integrating/linking external data to named entities. For 
> instance, geocoding or georeferencing location entities to geonames gazateers 
> can be implemented as an EntityLinker. Initially created ticket to 
> specifically solve the georeferencing/geolocating/geotagging problem, but the 
> framework should allow linkage of any external data to any entity type. 
> Commercial applications that do this are expensive, and there are many free 
> gazateers one could use to create solutions with OpenNLP. 
> UPDATE: The current implementation of the GeoEntityLinker uses Lucene to 
> store the Gazateers, and provides utils for indexing them. The impl returns 
> lat, long (and other gaz fields) for toponyms extracted with NER.
> All extracted toponyms are scored in four ways: fuzzy string matching, 
> binning by location, context modeling, and country-mention proximity. These 
> scores enable a good means of deciding what's worth keeping from the gaz.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to