2011/11/18 Rupert Westenthaler <[email protected]>:
>
> We where also experimenting with using MoreLikeThis queries to disambiguate 
> extracted Entities during the Semantic / NLP Hackathon at the Berlin 
> Buzzwords [1]. Basically you use the current context within the enhanced text 
> to perform a MLT query based on all occurrences of an Entity within Wikipedia.
> This would allow Stanbol Enhancement Engines to suggest Entities not only 
> because of the label, type and the ranking, but also because of the context 
> in the enhanced text. Maybe one could even use this to suggest related 
> Entities that are not even mentioned in the text (similar to categories)
>
> As far as I can remember this will be based on intermediated results of the 
> data set Olivier uses for his "Universal Topic Classification experiment". So 
> if Olivier finishes his work on this there is also a good change that we will 
> also have the required data to continue work on the Entity disambiguation.

Yes defintely. A disambiguation engine would really improve the
quality of the named entity tagging.

-- 
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel

Reply via email to