2011/11/18 Rupert Westenthaler <[email protected]>: > > We where also experimenting with using MoreLikeThis queries to disambiguate > extracted Entities during the Semantic / NLP Hackathon at the Berlin > Buzzwords [1]. Basically you use the current context within the enhanced text > to perform a MLT query based on all occurrences of an Entity within Wikipedia. > This would allow Stanbol Enhancement Engines to suggest Entities not only > because of the label, type and the ranking, but also because of the context > in the enhanced text. Maybe one could even use this to suggest related > Entities that are not even mentioned in the text (similar to categories) > > As far as I can remember this will be based on intermediated results of the > data set Olivier uses for his "Universal Topic Classification experiment". So > if Olivier finishes his work on this there is also a good change that we will > also have the required data to continue work on the Entity disambiguation.
Yes defintely. A disambiguation engine would really improve the quality of the named entity tagging. -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel
