One more observation. The interfaces might depend on whether the sense source includes proper nouns (entities) or not. For example, WordNet includes some small, but noticeable amount (~8000 if I'm not mistaken) of entities. It might be better to separate the two, it might be not - it depends. But the interfaces might depend on this assumption. And considering entities in WSD the situation becomes close (similar) to NER. It would be great if you take this into account and make assumptions explicit. It would be also great to discuss your findings in the state of the art of interfaces for WSD and sense (entity) sources.
Aliaksandr On 18 February 2015 at 14:39, Anthony Beylerian < anthonybeyler...@hotmail.com> wrote: > > > > Thank you for the feedback, I believe that having separate interfaces as > mentioned for sense provision and disambiguation would be a good idea. > We will try to survey the techniques and study the library further to > propose a first structure when possible. > Best, > > Anthony > > Subject: Re: Word Sense Disambiguation > > From: kottm...@gmail.com > > To: dev@opennlp.apache.org > > Date: Mon, 16 Feb 2015 16:48:48 +0100 > > > > On Mon, 2015-02-16 at 16:29 +0100, Aliaksandr Autayeu wrote: > > > Jörn, to avoid ambiguity in case you addressed me to propose a WSD > > > interface. I'd prefer Anthony to come up with a proposal, because he is > > > closer to the multiple WSD algorithms that would be nice to include in > the > > > analysis. > > > > Sorry, for being unclear, yes I addressed Anthony. But everybody who has > > an opinion is very welcome to join the discussion or propose something. > > > > Jörn > > > > >