Hi Alessio,
On Wed, May 27, 2015 at 1:05 PM, Alessio Palmero Aprosio < ziorufu...@gmail.com> wrote: > Dear all, > I have a couple of questions on DBpedia Spotlight. > > - What means "n-best candidates"? I tried to check it, thinking to get > as an answer a list of entities for each span, but I obtain only one > (with a confidence-like score). > It is the list of candidate entities that will be disambiguated > > - What is the "confidence"? I thought that the value shown with the > "n-best candidates" was the confidence, but I obtain this weird result, > as follows. > 1. > - Leave the text of the demo. > - Leave the default confidence (0.5). > - Check "n-best candidates". > - Annotate > You will see that the first word, "First" is not linked. > 2. > - Leave the text of the demo. > - Set the confidence to 0.1. > - Check "n-best candidates". > - Annotate > You will see that the first word, "First", now is linked to WWI with > confidence 1. > Is it normal? > The linking takes place in two stages : spotting & disambiguating. Spotting matches text to surface forms. Disambiguating chooses one topic among potential candidates (n-best candidates). As it currently stands the confidence parameter refers both to the spotter and the disambiguator. So that parameter is used to prune potential spots as well as potential topics. My guess is that by lowering the confidence you allowed the spotter to get an extra surface form match ( "First"). > Best, > Alessio > > > > ------------------------------------------------------------------------------ > _______________________________________________ > Dbpedia-discussion mailing list > Dbpedia-discussion@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion >
------------------------------------------------------------------------------
_______________________________________________ Dbpedia-discussion mailing list Dbpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion