I would be also extremely interested in an answer to this. Thanks for
asking, Stefano.
What's the best way to calculate "Spotlight's detection confidence" = a
single number?
Cheers,
Radim
---------- Původní zpráva ----------
Od: Stefano Bocconi <[email protected]>
Komu: [email protected] <dbp-spotlight-users@lists.
sourceforge.net>
Datum: 16. 6. 2014 18:14:26
Předmět: [Dbp-spotlight-users] How is the confidence value calculated?
"
Hi,
I am new to this list, I came here from the github Spotlight page about
support and feedback. Questions related to what I am asking have popped up a
couple of times in this list as far as I can see, but the answers do not
provide what I am looking for.
I am using the statistical back-end, and I am basically trying to
reconstruct the confidence value of the entities extracted.
I have extracted entities from tweets and as a first experiment I did not
asked for any threshold confidence. Now I would like to calculate the
confidence of each results to see how filtering based on that influences the
quality of some other process I am doing with the entities.
I am now using the formula:
(1 - .5 * percentageofsecondrank) * similarityscore
Based on the fact that confidence increases with similarity score, but
decreases if the second candidate is also similar.
Is this comparable to what Spotlight uses in http://spotlight.dbpedia.org/
rest/annotate(http://spotlight.dbpedia.org/rest/annotate)? Or else what is
the formula? Does support play a role?
Thanks,
Stefano
----------------------------------------------------------------------------
--
HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions
Find What Matters Most in Your Big Data with HPCC Systems
Open Source. Fast. Scalable. Simple. Ideal for Dirty Data.
Leverages Graph Analysis for Fast Processing & Easy Data Exploration
http://p.sf.net/sfu/hpccsystems_____________________________________________
__
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users"
------------------------------------------------------------------------------
HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions
Find What Matters Most in Your Big Data with HPCC Systems
Open Source. Fast. Scalable. Simple. Ideal for Dirty Data.
Leverages Graph Analysis for Fast Processing & Easy Data Exploration
http://p.sf.net/sfu/hpccsystems
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users