Hi Maatari, On Mon, Sep 22, 2014 at 8:22 AM, Maatari Daniel Okouya <okouy...@yahoo.fr> wrote: > I’m a bit confused about few concept. Could someone clarify them a bit. > > > When it comes to assigning some topics to a content resource, what would be > the difference between entity linking and categorization ? >
First lets explain the terminology as used by Stanbol. For that I will use a todays headline: "Lewis Hamilton not thinking about title after winning Singapore GP" Named Entity Recognition: Detects mentions of Entity types within the text. Typically Persons, Organizations and Locations * Lewis Hamilton -> person * Singapore -> location Entity Linking: Detects mentions of known Entities within the processed Text * Lewis Hamilton -> http://en.wikipedia.org/wiki/Lewis_Hamilton * Singapore Grand Prix -> http://en.wikipedia.org/wiki/Singapore_Grand_Prix Categorization: Assigns the content to a fixed set of categories. Categories might be hierarchical. A typical example are the IPTC Media Topics [1] which I will use for this example. * sport -> http://cv.iptc.org/newscodes/mediatopic/15000000 * Formula One -> http://cv.iptc.org/newscodes/mediatopic/20000994 Important is that Entity Linking requires an actual mention of the Entity in the text while categories do not depend on such mentions. > What I see as of now, within some tools well established is the > classification part. Usually it makes use of a control vocabulary to classify > the content. Output = resource dc:Subject controledVocabularyTerm > > However, what i also see in the description of content resource online within > some authority website is to link the document to external non skos resource > via for instance the Foaf:Topic. > > In that second case, do we have both an entity linking and a classification ? > or is it that both are the same, it is just that the knowledge base change, > from external source to controlled vocabulary. Which would mean that in the > world of linked data, content classification / categorization include entity > linking? In that case i would say that, the same was happening when linking > to a controlled vocabulary term. > IMO the properties used to represent analysis results do not necessarily indicate if the results express linked entities or categorizations. Based on the definition both dc:subject and foaf:topic they should be both used for categories. > > I'm little confused here. If someone, could clarify these notion i would > appreciate. hope this helps best Rupert [1] http://cv.iptc.org/newscodes/mediatopic -- | Rupert Westenthaler rupert.westentha...@gmail.com | Bodenlehenstraße 11 ++43-699-11108907 | A-5500 Bischofshofen | REDLINK.CO .......................................................................... | http://redlink.co/