Hello,

Bob J wrote:
> Hi all, quick questions for you.
> 
> 1) Is there a way of accessing all the links to a certain page across
> the rest of Wikipedia's pages?
>    e.g., For the following resource
> (http://dbpedia.org/page/Paris_Hilton), I'm trying to find the list of
> other resources that link to it
> 
> 2) As a follow-on to the previous question, is it possible to see the
> text that page used to link to it?
>    e.g., Perhaps the display text of the link was "Paris" or "P.H." or
> "Ms. Hilton"?
> 
> 3) Similar to question 1, but slightly different, is it possible to see
> the list of links a certain page has on it?
> 
> 4) I'm interested in the links to other entities from the fact-boxes and
> abstract? Is seems the dbpedia has this stripped out, any way to get it
> back?
> 
> Appreciate the help!

The pagelink dataset of DBpedia contains triples of the form:
$object1 <http://dbpedia.org/property/wikilink> $object2
Both $object1 and $object2 are DBpedia entities.

This can be used to answer questions 1 and 2 through SPARQL (by fixing
either $object1 or $object2). I believe, the data set is not in the
official SPARQL endpoint currently. If you want to use SPARQL to get the
result, you need to load it into a triple store.

In case you are not meaning "link", but rather Media disambiguation,
then you can use the property http://dbpedia.org/property/disambiguates.

Question 3: This is probably difficult, because there is no guarantee
that each link is extracted by DBpedia. Taking the distinct set of URIs
extracted by DBpedia could be an approximation.

Question 4: For the infoboxes: One way would be to use all properties
starting with http://dbpedia.org/property/ minus a few special ones
(like wikilink). Since all other properties apart from the infoboxes are
fixed, this should work. It is probably hard to query the links
occurring in an abstract. Of course, it made more sense for us to strip
this out, since the abstract should be a plain text description of an
entity.

I guess you need this for a statistical analysis of Wikipedia.(?)

Kind regards,

Jens

-- 
Dipl. Inf. Jens Lehmann
Department of Computer Science, University of Leipzig
Homepage: http://www.jens-lehmann.org
GPG Key: http://jens-lehmann.org/jens_lehmann.asc


------------------------------------------------------------------------------
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to