Hello, Bob J wrote: > Hi all, quick questions for you. > > 1) Is there a way of accessing all the links to a certain page across > the rest of Wikipedia's pages? > e.g., For the following resource > (http://dbpedia.org/page/Paris_Hilton), I'm trying to find the list of > other resources that link to it > > 2) As a follow-on to the previous question, is it possible to see the > text that page used to link to it? > e.g., Perhaps the display text of the link was "Paris" or "P.H." or > "Ms. Hilton"? > > 3) Similar to question 1, but slightly different, is it possible to see > the list of links a certain page has on it? > > 4) I'm interested in the links to other entities from the fact-boxes and > abstract? Is seems the dbpedia has this stripped out, any way to get it > back? > > Appreciate the help!
The pagelink dataset of DBpedia contains triples of the form: $object1 <http://dbpedia.org/property/wikilink> $object2 Both $object1 and $object2 are DBpedia entities. This can be used to answer questions 1 and 2 through SPARQL (by fixing either $object1 or $object2). I believe, the data set is not in the official SPARQL endpoint currently. If you want to use SPARQL to get the result, you need to load it into a triple store. In case you are not meaning "link", but rather Media disambiguation, then you can use the property http://dbpedia.org/property/disambiguates. Question 3: This is probably difficult, because there is no guarantee that each link is extracted by DBpedia. Taking the distinct set of URIs extracted by DBpedia could be an approximation. Question 4: For the infoboxes: One way would be to use all properties starting with http://dbpedia.org/property/ minus a few special ones (like wikilink). Since all other properties apart from the infoboxes are fixed, this should work. It is probably hard to query the links occurring in an abstract. Of course, it made more sense for us to strip this out, since the abstract should be a plain text description of an entity. I guess you need this for a statistical analysis of Wikipedia.(?) Kind regards, Jens -- Dipl. Inf. Jens Lehmann Department of Computer Science, University of Leipzig Homepage: http://www.jens-lehmann.org GPG Key: http://jens-lehmann.org/jens_lehmann.asc ------------------------------------------------------------------------------ _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
