Hello David,

David Lee schrieb:
> Hi,
> 
> I'm interested in querying data against the "Notes and References" (or
> sometimes just "References") section. As far as I can tell, DBpedia
> doesn't extract this, is that correct?

That's correct imho. There is a reference property in DBpedia already,
but I believe it extracts from "External Links" sections rather than the
"Notes and References". You can view extracted information at
http://dbpedia.org/resource/$wikipedia_article_name, so you can check
whether the information you are looking for has been extracted.

This is how you can extend DBpedia to extract notes and references:

- checkout DBpedia Subversion
- create a file "ReferenceExtractor" in extraction/extractors
- create class ReferenceExtractor implements Extractor in this file and
  implement the extractPage function (watch out for the other
  extractors to see how it works)

For testing your code, a start.php file is included, where you can use
LiveWikipedia as data source. You can specify a small set of Wikipedia
articles to test and develop with, which will be downloaded using a
Wikipedia webservice at runtime. You can do one request per second.

Technically, you should use pattern matching functions on the Wiki
markup. Also try to use a common vocabulary for references.

If you are very fast to implement this, it may go into the next DBpedia
release, which is currently in preparation.

Kind regards,

Jens


-- 
Dipl. Inf. Jens Lehmann
Department of Computer Science, University of Leipzig
Homepage: http://www.jens-lehmann.org
GPG Key: http://jens-lehmann.org/jens_lehmann.asc

-------------------------------------------------------------------------
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to