hi guys,

Recently I have been trying to do some data mining on wikipedia. I wish to
parse infobox properties in <prop, string> pair. However, I meet a problem
that when I want to write a homepage parser, I find there are labels in
different infobox mean the same concept, homepage. I got confused and
reference your project. DBpedia has a very good performance on this. It's
impressive.

So I wish to know how can you *get infobox templates from wiki*? I mean the
dumps of all. I searched wiki and didn't get satisfying answer. In
addition, a small question that how can you map different homepage labels
to one dbpedia ontology label? I checked "Mapping_en.xml" file in your
project. Is the mapping process done manually, or automatically?

Thank you very much in advance!
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to