[Dbpedia-discussion] Writing parser rules for Generic Extractor

Michael Haas Fri, 12 Jun 2009 10:48:24 -0700

Hello,


I'd like to improve the parser for the company infoboxes. In particular, 
the extraction for dbpedia-owl:keyPerson on the BMW article is flakey - 
I'd like to change to parser to ignore anything between <small> tags 
which should improve accuracy.


If I interpret the .xls correctly, then key_people is derived from 
Organisation and the parsing rule for organisation is used.

I assume I have to change rules.xsl to specify a new class for keyPerson 
and then add code for my new rule somewhere in extractors/infobox/.

Can someone give me a an example or some documentation how I would go 
about this? Or have I missed anything in the existing documentation?


Regards,

Michael Haas


------------------------------------------------------------------------------
Crystal Reports - New Free Runtime and 30 Day Trial
Check out the new simplified licensing option that enables unlimited
royalty-free distribution of the report engine for externally facing 
server and web deployment.
http://p.sf.net/sfu/businessobjects
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

[Dbpedia-discussion] Writing parser rules for Generic Extractor

Reply via email to