Hi Michael,
the mapping.xls is for mapping infoboxes to ontology classes and infobox
properties to ontology properties. The rules.xls keeps track of
exceptions to the definitions in the mapping.xls. Both files are
documented in the SVN ontology/docs files. If you make any changes to
the xls files you need to rerun the database import scripts in the SVN
ontology directory.
As defined in the mapping file (.xls) keyPerson on Organisation
has the range Person.
If you want to change the MappingBasedExtractor (former Generic
Extractor) to ignore tags, you might want to use the Util class and its
functions. For the keyPerson case find the corresponding code for object
properties in the MappingBasedExtractor ( "switch($property_type) {" )
and define own replacements. The Util class offers a function called
removeHtmlTags which might be useful.
Hope this helps,
Anja
Michael Haas schrieb:
> Hello,
>
>
> I'd like to improve the parser for the company infoboxes. In particular,
> the extraction for dbpedia-owl:keyPerson on the BMW article is flakey -
> I'd like to change to parser to ignore anything between <small> tags
> which should improve accuracy.
>
>
> If I interpret the .xls correctly, then key_people is derived from
> Organisation and the parsing rule for organisation is used.
>
> I assume I have to change rules.xsl to specify a new class for keyPerson
> and then add code for my new rule somewhere in extractors/infobox/.
>
> Can someone give me a an example or some documentation how I would go
> about this? Or have I missed anything in the existing documentation?
>
>
> Regards,
>
> Michael Haas
>
>
> ------------------------------------------------------------------------------
> Crystal Reports - New Free Runtime and 30 Day Trial
> Check out the new simplified licensing option that enables unlimited
> royalty-free distribution of the report engine for externally facing
> server and web deployment.
> http://p.sf.net/sfu/businessobjects
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
------------------------------------------------------------------------------
Crystal Reports - New Free Runtime and 30 Day Trial
Check out the new simplified licensing option that enables unlimited
royalty-free distribution of the report engine for externally facing
server and web deployment.
http://p.sf.net/sfu/businessobjects
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion