Thanks Pablo. The first thing to do would be fixing the parsing error. There is not much point in adding a property that will contain mostly bad data.
It looks like the infobox property contains a link to a file description page which itself points to some vector or bitmap files. I understand the parser is written in Scala, which I do not know. I can research all this a little more and report back if I find anything doable. But before I do that, what would be the position (interest) of the DBpedia project regarding binary image data? In the thread you provide below, Christopher Sahnwaldt mentions that licenses of image files might be an issue. That's surprising to me. I imagined that dbpedia would license data under the same license as Wikipedia itself. Is that too naive? Thanks, Nic On Tue, Sep 11, 2012 at 10:37 AM, Pablo N. Mendes <[email protected]> wrote: > Hi Nicolas, > You can add a property for logos in http://mappings.dbpedia.org as the > ontology is community-edited. > > As for parsing errors, this thread might be helpful: > http://www.mail-archive.com/[email protected]/msg03381.html > > We'd love to receive patches and unit tests. > > Cheers, > Pablo > > On Tue, Sep 11, 2012 at 10:29 AM, Nicolas Mabon <[email protected]> wrote: >> >> Hi Again, >> >> There seems to be a problem with logos. For example the University and >> Company below have valid logos on Wikipedia but the infobox was >> incorrectly parsed in the first case and the field seems to have been >> ignored in the second case. See below. >> >> Would it be possible to: >> 1. fix the parsing issue >> 2. augment company ontology with a 'logo' field >> >> I am happy to help with this (I would need access/pointers). >> >> Many thanks for running such a great project! >> >> Nic >> >> ----- >> >> WIKIPEDIA: >> >> http://en.wikipedia.org/w/index.php?title=Georgia_Institute_of_Technology&action=edit >> {{Infobox university >> ... >> |logo = [[File:GeorgiaTech logo.svg|200px]] >> ... >> }} >> >> DBPEDIA 'ontology infobox properties': >> <http://dbpedia.org/resource/Georgia_Institute_of_Technology> >> <http://dbpedia.org/property/logo> >> "200"^^<http://www.w3.org/2001/XMLSchema#int> . >> >> >> WIKIPEDIA: >> http://en.wikipedia.org/w/index.php?title=PerkinElmer&action=edit >> {{Infobox company| >> ... >> company_logo = [[Image:PerkinElmer.svg|175px|PerkinElmer logo]] | >> }} >> >> DBPEDIA: >> >> ill-parsed data in the 'raw infobox properties' (same problem as before) >> >> <http://dbpedia.org/resource/PerkinElmer> >> <http://dbpedia.org/property/companyLogo> >> "175"^^<http://www.w3.org/2001/XMLSchema#int> . >> >> no logo field in the 'ontology infobox properties' >> >> >> ------------------------------------------------------------------------------ >> Live Security Virtual Conference >> Exclusive live event will cover all the ways today's security and >> threat landscape has changed and how IT managers can respond. Discussions >> will include endpoint security, mobile security and the latest in malware >> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ >> _______________________________________________ >> Dbpedia-discussion mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion > > > > > -- > --- > Pablo N. Mendes > http://pablomendes.com > Events: http://wole2012.eurecom.fr > ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
