On 9/12/13 5:06 PM, Bernard Vatant wrote:
Hi Tom

Thanks for the pointer, which makes me wonder.
Since Geonames puts a lot of work at building quality links to Wikipedia, why does not DBpedia simply leverage this work and use those links? They are available in the RDF, see e.g., http://sws.geonames.org/207258/about.rdf it contains two (redundant) links. <gn:wikipediaArticle rdf:resource="http://en.wikipedia.org/wiki/Mount_Stanley"/>
<rdfs:seeAlso rdf:resource="http://dbpedia.org/resource/Mount_Stanley"/>

The RDF dump of Geonames is not optimal, though (it's the concatenation of all individual RDF descriptions, hence heavy and verbose). It is simpler to parse the alternate names file http://download.geonames.org/export/dump/alternateNames.zip which contains the wikipedia links as "alternate names" with the "language" link. Quite ad hoc, but efficient.

Whichever way, you don't need any fine-tuned matching algorithm, just harvest 400,000 triples that are already there.

+1

We should factor this into the next DBpedia update. All we need to do is make a dataset as described above. The upside is worth it.

Kingsley

Best

Bernard



2013/9/12 Tom Morris <tfmor...@gmail.com <mailto:tfmor...@gmail.com>>

    On Thu, Sep 12, 2013 at 1:47 PM, Bernard Vatant
    <bernard.vat...@mondeca.com <mailto:bernard.vat...@mondeca.com>>
    wrote:


        Whatever the method, features which have been for long ago in
        Geonames are not linked from DBpedia, although Geonames has
        had a Wikipedia/DBpedia link for ages in its description.
        Example : http://dbpedia.org/resource/Mount_Stanley vs
        http://sws.geonames.org/207258/


    Those two things have different names "Mount" vs "Mont" (does
    geonames nave multilingual support?) so it's probably doing
    simplistic name matching.

    The config files for mountains is here if you want to improve it:

    
https://sourceforge.net/p/dbpedia/dbpedia/ci/default/tree/external_datasets/Geonames/dbpedia-geonames-mountain.xml
    Tom




--
*Bernard Vatant
*
Vocabularies & Data Engineering
Tel : + 33 (0)9 71 48 84 59
Skype : bernard.vatant
Blog : the wheel and the hub <http://bvatant.blogspot.com>
Linked Open Vocabularies : lov.okfn.org <http://lov.okfn.org>
--------------------------------------------------------
*Mondeca*****
3 cité Nollez 75018 Paris, France
www.mondeca.com <http://www.mondeca.com/>
Follow us on Twitter : @mondecanews <http://twitter.com/#%21/mondecanews>

----------------------------------------------------------
Get an overview of Mondeca's Smart Content Factory <http://www.mondeca.com/Products/Smart-Content-Factory> at SIIA London <http://www.siia.net/london/2013/schedule.asp>


------------------------------------------------------------------------------
How ServiceNow helps IT people transform IT departments:
1. Consolidate legacy IT systems to a single system of record for IT
2. Standardize and globalize service processes across IT
3. Implement zero-touch automation to replace manual, redundant tasks
http://pubads.g.doubleclick.net/gampad/clk?id=51271111&iu=/4140/ostg.clktrk


_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion


--

Regards,

Kingsley Idehen 
Founder & CEO
OpenLink Software
Company Web: http://www.openlinksw.com
Personal Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca handle: @kidehen
Google+ Profile: https://plus.google.com/112399767740508618350/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen




Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

------------------------------------------------------------------------------
How ServiceNow helps IT people transform IT departments:
1. Consolidate legacy IT systems to a single system of record for IT
2. Standardize and globalize service processes across IT
3. Implement zero-touch automation to replace manual, redundant tasks
http://pubads.g.doubleclick.net/gampad/clk?id=51271111&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to