Hi Tom

Thanks for the pointer, which makes me wonder.
Since Geonames puts a lot of work at building quality links to Wikipedia,
why does not DBpedia simply leverage this work and use those links?
They are available in the RDF, see e.g.,
http://sws.geonames.org/207258/about.rdf it contains two (redundant) links.
<gn:wikipediaArticle rdf:resource="
http://en.wikipedia.org/wiki/Mount_Stanley"/>
<rdfs:seeAlso rdf:resource="http://dbpedia.org/resource/Mount_Stanley"/>

The RDF dump of Geonames is not optimal, though (it's the concatenation of
all individual RDF descriptions, hence heavy and verbose).
It is simpler to parse the alternate names file
http://download.geonames.org/export/dump/alternateNames.zip
which contains the wikipedia links as "alternate names" with the "language"
link. Quite ad hoc, but efficient.

Whichever way, you don't need any fine-tuned matching algorithm, just
harvest 400,000 triples that are already there.

Best

Bernard



2013/9/12 Tom Morris <tfmor...@gmail.com>

> On Thu, Sep 12, 2013 at 1:47 PM, Bernard Vatant <
> bernard.vat...@mondeca.com> wrote:
>
>>
>> Whatever the method, features which have been for long ago in Geonames
>> are not linked from DBpedia, although Geonames has had a Wikipedia/DBpedia
>> link for ages in its description.
>> Example : http://dbpedia.org/resource/Mount_Stanley vs
>> http://sws.geonames.org/207258/
>>
>
> Those two things have different names "Mount" vs "Mont" (does geonames
> nave multilingual support?) so it's probably doing simplistic name matching.
>
> The config files for mountains is here if you want to improve it:
>
>
> https://sourceforge.net/p/dbpedia/dbpedia/ci/default/tree/external_datasets/Geonames/dbpedia-geonames-mountain.xml
>
> Tom
>
>


-- 
*Bernard Vatant
*
Vocabularies & Data Engineering
Tel :  + 33 (0)9 71 48 84 59
Skype : bernard.vatant
Blog : the wheel and the hub <http://bvatant.blogspot.com>
Linked Open Vocabularies : lov.okfn.org
--------------------------------------------------------
*Mondeca**          **                   *
3 cité Nollez 75018 Paris, France
www.mondeca.com
Follow us on Twitter : @mondecanews <http://twitter.com/#%21/mondecanews>

----------------------------------------------------------
Get an overview of Mondeca's Smart Content
Factory<http://www.mondeca.com/Products/Smart-Content-Factory>
 at SIIA London <http://www.siia.net/london/2013/schedule.asp>
------------------------------------------------------------------------------
How ServiceNow helps IT people transform IT departments:
1. Consolidate legacy IT systems to a single system of record for IT
2. Standardize and globalize service processes across IT
3. Implement zero-touch automation to replace manual, redundant tasks
http://pubads.g.doubleclick.net/gampad/clk?id=51271111&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to