Hi, good to see people are using the WorldFact dataset.

I also thought about adding the ISO3166-2 Subdivision codes, but ran out of
time for the last release.

> Should we just hack some scripts to parse those Wikipedia pages?

I dont think it's worth the effort to create some sophisticated extractor,
so yes.

> We'll do it for at least for 49 countries (but if possible, for all)

Please do it for all (I can help there as well), since my initial impulse
for this dataset was to provide a complete collection of facts about
countries and not to be content with gaps in the records as you will find
plenty.

Of help might be this dataset as well:
http://data.okfn.org/data/core/language-codes

If anyone has additional ideas what to put in this dataset, please let us
know.

Cheers,

Markus Freudenberg

Release Manager, DBpedia <http://wiki.dbpedia.org>

On Tue, Nov 22, 2016 at 6:17 PM, Vladimir Alexiev <
vladimir.alex...@ontotext.com> wrote:

> Many DBpedia countries don't have basic stuff like country codes.
> DBpediaWorldFacts adds language and country codes:
> https://raw.githubusercontent.com/dbpedia/WorldFacts/master/
> DBpediaWorldFactsOntology.png
>
> We are planning to add ISO3166-2 Subdivision codes. These are top-level
> country regions, e.g. Spain has 67.
> - OMG provides these for US, CA, MX in a rather roundabout ontology (see
> at bottom):
>    http://www.omg.org/spec/LCC/Countries/ISO3166-2-SubdivisionCodes.rdf
> - UN/LOCODE provides tables for all countries:
> http://www.unece.org/cefact/locode/subdivisions.html
> - Wikipedia also provides tables, e.g. https://en.wikipedia.org/wiki/
> ISO_3166-2:ES
>   This may be best since it includes the subdivision URL. We'll do it for
> at least for 49 countries (but if possible, for all)
>
> Should we just hack some scripts to parse those Wikipedia pages?
> Or does someone have better suggestions, e.g. by using the Extraction
> Framework and writing a specific table parser?
>
> --
>
> OMG LCC data:
>
> lcc-3166-2:US-AK a lcc-cr:SubdivisionCode;
>    lcc-cr:hasSubdivisionTag "US-AK";
>    lcc-lr:identifies lcc-3166-2:Alaska;
>    lcc-lr:isMemberOf lcc-3166-2:ISO3166-2-CodeSet.
> lcc-3166-2:Alaska a lcc-cr:CountrySubdivision;
>   rdfs:label                  "Alaska" ;
>   lcc-cr:hasEnglishShortName  "Alaska" ;
>   lcc-cr:hasLocalShortName    "Alaska" ;
>   lcc-cr:isClassifiedBy       lcc-3166-2:State ;
>   lcc-cr:isSubdivisionOf      lcc-3166-1:UnitedStates;
>
>
> ------------------------------------------------------------
> ------------------
> _______________________________________________
> DBpedia-developers mailing list
> DBpedia-developers@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-developers
>
------------------------------------------------------------------------------
_______________________________________________
DBpedia-developers mailing list
DBpedia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-developers

Reply via email to