Because of result count restrictions, these queries don't return all ISO language codes extracted by DBpedia, but I think they give a good impression of the data quality and coverage (or sometimes lack thereof):
http://dbpedia.org/sparql?query=select+distinct+%3Fs%2C+%3Fo+where{%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2Fiso%3E+%3Fo+.%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2FwikiPageUsesTemplate%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FTemplate%3Ainfobox_language%3E+.} http://dbpedia.org/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&query=select+distinct+%3Fs%2C+%3Fo+where{%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2Fiso%3E+%3Fo+.%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2FwikiPageUsesTemplate%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FTemplate%3Ainfobox_language%3E+.} On Fri, Oct 23, 2009 at 18:20, Andrew Dunbar <[email protected]> wrote: > 2009/10/23 Aryeh Gregor <[email protected]>: >> On Fri, Oct 23, 2009 at 8:27 AM, Andrew Dunbar <[email protected]> wrote: >>> Yes I found how to get it through the API now. It was actually just >>> the Toolserver database that was intractably slow. >> >> There's nothing slow about the TS database here: >> >> mysql> pager true >> PAGER set to 'true' >> mysql> SELECT tl_from FROM templatelinks WHERE tl_namespace=10 AND >> tl_title IN ('Infobox_Language', 'Infobox_language'); >> 3144 rows in set (0.12 sec) >> >> Your query might have been what was slow. > > Yes I didn't specify tl_namespace and when I check for which columns > have keys I could see none: > mysql> describe templatelinks; > +--------------+-----------------+------+-----+---------+-------+ > | Field | Type | Null | Key | Default | Extra | > +--------------+-----------------+------+-----+---------+-------+ > | tl_from | int(8) unsigned | NO | | 0 | | > | tl_namespace | int(11) | NO | | 0 | | > | tl_title | varchar(255) | NO | | | | > +--------------+-----------------+------+-----+---------+-------+ > 3 rows in set (0.01 sec) > > But I don't know much about databases and SQL... > > I have reached an important milestone of extracting all the name value > pairs for language infobox ISO 639 language codes and language family > string by the way. > > But the values still need some work before I can try to match them > against ISO 639-5 language family codes which is my ultimate goal. > > Thanks for all the tips. > > Andrew Dunbar (hippietrail) > >> _______________________________________________ >> Wikitech-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wikitech-l >> > > > > -- > http://wiktionarydev.leuksman.com http://linguaphile.sf.net > > _______________________________________________ > Wikitech-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > _______________________________________________ Wikitech-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikitech-l
