Re: Dataset for all ISO639 code sorted by country/territory?

2016-11-10 Thread Andrew West
On 10 November 2016 at 17:56, Doug Ewell wrote: > > Keep in mind that the CLDR table documents 675 of the world's best-known > languages, counting variants such as three different orthographies of > Uzbek. Oddly, it seems that there are over 1.2 billion speakers of Cantonese in

RE: Dataset for all ISO639 code sorted by country/territory?

2016-11-10 Thread Doug Ewell
Mats Blakstad wrote: > For myself I was not actually considering the amount of speakers in > each country, but to map languages with countries/territories where > the language originated or have been spoken traditionally. And that is where I think you'll have disagreement on the details. > So I

Re: Dataset for all ISO639 code sorted by country/territory?

2016-11-10 Thread Mats Blakstad
On 20 September 2016 at 18:34, Doug Ewell wrote: > > Is there any dataset that contains all languages in the world sorted > > by country/territory? > > As others have pointed out, be careful about how slippery this slope can > get. Everyone has his or her own opinion about how

Re: Dataset for all ISO639 code sorted by country/territory?

2016-09-20 Thread Doug Ewell
Mats Blakstad wrote: > Is there any dataset that contains all languages in the world sorted > by country/territory? As others have pointed out, be careful about how slippery this slope can get. Everyone has his or her own opinion about how many speakers of Language X in country Y need to be

Re: Dataset for all ISO639 code sorted by country/territory?

2016-09-17 Thread Mats Blakstad
I manage to find a dataset on the website of Ethnologue, though it doesn't look like open source, need to check with them exactly how I'm allowed to use it: http://www.ethnologue.com/codes/download-code-tables Thanks for the explanation Phillippe. I know it is not an easy issue. Look for

Re: Dataset for all ISO639 code sorted by country/territory?

2016-09-17 Thread Philippe Verdy
Not all languages are sorted, only those for which there are released data in CLDR. And languages frequently belong to several countries/territories at the same time, with different official or recognized status (itself independant of the number of actual speakers, which is very frequently roughly

Re: Dataset for all ISO639 code sorted by country/territory?

2016-09-17 Thread Otto Stolz
Hello, am 2016-09-17 um 11:19 Uhr hat Mats Blakstad geschrieben: Is there any dataset that contains all languages in the world sorted by country/territory? Have you tried , already? Also, and may