Re: [datameet] Re: Shapefiles for complete India

2014-08-07 Thread Sharad Lele [शरच्चंद्र लेले]
I think I have the explanation for why I am seeing a good match and you 
are not:


The problem lies in defining what is the 'sub-district' unit (in 
IND_adm3). Administratively speaking, it is tehsil, below which lies CD 
block. Unfortunately, census gives information by CD block. So there are 
more 'sub-district' units in Census than tehsils in the country. GDAM 
seems to have followed the tehsil concept.


To check: Karnataka is one state in which tehsil and CD block are one 
and the same. That is why the sub-district layer IND_adm3 matches 
perfectly for Karnataka, but not for other states. There might be some 
other states where this holds good, I don't know.


Anyway, so if one really wants CD block level boundaries, we have to 
look at Justin, I guess.


But the GDAM boundaries are not 'wrong'.

Sharad

On 07-Aug-14 9:48 AM, Devdatta Tengshe wrote:
In Continuation of my previous email, here is a CSV file which shows 
just how bad the GDAM dataset is.


Regards,
Devdatta


On Thu, Aug 7, 2014 at 9:06 AM, Devdatta Tengshe devda...@tengshe.in 
mailto:devda...@tengshe.in wrote:


Hi Sharad,

I just download the GDAM data again, to confirm what you have said.

I'm going to have to disagree with you about the quality of the
IND_adm3 data.


Acoording to the 2001 Census, there are 5454 Sub Districts in
India http://www.socialjustice.nic.in/pdf/tab11.pdf. The GDAM
dataset has just 2299 features.

So clearly these taluk features do not correspond to the 2001
Census. I cross checked for some areas I have ground knowledge of,
and I can say that this dataset is not from any specific era. Some
tehsils in the file were created post 2001, while others created
in the 90's were not present.

In my opinion the GDAM data is pretty much unusable.


Regards,
Devdatta


On Wed, Aug 6, 2014 at 11:36 PM, Sharad Lele
sharad.l...@gmail.com mailto:sharad.l...@gmail.com wrote:

I have downloaded and checked the GADM boundaries (my version
is 2011). The taluka boundary layer probably holds good today,
becuase few talukas get split. Districts get split regularly
(every so many years) so the district boundary layer in this
GADM set is quite of date (may apply to 2001 or so). The
spatial registration (positional accuracy is ~1km, and the
spatial detail is of course not as good as the boundaries
given in a Survey of India 50k topo, but then that is an
unfair standard, so by a more generalized standard, the
quality is okay.

Sharad


On Monday, August 4, 2014 7:20:38 PM UTC+5:30, Dilip Damle wrote:

Mr Thakkar,

Please also look at another post (more than one) on this
group  about Taluk Shapefiles by Justin Meyers

So far as I know GADM is the source that has Taluk files.
I am not sure about its completeness and accuracy as on today

http://www.gadm.org/

On Monday, August 4, 2014 6:23:07 PM UTC+5:30, D Thakker
wrote:

thanks Dilip for your hardwork.
I have been on a lookout for all taluka / tehsil shape
file, so how do I be in a loop as I am very keen to
see the repository mail / list.


On Monday, August 4, 2014 9:50:26 AM UTC+5:30, Dilip
Damle wrote:

Sharad,

I am working on some things will revert in about a
week or may be more.

Thejesh,

Go ahead,

Actually there was one more source a Low
Resolution (vertices) District map by VDS
technologies.
I have it as Polylines in Autocad. I seem to have
lost the original file.
If anyone has then please share it. (it does not
seem to be on their site now)

On Sunday, August 3, 2014 11:32:43 PM UTC+5:30,
Thejesh GN wrote:

Actually its not a bad idea to list it on the
wiki. Let me know i will create an account.

--
Thejesh GN ⏚ ತೇಜೇಶ್ ಜಿ.ಎನ್
http://thejeshgn.com
GPG ID :  0xBFFC8DD3C06DD6B0

On Aug 3, 2014 10:15 PM, Sharad Lele
shara...@gmail.com wrote:

Dear Dilip and others:

I have been following this thread with
interest, but to be honest am a bit lost
now. Can someone post a summary of which
maps mentioned so far have what features
(which coverage, pertaining to which year,
what 

Re: [datameet] IT Act in Karnataka

2014-08-07 Thread Nisha Thompson
does anyone have the full text of the law?


On Wed, Aug 6, 2014 at 11:03 PM, srinivas kodali iota.kod...@gmail.com
wrote:

 It seems the new IT Act in Karnataka gives the government enough power to
 arrest you before you even commit an actual crime.
 Is this really true? this would be a huge set back for any open data
 activist. You could be jailed for simple scraping of public website.


 http://www.bangaloremirror.com/Bangalore/Cover-story/We-the-goondas/articleshow/39564603.cms

 Regards,
 Srinivas

 --
 Datameet is a community of Data Science enthusiasts in India. Know more
 about us by visiting http://datameet.org
 ---
 You received this message because you are subscribed to the Google Groups
 datameet group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to datameet+unsubscr...@googlegroups.com.
 For more options, visit https://groups.google.com/d/optout.




-- 
Nisha Thompson
DataMeet.org
ni...@datameet.org
skype: nishaqt
mobile: 962-061-2245

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
datameet group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [datameet] Re: Shapefiles for complete India

2014-08-07 Thread Sharad Lele
If I am right, then Justin may want to rename his layer as CDBlocks_2001...

Sharad

On Thursday, August 7, 2014 4:28:17 PM UTC+5:30, Sharad Lele wrote:

  I think I have the explanation for why I am seeing a good match and you 
 are not:

 The problem lies in defining what is the 'sub-district' unit (in 
 IND_adm3). Administratively speaking, it is tehsil, below which lies CD 
 block. Unfortunately, census gives information by CD block. So there are 
 more 'sub-district' units in Census than tehsils in the country. GDAM seems 
 to have followed the tehsil concept.

 To check: Karnataka is one state in which tehsil and CD block are one and 
 the same. That is why the sub-district layer IND_adm3 matches perfectly for 
 Karnataka, but not for other states. There might be some other states where 
 this holds good, I don't know.

 Anyway, so if one really wants CD block level boundaries, we have to look 
 at Justin, I guess.

 But the GDAM boundaries are not 'wrong'.

 Sharad

 On 07-Aug-14 9:48 AM, Devdatta Tengshe wrote:
  
  In Continuation of my previous email, here is a CSV file which shows 
 just how bad the GDAM dataset is.

  Regards,
  Devdatta
  

  On Thu, Aug 7, 2014 at 9:06 AM, Devdatta Tengshe devda...@tengshe.in 
 wrote:

 Hi Sharad,

 I just download the GDAM data again, to confirm what you have said.

 I'm going to have to disagree with you about the quality of the IND_adm3 
 data.


 Acoording to the 2001 Census, there are 5454 Sub Districts in India 
 http://www.socialjustice.nic.in/pdf/tab11.pdf. The GDAM dataset has 
 just 2299 features.

 So clearly these taluk features do not correspond to the 2001 Census. I 
 cross checked for some areas I have ground knowledge of, and I can say that 
 this dataset is not from any specific era. Some tehsils in the file were 
 created post 2001, while others created in the 90's were not present.

 In my opinion the GDAM data is pretty much unusable.


 Regards,
 Devdatta
   

 On Wed, Aug 6, 2014 at 11:36 PM, Sharad Lele sharad.l...@gmail.com 
 wrote:

 I have downloaded and checked the GADM boundaries (my version is 2011). 
 The taluka boundary layer probably holds good today, becuase few talukas 
 get split. Districts get split regularly (every so many years) so the 
 district boundary layer in this GADM set is quite of date (may apply to 
 2001 or so). The spatial registration (positional accuracy is ~1km, and the 
 spatial detail is of course not as good as the boundaries given in a Survey 
 of India 50k topo, but then that is an unfair standard, so by a more 
 generalized standard, the quality is okay.

 Sharad 


 On Monday, August 4, 2014 7:20:38 PM UTC+5:30, Dilip Damle wrote: 

 Mr Thakkar, 

 Please also look at another post (more than one) on this group  about 
 Taluk Shapefiles by Justin Meyers

 So far as I know GADM is the source that has Taluk files.
 I am not sure about its completeness and accuracy as on today

 http://www.gadm.org/

 On Monday, August 4, 2014 6:23:07 PM UTC+5:30, D Thakker wrote: 

  thanks Dilip for your hardwork.
 I have been on a lookout for all taluka / tehsil shape file, so how do 
 I be in a loop as I am very keen to see the repository mail / list.


 On Monday, August 4, 2014 9:50:26 AM UTC+5:30, Dilip Damle wrote:

 Sharad, 

 I am working on some things will revert in about a week or may be 
 more.

 Thejesh,

  Go ahead, 

 Actually there was one more source a Low Resolution (vertices) 
 District map by VDS technologies. 
 I have it as Polylines in Autocad. I seem to have lost the original 
 file. 
 If anyone has then please share it. (it does not seem to be on their 
 site now)

 On Sunday, August 3, 2014 11:32:43 PM UTC+5:30, Thejesh GN wrote: 

 Actually its not a bad idea to list it on the wiki. Let me know i 
 will create an account. 

 --
 Thejesh GN ⏚ ತೇಜೇಶ್ ಜಿ.ಎನ್
 http://thejeshgn.com
 GPG ID :  0xBFFC8DD3C06DD6B0
 On Aug 3, 2014 10:15 PM, Sharad Lele shara...@gmail.com wrote:

 Dear Dilip and others:

 I have been following this thread with interest, but to be honest 
 am a bit lost now. Can someone post a summary of which maps mentioned 
 so 
 far have what features (which coverage, pertaining to which year, what 
 attributes (such as census codes), etc.)? Would be most helpful.

 Sharad


 On Friday, August 1, 2014 9:03:58 PM UTC+5:30, Dilip Damle wrote: 

 Hello, 

 This is an old post. 
 However this is the appropriate place to add an additional source. 

 I had downloaded the set from Grid Geneva many years ago.
 The original complete source was named as GNV197 which is 24 MB
 Titled as HUMAN POPULATION AND ADMINISTRATIVE BOUNDARIES DATABASE 
 FOR ASIA
 I am attaching the South Central Asia E00 file.

 That set contains The disputed areas under the country name IN1 
 and IN2

 This dataset can not be easily found at present on the GRID Geneva 
 site http://www.grid.unep.ch/index.php?lang=en in the same name.
 may be it is still there somewhere with some other name.

 For copyright check the 

Re: [datameet] Re: Shapefiles for complete India

2014-08-07 Thread Eric Dodge
This is very interesting Sharad.

I've been looking for maps of what I've been calling administrative blocks,
that is, the units overseen by block development officers. MGNREGA data is
aggregated at this level and I've been hoping to use the data to do some
mapping exercises.

The census sub-districts are called differently across states (tahsil,
taluk, mandal, etc). You can see the list here:

http://censusindia.gov.in/Tables_Published/Admin_Units/Admin_links/subdistrict_nomeclature.html

I know that in all the states where census sub-districts are called taluk,
mandal, or CD block (with the exception of TN), the census sub-district is
identical to the administrative block.

I've already completed a mapping exercise for Bihar using the census
sub-district map and the data matched up pretty well. If the IND_adm3 data
is indeed the administrative blocks then I could do a similar exercise with
Madhya Pradesh. I'll take a look to see if the data lines up correctly.

Has anybody dug into this issue any deeper? I've heard that tehsil comes
from the revenue side whereas taluk, mandal, etc comes from the
administrative side but that doesn't explain why the census uses different
sub-district units across states.

Best,
Eric


On Thu, Aug 7, 2014 at 7:39 AM, Sharad Lele sharad.l...@gmail.com wrote:

 If I am right, then Justin may want to rename his layer as CDBlocks_2001...

 Sharad


 On Thursday, August 7, 2014 4:28:17 PM UTC+5:30, Sharad Lele wrote:

  I think I have the explanation for why I am seeing a good match and you
 are not:

 The problem lies in defining what is the 'sub-district' unit (in
 IND_adm3). Administratively speaking, it is tehsil, below which lies CD
 block. Unfortunately, census gives information by CD block. So there are
 more 'sub-district' units in Census than tehsils in the country. GDAM seems
 to have followed the tehsil concept.

 To check: Karnataka is one state in which tehsil and CD block are one and
 the same. That is why the sub-district layer IND_adm3 matches perfectly for
 Karnataka, but not for other states. There might be some other states where
 this holds good, I don't know.

 Anyway, so if one really wants CD block level boundaries, we have to look
 at Justin, I guess.

 But the GDAM boundaries are not 'wrong'.

 Sharad

 On 07-Aug-14 9:48 AM, Devdatta Tengshe wrote:

  In Continuation of my previous email, here is a CSV file which shows
 just how bad the GDAM dataset is.

  Regards,
  Devdatta


  On Thu, Aug 7, 2014 at 9:06 AM, Devdatta Tengshe devda...@tengshe.in
 wrote:

 Hi Sharad,

 I just download the GDAM data again, to confirm what you have said.

 I'm going to have to disagree with you about the quality of the IND_adm3
 data.


 Acoording to the 2001 Census, there are 5454 Sub Districts in India
 http://www.socialjustice.nic.in/pdf/tab11.pdf. The GDAM dataset has
 just 2299 features.

 So clearly these taluk features do not correspond to the 2001 Census. I
 cross checked for some areas I have ground knowledge of, and I can say that
 this dataset is not from any specific era. Some tehsils in the file were
 created post 2001, while others created in the 90's were not present.

 In my opinion the GDAM data is pretty much unusable.


 Regards,
 Devdatta


 On Wed, Aug 6, 2014 at 11:36 PM, Sharad Lele sharad.l...@gmail.com
 wrote:

 I have downloaded and checked the GADM boundaries (my version is 2011).
 The taluka boundary layer probably holds good today, becuase few talukas
 get split. Districts get split regularly (every so many years) so the
 district boundary layer in this GADM set is quite of date (may apply to
 2001 or so). The spatial registration (positional accuracy is ~1km, and the
 spatial detail is of course not as good as the boundaries given in a Survey
 of India 50k topo, but then that is an unfair standard, so by a more
 generalized standard, the quality is okay.

 Sharad


 On Monday, August 4, 2014 7:20:38 PM UTC+5:30, Dilip Damle wrote:

 Mr Thakkar,

 Please also look at another post (more than one) on this group  about
 Taluk Shapefiles by Justin Meyers

 So far as I know GADM is the source that has Taluk files.
 I am not sure about its completeness and accuracy as on today

 http://www.gadm.org/

 On Monday, August 4, 2014 6:23:07 PM UTC+5:30, D Thakker wrote:

  thanks Dilip for your hardwork.
 I have been on a lookout for all taluka / tehsil shape file, so how
 do I be in a loop as I am very keen to see the repository mail / list.


 On Monday, August 4, 2014 9:50:26 AM UTC+5:30, Dilip Damle wrote:

 Sharad,

 I am working on some things will revert in about a week or may be
 more.

 Thejesh,

  Go ahead,

 Actually there was one more source a Low Resolution (vertices)
 District map by VDS technologies.
 I have it as Polylines in Autocad. I seem to have lost the original
 file.
 If anyone has then please share it. (it does not seem to be on their
 site now)

 On Sunday, August 3, 2014 11:32:43 PM UTC+5:30, Thejesh GN wrote:

 Actually its not 

Re: [datameet] Re: Shapefiles for complete India

2014-08-07 Thread Sharad Lele [शरच्चंद्र लेले]

On 07-Aug-14 8:18 PM, Justin Meyers wrote:
As per our conversation yesterday, could you upload your data?  You 
keep criticizing everyone else's data, but you don't share your own!?? 
 Help me help you :)

Dear Justin:

I think you are going too far and too fast. A more moderate tone would 
be appreciated.


1. I never criticized anyone: in fact, i said GDAM was ok, and you were 
the one who used fairly strong words against it.
2. I have only clarified out that your layer reflects CD block 
boundaries while GDAM reflects tehsil boundaries. Both may be 'ok' in 
what they represent.
3. The only reason I have not shared any data of mine is simply because 
I have only two datasets: the GDAM one (which is both already available 
and being criticised by some ;-)) and a dataset of admin boundaries 
(district and tehsil) for Karnataka state alone which we paid for and 
got digitized from District Census Handbooks of 1991. When I check GDAM 
or your layer versus this layer, I see no major differences. So then 
what is the point in uploading that data?


Sharad


--
Democratizing Forest Governance in India
(In press with Oxford University Press India)

--
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups datameet group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [datameet] [Bangalore] Monthly working

2014-08-07 Thread Thejesh GN
I am in. I love small and targeted meetups/hackathons.

Thej
--
Thejesh GN *⏚* ತೇಜೇಶ್ ಜಿ.ಎನ್
http://thejeshgn.com
GPG ID :  0xBFFC8DD3C06DD6B0


On Thu, Aug 7, 2014 at 8:59 AM, Devdatta Tengshe devda...@tengshe.in
wrote:

 Hi Nisha,

 I think that this is an excellent idea. Very often I am doing some
 tinkering along these lines on my own, but a hackathon kind of atmosphere
 is more conducive to doing these things quickly.

 I'm all in.

 Regards,
 Devdatta Tengshe


 On Wed, Aug 6, 2014 at 4:57 PM, Nisha Thompson ni...@datameet.org wrote:

 Hey Everyone,

 Based on some conversations at the last Bangalore Meet up it seems there
 is some interest in getting together to do some mini working sessions
 (hackathons) to get and clean data.

 Something like we did before the Bangalore Open Data Camp around PDFs and
 QGIS.  2 to 4 hours where people can work on helping get data out of
 websites, pdfs, clean up data or learn or trouble shoot.

 We can pick a Saturday or Sunday, have food provided and get some
 specific tasks from people who need help.

 Is this something people would like to try? Also this would be separate
 from the DataMeet ups which we would keep as talks or presentations.


 Nisha

 --
 Nisha Thompson
 DataMeet.org
 ni...@datameet.org
 skype: nishaqt
 mobile: 962-061-2245

 --
 Datameet is a community of Data Science enthusiasts in India. Know more
 about us by visiting http://datameet.org
 ---
 You received this message because you are subscribed to the Google Groups
 datameet group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to datameet+unsubscr...@googlegroups.com.
 For more options, visit https://groups.google.com/d/optout.


  --
 Datameet is a community of Data Science enthusiasts in India. Know more
 about us by visiting http://datameet.org
 ---
 You received this message because you are subscribed to the Google Groups
 datameet group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to datameet+unsubscr...@googlegroups.com.
 For more options, visit https://groups.google.com/d/optout.


-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
datameet group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [datameet] Re: Shapefiles for complete India

2014-08-07 Thread Dilip Damle
Hello all,

Nice enlightening discussion with a little sparks,

A small thumbnail photo of the member may help reduce sparks because then 
you create an image of that person in your mind based on the photo. BTW my 
photo here is about 40 years outdated now I look more ferocious :) 

Any way to add to the confusion there is also the Division 
So it seems to be 

Country  State  Division  District  Tehsil / Taluk  Some Kind of 
 Block  Village 

Let us first get this ans confirm  if each is subset of the other or there 
are overlaps and if there is any other item missing in between then let us 
proceed  

On Thursday, August 7, 2014 10:32:08 PM UTC+5:30, Justin Meyers wrote:

 Sharad,
 Sorry about my tone; I get excited/ pushy when it comes to geospatial 
 data.  In my experience, you have to ask, ask again, and keep asking (like 
 I did with you until you finally responded).  I guess I should only ask 
 once, and if you don't respond, then you never will...

 In my honest opinion, GADM is junk.  If you want me to explain the top ten 
 reasons why, I can.  

 As per you sharing your data; if you can share it, it would be great. 
  Digitized one off data is unique, and could be better than what is on this 
 forum.  If you or your company paid for the data, and cannot share, it 
 makes sense.  I figure since you are here, looking for free data (or 
 whatever), asking questions about it, and have data, why not share what you 
 have.

 Just my 2 cents.  Sorry for coming off as a jerk. I apologize.

 Justin

 On Thursday, August 7, 2014 12:51:39 PM UTC-4, Sharad Lele wrote:

 On 07-Aug-14 8:18 PM, Justin Meyers wrote: 
  As per our conversation yesterday, could you upload your data?  You 
  keep criticizing everyone else's data, but you don't share your own!?? 
   Help me help you :) 
 Dear Justin: 

 I think you are going too far and too fast. A more moderate tone would 
 be appreciated. 

 1. I never criticized anyone: in fact, i said GDAM was ok, and you were 
 the one who used fairly strong words against it. 
 2. I have only clarified out that your layer reflects CD block 
 boundaries while GDAM reflects tehsil boundaries. Both may be 'ok' in 
 what they represent. 
 3. The only reason I have not shared any data of mine is simply because 
 I have only two datasets: the GDAM one (which is both already available 
 and being criticised by some ;-)) and a dataset of admin boundaries 
 (district and tehsil) for Karnataka state alone which we paid for and 
 got digitized from District Census Handbooks of 1991. When I check GDAM 
 or your layer versus this layer, I see no major differences. So then 
 what is the point in uploading that data? 

 Sharad 


 -- 
 Democratizing Forest Governance in India 
 (In press with Oxford University Press India) 



-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
datameet group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.