Devdatta,
Do you have and taluk boundaries from 2001?  I just created some (have 5204 
features as of today).  I am doing a lot of processing to fill in the gaps. 
 I think I might be able to get the 2001 taluks sometime soon, any help 
would be greatly appreciated!

Justin

On Friday, July 18, 2014 9:46:00 AM UTC-4, Justin Meyers wrote:
>
> Devdatta,
> I lined up a map (spent 5 minutes, might be able to do it a little better 
> if I spend more time) for Orissa.  I used a district shapefile from their 
> government (or potentially the SOI), I believe it has a custom projection. 
>  They had it up a few weeks ago, then the website disappeared...  I 
> rectified, then used a unsupervised classification, then vectorized.  I 
> haven't gone in and cleaned up the data, but do you think this would be 
> worthwhile developing?  I would rather have software do all the work than 
> actually tracing lines myself - Let me know your thoughts.
>
> Data is here: https://app.box.com/s/lfeg76yxkcqpyixojorg
>
> Justin
>
> On Friday, July 18, 2014 12:16:31 AM UTC-4, Justin Meyers wrote:
>>
>> Devdatta,
>> Yikes!  I was really hoping there was some dataset(s) out there that 
>> actually made sense...  Even the census tables from the 
>> http://censusindia.gov.in/ have duplicates, and it isn't always clear 
>> what record should be used.   Some of the village level data I have seen 
>> shows the wrong tehsil code in a central town (lets say the town code is 
>> 33333333xxxxxxxx, all the surrounding villages have codes that are 
>> 33444444xxxxxxxx).  I have worked with some wild data in the past, but 
>> India seems like a nightmare.What it will most likely come down to is that 
>> it will make sense that it doesn't make sense... if that makes sense!?!
>>
>> I think I need to collect my thoughts with all this and re-calibrate. I 
>> have a couple ideas, but I tried them and the results didn't make sense (so 
>> maybe they are correct (makes sense that it doesn't make sense...!??)).
>>
>> I'll keep you posted.  If you come up with anything, or additional 
>> resources, please let me know.
>>
>> Cheers!
>> Justin
>>
>> On Thursday, July 17, 2014 11:45:29 PM UTC-4, Devdatta Tengshe wrote:
>>>
>>> Hi Justin,
>>>
>>> It's very hard to look at Survey of India Digital Data and preserve your 
>>> sanity. As you have found out, the boundaries of different Administrative 
>>> levels do not match. There are many reasons for this, and not all of them 
>>> are solvable.
>>>
>>> The boundaries in the PDFs are generalised no doubt, but if one takes 
>>> care while digitizing at the correct scale, one shouldn't have much 
>>> problems. See the district shapefiles on the github repo. They were made 
>>> from a top down procedure. I used the PC boundaries for the country and 
>>> state boundaries. The individual district boundaries were made by referring 
>>> to these very Census maps, as well as tehesil boundaries. I also used an 
>>> custom tool which I have developed, which helps in cutting one polygon 
>>> based on another polygon, which tremendously cut down the time I spent on 
>>> creating these internal boundaries.So while the district boundaries might 
>>> be generalised in some cases, that the best, updated shapefile I know of 
>>> today.
>>>
>>> Having worked with government departments, I have learnt that getting 
>>> data itself is a big task.Any data is a boon. And once I get the data, I 
>>> don't expect it to match anything else. With this paradigm, the Census maps 
>>> are a goldmine for me.
>>>
>>> Regards,
>>> Devdatta
>>>
>>>
>>> On Fri, Jul 18, 2014 at 8:58 AM, Justin Meyers <justinell...@gmail.com> 
>>> wrote:
>>>
>>>> Devdatta,
>>>> Thanks for the quick response. I thought the files originated from the 
>>>> Survey of India, but wasn't certain.  I started to create a villages 
>>>> dataset, but the tehsils do not really align with what the 2001 census 
>>>> villages state their respected tehsil parent is...  So I am assuming all 
>>>> of 
>>>> the data from the gevernment is a mix bag (spelling may be off, codes may 
>>>> be wrong/ outdated, data may be mixed between years).  What a mess!?!?!  
>>>> As 
>>>> per rectifying and creating maps based off the PDFs, I'm not sure I would 
>>>> do that.  The lines they have for boundaries are very, very generalized. 
>>>>  Also, I tried (a few years ago) to line them up with actual vector data, 
>>>> and there is a huge shift (i was using WGS84 vector data, so maybe I 
>>>> should 
>>>> have reprojected).
>>>>
>>>> Maybe it would be best to start top down or bottom up.  So either build 
>>>> a dataset from villages up to states or states down to villages.
>>>>
>>>> Thoughts?  We need some official data though (which seems impossible to 
>>>> find...).  But anything is possible, right!?!
>>>>
>>>> Cheers,
>>>> Justin
>>>>
>>>> On Thursday, July 17, 2014 11:17:33 PM UTC-4, Devdatta Tengshe wrote:
>>>>
>>>>> Hi Justin,
>>>>> I know the euphoria that one has when one has done something new. It's 
>>>>> one of the best things in the world.
>>>>>
>>>>> If the original source you mentioned is Bhuvan, then the files came 
>>>>> directly from Survey of India. I have used those files before, and as you 
>>>>> mentioned there were only some 2000 Odd features in it.
>>>>>
>>>>> There are not from any specific era. Some tehsils in the file were 
>>>>> created post 2001, while others created in the 90's were not present.
>>>>>
>>>>> The only exhaustive source I know, is the Census Administrative Atlas. 
>>>>> They have maps in PDF format, not in shapefiles, and I had used it to 
>>>>> create the district shapefiles which are shared on the datameet github 
>>>>> repos.
>>>>> Sometimes I feel I should get started on digitizing those pdfs. It 
>>>>> shouldn't take more than 40 hours. 
>>>>>
>>>>> Regards,
>>>>> Devdatta Tengshe
>>>>>
>>>>>
>>>>> On Fri, Jul 18, 2014 at 8:27 AM, Justin Meyers <justinell...@gmail.com
>>>>> > wrote:
>>>>>
>>>>>> Devdatta,
>>>>>> Sorry I didn't type that up.  I just finished processing it and was 
>>>>>> excited and posted.  The previous file i posted had 2,693 features.  
>>>>>> This 
>>>>>> file has 2,739 features.  Initially I thought the data was relevant to 
>>>>>> 2001, but maybe it is 1991 (I have no metadata, the Indian government 
>>>>>> does 
>>>>>> not respond to my e-mails (I have sent at least a dozen, but they do not 
>>>>>> respond)).  I am not certain of the exact source, it is hosted by the 
>>>>>> Bhuvan (who do not respond to emails either....).
>>>>>>
>>>>>> As per any processing, I took the data and sorted the attributes (it 
>>>>>> was a long string all attached as one - so i split it and created the 
>>>>>> fields).
>>>>>>
>>>>>> Any other questions?  If you know of a more current dataset please 
>>>>>> post!!
>>>>>>
>>>>>>
>>>>>> On Thursday, July 17, 2014 9:19:41 PM UTC-4, Devdatta Tengshe wrote:
>>>>>>
>>>>>>> Hi Justin,
>>>>>>>
>>>>>>> Can you let us know what was the procedure to create this file, and 
>>>>>>> this is accurate upto which date?
>>>>>>> I'm asking this shapefile has 2739 sub districts, and according to 
>>>>>>> the census, there should be 5564.
>>>>>>>
>>>>>>> Regards,
>>>>>>> Devdatta Tengshe
>>>>>>>
>>>>>>>
>>>>>>> On Thu, Jul 17, 2014 at 11:15 PM, Justin Meyers <
>>>>>>> justinell...@gmail.com> wrote:
>>>>>>>
>>>>>>>> https://app.box.com/s/486rvabh3sjviiynbyu4
>>>>>>>>
>>>>>>>>
>>>>>>>> Cheers!
>>>>>>>>
>>>>>>>> -- 
>>>>>>>> Datameet is a community of Data Science enthusiasts in India. Know 
>>>>>>>> more about us by visiting http://datameet.org
>>>>>>>> --- 
>>>>>>>> You received this message because you are subscribed to the Google 
>>>>>>>> Groups "datameet" group.
>>>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>>>> send an email to datameet+u...@googlegroups.com.
>>>>>>>>
>>>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>>>
>>>>>>>
>>>>>>>  -- 
>>>>>> Datameet is a community of Data Science enthusiasts in India. Know 
>>>>>> more about us by visiting http://datameet.org
>>>>>> --- 
>>>>>> You received this message because you are subscribed to the Google 
>>>>>> Groups "datameet" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>> send an email to datameet+u...@googlegroups.com.
>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>
>>>>>
>>>>>  -- 
>>>> Datameet is a community of Data Science enthusiasts in India. Know more 
>>>> about us by visiting http://datameet.org
>>>> --- 
>>>> You received this message because you are subscribed to the Google 
>>>> Groups "datameet" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>> an email to datameet+u...@googlegroups.com.
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to