There are over +400,000 records, so the upload will take a very long time 
and will require more RAM than I can currently spare. (I have a rather 
extensive Bayesian measurement model running right now that take priority). 
Do you know Python? I could give you my scrapping code.  

On Tuesday, December 16, 2014 10:26:49 PM UTC-5, Anand Chitipothu wrote:
>
> On Tue, Dec 16, 2014 at 11:56 AM, Rick Morgan <[email protected] 
> <javascript:>> wrote:
>>
>> Hello, 
>>
>> Are you still looking for this data? With a rather complex Python code I 
>> scraped  http://www.censusindia.gov.in/Census_Data_2001/
>> Village_Directory/View_data/Village_Profile.aspx 
>> <http://www.google.com/url?q=http%3A%2F%2Fwww.censusindia.gov.in%2FCensus_Data_2001%2FVillage_Directory%2FView_data%2FVillage_Profile.aspx&sa=D&sntz=1&usg=AFQjCNGcG6qYyc27owTt4u8QxU1XoKEWzA>
>>  and 
>> collected about 90% of all the records over the course of 6 months... The 
>> other 10% are corrupt or otherwise missing. I had to move to other tasks, 
>> so I walked away, but I plan to collect the remaining records within the 
>> next few months.
>>
>> I have 473,514 complete records. Currently, the majority are in *.html 
>> format. The file is rather large, so I will have to send it over in 
>> batches. 
>>
>
> You could upload them to archive.org and share a link here.
>
> Anand
>  

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to