There are over +400,000 records, so the upload will take a very long time and will require more RAM than I can currently spare. (I have a rather extensive Bayesian measurement model running right now that take priority). Do you know Python? I could give you my scrapping code.
On Tuesday, December 16, 2014 10:26:49 PM UTC-5, Anand Chitipothu wrote: > > On Tue, Dec 16, 2014 at 11:56 AM, Rick Morgan <[email protected] > <javascript:>> wrote: >> >> Hello, >> >> Are you still looking for this data? With a rather complex Python code I >> scraped http://www.censusindia.gov.in/Census_Data_2001/ >> Village_Directory/View_data/Village_Profile.aspx >> <http://www.google.com/url?q=http%3A%2F%2Fwww.censusindia.gov.in%2FCensus_Data_2001%2FVillage_Directory%2FView_data%2FVillage_Profile.aspx&sa=D&sntz=1&usg=AFQjCNGcG6qYyc27owTt4u8QxU1XoKEWzA> >> and >> collected about 90% of all the records over the course of 6 months... The >> other 10% are corrupt or otherwise missing. I had to move to other tasks, >> so I walked away, but I plan to collect the remaining records within the >> next few months. >> >> I have 473,514 complete records. Currently, the majority are in *.html >> format. The file is rather large, so I will have to send it over in >> batches. >> > > You could upload them to archive.org and share a link here. > > Anand > -- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
