Nice. On Monday, March 28, 2016 at 11:20:41 AM UTC+5:30, Raphael Susewind wrote: > > Dear Avinash and all, > > I will try to make some time this week to scrape the pincodes from > electoral rolls for all polling booths in my electoral GIS shapefiles. > > Since pincode is in latin script, this should not be affected by the > much discussed PDF scraping issues with electoral rolls. > > We could then either go down the voronoi route, or alternatively use the > heatmap processing chain that I used to generate AC boundaries - this > latter would have the advantage of dealing with wrong coordinates in the > booth point dataset (basically, not all electoral booth coordinates are > correct; consequently, if we only voronoi, we would have a blip of > pincode B within a see of pincode A quite frequently. The heatmap stuff > takes care of this). > > Since I am not familiar with postal boundaries: can anyone here confirm > whether pincode areas are contiguous, and whether each pincode has only > one area? Or can it be that several non-contiguous areas have the same > pincodem intersparsed with other pincodes? (In which case voronoi would > perhaps be the better solution at last) > > In any case, I hope to give you the pincode for each polling booth by > end of the week or so (based on all-India 2014 electoral rolls), > > Best, > Raphael > > On 28.03.2016 06:33, Avinash Celestine wrote: > > > perhaps one way is to avoid using postal data altogether. > > > > All header pages in electoral rolls(the first page) contain the name of > > the polling station related to that roll, the PS number, and importantly > > the pin code. > > > > A site like psleci.nic.in <http://psleci.nic.in> has geog coordinates > > of polling stations (though Raphael had collected the data earlier*). > > Matching the two will give a fairly dense scattering of points - in > > fact much more dense than if we used some of the methods earlier in this > > thread. > > > > We thus have a way of associating a pin code with a geo coordinate. We > > can then use the voronoi method. > > > > Electoral rolls are mostly in pdf which make them difficult to scrape. > > But from what i have seen, for any given state, the location on the > > header page, of the pincode number is more or less constant, making it > > possible to target just that part of the page with any pdf parser. > > > > Electoral rolls have become difficult to download in bulk( a good > > thing!) but i understand different people on this group have the pdfs > > for different states. Putting this stuff together should give us > > comprehensive data on header pages for atleast some states. > > Alternatively, we can file RTIs for just the header pages of electoral > > rolls, though i dont know how successful that would be. > > > > * Raphael's data is > > at https://github.com/raphael-susewind/india-election-data > > > > > > > > On Sun, Mar 27, 2016 at 12:07 PM, srinivas kodali <[email protected] > <javascript:> > > <mailto:[email protected] <javascript:>>> wrote: > > > > Well, There were postal delivery zones in the past and the postal > > department even used to make maps of these zones. The Delhi postal > > delivery zone map > > < > https://drive.google.com/file/d/0B1RcWLku0ZOWWVBHMldrZWdfZEU/view?usp=sharing> > > had > > boundaries for delhi. I am not sure if other cities had them or how > > long the postal department was doing this, but it certainly can help > > with the boundaries for cities. > > > > Regards, > > Srinivas Kodali > > www.lostprogrammer.com <http://www.lostprogrammer.com> > > /"Not everyone who wanders is lost, I am probably a bit"/ > > > > On Tue, Mar 22, 2016 at 9:29 PM, Arun Ganesh <[email protected] > <javascript:> > > <mailto:[email protected] <javascript:>>> wrote: > > > > Shravan, crowdsourcing the boundaries of pincodes is not as > > trivial as you think. To start with, an area does not fall under > > a pincode, rather a street does based on the post office that > > services it. Read > > this: > http://www.georeference.org/doc/zip_codes_are_not_areas.htm > > > > You may also want to do some background reading of existing > > research that has been done by the group > > here: https://datameet.hackpad.com/M4hPFJVV2Gm?eid=v4YoXN4tTw5 > > > > To sum up, nobody has precise pincode boundaries like how you > > imagine them, not even the postal department. Any existing > > datasets are an estimate at best using some data processing on a > > large volume of address data. > > > > -- > > Datameet is a community of Data Science enthusiasts in India. > > Know more about us by visiting http://datameet.org > > --- > > You received this message because you are subscribed to the > > Google Groups "datameet" group. > > To unsubscribe from this group and stop receiving emails from > > it, send an email to [email protected] > <javascript:> > > <mailto:[email protected] <javascript:>>. > > For more options, visit https://groups.google.com/d/optout. > > > > > > -- > > Datameet is a community of Data Science enthusiasts in India. Know > > more about us by visiting http://datameet.org > > --- > > You received this message because you are subscribed to the Google > > Groups "datameet" group. > > To unsubscribe from this group and stop receiving emails from it, > > send an email to [email protected] <javascript:> > > <mailto:[email protected] <javascript:>>. > > For more options, visit https://groups.google.com/d/optout. > > > > > > -- > > Datameet is a community of Data Science enthusiasts in India. Know more > > about us by visiting http://datameet.org > > --- > > You received this message because you are subscribed to the Google > > Groups "datameet" group. > > To unsubscribe from this group and stop receiving emails from it, send > > an email to [email protected] <javascript:> > > <mailto:[email protected] <javascript:>>. > > For more options, visit https://groups.google.com/d/optout. > > -- > Dr Raphael Susewind | Associate, Contemporary South Asia Studies, Oxford > Snail Mail | Melanchthonstr. 4a, 33615 Bielefeld, Germany > Web & Twitter | https://www.raphael-susewind.de | @RaphaelSusewind > Impact | https://impactstory.org/raphael-susewind > > Please consider https://www.gnupg.org for encryption (key id 10AEE42F) > >
-- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
