Hi Vivek,
Thanks so much for this!
best wishes,
Peter

On Wednesday, June 5, 2024 at 5:46:09 PM UTC+9:30 Vivek Matthew wrote:

> This data with timestamps is great, Nikhil! It will be interesting to 
> compare with roundwise results when ECI puts it up for parliamentary 
> constituencies.
>
> ECI has already put out the roundwise vote counts for yesterday's assembly 
> election results: 
> https://results.eci.gov.in/AcResultGenJune2024/RoundwiseS011.htm?ac=1
>
>
> On Wed, Jun 5, 2024, 12:51 PM Nikhil VJ <[email protected]> wrote:
>
>> Hi all,
>>
>> Nice work Vivek!
>>
>> I was scraping to catch by *timestamp* how lead margins, vote counts 
>> change over time, from statewise results pages like this 
>> <https://results.eci.gov.in/PcResultGenJune2024/statewiseS011.htm>  and 
>> PC-wise 
>> results pages like this 
>> <https://results.eci.gov.in/PcResultGenJune2024/ConstituencywiseS0124.htm>
>>  .
>>
>> I've collated the data and posted it along with scraping and 
>> collating (python) scripts on this github repo:
>> https://github.com/answerquest/india-elections2024-results-timewise
>>
>> *Flaws in this data:*
>> 1. Didn't catch it all from the beginning : leads-margins tally scraping 
>> was started from around *1:50 pm,* per-candidate vote numbers scraping 
>> was started from around *4.30pm.* 
>> 2. There would be some missed time intervals for some constituencies 
>> sometimes some pages didn't load, script errored out due to edge cases
>> 3. I bungled up on applying "U" prefixes for union territories so those 
>> rows were scraped quite late. 
>>
>> But all in all I think it's a pretty good dataset to make time-series 
>> viz's, 
>> to "audit" tallies over time and detect out-of-norm additions, etc for 
>> folks who are interested in settling some ongoing debates using data.
>>
>> *Disclaimer :* I'm only sharing the data as-was at those timestamps, 
>> this is secondary scraped data that is prone to flukes like a html tag 
>> mis-rendering causing a bad number to come in. If you find something odd, 
>> kindly lookup the official sources, file RTIs etc, but leave me out of it 
>> pls.
>>
>>
>> ------------------------------------
>>
>> *My compliments to Election Commission of India, in case anyone from 
>> there is reading:*
>> 1. It was good to have whole integers of absolute vote counts given by 
>> ECI. Hope to see this maintained. This was a lot better than the 
>> rounded-off fractions of vote-shares we were getting during the US 2020 
>> elections counting which had made it impossible to calculate the actual 
>> numbers of votes.
>>
>> 2. Good website work, consistent naming of each constituency / state's 
>> pages and consistent page structures.
>>
>> 3. Page-not-opening cases were there but were rare, and the chinks 
>> disappeared from around evening onwards when the declarations were 
>> happening and I'd expect more site visitors. On my part, I ensured my 
>> scripts were hitting 1-at-a-time only, kept adequate time intervals etc so 
>> that I don't bombard the server (to coders : this was intentional. Don't 
>> suggest "fixing" it by parallel threading etc, that gets you 429'd).
>>
>> 4. Candidates' photos were properly organized and were instantly rendered 
>> on all the PC-wise pages I was checking out. Which means each and every 
>> candidate was properly tracked in the DB and their files were properly 
>> linked and small thumbnails were kept, as opposed to past elections when 
>> there would only be scanned pages listing all the candidates's totals. One 
>> suggestion: converting these to .webp format will shrink the sizes and your 
>> egress loads by around 10x.
>>
>> 5. Even prior to election, voters lists were quite well managed, even the 
>> voter roll pdfs were easy to download, and it was quite easy to find our 
>> part + serial number provided we'd done our homework (which FYI was the 
>> only info we needed in hand apart from photo id on voting day, if you just 
>> shared these with the officer when you entered the booth, they'd locate 
>> your entry in 5 seconds and you would be done voting in under a minute.)
>>
>> 6. All in all, we've come a long way in digitization and making this data 
>> accessible to all, Thank you for all the work done. 
>>
>> 7. It would be great if you published some inside stories of the 
>> technical infrastructure (server specs etc) used on 4th June for serving 
>> the website.
>>
>>
>> --
>> Cheers,
>> Nikhil VJ
>> https://nikhilvj.co.in
>>
>>
>> On Wed, Jun 5, 2024 at 9:25 AM Vivek Matthew <[email protected]> wrote:
>>
>>> Hi all,
>>>
>>> I have scraped the 2024 Lok Sabha election results from the 
>>> results.eci.gov.in website. In case anyone is interested, you can find 
>>> the CSV with the results attached.
>>>
>>> Once constituency-wise turnout numbers are released for phase 7, I will 
>>> include additional columns for turnout and vote share numbers.
>>>
>>> Note that semicolon (;) is used as the column separator.
>>>
>>> Regards,
>>> Vivek
>>>
>>> -- 
>>> Datameet is a community of Data Science enthusiasts in India. Know more 
>>> about us by visiting http://datameet.org
>>> --- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "datameet" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to [email protected].
>>> To view this discussion on the web visit 
>>> https://groups.google.com/d/msgid/datameet/0c95b2a3-27d3-4146-8ce3-44a49ae72f6fn%40googlegroups.com
>>>  
>>> <https://groups.google.com/d/msgid/datameet/0c95b2a3-27d3-4146-8ce3-44a49ae72f6fn%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>> -- 
>> Datameet is a community of Data Science enthusiasts in India. Know more 
>> about us by visiting http://datameet.org
>> --- 
>>
> You received this message because you are subscribed to a topic in the 
>> Google Groups "datameet" group.
>> To unsubscribe from this topic, visit 
>> https://groups.google.com/d/topic/datameet/pmbjGJNGHaM/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to 
>> [email protected].
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/datameet/CAH7jeuMg6EiA0ZL1x8hjVOQi8BxB10vqme5seQbY0-R%2BSbjK7g%40mail.gmail.com
>>  
>> <https://groups.google.com/d/msgid/datameet/CAH7jeuMg6EiA0ZL1x8hjVOQi8BxB10vqme5seQbY0-R%2BSbjK7g%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>>
>>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/datameet/6e0e1c19-6b03-4323-97c9-ed6b6ca920a6n%40googlegroups.com.

Reply via email to