Hi Pradeep, The 12k that you're seeing on GitHub seems to be a limitation with the number of lines of a single gist that can render on the web interface. The raw CSV file when downloaded should contain the 22k+ lines you're looking for: https://gist.githubusercontent.com/Vonter/dde3c47dfd3ca11e678cea61821aa099/raw/ba6b45cfcd8f1e7f17ef038ac1a08a70410630e0/villages.csv
Regards, Vivek On Monday, 13 February 2023 at 03:08:09 UTC+5:30 Pradeep Vanga wrote: > Hi Vivek, it looks like the csv file contains only about 12k+ entries. > > (It looks like I replied to the author and not this thread earlier, I have > also scraped the data and uploaded it here :) > https://www.kaggle.com/datasets/vangap/madhya-pradesh-village-list ) > > On Monday, February 6, 2023 at 9:49:48 PM UTC+5:30 Vivek Matthew wrote: > >> Hi Sharad, >> >> Nice catch regarding the switch to Hindi. The choice of English/Hindi >> names returned by the server is based on the cookie sent with the request. >> >> I've scraped the village list and put it as a CSV and JSON here: >> https://gist.github.com/Vonter/dde3c47dfd3ca11e678cea61821aa099 >> >> There are 23170 villages in there, but based on my count it looks like >> there's about a dozen of them without Devanagiri names. >> >> On Saturday, 4 February 2023 at 11:01:58 UTC+5:30 [email protected] >> wrote: >> >>> [image: mndagofkncnplmoa.png] >>> But btw, there is an option on the main PRD website to switch to Hindi, >>> and when I do that, then when I go searching for specific Gram Panchayats, >>> I do get this search menu, which suggests that at the backend somewhere the >>> Hindi lists also exist? Any ideas? >>> On Saturday, February 4, 2023 at 10:32:59 AM UTC+5:30 Sharad Lele wrote: >>> >>>> Dear Nikhil, >>>> >>>> Thanks for your help and yes, I assumed (incorrectly) that if the menus >>>> are in Hindi then the data will also be in Hindi/Devanagari! >>>> Unfortunately, >>>> as you pointed, out, the data are still in English/Roman script. >>>> >>>> Which means I have to expand my request: any one who can find a website >>>> that has village name lists in Hindi/Devanagari (for MP in particular), >>>> please flag. If someone has the data already in Devanagari, great! >>>> >>>> Sreeram pointed out that the list on the govt of India's LGDIR website >>>> has devanagari names for some states, but in the case of MP, the column >>>> for >>>> names in Devanagari is very sporadically filled! >>>> >>>> Sharad >>>> >>>> On 04-Feb-23 10:11, Nikhil VJ wrote: >>>> >>>> Hi Sharad, >>>> >>>> The site you linked is quite easy to scrape with basic GET api calls >>>> (aka you can open the url in browser also) giving the data in a proper >>>> structure that can be directly used by a program. >>>> >>>> *But : the data is all in English only.* >>>> >>>> Anyways, in case you want to scrape, can get someone to do using: >>>> >>>> Districts list: >>>> https://www.prd.mp.gov.in/Handlers/Districts.ashx?DivisionID=0 >>>> >>>> Take district ID from there to get local bodies list: >>>> https://www.prd.mp.gov.in/Handlers/localbodies.ashx?DistrictID=*45* >>>> &Is_Rural=1 >>>> >>>> Take "LBID" from there to get GP Zones: >>>> https://www.prd.mp.gov.in/Handlers/GpZones.ashx?LbId=*24319* >>>> >>>> >>>> Note to freshers in python coding who are looking for real world use >>>> cases to learn and apply their skills: This is a good starting project. >>>> Make 3 nested for loops and append all the results to a dict (json) >>>> array. At end, convert to a pandas dataframe, and output to CSV. >>>> https://www.prd.mp.gov.in/Handlers/Districts.ashx?DivisionID=0 >>>> -- >>>> Cheers, >>>> Nikhil VJ >>>> https://nikhilvj.co.in >>>> >>>> >>>> On Fri, Feb 3, 2023 at 12:10 AM Sharad Lele <[email protected]> wrote: >>>> >>>>> For instance, if someone can scrape the names from this website: >>>>> https://www.prd.mp.gov.in/GramSearch/SearchPanchayat.aspx >>>>> (sequentially, so as to get the district, block and GP tags also) >>>>> >>>>> On Thursday, February 2, 2023 at 9:47:01 PM UTC+5:30 Sharad Lele wrote: >>>>> >>>>>> I am looking for the census village list for Chhattisgarh and Madhya >>>>>> Pradesh (for starters) in Devanagari (Hindi script). Preferably with >>>>>> Census >>>>>> 2011 codes, so that I can quickly match them to the Census dataset, >>>>>> which >>>>>> is in English. But even if no codes attached, an accurate list with >>>>>> tehsil/block and district tags in digital format (not pdf hopefully) >>>>>> will >>>>>> be a big help. >>>>>> >>>>>> Any suggestions, folks? >>>>>> >>>>>> Sharad >>>>>> >>>>> -- >>>>> Datameet is a community of Data Science enthusiasts in India. Know >>>>> more about us by visiting http://datameet.org >>>>> --- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "datameet" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to [email protected]. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/datameet/4c644c88-1d3b-4e71-81a2-2e48d6c794c3n%40googlegroups.com >>>>> >>>>> <https://groups.google.com/d/msgid/datameet/4c644c88-1d3b-4e71-81a2-2e48d6c794c3n%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> >>>> -- >>>> Datameet is a community of Data Science enthusiasts in India. Know more >>>> about us by visiting http://datameet.org >>>> --- >>>> >>>> You received this message because you are subscribed to a topic in the >>>> Google Groups "datameet" group. >>>> To unsubscribe from this topic, visit >>>> https://groups.google.com/d/topic/datameet/x2kTZlgPeb4/unsubscribe. >>>> To unsubscribe from this group and all its topics, send an email to >>>> [email protected]. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/datameet/CAH7jeuMKboJCh71j%2Bncfbg%3DSbjpxjUVW4gWHk59VZ-2SKnN_7w%40mail.gmail.com >>>> >>>> <https://groups.google.com/d/msgid/datameet/CAH7jeuMKboJCh71j%2Bncfbg%3DSbjpxjUVW4gWHk59VZ-2SKnN_7w%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>>> >>>> -- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/datameet/df7b7705-1d8c-41c4-8893-96c7ff5781d6n%40googlegroups.com.
