Re: [datameet] Mapping Local Government Directory to WikiData

2021-03-20 Thread Thejesh GN
Bodhisattwa - Thank you. Added a note -
https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_India#Syncing_Local_Government_Data_to_WikiData

Arun - Sure. How do we proceed?


I also have the udise_districts and udise_blocks in the same SQLITE.
udise_districts uses a completely different  *udise_dist_code*. I will try
and map wikiDataId to this as well.
https://india-local-government-directory.glitch.me/india-local-government-directory/udise_districts

udise_blocks are completely different from blocks as a geographical area. I
am not going to pick it up as of now.
https://india-local-government-directory.glitch.me/india-local-government-directory/udise_blocks

My plan to pickup sub-district after this.

Thej
--
Thejesh GN *⏚* ತೇಜೇಶ್ ಜಿ.ಎನ್
http://thejeshgn.com
GPG ID :  0xBFFC8DD3C06DD6B0


On Sun, 21 Mar 2021 at 00:09, Arun Ganesh  wrote:

> Very cool Thejesh! The LGD dataset is definitely super useful to help
> reconcile various other datasets that reference any territory.
>
> Have been maintaining a dump of all the other LGD lookups here
> https://github.com/planemad/india-local-government-directory . Would be
> great to have it merged with the datmeet repo and see how we can maintain
> an easy to access dump of https://lgdirectory.gov.in
>
> On Sat, Mar 20, 2021 at 10:28 AM Bodhisattwa Mandal <
> bodhisattwa.rg...@gmail.com> wrote:
>
>> Hi Thejesh,
>>
>> The best place to discuss this is here -
>> https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_India
>>
>> There are Wikidata contributors who had been working on this, who might
>> respond there.
>>
>> Thanks,
>> Bodhisattwa
>>
>>
>> On Sat, 20 Mar 2021 at 10:46, Thejesh GN  wrote:
>>
>>> LGD publishes some important IDs, that can be useful. I also think
>>> WikiData item Id as a primary key. I just started syncing both of them
>>> locally so, I can update the WikiData with missing Census Location IDs.
>>> States was easy, but districts turned out to be not so easy.
>>>
>>> I have blogged here
>>>
>>>
>>> https://thejeshgn.com/2021/03/20/mapping-local-government-directory-to-wikidata/
>>>
>>> But here are the differences. Let me know what do you guys think.
>>>
>>> WikiDataIdLabelDescriptionComments
>>> Q955977 South Arcot Former district in Tamil Nadu, India Needs to be
>>> marked as dissolved in WikiData
>>> Q1900496 Bangalore Former district in Karnataka, India Needs to be
>>> marked as dissolved in WikiData
>>> Q1606061 Andaman Former district of the Andaman and Nicobar Islands Needs
>>> to be marked as dissolved in WikiData
>>> Q24949801 Shahbazwan District of Bihar in India is this same as
>>> GOPALGANJ district? Marked by mistake in WikiData. Should be removed as a
>>> district.
>>> Q6007135 Imphal Wikimedia disambiguation page is ex-district. Was
>>> split. Needs to be marked as dissolved in WikiData
>>> Q48731903 Noklak District in India, Nagaland New district
>>> . LGD needs update.
>>> January 20, 2021.
>>> Q61746013 Narayanapet District of Telangana, India There seem to be a
>>> duplicate Narayanpet district (Q85787759); but Q61746013 was created
>>> earlier. DataCommons
>>>  also uses the
>>> same. It also has
>>> Q29025081 East Karbi Anglong District of Assam, India When KARBI
>>> ANGLONG was split. The western part became the new "West Karbi Anglong" and
>>> the rest remained part of "Karbi Anglong". There is no "East Karbi Anglong"
>>> as such. Should be removed in WikiData?
>>> Q101088203 Bajali district of Assam India New district
>>>  formed in 12 January
>>> 2021. LGD needs an update
>>> DONT KNOW Vijayanagara district of Karnataka in India New district
>>>  formed in 2020/21
>>> .
>>> Needs an addition to LGD. May be mark Q1611788
>>>  as district in WikiData?
>>> DONT KNOW Chachaura district of mp Missing on LGD, WikiData and OSM. No
>>> gazette yet
>>> DONT KNOW Maihar district of mp Missing on LGD, WikiData and OSM. No
>>> gazette yet
>>> DONT KNOW Nagda district of mp Missing on LGD and WikiData. No gazette
>>> yet.
>>> Q61439260 Pakke-Kessang district of Arunachal Pradesh in India It was
>>> missing from WikiData query results. Because it was not tagged as district.
>>> I updated
>>> 
>>> WikiData.
>>>
>>>
>>> Thej
>>> --
>>> Thejesh GN *⏚* ತೇಜೇಶ್ ಜಿ.ಎನ್
>>> http://thejeshgn.com
>>> GPG ID :  0xBFFC8DD3C06DD6B0
>>>
>>> --
>>> Datameet is a community of Data Science enthusiasts in India. Know more
>>> about us by visiting http://datameet.org
>>> ---
>>> You received this message because you are subscribed to the Google
>>> Groups "datameet" group.
>>> To 

Re: [datameet] Mapping Local Government Directory to WikiData

2021-03-20 Thread Arun Ganesh
Very cool Thejesh! The LGD dataset is definitely super useful to help
reconcile various other datasets that reference any territory.

Have been maintaining a dump of all the other LGD lookups here
https://github.com/planemad/india-local-government-directory . Would be
great to have it merged with the datmeet repo and see how we can maintain
an easy to access dump of https://lgdirectory.gov.in

On Sat, Mar 20, 2021 at 10:28 AM Bodhisattwa Mandal <
bodhisattwa.rg...@gmail.com> wrote:

> Hi Thejesh,
>
> The best place to discuss this is here -
> https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_India
>
> There are Wikidata contributors who had been working on this, who might
> respond there.
>
> Thanks,
> Bodhisattwa
>
>
> On Sat, 20 Mar 2021 at 10:46, Thejesh GN  wrote:
>
>> LGD publishes some important IDs, that can be useful. I also think
>> WikiData item Id as a primary key. I just started syncing both of them
>> locally so, I can update the WikiData with missing Census Location IDs.
>> States was easy, but districts turned out to be not so easy.
>>
>> I have blogged here
>>
>>
>> https://thejeshgn.com/2021/03/20/mapping-local-government-directory-to-wikidata/
>>
>> But here are the differences. Let me know what do you guys think.
>>
>> WikiDataIdLabelDescriptionComments
>> Q955977 South Arcot Former district in Tamil Nadu, India Needs to be
>> marked as dissolved in WikiData
>> Q1900496 Bangalore Former district in Karnataka, India Needs to be
>> marked as dissolved in WikiData
>> Q1606061 Andaman Former district of the Andaman and Nicobar Islands Needs
>> to be marked as dissolved in WikiData
>> Q24949801 Shahbazwan District of Bihar in India is this same as
>> GOPALGANJ district? Marked by mistake in WikiData. Should be removed as a
>> district.
>> Q6007135 Imphal Wikimedia disambiguation page is ex-district. Was split.
>> Needs to be marked as dissolved in WikiData
>> Q48731903 Noklak District in India, Nagaland New district
>> . LGD needs update.
>> January 20, 2021.
>> Q61746013 Narayanapet District of Telangana, India There seem to be a
>> duplicate Narayanpet district (Q85787759); but Q61746013 was created
>> earlier. DataCommons 
>> also uses the same. It also has
>> Q29025081 East Karbi Anglong District of Assam, India When KARBI ANGLONG
>> was split. The western part became the new "West Karbi Anglong" and the
>> rest remained part of "Karbi Anglong". There is no "East Karbi Anglong" as
>> such. Should be removed in WikiData?
>> Q101088203 Bajali district of Assam India New district
>>  formed in 12 January
>> 2021. LGD needs an update
>> DONT KNOW Vijayanagara district of Karnataka in India New district
>>  formed in 2020/21
>> .
>> Needs an addition to LGD. May be mark Q1611788
>>  as district in WikiData?
>> DONT KNOW Chachaura district of mp Missing on LGD, WikiData and OSM. No
>> gazette yet
>> DONT KNOW Maihar district of mp Missing on LGD, WikiData and OSM. No
>> gazette yet
>> DONT KNOW Nagda district of mp Missing on LGD and WikiData. No gazette
>> yet.
>> Q61439260 Pakke-Kessang district of Arunachal Pradesh in India It was
>> missing from WikiData query results. Because it was not tagged as district.
>> I updated
>> 
>> WikiData.
>>
>>
>> Thej
>> --
>> Thejesh GN *⏚* ತೇಜೇಶ್ ಜಿ.ಎನ್
>> http://thejeshgn.com
>> GPG ID :  0xBFFC8DD3C06DD6B0
>>
>> --
>> Datameet is a community of Data Science enthusiasts in India. Know more
>> about us by visiting http://datameet.org
>> ---
>> You received this message because you are subscribed to the Google Groups
>> "datameet" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to datameet+unsubscr...@googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/datameet/CAABnYsUTfHZnmisWitBKBAGRBkYQ2OA8%2BuuK46MwRy8uNqiWTg%40mail.gmail.com
>> 
>> .
>>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to datameet+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/datameet/CAHyrfgb4AybH9okfQ2S9-r71agRi5kkMGueTdt_hOQs%2B1e1eDw%40mail.gmail.com
> 

Re: [datameet] Mapping Local Government Directory to WikiData

2021-03-20 Thread Bodhisattwa Mandal
Hi Thejesh,

The best place to discuss this is here -
https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_India

There are Wikidata contributors who had been working on this, who might
respond there.

Thanks,
Bodhisattwa


On Sat, 20 Mar 2021 at 10:46, Thejesh GN  wrote:

> LGD publishes some important IDs, that can be useful. I also think
> WikiData item Id as a primary key. I just started syncing both of them
> locally so, I can update the WikiData with missing Census Location IDs.
> States was easy, but districts turned out to be not so easy.
>
> I have blogged here
>
>
> https://thejeshgn.com/2021/03/20/mapping-local-government-directory-to-wikidata/
>
> But here are the differences. Let me know what do you guys think.
>
> WikiDataIdLabelDescriptionComments
> Q955977 South Arcot Former district in Tamil Nadu, India Needs to be
> marked as dissolved in WikiData
> Q1900496 Bangalore Former district in Karnataka, India Needs to be marked
> as dissolved in WikiData
> Q1606061 Andaman Former district of the Andaman and Nicobar Islands Needs
> to be marked as dissolved in WikiData
> Q24949801 Shahbazwan District of Bihar in India is this same as GOPALGANJ
> district? Marked by mistake in WikiData. Should be removed as a district.
> Q6007135 Imphal Wikimedia disambiguation page is ex-district. Was split.
> Needs to be marked as dissolved in WikiData
> Q48731903 Noklak District in India, Nagaland New district
> . LGD needs update.
> January 20, 2021.
> Q61746013 Narayanapet District of Telangana, India There seem to be a
> duplicate Narayanpet district (Q85787759); but Q61746013 was created
> earlier. DataCommons 
> also uses the same. It also has
> Q29025081 East Karbi Anglong District of Assam, India When KARBI ANGLONG
> was split. The western part became the new "West Karbi Anglong" and the
> rest remained part of "Karbi Anglong". There is no "East Karbi Anglong" as
> such. Should be removed in WikiData?
> Q101088203 Bajali district of Assam India New district
>  formed in 12 January
> 2021. LGD needs an update
> DONT KNOW Vijayanagara district of Karnataka in India New district
>  formed in 2020/21
> .
> Needs an addition to LGD. May be mark Q1611788
>  as district in WikiData?
> DONT KNOW Chachaura district of mp Missing on LGD, WikiData and OSM. No
> gazette yet
> DONT KNOW Maihar district of mp Missing on LGD, WikiData and OSM. No
> gazette yet
> DONT KNOW Nagda district of mp Missing on LGD and WikiData. No gazette
> yet.
> Q61439260 Pakke-Kessang district of Arunachal Pradesh in India It was
> missing from WikiData query results. Because it was not tagged as district.
> I updated
> 
> WikiData.
>
>
> Thej
> --
> Thejesh GN *⏚* ತೇಜೇಶ್ ಜಿ.ಎನ್
> http://thejeshgn.com
> GPG ID :  0xBFFC8DD3C06DD6B0
>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to datameet+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/datameet/CAABnYsUTfHZnmisWitBKBAGRBkYQ2OA8%2BuuK46MwRy8uNqiWTg%40mail.gmail.com
> 
> .
>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/datameet/CAHyrfgb4AybH9okfQ2S9-r71agRi5kkMGueTdt_hOQs%2B1e1eDw%40mail.gmail.com.