Hi Bhibhash,

Short answer: No.
There is no metadata. I took the top 1000 lines and visualized it : the
shapes were spread all over the country and there wasn't any order in them.

With an intern's help I've started import of them into a portable
postgresql DB (dockerized with a persistent volume) on my webserver.
Started yesterday and it's 10 million shapes in; from size comparisons
it'll probably be around 79 million shapes, so should be imported in a
week's time.

I've set an auto-increment primary key column in the DB table, so once
imported at least the data can be pulled out in batches.

There are 2 paths ahead once I've imported it:
-------

*1. Serving vector tiles*
I want to serve MVT vector tiles from this DB - and then have a map display
the same. That way one can directly have this stuff render on a map as they
browse it - fetching only one small region at a time and hopefully not
being too slow.

Finding some references here:
https://blog.jawg.io/how-to-make-mvt-with-postgis/
https://www.crunchydata.com/blog/dynamic-vector-tiles-from-postgis
https://medium.com/@frederic.rodrigo/web-mapping-comparing-vector-tile-servers-from-postgres-postgis-405055e69084

I'm new to this MVT business, so if anybody wants to collaborate pls feel
free to jump in.

------

*2. Tag with region ids*
Add columns to the DB table and tag each shape by state, district ids etc
using Within operations either in the DB or in python-geopandas.


--
Cheers,
Nikhil VJ
https://nikhilvj.co.in


On Tue, May 24, 2022 at 10:14 AM Bibhash Nath <bibhas...@gmail.com> wrote:

> Thanks for sharing.
> Do we have state-wise datasets? It seems quite large to process on my
> computer.
>
> Best,
> Bibhash
>
> On Wed, May 18, 2022 at 4:58 AM Nikhil VJ <nikhil...@gmail.com> wrote:
>
>> Thanks Justin for sharing!
>>
>> I've downloaded and extracted the India.geojsonl on a webserver.
>>
>> ref about .geojsonl : https://www.interline.io/blog/geojsonl-extracts/
>> -> pretty useful! One can loop through a huge file without having to load
>> it all into RAM.
>>
>> top lines look like:
>> {"type": "Feature", "properties": {},"geometry": {"type":
>> "Polygon","coordinates": [[[83.06380515611697, 25.34167404697847],
>> [83.06380909901775, 25.341635591519122], [83.06386494585949,
>> 25.341640268589657], [83.06386100295869, 25.341678724047526],
>> [83.06380515611697, 25.34167404697847]]]}}
>> {"type": "Feature", "properties": {},"geometry": {"type":
>> "Polygon","coordinates": [[[87.87555977691633, 22.397660095199],
>> [87.8754256865811, 22.397658576750985], [87.87542690191553,
>> 22.397566835386712], [87.87556099225075, 22.39756835383578],
>> [87.87555977691633, 22.397660095199]]]}}
>> ...
>> So we have just basic polygons, one for each building, no properties or
>> categorization.
>>
>> Here are some ideas on what to do with this:
>>
>> 1. Postgresql DB:
>> - Load all of these into a PostGreSQL DB
>> - Setup an api that will take a lat/lon and give all shapes within 1km
>> radius
>> - Next possible api: send a bounding polygon and get all buildings in that
>>
>> 2. Split up by district or lower level admin boundaries:
>> - Load the admin boundaries into a Postgresql DB
>> - Loop through each line (ie each building)
>> - Find out which place it's in by ST_Within query
>> - Dump it into a separate .geojsonl for that place
>> - We've now split it into multiple smaller files that are more usable,
>> can be loaded up by OSM mappers etc.
>>
>> --
>> Cheers,
>> Nikhil VJ
>> https://nikhilvj.co.in
>>
>>
>> On Wed, May 18, 2022 at 3:51 AM Justin <justinelliotmey...@gmail.com>
>> wrote:
>>
>>> [image: lg.png]
>>>
>>> https://github.com/microsoft/GlobalMLBuildingFootprints
>>>
>>>
>>> --
>>> Datameet is a community of Data Science enthusiasts in India. Know more
>>> about us by visiting http://datameet.org
>>> ---
>>> You received this message because you are subscribed to the Google
>>> Groups "datameet" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to datameet+unsubscr...@googlegroups.com.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/datameet/68c0ecc4-5b07-4eb9-a69d-ae6e5ad6832bn%40googlegroups.com
>>> <https://groups.google.com/d/msgid/datameet/68c0ecc4-5b07-4eb9-a69d-ae6e5ad6832bn%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>> --
>> Datameet is a community of Data Science enthusiasts in India. Know more
>> about us by visiting http://datameet.org
>> ---
>> You received this message because you are subscribed to the Google Groups
>> "datameet" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to datameet+unsubscr...@googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/datameet/CAH7jeuM_yRYLDRir7vYjT7JVoP-J%2BkWAEkbp_U84QSb82AX3aA%40mail.gmail.com
>> <https://groups.google.com/d/msgid/datameet/CAH7jeuM_yRYLDRir7vYjT7JVoP-J%2BkWAEkbp_U84QSb82AX3aA%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to datameet+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/datameet/CAHRkDO5qPr_E68nzFiTh%3DxSPd1myz0_WMbJi83mdQMDZ53pEkQ%40mail.gmail.com
> <https://groups.google.com/d/msgid/datameet/CAHRkDO5qPr_E68nzFiTh%3DxSPd1myz0_WMbJi83mdQMDZ53pEkQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/datameet/CAH7jeuOo0vfbxnsv%2B-V7%2Bt7zgkNGruyaT0xzULECGc-e8h3Qcw%40mail.gmail.com.

Reply via email to