Re: [datameet] Tracking Hate Speeches

2024-05-16 Thread Dilawar Singh
https://web.archive.org/web/20240516001222/https://hindutvawatch.org/  best,    Dilawar  On Wed, 15 May 2024 10:53:32 +0530 Shiladitya Ray wrote --- Hi guys, I'm trying to find a reliable source for tracking hate speech notices issued by ECI to politicians during the 2014

[datameet] Meteorological Department (IMD) APIs

2024-04-06 Thread Dilawar Singh
Hi, It looks like the Meteorological Department has some public APIs (https://mausam.imd.gov.in/imd_latest/contents/api.pdf). The document is silent on authorization etc. Has anyone played with them already? If yes, is there a community resource, e.g., library, etc., that can access data via

Re: [datameet] Creating variable names in Devanagari

2023-12-04 Thread Dilawar Singh
4, 2023 at 1:31:28 PM UTC+5:30 Dilawar Singh wrote: The answer is not going to be simple. Make sure that every time you save or open the file, the application uses encoding utf-8. Microsoft uses another weird encoding called utf-16 (https://en.wikipedia.org/wiki/UTF-16).  Stay away from utf

Re: [datameet] Creating variable names in Devanagari

2023-12-04 Thread Dilawar Singh
The answer is not going to be simple. Make sure that every time you save or open the file, the application uses encoding utf-8. Microsoft uses another weird encoding called utf-16 (https://en.wikipedia.org/wiki/UTF-16).  Stay away from utf-16 it if you want to copy-paste anything from the

[datameet] Re: [LWN] Some useful tools for binary formats

2023-02-28 Thread Dilawar Singh
:55 +0530 Dilawar Singh wrote --- https://lwn.net/Articles/924133/ best,    Dilawar  -- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "dat

[datameet] [LWN] Some useful tools for binary formats

2023-02-28 Thread Dilawar Singh
https://lwn.net/Articles/924133/ best,    Dilawar  -- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this

Re: [datameet] How I searched for the hottest villages in Kerala with Microsoft's Planetary Computer

2023-02-24 Thread Dilawar Singh
Awesome stuff! Can it be extended to all Panchayats in the country easily? Just a nitpick: why not use "median" as an averaging strategy?  best,    Dilawar  On Fri, 24 Feb 2023 12:00:40 +0530 Shijith Kunhitty wrote --- Just wanted to plug my data story on villages in Kerala and

Re: [datameet] Re: Data Scrape from website

2023-02-07 Thread Dilawar Singh
Did you try their API https://www.gbif.org/developer/summary ? best,    Dilawar  On Tue, 07 Feb 2023 14:35:09 +0530 Aravinth R wrote --- Hello there,I am working on a biodiversity project displaying various locations of wildlife species spotted in India. There is a global

[datameet] A public kroki server with Niolesk frontend (text to image service)

2022-12-14 Thread Dilawar Singh
since the images may be cached on the server. The URL is https://services.subcom.tech (for the frontend) and https://kroki.subcom.tech for kroki APIs. We plan to keep it alive as long as possible. best, Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/>

Re: [datameet] Image storage, browsing and retrieval

2022-11-02 Thread Dilawar Singh
3DY9jdfS_A%40mail.gmail.com?utm_medium=email_source=footer. Dilawar Singh Cofounder & CTO, https://subcom.tech https://www.linkedin.com/in/dilawar-singh/ / https://github.com/dilawar /  https://orcid.org/-0002-4645-3211 Schedule a Meeting: https://subcom.zohobookings.in/#/cu

Re: [datameet] Something like Holiday API (Karnataka/India)

2022-08-08 Thread Dilawar Singh
don't. May be scrape RBI holidays page? https://m.rbi.org.in//Scripts/HolidayMatrixDisplay.aspx And GoI calendar page https://www.india.gov.in/calendar On Tue, 9 Aug 2022, 09:00 Dilawar Singh, <mailto:dilawar.s.raj...@gmail.com> wrote: -- Datameet is a community o

[datameet] Something like Holiday API (Karnataka/India)

2022-08-08 Thread Dilawar Singh
Hi Datameet, Is there a shared resources that can be used for fetching the list of holidays (in JSON/XML/CSV) for the current year? Something like https://holidayapi.com/countries/in/2022. May be a github repo maintained by datameet? I tried holiday api which is great but doesn't allow

[datameet] Re: How to merge first and last value of each name?

2022-02-18 Thread Dilawar Singh
Look at 'difflib.SequenceMatcher' (https://docs.python.org/3/library/difflib.html#difflib.SequenceMatcher) in the Python ecosystem. Your favorite language may have something similar. Essentially, you are need to compute 'similarity' between two strings. If your prefix e.g. AltCoin in

Re: [datameet] Inquiry about the MEITY report on National Strategy on blockchain for govt systems

2021-12-06 Thread Dilawar Singh
Is it the one? https://www.meity.gov.in/writereaddata/files/National_BCT_Strategy.pdf -- Dilawar On Tue, Dec 7, 2021 at 9:26 AM Meera Mehta wrote: > Dear all > > Had a request to share - if anyone has a copy of this report or a link to > it > > MEITY report on "National Strategy on blockchain

Re: [datameet] Measuring BJP propaganda/advertising

2021-11-08 Thread Dilawar Singh
/i_am_sophie_zhang_at_fb_i_worked_in_my_spare_time/). Not sure if this is directly relevant but you can try to reach out to her. -- Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID <https://orcid.org/-0002-4645-3211> Github <https://github.com/dilawar> On Tue, Nov 9, 2021

Re: [datameet] Data extraction from BHUVAN website

2021-08-25 Thread Dilawar Singh
An Introduction to JQ: https://earthly.dev/blog/jq-select/ (worth bookmarking). -- Dilawar On Monday, July 5, 2021 at 11:10:53 AM UTC+5:30 Puneeth Chaganti wrote: > Hi Thej, > > On 04/07/21 10:51 pm, Thejesh GN wrote: > > Puneeth thanks for the script. What one can achieve with cli

[datameet] Developing tool to extract table from images

2021-05-26 Thread Dilawar Singh
uses often and people often need/want to digitize them, I'd like to have some samples. I am thinking of census data, GIS data etc.. There is no plan to support multi-page tables. I can use some advice on the OCR backend (I am using pytesseract from google for now). best, Dilawar -- Dilawar Singh

Re: [datameet] Routing challenge: Covery every path optimally

2021-05-11 Thread Dilawar Singh
places and thousands on possible routes. Maybe a brute force algo with do the job as well. PS: Talk is cheap! If you show me the data, I can show you the code. Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID <https://orcid.org/-0002-

Re: [datameet] Indian States centroids- shape file

2021-03-04 Thread Dilawar Singh
Have a look: https://github.com/dilawar/map-india-center best, Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID <https://orcid.org/-0002-4645-3211> Github <https://github.com/dilawar> On Wed, Mar 3, 2021 at 12:32 PM Rajes

Re: [datameet] Chennai becomes the first Indian city to have a comprehensive automatic groundwater monitoring system.

2021-02-05 Thread Dilawar Singh
Great. The EXCEL format can use some more work. Currently, it is not very machine friendly (probably an HTML table is exported directly to both PDF and EXCEL). A plain text CSV format would be very helpful. -- Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-p

Re: [datameet] Trouble reading Indian Districts Database's files

2021-02-03 Thread Dilawar Singh
I'm > done cleaning them. > > Best, > Eduardo > On Sunday, January 31, 2021 at 10:21:03 PM UTC-6 dilawar@gmail.com > wrote: > >> Found one: >> https://web.archive.org/web/20130314062432/http://www.bsos.umd.edu/socy/vanneman/districts/files/index.html >> . SAS

Re: [datameet] Trouble reading Indian Districts Database's files

2021-01-31 Thread Dilawar Singh
Found one: https://web.archive.org/web/20130314062432/http://www.bsos.umd.edu/socy/vanneman/districts/files/index.html . SAS files are available in this snapshot. -- Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID <https://orcid.org/-

Re: [datameet] Trouble reading Indian Districts Database's files

2021-01-31 Thread Dilawar Singh
and if you are lucky, some old snapshots might have the currently missing SAS files. The last time I tried, their search engine was having issues: https://archive.org/search.php?query=http%3A%2F%2Fvanneman.umd.edu%2Fdistricts%2Ffiles%2Findex.html -- Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.

Re: [datameet] The State of FOSS in India | Report Launch | 12th-15th Janurary 2021

2021-01-13 Thread Dilawar Singh
(roughly 50%). India has the lowest percentage. The number of respondents is pretty low here but it goes well with the anecdotal evidences. -- Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID <https://orcid.org/-0002-4645-3211> G

Re: [datameet] WikiData and simlar platforms

2020-12-24 Thread Dilawar Singh
. Nonetheless, my vote is for WikiData as the official source, and if possible mirror it to DataCommons so people can explore it easily . -- Dilawar Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID <https://orcid.org/-0002-4645-3211> Github <htt

Re: [datameet] Need help to run .mvt files

2020-12-15 Thread Dilawar Singh
Singh, Ph.D. LinkedIn <https://www.linkedin.com/in/dilawar-singh-ph-d-44b81b194/> ORCID <https://orcid.org/-0002-4645-3211> Github <https://github.com/dilawar> On Wed, Dec 16, 2020 at 2:41 AM Dilawar Singh wrote: > Reference: https://gdal.org/drivers/vector/mvt.html >

Re: [datameet] Need help to run .mvt files

2020-12-15 Thread Dilawar Singh
Reference: https://gdal.org/drivers/vector/mvt.html It doesn't look like GDAL is capable of converting MVT with 3d values to GeoJSON. Most of your GIS software uses gdal behind the scene for conversion. [dilawars@chutki Downloads]$ ogr2ogr -f GeoJSON a.geojson censuswardboundary_719_456.mvt

Re: [datameet] Discord

2020-12-05 Thread Dilawar Singh
up). Search works well. > > > -- > Cheers, > Nikhil VJ > https://nikhilvj.co.in > > > On Sat, Dec 5, 2020 at 1:13 AM Dilawar Singh > wrote: > >> Slack is not an ideal platform for collecting open knowledge or data. In >> addition to what Thajesh has al

Re: [datameet] Discord

2020-12-04 Thread Dilawar Singh
less account. Not to mention that there is a non-zero probability they will be around after 5 or 10 years. The same argument applies to any centralized platform. Are there distributed alternatives to emails which has better support of structured knowledge management? best, Dilawar Singh, Ph.D

Re: [datameet] TEXT TO ASCII

2020-09-06 Thread Dilawar Singh
> I have a set of files with X, Y, Z Values (Lidar Data) in .txt files. I need to convert it to ASCII format, This line is very confusing. `.txt` extensions usually means ASCII (or unicode). It is usually a good idea to attach a sample of the file to reduce the guesswork. You may also want to

Re: [datameet] Help with R logic - near similar name

2020-08-25 Thread Dilawar Singh
Not sure what is the equivalent of python difflib (SequenceMatcher) in R. If you have one, it will work. Sent from a handheld device. Pardon the brevity and typos. On Aug 25, 2020, 20:09 +0530, rammano...@gmail.com , wrote: > Hi, > > I have collected hospital data from multiple sources.

[datameet] textricator | Generate structured data from PDFs

2020-07-19 Thread Dilawar Singh
Found this tool today. Can help you getting data from PDFs. *Textricator* is a tool to extract text from documents and generate structured data. If you have a bunch of PDFs with the same format (or one big, consistently formatted PDF) and you want to extract the data to CSV or JSON,

Re: [datameet] Get Smart Route to Avoid Containment Zone

2020-07-07 Thread Dilawar Singh
Thanks for the pointers Abhishek. The token can be obtained using a GET request: " https://covid19.quantela.com/qpa/1.0.0/public/token/bbmp.com/6a4d20c0-87dd-556b-9319-ab7147e388d9 "; I've created an automated pipeline for our institute App. Here is a PHP+CURL solution to fetch the zone

Re: [datameet] Bangalore zone wise data on COVID19 in machine readable format

2020-06-24 Thread Dilawar Singh
BBMP has its own portal https://bbmp.maps.arcgis.com/apps/opsdashboard/index.html#/4f4f20e852744b96b493528aab76777d But I see no change in it for last few days. May be they are updating the location once in a while? Reaching out to them over email so far as been fruitless. If anyone knows

Re: [datameet] Bangalore zone wise data on COVID19 in machine readable format

2020-06-19 Thread Dilawar Singh
tps://kgis.ksrsac.in/covid/ and got list of all patients >in >Karnataka. > >On Thu, Jun 18, 2020 at 5:35 PM Dilawar Singh > >wrote: > >> Hi DataMeet, >> >> On page 8 of this document >> http://covid.bbmpgov.in/reports/report20200617.pdf there is a map >w

[datameet] Bangalore zone wise data on COVID19 in machine readable format

2020-06-18 Thread Dilawar Singh
Hi DataMeet, On page 8 of this document http://covid.bbmpgov.in/reports/report20200617.pdf there is a map which shows location of active cases in Bangalore. Is this data available in machine readable format? -- Dilawar NCBS Bangalore -- Datameet is a community of Data Science enthusiasts in

Re: [datameet] Re: Water level data extraction

2020-06-16 Thread Dilawar Singh
Web-scraping requires knowledge of some programming language. In essence, it is an art of transforming HTML to a format you like. For common tasks, you can find some tools easily on the net e.g., extract all tables from this URL and save them as csv. For example, you can get the html from the

[datameet] Re: Officially Approved India Map Borders?

2020-06-16 Thread Dilawar Singh
http://www.surveyofindia.gov.in/pages/display/122-outline-maps-of-india Example PDF (it is a scan copy): http://www.surveyofindia.gov.in/files/36m_india.pdf . Most maps on this site are not machine readable. best, Dilawar On Tuesday, June 16, 2020 at 10:49:46 PM UTC+5:30, Curran

Re: [datameet] Re: Open Data in India: In a Restrictive Copyright Regime, Voluntary Organisations Pitch in to Make Data Accessible

2020-06-16 Thread Dilawar Singh
Hi Thej, Datameet After reading this, I went to SOI site and tried downloading few maps. I could only find PDFs. And data.gov.in also love PDFs. I wrote a couple of scripts which I used to figure out boundaries and extract data from curves in old PDF paper. It's a tricky business but I had

Re: [datameet] PoK elevation data | India's center of mass

2020-06-11 Thread Dilawar Singh
aster with each pixel representing the > elevation at that point. It seems like that is all the data you would need > for your computation right? > > [1] http://srtm.csi.cgiar.org/srtmdata/ > [2] https://github.com/datameet/maps/tree/master/Country > > On Thu, Jun 11, 2020 at 3:33 P

Re: [datameet] Re: BhuNaksha: source of cadastral maps

2020-05-31 Thread Dilawar Singh
The updated script is available publicly on Gitlab https://gitlab.com/snippets/1982028 . It has to be tweaked for different states. best, Dilawar On Monday, April 27, 2020 at 12:16:25 AM UTC+5:30, Dilawar Singh wrote: > > Hello Sharad > > Here you go: https://gitlab.com/dilaw

Re: [datameet] Re: BhuNaksha: source of cadastral maps

2020-04-26 Thread Dilawar Singh
r so). > Sharad > > On 26-04-2020 13:48, Dilawar Singh wrote: > > Its quite possible. I've used python+selenium to automate browser for a > horribly designed website. You can tweak it for your usage: > https://github.com/dilawar/GreenWatchData > > best, > D

[datameet] Re: BhuNaksha: source of cadastral maps

2020-04-26 Thread Dilawar Singh
Its quite possible. I've used python+selenium to automate browser for a horribly designed website. You can tweak it for your usage: https://github.com/dilawar/GreenWatchData best, Dilawar On Saturday, April 25, 2020 at 1:26:28 PM UTC+5:30, Sharad Lele wrote: > > Dear All, > A followup