[datameet] textricator | Generate structured data from PDFs

2020-07-19 Thread Dilawar Singh
Found this tool today. Can help you getting data from PDFs. *Textricator* is a tool to extract text from documents and generate structured data. If you have a bunch of PDFs with the same format (or one big, consistently formatted PDF) and you want to extract the data to CSV or JSON, *Textricat

Re: [datameet] mumbai district boundary

2020-07-19 Thread Prabhakar Rajagopal
GADM is good. They also follow some standards for the admin units. But the GADM tehsil boundaries are different from the census tehsil boundaries. Any idea how they relate to each other? Prabhakar On Wed, Jul 8, 2020 at 8:35 PM Sharad Lele wrote: > All national, state, district, & taluka bounda