Re: [datameet] Data of Indian Railways

2019-01-04 Thread Srihari Thalla
Thanks for the mention Arun!

I have now updated the crawler - removing tabs, unwanted newlines, leading
and trailing spaces in the data columns.

Here is the latest links to download:
JSON:
https://api.apify.com/v1/execs/7t9roKQ9yp6T8ZnpR/results?format=json=1=1
CSV:
https://api.apify.com/v1/execs/7t9roKQ9yp6T8ZnpR/results?format=csv=1=1

Hope this helps!

@Jasvinder I think one solution to extract locations for the stations is
via Overpass using the station codes and combining them to the spreadsheet.

-- Srihari


On Fri, 4 Jan 2019 at 12:43, Jasvinder Singh 
wrote:

> Dear Arun,
> Exactly the type of simple data sheet that newbies can understand. However
> how location (Coordinates) is linked in this file?
>
> On Fri, Jan 4, 2019 at 12:11 PM Arun Ganesh  wrote:
>
>> Spreadsheet if anyone wants to explore:
>> https://docs.google.com/spreadsheets/d/1AFwl_5cB9qD39VWNox1LoeL3tGaGB22f7p4vc7IyMqY/edit#gid=0
>>
>> There are 16,770 station entries of which 11,660 seem to be currently
>> operational according to the expiry date of 2999.
>>
>> Filtering out goods stations, there are 9835 entries. This still seems to
>> include a few yards and cabins that are not legitimate stations. Also
>> noticed quite a few spelling and formatting issues in the names. The
>> station codes look correct. Some amount of manual cleanup is needed on this
>> list.
>>
>> The official number of stations according to IR is 7349 stations (as of
>> 2017)
>> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/IRSP_2016-17/Facts_Figure/Fact_Figures%20English%202016-17.pdf>
>> and 1817 halts/block huts (2013)
>> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/downloads/Data_Bank.pdf>.
>>
>>
>> On Fri, Jan 4, 2019 at 11:53 AM Arun Ganesh  wrote:
>>
>>> Beauty of the internet, crawler got done by Srihari:
>>> https://twitter.com/sriharithalla/status/1080801313707896837
>>>
>>> JSON data:
>>> https://api.apify.com/v1/execs/TsBwnYutP5u9FCKp5/results?format=json=1
>>>
>>> I'm in the process of doing a little bit of cleanup using openrefine and
>>> will share on a spreadsheet.
>>>
>>> On Fri, Jan 4, 2019 at 10:01 AM Jasvinder Singh <
>>> jasvinsinghre...@gmail.com> wrote:
>>>
>>>> Dear All,
>>>>
>>>> Not all the members are familiar with intricacies of the data
>>>> collection for such projects. Since this seems to be a crowd sourcing
>>>> endeavour, I suggest that the basic data collection protocol be enumerated
>>>> for newbies so that they can also contribute data which can then be put in
>>>> proper format by professionals.
>>>>
>>>> Regards,
>>>>
>>>> Jasvinder Singh
>>>>
>>>> On Tue, Dec 25, 2018 at 10:32 AM Nikhil VJ  wrote:
>>>>
>>>>> Hi folks,
>>>>>
>>>>> There's a project afoot in the OpenStreetMap and Wikidata communities
>>>>> to get together Indian Railways data.
>>>>>
>>>>> One major part of it: Properly mapping all the railway stations of
>>>>> India, and ensuring they have wikidata entries.
>>>>>
>>>>> Here's a wiki page set up for it:
>>>>> https://www.wikidata.org/wiki/Wikidata:WikiProject_Indian_Railways
>>>>>
>>>>> I'm cross-posting from OpenStreetMap India Telegram
>>>>> <https://t.me/OSMIndia> group:
>>>>>
>>>>> (Arun Ganesh): There seems to be around 7000 stations located. There
>>>>> still ~1.5k missing. A lot more need names, refs and wikidata links.
>>>>> Overpass: *http://overpass-turbo.eu/s/EC4
>>>>> <http://overpass-turbo.eu/s/EC4>*
>>>>>
>>>>>
>>>>> (Srihari Thalla) : Last year I created two MapRoulette Challenges to
>>>>> tag station codes and add Wiki tags
>>>>> *https://maproulette.org/mr3/challenge/2403
>>>>> <https://maproulette.org/mr3/challenge/2403>*
>>>>> *https://maproulette.org/mr3/challenge/2404
>>>>> <https://maproulette.org/mr3/challenge/2404>*
>>>>>
>>>>> 
>>>>>
>>>>> The overpass query above queries the whole country and may be slow or
>>>>> timeout. I adapted the query to work only on the map area being
>>>>> viewed, so you can zoom into smaller regions. And changed a few things,
>>>>> included

Re: [datameet] Data of Indian Railways

2019-01-04 Thread Srihari Thalla
Hi Sajjad,

I recently noticed that station codes from Datameet railways is used to
update Wikidata pages for the stations. Would it be possible to update the
repo in reverse as well, with the Arun's spreadsheet?

-- Srihari


On Fri, 4 Jan 2019 at 15:58, Sajjad Anwar  wrote:

> Hi!
>
> There are some ~8900 station coordinates that we scraped a while ago here
> https://github.com/datameet/railways
> Most of these have station codes so if we want to match with the list Arun
> generated we could do it. And then run another round of manual geocoding.
>
> Cheers,
> Sajjad
>
> On Fri, Jan 4, 2019 at 3:51 PM Nikhil VJ  wrote:
>
>> Hi,
>>
>> If there's interest I can set up a mapping interface to crowdsource
>> lat-longs. Along the lines of this: https://fuzzymapper.herokuapp.com/
>> But I would take a week to set up so tell. Though it would be best to get
>> existing lat-long sets with the official station code and import them in,
>> this can help cover the laggards.
>>
>>
>> Also, if collective work on OpenRefine is required then I can help set it
>> up on cloud and keep protected edit access. That won't take more time.
>>
>>
>> Regards
>> Nikhil VJ, Pune
>>
>>
>> On Friday, January 4, 2019 at 12:43:17 PM UTC+5:30, Jasvinder Singh wrote:
>>>
>>> Dear Arun,
>>> Exactly the type of simple data sheet that newbies can understand.
>>> However how location (Coordinates) is linked in this file?
>>>
>>> On Fri, Jan 4, 2019 at 12:11 PM Arun Ganesh  wrote:
>>>
>>>> Spreadsheet if anyone wants to explore:
>>>> https://docs.google.com/spreadsheets/d/1AFwl_5cB9qD39VWNox1LoeL3tGaGB22f7p4vc7IyMqY/edit#gid=0
>>>>
>>>> There are 16,770 station entries of which 11,660 seem to be currently
>>>> operational according to the expiry date of 2999.
>>>>
>>>> Filtering out goods stations, there are 9835 entries. This still seems
>>>> to include a few yards and cabins that are not legitimate stations. Also
>>>> noticed quite a few spelling and formatting issues in the names. The
>>>> station codes look correct. Some amount of manual cleanup is needed on this
>>>> list.
>>>>
>>>> The official number of stations according to IR is 7349 stations (as
>>>> of 2017)
>>>> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/IRSP_2016-17/Facts_Figure/Fact_Figures%20English%202016-17.pdf>
>>>> and 1817 halts/block huts (2013)
>>>> <http://www.indianrailways.gov.in/railwayboard/uploads/directorate/stat_econ/downloads/Data_Bank.pdf>.
>>>>
>>>>
>>>> On Fri, Jan 4, 2019 at 11:53 AM Arun Ganesh  wrote:
>>>>
>>>>> Beauty of the internet, crawler got done by Srihari:
>>>>> https://twitter.com/sriharithalla/status/1080801313707896837
>>>>>
>>>>> JSON data:
>>>>> https://api.apify.com/v1/execs/TsBwnYutP5u9FCKp5/results?format=json=1
>>>>>
>>>>> I'm in the process of doing a little bit of cleanup using openrefine
>>>>> and will share on a spreadsheet.
>>>>>
>>>>> On Fri, Jan 4, 2019 at 10:01 AM Jasvinder Singh 
>>>>> wrote:
>>>>>
>>>>>> Dear All,
>>>>>>
>>>>>> Not all the members are familiar with intricacies of the data
>>>>>> collection for such projects. Since this seems to be a crowd sourcing
>>>>>> endeavour, I suggest that the basic data collection protocol be 
>>>>>> enumerated
>>>>>> for newbies so that they can also contribute data which can then be put 
>>>>>> in
>>>>>> proper format by professionals.
>>>>>>
>>>>>> Regards,
>>>>>>
>>>>>> Jasvinder Singh
>>>>>>
>>>>>> On Tue, Dec 25, 2018 at 10:32 AM Nikhil VJ  wrote:
>>>>>>
>>>>>>> Hi folks,
>>>>>>>
>>>>>>> There's a project afoot in the OpenStreetMap and Wikidata
>>>>>>> communities to get together Indian Railways data.
>>>>>>>
>>>>>>> One major part of it: Properly mapping all the railway stations of
>>>>>>> India, and ensuring they have wikidata entries.
>>>>>>>
>>>>>>> Here's a wiki page set up for it:
>>>>>>> https://www.wikidata.org/w

Re: [datameet] Mapping of Hyderabad Bus Stops and Routes

2018-12-09 Thread Srihari Thalla
How did it go, Nikhil?

On Fri, Dec 7, 2018, 14:24 Nikhil VJ  Meetup link: https://www.meetup.com/swechafsmi/events/257026328/
>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to datameet+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.
>

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [datameet] Re: Telangana district boundaries

2017-08-09 Thread Srihari Thalla
Hi Harsha,

I was not able to collect any sources for the new districts. DataMeet does
not have it either.

Thanks!

On Mon 7 Aug, 2017, 19:21 harsha, <nanohar...@gmail.com> wrote:

> Hi Srihari,
>
> Were you able to get access to these ? I am currently looking at focusedly
> Vikarabad Distritc, which is a new district carved out of Rangareddy
> District and within it mandals & villages. Let me know if we can
> collectively work towards this.
>
> Cheers,
> Harsha
>
>
> On Sunday, June 4, 2017 at 11:10:18 AM UTC+5:30, Srihari Thalla wrote:
>>
>> Hi!
>>
>> Does anyone have the shapefiles for new 31 districts of Telangana? The
>> main intent is to update them in OpenStreetMap!
>>
>> Thanks!
>>
>> cc @PlaneMad
>>
> --
> Datameet is a community of Data Science enthusiasts in India. Know more
> about us by visiting http://datameet.org
> ---
> You received this message because you are subscribed to the Google Groups
> "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to datameet+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.
>
-- 
Cheers,
Srihari

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[datameet] Telangana district boundaries

2017-06-03 Thread Srihari Thalla
Hi!

Does anyone have the shapefiles for new 31 districts of Telangana? The main 
intent is to update them in OpenStreetMap!

Thanks!

cc @PlaneMad

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[datameet] Re: Government Open Data Licence – India (GODL) has been gazette notified on February 13, 2017.

2017-03-22 Thread Srihari Thalla
Great!

BTW, does this mean the ECI's Polling stations data is now public?
https://github.com/datameet/maps/issues/13

On Friday, 10 March 2017 19:20:15 UTC+5:30, Thejesh GN  wrote:
> Government Open Data Licence – India (GODL) has been gazette notified on 
> February 13, 2017. 
> 
> 
> https://data.gov.in/sites/default/files/Gazette_Notification_OGDL.pdf
> 
> 
> 
> 
> 
> 
> Thej
> --
> Thejesh GN ⏚ ತೇಜೇಶ್ ಜಿ.ಎನ್
> http://thejeshgn.com

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to datameet+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.