[datameet] dgit - git for datasets - alpha release

2016-04-05 Thread Venkata Pingali
Hi! I have been working on an opensource project to manage datasets called dgit. It has reached alpha stage. See the text below for details. dgit's goal is to enable more structured and predictable data science process where you are able to answer questions like: (a) Lineage/Auditability: Where

Re: [datameet] Tips on cleaning your data for mapping

2016-04-05 Thread Satyarupa Shekhar
Hey Nikhil, Good to hear from you and many thanks for the step-by-step guide. I'll pass it on to my colleague who is working on this and will get back to you on the progress or hiccups. Warmly, Satyarupa On 5 April 2016 at 13:06, Nikhil VJ wrote: > Hi Satyarupa, > >

Re: [datameet] Tips on cleaning your data for mapping

2016-04-05 Thread Nikhil VJ
Hi Satyarupa, Offering a few tips: 1. Bring your data to a flat excel table. Avoid having any merged cells in the header.. title each column like "Fatalities_2011", "Fatalities_2013" if you have levels of titles. Keep the headers in the first row only; from row 2 onwards your data should start.