Hi!
I have been working on an opensource project to manage
datasets called dgit. It has reached alpha stage. See the text
below for details.
dgit's goal is to enable more structured and predictable data science
process where you are able to answer questions like:
(a) Lineage/Auditability: Where
Hey Nikhil,
Good to hear from you and many thanks for the step-by-step guide. I'll pass
it on to my colleague who is working on this and will get back to you on
the progress or hiccups.
Warmly,
Satyarupa
On 5 April 2016 at 13:06, Nikhil VJ wrote:
> Hi Satyarupa,
>
>
Hi Satyarupa,
Offering a few tips:
1. Bring your data to a flat excel table. Avoid having any merged cells in
the header.. title each column like "Fatalities_2011", "Fatalities_2013" if
you have levels of titles. Keep the headers in the first row only; from row
2 onwards your data should start.