Since this thread was highlighted in yesterday's Carpentry Clippings, I'll bet 
I'm not the last to jump in today, so I'll be brief.

DVC <http://github.com/iterative/dvc.git> was mentioned at the beginning, but I 
gather few here have given it a try. I encourage you to take a look. The tool 
is still in alpha, but developing quickly with a lot of potential. What I like 
about DVC:
 * Works in parallel to git and is similar to git LFS in 
cloning/pushing/pulling references to data files
 * Data files are not tracked by git; your code repository remains just that
 * Supports external data sources (since 0.10.0 
<https://github.com/iterative/dvc/releases/tag/0.10.0>); do you really want a 
copy of your data *within* every repo that reads it?
 * Supports multiple cloud data sources (e.g. Amazon S3)
 * Does not default to "publishing" data on GitHub. GitHub is no Dataverse or 
Figshare (... data discoverability, yada yada)
 * It's a makefile alternative too!

------------------------------------------
The Carpentries: discuss
Permalink: 
https://carpentries.topicbox.com/groups/discuss/Tb776978a905c0bf8-Ma216656f062405087a5f69ae
Delivery options: https://carpentries.topicbox.com/groups/discuss/subscription

Reply via email to