Since this thread was highlighted in yesterday's Carpentry Clippings, I'll bet I'm not the last to jump in today, so I'll be brief.
DVC <http://github.com/iterative/dvc.git> was mentioned at the beginning, but I gather few here have given it a try. I encourage you to take a look. The tool is still in alpha, but developing quickly with a lot of potential. What I like about DVC: * Works in parallel to git and is similar to git LFS in cloning/pushing/pulling references to data files * Data files are not tracked by git; your code repository remains just that * Supports external data sources (since 0.10.0 <https://github.com/iterative/dvc/releases/tag/0.10.0>); do you really want a copy of your data *within* every repo that reads it? * Supports multiple cloud data sources (e.g. Amazon S3) * Does not default to "publishing" data on GitHub. GitHub is no Dataverse or Figshare (... data discoverability, yada yada) * It's a makefile alternative too! ------------------------------------------ The Carpentries: discuss Permalink: https://carpentries.topicbox.com/groups/discuss/Tb776978a905c0bf8-Ma216656f062405087a5f69ae Delivery options: https://carpentries.topicbox.com/groups/discuss/subscription
