Hi, Thank you for the topic feeding my thoughts. And thank you Ricardo for your explanations.
> What I was thinking about, is use guix to distribute data packages just like > we distribute softwares from pypi. The advantage of using guix seems > obvious, > but apparantly it's not desirable or possible and I don't understand why. Are you talking to package a way to fetch the data ? The first Debian example I found: https://packages.debian.org/fr/stretch/astrometry-data-2mass-00 Or to package the dataset itself ? Which does not seem affordable in term of resources (bandwith+disk), is it ? Last, when considering large dataset --say hundred or more samples of GB-- then hashing becomes the bottleneck. All the best, simon