Bug#761117: debsources: file-level deduplication

2014-09-11 Thread Stefano Zacchiroli
On Thu, Sep 11, 2014 at 02:09:35PM +0800, Paul Wise wrote: > A hash based filesystem layout like we use on snapshot.d.o. > > Use a filesystem with deduplication support like btrfs. I thought about btrfs back in the days, and ruled out the idea because it imposes a fairly important deployment requ

Bug#761117: debsources: file-level deduplication

2014-09-10 Thread Paul Wise
On Thu, Sep 11, 2014 at 4:31 AM, Stefano Zacchiroli wrote: > We already have all the file checksums in the database. Removing > (file-level) duplication in the file storage, using hard-links, can be > safely implemented offline, i.e., as long as no debsources update is > ongoing. I missed the tal

Bug#761117: debsources: file-level deduplication

2014-09-10 Thread Stefano Zacchiroli
Package: qa.debian.org Severity: wishlist We already have all the file checksums in the database. Removing (file-level) duplication in the file storage, using hard-links, can be safely implemented offline, i.e., as long as no debsources update is ongoing. Micro-benchmark (from my DebConf14 Debsou