On Fri, 15 Apr 2005, David Woodhouse wrote:

given piece of content. Also because we actually have the developer's
attention at commit time, and we can get _real_ answers from the user
about what she was doing, instead of having to guess.

Yes, but it's still hard to get *accurate* information. And developers tend to use very short commit messages already...

But if it can be done cheaply enough at a later date even though we end
up repeating ourselves, and if it can be done _well_ enough that we
shouldn't have just asked the user in the first place, then yes, OK I

I think examining the rsync algorithms should convince you that finding common chunks can be fairly efficient. (See my next message for a more concrete proposal.)

