Re: [darcs-users] darcs-hs/hashed-storage review

Ganesh Sittampalam Thu, 06 Aug 2009 13:44:45 -0700

On Thu, 6 Aug 2009, Petr Rockai wrote:

[treeDiff vs unsafeDiff]
I have read the code again and I take that back. There are just fourproblematic cases that need checking:
- empty -> text with trailing newline
- empty -> text without trailing newline
- text with trailing newline -> empty
- text without trailing newline -> empty
I have thrown together a small testcase that checks these four casesexplicitly. (I must say that darcs creates quite curious hunk patchesfor the no-newline cases...) It is available intests/trailing-newlines.sh in current darcs-hs.


OK, cool. I'll try to audit the code to confirm your assessment too.

Also, darcs check should check the index (and darcs repair fix it).
Hm, it is always safe to rm _darcs/index, so repair could just do that.As for check, it should be easy to implement, I'll do that in a bit (allit needs to do is read the index and pristine, check that the file listsmatch and then recompute the hashes from actual file contents of workingfiles... did I miss something?).


Sounds right to me.

Aha. Yes, virtualTreeIO is an exception in this case. I'll work on theMonad a little more, probably over the weekend. (I do use virtualTreeIOin place of apply_to_slurpy in darcs-hs... or more precisely, we use itto implement applyToTreei -- that's basically one of the intendeduse-cases. The other is to do read-only things -- it may be howeverbetter to have a read-only monad for that use-case -- basically acounterpart to ReadableDirectory.)


That sounds like a good idea.

Well, hashedTreeIO is currently not used. Well, it is used by darcs
check/repair in darcs-hs, which does its own cleanup. This is however
something that will need addressing.
OK, I'm now rather lost about what is needed for what. I'd haveexpected hashedTreeIO to be needed for everything that touches pristinein a hashed repo. Is this just a work in progress?

Eventually, yes. For now, it's a work in progress. (In darcs-hs, I havea patch to port check/repair to hashedTreeIO and it even works, althoughit's slower than current code by a fair margin.)


Do you expect to be able to sort that out?

More generally do you have a target in mind for how complete thetransition to hashed-storage will be before the patches are merged? Ithink that anything we merge we should be happy with keeping even if yourwork stops right there.

This is now done as well. The readIndex function now gives a pair: theTree object, and an action to "update" it. I have documented the newbehaviour [1] in haddock. It is indeed a little different from the usualTree objects in the sense that it gives you what is in the index file --which may be out of date. It would however be a waste to update all ofthe index every time (although this may be what git does, IIUIC). Tomake it possible to update only a part of the index, you can trim downthe index Tree and then update the result. It however wouldn't be muchof a problem to also provide a function that will always give you full,up-to-date index. For darcs however, we need (when we say darcs whatsnewsubtree) to use the partial update functionality.
[1]: 
http://repos.mornfall.net/hashed-storage/dist/doc/html/hashed-storage/Storage-Hashed-Index.html#v%253AreadIndex

OK, so now what you've got is an API with lots of constraints on how itshould be used and lots of ways to get it wrong. How about an alternatetype, e.g. IndexTree, with its own filter operation, and functions toget at either the real Tree or both the real Tree and the hash/mtimemap (both of which force it)? Under the covers IndexTree could just be aTree with the current implementation, but at least the API would be safe.


Cheers,

Ganesh
_______________________________________________
darcs-users mailing list
[email protected]
http://lists.osuosl.org/mailman/listinfo/darcs-users

Re: [darcs-users] darcs-hs/hashed-storage review

Reply via email to