Dear Git Gurus,
I wondered what would be your take on my wishlist request to add
--reset-hard option, which would be very similar to regular "update" which
checks out necessary commit, but I want it to remain in the branch.
Rationale: In DataLad we heavily rely on submodules, and we have established
easy ways to do some manipulations across full hierarchies of them. E.g. a
single command could introduce a good number of commits across deep hierarchy
of submodules, e.g. while committing changes within deep submodule, while also
doing all necessary commits in the repositories leading to that submodule so
the entire tree of them stays in a "clean" state. The difficulty comes when
there is a need to just "forget" some changes. The obvious way is to e.g.
git reset --hard PREVIOUS_STATE
in the top level repository. But that leaves all the submodules now in
the undesired state. If I do
git submodule update --recursive
I would end up in the detached HEADs within submodules.
What I want is to retain current branch they are at (or may be possible
"were in"? reflog records might have that information)
Example:
# Have to use datalad install since git clone --recurse-submodules
# seems to not consider alternative locations for submodules' .git/
# with url being just a relative path, and where submodules aren't
# all residing up under toplevel URL .git/
$> datalad install -r http://datasets.datalad.org/labs/gobbini/.git
[INFO ] Cloning http://datasets.datalad.org/labs/gobbini/.git into
'/tmp/gobbini'
install(ok): /tmp/gobbini (dataset)
[INFO ] Installing <Dataset path=/tmp/gobbini> recursively
[INFO ] Cloning http://datasets.datalad.org/labs/gobbini/famface/.git
into '/tmp/gobbini/famface'
[INFO ] Cloning
http://datasets.datalad.org/labs/gobbini/famface/data/.git into
'/tmp/gobbini/famface/data'
[INFO ] access to dataset sibling "datasets.datalad.org" not
auto-enabled, enable with:
| datalad siblings -d "/tmp/gobbini/famface/data" enable
-s datasets.datalad.org
[INFO ] Cloning
http://datasets.datalad.org/labs/gobbini/famface/data/scripts/mridefacer/.git
[2 other candidates] into '/tmp/gobbini/famface/data/scripts/mridefacer'
action summary:
install (ok: 4)
so I have a hierarchy in a good state and all checked out in master
branch
$> cd gobbini
$> git submodule status --recursive
b9071a6bc9f7665f7c75549c63d29f16d40e8af7 famface (heads/master)
e59ba76b42f219bdf14b6b547dd6d9cc0ed5227f famface/data
(BIDS-v1.0.1-3-ge59ba76b)
5d8036c0aaeebb448a00df6296ddc9f799efdd1f
famface/data/scripts/mridefacer (heads/master)
$> git submodule foreach --recursive cat .git/HEAD
Entering 'famface'
ref: refs/heads/master
Entering 'famface/data'
ref: refs/heads/master
Entering 'famface/data/scripts/mridefacer'
ref: refs/heads/master
and if I do roll back
$> git reset --hard HEAD^^^
HEAD is now at 9b4296d [DATALAD] aggregated meta data
changes on filesystem:
famface | 2 +-
and default update --recursive
$> git submodule update --recursive
Submodule path 'famface': checked out
'2569ab436501a832d35afbbe9cc20ffeb6077eb1'
Submodule path 'famface/data': checked out
'f1e8c9b8b025c311424283b9711efc6bc906ba2b'
Submodule path 'famface/data/scripts/mridefacer': checked out
'49b0fe42696724c2a8492f999736056e51b77358'
I end up in detached HEADs
$> git submodule status --recursive
2569ab436501a832d35afbbe9cc20ffeb6077eb1 famface (2569ab4)
f1e8c9b8b025c311424283b9711efc6bc906ba2b famface/data (BIDS-v1.0.1)
49b0fe42696724c2a8492f999736056e51b77358
famface/data/scripts/mridefacer (49b0fe4)
I do see that there is a "custom command" way to do it via
"submodule.<name>.update" config setting, but that is not easy to use for my
case since all the `<name>` would be different to specify !git reset --hard for
all of them via config option and I could not find any way to "glob" config
(like submodule.*.update). But in effect that is probably what I need:
# restarting from a clean state here
$> git -c submodule.famface.update='!git reset --hard' submodule update
--recursive
HEAD is now at 2569ab4 [DATALAD] aggregated meta data
Submodule path 'famface': 'git reset --hard
2569ab436501a832d35afbbe9cc20ffeb6077eb1'
Submodule path 'famface/data': checked out
'f1e8c9b8b025c311424283b9711efc6bc906ba2b'
Submodule path 'famface/data/scripts/mridefacer': checked out
'49b0fe42696724c2a8492f999736056e51b77358'
$> git submodule status --recursive
2569ab436501a832d35afbbe9cc20ffeb6077eb1 famface (heads/master)
f1e8c9b8b025c311424283b9711efc6bc906ba2b famface/data (BIDS-v1.0.1)
49b0fe42696724c2a8492f999736056e51b77358
famface/data/scripts/mridefacer (49b0fe4)
Corner cases I see which might make it trickier for a full blown
solution (might be relevant to current state as well for other
strategies):
- If between those commits we got an additional submodule added (in
immediate repository or within one of the subdatasets), ideally it
should also be wiped out
--
Yaroslav O. Halchenko
Center for Open Neuroscience http://centerforopenneuroscience.org
Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755
Phone: +1 (603) 646-9834 Fax: +1 (603) 646-1419
WWW: http://www.linkedin.com/in/yarik