On 12/18, Brandon Williams wrote:
> On 12/17, Thomas Gummerer wrote:
> > repo_read_index calls read_index_from, which takes an path argument for
> > the location of the index file. For the split index however it relies
>
> > on the current working directory to construct the path using git_path.
>
> This line isn't actually true and should probably be fixed. git_path
> doesn't rely on the CWD but rather it relies on the gitdir of the main
> repository (the_repository).
Right, let me fix that. Thanks!
> >
> > repo_read_index calls read_index_from with the full path for the index
> > file, however it doesn't change the cwd, so when split index mode is
> > turned on, read_index_from can't find the file for the split index.
> >
> > For example t3007-ls-files-recurse-submodules.sh was broken with
> > GIT_TEST_SPLIT_INDEX set in 188dce131f ("ls-files: use repository
> > object", 2017-06-22), and t7814-grep-recurse-submodules.sh was also
> > broken in a similar manner, probably by introducing struct repository
> > there, although I didn't track down the exact commit for that.
> >
> > Fix this by introducing a new read_index_for_repo function, which knows
> > about the correct paths for the submodules.
> >
> > The alternative would have been to make the callers pass in the base
> > path for the split index, however that ended up being more complicated,
> > and I think we want to converge towards using struct repository for
> > things like these anyway.
> >
> > Signed-off-by: Thomas Gummerer <[email protected]>
> > ---
> > cache.h | 1 +
> > read-cache.c | 16 ++++++++++++++--
> > repository.c | 2 +-
> > 3 files changed, 16 insertions(+), 3 deletions(-)
> >
> > diff --git a/cache.h b/cache.h
> > index cb5db7bf83..d42bea1ef7 100644
> > --- a/cache.h
> > +++ b/cache.h
> > @@ -614,6 +614,7 @@ extern int read_index_preload(struct index_state *,
> > const struct pathspec *paths
> > extern int do_read_index(struct index_state *istate, const char *path,
> > int must_exist); /* for testting only! */
> > extern int read_index_from(struct index_state *, const char *path);
> > +extern int read_index_for_repo(const struct repository *);
> > extern int is_index_unborn(struct index_state *);
> > extern int read_index_unmerged(struct index_state *);
> >
> > diff --git a/read-cache.c b/read-cache.c
> > index 2eb81a66b9..70357febdc 100644
> > --- a/read-cache.c
> > +++ b/read-cache.c
> > @@ -20,6 +20,7 @@
> > #include "split-index.h"
> > #include "utf8.h"
> > #include "fsmonitor.h"
> > +#include "repository.h"
> >
> > /* Mask for the name length in ce_flags in the on-disk index */
> >
> > @@ -1871,7 +1872,8 @@ static void freshen_shared_index(char *base_sha1_hex,
> > int warn)
> > free(shared_index);
> > }
> >
> > -int read_index_from(struct index_state *istate, const char *path)
> > +static int do_read_index_from(struct index_state *istate, const char *path,
> > + const struct repository *repo)
> > {
> > struct split_index *split_index;
> > int ret;
> > @@ -1896,7 +1898,7 @@ int read_index_from(struct index_state *istate, const
> > char *path)
> > split_index->base = xcalloc(1, sizeof(*split_index->base));
> >
> > base_sha1_hex = sha1_to_hex(split_index->base_sha1);
> > - base_path = git_path("sharedindex.%s", base_sha1_hex);
> > + base_path = repo_git_path(repo, "sharedindex.%s", base_sha1_hex);
> > ret = do_read_index(split_index->base, base_path, 1);
> > if (hashcmp(split_index->base_sha1, split_index->base->sha1))
> > die("broken index, expect %s in %s, got %s",
> > @@ -1909,6 +1911,16 @@ int read_index_from(struct index_state *istate,
> > const char *path)
> > return ret;
> > }
> >
> > +int read_index_for_repo(const struct repository *repo)
> > +{
> > + return do_read_index_from(repo->index, repo->index_file, repo);
> > +}
>
> > +
> > +int read_index_from(struct index_state *istate, const char *path)
> > +{
> > + return do_read_index_from(istate, path, the_repository);
> > +}
>
> This looks fine, though I wonder what the point of passing in the index
> file even was since we end just ended up reading the 'sharedindex' file based
> on the git path. I'm just curious about how this function evolved.
There are some callsites that are using an index different form
$gitdir/index, or even GIT_INDEX_FILE. e.g. see builtin/am.c [*1*],
which uses it's own 'patch-merge-index' in the am state directory for
its internal operations.
The split index mode was later bolted on, and the sharedindex.xxxx
would always go in $gitdir for the repository. Others probably know
quite a bit more about this, while I'm always interested in index
related things as that's how I got started with the git project, I
couldn't follow all the conversations that were going on there.
*1*:
https://github.com/gitster/git/blob/52015aaf9d19c97b52c47c7046058e6d029ff856/builtin/am.c#L1844
> > +
> > int is_index_unborn(struct index_state *istate)
> > {
> > return (!istate->cache_nr && !istate->timestamp.sec);
> > diff --git a/repository.c b/repository.c
> > index bb2fae5446..928b1f553d 100644
> > --- a/repository.c
> > +++ b/repository.c
> > @@ -229,5 +229,5 @@ int repo_read_index(struct repository *repo)
> > if (!repo->index)
> > repo->index = xcalloc(1, sizeof(*repo->index));
> >
> > - return read_index_from(repo->index, repo->index_file);
> > + return read_index_for_repo(repo);
> > }
> > --
> > 2.15.1.620.gb9897f4670
> >
>
> --
> Brandon Williams