On Tue, Aug 16, 2016 at 11:48 AM, Junio C Hamano <[email protected]> wrote:
> Jacob Keller <[email protected]> writes:
>
>> diff --git a/Documentation/diff-config.txt b/Documentation/diff-config.txt
>> index d5a5b17d5088..f5d693afad6c 100644
>> --- a/Documentation/diff-config.txt
>> +++ b/Documentation/diff-config.txt
>> @@ -123,7 +123,8 @@ diff.suppressBlankEmpty::
>> diff.submodule::
>> Specify the format in which differences in submodules are
>> shown. The "log" format lists the commits in the range like
>> - linkgit:git-submodule[1] `summary` does. The "short" format
>> + linkgit:git-submodule[1] `summary` does. The "diff" format shows the
>> + diff between the beginning and end of the range. The "short" format
>> format just shows the names of the commits at the beginning
>> and end of the range. Defaults to short.
>
> It would be much better to describe the default one first and then
> more involved one next, no? That would also match what the change
> to "diff-options" in this patch does (can be seen below).
>
The main thing is that "--submodule" alone means "use the log format"
so I think that's why it went first. I can reword this to make it more
clear how this works.
Thanks,
Jake
>> @@ -2311,6 +2322,15 @@ static void builtin_diff(const char *name_a,
>> two->dirty_submodule,
>> meta, del, add, reset);
>> return;
>> + } else if (o->submodule_format == DIFF_SUBMODULE_DIFF &&
>> + (!one->mode || S_ISGITLINK(one->mode)) &&
>> + (!two->mode || S_ISGITLINK(two->mode))) {
>> + show_submodule_diff(o->file, one->path ? one->path : two->path,
>> + line_prefix,
>> + one->oid.hash, two->oid.hash,
>> + two->dirty_submodule,
>> + meta, a_prefix, b_prefix, reset);
>> + return;
>> }
>
> The "either missing or is submodule" check used here is being
> consistent with the existing "submodule=log" case. Good.
>
>> diff --git a/submodule.c b/submodule.c
>> index 1b5cdfb7e784..b1da68dd49c9 100644
>> --- a/submodule.c
>> +++ b/submodule.c
>> @@ -333,6 +333,136 @@ static void print_submodule_summary(struct rev_info
>> *rev, FILE *f,
>> strbuf_release(&sb);
>> }
>>
>> +void show_submodule_diff(FILE *f, const char *path,
>> + const char *line_prefix,
>> + unsigned char one[20], unsigned char two[20],
>> + unsigned dirty_submodule, const char *meta,
>> + const char *a_prefix, const char *b_prefix,
>> + const char *reset)
>> +{
>> + struct strbuf submodule_git_dir = STRBUF_INIT, sb = STRBUF_INIT;
>> + struct child_process cp = CHILD_PROCESS_INIT;
>> + const char *git_dir;
>> +
>> + if (dirty_submodule & DIRTY_SUBMODULE_UNTRACKED) {
>> + fprintf(f, "%sSubmodule %s contains untracked content\n",
>> + line_prefix, path);
>> + }
>> + if (dirty_submodule & DIRTY_SUBMODULE_MODIFIED) {
>> + fprintf(f, "%sSubmodule %s contains modified content\n",
>> + line_prefix, path);
>> + }
>> +
>> + strbuf_addf(&sb, "%s%sSubmodule %s %s..",
>> + line_prefix, meta, path,
>> + find_unique_abbrev(one, DEFAULT_ABBREV));
>> + strbuf_addf(&sb, "%s:%s",
>> + find_unique_abbrev(two, DEFAULT_ABBREV),
>> + reset);
>> + fwrite(sb.buf, sb.len, 1, f);
>> +
>> + if (is_null_sha1(one))
>> + fprintf(f, " (new submodule)");
>> + if (is_null_sha1(two))
>> + fprintf(f, " (submodule deleted)");
>
> These messages are in sync with show_submodule_summary() that is
> used in --submodule=log codepath. Good.
>
They're not exactly the same due to some ways of splitting up new lines.
>> + /*
>> + * We need to determine the most accurate location to call the sub
>> + * command, and handle the various corner cases involved. We'll first
>> + * attempt to use the path directly if the submodule is checked out.
>> + * Then, if that fails, we'll check the standard module location in
>> + * the git directory. If even this fails, it means we can't do the
>> + * lookup because the module has not been initialized.
>> + */
>
> This is more elaborate than what show_submodule_summary() does,
> isn't it? Would it make the patch series (and the resulting code)
> more understandable if you used the same code by refactoring these
> two? If so, I wonder if it makes sense to split 3/3 into a few
> separate steps:
The show_submodule_summary just uses "add_submodule_odb" which adds
the submodule as an alternate source of objects, if I understand
correctly.
>
> * Update the internal "--submodule=<type>" handling without adding
> the "--submodule=diff" and show_submodule_diff() function.
>
> * Refactor the determination of the submodule status (i.e. does it
> even have a clone? where is its repository? etc.) from existing
> show_submodule_summary() into a separate helper function.
>
> * Make that helper function more elaborate like what you do here,
> and update show_submodule_summary(). I think the state
> show_submodule_summary() calls "not checked out" corresponds to
> what you say "not initialized" below, and they should share the
> same logic to determine that the submodule is in that state, and
> share the same message, for example.
Makes sense, I might squash that with the above if it's easier.
>
> * Introduce "--submodule=diff", and show_submodule_diff() function;
> the latter would use the helper function prepared in the previous
> step.
>
> perhaps?
>
That makes more sense. I'll rework the series some more.
>> +
>> + if (dirty_submodule & DIRTY_SUBMODULE_MODIFIED) {
>> + /*
>> + * If the submodule has modified contents we want to diff
>> + * against the work tree, so don't add a second parameter.
>> + * This is essentially equivalent of using diff-index instead.
>> + * Note that we can't (easily) show the diff of any untracked
>> + * files.
>> + */
>> + } else if (is_null_sha1(two)) {
>
> It is safer to have ';' inside the empty if(){} block to make sure
> that one empty statement exists there. It makes the intention of
> the code clearer, too.
>
Will do.
> I am debating myself if this is a good thing to do, though. The
> submodule is a separate project for a reason, and there is a reason
> why the changes haven't been committed. When asking "what's different
> between these two superproject states?", should the user really see
> these uncommitted changes?
>
> Thanks.
Well, the previous submodule code for "log" prints out "submodule has
untracked content" and "submodule has modified content" so I figured
the diff might want to show that as a diff too? Or maybe we just stick
with the messages and only show the difference of what's actually been
committed.. I think that's probably ok too.
Regards,
Jake
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html