On Mon, May 12, 2014 at 5:56 AM, David Turner <dtur...@twopensource.com> wrote:
>> So without watchman I got
>> 299.871ms read_index_from:1538 if (verify_hdr(hdr, mmap_size) < 0) go
>> 498.205ms cmd_status:1300 refresh_index(&the_index, REFRESH_QUIE
>> 796.050ms wt_status_collect:622 wt_status_collect_untracked(s)
>> and with watchman ("git status" ran several times to make sure it's cached)
>> 301.950ms read_index_from:1538 if (verify_hdr(hdr, mmap_size) < 0) go
>> 34.918ms read_fs_cache:347 if (verify_hdr(hdr, mmap_size) < 0) go
>> 1564.096ms watchman_load_fs_cache:628 update_fs_cache(istate, result);
>> 161.930ms cmd_status:1300 refresh_index(&the_index, REFRESH_QUIE
>> 251.614ms wt_status_collect:622 wt_status_collect_untracked(s)
>> Given the total time of "git status" without watchman is 1.9s,,
>> update_fs_cache() nearly matches that number alone. All that is spent
>> in the exclude update code in the function, but if you do
>> last_excluding_matching() anyway, why cache it?
> My numbers are different (for my test repository):
> 30.031ms read_index:1386 r = read_index_from(istate, get_index_
> 71.625ms cmd_status:1302 refresh_index(&the_index, REFRESH_QUIE
> 259.712ms wt_status_collect:622 wt_status_collect_untracked(s)
> 41.110ms read_index:1386 r = read_index_from(istate, get_index_
> 9.294ms read_fs_cache:347 if (verify_hdr(hdr, mmap_size) < 0) go
> 0.173ms watchman_load_fs_cache:628 update_fs_cache(istate, result)
> 41.901ms read_index:1386 r = read_index_from(istate, get_index_
> 18.355ms cmd_status:1302 refresh_index(&the_index, REFRESH_QUIE
> 50.911ms wt_status_collect:622 wt_status_collect_untracked(s)
> I think something must be going wrong with update_fs_cache on your
> machine. I have a few hypotheses:
> 1. Maybe watchman isn't fully started up when you run your tests.
> 2. Maybe there is a bug.
It's probably me doing something wrong (I ran it more than a couple
times so watchman must have loaded the whole thing). I got small
numbers in update_fs_cache() now.
>> A bit surprised about wt_status_collect_untracked number. I verified
>> that last_excluding_matching() still runs (on the same number of
>> entries like in no-watchman case). Replacing fs_cache_open() in
>> add_excludes_from_file_to_list() to plain open() does not change the
>> number, so we probably won't need that (unless your worktree is filled
>> with .gitignore, which I doubt it's a norm).
> My test repo has a couple hundred of them. Maybe that's unusual? A
> repo with a lot of projects will tend to have lots of gitignore files,
> because each project will want to maintain them independently.
I tried the worst case, every directory had an empty .gitignore. The
numbers did not change much. And I found out that because
add_excludes.. were called twice, not on every .gitignore because of
the condition "!(dir->flags & DIR_EXCLUDE_CMDL_ONLY)". So the number
of .gitignore does not matter (yet).
This is your quote from above, moved down a bit:
> update_fs_cache should only have to update based on what it has learned
> from watchman. So if no .gitignore has been changed, it should not have
> to do very much work.
> I could take the fe_excluded check and move it above the
> last_exclude_matching check in fs_cache_is_excluded; it causes t7300 to
> fail when run under watchman but presumably that's fixable
So you exclude files early and make the real read_directory() pass do
pretty much nothing. This is probably not a good idea. Assume that I
touch $TOP/.gitignore then do something other than "git status" (or
"git add") then I have to pay read_directory() cost.
Back to the open vs fs_cache_open and the number of .gitignore files
above. I touch $TOP/.gitignore then do "git status" to make it read
all .gitignore files (6k of them) and change between open and
fs_cache_open. I think the numbers still do not make any visible
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html