On Thu, Sep 01, 2016 at 11:16:49AM -0700, gjarms wrote:
> Hi Git Experts,
> We have been exploring various ways to improve git cloning time, one among
> them is using bitmap which is suppose to save time "counting objects". but
> i have problem creating bitmap since the repository contains 100's of pack
> files. the bitmap file is not created when i use "git gc".
> I have the following entries in my .gitconfig.
> packSizeLimit = 10m
> writebitmaps = on
> writeBitmapHashCache = on
> If i just dont use "packSizeLimit = 10m", then bitmap is created just by
> running git gc
> Can you please make me understand ?, What i understood is that the bitmap is
> created when there is a single pack file, but if i split it into multiple
> pack file, the bitmap generation fails with the warning
> "warning: disabling bitmap writing, as some objects are not being packed".
That's weird. I'd expect it to say:
disabling bitmap writing, packs are split due to pack.packSizeLimit
At least since v2.8.3, which has 9cea46cdda. Before that it wouldn't
have printed anything, and just silently turned off bitmaps.
The "some objects are not being packed" warning should only come when
want_object_in_pack() says we don't want an object. That's generally
because the object is either found in a shared alternates repository, or
is in a .keep pack. Though in the latter case, unless you've set
repack.packKeptObjects manually, we'll pack it anyway when bitmaps are
in effect (since ee34a2bead, in v2.0.0).
That confusion aside, you almost certainly should not be setting
packSizeLimit, and definitely not to something so low. Git will not
store cross-pack deltas, so you miss out on tons of delta opportunities.
As a result:
1. Your on-disk repository size will balloon. So you'll have a hundred
10m packs rather than one 200mb pack.
2. Your clone times will also grow, as git will try to find new deltas
between the objects in various packs independently for each clone.