merlimat commented on issue #12537: URL: https://github.com/apache/pulsar/issues/12537#issuecomment-954859813
Thanks @brucelwl for bringing this up. Most of the space is actually kept for the `asf-site` branch where we have automated updates for the website. Part of the issue is related to daily updates of many files to update timestamps in HTML files. I created #12538 to at least stop the bulk of these updates. I believe we should also cut the history of the generated commits for website and that should immediately reduce the repo size to a fraction of the current size. ``` ➜ git clone https://github.com/apache/pulsar.git Cloning into 'pulsar'... remote: Enumerating objects: 2605457, done. remote: Counting objects: 100% (26149/26149), done. remote: Compressing objects: 100% (3288/3288), done. remote: Total 2605457 (delta 23308), reused 24816 (delta 22221), pack-reused 2579308 Receiving objects: 100% (2605457/2605457), 1.25 GiB | 26.27 MiB/s, done. Resolving deltas: 100% (1976654/1976654), done. ➜ pulsar git:(master) du -sh pulsar/.git 1.3G .git ``` The workaround is to just fetch a specific branch when cloning the repo: ``` ➜ /tmp git clone -b master --single-branch https://github.com/apache/pulsar.git pulsar-master Cloning into 'pulsar-master'... remote: Enumerating objects: 200320, done. remote: Counting objects: 100% (257/257), done. remote: Compressing objects: 100% (178/178), done. remote: Total 200320 (delta 65), reused 127 (delta 29), pack-reused 200063 Receiving objects: 100% (200320/200320), 89.12 MiB | 9.72 MiB/s, done. Resolving deltas: 100% (91729/91729), done. ➜ pulsar-master git:(master) du -sh pulsar-master/.git 103M .git ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
