merlimat commented on issue #12537:
URL: https://github.com/apache/pulsar/issues/12537#issuecomment-954859813


   Thanks @brucelwl for bringing this up. 
   
   Most of the space is actually kept for the `asf-site` branch where we have 
automated updates for the website. Part of the issue is related to daily 
updates of many files to update timestamps in HTML files.
   
   I created #12538 to at least stop the bulk of these updates. 
   
   I believe we should also cut the history of the generated commits for 
website and that should immediately reduce the repo size to a fraction of the 
current size. 
   
   ```
   ➜  git clone https://github.com/apache/pulsar.git
   Cloning into 'pulsar'...
   remote: Enumerating objects: 2605457, done.
   remote: Counting objects: 100% (26149/26149), done.
   remote: Compressing objects: 100% (3288/3288), done.
   remote: Total 2605457 (delta 23308), reused 24816 (delta 22221), pack-reused 
2579308
   Receiving objects: 100% (2605457/2605457), 1.25 GiB | 26.27 MiB/s, done.
   Resolving deltas: 100% (1976654/1976654), done.
   ➜  pulsar git:(master) du -sh pulsar/.git
   1.3G .git
   ```
   
   The workaround is to just fetch a specific branch when cloning the repo:
   
   ```
   ➜  /tmp git clone -b master --single-branch 
https://github.com/apache/pulsar.git  pulsar-master
   Cloning into 'pulsar-master'...
   remote: Enumerating objects: 200320, done.
   remote: Counting objects: 100% (257/257), done.
   remote: Compressing objects: 100% (178/178), done.
   remote: Total 200320 (delta 65), reused 127 (delta 29), pack-reused 200063
   Receiving objects: 100% (200320/200320), 89.12 MiB | 9.72 MiB/s, done.
   Resolving deltas: 100% (91729/91729), done.
   ➜  pulsar-master git:(master) du -sh pulsar-master/.git
   103M .git
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to