Hi @jeremiedb It makes sense. Finetuning on larger networks can be achieved using smaller batch size. It worked for me with batch size of 32 for vgg19 and without using gc(). Since adding gc() might affect training performance and slow it down, it makes sense to reduce batch sizer according to the hardware in case the training crashes.
@lichen11 Please try with a reduced batch size and it should fix your issue. @sandeep-krishnamurthy Could you please close the issue as it has been answered and solved. @lichen11 Please feel free to reopen if closed in error. [ Full content available at: https://github.com/apache/incubator-mxnet/issues/7968 ] This message was relayed via gitbox.apache.org for [email protected]
