DickJC123 opened a new pull request #15922: Refactor for windows CI 'out of 
heap space' errors
URL: https://github.com/apache/incubator-mxnet/pull/15922
   ## Description ##
   This PR can be thought of as an alternate proposal to PR 
https://github.com/apache/incubator-mxnet/pull/15882 as far as how to handle 
the 'out of heap space' errors of the windows CI.  This PR follows the 
suggestion of @marcoabreu to break up the problematic large-compiles into 
smaller pieces, rather than moving the Windows compiler toolchain to 64-bit.
   So far the PR only breaks apart 
./src/operator/tensor/broadcast_reduce_op_value.{cu,cc}.  The PR also 
cherry-picks a test_shuffle speed-up and a laop_6 flakiness fix to help get the 
PR through CI.
   The pitch: smaller compiler tasks are more easily parallelized over the CPU 
cores in a 'make -j N' scenario, though it's not clear if total compile time is 
reduced.  It may decrease the time needed for a failing compile to complete- a 
help for those that like to isolate a current compile problem with a follow-up 
simple 'make' command.
