Quite interested in BytePS. Looking forward to seeing how integration could evolve.
On Wed, Nov 6, 2019, 8:14 AM Yimin Jiang <jym...@gmail.com> wrote: > Hi Zhennan, > > Thanks for your interest. To be honest, our team currently do not have a > plan for CPU training. That said, the notion of BytePS is not GPU-specific > and should also apply to CPU. I do not see a fundamental challenge yet. And > we welcome contributions on this. > > Thank you, > Yimin > > On Wed, Nov 6, 2019 at 2:59 PM Qin, Zhennan <zhennan....@intel.com> wrote: > > > Hi Yimin, > > > > Welcome to make contribution to MXNet project! > > > > From <https://github.com/bytedance/byteps/blob/master/README.md> > > https://github.com/bytedance/byteps/blob/master/README.md I found > another > > limitation that isn't shown in your proposal: > > > > BytePS does not support pure CPU training for now. One reason is that the > > cheap PS assumption< > > https://github.com/bytedance/byteps/blob/master/docs/rationale.md> of > > BytePS do not hold for CPU training. Consequently, you need CUDA and NCCL > > to build and run BytePS. > > > > I have a couple of question for this: How's the status of CPU training > > support? If CPU training isn't supported yet, what's the challenge to > > support it? Do you have a plan to support it? > > > > Thanks, > > Zhennan > > > > On Wed, 2019-11-06 at 12:14 +0800, Yimin Jiang wrote: > > > > Hi MXNet Community, > > > > > > BytePS (https://github.com/bytedance/byteps) is a high-performance, > > > > cross-framework architecture for distributed training. BytePS developers > > > > are planning to integrate a part of BytePS into MXNet. The link below is > > > > the proposal. Feedbacks are welcome. > > > > > > > https://cwiki.apache.org/confluence/display/MXNET/BytePS-MXNet+Integration > > > > > > > > Thank you, > > > > Yimin Jiang > > > > >