[ 
https://issues.apache.org/jira/browse/SYSTEMML-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

LI Guobao updated SYSTEMML-2420:
--------------------------------
    Attachment: systemml_rpc_class_diagram.png

> Communication between ps and workers
> ------------------------------------
>
>                 Key: SYSTEMML-2420
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-2420
>             Project: SystemML
>          Issue Type: Sub-task
>            Reporter: LI Guobao
>            Assignee: LI Guobao
>            Priority: Major
>         Attachments: systemml_rpc_2_seq_diagram.png, 
> systemml_rpc_class_diagram.png, systemml_rpc_sequence_diagram.png
>
>
> It aims to implement the parameter exchange between ps and workers. We could 
> leverage netty framework to implement our own Rpc framework. In general, the 
> netty {{TransportClient}} and {{TransportServer}} provides the sending and 
> receiving service for ps and workers. Extending the {{RpcHandler}} allows to 
> invoke the corresponding ps method (i.e., push/pull method) by handling the 
> different input Rpc call object. And then the {{SparkPsProxy}} wrapping 
> {{TransportClient}} allows the workers to execute the push/pull call to 
> server. At the same time, the ps netty server also provides the file 
> repository service which allows the workers to download the partitioned 
> training data, so that the workers could rebuild the matrix object with the 
> transfered file instead of broadcasting all the files with spark which are 
> not all necessary for each worker.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to