[ 
https://issues.apache.org/jira/browse/SYSTEMML-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15979562#comment-15979562
 ] 

Mike Dusenberry edited comment on SYSTEMML-1466 at 4/21/17 11:31 PM:
---------------------------------------------------------------------

Attaching a distributed convnet file, a python script for running the convnet 
using dummy data of the same size as the breast cancer data, and an output of 
the statistics w/ and w/o forced remote parfor loops.

I ran this on a Yarn cluster with 19 90GB executors.  I experimented with the 
driver size to see if I could force it to use a remote spark parfor loop, but 
that didn't work.

To run, grab a copy of the {{nn}} library and place it next to the attached 
files, then execute {{run_convnet_distrib_sgd.py}}.


was (Author: mwdus...@us.ibm.com):
Attaching a distributed convnet file, a python script for running the convnet 
using dummy data of the same size as the breast cancer data, and an output of 
the statistics w/ and w/o forced remote parfor loops.

To run, grab a copy of the {{nn}} library and place it next to the attached 
files, then execute {{run_convnet_distrib_sgd.py}}.

> Update `convnet.dml` to use distributed SGD.
> --------------------------------------------
>
>                 Key: SYSTEMML-1466
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1466
>             Project: SystemML
>          Issue Type: Sub-task
>            Reporter: Mike Dusenberry
>            Assignee: Mike Dusenberry
>         Attachments: convnet_distrib_sgd.dml, run_convnet_distrib_sgd.py, 
> run_convnet_distrib_sgd-stats.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to