[
https://issues.apache.org/jira/browse/SUBMARINE-52?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16831428#comment-16831428
]
Szilard Nemeth commented on SUBMARINE-52:
-----------------------------------------
Hi [~sunilg]!
1. Thanks, I hope this will be resolved soon. If you check the latest build
result from jenkins, we are still encountering this issue quite often.
2. Tried to run ./start-build-env.sh in Hadoop's root, but it hangs for me with
the docker run command that would start off the hadoop-build container. I had
some successful docker builds before. Have you ever seen such occasion when
this command hanged?
3, 4, 5, 6: I think these are no-ops at this point in time for this patch
7. I already fixed the majority of the checkstyle issues, the remainders are
the ones that I didn't plan to fix as described above.
As discussed with [~tangzhankun]: If this goes in, I will rebase my
SUBMARINE-49 patch on top of it and could go forward with that.
In the meantime, I can make some progress with the documentation.
[~sunilg], [~wangda]: Ideally, this patch should be reviewed & committed this
week, right?
Please note that I have a day off tomorrow so the earlies time I can fix review
comments will be Monday, next week.
Thanks!
> Generate Service spec + launch script for single-node PyTorch learning job
> --------------------------------------------------------------------------
>
> Key: SUBMARINE-52
> URL: https://issues.apache.org/jira/browse/SUBMARINE-52
> Project: Hadoop Submarine
> Issue Type: Sub-task
> Reporter: Szilard Nemeth
> Assignee: Szilard Nemeth
> Priority: Major
> Attachments: SUBMARINE-52-2.001.patch, SUBMARINE-52.001.patch,
> SUBMARINE-52.002.patch, SUBMARINE-52.003.patch, SUBMARINE-52.004.patch,
> SUBMARINE-52.005.patch, SUBMARINE-52.006.patch, SUBMARINE-52.007.patch,
> SUBMARINE-52.008.patch
>
>
> Similar to what we have for Tensorflow in
> {{org.apache.hadoop.yarn.submarine.runtimes.yarnservice.YarnServiceJobSubmitter}},
> we need a code that generates Service spec file (json) for PyTorch.
> We also need to take care of the separation of CLI/YAML arguments of TF /
> PyTorch.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)