ByronHsu opened a new pull request #491: URL: https://github.com/apache/submarine/pull/491
### What is this PR for? In order to save the resource, we use a single tensorboard for all experiments. <img src="https://user-images.githubusercontent.com/24364830/104589865-7657f300-56a5-11eb-969d-60d7f4567625.jpg" width="500px"> The implementation details are demonstrated in this figure: 1. There is a PV for storing tensorborad log files, and a corresponding PVC. 2. There is a tensorboard pod serving all the logs, and it can be accessed by the ingress service. 3. MLJob writes logs to PV, and tensorboard reads logs from PV. ### What type of PR is it? [Feature] ### Todos * [ ] - Frontend support ### What is the Jira issue? https://issues.apache.org/jira/browse/SUBMARINE-709 ### How should this be tested? ### Screenshots (if appropriate) ### Questions: * Does the licenses files need update? No * Is there breaking changes for older versions? No * Does this needs documentation? No ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
