On https://spot.incubator.apache.org/doc/ in section “2. Deployment Recommendations”, we are recommending that the “spot-ml” software (which is all scala-spark-streaming) should be run on the YARN node manager. On the "service layout", it is implied that the spot-ml software is installed on "Worker" node.
But these jobs are intended to be launched from the edge node, where the YARN service packages them up and publishes them to worker nodes. Thus, to make this simpler and more obvious for the users, the "ML" task should be shown as being installed, configured, and executed from the Edge node. Jeremy
