Joydeep Sen Sarma wrote:
if the cluster is unused - why restrict parallelism? if someone's willing to
wake up at 4am to beat the crowd - they would just absolutely hate this.
[It would be better to make your comments in Jira. ]
But if someone starts a long-running job at night that uses the entire
cluster then they could monopolize the cluster into the day. If
speculative execution is enabled, then some tasks could be killed to
make room for other jobs are started in the morning, but that's not
always possible. And, if it's not, pickling a job's state and swapping
it to HDFS would be expensive.
Note also that a task-limiting cluster cluster will still run faster at
night. If you've got 50 nodes with up to 200 tasks running at a time,
then tasks will run faster when only 50 are running. The network is
also a primary bottleneck, and it will be less congested when fewer jobs
are running, and disk contention will be lower too. So night owls would
still have significant advantages.
It's not intended as a perfect solution, but rather a substantial
improvement for many users that's not too hard to implement.
Doug