kbendick commented on pull request #2577:
URL: https://github.com/apache/iceberg/pull/2577#issuecomment-840856172


   I think that this can be configured via `spark.locality.wait`. I think if 
you set it to zero, it will just automatically give up looking for a data local 
node. At least that's what I've done when reading from S3 with yarn (which is 
by definition not local).
   
   ```
   Number of milliseconds to wait to launch a data-local task before giving up 
and launching it on a less-local node. The same wait will be used to step 
through multiple locality levels (process-local, node-local, rack-local and 
then any). It is also possible to customize the waiting time for each level by 
setting spark.locality.wait.node, etc. You should increase this setting if your 
tasks are long and see poor locality, but the default usually works well.
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to