Quick question: is it possible to configure a job such that its map tasks will only run node-local? We have some mappers that need to access data that is only available on the nodes themselves and can therefore not run on other nodes; even if they are in the same rack.
Thanks in advance, -Ives