Hadoop doesn't schedule the tasks close to the data
---------------------------------------------------
Key: CASSANDRA-955
URL: https://issues.apache.org/jira/browse/CASSANDRA-955
Project: Cassandra
Issue Type: Improvement
Components: Core
Reporter: Johan Oskarsson
Hadoop relies on locations for data in input splits being represented as
hostnames and not ip addresses. Currently in my testing tasks are more often
then not being scheduled on a node that does not contain the data requested.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.