[ https://issues.apache.org/jira/browse/SPARK-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14066788#comment-14066788 ]
Colin Patrick McCabe commented on SPARK-1767: --------------------------------------------- I posted a pull request here: https://github.com/apache/spark/pull/1486 This illustrates how to do it with Hadoop 2.5. I'm still not sure about whether we should change the type of getPreferredLocations (see the pull request) > Prefer HDFS-cached replicas when scheduling data-local tasks > ------------------------------------------------------------ > > Key: SPARK-1767 > URL: https://issues.apache.org/jira/browse/SPARK-1767 > Project: Spark > Issue Type: Improvement > Components: Spark Core > Affects Versions: 1.0.0 > Reporter: Sandy Ryza > -- This message was sent by Atlassian JIRA (v6.2#6252)