[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-24 Thread sitalkedia
Github user sitalkedia commented on the pull request: https://github.com/apache/spark/pull/12944#issuecomment-221283774 cc - @srowen --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-10 Thread sitalkedia
Github user sitalkedia commented on the pull request: https://github.com/apache/spark/pull/12944#issuecomment-218248308 cc - @rxin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this featur

[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-06 Thread sitalkedia
Github user sitalkedia commented on a diff in the pull request: https://github.com/apache/spark/pull/12944#discussion_r62334156 --- Diff: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ShuffleIndexRecord.java --- @@ -0,0 +1,39 @@ +/* + * Licensed to

[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-06 Thread sitalkedia
Github user sitalkedia commented on the pull request: https://github.com/apache/spark/pull/12944#issuecomment-217450001 @holdenk - `TransportConf` is not specific to the , it is used to create Transport client in other modules as well. Since number of index cache entry is very specif

[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-06 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/12944#discussion_r62331757 --- Diff: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ShuffleIndexRecord.java --- @@ -0,0 +1,39 @@ +/* + * Licensed to

[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-05 Thread holdenk
Github user holdenk commented on the pull request: https://github.com/apache/spark/pull/12944#issuecomment-217358557 So a very minor style thing; it seems like the rest of the configuration values are exposed through accessor methods on TransportConf rather than directly exposing getI

[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12944#issuecomment-217321362 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your p

[GitHub] spark pull request: [SPARK-15074][Shuffle] Cache shuffle index fil...

2016-05-05 Thread sitalkedia
GitHub user sitalkedia opened a pull request: https://github.com/apache/spark/pull/12944 [SPARK-15074][Shuffle] Cache shuffle index file to speedup shuffle fetch ## What changes were proposed in this pull request? Shuffle fetch on large intermediate dataset is slow because t