[GitHub] spark pull request #19976: [SPARK-22660][BUILD] Use position() and limit() t...

2017-12-14 Thread kellyzly
GitHub user kellyzly opened a pull request: https://github.com/apache/spark/pull/19976 [SPARK-22660][BUILD] Use position() and limit() to fix ambiguity issue in scala-2.12 ## What changes were proposed in this pull request? Missing some changes about limit

[GitHub] spark issue #19854: [SPARK-22660][BUILD] Use position() and limit() to fix a...

2017-12-06 Thread kellyzly
Github user kellyzly commented on the issue: https://github.com/apache/spark/pull/19854 thanks @HyukjinKwon ,@srowen, @viirya 's review --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark pull request #19854: [SPARK-22660][BUILD] Use position() and limit() t...

2017-12-06 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/19854#discussion_r155168946 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/Platform.java --- @@ -169,8 +169,10 @@ public static ByteBuffer allocateDirectBuffer(int

[GitHub] spark pull request #19853: SPARK-22659:remove sun.misc.Cleaner references

2017-12-05 Thread kellyzly
Github user kellyzly closed the pull request at: https://github.com/apache/spark/pull/19853 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19854: SPARK-22660:Use position() and limit() to fix amb...

2017-12-05 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/19854#discussion_r155083419 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/Platform.java --- @@ -169,8 +169,10 @@ public static ByteBuffer allocateDirectBuffer(int

[GitHub] spark pull request #19854: SPARK-22660:Use position() and limit() to fix amb...

2017-12-05 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/19854#discussion_r155080359 --- Diff: external/kafka-0-10-sql/src/test/scala/org/apache/spark/sql/kafka010/KafkaTestUtils.scala --- @@ -296,7 +296,8 @@ class KafkaTestUtils

[GitHub] spark pull request #19854: SPARK-22660:Use position() and limit() to fix amb...

2017-11-30 Thread kellyzly
GitHub user kellyzly opened a pull request: https://github.com/apache/spark/pull/19854 SPARK-22660:Use position() and limit() to fix ambiguity issue in scal… …a-2.12 and JDK9 ## What changes were proposed in this pull request? (Please fill in changes proposed

[GitHub] spark pull request #19853: SPARK-22659:remove sun.misc.Cleaner references

2017-11-29 Thread kellyzly
GitHub user kellyzly opened a pull request: https://github.com/apache/spark/pull/19853 SPARK-22659:remove sun.misc.Cleaner references ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How was this patch tested

[GitHub] spark pull request: [Spark-5682] Add spark encrypted shuffle by us...

2015-04-09 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/5307#issuecomment-91146737 @vanzin : Left some initial comments. I like the idea of using a library for this, my main concern is about the stability of that library (and who'll

[GitHub] spark pull request: [Spark-5682] Add spark encrypted shuffle by us...

2015-04-08 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/5307#discussion_r28030065 --- Diff: core/src/main/scala/org/apache/spark/storage/BlockObjectWriter.scala --- @@ -123,12 +130,34 @@ private[spark] class DiskBlockObjectWriter

[GitHub] spark pull request: [Spark-5682] Add spark encrypted shuffle by us...

2015-04-08 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/5307#discussion_r28029243 --- Diff: core/src/main/scala/org/apache/spark/storage/BlockObjectWriter.scala --- @@ -123,12 +130,34 @@ private[spark] class DiskBlockObjectWriter

[GitHub] spark pull request: [Spark-5682] Add spark encrypted shuffle by us...

2015-04-01 Thread kellyzly
GitHub user kellyzly opened a pull request: https://github.com/apache/spark/pull/5307 [Spark-5682] Add spark encrypted shuffle by using chimera lib [Chimera](https://github.com/intel-hadoop/chimera) is a project which strips code related to CryptoInputStream/CryptoOutputStream from

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-23 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-85348209 @steveloughran: I have updated code according to your comments:[make CryptoOutputStream.scala#close safe](https://github.com/kellyzly/spark/blob

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-23 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-84855188 @steveloughran: i don't understand why need make CryptoOutputStream.scala#close safe. Is there situation when multiple threads call this function at the same time

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-23 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-84893111 @steveloughran : in hadoop, if we need add a native lib path to hadoop execution path, we need export LD_LIBRARY_PATH export LD_LIBRARY_PATH=x in hadoop

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-19 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-83426561 @tgravescs,@vanzin,@srowen,@JoshRosen,@CodingCat: I have updated code for your previous comments. Big change are made in this submission: 1. Delete hadoop2.6

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-19 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/4491#discussion_r26818763 --- Diff: core/src/main/scala/org/apache/spark/crypto/CipherSuite.scala --- @@ -0,0 +1,72 @@ +/* + * Licensed to the Apache Software Foundation (ASF

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-19 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-83903992 @steveloughran: Thanks for your valuables suggestions. * I will update latest crypto code in hadoop latest trunk and rewrite it to scala. I looked into HADOOP

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-19 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/4491#discussion_r26818889 --- Diff: core/src/main/scala/org/apache/spark/crypto/CryptoInputStream.scala --- @@ -0,0 +1,428 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-16 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/4491#discussion_r26463740 --- Diff: core/src/main/scala/org/apache/spark/storage/BlockObjectWriter.scala --- @@ -123,12 +133,30 @@ private[spark] class DiskBlockObjectWriter

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-15 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/4491#discussion_r26455600 --- Diff: core/src/main/scala/org/apache/spark/storage/BlockObjectWriter.scala --- @@ -123,12 +133,30 @@ private[spark] class DiskBlockObjectWriter

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-15 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/4491#discussion_r26455837 --- Diff: core/src/main/scala/org/apache/spark/storage/BlockObjectWriter.scala --- @@ -123,12 +133,30 @@ private[spark] class DiskBlockObjectWriter

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-09 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77972708 @vanzin: I have renamed the jira name from Reuse hadoop encrypted shuffle algorithm to enable spark encrypted shuffle to Add encrypted shuffle in spark. I will update

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786363 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786367 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786385 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786348 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786280 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786424 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786413 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-08 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77786426 @srowen,@tgravescs,@vanzin: Encrypted shuffle can make the process of shuffle more safer. I think it is necessary in spark. Previous design is reusing hadoop encrypted

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-03-06 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-77521213 @tgravescs: My teammates and i are trying to stripping classes like CryptoInputStream.java and CryptoOutputStream.java from package org.apache.hadoop.crypto in hadoop

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-02-13 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-74218847 @srowen : the unit test fails. The error message like /home/jenkins/workspace/SparkPullRequestBuilder/core/src/main/scala/org/apache/spark/storage

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-02-12 Thread kellyzly
Github user kellyzly commented on a diff in the pull request: https://github.com/apache/spark/pull/4491#discussion_r24636611 --- Diff: pom.xml --- @@ -1600,6 +1600,19 @@ /profile profile + idhadoop-2.6/id + properties

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-02-12 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-74207662 @tgravescs @vanzin @srowen @JoshRosen I was glad to receive all the suggestions from you. I summarized all of your suggestions: * CryptoInputStream

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-02-12 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-74207814 @tgravescs Did you test it with the YARN external shuffle service? Sorry, i have not tested on this situation and will test it later. --- If your

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-02-12 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-74217212 @tgravescs: I want ask a questions why org.apache.hadoop.mapreduce.MRJobConfig which marked InterfaceAudience.Private can be imported

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-02-11 Thread kellyzly
Github user kellyzly commented on the pull request: https://github.com/apache/spark/pull/4491#issuecomment-74014097 @JoshRosen: I have updated the patch, added the profile of hadoop 2.4 I deleted before and add a profile of hadoop2.6. --- If your project is set up for it, you can

[GitHub] spark pull request: [SPARK-5682] Reuse hadoop encrypted shuffle al...

2015-02-09 Thread kellyzly
GitHub user kellyzly opened a pull request: https://github.com/apache/spark/pull/4491 [SPARK-5682] Reuse hadoop encrypted shuffle algorithm to enable spark en... You can merge this pull request into a Git repository by running: $ git pull https://github.com/kellyzly/spark