[GitHub] spark pull request #20800: [SPARK-23627][SQL] Provide isEmpty in Dataset

2018-05-14 Thread goungoun
Github user goungoun commented on a diff in the pull request: https://github.com/apache/spark/pull/20800#discussion_r188019259 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -511,6 +511,14 @@ class Dataset[T] private[sql]( */ def isLocal

[GitHub] spark issue #20800: [SPARK-23627][SQL] Provide isEmpty in Dataset

2018-05-14 Thread goungoun
Github user goungoun commented on the issue: https://github.com/apache/spark/pull/20800 Thanks!! --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark pull request #20800: [SPARK-23627][SQL] Provide isEmpty in Dataset

2018-03-23 Thread goungoun
Github user goungoun commented on a diff in the pull request: https://github.com/apache/spark/pull/20800#discussion_r176728379 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -511,6 +511,14 @@ class Dataset[T] private[sql]( */ def isLocal

[GitHub] spark pull request #19876: [ML][SPARK-23783][SPARK-11239] Add PMML export to...

2018-03-24 Thread goungoun
Github user goungoun commented on a diff in the pull request: https://github.com/apache/spark/pull/19876#discussion_r176911201 --- Diff: mllib/src/main/scala/org/apache/spark/ml/regression/LinearRegression.scala --- @@ -710,15 +711,58 @@ class LinearRegressionModel private[ml

[GitHub] spark pull request #19876: [ML][SPARK-23783][SPARK-11239] Add PMML export to...

2018-03-31 Thread goungoun
Github user goungoun commented on a diff in the pull request: https://github.com/apache/spark/pull/19876#discussion_r178432400 --- Diff: mllib/src/main/scala/org/apache/spark/ml/regression/LinearRegression.scala --- @@ -710,15 +711,58 @@ class LinearRegressionModel private[ml

[GitHub] spark pull request #20782: [SPARK-SPARK-23627][SQL] Provide isEmpty in DataS...

2018-03-08 Thread goungoun
GitHub user goungoun opened a pull request: https://github.com/apache/spark/pull/20782 [SPARK-SPARK-23627][SQL] Provide isEmpty in DataSet ## What changes were proposed in this pull request? This PR adds a isEmpty in DataSet ## How was this patch tested

[GitHub] spark pull request #20782: [SPARK-SPARK-23627][SQL] Provide isEmpty in DataS...

2018-03-08 Thread goungoun
Github user goungoun closed the pull request at: https://github.com/apache/spark/pull/20782 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #20800: isEmpty in Dataset and its testSuite

2018-03-11 Thread goungoun
GitHub user goungoun opened a pull request: https://github.com/apache/spark/pull/20800 isEmpty in Dataset and its testSuite ## What changes were proposed in this pull request? This PR adds isEmpty() in DataSet ## How was this patch tested? Unit tests added

[GitHub] spark issue #20782: [SPARK-23627][SQL] Provide isEmpty in DataSet

2018-03-11 Thread goungoun
Github user goungoun commented on the issue: https://github.com/apache/spark/pull/20782 As unnecessary information is included, I closed this pull request. Please refer request #20800 instead of #20782. I am sorry for your inconvenience

[GitHub] spark pull request #20800: [SPARK-23627][SQL] Provide isEmpty in DataSet

2018-03-12 Thread goungoun
Github user goungoun commented on a diff in the pull request: https://github.com/apache/spark/pull/20800#discussion_r174002184 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -511,6 +511,12 @@ class Dataset[T] private[sql]( */ def isLocal

[GitHub] spark issue #20800: [SPARK-23627][SQL] Provide isEmpty in Dataset

2018-03-14 Thread goungoun
Github user goungoun commented on the issue: https://github.com/apache/spark/pull/20800 @HyukjinKwon, @maropu Just a gentle reminder. Jenkins is waiting for a comment like 'ok to test'. --- - To unsu

[GitHub] spark pull request #20800: [SPARK-23627][SQL] Provide isEmpty in Dataset

2018-03-14 Thread goungoun
Github user goungoun commented on a diff in the pull request: https://github.com/apache/spark/pull/20800#discussion_r174673621 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -511,6 +511,14 @@ class Dataset[T] private[sql]( */ def isLocal

[GitHub] spark issue #20800: [SPARK-23627][SQL] Provide isEmpty in Dataset

2018-03-14 Thread goungoun
Github user goungoun commented on the issue: https://github.com/apache/spark/pull/20800 @rxin, checking empty is likely to be a common process in every ETL batch job. I think it is the right place to provide that functionality. When a basic function is missing already supposed to be

[GitHub] spark issue #20800: [SPARK-23627][SQL] Provide isEmpty in Dataset

2018-03-15 Thread goungoun
Github user goungoun commented on the issue: https://github.com/apache/spark/pull/20800 For additional check that I mentioned. The following code shows that Spark users does not need to add take(1). ds.rdd.take(1).isEmpty is redundant. [RDD.scala](https://github.com/apache

[GitHub] spark issue #22428: [SPARK-25430][SQL] Add map parameter for withColumnRenam...

2018-10-01 Thread goungoun
Github user goungoun commented on the issue: https://github.com/apache/spark/pull/22428 Awesome! @HyukjinKwon , @gatorsmile thanks for good information. Let me look into it further. By the way, I still hope this conversation is open to users' voice, not limited with devel

[GitHub] spark pull request #22428: [SPARK-25430][SQL] Add map parameter for withColu...

2018-09-15 Thread goungoun
GitHub user goungoun opened a pull request: https://github.com/apache/spark/pull/22428 [SPARK-25430][SQL] Add map parameter for withColumnRenamed ## What changes were proposed in this pull request? This PR allows withColumnRenamed with a map input argument ## How was

[GitHub] spark issue #22428: [SPARK-25430][SQL] Add map parameter for withColumnRenam...

2018-09-16 Thread goungoun
Github user goungoun commented on the issue: https://github.com/apache/spark/pull/22428 @HyukjinKwon , thanks for your review. Actually, that is the reason that I open this pull request. I think it is better to giving reusable option to users than repeating too much of same code in