[GitHub] spark issue #16830: [MINOR][CORE] Fix incorrect documentation of WritableCon...

2017-02-07 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/16830 Updated with the problem description. Please have a look when you are free. I will review the other documentation descriptions ... --- If your project is set up for it, you can reply

[GitHub] spark pull request #16830: [MINOR][CORE] Fix incorrect documentation of Writ...

2017-02-07 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/16830 [MINOR][CORE] Fix incorrect documentation of WritableConverter ## What changes were proposed in this pull request? `WritableConverter` and `WritableFactory` work in opposite

[GitHub] spark issue #16830: [MINOR][CORE] Fix incorrect documentation of WritableCon...

2017-02-07 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/16830 Is this exactly corresponds to your intention? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #17303: [SPARK-19112][CORE] add codec for ZStandard

2017-03-15 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/17303 [SPARK-19112][CORE] add codec for ZStandard ## What changes were proposed in this pull request? Hadoop[^1] & HBase[^2] started to support ZStandard Compression from their re

[GitHub] spark issue #17225: [CORE] Support ZStandard Compression

2017-03-15 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/17225 @HyukjinKwon Thanks for the information. It seems like both of Jira issue and my PR are messed up - it will re-create PR with the Jira issue. --- If your project is set up for it, you can

[GitHub] spark pull request #17225: [CORE] Support ZStandard Compression

2017-03-15 Thread dongjinleekr
Github user dongjinleekr closed the pull request at: https://github.com/apache/spark/pull/17225 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #17564: [MINOR][DOCS] Fix spacings in Structured Streaming Progr...

2017-04-11 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/17564 @srowen I just completed the review. I could not find any other typos from this document. :) --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #17564: [MINOR][DOCS] Fix spacings in Structured Streaming Progr...

2017-04-07 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/17564 No problem! I will notify you after the review. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #17564: [MINOR][DOCS] Fix spacings in Structured Streamin...

2017-04-07 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/17564 [MINOR][DOCS] Fix spacings in Structured Streaming Programming Guide ## What changes were proposed in this pull request? 1. Omitted space between the sentences: `... on static

[GitHub] spark pull request #17225: [CORE] Support ZStandard Compression

2017-03-09 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/17225 [CORE] Support ZStandard Compression ## What changes were proposed in this pull request? Hadoop will support ZStandard Compression from 2.9.0. This update enables saving a file

[GitHub] spark pull request #17225: [CORE] Support ZStandard Compression

2017-03-09 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/17225#discussion_r105342714 --- Diff: core/src/main/scala/org/apache/spark/io/CompressionCodec.scala --- @@ -49,13 +50,14 @@ private[spark] object CompressionCodec

[GitHub] spark pull request #21501: [SPARK-15064][ML] Locale support in StopWordsRemo...

2018-06-12 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/21501#discussion_r194623958 --- Diff: python/pyspark/ml/feature.py --- @@ -2582,25 +2582,31 @@ class StopWordsRemover(JavaTransformer, HasInputCol, HasOutputCol, JavaMLReadabl

[GitHub] spark issue #21525: [SPARK-24513][ML] Attribute support in UnaryTransformer

2018-06-12 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/21525 @jkbradley Excuse me. Could have a look when you are free? Thanks. --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark issue #21501: [SPARK-15064][ML] Locale support in StopWordsRemover

2018-06-12 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/21501 @mengxr Thank you very much for your kind guidance. If you have some time, please have a look at my another PR, #21525. I found this issue while I was reading [an issue on HashingTF](https

[GitHub] spark pull request #21501: [SPARK-15064][ML] Locale support in StopWordsRemo...

2018-06-09 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/21501#discussion_r194222392 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/StopWordsRemover.scala --- @@ -84,7 +86,28 @@ class StopWordsRemover @Since("

[GitHub] spark pull request #21501: [SPARK-15064][ML] Locale support in StopWordsRemo...

2018-06-09 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/21501#discussion_r194222407 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/StopWordsRemover.scala --- @@ -84,7 +86,28 @@ class StopWordsRemover @Since("

[GitHub] spark pull request #21501: [SPARK-15064][ML] Locale support in StopWordsRemo...

2018-06-10 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/21501#discussion_r194296251 --- Diff: python/pyspark/ml/feature.py --- @@ -2610,6 +2610,9 @@ def setParams(self, inputCol=None, outputCol=None, stopWords=None, caseSensitive

[GitHub] spark pull request #21525: [SPARK-24513][ML] Attribute support in UnaryTrans...

2018-06-11 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/21525 [SPARK-24513][ML] Attribute support in UnaryTransformer ## What changes were proposed in this pull request? This PR adds Metadata support in `UnaryTransformer`, as a preliminary work

[GitHub] spark issue #21501: [SPARK-15064][ML] Locale support in StopWordsRemover

2018-06-11 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/21501 Finally, I made it work by adding `StopWordsRemover.getDefaultLocale`! How about this approach? @mengxr @viirya

[GitHub] spark pull request #21501: [SPARK-15064][ML] Locale support in StopWordsRemo...

2018-06-11 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/21501#discussion_r194342986 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/StopWordsRemover.scala --- @@ -84,7 +86,28 @@ class StopWordsRemover @Since("

[GitHub] spark issue #21525: [SPARK-24513][ML] Attribute support in UnaryTransformer

2018-06-11 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/21525 If needed, I can propose a draft version of [SPARK-13998](https://issues.apache.org/jira/browse/SPARK-13998) implemented on top of this work

[GitHub] spark pull request #21501: [SPARK-15064][ML] Locale support in StopWordsRemo...

2018-06-06 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/21501 [SPARK-15064][ML] Locale support in StopWordsRemover ## What changes were proposed in this pull request? Add locale support for `StopWordsRemover`. ## How was this patch

[GitHub] spark pull request #22282: [SPARK-23539][SS] Add support for Kafka headers i...

2018-08-30 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/22282#discussion_r214198620 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaWriteTask.scala --- @@ -131,9 +158,25 @@ private[kafka010

[GitHub] spark issue #22282: [SPARK-23539][SS] Add support for Kafka headers in Struc...

2018-09-06 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/22282 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #22282: [SPARK-23539][SS] Add support for Kafka headers in Struc...

2018-09-06 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/22282 Retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark pull request #21525: [SPARK-24513][ML] Attribute support in UnaryTrans...

2018-09-10 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/21525#discussion_r216333687 --- Diff: mllib/src/main/scala/org/apache/spark/ml/Transformer.scala --- @@ -116,10 +116,17 @@ abstract class UnaryTransformer[IN, OUT, T

[GitHub] spark issue #22282: [SPARK-23539][SS] Add support for Kafka headers in Struc...

2018-09-10 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/22282 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark pull request #22282: [SPARK-23539][SS] Add support for Kafka headers i...

2018-08-31 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/22282#discussion_r214345173 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaWriteTask.scala --- @@ -131,9 +158,25 @@ private[kafka010

[GitHub] spark pull request #22282: [SPARK-23539][SS] Add support for Kafka headers i...

2018-08-31 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/22282#discussion_r214345393 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeArrayData.java --- @@ -473,17 +474,6 @@ public static

[GitHub] spark pull request #22342: Feature/fix kafka sql trivials

2018-09-05 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/22342 Feature/fix kafka sql trivials ## What changes were proposed in this pull request? Fix unused imports & outdated comments on `kafka-0-10-sql` module. (Found while I was wor

[GitHub] spark issue #22282: [SPARK-23539][SS] Add support for Kafka headers in Struc...

2018-11-12 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/22282 cc/ @zsxwing @tdas @dongjoon-hyun @srowen Rebased onto the latest master. Please have a look when you are free. Thanks in advance

[GitHub] spark pull request #22282: [SPARK-23539][SS] Add support for Kafka headers i...

2018-08-30 Thread dongjinleekr
GitHub user dongjinleekr opened a pull request: https://github.com/apache/spark/pull/22282 [SPARK-23539][SS] Add support for Kafka headers in Structured Streaming ## What changes were proposed in this pull request? This update adds support for Kafka Headers functionality

[GitHub] spark issue #22282: [SPARK-23539][SS] Add support for Kafka headers in Struc...

2018-08-30 Thread dongjinleekr
Github user dongjinleekr commented on the issue: https://github.com/apache/spark/pull/22282 As you can see, this PR consists of 3 parts: 1. Extend `UnsafeArrayData`, `UnsafeMapData` (commit 1~6) 2. Implement Kafka Headers functionality (commit 7, 10) 3. Update unit

[GitHub] spark pull request #22282: [SPARK-23539][SS] Add support for Kafka headers i...

2018-09-04 Thread dongjinleekr
Github user dongjinleekr commented on a diff in the pull request: https://github.com/apache/spark/pull/22282#discussion_r214856258 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaWriteTask.scala --- @@ -88,7 +92,30 @@ private[kafka010] abstract