[GitHub] [spark] AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496203279 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496203276 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496214750 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496214759 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496180912 **[Test build #105834 has

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496216722 Test PASSed. Refer to this link for build

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496216713 Build finished. Test PASSed.

[GitHub] [spark] SparkQA commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
SparkQA commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496217361 **[Test build #105837 has

[GitHub] [spark] AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496240190 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table URL: https://github.com/apache/spark/pull/24721#issuecomment-496245408 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table

2019-05-27 Thread GitBox
SparkQA commented on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table URL: https://github.com/apache/spark/pull/24721#issuecomment-496245256 **[Test build #105838 has

[GitHub] [spark] SparkQA removed a comment on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table URL: https://github.com/apache/spark/pull/24721#issuecomment-496221955 **[Test build #105838 has

[GitHub] [spark] AmplabJenkins commented on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table URL: https://github.com/apache/spark/pull/24721#issuecomment-496245400 Merged build finished. Test FAILed. This is an

[GitHub] [spark] AmplabJenkins removed a comment on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host URL: https://github.com/apache/spark/pull/24554#issuecomment-496247636 Build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins removed a comment on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host URL: https://github.com/apache/spark/pull/24554#issuecomment-496247639 Test PASSed. Refer to this link for build results

[GitHub] [spark] SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496247855 **[Test build #105835 has

[GitHub] [spark] AmplabJenkins commented on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module URL: https://github.com/apache/spark/pull/24416#issuecomment-496249677 Merged build finished. Test PASSed. This is an

[GitHub] [spark] AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496249668 Test PASSed. Refer to this link for build results

[GitHub] [spark] AmplabJenkins commented on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module URL: https://github.com/apache/spark/pull/24416#issuecomment-496249687 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496249660 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496249668 Test PASSed. Refer to this link for build

[GitHub] [spark] AmplabJenkins removed a comment on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module URL: https://github.com/apache/spark/pull/24416#issuecomment-496249677 Merged build finished. Test PASSed. This is an

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496249660 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins removed a comment on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24416: [SPARK-27521][SQL] move data source v2 to catalyst module URL: https://github.com/apache/spark/pull/24416#issuecomment-496249687 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host URL: https://github.com/apache/spark/pull/24554#issuecomment-496261411 Test PASSed. Refer to this link for build results (access

[GitHub] [spark] AmplabJenkins commented on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24554: [SPARK-27622][Core] Avoiding the network when block manager fetches disk persisted RDD blocks from the same host URL: https://github.com/apache/spark/pull/24554#issuecomment-496261407 Merged build finished. Test PASSed.

[GitHub] [spark] SparkQA commented on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData

2019-05-27 Thread GitBox
SparkQA commented on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData URL: https://github.com/apache/spark/pull/24011#issuecomment-496177647 **[Test build #105829 has

[GitHub] [spark] SparkQA removed a comment on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData URL: https://github.com/apache/spark/pull/24011#issuecomment-496140830 **[Test build #105829 has

[GitHub] [spark] AmplabJenkins commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496196811 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496196804 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins commented on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second"

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second" URL: https://github.com/apache/spark/pull/24472#issuecomment-496202241 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second"

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second" URL: https://github.com/apache/spark/pull/24472#issuecomment-496202231 Merged build finished. Test PASSed. This

[GitHub] [spark] gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#discussion_r287766069 ## File path:

[GitHub] [spark] BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed

2019-05-27 Thread GitBox
BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed URL: https://github.com/apache/spark/pull/24720#issuecomment-496216253 > This would make it update metrics on every write. It appears this is purposely done only every 16,000

[GitHub] [spark] AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496216722 Test PASSed. Refer to this link for build results

[GitHub] [spark] AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496216713 Build finished. Test PASSed.

[GitHub] [spark] BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed

2019-05-27 Thread GitBox
BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed URL: https://github.com/apache/spark/pull/24720#issuecomment-496216253 > This would make it update metrics on every write. It appears this is purposely done only every 16,000

[GitHub] [spark] BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed

2019-05-27 Thread GitBox
BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed URL: https://github.com/apache/spark/pull/24720#issuecomment-496216253 > This would make it update metrics on every write. It appears this is purposely done only every 16,000

[GitHub] [spark] cloud-fan closed pull request #24565: [SPARK-27665][Core] Split fetch shuffle blocks protocol from OpenBlocks

2019-05-27 Thread GitBox
cloud-fan closed pull request #24565: [SPARK-27665][Core] Split fetch shuffle blocks protocol from OpenBlocks URL: https://github.com/apache/spark/pull/24565 This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496241577 Test PASSed. Refer to this link for build

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496241567 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496241567 Merged build finished. Test PASSed.

[GitHub] [spark] gengliangwang commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
gengliangwang commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496241867 retest this please. This is an

[GitHub] [spark] SparkQA commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
SparkQA commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496250208 **[Test build #105843 has

[GitHub] [spark] hvanhovell closed pull request #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData

2019-05-27 Thread GitBox
hvanhovell closed pull request #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData URL: https://github.com/apache/spark/pull/24011 This is an automated message from the Apache Git Service. To

[GitHub] [spark] cloud-fan commented on a change in pull request #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
cloud-fan commented on a change in pull request #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#discussion_r287836766 ## File path:

[GitHub] [spark] SparkQA commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
SparkQA commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496180912 **[Test build #105834 has

[GitHub] [spark] gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#discussion_r287766069 ## File path:

[GitHub] [spark] HyukjinKwon closed pull request #24675: [SPARK-27803][SQL][PYTHON] Fix column pruning for Python UDF

2019-05-27 Thread GitBox
HyukjinKwon closed pull request #24675: [SPARK-27803][SQL][PYTHON] Fix column pruning for Python UDF URL: https://github.com/apache/spark/pull/24675 This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] srowen commented on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed

2019-05-27 Thread GitBox
srowen commented on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed URL: https://github.com/apache/spark/pull/24720#issuecomment-496201707 This would make it update metrics on every write. It appears this is purposely done only every 16,000 _records_ for

[GitHub] [spark] SparkQA commented on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second"

2019-05-27 Thread GitBox
SparkQA commented on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second" URL: https://github.com/apache/spark/pull/24472#issuecomment-496201716 **[Test build #105833 has

[GitHub] [spark] SparkQA removed a comment on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second"

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24472: [SPARK-27578][SQL] Add support for "interval '23:59:59' hour to second" URL: https://github.com/apache/spark/pull/24472#issuecomment-496167670 **[Test build #105833 has

[GitHub] [spark] BestOreo commented on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed

2019-05-27 Thread GitBox
BestOreo commented on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed URL: https://github.com/apache/spark/pull/24720#issuecomment-496216253 > This would make it update metrics on every write. It appears this is purposely done only every 16,000 _records_

[GitHub] [spark] BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed

2019-05-27 Thread GitBox
BestOreo edited a comment on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed URL: https://github.com/apache/spark/pull/24720#issuecomment-496216253 > This would make it update metrics on every write. It appears this is purposely done only every 16,000

[GitHub] [spark] srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics

2019-05-27 Thread GitBox
srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics URL: https://github.com/apache/spark/pull/24717#discussion_r287792850 ## File path: mllib/src/main/scala/org/apache/spark/mllib/evaluation/MultilabelMetrics.scala

[GitHub] [spark] srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics

2019-05-27 Thread GitBox
srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics URL: https://github.com/apache/spark/pull/24717#discussion_r287800833 ## File path: mllib/src/main/scala/org/apache/spark/mllib/evaluation/MultilabelMetrics.scala

[GitHub] [spark] srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics

2019-05-27 Thread GitBox
srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics URL: https://github.com/apache/spark/pull/24717#discussion_r287788064 ## File path: mllib/src/main/scala/org/apache/spark/mllib/evaluation/MulticlassMetrics.scala

[GitHub] [spark] srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics

2019-05-27 Thread GitBox
srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics URL: https://github.com/apache/spark/pull/24717#discussion_r287792619 ## File path: mllib/src/main/scala/org/apache/spark/mllib/evaluation/MultilabelMetrics.scala

[GitHub] [spark] srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics

2019-05-27 Thread GitBox
srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics URL: https://github.com/apache/spark/pull/24717#discussion_r287792148 ## File path: mllib/src/main/scala/org/apache/spark/mllib/evaluation/MultilabelMetrics.scala

[GitHub] [spark] srowen commented on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed

2019-05-27 Thread GitBox
srowen commented on issue #24720: [SPARK-27852][Spark Core] updateBytesWritten() operaton is missed URL: https://github.com/apache/spark/pull/24720#issuecomment-496222455 Look at `recordWritten()`. This is an automated

[GitHub] [spark] srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics

2019-05-27 Thread GitBox
srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics URL: https://github.com/apache/spark/pull/24717#discussion_r287791954 ## File path: mllib/src/main/scala/org/apache/spark/mllib/evaluation/MultilabelMetrics.scala

[GitHub] [spark] srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics

2019-05-27 Thread GitBox
srowen commented on a change in pull request #24717: [SPARK-27847][ML] One-Pass MultilabelMetrics & MulticlassMetrics URL: https://github.com/apache/spark/pull/24717#discussion_r287793291 ## File path: mllib/src/main/scala/org/apache/spark/mllib/evaluation/MultilabelMetrics.scala

[GitHub] [spark] AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496241577 Test PASSed. Refer to this link for build results

[GitHub] [spark] Ngone51 commented on a change in pull request #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
Ngone51 commented on a change in pull request #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#discussion_r287832950 ## File path:

[GitHub] [spark] AmplabJenkins removed a comment on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData URL: https://github.com/apache/spark/pull/24011#issuecomment-496178100 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData URL: https://github.com/apache/spark/pull/24011#issuecomment-496178100 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData URL: https://github.com/apache/spark/pull/24011#issuecomment-496178095 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins commented on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData

2019-05-27 Thread GitBox
AmplabJenkins commented on issue #24011: [SPARK-27071][CORE] Expose additional metrics in status.api.v1.StageData URL: https://github.com/apache/spark/pull/24011#issuecomment-496178095 Merged build finished. Test PASSed.

[GitHub] [spark] HyukjinKwon commented on a change in pull request #24335: [SPARK-27425][SQL] Add count_if functions

2019-05-27 Thread GitBox
HyukjinKwon commented on a change in pull request #24335: [SPARK-27425][SQL] Add count_if functions URL: https://github.com/apache/spark/pull/24335#discussion_r287773618 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CountIf.scala

[GitHub] [spark] AmplabJenkins removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496214750 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496214759 Test PASSed. Refer to this link for build results (access rights to CI server

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496217895 Test FAILed. Refer to this link for build

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496242706 Merged build finished. Test FAILed.

[GitHub] [spark] Ngone51 commented on issue #20056: [SPARK-22878] [CORE] Count totalDroppedEvents for LiveListenerBus

2019-05-27 Thread GitBox
Ngone51 commented on issue #20056: [SPARK-22878] [CORE] Count totalDroppedEvents for LiveListenerBus URL: https://github.com/apache/spark/pull/20056#issuecomment-496242898 Hi, @zuotingbing , can you file a JIRA ticket for the issue and given more details please ? We can discuss more at

[GitHub] [spark] SparkQA removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496242115 **[Test build #105840 has

[GitHub] [spark] AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496242713 Test FAILed. Refer to this link for build

[GitHub] [spark] SparkQA removed a comment on issue #24711: [Minor][SS]avoid inefficient sort when getLatest in HDFSMetadataLog

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24711: [Minor][SS]avoid inefficient sort when getLatest in HDFSMetadataLog URL: https://github.com/apache/spark/pull/24711#issuecomment-496201065 **[Test build #4785 has

[GitHub] [spark] SparkQA commented on issue #24711: [Minor][SS]avoid inefficient sort when getLatest in HDFSMetadataLog

2019-05-27 Thread GitBox
SparkQA commented on issue #24711: [Minor][SS]avoid inefficient sort when getLatest in HDFSMetadataLog URL: https://github.com/apache/spark/pull/24711#issuecomment-496255115 **[Test build #4785 has

[GitHub] [spark] cloud-fan commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
cloud-fan commented on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496255028 LGTM This is an automated

[GitHub] [spark] SparkQA removed a comment on issue #24565: [SPARK-27665][Core] Split fetch shuffle blocks protocol from OpenBlocks

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24565: [SPARK-27665][Core] Split fetch shuffle blocks protocol from OpenBlocks URL: https://github.com/apache/spark/pull/24565#issuecomment-496151176 **[Test build #105831 has

[GitHub] [spark] SparkQA commented on issue #24565: [SPARK-27665][Core] Split fetch shuffle blocks protocol from OpenBlocks

2019-05-27 Thread GitBox
SparkQA commented on issue #24565: [SPARK-27665][Core] Split fetch shuffle blocks protocol from OpenBlocks URL: https://github.com/apache/spark/pull/24565#issuecomment-496184894 **[Test build #105831 has

[GitHub] [spark] SparkQA commented on issue #24716: [SPARK-25944][R][BUILD] AppVeyor change to latest R version (3.6.0)

2019-05-27 Thread GitBox
SparkQA commented on issue #24716: [SPARK-25944][R][BUILD] AppVeyor change to latest R version (3.6.0) URL: https://github.com/apache/spark/pull/24716#issuecomment-496191131 **[Test build #105832 has

[GitHub] [spark] gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#discussion_r287765847 ## File path:

[GitHub] [spark] SparkQA removed a comment on issue #24716: [SPARK-25944][R][BUILD] AppVeyor change to latest R version (3.6.0)

2019-05-27 Thread GitBox
SparkQA removed a comment on issue #24716: [SPARK-25944][R][BUILD] AppVeyor change to latest R version (3.6.0) URL: https://github.com/apache/spark/pull/24716#issuecomment-496158744 **[Test build #105832 has

[GitHub] [spark] gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
gengliangwang commented on a change in pull request #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#discussion_r287766069 ## File path:

[GitHub] [spark] AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496196804 Merged build finished. Test PASSed.

[GitHub] [spark] AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496196811 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496197389 **[Test build #105835 has

[GitHub] [spark] HyukjinKwon commented on issue #24675: [SPARK-27803][SQL][PYTHON] Fix column pruning for Python UDF

2019-05-27 Thread GitBox
HyukjinKwon commented on issue #24675: [SPARK-27803][SQL][PYTHON] Fix column pruning for Python UDF URL: https://github.com/apache/spark/pull/24675#issuecomment-496199352 Merged to master. This is an automated message from

[GitHub] [spark] HyukjinKwon commented on issue #24700: [SPARK-27834][SQL][R][PYTHON] Make separate PySpark/SparkR vectorization configurations

2019-05-27 Thread GitBox
HyukjinKwon commented on issue #24700: [SPARK-27834][SQL][R][PYTHON] Make separate PySpark/SparkR vectorization configurations URL: https://github.com/apache/spark/pull/24700#issuecomment-496199115 Adding @BryanCutler, @viirya too. Let me go ahead with it. The naming is a bit odd but I

[GitHub] [spark] HyukjinKwon commented on issue #24700: [SPARK-27834][SQL][R][PYTHON] Make separate PySpark/SparkR vectorization configurations

2019-05-27 Thread GitBox
HyukjinKwon commented on issue #24700: [SPARK-27834][SQL][R][PYTHON] Make separate PySpark/SparkR vectorization configurations URL: https://github.com/apache/spark/pull/24700#issuecomment-496199224 If there are no more concerns than that, let me go ahead.

[GitHub] [spark] SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496203886 **[Test build #105836 has

[GitHub] [spark] cloud-fan opened a new pull request #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table

2019-05-27 Thread GitBox
cloud-fan opened a new pull request #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table URL: https://github.com/apache/spark/pull/24721 ## What changes were proposed in this pull request? When inserting data to a data source v1 table, Spark will forcibly cast

[GitHub] [spark] SparkQA commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series

2019-05-27 Thread GitBox
SparkQA commented on issue #24643: [SPARK-26412][PySpark][SQL][WIP] Allow Pandas UDF to take an iterator of pd.Series or an iterator of tuple of pd.Series URL: https://github.com/apache/spark/pull/24643#issuecomment-496242115 **[Test build #105840 has

[GitHub] [spark] SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase

2019-05-27 Thread GitBox
SparkQA commented on issue #24719: [SPARK-27849][SQL] Redact treeString of FileTable and DataSourceV2ScanExecBase URL: https://github.com/apache/spark/pull/24719#issuecomment-496242114 **[Test build #105839 has

[GitHub] [spark] AmplabJenkins removed a comment on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table URL: https://github.com/apache/spark/pull/24721#issuecomment-496245408 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24721: [SPARK-27856][SQL] do not forcibly add cast when inserting table URL: https://github.com/apache/spark/pull/24721#issuecomment-496245400 Merged build finished. Test FAILed.

[GitHub] [spark] srowen closed pull request #24648: [SPARK-27777][ML] Eliminate uncessary sliding job in AreaUnderCurve

2019-05-27 Thread GitBox
srowen closed pull request #24648: [SPARK-2][ML] Eliminate uncessary sliding job in AreaUnderCurve URL: https://github.com/apache/spark/pull/24648 This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] kiszk commented on a change in pull request #24709: [SPARK-27841][SQL] Improve UTF8String to/fromString()/numBytesForFirstByte() performance

2019-05-27 Thread GitBox
kiszk commented on a change in pull request #24709: [SPARK-27841][SQL] Improve UTF8String to/fromString()/numBytesForFirstByte() performance URL: https://github.com/apache/spark/pull/24709#discussion_r287837386 ## File path:

[GitHub] [spark] AmplabJenkins removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens

2019-05-27 Thread GitBox
AmplabJenkins removed a comment on issue #24569: [SPARK-23191][CORE] Warn rather than terminate when duplicate worker register happens URL: https://github.com/apache/spark/pull/24569#issuecomment-496180456 Test PASSed. Refer to this link for build results (access rights to CI server

  1   2   3   4   5   6   7   8   >