[jira] [Commented] (SPARK-29302) dynamic partition overwrite with speculation enabled

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165653#comment-17165653 ] Apache Spark commented on SPARK-29302: -- User 'WinkerDu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32457: Assignee: Apache Spark > logParam thresholds in DT/GBT/FM/LR/MLP >

[jira] [Assigned] (SPARK-30794) Stage Level scheduling: Add ability to set off heap memory

2020-07-27 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves reassigned SPARK-30794: - Assignee: Zhongwei Zhu > Stage Level scheduling: Add ability to set off heap memory >

[jira] [Resolved] (SPARK-30794) Stage Level scheduling: Add ability to set off heap memory

2020-07-27 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-30794. --- Fix Version/s: 3.1.0 Resolution: Fixed > Stage Level scheduling: Add ability to set

[jira] [Commented] (SPARK-29918) RecordBinaryComparator should check endianness when compared by long

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165611#comment-17165611 ] Apache Spark commented on SPARK-29918: -- User 'mundaym' has created a pull request for this issue:

[jira] [Created] (SPARK-32459) UDF regression of WrappedArray supporting caused by SPARK-31826

2020-07-27 Thread wuyi (Jira)
wuyi created SPARK-32459: Summary: UDF regression of WrappedArray supporting caused by SPARK-31826 Key: SPARK-32459 URL: https://issues.apache.org/jira/browse/SPARK-32459 Project: Spark Issue Type:

[jira] [Updated] (SPARK-32458) Mismatched row access sizes in tests

2020-07-27 Thread Michael Munday (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Munday updated SPARK-32458: --- Description: The RowEncoderSuite and UnsafeMapSuite tests fail on big-endian systems. This

[jira] [Created] (SPARK-32458) Mismatched row access sizes in tests

2020-07-27 Thread Michael Munday (Jira)
Michael Munday created SPARK-32458: -- Summary: Mismatched row access sizes in tests Key: SPARK-32458 URL: https://issues.apache.org/jira/browse/SPARK-32458 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-19169) columns changed orc table encouter 'IndexOutOfBoundsException' when read the old schema files

2020-07-27 Thread bianqi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165657#comment-17165657 ] bianqi commented on SPARK-19169: [~hyukjin.kwon] hello We also encountered this problem in the

[jira] [Commented] (SPARK-32458) Mismatched row access sizes in tests

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165604#comment-17165604 ] Apache Spark commented on SPARK-32458: -- User 'mundaym' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32458) Mismatched row access sizes in tests

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32458: Assignee: (was: Apache Spark) > Mismatched row access sizes in tests >

[jira] [Assigned] (SPARK-32458) Mismatched row access sizes in tests

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32458: Assignee: Apache Spark > Mismatched row access sizes in tests >

[jira] [Commented] (SPARK-29918) RecordBinaryComparator should check endianness when compared by long

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165614#comment-17165614 ] Apache Spark commented on SPARK-29918: -- User 'mundaym' has created a pull request for this issue:

[jira] [Resolved] (SPARK-32435) Remove heapq3 port from Python 3

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32435. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29229

[jira] [Assigned] (SPARK-32435) Remove heapq3 port from Python 3

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-32435: Assignee: Hyukjin Kwon > Remove heapq3 port from Python 3 >

[jira] [Commented] (SPARK-32425) Spark sequence() fails if start and end of range are identical timestamps

2020-07-27 Thread JinxinTang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165634#comment-17165634 ] JinxinTang commented on SPARK-32425: cc [~viirya]  Because I have fixed it by 

[jira] [Commented] (SPARK-27194) Job failures when task attempts do not clean up spark-staging parquet files

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165649#comment-17165649 ] Apache Spark commented on SPARK-27194: -- User 'WinkerDu' has created a pull request for this issue:

[jira] [Commented] (SPARK-29302) dynamic partition overwrite with speculation enabled

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165651#comment-17165651 ] Apache Spark commented on SPARK-29302: -- User 'WinkerDu' has created a pull request for this issue:

[jira] [Commented] (SPARK-20680) Spark-sql do not support for void column datatype of view

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165669#comment-17165669 ] Apache Spark commented on SPARK-20680: -- User 'ulysses-you' has created a pull request for this

[jira] [Commented] (SPARK-31851) Redesign PySpark documentation

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165546#comment-17165546 ] Hyukjin Kwon commented on SPARK-31851: -- The base work is done. I will create one more PR soon to

[jira] [Commented] (SPARK-32455) LogisticRegressionModel prediction optimization

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165552#comment-17165552 ] Apache Spark commented on SPARK-32455: -- User 'zhengruifeng' has created a pull request for this

[jira] [Commented] (SPARK-32455) LogisticRegressionModel prediction optimization

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165553#comment-17165553 ] Apache Spark commented on SPARK-32455: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-32455) LogisticRegressionModel prediction optimization

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32455: Assignee: (was: Apache Spark) > LogisticRegressionModel prediction optimization >

[jira] [Assigned] (SPARK-32455) LogisticRegressionModel prediction optimization

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32455: Assignee: Apache Spark > LogisticRegressionModel prediction optimization >

[jira] [Created] (SPARK-32456) Give better error message for union streams in append mode that don't have a watermark

2020-07-27 Thread Yuanjian Li (Jira)
Yuanjian Li created SPARK-32456: --- Summary: Give better error message for union streams in append mode that don't have a watermark Key: SPARK-32456 URL: https://issues.apache.org/jira/browse/SPARK-32456

[jira] [Commented] (SPARK-32417) Flaky test: BlockManagerDecommissionIntegrationSuite.verify that an already running task which is going to cache data succeeds on a decommissioned executor

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165466#comment-17165466 ] Apache Spark commented on SPARK-32417: -- User 'agrawaldevesh' has created a pull request for this

[jira] [Assigned] (SPARK-32417) Flaky test: BlockManagerDecommissionIntegrationSuite.verify that an already running task which is going to cache data succeeds on a decommissioned executor

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32417: Assignee: Apache Spark > Flaky test: BlockManagerDecommissionIntegrationSuite.verify

[jira] [Issue Comment Deleted] (SPARK-24194) HadoopFsRelation cannot overwrite a path that is also being read from

2020-07-27 Thread philipse (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] philipse updated SPARK-24194: - Comment: was deleted (was: Hi  is the issue closed ? can i try it in product env?   Thanks) >

[jira] [Commented] (SPARK-32434) Support Scala 2.13 in AbstractCommandBuilder and load-spark-env scripts

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165535#comment-17165535 ] Apache Spark commented on SPARK-32434: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-32434) Support Scala 2.13 in AbstractCommandBuilder and load-spark-env scripts

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165536#comment-17165536 ] Apache Spark commented on SPARK-32434: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Comment Edited] (SPARK-32453) Remove SPARK_SCALA_VERSION environment and let load-spark-env scripts detect it in AppVeyor

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165549#comment-17165549 ] Hyukjin Kwon edited comment on SPARK-32453 at 7/27/20, 8:57 AM: It will

[jira] [Created] (SPARK-32455) LogisticRegressionModel prediction optimization

2020-07-27 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-32455: Summary: LogisticRegressionModel prediction optimization Key: SPARK-32455 URL: https://issues.apache.org/jira/browse/SPARK-32455 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-32453) Remove SPARK_SCALA_VERSION environment and let load-spark-env scripts detect it in AppVeyor

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32453. -- Resolution: Invalid > Remove SPARK_SCALA_VERSION environment and let load-spark-env scripts

[jira] [Commented] (SPARK-32453) Remove SPARK_SCALA_VERSION environment and let load-spark-env scripts detect it in AppVeyor

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165549#comment-17165549 ] Hyukjin Kwon commented on SPARK-32453: -- It will be fixed together at

[jira] [Assigned] (SPARK-32456) Give better error message for union streams in append mode that don't have a watermark

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32456: Assignee: Apache Spark > Give better error message for union streams in append mode that

[jira] [Assigned] (SPARK-32456) Give better error message for union streams in append mode that don't have a watermark

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32456: Assignee: (was: Apache Spark) > Give better error message for union streams in

[jira] [Commented] (SPARK-32456) Give better error message for union streams in append mode that don't have a watermark

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165567#comment-17165567 ] Apache Spark commented on SPARK-32456: -- User 'xuanyuanking' has created a pull request for this

[jira] [Created] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-32457: Summary: logParam thresholds in DT/GBT/FM/LR/MLP Key: SPARK-32457 URL: https://issues.apache.org/jira/browse/SPARK-32457 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-32417) Flaky test: BlockManagerDecommissionIntegrationSuite.verify that an already running task which is going to cache data succeeds on a decommissioned executor

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32417: Assignee: (was: Apache Spark) > Flaky test:

[jira] [Commented] (SPARK-31587) R installation in Github Actions is being failed

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165511#comment-17165511 ] Apache Spark commented on SPARK-31587: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-29664) Column.getItem behavior is not consistent with Scala version

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165510#comment-17165510 ] Apache Spark commented on SPARK-29664: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Resolved] (SPARK-32188) API Reference

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32188. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29188

[jira] [Resolved] (SPARK-32179) Replace and redesign the documentation base

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32179. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29188

[jira] [Assigned] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32457: Assignee: (was: Apache Spark) > logParam thresholds in DT/GBT/FM/LR/MLP >

[jira] [Commented] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165573#comment-17165573 ] Apache Spark commented on SPARK-32457: -- User 'zhengruifeng' has created a pull request for this

[jira] [Commented] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165572#comment-17165572 ] Apache Spark commented on SPARK-32457: -- User 'zhengruifeng' has created a pull request for this

[jira] [Resolved] (SPARK-32425) Spark sequence() fails if start and end of range are identical timestamps

2020-07-27 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-32425. - Resolution: Duplicate > Spark sequence() fails if start and end of range are identical

[jira] [Commented] (SPARK-32459) UDF regression of WrappedArray supporting caused by SPARK-31826

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165698#comment-17165698 ] Apache Spark commented on SPARK-32459: -- User 'Ngone51' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32459) UDF regression of WrappedArray supporting caused by SPARK-31826

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32459: Assignee: (was: Apache Spark) > UDF regression of WrappedArray supporting caused by

[jira] [Commented] (SPARK-32459) UDF regression of WrappedArray supporting caused by SPARK-31826

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165696#comment-17165696 ] Apache Spark commented on SPARK-32459: -- User 'Ngone51' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32459) UDF regression of WrappedArray supporting caused by SPARK-31826

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32459: Assignee: Apache Spark > UDF regression of WrappedArray supporting caused by SPARK-31826

[jira] [Commented] (SPARK-32425) Spark sequence() fails if start and end of range are identical timestamps

2020-07-27 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165810#comment-17165810 ] L. C. Hsieh commented on SPARK-32425: - Thanks [~JinxinTang]. > Spark sequence() fails if start and

[jira] [Issue Comment Deleted] (SPARK-32431) The .schema() API behaves incorrectly for nested schemas that have column duplicates in case-insensitive mode

2020-07-27 Thread Maxim Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Gekk updated SPARK-32431: --- Comment: was deleted (was: I cannot reproduce the issue on master, branch-3.0 and branch-2.4. I

[jira] [Assigned] (SPARK-32420) Add handling for unique key in non-codegen hash join

2020-07-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-32420: --- Assignee: Cheng Su > Add handling for unique key in non-codegen hash join >

[jira] [Updated] (SPARK-31993) Generated code in 'concat_ws' fails to compile when splitting method is in effect

2020-07-27 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-31993: Component/s: (was: Spark Core) SQL > Generated code in 'concat_ws' fails to compile

[jira] [Commented] (SPARK-32429) Standalone Mode allow setting CUDA_VISIBLE_DEVICES on executor launch

2020-07-27 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165934#comment-17165934 ] Thomas Graves commented on SPARK-32429: --- So this doesn't address the task side, it addresses the

[jira] [Assigned] (SPARK-32424) Fix silent data change for timestamp parsing if overflow happens

2020-07-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-32424: --- Assignee: Kent Yao > Fix silent data change for timestamp parsing if overflow happens >

[jira] [Resolved] (SPARK-32424) Fix silent data change for timestamp parsing if overflow happens

2020-07-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-32424. - Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29220

[jira] [Commented] (SPARK-32332) AQE doesn't adequately allow for Columnar Processing extension

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165897#comment-17165897 ] Apache Spark commented on SPARK-32332: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-32332) AQE doesn't adequately allow for Columnar Processing extension

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165898#comment-17165898 ] Apache Spark commented on SPARK-32332: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-32429) Standalone Mode allow setting CUDA_VISIBLE_DEVICES on executor launch

2020-07-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165903#comment-17165903 ] Xiangrui Meng commented on SPARK-32429: --- Couple questions: 1. Which GPU resource name do we use?

[jira] [Updated] (SPARK-32431) The .schema() API behaves incorrectly for nested schemas that have column duplicates in case-insensitive mode

2020-07-27 Thread Maxim Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Gekk updated SPARK-32431: --- Description: The code below throws org.apache.spark.sql.AnalysisException: Found duplicate

[jira] [Created] (SPARK-32460) how spark collects non-match results after performing broadcast left outer join

2020-07-27 Thread farshad delavarpour (Jira)
farshad delavarpour created SPARK-32460: --- Summary: how spark collects non-match results after performing broadcast left outer join Key: SPARK-32460 URL: https://issues.apache.org/jira/browse/SPARK-32460

[jira] [Assigned] (SPARK-32443) Fix testCommandAvailable to use POSIX compatible `command -v`

2020-07-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-32443: - Assignee: Hyukjin Kwon (was: Dongjoon Hyun) > Fix testCommandAvailable to use POSIX

[jira] [Resolved] (SPARK-32443) Fix testCommandAvailable to use POSIX compatible `command -v`

2020-07-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-32443. --- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29241

[jira] [Resolved] (SPARK-32420) Add handling for unique key in non-codegen hash join

2020-07-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-32420. - Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29216

[jira] [Resolved] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread Huaxin Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Huaxin Gao resolved SPARK-32457. Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 29257

[jira] [Assigned] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread Huaxin Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Huaxin Gao reassigned SPARK-32457: -- Assignee: zhengruifeng > logParam thresholds in DT/GBT/FM/LR/MLP >

[jira] [Updated] (SPARK-32421) Add code-gen for shuffled hash join

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-32421: - Parent: SPARK-32461 Issue Type: Sub-task (was: Improvement) > Add code-gen for shuffled hash

[jira] [Created] (SPARK-32462) Don't save the previous search text for datatable

2020-07-27 Thread Kousuke Saruta (Jira)
Kousuke Saruta created SPARK-32462: -- Summary: Don't save the previous search text for datatable Key: SPARK-32462 URL: https://issues.apache.org/jira/browse/SPARK-32462 Project: Spark Issue

[jira] [Updated] (SPARK-21505) A dynamic join operator to improve the join reliability

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-21505: - Parent: SPARK-32461 Issue Type: Sub-task (was: New Feature) > A dynamic join operator to

[jira] [Commented] (SPARK-32461) Shuffled hash join improvement

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165951#comment-17165951 ] Cheng Su commented on SPARK-32461: -- Just FYI - I am working on each sub-tasks separately now. >

[jira] [Commented] (SPARK-32417) Flaky test: BlockManagerDecommissionIntegrationSuite.verify that an already running task which is going to cache data succeeds on a decommissioned executor

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165979#comment-17165979 ] Apache Spark commented on SPARK-32417: -- User 'holdenk' has created a pull request for this issue:

[jira] [Updated] (SPARK-32462) Don't save the previous search text for datatable

2020-07-27 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-32462: --- Description: DataTable is used in stage-page and executors-page for pagination and filter

[jira] [Updated] (SPARK-32383) Preserve hash join (BHJ and SHJ) stream side ordering

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-32383: - Parent: SPARK-32461 Issue Type: Sub-task (was: Improvement) > Preserve hash join (BHJ and SHJ)

[jira] [Updated] (SPARK-32420) Add handling for unique key in non-codegen hash join

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-32420: - Parent: SPARK-32461 Issue Type: Sub-task (was: Improvement) > Add handling for unique key in

[jira] [Updated] (SPARK-32399) Support full outer join in shuffled hash join and broadcast hash join

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-32399: - Parent: SPARK-32461 Issue Type: Sub-task (was: Improvement) > Support full outer join in

[jira] [Comment Edited] (SPARK-28210) Shuffle Storage API: Reads

2020-07-27 Thread Tianchen Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166027#comment-17166027 ] Tianchen Zhang edited comment on SPARK-28210 at 7/27/20, 11:40 PM: --- Hi

[jira] [Commented] (SPARK-28210) Shuffle Storage API: Reads

2020-07-27 Thread Tianchen Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166027#comment-17166027 ] Tianchen Zhang commented on SPARK-28210: Hi [~devaraj], do you mind share some ideas about your

[jira] [Created] (SPARK-32461) Shuffled hash join improvement

2020-07-27 Thread Cheng Su (Jira)
Cheng Su created SPARK-32461: Summary: Shuffled hash join improvement Key: SPARK-32461 URL: https://issues.apache.org/jira/browse/SPARK-32461 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-32330) Preserve shuffled hash join build side partitioning

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-32330: - Parent: SPARK-32461 Issue Type: Sub-task (was: Improvement) > Preserve shuffled hash join

[jira] [Updated] (SPARK-32286) Coalesce bucketed tables for shuffled hash join if applicable

2020-07-27 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-32286: - Parent: SPARK-32461 Issue Type: Sub-task (was: Improvement) > Coalesce bucketed tables for

[jira] [Assigned] (SPARK-32462) Don't save the previous search text for datatable

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32462: Assignee: Kousuke Saruta (was: Apache Spark) > Don't save the previous search text for

[jira] [Commented] (SPARK-32462) Don't save the previous search text for datatable

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166025#comment-17166025 ] Apache Spark commented on SPARK-32462: -- User 'sarutak' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32462) Don't save the previous search text for datatable

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32462: Assignee: Apache Spark (was: Kousuke Saruta) > Don't save the previous search text for

[jira] [Updated] (SPARK-31753) Add missing keywords in the SQL documents

2020-07-27 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-31753: - Affects Version/s: (was: 3.0.0) 3.0.1 > Add missing keywords

[jira] [Updated] (SPARK-31753) Add missing keywords in the SQL documents

2020-07-27 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-31753: - Affects Version/s: 3.0.0 > Add missing keywords in the SQL documents >

[jira] [Resolved] (SPARK-31753) Add missing keywords in the SQL documents

2020-07-27 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-31753. -- Fix Version/s: 3.0.1 Assignee: philipse Resolution: Fixed Resolved by 

[jira] [Created] (SPARK-32463) Document Data Type inference rule in SQL reference

2020-07-27 Thread Huaxin Gao (Jira)
Huaxin Gao created SPARK-32463: -- Summary: Document Data Type inference rule in SQL reference Key: SPARK-32463 URL: https://issues.apache.org/jira/browse/SPARK-32463 Project: Spark Issue Type:

[jira] [Created] (SPARK-32464) Support skew handling on join with one side that has no query stage

2020-07-27 Thread Wang, Gang (Jira)
Wang, Gang created SPARK-32464: -- Summary: Support skew handling on join with one side that has no query stage Key: SPARK-32464 URL: https://issues.apache.org/jira/browse/SPARK-32464 Project: Spark

[jira] [Resolved] (SPARK-32439) Override datasource implementation during look up via configuration

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32439. -- Resolution: Won't Fix > Override datasource implementation during look up via configuration >

[jira] [Commented] (SPARK-32361) Remove project if output is subset of child

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166094#comment-17166094 ] Hyukjin Kwon commented on SPARK-32361: -- Please fill JIRA description. > Remove project if output

[jira] [Updated] (SPARK-32369) pyspark foreach/foreachPartition send http request failed

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32369: - Description: I use urllib.request to send http request in foreach/foreachPartition. pyspark

[jira] [Commented] (SPARK-32359) Implement max_error metric evaluator for spark regression mllib

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166095#comment-17166095 ] Hyukjin Kwon commented on SPARK-32359: -- Please fill JIRA description. > Implement max_error metric

[jira] [Commented] (SPARK-32423) class 'DataFrame' returns instance of type(self) instead of DataFrame

2020-07-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166114#comment-17166114 ] Hyukjin Kwon commented on SPARK-32423: -- Can you show some pseudo codes? It's a bit difficult to

[jira] [Commented] (SPARK-32463) Document Data Type inference rule in SQL reference

2020-07-27 Thread Huaxin Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166045#comment-17166045 ] Huaxin Gao commented on SPARK-32463: [~planga82] You are welcomed to work on this if you have free

[jira] [Updated] (SPARK-32464) Support skew handling on join that has one side with no query stage

2020-07-27 Thread Wang, Gang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wang, Gang updated SPARK-32464: --- Summary: Support skew handling on join that has one side with no query stage (was: Support skew

[jira] [Assigned] (SPARK-32464) Support skew handling on join that has one side with no query stage

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32464: Assignee: (was: Apache Spark) > Support skew handling on join that has one side with

[jira] [Assigned] (SPARK-32464) Support skew handling on join that has one side with no query stage

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32464: Assignee: Apache Spark > Support skew handling on join that has one side with no query

[jira] [Assigned] (SPARK-32464) Support skew handling on join that has one side with no query stage

2020-07-27 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32464: Assignee: Apache Spark > Support skew handling on join that has one side with no query

  1   2   >