[GitHub] [spark] SparkQA commented on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
SparkQA commented on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976236833 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50011/ -- This is an automated message from the Apache

[GitHub] [spark] wankunde commented on pull request #34629: [SPARK-37355][CORE]Avoid Block Manager registrations when Executor is shutting down

2021-11-23 Thread GitBox
wankunde commented on pull request #34629: URL: https://github.com/apache/spark/pull/34629#issuecomment-976248881 Hi, @Ngone51 could you help me to review this PR ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [spark] JoshRosen opened a new pull request #34691: [SPARK-37447][SQL] Cache LogicalPlan.isStreaming() result in a lazy val

2021-11-23 Thread GitBox
JoshRosen opened a new pull request #34691: URL: https://github.com/apache/spark/pull/34691 ### What changes were proposed in this pull request? This PR adds caching to `LogicalPlan.isStreaming()`: the default implementation's result will now be cached in a `private lazy val`.

[GitHub] [spark] HyukjinKwon removed a comment on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
HyukjinKwon removed a comment on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976191464 retest this please -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [spark] SparkQA commented on pull request #34688: [WIP][SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
SparkQA commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976261386 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50009/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976292574 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50014/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
SparkQA commented on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976302458 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50011/ -- This is an automated message from the

[GitHub] [spark] AmplabJenkins commented on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976302494 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50011/ --

[GitHub] [spark] SparkQA commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
SparkQA commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976327890 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50012/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34691: [SPARK-37447][SQL] Cache LogicalPlan.isStreaming() result in a lazy val

2021-11-23 Thread GitBox
SparkQA commented on pull request #34691: URL: https://github.com/apache/spark/pull/34691#issuecomment-976339112 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50015/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
SparkQA commented on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976389654 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50019/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976238045 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50008/

[GitHub] [spark] AngersZhuuuu commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AngersZh commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976238124 ping @sunchao @dongjoon-hyun @HyukjinKwon -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [spark] SparkQA commented on pull request #34687: [SPARK-36231][PYTHON] Support arithmetic operations of decimal(nan) series

2021-11-23 Thread GitBox
SparkQA commented on pull request #34687: URL: https://github.com/apache/spark/pull/34687#issuecomment-976258058 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50010/ -- This is an automated message from the

[GitHub] [spark] gengliangwang commented on a change in pull request #34596: [SPARK-37326][SQL] Support TimestampNTZ in CSV data source

2021-11-23 Thread GitBox
gengliangwang commented on a change in pull request #34596: URL: https://github.com/apache/spark/pull/34596#discussion_r754904724 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala ## @@ -442,17 +442,22 @@ object DateTimeUtils {

[GitHub] [spark] SparkQA commented on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
SparkQA commented on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976299121 **[Test build #145538 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145538/testReport)** for PR 34677 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976207850 **[Test build #145538 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145538/testReport)** for PR 34677 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976300656 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145538/

[GitHub] [spark] SparkQA removed a comment on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976209156 **[Test build #145539 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145539/testReport)** for PR 34689 at commit

[GitHub] [spark] SparkQA commented on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
SparkQA commented on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976336510 **[Test build #145539 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145539/testReport)** for PR 34689 at commit

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
HyukjinKwon commented on a change in pull request #34688: URL: https://github.com/apache/spark/pull/34688#discussion_r754951023 ## File path: python/pyspark/__init__.pyi ## @@ -38,7 +38,7 @@ from pyspark.profiler import ( # noqa: F401 from pyspark.rdd import RDD as RDD,

[GitHub] [spark] SparkQA removed a comment on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976236122 **[Test build #145541 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145541/testReport)** for PR 34070 at commit

[GitHub] [spark] SparkQA commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
SparkQA commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976391877 **[Test build #145535 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145535/testReport)** for PR 34611 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976191725 **[Test build #145535 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145535/testReport)** for PR 34611 at commit

[GitHub] [spark] SparkQA commented on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
SparkQA commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976422968 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50018/ -- This is an automated message from the

[GitHub] [spark] Yikun commented on a change in pull request #34687: [SPARK-36231][PYTHON] Support arithmetic operations of decimal(nan) series

2021-11-23 Thread GitBox
Yikun commented on a change in pull request #34687: URL: https://github.com/apache/spark/pull/34687#discussion_r754789113 ## File path: python/pyspark/pandas/tests/data_type_ops/testing_utils.py ## @@ -49,8 +49,15 @@ def numeric_pdf(self): dtypes = [np.int32, int,

[GitHub] [spark] AngersZhuuuu commented on pull request #28034: [SPARK-31268][CORE]Initial Task Executor Metrics with latestMetrics

2021-11-23 Thread GitBox
AngersZh commented on pull request #28034: URL: https://github.com/apache/spark/pull/28034#issuecomment-976304141 ping @mridulm Not sure if linkedIn will care about this problem. Are you interested in this? -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] AngersZhuuuu removed a comment on pull request #28034: [SPARK-31268][CORE]Initial Task Executor Metrics with latestMetrics

2021-11-23 Thread GitBox
AngersZh removed a comment on pull request #28034: URL: https://github.com/apache/spark/pull/28034#issuecomment-885439832 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976333736 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50014/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
SparkQA commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976353055 **[Test build #145546 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145546/testReport)** for PR 34688 at commit

[GitHub] [spark] HyukjinKwon commented on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
HyukjinKwon commented on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976353742 retest this please -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] SparkQA commented on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
SparkQA commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976391017 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50018/ -- This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976398396 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976398398 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145546/

[GitHub] [spark] AmplabJenkins commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976398396 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [spark] AmplabJenkins commented on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976398398 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145546/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976398401 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [spark] AmplabJenkins commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976398397 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145535/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34691: [SPARK-37447][SQL] Cache LogicalPlan.isStreaming() result in a lazy val

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34691: URL: https://github.com/apache/spark/pull/34691#issuecomment-976398404 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50015/ --

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34691: [SPARK-37447][SQL] Cache LogicalPlan.isStreaming() result in a lazy val

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34691: URL: https://github.com/apache/spark/pull/34691#issuecomment-976398404 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50015/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976398397 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145535/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976398399 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] AmplabJenkins commented on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976440342 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50018/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976440343 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50019/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34692: [SPARK-11792][FOLLOWUP] Update scaladoc of KnownSizeEstimation

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34692: URL: https://github.com/apache/spark/pull/34692#issuecomment-976440746 Can one of the admins verify this patch? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976440343 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50019/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976440342 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50018/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976294816 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145542/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34687: [SPARK-36231][PYTHON] Support arithmetic operations of decimal(nan) series

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34687: URL: https://github.com/apache/spark/pull/34687#issuecomment-976294818 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50010/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34688: [WIP][SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976294817 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50009/

[GitHub] [spark] AmplabJenkins commented on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976300656 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145538/ -- This

[GitHub] [spark] gengliangwang commented on a change in pull request #34596: [SPARK-37326][SQL] Support TimestampNTZ in CSV data source

2021-11-23 Thread GitBox
gengliangwang commented on a change in pull request #34596: URL: https://github.com/apache/spark/pull/34596#discussion_r754907789 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala ## @@ -66,10 +68,23 @@ sealed trait

[GitHub] [spark] HyukjinKwon commented on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
HyukjinKwon commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976343235 cc @ueshin @viirya @BryanCutler @JoshRosen @WeichenXu123 @mengxr FYI -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976351448 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50014/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976351446 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145539/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976351445 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50012/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976351442 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50013/ --

[GitHub] [spark] SparkQA commented on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
SparkQA commented on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976356000 **[Test build #145547 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145547/testReport)** for PR 34677 at commit

[GitHub] [spark] SparkQA commented on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
SparkQA commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976375766 **[Test build #145546 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145546/testReport)** for PR 34688 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #34686: [SPARK-37444][SQL] ALTER NAMESPACE ... SET LOCATION should handle empty location consistently across v1 and v2 command

2021-11-23 Thread GitBox
cloud-fan commented on a change in pull request #34686: URL: https://github.com/apache/spark/pull/34686#discussion_r754984133 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala ## @@ -349,7 +351,7 @@ case class

[GitHub] [spark] SparkQA commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
SparkQA commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976460698 **[Test build #145540 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145540/testReport)** for PR 34611 at commit

[GitHub] [spark] SparkQA commented on pull request #34691: [SPARK-37447][SQL] Cache LogicalPlan.isStreaming() result in a lazy val

2021-11-23 Thread GitBox
SparkQA commented on pull request #34691: URL: https://github.com/apache/spark/pull/34691#issuecomment-976296694 **[Test build #145544 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145544/testReport)** for PR 34691 at commit

[GitHub] [spark] SparkQA commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
SparkQA commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976297392 **[Test build #145545 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145545/testReport)** for PR 34070 at commit

[GitHub] [spark] SparkQA commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
SparkQA commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976338712 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50017/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976370765 **[Test build #145543 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145543/testReport)** for PR 34690 at commit

[GitHub] [spark] SparkQA commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
SparkQA commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976371216 **[Test build #145541 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145541/testReport)** for PR 34070 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976244692 **[Test build #145543 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145543/testReport)** for PR 34690 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976238045 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50008/ --

[GitHub] [spark] AngersZhuuuu opened a new pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AngersZh opened a new pull request #34690: URL: https://github.com/apache/spark/pull/34690 ### What changes were proposed in this pull request? Since Hive 2.3.9 start have function `getWithoutRegisterFns`, but user may use hive 2.3.8 or lower version. Here we should use

[GitHub] [spark] SparkQA commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
SparkQA commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976238023 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50008/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976244692 **[Test build #145543 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145543/testReport)** for PR 34690 at commit

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976290699 **[Test build #145542 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145542/testReport)** for PR 34690 at commit

[GitHub] [spark] SparkQA commented on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
SparkQA commented on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976290689 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50012/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
SparkQA commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976291317 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50013/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA removed a comment on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976239896 **[Test build #145542 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145542/testReport)** for PR 34690 at commit

[GitHub] [spark] gengliangwang commented on a change in pull request #34596: [SPARK-37326][SQL] Support TimestampNTZ in CSV data source

2021-11-23 Thread GitBox
gengliangwang commented on a change in pull request #34596: URL: https://github.com/apache/spark/pull/34596#discussion_r754901303 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/CSVOptions.scala ## @@ -164,6 +164,10 @@ class CSVOptions(

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976302494 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50011/

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
HyukjinKwon commented on a change in pull request #34688: URL: https://github.com/apache/spark/pull/34688#discussion_r754950755 ## File path: python/pyspark/__init__.py ## @@ -136,7 +136,7 @@ def wrapper(self, *args, **kwargs): "Accumulator", "AccumulatorParam",

[GitHub] [spark] HyukjinKwon removed a comment on pull request #34677: [SPARK-37436][PYTHON] Uses Python's standard string formatter for SQL API in pandas API on Spark

2021-11-23 Thread GitBox
HyukjinKwon removed a comment on pull request #34677: URL: https://github.com/apache/spark/pull/34677#issuecomment-976353742 retest this please -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [spark] cloud-fan commented on a change in pull request #34686: [SPARK-37444][SQL] ALTER NAMESPACE ... SET LOCATION should handle empty location consistently across v1 and v2 command

2021-11-23 Thread GitBox
cloud-fan commented on a change in pull request #34686: URL: https://github.com/apache/spark/pull/34686#discussion_r754983417 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala ## @@ -349,7 +351,7 @@ case class

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976386911 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50016/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34691: [SPARK-37447][SQL] Cache LogicalPlan.isStreaming() result in a lazy val

2021-11-23 Thread GitBox
SparkQA commented on pull request #34691: URL: https://github.com/apache/spark/pull/34691#issuecomment-976391393 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50015/ -- This is an automated message from the

[GitHub] [spark] pan3793 opened a new pull request #34692: [SPARK-11792][FOLLOWUP] Update scaladoc of KnownSizeEstimation

2021-11-23 Thread GitBox
pan3793 opened a new pull request #34692: URL: https://github.com/apache/spark/pull/34692 ### What changes were proposed in this pull request? Followup #9813 ### Why are the changes needed? Fix scaladoc. ### Does this PR introduce _any_ user-facing change? No.

[GitHub] [spark] SparkQA commented on pull request #34693: [SPARK-37259][SQL] Support CTE queries with MSSQL JDBC

2021-11-23 Thread GitBox
SparkQA commented on pull request #34693: URL: https://github.com/apache/spark/pull/34693#issuecomment-976453455 **[Test build #145548 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145548/testReport)** for PR 34693 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #34687: [SPARK-36231][PYTHON] Support arithmetic operations of decimal(nan) series

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34687: URL: https://github.com/apache/spark/pull/34687#issuecomment-976294818 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50010/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34688: [WIP][SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976294817 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50009/ --

[GitHub] [spark] AmplabJenkins commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AmplabJenkins commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976294816 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145542/ -- This

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34689: [SPARK-37445][BUILD] Upgrade hadoop profile to hadoop-3.3 since we support hadoop-3.3 as default now

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34689: URL: https://github.com/apache/spark/pull/34689#issuecomment-976351446 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/145539/

[GitHub] [spark] HyukjinKwon commented on a change in pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
HyukjinKwon commented on a change in pull request #34688: URL: https://github.com/apache/spark/pull/34688#discussion_r754955918 ## File path: python/pyspark/serializers.py ## @@ -19,7 +19,7 @@ PySpark supports custom serializers for transferring data; this can improve

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976351442 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50013/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976351445 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50012/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
AmplabJenkins removed a comment on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976351448 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/50014/

[GitHub] [spark] SparkQA commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
SparkQA commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976376809 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50017/ -- This is an automated message from the

[GitHub] [spark] SparkQA removed a comment on pull request #34688: [SPARK-32079][PYTHON] Remove namedtuple hack by replacing built-in pickle to cloudpickle

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34688: URL: https://github.com/apache/spark/pull/34688#issuecomment-976353055 **[Test build #145546 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145546/testReport)** for PR 34688 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #34611: [SPARK-35867][SQL] Enable vectorized read for VectorizedPlainValuesReader.readBooleans

2021-11-23 Thread GitBox
SparkQA removed a comment on pull request #34611: URL: https://github.com/apache/spark/pull/34611#issuecomment-976235724 **[Test build #145540 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145540/testReport)** for PR 34611 at commit

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976239896 **[Test build #145542 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/145542/testReport)** for PR 34690 at commit

[GitHub] [spark] Yikun commented on a change in pull request #34687: [SPARK-36231][PYTHON] Support arithmetic operations of decimal(nan) series

2021-11-23 Thread GitBox
Yikun commented on a change in pull request #34687: URL: https://github.com/apache/spark/pull/34687#discussion_r754789113 ## File path: python/pyspark/pandas/tests/data_type_ops/testing_utils.py ## @@ -49,8 +49,15 @@ def numeric_pdf(self): dtypes = [np.int32, int,

[GitHub] [spark] wankunde commented on pull request #34234: [SPARK-36967][CORE] Report accurate shuffle block size if its skewed

2021-11-23 Thread GitBox
wankunde commented on pull request #34234: URL: https://github.com/apache/spark/pull/34234#issuecomment-976278207 Hi, @Ngone51 @JoshRosen @attilapiros Could you help me to review this PR? Thanks -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] SparkQA commented on pull request #34070: [SPARK-36840][SQL] Support DPP if there is no selective predicate on the filtering side

2021-11-23 Thread GitBox
SparkQA commented on pull request #34070: URL: https://github.com/apache/spark/pull/34070#issuecomment-976330327 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50013/ -- This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #34690: [SPARK-37446][SQL] When create Hie client, we should use reflection when call getWithoutRegisterFns

2021-11-23 Thread GitBox
SparkQA commented on pull request #34690: URL: https://github.com/apache/spark/pull/34690#issuecomment-976336286 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/50016/ -- This is an automated message from the Apache

  1   2   3   4   >