[
https://issues.apache.org/jira/browse/SPARK-40502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607523#comment-17607523
]
CaoYu commented on SPARK-40502:
---
I am a teacher
Recently designed Python language basic course, big data
[
https://issues.apache.org/jira/browse/SPARK-40511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40511:
Assignee: Apache Spark
> Upgrade slf4j to 2.x
>
>
>
[
https://issues.apache.org/jira/browse/SPARK-40511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40511:
Assignee: (was: Apache Spark)
> Upgrade slf4j to 2.x
>
>
>
[
https://issues.apache.org/jira/browse/SPARK-40511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607522#comment-17607522
]
Apache Spark commented on SPARK-40511:
--
User 'LuciferYang' has created a pull request for this
Yang Jie created SPARK-40511:
Summary: Upgrade slf4j to 2.x
Key: SPARK-40511
URL: https://issues.apache.org/jira/browse/SPARK-40511
Project: Spark
Issue Type: Improvement
Components:
[
https://issues.apache.org/jira/browse/SPARK-40496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan reassigned SPARK-40496:
---
Assignee: Ivan Sadikov
> Configs to control "enableDateTimeParsingFallback" are
[
https://issues.apache.org/jira/browse/SPARK-40496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan resolved SPARK-40496.
-
Fix Version/s: 3.4.0
Resolution: Fixed
Issue resolved by pull request 37942
[
https://issues.apache.org/jira/browse/SPARK-40506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
王俊博 updated SPARK-40506:
Description:
Spark StreamingSource Metrics sourceName is inappropriate.The label now looks
like
[
https://issues.apache.org/jira/browse/SPARK-40332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607489#comment-17607489
]
Apache Spark commented on SPARK-40332:
--
User 'zhengruifeng' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-40332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607488#comment-17607488
]
Apache Spark commented on SPARK-40332:
--
User 'zhengruifeng' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-40501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
BingKun Pan updated SPARK-40501:
Summary: Enhance 'SpecialLimits' to support project(..., limit(...)) (was:
Add
[
https://issues.apache.org/jira/browse/SPARK-40510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607483#comment-17607483
]
Apache Spark commented on SPARK-40510:
--
User 'zhengruifeng' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-40510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40510:
Assignee: Apache Spark
> Implement `ddof` in `Series.cov`
>
[
https://issues.apache.org/jira/browse/SPARK-40510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607481#comment-17607481
]
Apache Spark commented on SPARK-40510:
--
User 'zhengruifeng' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-40510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40510:
Assignee: (was: Apache Spark)
> Implement `ddof` in `Series.cov`
>
Ruifeng Zheng created SPARK-40510:
-
Summary: Implement `ddof` in `Series.cov`
Key: SPARK-40510
URL: https://issues.apache.org/jira/browse/SPARK-40510
Project: Spark
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/SPARK-40491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon resolved SPARK-40491.
--
Fix Version/s: 3.4.0
Resolution: Fixed
Issue resolved by pull request 37937
[
https://issues.apache.org/jira/browse/SPARK-40491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon reassigned SPARK-40491:
Assignee: jiaan.geng
> Remove too old TODO for JdbcRDD
> ---
[
https://issues.apache.org/jira/browse/SPARK-40500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon resolved SPARK-40500.
--
Fix Version/s: 3.4.0
Resolution: Fixed
Issue resolved by pull request 37947
[
https://issues.apache.org/jira/browse/SPARK-40500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon reassigned SPARK-40500:
Assignee: Ruifeng Zheng
> Use `pd.items` instead of `pd.iteritems`
>
[
https://issues.apache.org/jira/browse/SPARK-40499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon updated SPARK-40499:
-
Priority: Major (was: Blocker)
> Spark 3.2.1 percentlie_approx query much slower than Spark
[
https://issues.apache.org/jira/browse/SPARK-40502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607447#comment-17607447
]
Hyukjin Kwon commented on SPARK-40502:
--
{quote}
For some reasons, i can't using DataFrame API, only
Jungtaek Lim created SPARK-40509:
Summary: Construct an example of applyInPandasWithState in
examples directory
Key: SPARK-40509
URL: https://issues.apache.org/jira/browse/SPARK-40509
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-40508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607384#comment-17607384
]
Apache Spark commented on SPARK-40508:
--
User 'tedyu' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-40508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40508:
Assignee: (was: Apache Spark)
> Treat unknown partitioning as UnknownPartitioning
>
[
https://issues.apache.org/jira/browse/SPARK-40508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40508:
Assignee: Apache Spark
> Treat unknown partitioning as UnknownPartitioning
>
[
https://issues.apache.org/jira/browse/SPARK-40508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ted Yu updated SPARK-40508:
---
Description:
When running spark application against spark 3.3, I see the following :
{code}
Ted Yu created SPARK-40508:
--
Summary: Treat unknown partitioning as UnknownPartitioning
Key: SPARK-40508
URL: https://issues.apache.org/jira/browse/SPARK-40508
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-40477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kazuyuki Tanimura resolved SPARK-40477.
---
Resolution: Won't Fix
gave another thought and decided to close this one not to be
[
https://issues.apache.org/jira/browse/SPARK-40416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gengliang Wang resolved SPARK-40416.
Fix Version/s: 3.4.0
Resolution: Fixed
Issue resolved by pull request 37840
[
https://issues.apache.org/jira/browse/SPARK-40416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gengliang Wang reassigned SPARK-40416:
--
Assignee: Daniel
> Add error classes for subquery expression CheckAnalysis failures
[
https://issues.apache.org/jira/browse/SPARK-40507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anil Dasari updated SPARK-40507:
Description:
Dataframe saveAsTable sets all columns as optional/nullable while creating the
Anil Dasari created SPARK-40507:
---
Summary: Spark creates an optional columns in hive table for
fields that are not null
Key: SPARK-40507
URL: https://issues.apache.org/jira/browse/SPARK-40507
Project:
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:23 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:23 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:22 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:22 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:21 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:21 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:20 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:20 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:18 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-31404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607326#comment-17607326
]
Sachit commented on SPARK-31404:
Hi [~cloud_fan] ,
Could you please confirm if we need to use below
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys edited comment on SPARK-40439 at 9/20/22 5:17 PM:
---
[~hyukjin.kwon]: Thank you
[
https://issues.apache.org/jira/browse/SPARK-40439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607314#comment-17607314
]
xsys commented on SPARK-40439:
--
[~hyukjin.kwon]: Thank you for your response! Setting
[
https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-39494:
Assignee: Apache Spark
> Support `createDataFrame` from a list of scalars when schema is
[
https://issues.apache.org/jira/browse/SPARK-39494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-39494:
Assignee: (was: Apache Spark)
> Support `createDataFrame` from a list of scalars
[
https://issues.apache.org/jira/browse/SPARK-40357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607277#comment-17607277
]
Max Gekk commented on SPARK-40357:
--
[~lvshaokang] Sure, go ahead.
> Migrate window type check failures
[
https://issues.apache.org/jira/browse/SPARK-40357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607275#comment-17607275
]
Shaokang Lv commented on SPARK-40357:
-
Hi, [~maxgekk] , I would like to do some work and pick up
[
https://issues.apache.org/jira/browse/SPARK-34805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607270#comment-17607270
]
Joost Farla commented on SPARK-34805:
-
[~cloud_fan] I was running into the exact same issue using
[
https://issues.apache.org/jira/browse/SPARK-40479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Max Gekk resolved SPARK-40479.
--
Fix Version/s: 3.4.0
Resolution: Fixed
Issue resolved by pull request 37921
[
https://issues.apache.org/jira/browse/SPARK-40479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Max Gekk reassigned SPARK-40479:
Assignee: Max Gekk
> Migrate unexpected input type error to an error class
>
[
https://issues.apache.org/jira/browse/SPARK-40491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean R. Owen updated SPARK-40491:
-
Issue Type: Task (was: New Feature)
Priority: Trivial (was: Major)
This didn't need a
[
https://issues.apache.org/jira/browse/SPARK-40489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607188#comment-17607188
]
Garret Wilson edited comment on SPARK-40489 at 9/20/22 1:19 PM:
[
https://issues.apache.org/jira/browse/SPARK-40489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607188#comment-17607188
]
Garret Wilson edited comment on SPARK-40489 at 9/20/22 1:18 PM:
[
https://issues.apache.org/jira/browse/SPARK-40489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607188#comment-17607188
]
Garret Wilson commented on SPARK-40489:
---
{quote}It sounds like the new major version upgrade is
[
https://issues.apache.org/jira/browse/SPARK-40489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17606852#comment-17606852
]
Garret Wilson edited comment on SPARK-40489 at 9/20/22 1:13 PM:
#
[
https://issues.apache.org/jira/browse/SPARK-40506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40506:
Assignee: (was: Apache Spark)
> Spark Streaming metrics name don't need application
[
https://issues.apache.org/jira/browse/SPARK-40506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40506:
Assignee: Apache Spark
> Spark Streaming metrics name don't need application name
>
[
https://issues.apache.org/jira/browse/SPARK-40506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607167#comment-17607167
]
Apache Spark commented on SPARK-40506:
--
User 'Kwafoor' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-40506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
王俊博 updated SPARK-40506:
Summary: Spark Streaming metrics name don't need application name (was:
Spark Streaming Metrics SourceName is
[
https://issues.apache.org/jira/browse/SPARK-40506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
王俊博 updated SPARK-40506:
Description:
Spark StreamingSource Metrics sourceName is inappropriate.The label now looks
like
王俊博 created SPARK-40506:
---
Summary: Spark Streaming Metrics SourceName is unsuitable
Key: SPARK-40506
URL: https://issues.apache.org/jira/browse/SPARK-40506
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-40505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607151#comment-17607151
]
Apache Spark commented on SPARK-40505:
--
User 'bryanck' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-40505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40505:
Assignee: Apache Spark
> Remove min heap setting in Kubernetes Dockerfile entrypoint
>
[
https://issues.apache.org/jira/browse/SPARK-40505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40505:
Assignee: (was: Apache Spark)
> Remove min heap setting in Kubernetes Dockerfile
[
https://issues.apache.org/jira/browse/SPARK-40505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607149#comment-17607149
]
Apache Spark commented on SPARK-40505:
--
User 'bryanck' has created a pull request for this issue:
Bryan Keller created SPARK-40505:
Summary: Remove min heap setting in Kubernetes Dockerfile
entrypoint
Key: SPARK-40505
URL: https://issues.apache.org/jira/browse/SPARK-40505
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-40501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
BingKun Pan updated SPARK-40501:
Description:
h4. It took a long time to fetch out, still running after 20 minutes...
when run as
[
https://issues.apache.org/jira/browse/SPARK-40501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
BingKun Pan updated SPARK-40501:
Summary: Add PushProjectionThroughLimit for Optimizer (was: add
PushProjectionThroughLimit for
[
https://issues.apache.org/jira/browse/SPARK-40504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607092#comment-17607092
]
Apache Spark commented on SPARK-40504:
--
User 'zhengchenyu' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-40504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40504:
Assignee: Apache Spark
> Make yarn appmaster load config from client
>
[
https://issues.apache.org/jira/browse/SPARK-40504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40504:
Assignee: (was: Apache Spark)
> Make yarn appmaster load config from client
>
[
https://issues.apache.org/jira/browse/SPARK-40457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607082#comment-17607082
]
Bilna commented on SPARK-40457:
---
[~hyukjin.kwon] it is org.codehaus.jackson:jackson-mapper-asl:jar:1.9.13
[
https://issues.apache.org/jira/browse/SPARK-40504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengchenyu updated SPARK-40504:
Description:
In yarn federation mode, config in client side and nm side may be different.
[
https://issues.apache.org/jira/browse/SPARK-40327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607079#comment-17607079
]
Apache Spark commented on SPARK-40327:
--
User 'zhengruifeng' has created a pull request for this
zhengchenyu created SPARK-40504:
---
Summary: Make yarn appmaster load config from client
Key: SPARK-40504
URL: https://issues.apache.org/jira/browse/SPARK-40504
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-40327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40327:
Assignee: (was: Apache Spark)
> Increase pandas API coverage for pandas API on Spark
[
https://issues.apache.org/jira/browse/SPARK-40327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40327:
Assignee: Apache Spark
> Increase pandas API coverage for pandas API on Spark
>
[
https://issues.apache.org/jira/browse/SPARK-40327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607077#comment-17607077
]
Apache Spark commented on SPARK-40327:
--
User 'zhengruifeng' has created a pull request for this
Ruifeng Zheng created SPARK-40503:
-
Summary: Add resampling to API references
Key: SPARK-40503
URL: https://issues.apache.org/jira/browse/SPARK-40503
Project: Spark
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/SPARK-40491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607045#comment-17607045
]
CaoYu commented on SPARK-40491:
---
Maybe we can just not remove these.
I have already created
CaoYu created SPARK-40502:
-
Summary: Support dataframe API use jdbc data source in PySpark
Key: SPARK-40502
URL: https://issues.apache.org/jira/browse/SPARK-40502
Project: Spark
Issue Type: New
[
https://issues.apache.org/jira/browse/SPARK-40500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607034#comment-17607034
]
Apache Spark commented on SPARK-40500:
--
User 'zhengruifeng' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-40500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40500:
Assignee: Apache Spark
> Use `pd.items` instead of `pd.iteritems`
>
[
https://issues.apache.org/jira/browse/SPARK-40500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40500:
Assignee: (was: Apache Spark)
> Use `pd.items` instead of `pd.iteritems`
>
[
https://issues.apache.org/jira/browse/SPARK-40501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
BingKun Pan updated SPARK-40501:
Description:
h4. It took a long time to fetch out
when run as follow code in spark-shell:
[
https://issues.apache.org/jira/browse/SPARK-40501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607033#comment-17607033
]
Apache Spark commented on SPARK-40501:
--
User 'panbingkun' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-40501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40501:
Assignee: (was: Apache Spark)
> add PushProjectionThroughLimit for Optimizer
>
[
https://issues.apache.org/jira/browse/SPARK-40501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-40501:
Assignee: Apache Spark
> add PushProjectionThroughLimit for Optimizer
>
[
https://issues.apache.org/jira/browse/SPARK-40499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
xuanzhiang updated SPARK-40499:
---
Priority: Blocker (was: Major)
> Spark 3.2.1 percentlie_approx query much slower than Spark 2.4.0
BingKun Pan created SPARK-40501:
---
Summary: add PushProjectionThroughLimit for Optimizer
Key: SPARK-40501
URL: https://issues.apache.org/jira/browse/SPARK-40501
Project: Spark
Issue Type:
Ruifeng Zheng created SPARK-40500:
-
Summary: Use `pd.items` instead of `pd.iteritems`
Key: SPARK-40500
URL: https://issues.apache.org/jira/browse/SPARK-40500
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-40499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
xuanzhiang updated SPARK-40499:
---
Priority: Blocker (was: Minor)
> Spark 3.2.1 percentlie_approx query much slower than Spark 2.4.0
[
https://issues.apache.org/jira/browse/SPARK-40499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
xuanzhiang updated SPARK-40499:
---
Priority: Major (was: Blocker)
> Spark 3.2.1 percentlie_approx query much slower than Spark 2.4.0
[
https://issues.apache.org/jira/browse/SPARK-40499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
xuanzhiang updated SPARK-40499:
---
Environment:
hadoop: 3.0.0
spark: 2.4.0 / 3.2.1
shuffle:spark 2.4.0
was:
hadoop 3.0.0
[
https://issues.apache.org/jira/browse/SPARK-40499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
xuanzhiang updated SPARK-40499:
---
Attachment: spark3.2-shuffle-data.png
> Spark 3.2.1 percentlie_approx query much slower than Spark
[
https://issues.apache.org/jira/browse/SPARK-40419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607022#comment-17607022
]
Apache Spark commented on SPARK-40419:
--
User 'itholic' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-40419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607021#comment-17607021
]
Apache Spark commented on SPARK-40419:
--
User 'itholic' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-40499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
xuanzhiang updated SPARK-40499:
---
Description:
spark.sql(
s"""
|SELECT
| Info ,
|
1 - 100 of 129 matches
Mail list logo