jzhuge commented on PR #38699:
URL: https://github.com/apache/spark/pull/38699#issuecomment-1445858855
> > If we are setting it in `SparkContext`, do we want to get rid of this
from other places like `PythonRunner.compute` ?
>
> I think we can remove code in PythonRunner.compute
WweiL opened a new pull request, #40187:
URL: https://github.com/apache/spark/pull/40187
### What changes were proposed in this pull request?
https://github.com/apache/spark/pull/40073 accidentally changed the
relationship of the two `if` statement in
cloud-fan closed pull request #40121: [SPARK-42528][CORE] Optimize
PercentileHeap
URL: https://github.com/apache/spark/pull/40121
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
cloud-fan commented on PR #40121:
URL: https://github.com/apache/spark/pull/40121#issuecomment-1445779613
The failed HealthTrackerIntegrationSuite is definitely unrelated, I'm
merging it to master, thanks!
--
This is an automated message from the Apache Git Service.
To respond to the
cloud-fan commented on PR #40115:
URL: https://github.com/apache/spark/pull/40115#issuecomment-1445778387
the change LGTM but the PR title is a bit confusing. How is it related to
subquery?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log
dongjoon-hyun commented on PR #40183:
URL: https://github.com/apache/spark/pull/40183#issuecomment-1445745535
Thank you, @viirya . Sorry for missing at the first PR.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
viirya commented on PR #40183:
URL: https://github.com/apache/spark/pull/40183#issuecomment-1445724214
Looks good.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To
allisonwang-db closed pull request #40146: [SPARK-42120][SQL] Add built-in
table-valued function json_tuple
URL: https://github.com/apache/spark/pull/40146
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to
allisonwang-db commented on PR #40146:
URL: https://github.com/apache/spark/pull/40146#issuecomment-1445719261
Combined in https://github.com/apache/spark/pull/40151
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
allisonwang-db closed pull request #40149: [SPARK-42122][SQL] Add built-in
table-valued function stack
URL: https://github.com/apache/spark/pull/40149
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go
allisonwang-db commented on PR #40149:
URL: https://github.com/apache/spark/pull/40149#issuecomment-1445718970
Merged in https://github.com/apache/spark/pull/40151
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
navinvishy commented on code in PR #38947:
URL: https://github.com/apache/spark/pull/38947#discussion_r1118279709
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala:
##
@@ -1399,6 +1399,119 @@ case class ArrayContains(left:
navinvishy commented on code in PR #38947:
URL: https://github.com/apache/spark/pull/38947#discussion_r1118278674
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala:
##
@@ -1399,6 +1399,149 @@ case class ArrayContains(left:
hvanhovell opened a new pull request, #40186:
URL: https://github.com/apache/spark/pull/40186
### What changes were proposed in this pull request?
This PR adds the `SQLImplicits` class to Spark Connect. This makes it easier
for end users to work with Connect Datasets.
The current
navinvishy commented on code in PR #38947:
URL: https://github.com/apache/spark/pull/38947#discussion_r1118278902
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/collectionOperations.scala:
##
@@ -1399,6 +1399,149 @@ case class ArrayContains(left:
dongjoon-hyun commented on code in PR #40179:
URL: https://github.com/apache/spark/pull/40179#discussion_r1118276085
##
connector/connect/client/jvm/src/test/scala/org/apache/spark/sql/connect/client/CompatibilitySuite.scala:
##
@@ -155,6 +156,7 @@ class CompatibilitySuite
hvanhovell commented on PR #40184:
URL: https://github.com/apache/spark/pull/40184#issuecomment-1445682796
@amaliujia can you please update the compatibility test for these?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and
hvanhovell commented on code in PR #40184:
URL: https://github.com/apache/spark/pull/40184#discussion_r1118271294
##
connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala:
##
@@ -190,6 +190,22 @@ class SparkSession(
range(start, end, step,
hvanhovell commented on code in PR #40184:
URL: https://github.com/apache/spark/pull/40184#discussion_r1118271144
##
connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/SparkSession.scala:
##
@@ -190,6 +190,22 @@ class SparkSession(
range(start, end, step,
hvanhovell opened a new pull request, #40185:
URL: https://github.com/apache/spark/pull/40185
### What changes were proposed in this pull request?
This PR adds the RuntimeConfig class for the Spark Connect Scala Client.
### Why are the changes needed?
API Parity.
### Does
gatorsmile commented on PR #39558:
URL: https://github.com/apache/spark/pull/39558#issuecomment-1445674657
@smallzhongfeng Could you help add it to the migration guide?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use
beliefer commented on PR #39990:
URL: https://github.com/apache/spark/pull/39990#issuecomment-1445670336
ping @huaxingao cc @cloud-fan @sadikovi
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to
amaliujia commented on PR #40184:
URL: https://github.com/apache/spark/pull/40184#issuecomment-1445656241
@hvanhovell
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To
amaliujia opened a new pull request, #40184:
URL: https://github.com/apache/spark/pull/40184
### What changes were proposed in this pull request?
Throw exceptions for unsupported session API:
1. newSession
2. getActiveSession
3. getDefaultSession
4. active
dongjoon-hyun closed pull request #40183:
[SPARK-42587][CONNECT][TESTS][FOLLOWUP] Fix `scalafmt` failure
URL: https://github.com/apache/spark/pull/40183
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go
dongjoon-hyun commented on PR #40183:
URL: https://github.com/apache/spark/pull/40183#issuecomment-1445652117
Merged to master/3.4.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
dongjoon-hyun commented on PR #40183:
URL: https://github.com/apache/spark/pull/40183#issuecomment-1445651565
Thank you so much, @hvanhovell . Sorry for the troubles.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use
dongjoon-hyun commented on PR #40183:
URL: https://github.com/apache/spark/pull/40183#issuecomment-1445636922
Now, it passed.
![Screenshot 2023-02-26 at 7 25 33
PM](https://user-images.githubusercontent.com/9700541/221465853-833cb047-d751-43f1-a341-7a6b01f5ce21.png)
--
This is an
dongjoon-hyun commented on PR #40183:
URL: https://github.com/apache/spark/pull/40183#issuecomment-1445628852
cc @viirya
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
dongjoon-hyun opened a new pull request, #40183:
URL: https://github.com/apache/spark/pull/40183
### What changes were proposed in this pull request?
### Why are the changes needed?
### Does this PR introduce _any_ user-facing change?
###
dongjoon-hyun commented on PR #40181:
URL: https://github.com/apache/spark/pull/40181#issuecomment-1445625931
Let me close this and fix the branch first.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above
dongjoon-hyun closed pull request #40181: [SPARK-42589][CONNECT][TESTS] Exclude
`RelationalGroupedDataset.apply` from `CompatibilitySuite`
URL: https://github.com/apache/spark/pull/40181
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to
dongjoon-hyun commented on code in PR #40181:
URL: https://github.com/apache/spark/pull/40181#discussion_r1118237094
##
connector/connect/client/jvm/src/test/scala/org/apache/spark/sql/connect/client/CompatibilitySuite.scala:
##
@@ -39,8 +39,8 @@ import
dongjoon-hyun commented on code in PR #40181:
URL: https://github.com/apache/spark/pull/40181#discussion_r1118236801
##
connector/connect/client/jvm/src/test/scala/org/apache/spark/sql/connect/client/CompatibilitySuite.scala:
##
@@ -159,6 +159,7 @@ class CompatibilitySuite
dongjoon-hyun commented on PR #40181:
URL: https://github.com/apache/spark/pull/40181#issuecomment-1445615890
Hi, @viirya . Could you review this PR too?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above
zml1206 opened a new pull request, #40182:
URL: https://github.com/apache/spark/pull/40182
### What changes were proposed in this pull request?
Extend the CollapseWindow rule to collapse Window nodes with the equivalent
partition/order expressions
### Why are the changes
dongjoon-hyun opened a new pull request, #40181:
URL: https://github.com/apache/spark/pull/40181
…
### What changes were proposed in this pull request?
### Why are the changes needed?
### Does this PR introduce _any_ user-facing change?
hvanhovell closed pull request #40179: [SPARK-42560][CONNECT] Add ColumnName
class
URL: https://github.com/apache/spark/pull/40179
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
hvanhovell commented on PR #40179:
URL: https://github.com/apache/spark/pull/40179#issuecomment-1445591135
Merging.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To
dongjoon-hyun closed pull request #40180: [SPARK-42587][CONNECT][TESTS] Use
wrapper versions for SBT and Maven in `connect` module tests
URL: https://github.com/apache/spark/pull/40180
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to
dongjoon-hyun commented on PR #40180:
URL: https://github.com/apache/spark/pull/40180#issuecomment-1445587270
Thank you! Merged to master/3.4.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
dongjoon-hyun commented on PR #40176:
URL: https://github.com/apache/spark/pull/40176#issuecomment-1445586703
Thank you for doing this too!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
wankunde commented on PR #40157:
URL: https://github.com/apache/spark/pull/40157#issuecomment-1445583743
cc @cloud-fan Could you help to review this PR? Thanks
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL
LuciferYang commented on code in PR #40136:
URL: https://github.com/apache/spark/pull/40136#discussion_r1118213407
##
connector/connect/client/jvm/pom.xml:
##
@@ -125,6 +125,11 @@
${mima.version}
test
+
Review Comment:
> On the other side, maven is
LuciferYang commented on code in PR #40136:
URL: https://github.com/apache/spark/pull/40136#discussion_r1118213407
##
connector/connect/client/jvm/pom.xml:
##
@@ -125,6 +125,11 @@
${mima.version}
test
+
Review Comment:
> On the other side, maven is
LuciferYang commented on code in PR #40136:
URL: https://github.com/apache/spark/pull/40136#discussion_r1118209034
##
connector/connect/client/jvm/pom.xml:
##
@@ -125,6 +125,11 @@
${mima.version}
test
+
Review Comment:
> Do we still need rules for
LuciferYang commented on code in PR #40136:
URL: https://github.com/apache/spark/pull/40136#discussion_r1118212051
##
connector/connect/client/jvm/pom.xml:
##
@@ -125,6 +125,11 @@
${mima.version}
test
+
Review Comment:
Update comments
--
This is
zhengruifeng commented on code in PR #40013:
URL: https://github.com/apache/spark/pull/40013#discussion_r1118211205
##
connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala:
##
@@ -1346,16 +1346,16 @@ class
LuciferYang commented on code in PR #40136:
URL: https://github.com/apache/spark/pull/40136#discussion_r1118209034
##
connector/connect/client/jvm/pom.xml:
##
@@ -125,6 +125,11 @@
${mima.version}
test
+
Review Comment:
> Do we still need rules for
dongjoon-hyun commented on PR #40180:
URL: https://github.com/apache/spark/pull/40180#issuecomment-1445559302
Could you review this editorial patch, @HyukjinKwon and @viirya ?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub
dongjoon-hyun opened a new pull request, #40180:
URL: https://github.com/apache/spark/pull/40180
…
### What changes were proposed in this pull request?
### Why are the changes needed?
### Does this PR introduce _any_ user-facing change?
ulysses-you commented on code in PR #40177:
URL: https://github.com/apache/spark/pull/40177#discussion_r1118205294
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala:
##
@@ -194,12 +194,12 @@ object EliminateOuterJoin extends Rule[LogicalPlan]
ulysses-you commented on code in PR #40177:
URL: https://github.com/apache/spark/pull/40177#discussion_r1118205294
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala:
##
@@ -194,12 +194,12 @@ object EliminateOuterJoin extends Rule[LogicalPlan]
viirya commented on code in PR #40178:
URL: https://github.com/apache/spark/pull/40178#discussion_r1118205272
##
docs/building-spark.md:
##
@@ -276,34 +276,6 @@ Enable the profile (e.g. 2.13):
# For sbt
./build/sbt -Pscala-2.13 compile
-## Running Jenkins tests with
zhengruifeng commented on code in PR #40013:
URL: https://github.com/apache/spark/pull/40013#discussion_r1118194094
##
connector/connect/server/src/main/scala/org/apache/spark/sql/connect/planner/SparkConnectPlanner.scala:
##
@@ -1346,16 +1346,16 @@ class
HyukjinKwon commented on code in PR #40178:
URL: https://github.com/apache/spark/pull/40178#discussion_r1118182170
##
docs/building-spark.md:
##
@@ -276,34 +276,6 @@ Enable the profile (e.g. 2.13):
# For sbt
./build/sbt -Pscala-2.13 compile
-## Running Jenkins tests
zhengruifeng commented on code in PR #40013:
URL: https://github.com/apache/spark/pull/40013#discussion_r1118191339
##
connector/connect/server/src/main/scala/org/apache/spark/sql/connect/dsl/package.scala:
##
@@ -703,21 +703,13 @@ package object dsl {
def drop(columns:
zhengruifeng commented on PR #40170:
URL: https://github.com/apache/spark/pull/40170#issuecomment-1445528865
late LGTM, thanks!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
zhengruifeng closed pull request #39995: [WIP][CONNECT] Initial runtime SQL
configuration implementation
URL: https://github.com/apache/spark/pull/39995
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go
hvanhovell commented on PR #40176:
URL: https://github.com/apache/spark/pull/40176#issuecomment-1445521369
Thanks for doing this!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
hvanhovell closed pull request #40176: [SPARK-42564][CONNECT] Implement
SparkSession.version and SparkSession.time
URL: https://github.com/apache/spark/pull/40176
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL
hvanhovell commented on PR #40176:
URL: https://github.com/apache/spark/pull/40176#issuecomment-1445521025
Merging
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To
hvanhovell commented on code in PR #40179:
URL: https://github.com/apache/spark/pull/40179#discussion_r1118185169
##
connector/connect/client/jvm/src/test/scala/org/apache/spark/sql/connect/client/CompatibilitySuite.scala:
##
@@ -155,6 +156,7 @@ class CompatibilitySuite extends
HyukjinKwon closed pull request #39991: [SPARK-42419][CONNECT][PYTHON] Migrate
into error framework for Spark Connect Column API.
URL: https://github.com/apache/spark/pull/39991
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub
HyukjinKwon commented on PR #39991:
URL: https://github.com/apache/spark/pull/39991#issuecomment-1445518870
Merged to master and branch-3.4.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
HyukjinKwon closed pull request #40172: [SPARK-42569][CONNECT][FOLLOW-UP] Throw
unsupported exceptions for persist
URL: https://github.com/apache/spark/pull/40172
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL
HyukjinKwon commented on PR #40172:
URL: https://github.com/apache/spark/pull/40172#issuecomment-1445518268
Merged to master and branch-3.4.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
HyukjinKwon commented on code in PR #40179:
URL: https://github.com/apache/spark/pull/40179#discussion_r1118183276
##
connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/Column.scala:
##
@@ -26,7 +26,7 @@ import
HyukjinKwon closed pull request #40170: [SPARK-42574][CONNECT][PYTHON] Fix
toPandas to handle duplicated column names
URL: https://github.com/apache/spark/pull/40170
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
HyukjinKwon commented on PR #40170:
URL: https://github.com/apache/spark/pull/40170#issuecomment-1445516304
Merged to master and branch-3.4.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
HyukjinKwon commented on code in PR #40178:
URL: https://github.com/apache/spark/pull/40178#discussion_r1118182170
##
docs/building-spark.md:
##
@@ -276,34 +276,6 @@ Enable the profile (e.g. 2.13):
# For sbt
./build/sbt -Pscala-2.13 compile
-## Running Jenkins tests
hvanhovell opened a new pull request, #40179:
URL: https://github.com/apache/spark/pull/40179
### What changes were proposed in this pull request?
This PR adds the ColumnName for the Spark Connect Scala Client. This is a
stepping stone to implement the SQLImplicits.
### Why are
dongjoon-hyun commented on PR #40178:
URL: https://github.com/apache/spark/pull/40178#issuecomment-1445501633
Lastly, are you claiming a followup across `spark-website` and `spark`
repositories? To me, `[FOLLOWUP]` doesn't make sense at all in that case,
@bjornjorgensen .
--
This is an
dongjoon-hyun commented on PR #40178:
URL: https://github.com/apache/spark/pull/40178#issuecomment-1445501723
Also, cc @HyukjinKwon .
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
dongjoon-hyun commented on PR #40178:
URL: https://github.com/apache/spark/pull/40178#issuecomment-1445501146
To be clear, the code change itself looks okay, @bjornjorgensen .
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub
wangyum commented on PR #40177:
URL: https://github.com/apache/spark/pull/40177#issuecomment-1445494765
cc @cloud-fan @ulysses-you
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
NarekDW commented on PR #40040:
URL: https://github.com/apache/spark/pull/40040#issuecomment-1445466115
Also, I'd like to share some performance measurements from my local machine,
using JMH:
code example:
```java
...
@Benchmark
public void
hvanhovell commented on PR #40175:
URL: https://github.com/apache/spark/pull/40175#issuecomment-1445462573
@cloud-fan can you take a look?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
bjornjorgensen commented on PR #40178:
URL: https://github.com/apache/spark/pull/40178#issuecomment-1445419415
And CC @xinrong-meng This is for updating documentation for spark 3.4
release.
--
This is an automated message from the Apache Git Service.
To respond to the message, please
bjornjorgensen commented on PR #40178:
URL: https://github.com/apache/spark/pull/40178#issuecomment-1445418994
@srowen @dongjoon-hyun @viirya
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
bjornjorgensen opened a new pull request, #40178:
URL: https://github.com/apache/spark/pull/40178
### What changes were proposed in this pull request?
Remove Jenkins from web page.
This is a followup on https://github.com/apache/spark-website/pull/442
### Why are the changes
dtenedor commented on PR #39678:
URL: https://github.com/apache/spark/pull/39678#issuecomment-1445414348
Hi @RyanBerti just checking up on this :) are you back from PTO and still
interested in this work?
--
This is an automated message from the Apache Git Service.
To respond to the
NarekDW commented on PR #40040:
URL: https://github.com/apache/spark/pull/40040#issuecomment-1445410662
@srielau could you take a look, pls?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the
srowen commented on PR #40116:
URL: https://github.com/apache/spark/pull/40116#issuecomment-1445388789
Looks better. Title should start with `[SPARK-41391]` to link it. Please
include the description in the title; there is nothing there now
--
This is an automated message from the Apache
wangyum opened a new pull request, #40177:
URL: https://github.com/apache/spark/pull/40177
### What changes were proposed in this pull request?
Enhance `EliminateOuterJoin` by removing the outer join if they are all
distinct aggregate functions. For example:
```sql
SELECT
panbingkun opened a new pull request, #40176:
URL: https://github.com/apache/spark/pull/40176
### What changes were proposed in this pull request?
The pr aims to implement SparkSession.version and SparkSession.time.
### Why are the changes needed?
API coverage.
### Does
ritikam2 commented on PR #40116:
URL: https://github.com/apache/spark/pull/40116#issuecomment-1445296889
Sean I tried to correct the two things pointed out by you. Let me know if
that works
--
This is an automated message from the Apache Git Service.
To respond to the message, please
87 matches
Mail list logo