GitHub user jgoleary opened a pull request:
https://github.com/apache/spark/pull/19326
[SPARK-22107] Change as to alias in python quickstart
## What changes were proposed in this pull request?
Updated docs so that a line of python in the quick start guide executes.
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19326
Can one of the admins verify this patch?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
GitHub user joseph-torres opened a pull request:
https://github.com/apache/spark/pull/19327
[WIP] Implement stream-stream outer joins.
## What changes were proposed in this pull request?
Allow one-sided outer joins between two streams when a watermark is defined.
Github user BryanCutler commented on a diff in the pull request:
https://github.com/apache/spark/pull/19122#discussion_r140600118
--- Diff: python/pyspark/ml/tests.py ---
@@ -836,6 +836,27 @@ def test_save_load_simple_estimator(self):
loadedModel =
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19311
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82091/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19311
**[Test build #82091 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82091/testReport)**
for PR 19311 at commit
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/19325
retest this please
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user vanzin commented on a diff in the pull request:
https://github.com/apache/spark/pull/19311#discussion_r140612177
--- Diff:
core/src/test/scala/org/apache/spark/storage/MemoryStoreSuite.scala ---
@@ -407,4 +407,119 @@ class MemoryStoreSuite
})
Github user mallman commented on the issue:
https://github.com/apache/spark/pull/16578
> @mallman how about adding comment explaining why such workaround was done
+ bug number in parquet-mr ? So in future once that bug is fixed, code can be
cleaned.
It will take me more time
GitHub user BryanCutler opened a pull request:
https://github.com/apache/spark/pull/19325
[SPARK--22106][PYSPARK][SQL] Disable 0-parameter pandas_udf and add doctests
## What changes were proposed in this pull request?
This change disables the use of 0-parameter pandas_udfs
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19323
**[Test build #82095 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82095/testReport)**
for PR 19323 at commit
Github user mallman commented on a diff in the pull request:
https://github.com/apache/spark/pull/16578#discussion_r140611282
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaPruning.scala
---
@@ -0,0 +1,130 @@
+/*
+ *
Github user kevinyu98 closed the pull request at:
https://github.com/apache/spark/pull/19302
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19323
**[Test build #82095 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82095/testReport)**
for PR 19323 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19323
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19323
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82095/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19325
**[Test build #82093 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82093/testReport)**
for PR 19325 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19325
**[Test build #82096 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82096/testReport)**
for PR 19325 at commit
Github user kevinyu98 commented on the issue:
https://github.com/apache/spark/pull/19302
I am opening a new jira SPARK-22088 for this. I will close this PR. The
style fails is because a new JIRA SPARK-22088 fixed a style issue after I
submit my PR. I have included that JIRA in my new
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/19325
cc @HyukjinKwon @viirya
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19311
**[Test build #82091 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82091/testReport)**
for PR 19311 at commit
Github user tgravescs commented on the issue:
https://github.com/apache/spark/pull/19194
Jenkins, test this please
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19327
**[Test build #82094 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82094/testReport)**
for PR 19327 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19325
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82093/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19194
**[Test build #82092 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82092/testReport)**
for PR 19194 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16578
**[Test build #82097 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82097/testReport)**
for PR 16578 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19327
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82094/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19327
**[Test build #82094 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82094/testReport)**
for PR 19327 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19327
Merged build finished. Test FAILed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19194
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82092/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19194
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/19325
Adding the patch to enable 0-parameter pandas_udf if it is requested in the
future
Github user hvanhovell commented on the issue:
https://github.com/apache/spark/pull/19323
retest this please
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19194
**[Test build #82092 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82092/testReport)**
for PR 19194 at commit
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/19325
This is a followup to #18659
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands,
Github user BryanCutler commented on the issue:
https://github.com/apache/spark/pull/19325
@cloud-fan @ueshin I'm not sure if you are ok with merging this soon, but
in adding the doctests I found there were problems with using the decorator and
having empty partitions. I fixed those
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19325
**[Test build #82093 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82093/testReport)**
for PR 19325 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19325
Merged build finished. Test FAILed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19311
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/19320
I see, you're saying the display of a bunch of job IDs is like "[1] [2]
[3]" elsewhere? consistency is the most important thing IMHO. I'd just match
whatever a similar column does elsewhere.
---
Github user viirya commented on the issue:
https://github.com/apache/spark/pull/19301
I asked it because by considering subexpressionElimination, you may not
actually run it multiple times. So the benchmark numbers can tell if your fix
really improves the performance.
---
Github user guoxiaolongzte commented on the issue:
https://github.com/apache/spark/pull/19144
@HyukjinKwon @jerryshao
Help to review the code, thanks.
---
-
To unsubscribe, e-mail:
Github user guoxiaolongzte commented on the issue:
https://github.com/apache/spark/pull/19242
@HyukjinKwon @jerryshao
Help to review the code, thanks.
---
-
To unsubscribe, e-mail:
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19321
**[Test build #82073 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82073/testReport)**
for PR 19321 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19321
Merged build finished. Test FAILed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19308
**[Test build #82077 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82077/testReport)**
for PR 19308 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19312
**[Test build #82071 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82071/testReport)**
for PR 19312 at commit
Github user ala commented on the issue:
https://github.com/apache/spark/pull/19308
@hvanhovell How about this?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19312
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82071/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19312
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/19144
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org
Github user guoxiaolongzte commented on the issue:
https://github.com/apache/spark/pull/19320
In JDBC / ODBC Server page.There are no other places yet.
---
-
To unsubscribe, e-mail:
Github user yanboliang commented on the issue:
https://github.com/apache/spark/pull/19020
@sethah To the issue that whether huber linear regression share codebase
with ```LinearRegression```, we have discussion at
[JIRA](https://issues.apache.org/jira/browse/SPARK-3181). At last
Github user zuotingbing commented on the issue:
https://github.com/apache/spark/pull/19300
if we find some mistakes likes this , shouldn't we to report it at first?
---
-
To unsubscribe, e-mail:
Github user srowen commented on a diff in the pull request:
https://github.com/apache/spark/pull/19242#discussion_r140450417
--- Diff: docs/configuration.md ---
@@ -740,6 +740,20 @@ Apart from these, the following properties are also
available, and may be useful
Github user srowen commented on a diff in the pull request:
https://github.com/apache/spark/pull/19242#discussion_r140450356
--- Diff: docs/configuration.md ---
@@ -740,6 +740,20 @@ Apart from these, the following properties are also
available, and may be useful
Github user sathiyapk commented on a diff in the pull request:
https://github.com/apache/spark/pull/19295#discussion_r140450459
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/SparkOptimizer.scala ---
@@ -28,12 +28,18 @@ class SparkOptimizer(
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/18015
**[Test build #82072 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82072/testReport)**
for PR 18015 at commit
Github user steveloughran commented on the issue:
https://github.com/apache/spark/pull/19294
As I play with commit logic all the way through the stack, I can' t help
thinking everyone's lives would be better if we tagged the MRv1 commit APIs as
deprecated in Hadoop 3. and uses of the
Github user viirya commented on a diff in the pull request:
https://github.com/apache/spark/pull/18945#discussion_r140419255
--- Diff: python/pyspark/sql/dataframe.py ---
@@ -1760,13 +1760,39 @@ def toPandas(self):
"if using
Github user HyukjinKwon commented on the issue:
https://github.com/apache/spark/pull/18945
Hey @logannc, let's don't make it complicated for now and go with their
ways first - https://github.com/apache/spark/pull/18945#discussion_r134033952
and
Github user daweicheng closed the pull request at:
https://github.com/apache/spark/pull/19279
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/19300
@HyukjinKwon will make this change in another PR. We do need to push back
on tiny PRs. It's a problem because some people are actually incentivized based
on number of patches and it costs the
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/19320
I agree with making things consistent. The question is, I guess, what the
JDBC/ODBC page does. You can match that.
---
-
To
Github user HyukjinKwon commented on the issue:
https://github.com/apache/spark/pull/19300
I'd help review other PRs and leave a comment when someone fixes some codes
around it rather than proposing it alone, or you can buffer it in your local
and flush it.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/18015
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/18015
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82072/
Test PASSed.
---
Github user guoxiaolongzte commented on the issue:
https://github.com/apache/spark/pull/19320
Do you agree with my PR or against my PR?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/19144
If this isn't getting support, it's not worth pinging indefinitely, please.
That said, I'll merge this.
---
-
To unsubscribe,
Github user guoxiaolongzte commented on the issue:
https://github.com/apache/spark/pull/19320
In JDBC / ODBC Server page, 'job ids' are separated by braces, as shown
below:
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19144
**[Test build #3931 has
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3931/testReport)**
for PR 19144 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19307
**[Test build #82076 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82076/testReport)**
for PR 19307 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19144
**[Test build #3931 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3931/testReport)**
for PR 19144 at commit
Github user sathiyapk commented on a diff in the pull request:
https://github.com/apache/spark/pull/19295#discussion_r140464377
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/SparkOptimizer.scala ---
@@ -28,12 +28,18 @@ class SparkOptimizer(
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19222
**[Test build #82078 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82078/testReport)**
for PR 19222 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19020
**[Test build #82075 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82075/testReport)**
for PR 19020 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19020
Merged build finished. Test FAILed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19020
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82075/
Test FAILed.
---
Github user yanboliang commented on the issue:
https://github.com/apache/spark/pull/19020
@jkbradley Thanks for your comments, I have addressed all your inline
comments. Please see replies to your other questions below:
> Echoing @WeichenXu123 's comment: Why use "epsilon" as the
Github user zuotingbing commented on the issue:
https://github.com/apache/spark/pull/19300
@cloud-fan @gatorsmile Could you please help to review this and merge it
to master? Thanks.
---
-
To unsubscribe, e-mail:
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19321
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82073/
Test FAILed.
---
Github user guoxiaolongzte commented on the issue:
https://github.com/apache/spark/pull/18015
And failed, who submitted the code in question.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For
Github user ala commented on a diff in the pull request:
https://github.com/apache/spark/pull/19308#discussion_r140459738
--- Diff:
sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/OffHeapColumnVector.java
---
@@ -517,6 +517,7 @@ public void
Github user kiszk commented on a diff in the pull request:
https://github.com/apache/spark/pull/19222#discussion_r140462525
--- Diff:
common/unsafe/src/main/java/org/apache/spark/unsafe/array/ByteArrayMethods.java
---
@@ -46,6 +47,42 @@ public static int
GitHub user guoxiaolongzte opened a pull request:
https://github.com/apache/spark/pull/19320
[SPARK-22099] The 'job ids' list style needs to be changed in the SQL page.
## What changes were proposed in this pull request?
The 'job ids' list style needs to be changed in the
Github user gatorsmile commented on a diff in the pull request:
https://github.com/apache/spark/pull/18015#discussion_r140421501
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala
---
@@ -61,7 +59,37 @@ private[ui] class
Github user HyukjinKwon commented on the issue:
https://github.com/apache/spark/pull/19319
I'd go with this PR / approach. This approach and PR look pretty good. Let
me help double check this tonight.
---
-
To
Github user gatorsmile commented on the issue:
https://github.com/apache/spark/pull/19281
Thanks! Merged to master.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19319
Merged build finished. Test FAILed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19319
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82069/
Test FAILed.
---
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/19315
Just ctrl-F and find?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19319
Merged build finished. Test PASSed.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19319
**[Test build #82070 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82070/testReport)**
for PR 19319 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/19319
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82070/
Test PASSed.
---
Github user wzhfy commented on the issue:
https://github.com/apache/spark/pull/19321
cc @cloud-fan @gatorsmile
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19321
**[Test build #82073 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82073/testReport)**
for PR 19321 at commit
Github user wzhfy commented on a diff in the pull request:
https://github.com/apache/spark/pull/19321#discussion_r140429643
--- Diff:
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/aggregate/ApproximatePercentileSuite.scala
---
@@ -270,7 +270,6 @@ class
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/19302
**[Test build #82074 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82074/testReport)**
for PR 19302 at commit
Github user zhengruifeng commented on the issue:
https://github.com/apache/spark/pull/19229
I am not familiar with SQL source, but I think it's great to transform all
columns at a time
---
-
To unsubscribe, e-mail:
Github user yanboliang commented on a diff in the pull request:
https://github.com/apache/spark/pull/19020#discussion_r140440076
--- Diff:
mllib/src/test/scala/org/apache/spark/ml/regression/LinearRegressionSuite.scala
---
@@ -998,6 +1047,172 @@ class LinearRegressionSuite
1 - 100 of 362 matches
Mail list logo