Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21320
@mallman Glad to see this got merged in. Thanks for all of your work
pushing through. I'm looking forward to the next phase. Please let me know if I
can help again. I did notice that w
Github user ajacques closed the pull request at:
https://github.com/apache/spark/pull/21889
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
Thanks for the response all. @mailman If it's really your preference, I
will create a PR against that branch and close this one. My intention was never
to take away from your efforts, and I
Github user ajacques commented on a diff in the pull request:
https://github.com/apache/spark/pull/21889#discussion_r210170646
--- Diff:
sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaPruningSuite.scala
---
@@ -0,0 +1,245
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21320
@mallman if you're planning on making more code changes, would you be
willing to work on a shared branch or something? I've been working to
incorporate the C
Github user ajacques commented on a diff in the pull request:
https://github.com/apache/spark/pull/21889#discussion_r209830673
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaPruning.scala
---
@@ -0,0 +1,200
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@gatorsmile Do you think there is a on deterministic failure in this change
that causes it to inconsistently fail?
---
-
To
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@mallman, while we wait for the go-no-go, do you have the changes for the
next PR ready? Is there anything you need help with
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
>> but @gatorsmile wants to review it in a follow-on PR.
> Where did he say it after the comment above?
It was my interpretation of this comment:
https://github.com/apa
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@HyukjinKwon Looks like most of your comments have been already addressed,
but I've gone ahead and made a few more tweaks to help this get merged. Please
let me know if any blocking comments
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
Is there anything I can do to help with this PR?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
Jenkins build successful. Any PR comments/blockers to merge for phase 1?
cc @HyukjinKwon, @gatorsmile, @cloud-fan
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
Alright to make sure we're all on the same page, it sounds like we're ready
to merge this PR pending:
* Successful build by Jenkins
* Any PR comments from a maintainer
Th
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@mallman Is it related to [this revert in
ParquetReadSupport](https://github.com/apache/spark/pull/21889/commits/0312a5188f0d6c9fc5304195dbdc703bf0aa3fb7#diff-245e70c1f41e353e34cf29bd00fd9029L86
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@mallman
`select id, name.middle, address from temp` - **Works**
`select name.middle, address from temp` - **Fails
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
The tests as committed pass for me, but I removed the `order by id` and I
got that error. Are you saying it works with the specific query in my comment
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@mallman: I've rebased on top of your changes and pushed. I'm seeing the
following:
Given the following schema:
```
root
|-- id: integer (nullable = true)
Github user ajacques commented on a diff in the pull request:
https://github.com/apache/spark/pull/21889#discussion_r207718713
--- Diff:
sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetSchemaPruningSuite.scala
---
@@ -0,0 +1,205
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@mallman: [This
one](https://github.com/apache/spark/pull/21889/files#diff-0c6c7481232e9637b91c179f1005426aR120)?
I just enabled it on my branch and the test passed. Was it fixed by your
latest
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
Are there any other blockers to enabling this by default now that @mallman
fixed the currently known broken queries?
---
-
To
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
Anybody else able to reproduce this failure? It succeeded on my developer
machine.
---
-
To unsubscribe, e-mail: reviews
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
These test failures are in Spark streaming. Is this just an intermittent
test failure or actually caused by this PR?
---
-
To
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
@mallman, sounds good I'll get this PR updated with your latest changes as
soon as I can.
---
-
To unsubscribe, e
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21889
Where does that leave both of these PRs? Do we still want this one with the
code refactoring or to go back to the original? Are there any comments for this
PR that would block merging? I'v
GitHub user ajacques opened a pull request:
https://github.com/apache/spark/pull/21889
[SPARK-4502][SQL] Parquet nested column pruning - foundation (2nd attempt)
(Link to Jira: https://issues.apache.org/jira/browse/SPARK-4502)
**This is a restart of apache/spark#21320. Most
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21320
To confirm we want to start a secondary PR based on my stylistic/minor
fixes? As I get up to speed on this code, I won't be able to make heavy
changes. I'll have some time tomorrow to t
Github user ajacques commented on a diff in the pull request:
https://github.com/apache/spark/pull/21320#discussion_r205329769
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/planning/SelectedField.scala
---
@@ -0,0 +1,134 @@
+/*
+ * Licensed to the
Github user ajacques commented on a diff in the pull request:
https://github.com/apache/spark/pull/21320#discussion_r205329633
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/planning/ProjectionOverSchema.scala
---
@@ -0,0 +1,62 @@
+/*
+ * Licensed to
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21320
@HyukjinKwon, I'm not totally familiar with Spark internals yet, so to be
honest I don't feel confident making big changes and hopefully can keep them
simple at first.
I'
Github user ajacques commented on the issue:
https://github.com/apache/spark/pull/21320
Hey @mallman, I want to thank you for your work on this so far. I've been
watching this pull request hoping this would get merged into 2.4 since it would
be a benefit to me, but can see h
30 matches
Mail list logo