[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-08-15 Thread MaxGekk
Github user MaxGekk commented on the issue:

https://github.com/apache/spark/pull/21296
  
>  ... does this push down the column selection into the reading of data

Yes, it does. 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-08-15 Thread koertkuipers
Github user koertkuipers commented on the issue:

https://github.com/apache/spark/pull/21296
  
if i do not select a schema (and i use inferSchema), and i do a select for 
only a few column, does this push down the column selection into the reading of 
data (for schema inference and for the actual data read)?


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-23 Thread gatorsmile
Github user gatorsmile commented on the issue:

https://github.com/apache/spark/pull/21296
  

https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91045/testReport/org.apache.spark.sql.execution.datasources.csv/UnivocityParserSuite/_It_is_not_a_test_it_is_a_sbt_testing_SuiteSelector_/history/


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-23 Thread gatorsmile
Github user gatorsmile commented on the issue:

https://github.com/apache/spark/pull/21296
  
This PR breaks the tests. Let me revert it. Please resubmit the PR with the 
fix. Thanks!


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-22 Thread cloud-fan
Github user cloud-fan commented on the issue:

https://github.com/apache/spark/pull/21296
  
thanks, merging to master!


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-22 Thread MaxGekk
Github user MaxGekk commented on the issue:

https://github.com/apache/spark/pull/21296
  
I added the word `parser` to the feature name because as @HyukjinKwon wrote 
above we do pruning in type conversion already. This PR enables column pruning 
by CSV parser only.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-22 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue:

https://github.com/apache/spark/pull/21296
  
Just for clarification, I'm okay.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-22 Thread cloud-fan
Github user cloud-fan commented on the issue:

https://github.com/apache/spark/pull/21296
  
LGTM except a minor comment about the config name


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-22 Thread MaxGekk
Github user MaxGekk commented on the issue:

https://github.com/apache/spark/pull/21296
  
@cloud-fan @HyukjinKwon Could you look at the PR, please.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/90797/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-18 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90797 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90797/testReport)**
 for PR 21296 at commit 
[`6ff6d4f`](https://github.com/apache/spark/commit/6ff6d4fda9f7e8ee43d7aa04818204de4c49440b).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-18 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90797 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90797/testReport)**
 for PR 21296 at commit 
[`6ff6d4f`](https://github.com/apache/spark/commit/6ff6d4fda9f7e8ee43d7aa04818204de4c49440b).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/90752/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90752 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90752/testReport)**
 for PR 21296 at commit 
[`f89eeb7`](https://github.com/apache/spark/commit/f89eeb7f7ba86888ad3f7994577a4d4ebbf09197).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/90751/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90751 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90751/testReport)**
 for PR 21296 at commit 
[`7dcfc7a`](https://github.com/apache/spark/commit/7dcfc7a7664fcd5311cb352f0ea7a24b3cc1c639).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `  case class WidenSetOperationTypes(conf: SQLConf) extends 
Rule[LogicalPlan] `
  * `  case class FunctionArgumentConversion(conf: SQLConf) extends 
TypeCoercionRule `
  * `  case class CaseWhenCoercion(conf: SQLConf) extends TypeCoercionRule `
  * `  case class IfCoercion(conf: SQLConf) extends TypeCoercionRule `
  * `  case class ImplicitTypeCasts(conf: SQLConf) extends TypeCoercionRule 
`
  * `class ContinuousMemoryStream[A : Encoder](id: Int, sqlContext: 
SQLContext, numPartitions: Int = 2)`


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90752 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90752/testReport)**
 for PR 21296 at commit 
[`f89eeb7`](https://github.com/apache/spark/commit/f89eeb7f7ba86888ad3f7994577a4d4ebbf09197).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-17 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90751 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90751/testReport)**
 for PR 21296 at commit 
[`7dcfc7a`](https://github.com/apache/spark/commit/7dcfc7a7664fcd5311cb352f0ea7a24b3cc1c639).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Merged build finished. Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue:

https://github.com/apache/spark/pull/21296
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/90555/
Test PASSed.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-13 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90555 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90555/testReport)**
 for PR 21296 at commit 
[`f90daa7`](https://github.com/apache/spark/commit/f90daa7ea33d119be978c27de10978c2d6281e25).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-13 Thread SparkQA
Github user SparkQA commented on the issue:

https://github.com/apache/spark/pull/21296
  
**[Test build #90555 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/90555/testReport)**
 for PR 21296 at commit 
[`f90daa7`](https://github.com/apache/spark/commit/f90daa7ea33d119be978c27de10978c2d6281e25).


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-11 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue:

https://github.com/apache/spark/pull/21296
  
Can we update the migration guide then? I want to see if the note makes 
sense.


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark issue #21296: [SPARK-24244][SQL] Passing only required columns to the ...

2018-05-11 Thread cloud-fan
Github user cloud-fan commented on the issue:

https://github.com/apache/spark/pull/21296
  
I'm fine with breaking the CSV behavior of malformed input a little, as 
long as we documented it well in the migration guide. cc @gatorsmile 


---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org