[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/70336/ Test PASSed. ---

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-18 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #70336 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70336/testReport)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-18 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #70336 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70336/testReport)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-18 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 ping @cloud-fan @yhuai May you take a look? Thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14365 It looks good to me. Thank you for updating, @viirya ! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69856/ Test PASSed. ---

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #69856 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69856/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #69856 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69856/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-08 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69852/ Test FAILed. ---

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 @cloud-fan @yhuai @dongjoon-hyun I've updated this as: * Assume metastore schema matches with physical Orc schema by column, disregarding column names. * Mapping required schema to

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #69852 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69852/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 We have two options. First one is to map metastore schema to physical Orc schema like this. But we don't infer physical schema of Orc file now. I will update this to have this mapping in

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 @dongjoon-hyun yeah, I see. Because we directly use metastore schema of converted Orc table, when the physical schema in Orc file and metastore schema mismatch, this issue happens. --- If your

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-07 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14365 It seems to exist some cases like the following (on the current master). On hive, ```sql CREATE TABLE t1 (a string) PARTITIONED BY (b string) STORED AS ORC; INSERT INTO

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 As we replace schema inferring with metastore schema completely by #14690 for converted Hive tables, we may not have the chance to detect the mismatch between Orc file's physical schema and

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-12-02 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14365 Hi, @viirya , @cloud-fan , @yhuai . #14690 and the parent issue SPARK-17861 seems to be resolved? What about restarting this issue? --- If your project is set up for it, you can reply

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-10-10 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/14365 @cloud-fan actually, this conversion was disabled because of this bug. btw, pr that @cloud-fan mentioned is https://github.com/apache/spark/pull/14690. I think it is better to hold

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-10-09 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14365 I noticed that `spark.sql.hive.convertMetastoreOrc` is false by default, so this bug is not so critical. And we may stop inferring file schema when reading ORC tables, for performance

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-10-05 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 ping @cloud-fan Can you have a decision about this? Are we going to have complex logic for this issue? Or just disable it? --- If your project is set up for it, you can reply to this email and have

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-12 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 @cloud-fan So do we have decision on this? A simpler approach to disable the conversion if the schema is inconsistent, or a complex one to work around this Hive bug? --- If your project is set up

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-04 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63214/ Test PASSed. ---

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-04 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-04 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #63214 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63214/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-04 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #63214 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63214/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-02 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 @cloud-fan I've submitted a PR #14282 previously that disables the conversion if the schema is inconsistent. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-02 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14365 @rxin @yhuai , is it worth adding this complex logic to work around this hive bug? This feature(convert Metastore Orc tables to data source table) is not a major feature and the broken case seems

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-08-02 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 cc @yhuai @liancheng @rxin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62993/ Test PASSed. ---

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-28 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #62993 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62993/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-28 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #62993 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62993/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-28 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14365 cc @cloud-fan @yhuai @liancheng @rxin Please review this change. Thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62880/ Test PASSed. ---

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #62880 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62880/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14365 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62875/ Test FAILed. ---

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #62875 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62875/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #62880 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62880/consoleFull)** for PR 14365 at commit

[GitHub] spark issue #14365: [SPARK-16628][SQL] Translate file-based relation schema ...

2016-07-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14365 **[Test build #62875 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62875/consoleFull)** for PR 14365 at commit