[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/13907 I see. Let me close it. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13907 I don't think it's a very useful feature, and we may surprise users as they do use hive syntax to specify row format. For advanced users, they can easily use `USING xxx` to explicitly create a data source table for better performance. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/13907 This is not contained in https://github.com/apache/spark/pull/14482. Should I leave it open? Or should I fix the conflict after https://github.com/apache/spark/pull/14482 is merged? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/13907 Nope. If users do not specify the intput and output formats. We will use the default `INPUTFORMAT`, which is `org.apache.hadoop.mapred.TextInputFormat` and the default `OUTPUTFORMAT`, which is `org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat`. This is different from the standard input and output formats for `ORC`: `org.apache.hadoop.hive.ql.io.orc.OrcInputFormat` and `org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat`. I am not sure whether we should still convert it. Please let me know if you think we should still convert them. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/13907 With your PR, if users specify `ROW FORMAT SERDE 'org.apache.hadoop.hive.ql.io.orc.OrcSerde'`, will we convert? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13907 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/61253/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13907 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13907 **[Test build #61253 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61253/consoleFull)** for PR 13907 at commit [`a9ce0d8`](https://github.com/apache/spark/commit/a9ce0d8342a2c3768823b4dd120fda0997b1c313). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark issue #13907: [SPARK-16209] [SQL] Convert Hive Tables in PARQUET/ORC t...
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13907 **[Test build #61253 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/61253/consoleFull)** for PR 13907 at commit [`a9ce0d8`](https://github.com/apache/spark/commit/a9ce0d8342a2c3768823b4dd120fda0997b1c313). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org