Github user HyukjinKwon commented on a diff in the pull request:
https://github.com/apache/spark/pull/19871#discussion_r154881449
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala
---
@@ -587,7 +601,8 @@ object DataSource extends Logging {
if (provider1.toLowerCase(Locale.ROOT) == "orc" ||
provider1.startsWith("org.apache.spark.sql.hive.orc")) {
throw new AnalysisException(
- "The ORC data source must be used with Hive support
enabled")
+ "Hive-based ORC data source must be used with Hive
support enabled. " +
+ "Please use native ORC data source instead")
--- End diff --
I think we should make this more actionable, saying `spark.sql.orc.impl`
should be set to `native` explicitly.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]