Github user dongjoon-hyun commented on the issue:
https://github.com/apache/spark/pull/19470
Ya, that was my question, too.
- What kind of difference does Spark support, especially in ORC? Apache
Spark only supports HiveFileFormat so far, not old OrcFileFormat.
- In addition, there is no Schema Merging. Randomly (usually the bigging
ORC file?), the first correct ORC file schema is used now. For old ORC cases,
those are meaningless like `_colX`. For me, HiveMetastore schema is the only
valid one in Apache Spark.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]