I wanted to confirm an issue I'm seeing when a Hive table's schema is modified (columns deleted or re-ordered). It seems strange that ORC wouldn't support column re-ordering when the column name is written into the file. Possibly I'm doing something wrong? For example, if I have a Hive table with columns A, B, and C which contains existing data. If I alter the Hive table and remove column B. The new query results will return the old values of B for column C. Column C is no longer accessible since the table only contains 2 columns now. Is this a known issue? Are table modifications recommended against as a result? Thanks!
BenTimber.io -Blog -Github -Twitter
