WTa-hash opened a new issue #4593:
URL: https://github.com/apache/hudi/issues/4593


   Does Hudi support column re-ordering if there are no column 
additions/removals? According to 
https://hudi.apache.org/docs/schema_evolution/, it states that "Add a new 
nullable column and change the ordering of fields" is not supported. So if I 
perform only the 2nd half of that statement, which is "change the ordering of 
fields", will Hudi support this?
   
   **To Reproduce**
   
   Steps to reproduce the behavior: In my test script, I have 3 sets of data, 
1) initial data, 2) update #1, 3) update #2. In update #1, I'll be re-ordering 
the columns.
   
   1. Run test script: 
[TestData.txt](https://github.com/apache/hudi/files/7865586/TestData.txt)
   2. Observe output.
   Initial Data 
   
![1](https://user-images.githubusercontent.com/64644025/149400788-5df30ebd-2ecd-425e-ad16-963842b0bbb4.PNG)
   Update #1 
   
![2](https://user-images.githubusercontent.com/64644025/149400821-93afa4d8-5826-4a5f-9fbf-25483c9ba6ca.PNG)
   Update #2 
   
![3](https://user-images.githubusercontent.com/64644025/149400844-0ee7755b-d170-4c54-8bf1-6b2ce8053b3b.PNG)
   
   
   **Expected behavior**
   Does Hudi officially support just column-reordering? My existing Hudi table 
may have a certain column order, while the incoming data batch will contain the 
same columns but in different order.
   
   ENVIRONMENT:
   AWS EMR: 5.33.1
   Hudi version : 0.7.0-amzn-1
   Spark version : 2.4.7
   Hive version : 2.3.7
   Hadoop version : 2.10.1
   Storage (HDFS/S3/GCS..) : S3
   Running on Docker? (yes/no) : no


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to