rdblue commented on issue #830: Support name mapping resolution for parquet
URL: https://github.com/apache/incubator-iceberg/pull/830#issuecomment-610535477
 
 
   > The remaining needs a double check.
   
   I don't see replies to my comments. Can you reply so I know what you want me 
to double-check and why?
   
   > The ID fallback way cannot work when the iceberg table schema is different 
from the parquet file schema as the original issue mentioned
   
   Non-Iceberg table schemas are maintained either by position (like CSV) or by 
name (like JSON). The fallback strategy needs to match how schemas have been 
maintained for a table. We implemented position-based fallback because before 
Iceberg we maintained by position so we could rename fields and add fields, but 
not drop or reorder. Support for name mapping supports the name-based approach 
for maintaining a schema. Both approaches are valid.
   
   You're right that some files are incompatible with position-based, but that 
just shows that you have to maintain a schema consistently. You can't use both 
position-based and name-based in a table.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to