geruh commented on code in PR #2941:
URL: https://github.com/apache/iceberg-python/pull/2941#discussion_r2722631506


##########
pyiceberg/catalog/hive.py:
##########
@@ -551,23 +551,32 @@ def commit_table(
 
                 if hive_table and current_table:
                     # Table exists, update it.
-                    new_parameters = _construct_parameters(
+
+                    # Note on table properties:
+                    # - Iceberg table properties are stored in both HMS and 
Iceberg metadata JSON.
+                    # - Updates are reflected in both locations
+                    # - Existing HMS table properties (set by external systems 
like Hive/Spark) are preserved.
+                    #
+                    # While it is possible to modify HMS table properties 
through this API, it is not recommended:
+                    # - Mixing HMS-specific properties in Iceberg metadata can 
cause confusion
+                    # - New/updated HMS table properties will also be stored 
in Iceberg metadata (even though it is HMS-specific)
+                    # - HMS-native properties (set outside Iceberg) cannot be 
deleted since they are not visible to Iceberg
+                    #   (However, if you first SET an HMS property via 
Iceberg, it becomes tracked in Iceberg metadata,
+                    #   and can then be deleted via Iceberg - which removes it 
from both Iceberg metadata and HMS)
+                    new_iceberg_properties = _construct_parameters(
                         
metadata_location=updated_staged_table.metadata_location,
                         
previous_metadata_location=current_table.metadata_location,
                         metadata_properties=updated_staged_table.properties,
                     )
-
                     # Detect properties that were removed from Iceberg metadata
-                    removed_keys = current_table.properties.keys() - 
updated_staged_table.properties.keys()
-
-                    # Sync HMS parameters: Iceberg metadata is the source of 
truth, HMS parameters are
-                    # a projection of Iceberg state plus any HMS-only 
properties.
-                    # Start with existing HMS params, remove deleted Iceberg 
properties, then apply Iceberg values.
-                    merged_params = dict(hive_table.parameters or {})
-                    for key in removed_keys:
-                        merged_params.pop(key, None)
-                    merged_params.update(new_parameters)
-                    hive_table.parameters = merged_params
+                    deleted_iceberg_properties = 
current_table.properties.keys() - updated_staged_table.properties.keys()

Review Comment:
   nit: this is non blocking but seems like this logic is technically still 
"constructing parameters". wdyt of moving this into constructing parameters and 
adding these comments as a python docstring?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to