JingsongLi opened a new pull request, #8077:
URL: https://github.com/apache/paimon/pull/8077

   ## Summary
   
   Fix updating BLOB columns in data-evolution append tables for both Java and 
Python paths. This PR also adds regression coverage for updating a blob column 
while preserving untouched blob values.
   
   ## Changes
   
   - Allow Java dedicated-format writes to produce only dedicated blob/vector 
files when the write type contains no normal columns, avoiding empty parquet 
schema writes.
   - Include matching blob files when Python row-id updates read original rows 
for merge, so non-updated rows keep their previous blob values.
   - Add Java and Python regression tests for updating blob columns.
   
   ## Testing
   
   - [x] `mvn -pl paimon-core -Pfast-build -Dtest=BlobTableTest#testBasic test`
   - [x] `mvn -pl paimon-core -Pfast-build 
-Dtest=BlobTableTest#testUpdateBlobColumn test`
   - [x] `python -m pytest 
paimon-python/pypaimon/tests/blob_table_test.py::DedicatedFormatWriterTest::test_update_blob_column
 -q`
   - [x] `python -m pytest 
paimon-python/pypaimon/tests/table_update_test.py::TableUpdateBatchTest::test_update_existing_column
 
paimon-python/pypaimon/tests/table_update_test.py::TableUpdateBatchTest::test_update_preserves_other_columns
 
paimon-python/pypaimon/tests/blob_table_test.py::DedicatedFormatWriterTest::test_dedicated_format_writer_partial_write_with_write_type
 -q`
   - [x] `git diff --check`
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to