JingsongLi opened a new pull request, #8077: URL: https://github.com/apache/paimon/pull/8077
## Summary Fix updating BLOB columns in data-evolution append tables for both Java and Python paths. This PR also adds regression coverage for updating a blob column while preserving untouched blob values. ## Changes - Allow Java dedicated-format writes to produce only dedicated blob/vector files when the write type contains no normal columns, avoiding empty parquet schema writes. - Include matching blob files when Python row-id updates read original rows for merge, so non-updated rows keep their previous blob values. - Add Java and Python regression tests for updating blob columns. ## Testing - [x] `mvn -pl paimon-core -Pfast-build -Dtest=BlobTableTest#testBasic test` - [x] `mvn -pl paimon-core -Pfast-build -Dtest=BlobTableTest#testUpdateBlobColumn test` - [x] `python -m pytest paimon-python/pypaimon/tests/blob_table_test.py::DedicatedFormatWriterTest::test_update_blob_column -q` - [x] `python -m pytest paimon-python/pypaimon/tests/table_update_test.py::TableUpdateBatchTest::test_update_existing_column paimon-python/pypaimon/tests/table_update_test.py::TableUpdateBatchTest::test_update_preserves_other_columns paimon-python/pypaimon/tests/blob_table_test.py::DedicatedFormatWriterTest::test_dedicated_format_writer_partial_write_with_write_type -q` - [x] `git diff --check` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
