JingsongLi opened a new pull request, #8275:
URL: https://github.com/apache/paimon/pull/8275

   ## Summary
   
   This PR merges the previous singleton and parallel single-column global 
index writer APIs into one `GlobalIndexSingleColumnWriter` interface. 
Single-column index writers now receive the caller-provided shard-relative row 
id through `write(@Nullable Object key, long relativeRowId)`.
   
   ## Changes
   
   - Replace `GlobalIndexSingletonWriter` and `GlobalIndexParallelWriter` with 
`GlobalIndexSingleColumnWriter`.
   - Update BTree, Vector, Lumina, and Tantivy global index writers to 
implement the unified single-column writer API.
   - Pass explicit shard-relative row ids from BTree, Flink, and Spark index 
build paths.
   - Keep vector/full-text row counts as logical row counts while persisting 
caller-provided row ids for non-null indexed entries.
   - Update test helper index formats and affected tests to read/write explicit 
relative row ids.
   
   ## Testing
   
   - `mvn -pl paimon-vector/paimon-vector-index -am -Pfast-build -DskipTests 
test-compile`
   - `mvn -pl paimon-lumina -am -Pfast-build -DskipTests test-compile`
   - `mvn -pl paimon-tantivy/paimon-tantivy-index -am -Pfast-build -DskipTests 
test-compile`
   - `mvn -pl paimon-flink/paimon-flink-common -am -Pfast-build -DskipTests 
test-compile`
   - `mvn -pl paimon-spark/paimon-spark-common -am -Pfast-build -DskipTests 
compile`
   - `git diff --check`
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to