JingsongLi opened a new pull request, #8275: URL: https://github.com/apache/paimon/pull/8275
## Summary This PR merges the previous singleton and parallel single-column global index writer APIs into one `GlobalIndexSingleColumnWriter` interface. Single-column index writers now receive the caller-provided shard-relative row id through `write(@Nullable Object key, long relativeRowId)`. ## Changes - Replace `GlobalIndexSingletonWriter` and `GlobalIndexParallelWriter` with `GlobalIndexSingleColumnWriter`. - Update BTree, Vector, Lumina, and Tantivy global index writers to implement the unified single-column writer API. - Pass explicit shard-relative row ids from BTree, Flink, and Spark index build paths. - Keep vector/full-text row counts as logical row counts while persisting caller-provided row ids for non-null indexed entries. - Update test helper index formats and affected tests to read/write explicit relative row ids. ## Testing - `mvn -pl paimon-vector/paimon-vector-index -am -Pfast-build -DskipTests test-compile` - `mvn -pl paimon-lumina -am -Pfast-build -DskipTests test-compile` - `mvn -pl paimon-tantivy/paimon-tantivy-index -am -Pfast-build -DskipTests test-compile` - `mvn -pl paimon-flink/paimon-flink-common -am -Pfast-build -DskipTests test-compile` - `mvn -pl paimon-spark/paimon-spark-common -am -Pfast-build -DskipTests compile` - `git diff --check` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
