andygrove commented on code in PR #1987:
URL: https://github.com/apache/datafusion-comet/pull/1987#discussion_r2260899713
##########
.github/workflows/iceberg_spark_test.yml:
##########
@@ -63,24 +63,96 @@ jobs:
with:
rust-version: ${{env.RUST_VERSION}}
jdk-version: ${{ matrix.java-version }}
+ - name: Build Comet
+ shell: bash
+ run: |
+ PROFILES="-Pspark-${{matrix.spark-version.short}}
-Pscala-${{matrix.scala-version}}" make release
- name: Setup Iceberg
uses: ./.github/actions/setup-iceberg-builder
with:
iceberg-version: ${{ matrix.iceberg-version.full }}
- scala-version: ${{ matrix.scala-version }}
- spark-short-version: ${{ matrix.spark-version.short }}
- - name: Build local Spark jar with comet patch
- uses: ./.github/actions/setup-spark-local-jar
- with:
- spark-short-version: ${{ matrix.spark-version.short }}
- spark-version: ${{ matrix.spark-version.full }}
- scala-version: ${{ matrix.scala-version }}
- name: Run Iceberg Spark tests
run: |
cd apache-iceberg
rm -rf /root/.m2/repository/org/apache/parquet # somehow parquet
cache requires cleanups
- ENABLE_COMET=true ./gradlew -DsparkVersions=${{
matrix.spark-version.short }} -DscalaVersion=${{ matrix.scala-version }}
-DflinkVersions= -DkafkaVersions= \
- :iceberg-spark:iceberg-spark-${{ matrix.spark-version.short }}_${{
matrix.scala-version }}:check \
- :iceberg-spark:iceberg-spark-extensions-${{
matrix.spark-version.short }}_${{ matrix.scala-version }}:check \
- :iceberg-spark:iceberg-spark-runtime-${{
matrix.spark-version.short }}_${{ matrix.scala-version }}:check \
+ ENABLE_COMET=true COMET_PARQUET_SCAN_IMPL=native_iceberg_compat
./gradlew -DsparkVersions=${{ matrix.spark-version.short }} -DscalaVersion=${{
matrix.scala-version }} -DflinkVersions= -DkafkaVersions= \
Review Comment:
I don't think that we want to set `COMET_PARQUET_SCAN_IMPL` here. The
Iceberg integration explicitly calls low level APIS for `native_comet` scan.
```suggestion
ENABLE_COMET=true ./gradlew -DsparkVersions=${{
matrix.spark-version.short }} -DscalaVersion=${{ matrix.scala-version }}
-DflinkVersions= -DkafkaVersions= \
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]