Re: [D] [GLUTEN][FLINK] Nexmark q0 performance [incubator-gluten]
GitHub user ParyshevSergey added a comment to the discussion: [GLUTEN][FLINK] Nexmark q0 performance @lgbo-ustc Flink config above, results: Gluten w/ Kafka, 3 of 4 native operators (StreamRecordTimestampInserter is java operator) Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |0 |2995.692 |0.000|33.38 K/s |9.22 E/s | -- metric monitor is off |q0|100,000,000 |1 |2985.706 |2983.679 |33.49 K/s |33.52 K/s | Gluten w/ Kafka, 3 of 3 native operators (StreamRecordTimestampInserter is deleted in code) Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |0.36|525.402 |187.356 |190.33 K/s |533.74 K/s | |Total |100,000,000 |0.357 |525.402 |187.356 |190.33 K/s |533.74 K/s | +--+-++--+-+--+-+ 2026-01-28 16:42:33,100 INFO org.apache.gluten.client.OffloadedJobGraphGenerator [] - OperatorChainSliceGraph: Slice ID: 7, offloadable: false Inputs: [] Outputs: [] Operator Configs: nexmark_q0[3]: Writer(7) 2026-01-28 16:42:33,108 INFO org.apache.gluten.client.OffloadedJobGraphGenerator [] - OperatorChainSliceGraph: Slice ID: 2, offloadable: true Inputs: [] Outputs: [3] Operator Configs: Source: kafka[1](2) Slice ID: 3, offloadable: true Inputs: [2] Outputs: [5] Operator Configs: gluten-calc(3) Slice ID: 5, offloadable: true Inputs: [3] Outputs: [] Operator Configs: Writer(5) Flink 1.20 Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |0.88|275.188 |242.630 |363.39 K/s |412.15 K/s | |Total |100,000,000 |0.882 |275.188 |242.630 |363.39 K/s |412.15 K/s | +--+-++--+-+--+-+ GitHub link: https://github.com/apache/incubator-gluten/discussions/11508#discussioncomment-15638439 This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
Re: [D] [GLUTEN][FLINK] Nexmark q0 performance [incubator-gluten]
GitHub user lgbo-ustc edited a comment on the discussion: [GLUTEN][FLINK] Nexmark q0 performance @ParyshevSergey Can you provide more details about the test settings. It is expected that the new version of the code may experience a certain degree of performance regression, with a potential decrease of around 10%. Currently, our priority is the improvement of functionalities, and performance optimization in this aspect will be considered later. However, as you mentioned, we have not encountered such significant slowdowns before. We have re-run the stress tests, and the results are as follows (without `StreamRecordTimestampInserter ` which is not supported at present). datagen source # gluten ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.01|40.124|40.460 |2.49 M/s |2.47 M/s | |Total |100,000,000 |1.008 |40.124|40.460 |2.49 M/s |2.47 M/s | +--+-++--+-+--+-+ ``` # flink ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.03|115.690 |119.576 |864.38 K/s |836.29 K/s | |Total |100,000,000 |1.034 |115.690 |119.576 |864.38 K/s |836.29 K/s | +--+-++--+-+--+-+ ``` kafka source # gluten https://github.com/user-attachments/assets/0198a4e3-3c6f-4685-a77d-9d66c54c5cc9"; /> # flink https://github.com/user-attachments/assets/e5ee37ef-6cc3-44ab-b63d-74f225f27f24"; /> GitHub link: https://github.com/apache/incubator-gluten/discussions/11508#discussioncomment-15638130 This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
Re: [D] [GLUTEN][FLINK] Nexmark q0 performance [incubator-gluten]
GitHub user lgbo-ustc edited a comment on the discussion: [GLUTEN][FLINK] Nexmark q0 performance @ParyshevSergey Can you provide more details about the test settings. It is expected that the new version of the code may experience a certain degree of performance regression, with a potential decrease of around 10%. Currently, our priority is the improvement of functionalities, and performance optimization in this aspect will be considered later. However, as you mentioned, we have not encountered such significant slowdowns before. We have re-run the stress tests, and the results are as follows (remove `StreamRecordTimestampInserter `). datagen source # gluten ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.01|40.124|40.460 |2.49 M/s |2.47 M/s | |Total |100,000,000 |1.008 |40.124|40.460 |2.49 M/s |2.47 M/s | +--+-++--+-+--+-+ ``` # flink ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.03|115.690 |119.576 |864.38 K/s |836.29 K/s | |Total |100,000,000 |1.034 |115.690 |119.576 |864.38 K/s |836.29 K/s | +--+-++--+-+--+-+ ``` kafka source # gluten https://github.com/user-attachments/assets/0198a4e3-3c6f-4685-a77d-9d66c54c5cc9"; /> # flink https://github.com/user-attachments/assets/e5ee37ef-6cc3-44ab-b63d-74f225f27f24"; /> GitHub link: https://github.com/apache/incubator-gluten/discussions/11508#discussioncomment-15638130 This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
Re: [D] [GLUTEN][FLINK] Nexmark q0 performance [incubator-gluten]
GitHub user lgbo-ustc edited a comment on the discussion: [GLUTEN][FLINK] Nexmark q0 performance @ParyshevSergey Can you provide more detail about the test settings. It is expected that the new version of the code may experience a certain degree of performance regression, with a potential decrease of around 10%. Currently, our priority is the improvement of functionalities, and performance optimization in this aspect will be considered later. However, as you mentioned, we have not encountered such significant slowdowns before. We have re-run the stress tests, and the results are as follows (remove `StreamRecordTimestampInserter `). datagen source # gluten ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.01|40.124|40.460 |2.49 M/s |2.47 M/s | |Total |100,000,000 |1.008 |40.124|40.460 |2.49 M/s |2.47 M/s | +--+-++--+-+--+-+ ``` # flink ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.03|115.690 |119.576 |864.38 K/s |836.29 K/s | |Total |100,000,000 |1.034 |115.690 |119.576 |864.38 K/s |836.29 K/s | +--+-++--+-+--+-+ ``` kafka source # gluten https://github.com/user-attachments/assets/0198a4e3-3c6f-4685-a77d-9d66c54c5cc9"; /> # flink https://github.com/user-attachments/assets/e5ee37ef-6cc3-44ab-b63d-74f225f27f24"; /> GitHub link: https://github.com/apache/incubator-gluten/discussions/11508#discussioncomment-15638130 This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
Re: [D] [GLUTEN][FLINK] Nexmark q0 performance [incubator-gluten]
GitHub user lgbo-ustc added a comment to the discussion: [GLUTEN][FLINK] Nexmark q0 performance @ParyshevSergey Can you provide more detail about the test settings. It is expected that the new version of the code may experience a certain degree of performance regression, with a potential decrease of around 10%. Currently, our priority is the improvement of functionalities, and performance optimization in this aspect will be considered later. However, as you mentioned, we have not encountered such significant slowdowns before. We have re-run the stress tests, and the results are as follows (remove `StreamRecordTimestampInserter `). datagen source # gluten ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.01|40.124|40.460 |2.49 M/s |2.47 M/s | |Total |100,000,000 |1.008 |40.124|40.460 |2.49 M/s |2.47 M/s | +--+-++--+-+--+-+ ``` # flink ``` Nexmark Results +--+-++--+-+--+-+ | Query| Events Num | Cores | Time(s) | Cores * Time(s) | Throughput | Throughput/Cores| +--+-++--+-+--+-+ |q0|100,000,000 |1.03|115.690 |119.576 |864.38 K/s |836.29 K/s | |Total |100,000,000 |1.034 |115.690 |119.576 |864.38 K/s |836.29 K/s | +--+-++--+-+--+-+ ``` datagen source # gluten https://github.com/user-attachments/assets/0198a4e3-3c6f-4685-a77d-9d66c54c5cc9"; /> # flink https://github.com/user-attachments/assets/e5ee37ef-6cc3-44ab-b63d-74f225f27f24"; /> GitHub link: https://github.com/apache/incubator-gluten/discussions/11508#discussioncomment-15638130 This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
Re: [D] [GLUTEN][FLINK] Nexmark q0 performance [incubator-gluten]
GitHub user KevinyhZou added a comment to the discussion: [GLUTEN][FLINK] Nexmark q0 performance I have reproduced this on my local environment, and it seems the pr: https://github.com/apache/incubator-gluten/pull/11365 leads to the performance regression. @lgbo-ustc could you take a look at this? GitHub link: https://github.com/apache/incubator-gluten/discussions/11508#discussioncomment-15628161 This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
Re: [D] [GLUTEN][FLINK] Nexmark q0 performance [incubator-gluten]
GitHub user ParyshevSergey edited a discussion: [GLUTEN][FLINK] Nexmark q0 performance Hello community, according [flink docs](https://github.com/apache/incubator-gluten/blob/main/gluten-flink/docs/Flink.md) I builded velox4j myself with gluten and run local cluster to test nexmark q0 query. With the latest branch of nexmark datagen not working for me (pure virtual call issue in logs) then I try to use kafka connector with local kafka. And in this case q0 pass after 49min - I noticed that conversion between RowData and RowVector is heavy operation so I removed code in CommonExecSink of StreamRecordTimestampInserter to avoid conversion. So after run the job was (NativeKafka -> gluten_calc -> Writer) as one chained operator - result is 8min. Without gluten same config the job pass after 4min. Why the performance is so terrible? @shuai-xu @KevinyhZou What do you think? flink config.yaml ``` taskmanager.memory.process.size: 4G jobmanager.rpc.address: localhost jobmanager.rpc.port: 6123 jobmanager.memory.process.size: 4G taskmanager.numberOfTaskSlots: 1 parallelism.default: 1 io.tmp.dirs: /tmp/flink/tmp/ jobmanager.bind-host: 0.0.0.0 rest.bind-host: 0.0.0.0 #== # JVM #== # JVM options for GC env.java.opts: -verbose:gc -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:NewRatio=3 -XX:ConcGCThreads=4 --add-opens=java.base/java.util=ALL-UNNAMED --add-opens=java.base/java.nio=org.apache.arrow.memory.core,ALL-UNNAMED env.java.opts.jobmanager: -Xloggc:$FLINK_LOG_DIR/jobmanager-gc.log -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=2 -XX:GCLogFileSize=512M --add-opens=java.base/java.util=ALL-UNNAMED --add-opens=java.base/java.nio=org.apache.arrow.memory.core,ALL-UNNAMED env.java.opts.taskmanager: -Xloggc:$FLINK_LOG_DIR/taskmanager-gc.log -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=2 -XX:GCLogFileSize=512M --add-opens=java.base/java.util=ALL-UNNAMED --add-opens=java.base/java.nio=org.apache.arrow.memory.core,ALL-UNNAMED #== # State & Checkpoint #== state.backend.type: rocksdb # for example, hdfs://benchmark01/checkpoint execution.checkpointing.dir: file:///tmp/flink/checkpoint execution.checkpointing.incremental: true execution.checkpointing.interval: 1800 execution.checkpointing.mode: EXACTLY_ONCE state.backend.local-recovery: true #== # Runtime Others #== # configuration options for adjusting and tuning table programs. #table.exec.mini-batch.enabled: true #table.exec.mini-batch.allow-latency: 2s #table.exec.mini-batch.size: 5 table.optimizer.distinct-agg.split.enabled: true # disable final checkpoint to avoid test waiting for the last checkpoint complete execution.checkpointing.checkpoints-after-tasks-finish.enabled: false execution.checkpointing.externalized-checkpoint-retention: RETAIN_ON_CANCELLATION jobstore.expiration-time: 100 execution.checkpointing.tolerable-failed-checkpoints: 10 ``` GitHub link: https://github.com/apache/incubator-gluten/discussions/11508 This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
