MaxGekk opened a new pull request #26189: [SPARK-29533][SQL][TEST] Benchmark 
casting strings to intervals
URL: https://github.com/apache/spark/pull/26189
 
 
   ### What changes were proposed in this pull request?
   Added new benchmark `IntervalBenchmark` to measure performance of interval 
related functions. In the PR, I added benchmarks for casting strings to 
interval. In particular, interval strings with `interval` prefix and without it 
because there is special code for this 
https://github.com/apache/spark/blob/da576a737c2db01e5ba5ce19ed0e8f900cb5efaf/common/unsafe/src/main/java/org/apache/spark/unsafe/types/CalendarInterval.java#L100-L103
 . And also I added benchmarks for different number of units in interval 
strings, for example 1 unit is `interval 10 years`, 2 units w/o interval is `10 
years 5 months`, and etc.
   
   
   ### Why are the changes needed?
   - To find out current performance issues in casting to intervals
   - The benchmark can be used while refactoring/re-implementing 
`CalendarInterval.fromString()` or 
`CalendarInterval.fromCaseInsensitiveString()`.
   
   ### Does this PR introduce any user-facing change?
   No
   
   ### How was this patch tested?
   By running the benchmark via the command:
   ```shell
   SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain 
org.apache.spark.sql.execution.benchmark.IntervalBenchmark"
   ```
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to