MaxGekk opened a new pull request #30118:
URL: https://github.com/apache/spark/pull/30118


   ### What changes were proposed in this pull request?
   1. Turn off/on the SQL config 
`spark.sql.legacy.parquet.int96RebaseModeInWrite` which was added by 
https://github.com/apache/spark/pull/30056 in `DateTimeRebaseBenchmark`. The 
parquet readers should infer correct rebasing mode automatically from metadata.
   2. Regenerate benchmark results of `DateTimeRebaseBenchmark` in the 
environment:
   
   | Item | Description |
   | ---- | ----|
   | Region | us-west-2 (Oregon) |
   | Instance | r3.xlarge (spot instance) |
   | AMI | ami-06f2f779464715dc5 
(ubuntu/images/hvm-ssd/ubuntu-bionic-18.04-amd64-server-20190722.1) |
   | Java | OpenJDK8/11 installed by`sudo add-apt-repository ppa:openjdk-r/ppa` 
& `sudo apt install openjdk-11-jdk`|
   
   ### Why are the changes needed?
   To have up-to date info about INT96 performance which is the default type 
for Catalyst's timestamp type.
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   ### How was this patch tested?
   By updating benchmark results:
   ```
   $ SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain 
org.apache.spark.sql.execution.benchmark.DateTimeRebaseBenchmark"
   ```


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to