andylam-db commented on PR #44084: URL: https://github.com/apache/spark/pull/44084#issuecomment-1872466384
> Our goal is to guarantee Spark produces the same result as the reference DBMS, why do we want to have one golden file for each DBMS? What if they are different? Which one should Spark follow? That makes sense. I think all DBMS can share the same golden file, if we make the queries simple enough that they are compatible with all DBMS. I wish there was a way to only have one set of golden files, but the current golden files have the Spark schema and hence aren't able to be generated with the DBMS. I don't want to make too many changes to the golden file. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
