peter-toth commented on a change in pull request #31682:
URL: https://github.com/apache/spark/pull/31682#discussion_r584847686



##########
File path: sql/catalyst/src/test/scala/org/apache/spark/sql/RowTest.scala
##########
@@ -108,6 +108,17 @@ class RowTest extends AnyFunSpec with Matchers {
     }
   }
 
+  describe("SPARK-34545: rows with different schema are not equal") {

Review comment:
       I've updated the PR description to focus on the correctness issue. I 
think it now clearly describes what is the issue with comparing 
`GenericRowWithSchema` objects.
   
   First I thought that this is a generic issue with `Row` that's why I tried 
to fix `.equals()` there. But that raised other compatibility issues 
(https://github.com/apache/spark/pull/31682#discussion_r584488733).
   
   Then the current PR fixes `.equals()` in `GenericRowWithSchema` only, but 
you can see that this also breaks a lots of UTs as many times in UTs we compare 
`GenericRowWithSchema` to `GenericRow` just to verify its content. 
   
   Are those UTs incorrect and shall we fix them? Or shall I give up this idea 
of fixing `.equals()` and we should just disable `valueCompare` in pyrolite: 
https://github.com/apache/spark/pull/31682#issuecomment-787904759 I've checked 
it, it also fixes the issue.
   
   I will add an end to end test definitely. (Just need to figure out where we 
have python e2e test. Sorry I'm not familiar with them.)




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to