Lokesh Jain created HUDI-5799:
---------------------------------

             Summary: Fix Spark partition validation in 
TestBulkInsertInternalPartitionerForRows
                 Key: HUDI-5799
                 URL: https://issues.apache.org/jira/browse/HUDI-5799
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Lokesh Jain
            Assignee: Lokesh Jain


This jira fixes the test failure in 
`TestBulkInsertInternalPartitionerForRows#testBulkInsertInternalPartitioner` 
with `GLOBAL_SORT`. The behavior of 
`GlobalSortPartitionerWithRows`(`BulkInsertSortMode.GLOBAL_SORT`) is the same 
as before.  The new assertion on the number of output partitions added in the 
test causes it to fail, because `df.sort` may generate smaller number of 
partitions than the specified parallelism.


{code:java}
[ERROR] testBulkInsertInternalPartitioner{BulkInsertSortMode, boolean, boolean, 
boolean, boolean}[1]  Time elapsed: 0.961 s  <<< FAILURE!
org.opentest4j.AssertionFailedError: expected: <2> but was: <1>
    at org.junit.jupiter.api.AssertionUtils.fail(AssertionUtils.java:55)
    at org.junit.jupiter.api.AssertionUtils.failNotEqual(AssertionUtils.java:62)
    at org.junit.jupiter.api.AssertEquals.assertEquals(AssertEquals.java:150)
    at org.junit.jupiter.api.AssertEquals.assertEquals(AssertEquals.java:145)
    at org.junit.jupiter.api.Assertions.assertEquals(Assertions.java:510)
    at 
org.apache.hudi.execution.bulkinsert.TestBulkInsertInternalPartitionerForRows.testBulkInsertInternalPartitioner(TestBulkInsertInternalPartitionerForRows.java:152)
    at 
org.apache.hudi.execution.bulkinsert.TestBulkInsertInternalPartitionerForRows.testBulkInsertInternalPartitioner(TestBulkInser
 {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to