Raymond Xu created HUDI-5287:
--------------------------------

             Summary: Make test datagen ensure records in specified partitions
                 Key: HUDI-5287
                 URL: https://issues.apache.org/jira/browse/HUDI-5287
             Project: Apache Hudi
          Issue Type: Improvement
            Reporter: Raymond Xu
             Fix For: 0.13.0


For some test cases we generate 100 or 1000 record to ensure there are records 
in all 3 fixed partitions. This wastes a lot of time in running the tests. UT 
or FT should not need to test against many records. Usually 10-20 records with 
parallelism of 2 will suffice.

We should audit the current datagen usage through all test cases and optimize 
in this aspect.

Tests with large number of records should be moved to long-running integration 
tests. (this is for another story)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to