[ 
https://issues.apache.org/jira/browse/HUDI-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-1303:
---------------------------------
    Labels: pull-request-available  (was: )

> Some improvements for the HUDI Test Suite
> -----------------------------------------
>
>                 Key: HUDI-1303
>                 URL: https://issues.apache.org/jira/browse/HUDI-1303
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: Prashant Wason
>            Assignee: Prashant Wason
>            Priority: Minor
>              Labels: pull-request-available
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> # Use the DAG Node's label from the yaml as its name instead of UUID names 
> which are not descriptive when debugging issues from logs.
>  # Fix CleanNode constructor which is not correctly implemented
>  # When generating upsets, allows more granualar control over the number of 
> inserts and upserts - zero or more inserts and upserts can be specified 
> instead of always requiring both inserts and upserts.
>  # Fixed generation of records of specific size
>  ** The current code was using a class variable "shouldAddMore" which was 
> reset to false after the first record generation causing subsequent records 
> to be of minimum size.
>  ** In this change, we pre-calculate the extra size of the complex fields. 
> When generating records, for complex fields we read the field size from this 
> map.
>  # Refresh the timeline of the DeltaSync service before calling 
> readFromSource. This ensures that only the newest generated data is read and 
> data generated in the older Dag Nodes is ignored (as their AVRO files will 
> have an older timestamp).
>  # Making --workload-generator-classname an optional parameter as most 
> probably the default will be used



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to