[
https://issues.apache.org/jira/browse/HUDI-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-1303:
---------------------------------
Labels: pull-request-available (was: )
> Some improvements for the HUDI Test Suite
> -----------------------------------------
>
> Key: HUDI-1303
> URL: https://issues.apache.org/jira/browse/HUDI-1303
> Project: Apache Hudi
> Issue Type: Improvement
> Reporter: Prashant Wason
> Assignee: Prashant Wason
> Priority: Minor
> Labels: pull-request-available
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> # Use the DAG Node's label from the yaml as its name instead of UUID names
> which are not descriptive when debugging issues from logs.
> # Fix CleanNode constructor which is not correctly implemented
> # When generating upsets, allows more granualar control over the number of
> inserts and upserts - zero or more inserts and upserts can be specified
> instead of always requiring both inserts and upserts.
> # Fixed generation of records of specific size
> ** The current code was using a class variable "shouldAddMore" which was
> reset to false after the first record generation causing subsequent records
> to be of minimum size.
> ** In this change, we pre-calculate the extra size of the complex fields.
> When generating records, for complex fields we read the field size from this
> map.
> # Refresh the timeline of the DeltaSync service before calling
> readFromSource. This ensures that only the newest generated data is read and
> data generated in the older Dag Nodes is ignored (as their AVRO files will
> have an older timestamp).
> # Making --workload-generator-classname an optional parameter as most
> probably the default will be used
--
This message was sent by Atlassian Jira
(v8.3.4#803005)