[
https://issues.apache.org/jira/browse/HUDI-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-2322:
---------------------------------
Labels: pull-request-available (was: )
> Only include meta fields to reorder while preparing dataset for bulk insert
> ---------------------------------------------------------------------------
>
> Key: HUDI-2322
> URL: https://issues.apache.org/jira/browse/HUDI-2322
> Project: Apache Hudi
> Issue Type: Bug
> Reporter: Sagar Sumit
> Assignee: Sagar Sumit
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.9.0
>
>
> Below filter in `HoodieDatasetBulkInsertHelper` will result in
> `_hoodie_is_deleted` to be reordered as well even though it is not part of
> meta columns.
> {code:java}
> List<Column> originalFields =
> Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field ->
> !field.name().contains("_hoodie_")).map(f -> new
> Column(f.name())).collect(Collectors.toList());
> List<Column> metaFields =
> Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field ->
> field.name().contains("_hoodie_")).map(f -> new
> Column(f.name())).collect(Collectors.toList());
> {code}
> The fix is to check only for
> `HoodieRecord.HOODIE_META_COLUMNS_WITH_OPERATION`.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)