Sagar Sumit created HUDI-2322:
---------------------------------
Summary: Only include meta fields to reorder while preparing
dataset for bulk insert
Key: HUDI-2322
URL: https://issues.apache.org/jira/browse/HUDI-2322
Project: Apache Hudi
Issue Type: Bug
Reporter: Sagar Sumit
Assignee: Sagar Sumit
Fix For: 0.9.0
Below filter in `HoodieDatasetBulkInsertHelper` will result in
`_hoodie_is_deleted` to be reordered as well even though it is not part of meta
columns.
{code:java}
List<Column> originalFields =
Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field ->
!field.name().contains("_hoodie_")).map(f -> new
Column(f.name())).collect(Collectors.toList());
List<Column> metaFields =
Arrays.stream(rowsWithMetaCols.schema().fields()).filter(field ->
field.name().contains("_hoodie_")).map(f -> new
Column(f.name())).collect(Collectors.toList());
{code}
The fix is to check only for `HoodieRecord.HOODIE_META_COLUMNS_WITH_OPERATION`.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)