[
https://issues.apache.org/jira/browse/HUDI-7833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sivabalan narayanan updated HUDI-7833:
--------------------------------------
Fix Version/s: 1.0.2
> Validate that fg reader works with nested column as record key
> --------------------------------------------------------------
>
> Key: HUDI-7833
> URL: https://issues.apache.org/jira/browse/HUDI-7833
> Project: Apache Hudi
> Issue Type: Sub-task
> Reporter: Jonathan Vexler
> Assignee: Lin Liu
> Priority: Critical
> Labels: pull-request-available
> Fix For: 1.0.1, 1.0.2
>
> Original Estimate: 2h
> Remaining Estimate: 2h
>
> Ensure that fg reader works if the record key is a nested column
>
> Progress:
> Created a PR to reproduce the problem:
> [https://github.com/apache/hudi/pull/12253]
> From the PR, we turn on and off the fg reader, and run write operations
> (insert, update and delete), and read. We want to test if the update and
> delete could succeed on the map typed key column.
> From the test result, we can see that with or without fg reader enabled, the
> test failed for map typed key. We can conclude that nested keys are not
> supported in Hudi so far.
> I did some investigation on the root cause for this specific test:
> in `BuiltinKeyGenerator.combineRecordKeyInternal`, the `UnsafeMapData`
> object's hash is returned, without caring the content of the map object.
> to fix it, we need to create a until function that deserialize these map
> objects, and generate the hash based on their content.
> We should also do this for all other nested data types.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)