furquan1993 opened a new issue, #14259:
URL: https://github.com/apache/hudi/issues/14259
### Bug Description
**What happened:**
when querying a hudi table with a complex data column, like array/map with
incremental cdc, the resulted after row is having some encoded value for the
complex field.
for example. I have a column name policy_types which is an array of string.
When running cdc query on the table its returning the below json. You can see
policy_types is some kind of encoded object.
**What you expected:**
The actual value for this field.
for example: list of strings
**Steps to reproduce:**
1. Create a table with a column type as array of string
2. Insert few records
3. Run incremental CDC query to fetch before/after record.
4. You will find the after to contain encoded value for array type column.
```
{
"_hoodie_record_key": "2",
"_hoodie_partition_path": "",
"policy_types": {
"baseObject":
"AAAAAAAAAAARAAAAWAAAABUAAABwAAAAAQAAAIgAAAAAAAAAkAAAAEgAAACQAAAAAgAAAAAAAAAMTQAAAAAAAGYAAAAAAAAABwAAANgAAAAgAAAA4AAAADIwMjUxMTEzMTExNzI3MzAzAAAAAAAAADIwMjUxMTEzMTExNzI3MzAzXzBfMQAAADIAAAAAAAAANDEyZjJmOTAtZmM0MS00ZDNjLThlMzMtZTI4N2JkYzUwMTY2LTBfMC0yNi0yOV8yMDI1MTExMzExMTcyNzMwMy5wYXJxdWV0UEVORElORwABAAAAAAAAAAAAAAAAAAAABAAAABgAAABBVVRPAAAAAA==",
"baseOffset": 240,
"sizeInBytes": 32
},
"_hoodie_file_name":
"412f2f90-fc41-4d3c-8e33-e287bdc50166-0_0-26-29_20251113111727303.parquet",
"_hoodie_commit_seqno": "20251113111727303_0_1",
"order_id": 2,
"order_customer_id": 102,
"order_status": "PENDING",
"_hoodie_commit_time": "20251113111727303",
"order_date": 19724
}
```
### Environment
**Hudi version:**
**Query engine:** (Spark/Flink/Trino etc)
**Relevant configs:**
### Logs and Stack Trace
_No response_
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]