cajil commented on issue #6047:
URL: https://github.com/apache/hudi/issues/6047#issuecomment-1177265155
@yihua I have tried with the suggested setting by providing in
propsFilePath. Still running into the same issue.
I am attaching a sample parquet file for reproducing the issue. Only change
being CREATED(timestamp) column instead of CREATEDBY(string) in the schema.
Schema definition
`{
"name": "bootstraptest",
"type": "record",
"fields": [
{
"name": "CREATED",
"type": {
"type": "string",
"logicalType" : "timestamp-micros"
}
},
{
"name": "ID",
"type": {
"type": "bytes",
"logicalType": "decimal",
"precision": 12,
"scale": 0
}
},
{
"name": "CLIENT_ID",
"type": {
"type": "bytes",
"logicalType": "decimal",
"precision": 12,
"scale": 0
}
}
]
}`
Command used:
`hdfsparquetimport --upsert false --srcPath
/Users/user1/hudi-res/parquet_data/ --targetPath
/Users/user1/hudi-res/hudi_converted/ --tableName BOOTSTRAPTEST --tableType
COPY_ON_WRITE --rowKeyField ID --partitionPathField CLIENT_ID --parallelism 50
--schemaFilePath /Users/user1/hudi-res/schema/test.avsc --format parquet
--sparkMemory 2G --retry 1 --sparkMaster local --propsFilePath
/Users/user1/hudi-res/config/hudi-config.properties
Renaming parquet files to .txt for sharing.
[data_0_0_0.snappy.parquet.txt](https://github.com/apache/hudi/files/9061908/data_0_0_0.snappy.parquet.txt)
[data_0_1_0.snappy.parquet.txt](https://github.com/apache/hudi/files/9061909/data_0_1_0.snappy.parquet.txt)
`
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]