ArnavBalyan opened a new pull request, #3311:
URL: https://github.com/apache/parquet-java/pull/3311
### Rationale for this change
- INT96 fields use a hardcoded "INT96" string as schema name.
- Avro re-uses the same schema reference on reoccurrence, which causes
subsequent values as bare "INT96" instead of the complete definition.
- Move away from the hardcoding and use name/namespace map to prevent
collisions.
Before:
```
{
"type" : "record",
"name" : "int96Schema",
"fields" : [ {
"name" : "timestamp_1",
"type" : [ "null", {
"type" : "fixed",
"name" : "INT96",
"doc" : "INT96 represented as byte[12]",
"size" : 12
} ],
"default" : null
}, {
"name" : "timestamp_2",
"type" : [ "null", "INT96" ],
"default" : null
} ]
}
```
After:
```
{
"type" : "record",
"name" : "int96Schema",
"fields" : [ {
"name" : "timestamp_1",
"type" : [ "null", {
"type" : "fixed",
"name" : "timestamp_1",
"doc" : "INT96 represented as byte[12]",
"size" : 12
} ],
"default" : null
}, {
"name" : "timestamp_2",
"type" : [ "null", {
"type" : "fixed",
"name" : "timestamp_2",
"doc" : "INT96 represented as byte[12]",
"size" : 12
} ],
"default" : null
} ]
}
```
### Are these changes tested?
- Yes
### Are there any user-facing changes?
- Yes
Closes: #2972
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]