Max Gekk created SPARK-57456:
--------------------------------
Summary: Support nanosecond-precision timestamp types in the JSON
datasource (v1 and v2)
Key: SPARK-57456
URL: https://issues.apache.org/jira/browse/SPARK-57456
Project: Spark
Issue Type: Sub-task
Components: SQL
Affects Versions: 5.0.0
Reporter: Max Gekk
Umbrella: SPARK-56822 (Timestamps with nanosecond precision).
Add read and write support for the nanosecond-capable timestamp types
TIMESTAMP_NTZ(p) and TIMESTAMP_LTZ(p) (p in 7-9) so this datasource reaches
parity with the microsecond TimestampType / TimestampNTZType. Remove the
SPARK-57166 rejection guardrail (supportDataType / supportsDataType) once read
and write are implemented and tested, and update FileBasedDataSourceSuite
accordingly. Cover precisions 7-9 for both NTZ and LTZ.
Scope:
- Read: JacksonParser nanos cases via TimestampFormatter parseNanos /
parseWithoutTimeZoneNanos with the column precision.
- Write: JacksonGenerator via formatNanos / formatWithoutTimeZoneNanos.
- Schema inference: JsonInferSchema (keep inferring microsecond by default;
nanos only via an explicit user schema).
- Guardrails: JsonFileFormat (v1), JsonTable (v2).
- Note: the legacy formatter policy rejects nanos; document the limitation.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]