Babu Mahesh created SPARK-49763:
-----------------------------------

             Summary: [CSV Reader] Add Flag to Control Inference of Time-Only 
Columns as String or Timestamp During Schema Detection
                 Key: SPARK-49763
                 URL: https://issues.apache.org/jira/browse/SPARK-49763
             Project: Spark
          Issue Type: Task
          Components: SQL
    Affects Versions: 3.5.2
            Reporter: Babu Mahesh


This task involves adding a configurable flag to control the inference of 
time-only columns during schema detection in Spark. By default, Spark converts 
time-only columns to {{Timestamp}} type, which can lead to unintended behavior 
in certain use cases. This new flag will allow users to specify whether 
time-only columns should be inferred as {{Timestamp}} or as {{{}String{}}}.
h3. Key Changes:
 * Introduce a flag (e.g., {{{}inferStringTypeForTimeOnlyColumn{}}}).
 * When the flag is set to {{{}true{}}}, time-only columns will be inferred as 
{{{}String{}}}.
 * When the flag is set to {{false}} (default), time-only columns will be 
inferred as {{{}Timestamp{}}}.
 * Update documentation to reflect the new option.
 * Ensure backward compatibility by defaulting to the current behavior.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to