brijrajk opened a new pull request, #56469:
URL: https://github.com/apache/spark/pull/56469

   ### What changes were proposed in this pull request?
   
   Added the missing `singleVariantColumn` option to the Data Source Options 
tables in:
   - `docs/sql-data-sources-csv.md`
   - `docs/sql-data-sources-json.md`
   - `docs/sql-data-sources-xml.md`
   
   The option was introduced in Spark 4.1.0 (SPARK-51298 for CSV, also 
supported for JSON and XML) but was never documented in the reference tables. 
It is defined as a shared constant in `DataSourceOptions.scala` and consumed by 
`CSVOptions`, `JSONOptions`, and `XmlOptions`.
   
   ### Why are the changes needed?
   
   Users have no way to discover `singleVariantColumn` from the official data 
source options reference. The option allows ingesting an entire CSV/JSON/XML 
record as a single `VariantType` column instead of parsing it into individual 
fields — a key use case for the Variant type introduced in Spark 4.0.
   
   ### Does this PR introduce _any_ user-facing change?
   
   No. Documentation only.
   
   ### How was this patch tested?
   
   No code change — documentation only. Verified the option is defined in 
`DataSourceOptions.scala` (line 78), `CSVOptions.scala` (line 338), 
`JSONOptions.scala` (line 215), and `XmlOptions.scala` (line 194).
   
   ### Was this patch authored or co-authored using generative AI tooling?
   
   Generated-by: Claude (Anthropic)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to