[ 
https://issues.apache.org/jira/browse/SPARK-55444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Max Gekk reassigned SPARK-55444:
--------------------------------

    Assignee: David Milicevic

> Types Framework - Phase 3 - Storage Formats
> -------------------------------------------
>
>                 Key: SPARK-55444
>                 URL: https://issues.apache.org/jira/browse/SPARK-55444
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>    Affects Versions: 4.2.0
>            Reporter: David Milicevic
>            Assignee: David Milicevic
>            Priority: Major
>              Labels: pull-request-available
>
> *Summary:*
> Add storage format support to the framework
> *Description:*
> Extend the framework to cover storage format integration points (Parquet, 
> ORC, Avro, CSV, JSON, XML, columnar caching).
> *What this includes:*
>  * New interface(s) for storage format operations (schema conversion, 
> read/write support)
>  * Integration in ~24 files (Scala + Java) across Parquet, ORC, Avro, CSV, 
> JSON, XML, and columnar caching including vectorized Java files 
> ({{{}OffHeapColumnVector{}}}, {{{}OnHeapColumnVector{}}}, 
> {{{}ParquetVectorUpdaterFactory{}}}, {{{}VectorizedColumnReader{}}})
> *Design doc:*
> Linked in the parent work item.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to