[ 
https://issues.apache.org/jira/browse/ARROW-3968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bhaskar Mookerji updated ARROW-3968:
------------------------------------
    Description: 
As part of exploring the Arrow C++ implementation, I wrote standalone tool for 
streaming Arrow to a file from CSV, using the new CSV parser implementation 
from [~pitrou]. I realize that Arrow's emphasis is on in-memory representation, 
as opposed to efficient storage to disk, so I'd love to know if this has any 
utility for the project. At the very least, it seems like a quick way to get 
into exploring the format from a something easily inspectable/familiar (i.e., 
CSV).

In either case, I'm making this issue here as a placeholder for an accompanying 
PR on Github. Also, I think this is my first issue for this project, so please 
let me know if I should do anything differently.

 

PR is now available at: https://github.com/apache/arrow/pull/3136

  was:
As part of exploring the Arrow C++ implementation, I wrote standalone tool for 
streaming Arrow to a file from CSV, using the new CSV parser implementation 
from [~pitrou]. I realize that Arrow's emphasis is on in-memory representation, 
as opposed to efficient storage to disk, so I'd love to know if this has any 
utility for the project. At the very least, it seems like a quick way to get 
into exploring the format from a something easily inspectable/familiar (i.e., 
CSV).

In either case, I'm making this issue here as a placeholder for an accompanying 
PR on Github. Also, I think this is my first issue for this project, so please 
let me know if I should do anything differently.


> Standalone CSV to Arrow Conversion Tool
> ---------------------------------------
>
>                 Key: ARROW-3968
>                 URL: https://issues.apache.org/jira/browse/ARROW-3968
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Bhaskar Mookerji
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> As part of exploring the Arrow C++ implementation, I wrote standalone tool 
> for streaming Arrow to a file from CSV, using the new CSV parser 
> implementation from [~pitrou]. I realize that Arrow's emphasis is on 
> in-memory representation, as opposed to efficient storage to disk, so I'd 
> love to know if this has any utility for the project. At the very least, it 
> seems like a quick way to get into exploring the format from a something 
> easily inspectable/familiar (i.e., CSV).
> In either case, I'm making this issue here as a placeholder for an 
> accompanying PR on Github. Also, I think this is my first issue for this 
> project, so please let me know if I should do anything differently.
>  
> PR is now available at: https://github.com/apache/arrow/pull/3136



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to