[
https://issues.apache.org/jira/browse/ARROW-3968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bhaskar Mookerji updated ARROW-3968:
------------------------------------
Description:
As part of exploring the Arrow C++ implementation, I wrote standalone tool for
streaming Arrow to a file from CSV, using the new CSV parser implementation
from [~pitrou]. I realize that Arrow's emphasis is on in-memory representation,
as opposed to efficient storage to disk, so I'd love to know if this has any
utility for the project. At the very least, it seems like a quick way to get
into exploring the format from a something easily inspectable/familiar (i.e.,
CSV).
In either case, I'm making this issue here as a placeholder for an accompanying
PR on Github. Also, I think this is my first issue for this project, so please
let me know if I should do anything differently.
PR is now available at: https://github.com/apache/arrow/pull/3136
was:
As part of exploring the Arrow C++ implementation, I wrote standalone tool for
streaming Arrow to a file from CSV, using the new CSV parser implementation
from [~pitrou]. I realize that Arrow's emphasis is on in-memory representation,
as opposed to efficient storage to disk, so I'd love to know if this has any
utility for the project. At the very least, it seems like a quick way to get
into exploring the format from a something easily inspectable/familiar (i.e.,
CSV).
In either case, I'm making this issue here as a placeholder for an accompanying
PR on Github. Also, I think this is my first issue for this project, so please
let me know if I should do anything differently.
> Standalone CSV to Arrow Conversion Tool
> ---------------------------------------
>
> Key: ARROW-3968
> URL: https://issues.apache.org/jira/browse/ARROW-3968
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++
> Reporter: Bhaskar Mookerji
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> As part of exploring the Arrow C++ implementation, I wrote standalone tool
> for streaming Arrow to a file from CSV, using the new CSV parser
> implementation from [~pitrou]. I realize that Arrow's emphasis is on
> in-memory representation, as opposed to efficient storage to disk, so I'd
> love to know if this has any utility for the project. At the very least, it
> seems like a quick way to get into exploring the format from a something
> easily inspectable/familiar (i.e., CSV).
> In either case, I'm making this issue here as a placeholder for an
> accompanying PR on Github. Also, I think this is my first issue for this
> project, so please let me know if I should do anything differently.
>
> PR is now available at: https://github.com/apache/arrow/pull/3136
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)