[
https://issues.apache.org/jira/browse/ORC-199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16087926#comment-16087926
]
ASF GitHub Bot commented on ORC-199:
------------------------------------
Github user spasam commented on the issue:
https://github.com/apache/orc/pull/131
Finally was able to test this end to end. I had to tweak CsvReader. Added
customer converters for boolean and timestamp. I can upload changes in a
separate pull request.
Ideally, if **json-schema** Driver command line option is changed to
**schema** and CSV schema is determined on the fly (assuming there is a header
line with column names), this would be awesome.
> Include a CSV to ORC converter
> ------------------------------
>
> Key: ORC-199
> URL: https://issues.apache.org/jira/browse/ORC-199
> Project: ORC
> Issue Type: New Feature
> Reporter: Carter Shanklin
> Assignee: Owen O'Malley
>
> It will be good to have a utility to convert CSV to ORC in a way that doesn't
> require any complex setup.
> To get things started I've created
> https://github.com/cartershanklin/csv-to-orc which uses ORC core and OpenCSV
> (which is Apache licensed).
> If there's interest it might be better to fold this into the ORC project to
> make it easier for users to find.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)