[ https://issues.apache.org/jira/browse/CRUNCH-632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15824105#comment-15824105 ]
Jim McStanton commented on CRUNCH-632: -------------------------------------- [~mkwhitacre] absolutely, thanks for the quick response on this. It may be a few days before I have a chance to do so though, hopefully I'll have a chance by Wednesday afternoon. > Add compression support for CSVFileSource > ----------------------------------------- > > Key: CRUNCH-632 > URL: https://issues.apache.org/jira/browse/CRUNCH-632 > Project: Crunch > Issue Type: Improvement > Reporter: Jim McStanton > Assignee: Micah Whitacre > Priority: Minor > Attachments: CRUNCH-632b.patch, CRUNCH-632.patch > > > Currently CSVFileSource does not support decompressing files before reading > them, and simply opens the file and starts reading the contents: > https://github.com/apache/crunch/blob/6280983179e9c690af69c2bf0e296b054122d724/crunch-core/src/main/java/org/apache/crunch/io/text/csv/CSVRecordReader.java#L127. > > This source would more closely match TextFileSource if this support was > added. The {{LineRecordReader}} supports this behavior > [here|http://grepcode.com/file/repo1.maven.org/maven2/org.apache.hadoop/hadoop-mapreduce-client-core/2.7.1/org/apache/hadoop/mapreduce/lib/input/LineRecordReader.java?av=f#87]. > -- This message was sent by Atlassian JIRA (v6.3.4#6332)