[ https://issues.apache.org/jira/browse/CRUNCH-429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14230442#comment-14230442 ]
Danny Morgan commented on CRUNCH-429: ------------------------------------- [~mkwhitacre] Moving FileSystem retrieval outside the loop broke reading from alternative filesystem sources. If the crunch job is running on a hadoop cluster and the input paths are s3 then: {code:java} FileSystem fileSystem = FileSystem.get(job.getConfiguration()); {code} isn't correct. > The CSVFileSource does not always function properly > --------------------------------------------------- > > Key: CRUNCH-429 > URL: https://issues.apache.org/jira/browse/CRUNCH-429 > Project: Crunch > Issue Type: Bug > Components: Core > Affects Versions: 0.8.3 > Reporter: mac champion > Assignee: mac champion > Priority: Minor > Labels: csv, csvparser > Fix For: 0.8.4, 0.11.0 > > Attachments: 0001-CRUNCH-429-Fix-CSVInputFormat.patch, > CRUNCH-429_a.patch > > Original Estimate: 336h > Remaining Estimate: 336h > > The "configure" method of CSVInputFormat does not have any effect on its > configuration and is never called. Instead, the class needs to implement > Configurable and set its configuration options in an overriden setConf > method. -- This message was sent by Atlassian JIRA (v6.3.4#6332)