[ 
https://issues.apache.org/jira/browse/CRUNCH-565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14939926#comment-14939926
 ] 

mac champion edited comment on CRUNCH-565 at 10/1/15 3:55 PM:
--------------------------------------------------------------

[~mkwhitacre]
Well, at first I started using it just because that's what I'm comfortable 
with. But later I realized i wasn't completely certain how to manipulate it 
into returning null instead of blank strings. With Mockito that's easy, just 
don't mock anything and the return value will be null.

BUT, If I switch all of these to get(opt,default) I will have to do some extra 
stuff, but I shouldn't have to handle nulls or do anything weird like that. Can 
you take another look here? https://github.com/champgm/crunch/pull/7

Also, sorry about the pull request to apache/crunch. I've forked that and I use 
it play around and create pull requests so I can have a nice place to review 
and comment on the diffs. When the code looks good and it builds, I'll squash, 
create a patch, and attach it to the JIRA. Is that an okay workflow? The 
official one is pretty sparse and doesn't include any kind of review steps: 
https://cwiki.apache.org/confluence/display/CRUNCH/Committer+Workflow





was (Author: champgm):
[~mkwhitacre]
Well, at first I started using it just because that's what I'm comfortable 
with. But later I realized i wasn't completely certain how to manipulate it 
into returning null instead of blank strings. With Mockito that's easy, just 
don't mock anything and the return value will be null.

BUT, If I switch all of these to get(opt,default) I will have to do some extra 
stuff, but I shouldn't have to handle nulls or do anything weird like that. Can 
you take another look here? https://github.com/champgm/crunch/pull/7

Also, sorry about the pull request to apache/crunch. I've forked that and I use 
it play around and create pull requests so I can have a nice place to review 
and comment on the diffs. When the code looks good and it builds, I'll create a 
patch and attach it to the JIRA. Is that an okay workflow? The official one is 
pretty sparse and doesn't include any kind of review steps: 
https://cwiki.apache.org/confluence/display/CRUNCH/Committer+Workflow




> CSVInputFormat needs to be more defensive when configuring itself
> -----------------------------------------------------------------
>
>                 Key: CRUNCH-565
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-565
>             Project: Crunch
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.10.0, 0.8.3
>            Reporter: mac champion
>            Assignee: mac champion
>            Priority: Minor
>              Labels: csv, csvparser
>
> It seems that some behavior has changed somewhere along the line where hadoop 
> Configuration is concerned. It is possible that a call to .get(OPTION) will 
> return null. CSVInputFormat does not handle that case gracefully:
> https://github.com/apache/crunch/blob/apache-crunch-0.10.0/crunch-core/src/main/java/org/apache/crunch/io/text/csv/CSVInputFormat.java#L178-L183
> Some more relevant details can be found in this JIRA:
> https://issues.apache.org/jira/browse/CRUNCH-564?focusedCommentId=14938186&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14938186



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to