[ 
https://issues.apache.org/jira/browse/CRUNCH-502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Josh Wills updated CRUNCH-502:
------------------------------
    Attachment: CRUNCH-502c.patch

Managed to simplify this and make it less awful, if still a bit odd. I verified 
it works on both hadoop1 and hadoop2, will commit it soon.

> OutputFormat has inconsistent context state in interface functions
> ------------------------------------------------------------------
>
>                 Key: CRUNCH-502
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-502
>             Project: Crunch
>          Issue Type: Bug
>          Components: IO
>    Affects Versions: 0.12.0
>            Reporter: Mārtiņš Kalvāns
>            Assignee: Josh Wills
>         Attachments: CRUNCH-502.patch, CRUNCH-502b.patch, CRUNCH-502c.patch
>
>
> I created example project to demonstrate problematic behaviour:
> https://github.com/sisidra/crunch-ofb
> 1. FormatBundle config is not populated to Configuration in checkOutputSpecs:
> https://github.com/sisidra/crunch-ofb/blob/master/src/main/java/com/spotify/crunch/bugreport/MyOutputFormat.java#L39
> {code}
> 15/03/02 15:40:24 INFO bugreport.MyOutputFormat: my.config.key 
> (checkOutputSpecs): null
> 15/03/02 15:40:24 ERROR bugreport.MyOutputFormat: Wrong my.config.key value 
> in checkOutputSpecs!
> {code}
> 2. TaskAttemptContext. getTaskAttemptID().toString() is different in 
> getRecordWriter and getOutputCommitter:
> {code}
> 2015-03-02 15:40:38,960 INFO [main] 
> com.spotify.crunch.bugreport.MyOutputFormat: TaskAttemptID 
> (getOutputCommitter): attempt_1422406067005_0121_m_000000_0
> ...
> 2015-03-02 15:40:39,789 INFO [main] 
> com.spotify.crunch.bugreport.MyOutputFormat: TaskAttemptID (getRecordWriter): 
> attempt_1422406067005_out0_0121_m_000000_0
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to