[ 
https://issues.apache.org/jira/browse/CRUNCH-300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13827966#comment-13827966
 ] 

Gabriel Reid commented on CRUNCH-300:
-------------------------------------

I've integrated this one locally, but I'm still working on putting together 
more substantial integration tests for writing from MemPipeline in general. 

I'll wait until CRUNCH-293 is done before going further with it. 

An added topic on writing from MemPipelines in General is that there doesn't 
seem to be a standard way of handling the output file naming and directory 
structure. I think that coming to a standard will mean breaking some some 
external stuff that depends on the current output structure, but I'm thinking 
(hoping) that there aren't many people using it yet. 

> Support reflected Avro record writing from MemPipeline
> ------------------------------------------------------
>
>                 Key: CRUNCH-300
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-300
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>            Reporter: David Whiting
>            Priority: Minor
>         Attachments: 
> 0001-Allow-MemPipeline-to-write-Avro-files-by-reflection.patch
>
>
> MemPipeline doesn't support writing Avro records via reflection. It seems 
> that this was half implemented but never finished, but I needed it to create 
> some test data to run through a cluster MapReduce test. The current 
> implementation correctly reflects the schema, but then uses a 
> GenericDatumWriter to try and write the record, causing a ClassCastException. 
> The correct way would be to get a ReflectDatumWriter from the 
> ReflectDataFactory.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to