[ 
https://issues.apache.org/jira/browse/CRUNCH-293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13825634#comment-13825634
 ] 

Ryan Blue commented on CRUNCH-293:
----------------------------------

Micah, Josh just pointed me at this issue, which is well-timed. I just ran into 
the same problem recently and implemented a solution I was going to submit a 
patch for. My problem was that I wanted to override the avro generic classes 
rather than the specific. My implementation is very similar to yours, only I 
changed the reflect factory to a ReaderWriterFactory and added an AvroMode enum 
to handle each case (REFLECT, SPECIFIC, GENERIC). The enum approach brings all 
of the code together like your AvroDataFactory, but keeps the handling for each 
mode separate. Each mode can be individually overridden:
{code:java}
  AvroMode.SPECIFIC.override(specificFactory);
{code}

It also cleans up some of the places that hard-code specific or reflect readers 
to use the right one based on the AvroType:
{code:java}
  AvroMode.forType(atype).configure(bundle)
{code}

This is what makes it possible for me to override the generics correctly. A lot 
of places simply used Reflect because it is the most general... but that causes 
problems if you need to change specific (e.g., use a different ClassLoader) or 
generic.

Let me know what you think of the 
[branch|https://github.com/rdblue/crunch/commit/110614da91dc2b7609520883bb3cdfd40ea68700]
 and whether it might work for your needs. Please ignore all of the unnecessary 
import changes, I still need to clean it up.

> Injection of reader into AvroRecordReader
> -----------------------------------------
>
>                 Key: CRUNCH-293
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-293
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>    Affects Versions: 0.7.0, 0.8.0
>            Reporter: Micah Whitacre
>            Assignee: Micah Whitacre
>         Attachments: CRUNCH-293.patch, CRUNCH-293_v2.patch
>
>
> With CRUNCH-243, I wanted to support injecting custom readers to handle the 
> cases like passivity between Avro Schema.  The changes made however were not 
> complete as we also need to be able to inject a reader into the 
> AvroRecordReader which constructs its own SpecificDatumReader.
> We could create a SpecificDataFactory which emulates the ReflectDataFactory.  
> Or simplify to a single DataFactory which will create either 
> Reflect/Specific/Generic.  Thoughts?



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to