[ 
https://issues.apache.org/jira/browse/AVRO-570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13087815#comment-13087815
 ] 

Doug Cutting commented on AVRO-570:
-----------------------------------

Great to see this working!

Some cosmetic comments on the patch:
 - please don't include comments with your name, the date, stack traces, etc.
 - please don't use tabs for indentation, but only spaces, 2 per indent level
 - please check your patch to make sure no whitespace-only changes are made
 - please put a space after // and before {

These guidelines make it much easier to review patches.

For the enum TRANSPROTO:
 - this might better be called simply TRANSPORT or PROTOCOL or maybe 
TRANSPORT_PROTOCOL.
 - if it's public in a public, non-test class then it needs javadoc
 - i'd prefer when values of this are processed, rather than using an if-else 
structure, a switch statement is used with a 'default' clause that throws an 
exception.  this makes it clearer that all values are handled.
 - do we need a NONE value, or should we rather simply default the value to 
HTTP or SASL?

The need for the flush in TetherKeySerialization is mysterious, since that's 
using a DirectBinaryEncoder, which has no buffering.  So the flush only has an 
effect on the underlying InputStream.  If that's required then it may be a bug 
in Hadoop.  Perhaps rather than including the entire stacktrace here, file a 
separate Jira issue about this with the stack trace, then just add an 
end-of-line comment referring to this issue, e.g., 'flush(); // Possible bug, 
see: AVRO-XXX'

We shouldn't reference Jira issue numbers in sources except in comments.  So, 
instead of adding a test-570 target in lang/py/build.xml you might define 
test.include and test.exclude properties near the top of that build.xml, then 
run it with something like 'ant -Dtest.include=...'.

It would be great if someone more fluent in Python could look over the Python 
stuff here.

And it would be great if someone more fluent in Maven could look over the Maven 
stuff here.

> python implementation of mapreduce connector
> --------------------------------------------
>
>                 Key: AVRO-570
>                 URL: https://issues.apache.org/jira/browse/AVRO-570
>             Project: Avro
>          Issue Type: New Feature
>          Components: python
>    Affects Versions: 1.6.0
>            Reporter: Doug Cutting
>            Assignee: Jeremy Lewi
>            Priority: Critical
>              Labels: hadoop
>         Attachments: AVRO-570.patch, AVRO-570.patch, AVRO-570.patch
>
>
> AVRO-512 defines protocols for implementing mapreduce tasks.  It would be 
> good to have a Python implementation of this.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to