[
https://issues.apache.org/jira/browse/PIG-20?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12545942
]
Alan Gates commented on PIG-20:
-------------------------------
Mostly looks good. There are a few issues that need addressed:
1) Data in the provided unit tests is already sorted before it is fed to the
sort tests. It should randomized to assure we're really sorting the data and
not merely passing it through.
2) In EvalSpec, the comparator can be instantiated either via a call to
instantiateFunc or getComparator. But in getComparator there is no way to
instantiate the user defined comparator. It depends instead on instantiateFunc
having been called, otherwise it returns the default comparator. This doesn't
seem right. This means there is a possible code path where the user would
provide a comparator but the default one would still get used. If we cannot
guarantee that instantiateFunc has always been called before getComparator,
then getComparator needs to be changed to take a FunctionInstantiator so it can
create the user defined function if necessary. If we can guarantee that
instantiateFunc _should_ always be called before getComparator, then
getComparator should throw a RuntimeException if the member variable comparator
is null.
3) Nitpicky code formatting issues:
a) The pig coding standard is now to use spaces instead of tabs. In the
existing files you were correct to stay with standard of the surrounding code,
but in the new files that were created spaces should be used.
b) Lines shouldn't exceed 80 characters in length when possible.
> Sorting using custom comparison functions
> ------------------------------------------
>
> Key: PIG-20
> URL: https://issues.apache.org/jira/browse/PIG-20
> Project: Pig
> Issue Type: New Feature
> Components: impl
> Reporter: Olga Natkovich
> Attachments: usercompare.patch
>
>
> Currently, onlu string based sorting is supported. Once we have types,
> numeric sort will be supported as well. However, soem users express need for
> custome comparison functions for sort.
> Alan put together a design document for this:
> http://wiki.apache.org/pig/UserDefinedOrdering
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.