[
https://issues.apache.org/jira/browse/PIG-5044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
liyunzhang_intel updated PIG-5044:
----------------------------------
Attachment: PIG-5044_4.patch
[~Szita]: fix issues in the review board and help to review PIG-5044_4.patch(I
also uploaded it to the review board).
> Create SparlCompiler#getSamplingJob in spark mode
> -------------------------------------------------
>
> Key: PIG-5044
> URL: https://issues.apache.org/jira/browse/PIG-5044
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: liyunzhang_intel
> Assignee: liyunzhang_intel
> Fix For: spark-branch
>
> Attachments: PIG-5044_2.patch, PIG-5044_3.patch, PIG-5044_4.patch
>
>
> Like MRCompiler#getSamplingJob, we also need a function like that to sample
> data from a file, sort sampling data and generate output by
> UDF(org.apache.pig.impl.builtin.FindQuantiles).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)