Poisson Sample Loader should compute the number of samples required only once
-----------------------------------------------------------------------------

                 Key: PIG-1143
                 URL: https://issues.apache.org/jira/browse/PIG-1143
             Project: Pig
          Issue Type: Bug
            Reporter: Sriranjan Manjunath
            Assignee: Sriranjan Manjunath


The current poisson sampler forces each of the maps to compute the sample 
number. This is redundant and causes issues when a large directory is specified 
in the join. The sampler should be changed to calculate the sample count only 
once and this information should be shared with the remaining mappers.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to