[ 
https://issues.apache.org/jira/browse/PIG-3225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13637830#comment-13637830
 ] 

Gianmarco De Francisci Morales commented on PIG-3225:
-----------------------------------------------------

Hi Saiph,
I am happy to see interest in this project idea.

This idea should be combined with the other sampling projects in Pig as shown 
in  https://cwiki.apache.org/confluence/display/PIG/GSoc2013 to prepare a GSoC 
project proposal.

In my view, reservoir and bootstrap sampling are the easiest, while stratified 
sampling might be more complicated.
                
> Stratified sampling
> -------------------
>
>                 Key: PIG-3225
>                 URL: https://issues.apache.org/jira/browse/PIG-3225
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Gianmarco De Francisci Morales
>              Labels: gsoc2013
>
> Implement a stratified sampling option ( 
> http://en.wikipedia.org/wiki/Stratified_sampling ) in Pig's SAMPLE operator.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to