[
https://issues.apache.org/jira/browse/PIG-3225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13637830#comment-13637830
]
Gianmarco De Francisci Morales commented on PIG-3225:
-----------------------------------------------------
Hi Saiph,
I am happy to see interest in this project idea.
This idea should be combined with the other sampling projects in Pig as shown
in https://cwiki.apache.org/confluence/display/PIG/GSoc2013 to prepare a GSoC
project proposal.
In my view, reservoir and bootstrap sampling are the easiest, while stratified
sampling might be more complicated.
> Stratified sampling
> -------------------
>
> Key: PIG-3225
> URL: https://issues.apache.org/jira/browse/PIG-3225
> Project: Pig
> Issue Type: New Feature
> Reporter: Gianmarco De Francisci Morales
> Labels: gsoc2013
>
> Implement a stratified sampling option (
> http://en.wikipedia.org/wiki/Stratified_sampling ) in Pig's SAMPLE operator.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira