[
https://issues.apache.org/jira/browse/PIG-3225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13614617#comment-13614617
]
Gianmarco De Francisci Morales commented on PIG-3225:
-----------------------------------------------------
Hi Dishara,
Happy to see your interest.
While we haven't discussed in detail with the rest of the Committers, my
personal view on this project is that it should be combined with the one on
Bootstrap sampling PIG-3221 to be worth of GSoC.
Regarding the sampling, this part of the project requires designing and
changing the parser to recognize new part of the syntax for the SAMPLE operator
(to specify the strata), and implementing the logical and physical operators
connected to it.
> Stratified sampling
> -------------------
>
> Key: PIG-3225
> URL: https://issues.apache.org/jira/browse/PIG-3225
> Project: Pig
> Issue Type: New Feature
> Reporter: Gianmarco De Francisci Morales
> Labels: gsoc2013
>
> Implement a stratified sampling option (
> http://en.wikipedia.org/wiki/Stratified_sampling ) in Pig's SAMPLE operator.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira