[
https://issues.apache.org/jira/browse/PIG-2784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13943165#comment-13943165
]
camelia_c commented on PIG-2784:
--------------------------------
Hello,
My name is Camelia, I'm a PhD Student and I'm interested in working on this
project. I just uploaded the project proposal on Google Melange and I hope that
it is a good starting point for my future work on this contribution.
Best regards,
Camelia
> Framework for dynamic query optimization
> ----------------------------------------
>
> Key: PIG-2784
> URL: https://issues.apache.org/jira/browse/PIG-2784
> Project: Pig
> Issue Type: New Feature
> Reporter: Jie Li
> Labels: GSOC2014
>
> We need a framework to implement dynamic query optimization, i.e. changing
> the query plan at runtime. Currently we support estimating the number of
> reducers dynamically, which works well as the first step but was not
> perfectly implemented. In near future, we'll support more dynamic
> optimization, like [removing sample job for
> order-by|https://issues.apache.org/jira/browse/PIG-483], [removing limit
> job|https://issues.apache.org/jira/browse/PIG-2675], dynamically detecting
> skew and using skew-join, etc.
> Currently estimating #reducer is implemented in JobControlCompiler after
> MRCompiler compiles all the MapReduceOperators and generate the complete
> MRPlan. One place (discussed with Thejas) to implement the framework is at
> the MRCompiler, where the MRPlan'll be generated at batches and adjusted
> dynamically.
> Any comment?
> This is a candidate project for Google summer of code 2014. More information
> about the program can be found at
> https://cwiki.apache.org/confluence/display/PIG/GSoc2014
--
This message was sent by Atlassian JIRA
(v6.2#6252)