John Omernik created DRILL-4460:
-----------------------------------

             Summary: Provide feature that allows fall back to sort aggregation
                 Key: DRILL-4460
                 URL: https://issues.apache.org/jira/browse/DRILL-4460
             Project: Apache Drill
          Issue Type: Improvement
          Components: Execution - Flow
    Affects Versions: 1.5.0
            Reporter: John Omernik


Currently, the default setting for Drill is to use a Hash (in Memory) model for 
aggregations (set by planner.enable_hashagg = true as default).  This works 
well, but it's memory dependent and an out of memory condition will cause a 
query failure.  At this point, a user can alter session set 
`planner.enable_hashagg` = false and run the query again. If memory is a 
challenge again, the sort based approach will spill to disk allowing the query 
to complete (slower).

What I am requesting is a feature, that defaults to be off (so Drill default 
behavior will be the same after this feature is added) that would allow a query 
that tried hash aggregation and failed due to out of memory to restart the same 
query with sort aggregation.  Basically, allowing the query to succeed, it will 
try hash first, then go to sort.  This would make for a better user experience 
in that the query would succeed. Perhaps a warning could be set for the user 
that would allow them to understand that this occurred, so they could just go 
to a sort based query by default in the future. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to