[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15420368#comment-15420368 ]
Michael J. Carey commented on ASTERIXDB-1556: --------------------------------------------- Two final thoughts: If we change the data structure to make things "more moveable", that would probably be bad - we would be making the normal case suffer (more bloated) at the expense of something we hope will rarely or never happen. So, we have to keep that in mind. Also, I agree that separate hash tables make sense - it's how I think I'd have done it, too, if I'd been doing it - just because it wouldn't have occurred to me to have a global one. HOWEVER - I worry that that might be a bigger change? (And I'd like to contain this one for now and have everyone working on our issues in "most-pressing DESC" sort order.) > Hash Table used by External hash group-by doesn't conform to the budget. > ------------------------------------------------------------------------ > > Key: ASTERIXDB-1556 > URL: https://issues.apache.org/jira/browse/ASTERIXDB-1556 > Project: Apache AsterixDB > Issue Type: Bug > Reporter: Taewoo Kim > Assignee: Taewoo Kim > Attachments: 2wayjoin.pdf, 2wayjoin.rtf, 2wayjoinplan.rtf, > 3wayjoin.pdf, 3wayjoin.rtf, 3wayjoinplan.rtf > > > When we enable prefix-based fuzzy-join and apply the multi-way fuzzy-join ( > > 2), the system generates an out-of-memory exception. > Since a fuzzy-join is created using 30-40 lines of AQL codes and this AQL is > translated into massive number of operators (more than 200 operators in the > plan for a 3-way fuzzy join), it could generate out-of-memory exception. > /// Update: as the discussion goes, we found that hash table in the external > hash group by doesn't conform to the frame limit. So, an out of memory > exception happens during the execution of an external hash group by operator. -- This message was sent by Atlassian JIRA (v6.3.4#6332)