[ https://issues.apache.org/jira/browse/PHOENIX-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14062662#comment-14062662 ]
Maryann Xue commented on PHOENIX-539: ------------------------------------- Thanks, [~jamestaylor]! I tried setting batch size to 2 for HashJoinTest. And it turned out that the cache close was not a problem, but the ChunkedResultIterator still could not work with hash join scan for the reason that: a single row in the left table can produce multiple rows after joining the right table (hash cache) and if a batch stops in the middle of such multiple results then when the next batch starts the leftover of those multiple results (produced by the previous row) will be lost. > Implement parallel scanner that does not spool to disk > ------------------------------------------------------ > > Key: PHOENIX-539 > URL: https://issues.apache.org/jira/browse/PHOENIX-539 > Project: Phoenix > Issue Type: Task > Reporter: James Taylor > Assignee: Gabriel Reid > Fix For: 5.0.0, 3.1, 4.1 > > Attachments: PHOENIX-539.1.patch, PHOENIX-539.patch > > > In scenarios where a LIMIT is not present on a non aggregate query that will > return a lot of results, Phoenix spools the results to disk. This is less > than ideal in these situations. @larsh has created a very good and relatively > simple implementation that is queue based to replace this. -- This message was sent by Atlassian JIRA (v6.2#6252)