[jira] [Commented] (PHOENIX-539) Implement parallel scanner that does not spool to disk

Maryann Xue (JIRA) Tue, 15 Jul 2014 14:22:36 -0700

    [ 
https://issues.apache.org/jira/browse/PHOENIX-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14062662#comment-14062662
 ]


Maryann Xue commented on PHOENIX-539:
-------------------------------------

Thanks, [~jamestaylor]! I tried setting batch size to 2 for HashJoinTest. And 
it turned out that the cache close was not a problem, but the 
ChunkedResultIterator still could not work with hash join scan for the reason 
that:
      a single row in the left table can produce multiple rows after joining 
the right table (hash cache) and if a batch stops in the middle of such 
multiple results then when the next batch starts the leftover of those multiple 
results (produced by the previous row) will be lost.

> Implement parallel scanner that does not spool to disk
> ------------------------------------------------------
>
>                 Key: PHOENIX-539
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-539
>             Project: Phoenix
>          Issue Type: Task
>            Reporter: James Taylor
>            Assignee: Gabriel Reid
>             Fix For: 5.0.0, 3.1, 4.1
>
>         Attachments: PHOENIX-539.1.patch, PHOENIX-539.patch
>
>
> In scenarios where a LIMIT is not present on a non aggregate query that will 
> return a lot of results, Phoenix spools the results to disk. This is less 
> than ideal in these situations. @larsh has created a very good and relatively 
> simple implementation that is queue based to replace this.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (PHOENIX-539) Implement parallel scanner that does not spool to disk

Reply via email to