[ 
https://issues.apache.org/jira/browse/TAJO-178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyunsik Choi updated TAJO-178:
------------------------------

    Attachment: TAJO-178_3.patch

+1

As I mentioned above, this feature is promising if we optimize this. I've 
uploaded the rebased and cleaned-up patch.

If there is no objection until tomorrow, I'll commit it.

Thank you
                
> Implements StorageManager for scanning asynchronously
> -----------------------------------------------------
>
>                 Key: TAJO-178
>                 URL: https://issues.apache.org/jira/browse/TAJO-178
>             Project: Tajo
>          Issue Type: Improvement
>          Components: storage
>    Affects Versions: 0.2-incubating
>            Reporter: hyoungjunkim
>            Assignee: hyoungjunkim
>         Attachments: TAJO-178_1.path, TAJO-178_3.patch, TAJO-178.patch_2, 
> TAJO-178.path, tajo_storage_manager.png
>
>
> The current StorageManager does not provide scan scheduling function. All 
> scan operations run concurrently. This is the cause of random disk access and 
> disk read performance is not good.
> The proposed StorageManager is based on double buffering. Each disk has a 
> scheduler to schedule by order of scanned adjust. Each Scanner has a 
> InputStream and a Tuple pool. The next() operation of ScanNode is blocked 
> until Tuple pool is filled. Assigned Scanner by the scheduler read data(xMB) 
> and fills Tuple Pool and notifies to next() operation. After scanning Scanner 
> re-enter DiskScanQueue.
> In this way Scanner can pass column vector to Vectorized Query Engine.
> See the attached file.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to