[
https://issues.apache.org/jira/browse/TAJO-5?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jihoon Son resolved TAJO-5.
---------------------------
Resolution: Resolved
Fix Version/s: 0.11.1
Already fixed in TAJO-1950.
> Cache mechanism to keep instances of opened BSTIndexs in PullServerService
> --------------------------------------------------------------------------
>
> Key: TAJO-5
> URL: https://issues.apache.org/jira/browse/TAJO-5
> Project: Tajo
> Issue Type: Improvement
> Components: Data Shuffle
> Reporter: Hyunsik Choi
> Labels: newbie
> Fix For: 0.11.1
>
>
> PullServerAuxService is an auxiliary service of Yarn to repartition
> intermediate data. It is similar to ShuffleHandler of MRv2.
> PullServerAuxService supports hash repartition as well as range repartition.
> It works through netty-based HTTP web server.
> For retrieval of range partition data, PullServerAuxService uses a binary
> search tree (BSTIndex.java). For each request of range partitioned data, it
> opens BSTIndex every time. It may cause overheads. See messageReceived in
> PullServer and getFileChunks in PullServerAuxService.
> If PullServerAuxService uses some cache mechanism that keeps instances of
> opened BSTIndex and data files, it could get rid of this overhead.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)