[
https://issues.apache.org/jira/browse/IMPALA-11886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on IMPALA-11886 started by Ye Zihao.
-----------------------------------------
> Data cache should support asynchronous writes
> ---------------------------------------------
>
> Key: IMPALA-11886
> URL: https://issues.apache.org/jira/browse/IMPALA-11886
> Project: IMPALA
> Issue Type: Improvement
> Affects Versions: Impala 4.3.0
> Reporter: Ye Zihao
> Assignee: Ye Zihao
> Priority: Major
>
> Currently, writes to the data cache are synchronized with hdfs file reads,
> and both are handled by remote hdfs IO threads. In other words, if a cache
> miss occurs, the IO thread needs to take additional responsibility for cache
> writes, which will lead to query performance deterioration in some cases.
> Therefore, the data cache should be able to defer the writes to another
> thread(or thread pool) which writes asynchronously, allowing the IO thread to
> copy the data into the temporary buffer and immediately return it to the
> Scanner. Also need to bound the extra memory consumption for holding the
> temporary buffer though.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]