[
https://issues.apache.org/jira/browse/DRILL-6164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16366593#comment-16366593
]
ASF GitHub Bot commented on DRILL-6164:
---------------------------------------
GitHub user vrozov opened a pull request:
https://github.com/apache/drill/pull/1122
DRILL-6164: Heap memory leak during parquet scan and OOM
@parthchandra Please take a look
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/vrozov/drill DRILL-6164
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/drill/pull/1122.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1122
----
commit ec6b6ae017fd22ab6cfe539430053c7caf5174c0
Author: Vlad Rozov <vrozov@...>
Date: 2018-02-16T03:25:21Z
DRILL-6164: Heap memory leak during parquet scan and OOM
----
> Heap memory leak during parquet scan and OOM
> --------------------------------------------
>
> Key: DRILL-6164
> URL: https://issues.apache.org/jira/browse/DRILL-6164
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Vlad Rozov
> Assignee: Vlad Rozov
> Priority: Major
>
> During a scan of a large set of parquet files, Drill iterates over the set
> initializing parquet readers. Such initialization may require a significant
> memory usage (both heap and direct). When scan moves to the next parquet file
> in the set, it does not remove reference to the reader from the set it
> iterates over and does not remove references created during initialization.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)