[
https://issues.apache.org/jira/browse/HADOOP-17195?focusedWorklogId=485138&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-485138
]
ASF GitHub Bot logged work on HADOOP-17195:
-------------------------------------------
Author: ASF GitHub Bot
Created on: 16/Sep/20 13:21
Start Date: 16/Sep/20 13:21
Worklog Time Spent: 10m
Work Description: steveloughran commented on pull request #2294:
URL: https://github.com/apache/hadoop/pull/2294#issuecomment-693400920
Closing this, but leaving up as the PoC to say "we should have a shared
thread pool for lower startup costs"; it would be a switch to buffering on
which will the way to guarantee an end to OOM problems
I am happy for the S3A blocks class to be moved to hadoop-common to address
this.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 485138)
Time Spent: 40m (was: 0.5h)
> Intermittent OutOfMemory error while performing hdfs CopyFromLocal to abfs
> ---------------------------------------------------------------------------
>
> Key: HADOOP-17195
> URL: https://issues.apache.org/jira/browse/HADOOP-17195
> Project: Hadoop Common
> Issue Type: Bug
> Components: fs/azure
> Affects Versions: 3.3.0
> Reporter: Mehakmeet Singh
> Assignee: Bilahari T H
> Priority: Major
> Labels: abfsactive, pull-request-available
> Time Spent: 40m
> Remaining Estimate: 0h
>
> OutOfMemory error due to new ThreadPools being made each time
> AbfsOutputStream is created. Since threadPool aren't limited a lot of data is
> loaded in buffer and thus it causes OutOfMemory error.
> Possible fixes:
> - Limit the number of ThreadCounts while performing hdfs copyFromLocal (Using
> -t property).
> - Reducing OUTPUT_BUFFER_SIZE significantly which would limit the amount of
> buffer to be loaded in threads.
> - Don't create new ThreadPools each time AbfsOutputStream is created and
> limit the number of ThreadPools each AbfsOutputStream could create.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]