[
https://issues.apache.org/jira/browse/HIVE-24328?focusedWorklogId=509542&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-509542
]
ASF GitHub Bot logged work on HIVE-24328:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 10/Nov/20 06:48
Start Date: 10/Nov/20 06:48
Worklog Time Spent: 10m
Work Description: aasha commented on a change in pull request #1648:
URL: https://github.com/apache/hive/pull/1648#discussion_r520328435
##########
File path: ql/src/java/org/apache/hadoop/hive/ql/parse/repl/CopyUtils.java
##########
@@ -82,27 +87,31 @@ public void copyAndVerify(Path destRoot,
List<ReplChangeManager.FileInfo> srcFil
}
FileSystem sourceFs = srcFiles.get(0).getSrcFs();
boolean useRegularCopy = regularCopy(sourceFs, srcFiles);
+ ExecutorService executorService = null;
try {
if (useRegularCopy || readSrcAsFilesList) {
+ executorService = Executors.newFixedThreadPool(maxParallelCopyTask);
Review comment:
Distcp supports running distributed copy for blocks of the same file in
parallel. Customers can use this option.
-blocksperchunk <blocksperchunk> | Number of blocks per chunk. When
specified, split files into chunks to copy in parallel | If set to a positive
value, files with more blocks than this value will be split into chunks of
<blocksperchunk> blocks to be transferred in parallel, and reassembled on the
destination. By default, <blocksperchunk> is 0 and the files will be
transmitted in their entirety without splitting. This switch is only applicable
when the source file system implements getBlockLocations method and the target
file system implements concat method.
-- | -- | --
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 509542)
Time Spent: 2.5h (was: 2h 20m)
> Run distcp in parallel for all file entries in repl load.
> ---------------------------------------------------------
>
> Key: HIVE-24328
> URL: https://issues.apache.org/jira/browse/HIVE-24328
> Project: Hive
> Issue Type: Task
> Reporter: Aasha Medhi
> Assignee: Aasha Medhi
> Priority: Major
> Labels: pull-request-available
> Attachments: HIVE-24328.01.patch
>
> Time Spent: 2.5h
> Remaining Estimate: 0h
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)