[
https://issues.apache.org/jira/browse/OOZIE-2402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15027449#comment-15027449
]
Robert Kanter commented on OOZIE-2402:
--------------------------------------
The overall approach looks good to me. This should be helpful in speeding
things up.
Some minor things:
# Please remove the trailing whitespaces from the 7 lines
# Can you update the docs? Update this section
(http://oozie.apache.org/docs/4.2.0/AG_Install.html#Oozie_Server_Setup) with
the new usage info. You can find the source for this page in AG_Install.twiki
in the code
# In {{concurrentCopyFromLocal}}, I think we should put the call to
{{copyFolderRecursively}} in a try-catch-finally block. For instance, we
should make sure to always call {{threadpool.shutdown()}}.
# It would be nice if {{checkCopyResults}} could print out the exception for
each failure, instead of just one. Otherwise, if there are multiple problems,
the user will have to keep trying after resolving each issue.
# In {{copyFolderRecursively}}, the Streams should be closed in a finally
block, not a catch block. Otherwise, they're only closed if an Exception occurs
# The description for the concurrency should say what the default is (e.g.
"(default=1)")
# Why use Streams instead of {{fs.copyFromLocalFile}}?
# Can you add/update unit tests in {{TestOozieSharelibCLI}}?
> oozie-setup.sh sharelib create takes a long time on large clusters
> ------------------------------------------------------------------
>
> Key: OOZIE-2402
> URL: https://issues.apache.org/jira/browse/OOZIE-2402
> Project: Oozie
> Issue Type: Improvement
> Components: tools
> Affects Versions: 4.2.0
> Reporter: Illya Yalovyy
> Assignee: Illya Yalovyy
> Attachments: OOZIE-2402-1.patch
>
>
> When cluster has 256+ nodes it can take up to 5 minutes to create a sharelib.
> Copy the tarball itself takes only around 10 seconds. It seems like
> performance could be improved by loading files concurrently in many threads.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)