[ 
https://issues.apache.org/jira/browse/MESOS-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13968922#comment-13968922
 ] 

Bernd Mathiske commented on MESOS-336:
--------------------------------------

I want what is in MESOS-700, too, as an option for the user. But that scheme 
requires the framework to have a checksum ready and where does this checksum 
come from? The framework programmer needs to have access to the 
URI-downloadable resources and generate the checksum ahead of time. In 
contrast, with the above scheme, this extra work is not necessary. Instead, it 
is assumed that each URI is (typically) only ever used once and that subsequent 
downloads would have exactly the same result. This should be a common scenario 
for most frameworks.

Since neither variant is ideal for every scenario that may occur in the wild, I 
suggest to have both, i.e. make the checksum optional as explained in the 
comments to MESOS-700, and ALSO provide caching if no checksum is given. This 
makes provisioning easier if no third party download site is involved.

An important use case for checksums might be upgrading executor software on the 
fly. You can force just that by introducing a fresh checksum. But employing 
this kind of trigger is only strictly necessary when the content of a third 
party URI changes while a framework is running. In all cases where the 
downloaded resources are under the control of the framework user, the framework 
has alternative control mechanisms to deal with upgrades, e.g. using different 
file names.


> Mesos slave should cache executors
> ----------------------------------
>
>                 Key: MESOS-336
>                 URL: https://issues.apache.org/jira/browse/MESOS-336
>             Project: Mesos
>          Issue Type: Improvement
>          Components: slave
>            Reporter: brian wickman
>            Assignee: Bernd Mathiske
>              Labels: newbie
>
> The slave should be smarter about how it handles pulling down executors.  In 
> our environment, executors rarely change but the slave will always pull it 
> down from regardless HDFS.  This puts undue stress on our HDFS clusters, and 
> is not resilient to reduced HDFS availability.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to