Re: How to restrict disk space for spark caches on yarn?

Peter Rudenko Mon, 13 Jul 2015 06:58:01 -0700

Hi Andrew, here's what i found. Maybe would be relevant for people withthe same issue:

1) There's 3 types of local resources in YARN (public, private,application). More about it here:http://hortonworks.com/blog/management-of-application-dependencies-in-yarn/


2) Spark cache is of application type of resource.

3) Currently it's not possible to specify quota for applicationresources (https://issues.apache.org/jira/browse/YARN-882)


4) The only it's possible to specify these 2 settings:

yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage- The maximum percentage of disk space utilization allowed after which adisk is marked as bad. Values can range from 0.0 to 100.0. If the valueis greater than or equal to 100, the nodemanager will check for fulldisk. This applies to yarn-nodemanager.local-dirs andyarn.nodemanager.log-dirs.

yarn.nodemanager.disk-health-checker.min-free-space-per-disk-mb - Theminimum space that must be available on a disk for it to be used. Thisapplies to yarn-nodemanager.local-dirs and yarn.nodemanager.log-dirs.

5) Yarn's cache cleanup doesn't cleaned app resources:https://github.com/apache/hadoop/blob/8d58512d6e6d9fe93784a9de2af0056bcc316d96/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/ResourceLocalizationService.java#L511

As i understood application resources cleaned when spark applicationcorrectly terminates (using sc.stop()). But in my case when it fills alldisk space it was stucked and couldn't stop correctly. After i restartedyarn i don't know how easily trigger cache cleanup except of manually onall the nodes.


Thanks,
Peter Rudenko

On 2015-07-10 20:07, Andrew Or wrote:

Hi Peter,
AFAIK Spark assumes infinite disk space, so there isn't really a wayto limit how much space it uses. Unfortunately I'm not aware of asimpler workaround than to simply provision your cluster with moredisk space. By the way, are you sure that it's disk space thatexceeded the limit, but not the number of inodes? If it's the latter,maybe you could control the ulimit of the container.
To answer your other question: if it can't persist to disk then yes itwill fail. It will only recompute from the data source if for somereason someone evicted our blocks from memory, but that shouldn'thappen in your case since your'e using MEMORY_AND_DISK_SER.
-Andrew
2015-07-10 3:51 GMT-07:00 Peter Rudenko <petro.rude...@gmail.com<mailto:petro.rude...@gmail.com>>:
    Hi, i have a spark ML worklflow. It uses some persist calls. When
    i launch it with 1 tb dataset - it puts down all cluster, becauses
    it fills all disk space at /yarn/nm/usercache/root/appcache:
    http://i.imgur.com/qvRUrOp.png

    I found a yarn settings:
    /yarn/.nodemanager.localizer./cache/.target-size-mb - Target size
    of localizer cache in MB, per nodemanager. It is a target
    retention size that only includes resources with PUBLIC and
    PRIVATE visibility and excludes resources with APPLICATION visibility

    But it excludes resources with APPLICATION visibility, and spark
    cache as i understood is of APPLICATION type.

    Is it possible to restrict a disk space for spark application?
    Will spark fail if it wouldn't be able to persist on disk
    (StorageLevel.MEMORY_AND_DISK_SER) or it would recompute from data
    source?

    Thanks,
    Peter Rudenko

Re: How to restrict disk space for spark caches on yarn?

Reply via email to