Yarn configuration

fetch Mon, 08 Sep 2025 07:34:39 -0700

Hello!

We have Hadoop/HDFS running with Yarn/Spark on worker nodes forprocessing jobs that are ran on a schedule. We would like to introduce aqueue for Spark "streaming" jobs that are indefinite/do not exit,without interfering with the scheduled jobs or Hadoop/HBase/HDFS. Wecurrently limit Yarn to 11 CPUs, and want to bump it up to 14 CPUs tohandle this additional queue. Is this a sensible thing to do on theworkers themselves? From profiling a bit it seems like thenon-Yarn/Spark related processes don't require a huge amount of CPU, butis there a recommended resource amount for Hadoop/HBase/HDFS that I canreference? One worker has 24 CPU, 125GB RAM, 8 Disks.


Thanks!


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Yarn configuration

Reply via email to