Re: Resource Manager UI showing running jobs but no actual jobs running

2019-04-02 Thread Prabhu Josephraj
Hi George, Have seen this issue - RM UI will show the old job list and the RM process heap usage will be high. This is due to a Bug fixed by YARN-7163. Can you test with patch from YARN-7163. Thanks, Prabhu Joseph On Tue, Apr 2, 2019 at 4:59 AM George Liaw wrote: > Hi all, > > Using

Re: Resource Manager UI showing running jobs but no actual jobs running

2019-04-02 Thread Prabhu Josephraj
're the same issue? > > > On Tue, Apr 2, 2019, 2:15 AM Prabhu Josephraj > wrote: > >> Hi George, >> >> Have seen this issue - RM UI will show the old job list and the RM >> process heap usage will be high. This is due to a Bug fixed by YARN-7163. >> Can you

Re: Spark pools support on Yarn

2019-02-26 Thread Prabhu Josephraj
Hi Anton, Spark Pools / Spark Fair Scheduler is scheduling the tasks within a Spark Job. Each Spark job will have multiple stages and each stage will have multiple tasks. This is different from YARN Fair Scheduler which schedules the jobs submitted to YARN Cluster. Spark Pools within a

How Useful YARN Placement Constraint for MapReduce Jobs

2019-02-08 Thread Prabhu Josephraj
Hi, Was thinking on supporting YARN Placement Constraint for MapReduce Applications, but want to check with you on how useful it will be?. Mappers usually runs on Data Local machine and so won't need but Reducers can gain by distributing the reducers to different machine using Anti

Re: yarn usercache dir not resolved properly when running an example application

2019-02-14 Thread Prabhu Josephraj
Hi Vinay, Can you try specifying below configs under Docker section in container-executor.cfg which will allow Docker Containers to use the NM Local Dirs. docker.allowed.ro-mounts=/data/yarn/local,,/usr/jdk64/jdk1.8.0_112/bin

Re: yarn usercache dir not resolved properly when running an example application

2019-02-14 Thread Prabhu Josephraj
ny chance?. > Also, is there any log or out information which says that the docker > container runtime has been picked up.? > > > > On Thu, Feb 14, 2019 at 9:38 PM Prabhu Josephraj > wrote: > >> Hi Vinay, >> >> Can you try specifying below configs under Docker

Re: The AvailableVCores of the scheduler queue is a negative number

2019-02-19 Thread Prabhu Josephraj
This is expected for DefaultResourceCalculator (Memory based scheduling) where it allocates requested n memory and 1 core (logical) per container. Say a node has 100GB and 5 cores, 15 containers requested each with 10 GB, 10 containers will be allocated and available node resource will be 0GB and

Re: How to restrict users who can post domains/entities to the YARN Timeline Server?

2019-05-30 Thread Prabhu Josephraj
Hi Junseung, You are right, any one who has a valid kerberos ticket is allowed to put a domain, but the owner of domain can decide who can write and read entities into the domain. We can write a custom Filter with extra logic to restrict certain users from creating domain and add the

Re: RM web got HTTP ERROR 500

2019-06-12 Thread Prabhu Josephraj
Hi Kevin, Looks different versions of hadoop-yarn-api jar is in the classpath of Yarn ResourceManager. Can you remove the older jars if any in classpath. lsof -p or adding -verbose in YARN_OPTS in yarn.cmd file will help to find the wrong jars. Thanks, Prabhu Joseph On Wed, Jun 12, 2019

Re: Set yarn.nodemanager.resource.memory-mb higher than node physical memory

2019-08-15 Thread Prabhu Josephraj
YARN allocates based on the configuration (yarn.nodemanager.resource.memory-mb) user has configured. It has allocated the AM Container of size 1536MB as it can fit in 5120MB Available Node Size. yarn.nodemanager.pmem-check-enabled will kill the container if the physical memory usage of the

Re: Set yarn.nodemanager.resource.memory-mb higher than node physical memory

2019-08-15 Thread Prabhu Josephraj
Are you aware of any job syntax to tune the 'container physical memory > usage' to 'force' job kill/log? > > thanks/Guido > > > > On Thu, Aug 15, 2019 at 1:50 PM Prabhu Josephraj > wrote: > >> YARN allocates based on the configuration >> (yarn.nodemanag

Re: Set yarn.nodemanager.resource.memory-mb higher than node physical memory

2019-08-15 Thread Prabhu Josephraj
esource Utilization by Containers : PMem:110 MB, VMem:4014 MB, > VCores:0.9735 > ...unquote > > My question is : which job setting may I use to force a node physical > memory usage >512MB and force a job kill due (or thanks) pmem check. > Hope above better explain my q

Re: Could not find or load main class org.apache.hadoop.yarn.server.nodemanager.containermanager.loca lizer.ContainerLocalizer

2019-08-20 Thread Prabhu Josephraj
On Secure Cluster, The ContainerLocalizer JVM runs as job user. The below issue happens when the job user does not have access to the hadoop-yarn-server-nodemanager-.jar present on hadoop classpath of NodeManager machine. Could not find or load main class org.apache.hadoop.yarn.server.

Re: Docker container executor is failing

2019-08-30 Thread Prabhu Josephraj
Can you test with adding local into docker.trusted.registries in container-executor.cfg. Fyi https://community.cloudera.com/t5/Support-Questions/Not-able-to-run-docker-container-on-yarn-even-after/m-p/224259 On Fri, Aug 30, 2019 at 2:07 PM Yen-Onn Hiu wrote: > hi all, > > I have a bash script

Re: NegativeArraySizeException during map segment merge

2019-09-04 Thread Prabhu Josephraj
1. Looking at IFile$Reader#nextRawValue, not sure why we create valBytes array of size 2 * currentValueLength even though it tries to read data of currentValueLength size. If there is no reason, this can be fixed which will fix the problem. public void nextRawValue(DataInputBuffer value) throws

Re: user notification upon application error

2019-12-04 Thread Prabhu Josephraj
MapReduce Application can be configured to notify the status on completion through mapreduce.job.end-notification.url. Need to write a webservice to collect the status and send email to users. Below has examples

Re: Exception in thread "main" org.codehaus.jackson.map.exc.UnrecognizedPropertyException: Unrecognized field "Token" (Class org.apache.hadoop.yarn.api.records.timeline.TimelineDelegationTokenResponse

2019-10-17 Thread Prabhu Josephraj
Suspect the TimelineClient and ApplicationHistoryServer are using different hadoop libraries. Can you make sure the client uses the same hadoop jars and dependency jars as the ApplicationHistoryServer process. Simple workaround is to disable timeline service for this job. hbase

Re: how to list yarn applications by creation time and filter by username?

2019-11-27 Thread Prabhu Josephraj
Yarn CLI does not do that, i think u need to write a script which does that on top of the output provided by YARN CLI. On Wed, Nov 27, 2019 at 9:19 AM Manuel Sopena Ballesteros < manuel...@garvan.org.au> wrote: > Dear Hadoop community, > > > > I am learning yarn and would like to find an

Re: Can't Change Retention Period for YARN Log Aggregation

2019-11-21 Thread Prabhu Josephraj
The deletion service runs as part of MapReduce JobHistoryServer. Can you try restarting it? On Fri, Nov 22, 2019 at 3:42 AM David M wrote: > All, > > > > I have an HDP 2.6.1 cluster where we’ve had > yarn.log-aggregation.retain-seconds set to 30 days for a while, and > everything was working

Re: how to list yarn applications by creation time and filter by username?

2019-11-27 Thread Prabhu Josephraj
allesteros < manuel...@garvan.org.au> wrote: > Thanks Prabhu, > > > > Do you know which yarn command can I use in order to get application > creation time? > > > > Thank you > > > > Manuel > > > > *From:* Prabhu Josephraj [mailt

Re: can't start spark thrift after Configuring YARN container executor

2019-10-10 Thread Prabhu Josephraj
As per the error, spark user does not have permission to create directory under NodeManager Local Directory or the existing spark user directory is with stale uid or gid. *Permission denied Can't create directory /d1/hadoop/yarn/local/usercache/spark/appcache/application_1570681803028_0018* 1.