[jira] [Created] (YARN-9039) App ACLs are not validated when serving logs from Logs CLI/Yarn UI2

2018-11-20 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-9039:
--

 Summary: App ACLs are not validated when serving logs from Logs 
CLI/Yarn UI2
 Key: YARN-9039
 URL: https://issues.apache.org/jira/browse/YARN-9039
 Project: Hadoop YARN
  Issue Type: Bug
  Components: log-aggregation
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad
 Attachments: YARN-9039.1.patch

App Acls are not being validated when serving logs through YARN CLI. 

This also applies while serving logs through YARN UIV2 through ATSV2 Log 
Webservice



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-9030) Log aggregation changes to handle filesystems which do not support permissions

2018-11-16 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-9030:
--

 Summary: Log aggregation changes to handle filesystems which do 
not support permissions
 Key: YARN-9030
 URL: https://issues.apache.org/jira/browse/YARN-9030
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Some cloud storages like ABFS do not support permissions in which case they 
throw an UnsupportedOperationException. Log aggregation should hanlde these 
case and not set permissions for log aggregation base dir/ sub dirs 

{noformat}

2018-11-12 15:37:28,726 WARN  logaggregation.LogAggregationService 
(LogAggregationService.java:initApp(209)) - Application failed to init 
aggregation
org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to check 
permissions for dir [abfs://testc...@test.blob.core.windows.net/app-logs]
at 
org.apache.hadoop.yarn.logaggregation.filecontroller.LogAggregationFileController.verifyAndCreateRemoteLogDir(LogAggregationFileController.java:277)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.logaggregation.LogAggregationService.initAppAggregator(LogAggregationService.java:238)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.logaggregation.LogAggregationService.initApp(LogAggregationService.java:204)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.logaggregation.LogAggregationService.handle(LogAggregationService.java:347)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.logaggregation.LogAggregationService.handle(LogAggregationService.java:69)
at 
org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:197)
at 
org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:126)
at java.lang.Thread.run(Thread.java:748)

{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Resolved] (YARN-4249) Many options in "yarn application" command is not documented

2018-10-25 Thread Suma Shivaprasad (JIRA)


 [ 
https://issues.apache.org/jira/browse/YARN-4249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suma Shivaprasad resolved YARN-4249.

Resolution: Not A Problem

> Many options in "yarn application" command is not documented
> 
>
> Key: YARN-4249
> URL: https://issues.apache.org/jira/browse/YARN-4249
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: nijel
>Priority: Major
>
> in document only few options are specified.
> {code}
> Usage: `yarn application [options] `
> | COMMAND\_OPTIONS | Description |
> |: |: |
> | -appStates \ | Works with -list to filter applications based on 
> input comma-separated list of application states. The valid application state 
> can be one of the following:  ALL, NEW, NEW\_SAVING, SUBMITTED, ACCEPTED, 
> RUNNING, FINISHED, FAILED, KILLED |
> | -appTypes \ | Works with -list to filter applications based on 
> input comma-separated list of application types. |
> | -list | Lists applications from the RM. Supports optional use of -appTypes 
> to filter applications based on application type, and -appStates to filter 
> applications based on application state. |
> | -kill \ | Kills the application. |
> | -status \ | Prints the status of the application. |
> {code}
> some options are missing like
> -appId  Specify Application Id to be operated
> -help   Displays help for all commands.
> -movetoqueueMoves the application to a different queue.
> -queue  Works with the movetoqueue command to specify 
> which queue to move an application to.
> -updatePriority   update priority of an 
> application.ApplicationId can be passed using 'appId' option.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8923) Cleanup references to ENV file type in code

2018-10-19 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8923:
--

 Summary: Cleanup references to ENV file type in code 
 Key: YARN-8923
 URL: https://issues.apache.org/jira/browse/YARN-8923
 Project: Hadoop YARN
  Issue Type: Improvement
  Components: yarn-native-services
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


YARN Service code still has references to ENV file type which should be 
removed. And a few other unused methods in configuration file handling which 
could be removed



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8920) LogAggregation should be configurable to allow writing to underlying storage as appOwner or yarn user

2018-10-19 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8920:
--

 Summary: LogAggregation should be configurable to allow writing to 
underlying storage as appOwner or yarn user
 Key: YARN-8920
 URL: https://issues.apache.org/jira/browse/YARN-8920
 Project: Hadoop YARN
  Issue Type: Improvement
  Components: log-aggregation, yarn
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8711) YARN UI2 : Display component state in Component list and details page for a Service

2018-08-24 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8711:
--

 Summary: YARN UI2 : Display component state in Component list and 
details page for a Service
 Key: YARN-8711
 URL: https://issues.apache.org/jira/browse/YARN-8711
 Project: Hadoop YARN
  Issue Type: Bug
  Components: yarn-ui-v2
Reporter: Suma Shivaprasad
Assignee: Akhil PB


YARN-8488 adds component states and service state = SUCCEEDED. Users could then 
track overall component status on the UI for terminating jobs. 
cc [~sunil.gov...@gmail.com]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8710) Service AM should set a finite limit on NM container max retries

2018-08-24 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8710:
--

 Summary: Service AM should set a finite limit on NM container max 
retries 
 Key: YARN-8710
 URL: https://issues.apache.org/jira/browse/YARN-8710
 Project: Hadoop YARN
  Issue Type: Bug
  Components: yarn-native-services
 Environment: Container retries are currently set to a default of -1 in 
AbstractProviderService.buildContainerRetry. If this is not overridden via 
service spec with a finite value for yarn.service.container-failure.retry.max , 
this causes infinite NM reties for the container for ALWAYS/ON_FAILURE restart 
policy . Ideally it should try a finite number of time on the same NM and 
subsequently Service AM can retry on another node.

We can set this to default value of 3.
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8675) Setting hostname of docker container breaks with "host" networking mode for Apps which do not run as a YARN service

2018-08-16 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8675:
--

 Summary: Setting hostname of docker container breaks with "host" 
networking mode for Apps which do not run as a YARN service
 Key: YARN-8675
 URL: https://issues.apache.org/jira/browse/YARN-8675
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Applications like the Spark AM currently do not run as a YARN service and 
setting hostname breaks driver/executor communication if docker version 
>=1.13.1 , especially with wire-encryption turned on.

The proposal is to not set the hostname when "host" networking mode is enabled.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8588) Logging improvements for better debuggability

2018-07-26 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8588:
--

 Summary: Logging improvements for better debuggability
 Key: YARN-8588
 URL: https://issues.apache.org/jira/browse/YARN-8588
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Capacity allocations decided in GuaranteedCapacityOvertimePolicy are available 
via AutoCreatedLeafQueueConfig. However this class lacks a toString and some 
other DEBUG level logs are needed for better debuggability



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8572) YarnClient getContainers API should support filtering by container status

2018-07-24 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8572:
--

 Summary: YarnClient getContainers API should support filtering by 
container status
 Key: YARN-8572
 URL: https://issues.apache.org/jira/browse/YARN-8572
 Project: Hadoop YARN
  Issue Type: Improvement
  Components: yarn
Reporter: Suma Shivaprasad


YarnClient.getContainers should support filtering containers by their status - 
RUNNING, COMPLETED etc . This may require corresponding changes in ATS to 
filter by container status for a given application attemopt



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8519) Yarn UI2 : Changes to depict Auto Created leaf Queues/Managed Queues differently from other queues

2018-07-11 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8519:
--

 Summary: Yarn UI2 : Changes to depict Auto Created leaf 
Queues/Managed Queues differently from other queues
 Key: YARN-8519
 URL: https://issues.apache.org/jira/browse/YARN-8519
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad


YARN-7420 covers changes to depict auto created leaf queues in a separate color 
notation but this was done in the old Yarn UI and similiar chnages need to be 
incorporated in the new YARN UI to depict Managed Parent queues/Auto-Created 
leaf queues separately



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8419) In "New Service" section of new YARN UI, user cannot submit service as Submit button is always disabled.

2018-06-11 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8419:
--

 Summary: In "New Service" section of new YARN UI, user cannot 
submit service as Submit button is always disabled.
 Key: YARN-8419
 URL: https://issues.apache.org/jira/browse/YARN-8419
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad


This is because, user.name check is still mandatory for non-secure cluster. But 
in secure cluster, user.name is not exposed to UI and hence was showing this 
error in secure cluster.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8415) TimelineWebServices should throw a ForBiddenException ( 403

2018-06-11 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8415:
--

 Summary: TimelineWebServices should throw a ForBiddenException ( 
403
 Key: YARN-8415
 URL: https://issues.apache.org/jira/browse/YARN-8415
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8339) Service AM should localize static/archive resource types to container working directory instead of 'resources' diurectory

2018-05-22 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8339:
--

 Summary: Service AM should localize static/archive resource types 
to container working directory instead of 'resources' diurectory
 Key: YARN-8339
 URL: https://issues.apache.org/jira/browse/YARN-8339
 Project: Hadoop YARN
  Issue Type: Bug
  Components: yarn-native-services
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


This is to address one of the review comments posted by [~wangda] in YARN-8079 
at 
https://issues.apache.org/jira/browse/YARN-8079?focusedCommentId=16482065=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16482065



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8300) Fix NPE in DefaultUpgradeComponentsFinder

2018-05-15 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8300:
--

 Summary: Fix NPE in DefaultUpgradeComponentsFinder 
 Key: YARN-8300
 URL: https://issues.apache.org/jira/browse/YARN-8300
 Project: Hadoop YARN
  Issue Type: Bug
  Components: yarn
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad



In current upgrades for Yarn native services, we do not support 
addition/deletion of compoents during upgrade. On trying to upgrade with the 
same number of components in target spec as the current service spec but with 
the one of the components having a new target spec and name, see the following 
NPE in service AM logs

{noformat}
2018-05-15 00:10:41,489 [IPC Server handler 0 on 37488] ERROR 
service.ClientAMService - Error while trying to upgrade service {} 
java.lang.NullPointerException
at 
org.apache.hadoop.yarn.service.UpgradeComponentsFinder$DefaultUpgradeComponentsFinder.lambda$findTargetComponentSpecs$0(UpgradeComponentsFinder.java:103)
at java.util.ArrayList.forEach(ArrayList.java:1257)
at 
org.apache.hadoop.yarn.service.UpgradeComponentsFinder$DefaultUpgradeComponentsFinder.findTargetComponentSpecs(UpgradeComponentsFinder.java:100)
at 
org.apache.hadoop.yarn.service.ServiceManager.processUpgradeRequest(ServiceManager.java:259)
at 
org.apache.hadoop.yarn.service.ClientAMService.upgrade(ClientAMService.java:163)
at 
org.apache.hadoop.yarn.service.impl.pb.service.ClientAMProtocolPBServiceImpl.upgradeService(ClientAMProtocolPBServiceImpl.java:81)
at 
org.apache.hadoop.yarn.proto.ClientAMProtocol$ClientAMProtocolService$2.callBlockingMethod(ClientAMProtocol.java:5972)
at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:523)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:991)
at org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:872)
at org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:818)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1682)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2678)
{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8296) Update YarnServiceApi documentation and yarn service UI code to remove references to unique_component_support

2018-05-15 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8296:
--

 Summary: Update YarnServiceApi documentation and yarn service UI 
code to remove references to unique_component_support
 Key: YARN-8296
 URL: https://issues.apache.org/jira/browse/YARN-8296
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


yarn-servicedef.js/service-component-table.hbs and YarnServiceAPI.md has 
references to unique_component_support which is unused in Yarn service 
implementation code. We should remove this from docs/UI code if this is no 
longer used.

cc [~billie.rinaldi] [~gsaha] 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8255) Allow option to disable flex for a service component

2018-05-07 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8255:
--

 Summary: Allow option to disable flex for a service component 
 Key: YARN-8255
 URL: https://issues.apache.org/jira/browse/YARN-8255
 Project: Hadoop YARN
  Issue Type: Bug
  Components: yarn-native-services
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


YARN-8080 implements restart capabilities for service component instances. YARN 
service components should add an option to disallow flexing to support 
workloads which are essentially batch/iterative jobs which terminate with 
restart_policy=NEVER/ON_FAILURE. This could be disabled by default for 
components where restart_policy=NEVER/ON_FAILURE and enabled by default when 
restart_policy=ALWAYS(which is the default restart_policy) unless explicitly 
set at the service spec.

The option could be exposed as part of the component spec as "allow_flexing". 

cc [~billie.rinaldi] [~gsaha] [~eyang] [~csingh] [~wangda]





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8217) RmAuthenticationFilterInitializer /TimelineAuthenticationFilterInitializer should use Configuration.getPropsWithPrefix instead of iterator

2018-04-26 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8217:
--

 Summary: RmAuthenticationFilterInitializer 
/TimelineAuthenticationFilterInitializer should use 
Configuration.getPropsWithPrefix instead of iterator
 Key: YARN-8217
 URL: https://issues.apache.org/jira/browse/YARN-8217
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


HADOOP-15411 fixed a similar issue for AuthenticationFilterInitializer. This 
issue can occur in 
RmAuthenticationFilterInitializer/TimelineAuthenticationFilterInitializer as 
well



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8210) AMRMClient logging on every heartbeat to track updation of AM RM token causes too many log lines to be generated in AM logs

2018-04-25 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8210:
--

 Summary: AMRMClient logging on every heartbeat to track updation 
of AM RM token causes too many log lines to be generated in AM logs
 Key: YARN-8210
 URL: https://issues.apache.org/jira/browse/YARN-8210
 Project: Hadoop YARN
  Issue Type: Bug
  Components: yarn
Affects Versions: 3.0.0-alpha1, 2.9.0
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


YARN-4682 added logs to track when AM RM token is updated for debuggability 
purposes. However this is printed on every heartbeat and could cause the AM 
logs to be flooded with this whenever RM's Master key is rolled over especially 
if its a long running AM. Hence proposing to remove this log line



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8177) Fix documentation for node label support

2018-04-18 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8177:
--

 Summary: Fix documentation for node label support 
 Key: YARN-8177
 URL: https://issues.apache.org/jira/browse/YARN-8177
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Capacity Scheduler Dynamic Queues feature documentation needs to be fixed for 
node label support with examples.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8121) Container runtime and runtime context information should be available in UI

2018-04-05 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8121:
--

 Summary: Container runtime and runtime context information should 
be available in UI
 Key: YARN-8121
 URL: https://issues.apache.org/jira/browse/YARN-8121
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Suma Shivaprasad


Currently we do not publish container runtime and other runtime context 
information(important in case of docker ) - like the the docker image used, 
docker network the container ran on etc in ATS. This needs to be added in ATS 
and made available in UI to provide visibility on container runtime chosen and 
the runtime parameters to end user



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-8086) ManagedParentQueue with no leaf queues cause JS error in new UI

2018-03-29 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-8086:
--

 Summary: ManagedParentQueue with no leaf queues cause JS error in 
new UI
 Key: YARN-8086
 URL: https://issues.apache.org/jira/browse/YARN-8086
 Project: Hadoop YARN
  Issue Type: Bug
Affects Versions: 3.1.0
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


ManagedParentQueues do not have any leaf queues and capacity-queue.js does not 
do null checks while checking for child queues under Managed parent queues 
while displaying "Cluster Overview/Queues" page if such queues exist in 
capacity-scheduler configuration



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Resolved] (YARN-7987) Docker container name(--name) needs to be DNS friendly for DNS resolution to work in user defined networks.

2018-03-12 Thread Suma Shivaprasad (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suma Shivaprasad resolved YARN-7987.

Resolution: Won't Fix

> Docker container name(--name) needs to be DNS friendly for DNS resolution to 
> work in user defined networks. 
> 
>
> Key: YARN-7987
> URL: https://issues.apache.org/jira/browse/YARN-7987
> Project: Hadoop YARN
>  Issue Type: Sub-task
>Reporter: Suma Shivaprasad
>Assignee: Suma Shivaprasad
>Priority: Major
>
> User defined networks like overlays support DNS resolution through Docker 
> Embedded DNS which needs the container name (–name parameter value in docker 
> run) to be a FQDN for container names to be resolved - Please refer 
> documentation 
> [https://docs.docker.com/v17.09/engine/userguide/networking/configure-dns/]
> However Yarn sets the container name to the container's id which is not DNS 
> friendly(eg: container_e26_1519402686002_0035_01_03) and is not a FQDN. 
> The proposal is to set a FQDN(eg: 
> ctr-e26-1519402686002-0035-01-03.domain-name) as the docker container's 
> name for containers to be able to communicate to each other via hostnames in 
> user defined networks like overlays, bridges etc. The domain name will be 
> picked up from the YARN DNS registry configuration 
> (hadoop.registry.dns.domain-name)
>  
>  
>  
>  
>  
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7994) Add support for network-alias in docker run for user defined networks

2018-03-02 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7994:
--

 Summary: Add support for network-alias in docker run for user 
defined networks 
 Key: YARN-7994
 URL: https://issues.apache.org/jira/browse/YARN-7994
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Docker Embedded DNS supports DNS resolution for containers by one or more of 
its configured {{--network-alias}} within a user-defined network. 
DockerRunCommand should support this option for DNS resolution to work through 
docker embedded DNS 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7987) Docker container name(--name) needs to be DNS friendly for DNS resolution to work in user defined networks.

2018-02-28 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7987:
--

 Summary: Docker container name(--name) needs to be DNS friendly 
for DNS resolution to work in user defined networks. 
 Key: YARN-7987
 URL: https://issues.apache.org/jira/browse/YARN-7987
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


User defined networks like overlays support DNS resolution through Docker 
Embedded DNS which needs the container name (–name parameter value in docker 
run) to be a FQDN for container names to be resolved - Please refer 
documentation 
[https://docs.docker.com/v17.09/engine/userguide/networking/configure-dns/]

However Yarn sets the container name to the container's id which is not DNS 
friendly(eg: container_e26_1519402686002_0035_01_03) and is not a FQDN. 

The proposal is to set a FQDN(eg: 
ctr-e26-1519402686002-0035-01-03.domain-name) as the docker container's 
name for containers to be able to communicate to each other via hostnames in 
user defined networks like overlays, bridges etc. The domain name will be 
picked up from the YARN DNS registry configuration 
(hadoop.registry.dns.domain-name)

 

 

 

 

 

 

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7935) Expose container's hostname to applications running within the docker container

2018-02-14 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7935:
--

 Summary: Expose container's hostname to applications running 
within the docker container
 Key: YARN-7935
 URL: https://issues.apache.org/jira/browse/YARN-7935
 Project: Hadoop YARN
  Issue Type: Sub-task
  Components: yarn
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Some applications have a need to bind to the container's hostname (like Spark) 
which is different from the NodeManager's hostname(NM_HOST which is available 
as an env during container launch) when launched through Docker runtime. The 
container's hostname can be exposed to applications via an env 
CONTAINER_HOSTNAME. Another potential candidate is the container's IP but this 
can be addressed in a separate jira.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7657) Queue Mapping could provide options to provide 'user' specific auto-created queues under a specified group parent queue

2017-12-14 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7657:
--

 Summary: Queue Mapping could provide options to provide 'user' 
specific auto-created queues under a specified group parent queue
 Key: YARN-7657
 URL: https://issues.apache.org/jira/browse/YARN-7657
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Current Queue-Mapping only provides %user as an option for 'user' specific 
queues as u:%user:%user. We can also support %user with group as 
'g:makerting-group:marketing.%user'  and user specific queues can be 
automatically created under a group queue in this case.

cc [~leftnoteasy]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7643) Handle recovery of applications on auto-created leaf queues

2017-12-12 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7643:
--

 Summary: Handle recovery of applications on auto-created leaf 
queues
 Key: YARN-7643
 URL: https://issues.apache.org/jira/browse/YARN-7643
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


CapacityScheduler application recovery should auto-create leaf queue if it 
doesnt exist. Also RMAppManager needs to set the queue-mapping placement 
context so that scheduler has necessary placement context to recreate the queue



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7639) Queue Management scheduling edit policy class needs to be configured dynamically

2017-12-11 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7639:
--

 Summary: Queue Management scheduling edit policy class needs to be 
configured dynamically
 Key: YARN-7639
 URL: https://issues.apache.org/jira/browse/YARN-7639
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


This needs to be configured dynamically for 
yarn.resourcemanager.monitor.capacity.queue-management.monitoring-interval 
whenever auto leaf queue creation is enabled for a parent queue.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7638) Add unit tests for Preemption and Recovery

2017-12-11 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7638:
--

 Summary: Add unit tests for Preemption and Recovery
 Key: YARN-7638
 URL: https://issues.apache.org/jira/browse/YARN-7638
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Add unit tests to test inter leaf-queue pre-emption based on utilization and 
work preserving start/recovery.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7634) Queue ACL validations should validate parent queue ACLs before auto-creating leaf queues

2017-12-09 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7634:
--

 Summary: Queue ACL validations should validate parent queue ACLs 
before auto-creating leaf queues
 Key: YARN-7634
 URL: https://issues.apache.org/jira/browse/YARN-7634
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


RMAppManager currently validates only leaf queue ACLs and if leaf queue doesnt 
exist which is the case in auto-created leaf queues, queue mapping may return a 
parent queue. However Parent queue ACLs are not validated. This needs to be 
validated before auto-creating leaf queues for the mapped parent queue



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7633) Add documentation for auto queue creation feature and related configurations

2017-12-09 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7633:
--

 Summary: Add documentation for auto queue creation feature and 
related configurations
 Key: YARN-7633
 URL: https://issues.apache.org/jira/browse/YARN-7633
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7632) Effective min and max resource need to be set for auto created leaf queues upon creation and capacity management

2017-12-09 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7632:
--

 Summary: Effective min and max resource need to be set for auto 
created leaf queues upon creation and capacity management
 Key: YARN-7632
 URL: https://issues.apache.org/jira/browse/YARN-7632
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


YARN-5881 introduced the notion of configuring queues with Absolute resource 
specifications instead of percentage. As part of that , each leaf queue has an 
effective min/max capacity that needs to be set when queue is created and 
whenever queue capacity is changed



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7586) Application Placement should be done before ACL checks in ResourceManager

2017-11-30 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7586:
--

 Summary: Application Placement should be done before ACL checks in 
ResourceManager
 Key: YARN-7586
 URL: https://issues.apache.org/jira/browse/YARN-7586
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


YARN-7473 moved Application placement from RMAppManager to RMAppImpl which 
causes issues since ApplicationSubmissionContext still has the original queue 
specified by the user and not the mapped queue . This causes issues while doing 
ACL checks in RMAppManager



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7574) Add support for Node Labels on Auto Created Leaf Queue Template

2017-11-28 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7574:
--

 Summary: Add support for Node Labels on Auto Created Leaf Queue 
Template
 Key: YARN-7574
 URL: https://issues.apache.org/jira/browse/YARN-7574
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


YARN-7473 adds support for auto created leaf queues to inherit node labels 
capacities from parent queues. Howebver there is no support for leaf queue 
template to allow different configured capacities for different node labels. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7520) Queue Ordering policy changes for ordering auto created leaf queues within Managed parent Queues

2017-11-16 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7520:
--

 Summary: Queue Ordering policy changes for ordering auto created 
leaf queues within Managed parent Queues
 Key: YARN-7520
 URL: https://issues.apache.org/jira/browse/YARN-7520
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7473) Implement Framework and policy for capacity management of auto created queues

2017-11-10 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7473:
--

 Summary: Implement Framework and policy for capacity management of 
auto created queues 
 Key: YARN-7473
 URL: https://issues.apache.org/jira/browse/YARN-7473
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad


This jira mainly addresses the following
 
1.Support adding pluggable policies on parent queue for dynamically managing 
capacity/state for leaf queues.

2. Implement  a default policy that manages capacity based on pending 
applications and either grants guaranteed or zero capacity to queues based on 
parent's available guaranteed capacity.

3. Integrate with SchedulingEditPolicy framework to trigger this periodically 
and signal scheduler to take necessary actions for capacity/queue management.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7420) YARN UI changes to depict auto created queues

2017-10-31 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7420:
--

 Summary: YARN UI changes to depict auto created queues 
 Key: YARN-7420
 URL: https://issues.apache.org/jira/browse/YARN-7420
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


Auto created queues will be depicted in a different color to indicate they have 
been auto created and for easier distinction from manually pre-configured 
queues.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7419) Implement Auto Queue Creation with modifications to queue mapping flow

2017-10-31 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7419:
--

 Summary: Implement Auto Queue Creation with modifications to queue 
mapping flow
 Key: YARN-7419
 URL: https://issues.apache.org/jira/browse/YARN-7419
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


This involves changes to queue mapping flow to pass along context information 
for auto queue creation. Auto creation of queues will be part of Capacity 
Scheduler flow while attempting to resolve queues during application 
submission. The leaf queues which do not exist are auto created under parent 
queues which have been explicitly enabled for auto queue creation . In order to 
determine which parent queue to create the leaf queues under - parent queues 
need to be specified in queue mapping configuration 




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7394) Merge code paths for Reservation/Plan queues and Auto Created queues

2017-10-25 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7394:
--

 Summary: Merge code paths for Reservation/Plan queues and Auto 
Created queues
 Key: YARN-7394
 URL: https://issues.apache.org/jira/browse/YARN-7394
 Project: Hadoop YARN
  Issue Type: Sub-task
Reporter: Suma Shivaprasad
Assignee: Suma Shivaprasad


The initialization/reinitialization logic for ReservationQueue and AutoCreated 
Leaf queues are similar. The proposal is to rename ReservationQueue to a more 
generic name AutoCreatedLeafQueue which are either managed by PlanQueue(already 
exists) or AutoCreatedEnabledParentQueue (new class). 





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-7103) Add logs for errors while signalling containers for better debuggability

2017-08-25 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-7103:
--

 Summary: Add logs for errors while signalling containers for 
better debuggability
 Key: YARN-7103
 URL: https://issues.apache.org/jira/browse/YARN-7103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Suma Shivaprasad






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org



[jira] [Created] (YARN-3346) Deadlock in Capacity Scheduler

2015-03-12 Thread Suma Shivaprasad (JIRA)
Suma Shivaprasad created YARN-3346:
--

 Summary: Deadlock in Capacity Scheduler
 Key: YARN-3346
 URL: https://issues.apache.org/jira/browse/YARN-3346
 Project: Hadoop YARN
  Issue Type: Bug
  Components: capacityscheduler
Affects Versions: 2.6.0
Reporter: Suma Shivaprasad


{noformat}
Found one Java-level deadlock:
=
2144051991@qtp-383501499-6:
  waiting to lock monitor 0x7fa700eec8e8 (object 0x0004589fec18, a 
org.apache.hadoop.yarn.server.resourcemanager.scheduler.common.fica.FiCaSchedulerApp),
  which is held by ResourceManager Event Processor
ResourceManager Event Processor:
  waiting to lock monitor 0x7fa700aadf88 (object 0x000441c05ec8, a 
org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue),
  which is held by IPC Server handler 0 on 54311
IPC Server handler 0 on 54311:
  waiting to lock monitor 0x7fa700e20798 (object 0x000441d867f8, a 
org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue),
  which is held by ResourceManager Event Processor
{noformat}





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)