Github user vanzin commented on a diff in the pull request:
https://github.com/apache/spark/pull/23030#discussion_r233948762
--- Diff:
resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala
---
@@ -598,13 +598,24 @@ private[yarn] class YarnAllocator(
(false, s"Container ${containerId}${onHostStr} was preempted.")
// Should probably still count memory exceeded exit codes
towards task failures
case VMEM_EXCEEDED_EXIT_CODE =>
- (true, memLimitExceededLogMessage(
- completedContainer.getDiagnostics,
- VMEM_EXCEEDED_PATTERN))
+ val suggestion = if
(conf.getBoolean(YarnConfiguration.NM_VMEM_CHECK_ENABLED,
--- End diff --
Ah, and another option that can help here is
`yarn.nodemanager.vmem-pmem-ratio`.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]