Is there any way to integrate Apex metrics/stats into an external
monitoring system (e.g. graphite)? Also what's the best way for enabling
jmx for a submitted job?
Thanks
Can you please attach the stacktrace of the operator?
You can increase the attribute TIMEOUT_WINDOW_COUNT , AppMaster uses that
to decide when to kill the blocked operator.
For taking stack trace, find the information in the blog.
I don't see such warning in the appmaster log.
On 2017-02-28 12:16 (-0800), Ashwin Chandra Putta
> wrote:
> Sunil,
> This might be related to checkpointing. See:
>
Sunil,
This might be related to checkpointing. See:
https://github.com/apache/apex-core/blob/master/engine/src/main/java/com/datatorrent/stram/StreamingContainerManager.java#L2211-L2217
Also check this piece of code:
That doesn't seems to be the case. We do see window id moving in UI as well.
On 2017-02-28 11:19 (-0800), Munagala Ramanath
> wrote:
> It likely means that that operator is taking too long to return from one of
> the callbacks like beginWindow(),
Ok so I figured it out. It wasn't Scala exactly. The yarn container memory
configuration was at the default, just on the threshold between java jar
and a scala-based jar (containing the scala libs in addition to everything)
so the scala jobs were just big enough to cause the yarn containers to not
It likely means that that operator is taking too long to return from one of
the callbacks like beginWindow(), endWindow(),
emitTuples(), etc. Do you have any potentially blocking calls to external
systems in any of those callbacks ?
Ram
On Tue, Feb 28, 2017 at 11:09 AM, Sunil Parmar
2017-02-27 19:43:21,926 INFO com.datatorrent.stram.StreamingContainerManager:
Blocked operator PTOperator[id=3,name=eventUpdatesFormatter] container
PTContainer[id=1(container_1487310232732_0027_02_000111),state=ACTIVE] time
61905ms
2017-02-27 19:43:22,928 INFO
Hello All,
I have created an app in which I am using Delay Operator for adding a
feedback loop in the DAG. Also I am doing static partitioning of some
operator in the loop. When I run the app, the operators in the loop are not
being check-pointed.
To narrow down the cause of this, I experimented