My Flink job failed to checkpoint with a "The job has failed" error.  The
logs contained no other recent errors.  I keep hitting the error even if I
cancel the jobs and restart them.  When I restarted my jobmanager and
taskmanager, the error went away.

What error am I hitting?  It looks like there is bad state that lives
outside the scope of a job.

How often do people restart their jobmanagers and taskmanager to deal with
errors like this?

Reply via email to