Hey John! It looks like the task managers are not picking up the correct configuration. Can you please verify that all nodes (JobManager and TaskManager) use the same configuration.
The task managers use ZooKeeper to look up the JobManager and not the configuration. >From the docs >(https://ci.apache.org/projects/flink/flink-docs-release-1.0/setup/jobmanager_high_availability.html), you need at least the following configuration: recovery.mode: zookeeper recovery.zookeeper.quorum: localhost:2181 recovery.zookeeper.path.root: /flink # important: customize per cluster state.backend: filesystem state.backend.fs.checkpointdir: hdfs:///flink/checkpoints recovery.zookeeper.storageDir: hdfs:///flink/recovery Does this help? – Ufuk On Wed, May 4, 2016 at 8:42 PM, John Sherwood <j...@vt.edu> wrote: > I'm attempting to move to an HA configuration with a trio of JobManagers on > top of a ZK cluster. From the docs, it appears that I should have them in my > 'masters' file (as I do), but when I attempt to start the TaskManagers, they > die complaining there is no jobmanager.rpc.address config - which seems > counter to the purpose of having the masters. Is there some other setting > I'm missing to tell the taskmanagers to use the masters file?