On Tue, Oct 11, 2016 at 10:55 AM, Michael Armbrust <mich...@databricks.com> wrote:
> *Complex event processing and state management:* Several groups I've >> talked to want to run a large number (tens or hundreds of thousands now, >> millions in the near future) of state machines over low-rate partitions of >> a high-rate stream. Covering these use cases translates roughly into a >> three sub-requirements: maintaining lots of persistent state efficiently, >> feeding tuples to each state machine in the right order, and exposing >> convenient programmer APIs for complex event detection and signal >> processing tasks. >> > > I've heard this one too, but don't know of anyone actively working on it. > Would be awesome to open a JIRA and start discussing what the APIs would > look like. > There is an existing ticket for CEP: https://issues.apache.org/jira/browse/SPARK-14745