On a related note, one bug we hit when using Samza was not populating
all the partitions in our test environment for a bootstrap stream. That happened because we partition by something we call an application ID and we were testing using only one application's data. The job would catch up to the bootstrap stream
and then freeze. The solution was to add more data from other applications.

I know this was kind of by design, but it seems like a behavior that is bound
to cause many problems for others as well. I would like to see this change.

Lukas

-----Original Message----- From: Roger Hoover
Sent: Sunday, January 4, 2015 3:31 PM
To: [email protected]
Subject: Samza 0.8.0 - Job hangs if bootstrap topic is empty

Hi,

If someone forgets to populate a bootstrap topic before starting a job, the
job will hang indefinitely without processing any non-bootstrap messages,
even if the bootstrap topic later populated.  This is unexpected and hard
for operations to debug.

I think Samza should either treat empty bootstrap topics as already caught
up or as fatal errors.  I created a JIRA here:
https://issues.apache.org/jira/browse/SAMZA-502

Thanks,

Roger

Reply via email to