Robin Knox created SAMZA-1792:
---------------------------------
Summary: Samza whitelist for output topic names
Key: SAMZA-1792
URL: https://issues.apache.org/jira/browse/SAMZA-1792
Project: Samza
Issue Type: Improvement
Reporter: Robin Knox
We are currently unable to obtain a list of topic names produced to per samza
job. The proposal here is that samza could use a *whitelist* of topic names
which are allowed to be written to. That list of topic names could be checked
against in the send method of
[KafkaSystemProducer.scala|https://github.com/apache/samza/blob/032a16079db8cbb1caf00117b985f7575d2c823a/samza-kafka/src/main/scala/org/apache/samza/system/kafka/KafkaSystemProducer.scala#L78].
The whitelist could be defined as part of initialisation of a samza job.
The benefits of this are two-fold:
# We can log the defined whitelist of output topics which the job is intended
to produce to
# We can prevent users from creating samza jobs which log to dynamic topics,
creating a huge number of additional topics (potentially unintentionally)
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)