Robin Knox created SAMZA-1792:
---------------------------------

             Summary: Samza whitelist for output topic names
                 Key: SAMZA-1792
                 URL: https://issues.apache.org/jira/browse/SAMZA-1792
             Project: Samza
          Issue Type: Improvement
            Reporter: Robin Knox


We are currently unable to obtain a list of topic names produced to per samza 
job. The proposal here is that samza could use a *whitelist* of topic names 
which are allowed to be written to. That list of topic names could be checked 
against in the send method of 
[KafkaSystemProducer.scala|https://github.com/apache/samza/blob/032a16079db8cbb1caf00117b985f7575d2c823a/samza-kafka/src/main/scala/org/apache/samza/system/kafka/KafkaSystemProducer.scala#L78].

The whitelist could be defined as part of initialisation of a samza job.

The benefits of this are two-fold:
 # We can log the defined whitelist of output topics which the job is intended 
to produce to
 # We can prevent users from creating samza jobs which log to dynamic topics, 
creating a huge number of additional topics (potentially unintentionally)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to