Hao Song created SAMZA-1387:
-------------------------------

             Summary: Unable to Start Samza App Because Regex Check
                 Key: SAMZA-1387
                 URL: https://issues.apache.org/jira/browse/SAMZA-1387
             Project: Samza
          Issue Type: Bug
    Affects Versions: 0.13.0
            Reporter: Hao Song


[I created this as a bug but feel free to change it if it's a feature]

When I'm trying to upgrade Samza to 0.13 (from 0.11), I got the error below 
when trying to start our job: 

[ec2-user@namenode-shadow-1] out: 2017-08-09 17:26:31.977 [main] 
KafkaSystemAdmin [INFO] Attempting to create coordinator stream 
__samza_coordinator_<env>.<AppName>_1.
[ec2-user@namenode-shadow-1] out: Exception in thread "main" 
java.lang.IllegalArgumentException: Identifier 'streamId' is 
'__samza_coordinator_<env>.<AppName>_1'. It must match the expression 
[A-Za-z0-9_-]+
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.system.StreamSpec.validateLogicalIdentifier(StreamSpec.java:201)
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.system.StreamSpec.<init>(StreamSpec.java:140)
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.system.kafka.KafkaStreamSpec.<init>(KafkaStreamSpec.java:152)
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.system.kafka.KafkaSystemAdmin.createCoordinatorStream(KafkaSystemAdmin.scala:334)
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.job.JobRunner.run(JobRunner.scala:88)
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.job.JobRunner$.doOperation(JobRunner.scala:52)
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.job.JobRunner$.main(JobRunner.scala:47)
[ec2-user@namenode-shadow-1] out:       at 
org.apache.samza.job.JobRunner.main(JobRunner.scala)

Looking at Samza code on Github, it seems Samza just started to restrict topic 
names to a stricter sets. It doesn't allow '.' anymore whereas we have been 
using '.' for all of our topics. That means we cannot upgrade to Samza 0.13 
without changing all the topics. That also breaks our convention for defining 
topic names. Having '.' in the topic name is actually very convenient for us 
because they can be shown in Grafana/StatsD in a more organized way.

Here is the change in Github: 
https://github.com/apache/samza/commit/e6c1eed4f1d576661abafce8477c1749c2554b39#diff-98a114809fb1ab416f8d1d0b4d318b96R201

Can you expand the regex set to a larger set including '.'? Thanks in advance!



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to