Hao Song created SAMZA-1387:
-------------------------------
Summary: Unable to Start Samza App Because Regex Check
Key: SAMZA-1387
URL: https://issues.apache.org/jira/browse/SAMZA-1387
Project: Samza
Issue Type: Bug
Affects Versions: 0.13.0
Reporter: Hao Song
[I created this as a bug but feel free to change it if it's a feature]
When I'm trying to upgrade Samza to 0.13 (from 0.11), I got the error below
when trying to start our job:
[ec2-user@namenode-shadow-1] out: 2017-08-09 17:26:31.977 [main]
KafkaSystemAdmin [INFO] Attempting to create coordinator stream
__samza_coordinator_<env>.<AppName>_1.
[ec2-user@namenode-shadow-1] out: Exception in thread "main"
java.lang.IllegalArgumentException: Identifier 'streamId' is
'__samza_coordinator_<env>.<AppName>_1'. It must match the expression
[A-Za-z0-9_-]+
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.system.StreamSpec.validateLogicalIdentifier(StreamSpec.java:201)
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.system.StreamSpec.<init>(StreamSpec.java:140)
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.system.kafka.KafkaStreamSpec.<init>(KafkaStreamSpec.java:152)
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.system.kafka.KafkaSystemAdmin.createCoordinatorStream(KafkaSystemAdmin.scala:334)
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.job.JobRunner.run(JobRunner.scala:88)
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.job.JobRunner$.doOperation(JobRunner.scala:52)
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.job.JobRunner$.main(JobRunner.scala:47)
[ec2-user@namenode-shadow-1] out: at
org.apache.samza.job.JobRunner.main(JobRunner.scala)
Looking at Samza code on Github, it seems Samza just started to restrict topic
names to a stricter sets. It doesn't allow '.' anymore whereas we have been
using '.' for all of our topics. That means we cannot upgrade to Samza 0.13
without changing all the topics. That also breaks our convention for defining
topic names. Having '.' in the topic name is actually very convenient for us
because they can be shown in Grafana/StatsD in a more organized way.
Here is the change in Github:
https://github.com/apache/samza/commit/e6c1eed4f1d576661abafce8477c1749c2554b39#diff-98a114809fb1ab416f8d1d0b4d318b96R201
Can you expand the regex set to a larger set including '.'? Thanks in advance!
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)