kennknowles commented on code in PR #31682:
URL: https://github.com/apache/beam/pull/31682#discussion_r1725286758


##########
sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka/KafkaIO.java:
##########
@@ -2684,26 +2687,49 @@ public PCollection<KafkaRecord<K, V>> 
expand(PCollection<KafkaSourceDescriptor>
                             .getSchemaRegistry()
                             .getSchemaCoder(KafkaSourceDescriptor.class),
                         recordCoder));
+
         if (isCommitOffsetEnabled() && !configuredKafkaCommit() && 
!isRedistribute()) {
-          outputWithDescriptor =
-              outputWithDescriptor
-                  .apply(Reshuffle.viaRandomKey())
-                  .setCoder(
-                      KvCoder.of(
-                          input
-                              .getPipeline()
-                              .getSchemaRegistry()
-                              .getSchemaCoder(KafkaSourceDescriptor.class),
-                          recordCoder));
-
-          PCollection<Void> unused = outputWithDescriptor.apply(new 
KafkaCommitOffset<K, V>(this));
-          unused.setCoder(VoidCoder.of());
+          // Add transform for committing offsets to Kafka with consistency 
with beam pipeline data
+          // processing.
+          boolean useLegacyImplementation = false;
+          String requestedVersionString =
+              input
+                  .getPipeline()
+                  .getOptions()
+                  .as(StreamingOptions.class)
+                  .getUpdateCompatibilityVersion();
+          if (requestedVersionString != null) {
+            List<String> requestedVersion = 
Arrays.asList(requestedVersionString.split("\\."));
+            List<String> targetVersion = Arrays.asList("2", "59", "0");
+
+            if (Comparators.lexicographical(Comparator.<String>naturalOrder())
+                    .compare(requestedVersion, targetVersion)
+                < 0) {
+              useLegacyImplementation = true;

Review Comment:
   Readability: I prefer the approach of a hard fork, where the first thing you 
do with expand is to take a different path depending on the version. This keeps 
the cyclomatic complexity of the main logic to a minimum. It is also more 
precise than "legacy" which has some connotations but isn't as good as 
`expand_2_59_0`. I can see how here you are only changing a tiny bit of logic 
based on this, but I would still prefer distinct methods called where each 
method is a straight-line implementation for a particular version range.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@beam.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to