FrankChen021 commented on code in PR #19514:
URL: https://github.com/apache/druid/pull/19514#discussion_r3298113265


##########
processing/src/main/java/org/apache/druid/data/input/impl/RegexInputFormat.java:
##########
@@ -56,7 +60,7 @@ public RegexInputFormat(
     this.pattern = pattern;
     this.listDelimiter = listDelimiter;
     this.columns = columns;
-    this.compiledPatternSupplier = Suppliers.memoize(() -> 
Pattern.compile(pattern));
+    this.compiledPatternSupplier = Suppliers.memoize(() -> 
RegexPatternFactory.compile(regexConfig.getEngine(), pattern));

Review Comment:
   [P2] Regex engine config is not wired into all ingestion runtimes
   
   `RegexInputFormat` now depends on injected `RegexConfig`, but the module 
that binds the configured engine is not installed in every ingestion runtime 
that deserializes this type, such as standalone indexer and sampler/overlord 
paths. In those paths Guice can fall back to the default Java engine, so 
`druid.regex.engine=RE2J` is silently ignored and the ReDoS mitigation does not 
cover supported ingestion flows. Please install the regex engine module 
wherever input formats are materialized, or otherwise ensure this constructor 
receives the configured engine in those runtimes.



##########
docs/ingestion/data-formats.md:
##########
@@ -897,7 +965,8 @@ This query returns:
 |---------------------|-----------------|
 | `1680795276351`     | `partition-1`   |
 
-## FlattenSpec
+## Flat

Review Comment:
   [P3] Restore the FlattenSpec heading
   
   This heading was split into `## Flat` and `tenSpec`, so the rendered docs no 
longer expose the `FlattenSpec` section title or the existing `#flattenspec` 
anchor. Please restore the single `## FlattenSpec` heading.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to