andygrove opened a new issue, #4561: URL: https://github.com/apache/datafusion-comet/issues/4561
## Background `split_part(str, delimiter, partNum)` currently falls back to Spark. It is `RuntimeReplaceable` and lowers to `element_at(StringSplitSQL(str, delimiter), partNum)`. Comet already supports `element_at`, but it does not support the inner `StringSplitSQL` expression, so the whole function falls back with `stringsplitsql is not supported`. `StringSplitSQL` differs from `StringSplit` (the `split` function) in that it splits on a literal string rather than a regex. ## Proposal Add a serde for `StringSplitSQL` (a native string split on a literal delimiter). This would enable `split_part` to run natively, since `element_at` over the resulting array is already supported. ## Acceptance criteria - `StringSplitSQL` executes natively and matches Spark. - `split_part` no longer falls back; add SQL file test coverage. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
