andygrove opened a new issue, #4561:
URL: https://github.com/apache/datafusion-comet/issues/4561

   ## Background
   
   `split_part(str, delimiter, partNum)` currently falls back to Spark. It is 
`RuntimeReplaceable` and lowers to `element_at(StringSplitSQL(str, delimiter), 
partNum)`. Comet already supports `element_at`, but it does not support the 
inner `StringSplitSQL` expression, so the whole function falls back with 
`stringsplitsql is not supported`.
   
   `StringSplitSQL` differs from `StringSplit` (the `split` function) in that 
it splits on a literal string rather than a regex.
   
   ## Proposal
   
   Add a serde for `StringSplitSQL` (a native string split on a literal 
delimiter). This would enable `split_part` to run natively, since `element_at` 
over the resulting array is already supported.
   
   ## Acceptance criteria
   
   - `StringSplitSQL` executes natively and matches Spark.
   - `split_part` no longer falls back; add SQL file test coverage.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to