XiangpengHao opened a new issue, #11677: URL: https://github.com/apache/datafusion/issues/11677
### Is your feature request related to a problem or challenge? Part of #10918 Initial `StringView` is supported by #11667, which covers some most performance critical workloads, such as loading from parquet, reasonable aggregation, faster filtering etc. However, there are many operators not supported and using those operations will cause DataFusion to cast `StringViewArray` to `StringArray`, which is often unnecessary and slow. ### Describe the solution you'd like We should gradually implement native `StringView` support string operators, such as `length`, `reverse`, etc. Here's a list of items to help implementing the support: - [ ] Add `Utf8View` to the function signature, e.g., https://github.com/apache/datafusion/blob/main/datafusion/functions/src/unicode/character_length.rs#L46-L50 - [ ] Handle `Utf8View` in the invocation. The function likely already need to handle `Utf8` and `LargeUtf8`, so we should just add a new case to the type match - [ ] Write tests for the new `Utf8View`. As we now have three string types, it often need to restructure the test logic to reduce duplicated code. - [ ] Make sure the test cases cover long (> 12 byte) and short strings. ### Describe alternatives you've considered _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org