XiangpengHao opened a new issue, #11677:
URL: https://github.com/apache/datafusion/issues/11677

   ### Is your feature request related to a problem or challenge?
   
   Part of #10918
   
   Initial `StringView` is supported by #11667, which covers some most 
performance critical workloads, such as loading from parquet, reasonable 
aggregation, faster filtering etc. However, there are many operators not 
supported and using those operations will cause DataFusion to cast 
`StringViewArray` to `StringArray`, which is often unnecessary and slow.
   
   ### Describe the solution you'd like
   
   We should gradually implement native `StringView` support string operators, 
such as `length`, `reverse`, etc.
   
   Here's a list of items to help implementing the support:
   
   - [ ] Add `Utf8View` to the function signature, e.g.,  
https://github.com/apache/datafusion/blob/main/datafusion/functions/src/unicode/character_length.rs#L46-L50
   
   - [ ] Handle `Utf8View` in the invocation. The function likely already need 
to handle `Utf8` and `LargeUtf8`, so we should just add a new case to the type 
match
   - [ ] Write tests for the new `Utf8View`. As we now have three string types, 
it often need to restructure the test logic to reduce duplicated code.
   - [ ] Make sure the test cases cover long (> 12 byte) and short strings. 
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to