XiangpengHao commented on code in PR #11662: URL: https://github.com/apache/datafusion/pull/11662#discussion_r1693060380
########## datafusion/functions/src/utils.rs: ########## @@ -41,8 +41,8 @@ macro_rules! get_optimal_return_type { DataType::LargeUtf8 | DataType::LargeBinary => $largeUtf8Type, // Binary inputs are automatically coerced to Utf8 DataType::Utf8 | DataType::Binary => $utf8Type, - // Utf8View inputs will yield Utf8View outputs - DataType::Utf8View => DataType::Utf8View, + // Utf8View max offset size is u32::MAX, the same as UTF8 Review Comment: Used in string related `ScalarUDF`s, for example: https://github.com/XiangpengHao/datafusion/blob/string-view2-local/datafusion/functions/src/unicode/character_length.rs#L70 I added a small unit test to the function. Note that there are a bunch of string related functions that only accepts `Utf8` and `LargeUtf8`, we currently rely on coerce rules to cast them, which won't panic but may be slower than it should be. I think we should add native support to `Utf8View`, I'm working on it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org