alamb opened a new issue, #12415:
URL: https://github.com/apache/datafusion/issues/12415

   ### Is your feature request related to a problem or challenge?
   
   @2010YOUY01  found a bug where `substr` behaved differently for StringView 
than for String: https://github.com/apache/datafusion/pull/12383
   
   As we continue to implement special versions of the string functions for 
different representations this drift may get worse
   
   ### Describe the solution you'd like
   
   I would like a systematic and SQL level set of tests that ensure the string 
functions compute the same answer for all different combinations. 
   
   ### Describe alternatives you've considered
   
   I think we could do this in sqllogictest by separating the table definitions 
into different .slt files and then includeing the same queries file. Here is an 
example:
   
   
https://github.com/apache/datafusion/blob/02eab80cd62e02fcb68dee8b99d63aaac680a66c/datafusion/sqllogictest/test_files/join_disable_repartition_joins.slt#L26-L25
   
   So this would look something like 
   
   ## in `strings_stringview.slt`
   ```sql
   statement ok
   -- Populate the "t" table table with the "standard" values in StringView form
   CREATE TABLE temp AS VALUES
    ('one', 'two'),
     ('three, 'four)
   
   statement ok
   CREATE TABLE t as 
   SELECT 
     arrow_cast(column1, 'Utf8View') as column1,
     arrow_cast(column2, 'Utf8View') as column2,
   FROM temp
   
   -- run queries from string_queries.slt
   include string_queries.slt
   ```
   
   ## in `strings_queries.slt`
   ```sql
   -- Note this file is intended to be run with a table 't' already defined 
   -- with standard data with the appropriate data type (StringView, String, 
etc)
   
   -- Test concat
   query
   SELECT concat(column1, column2) from t;
   
   ...
   ```
   
   
   
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to