Omega359 commented on issue #13470:
URL: https://github.com/apache/datafusion/issues/13470#issuecomment-2537407016

   Copied from 
[Discord](https://discord.com/channels/885562378132000778/1314317688150949978/1316455175376081028):
   
   I've finally gotten to the point of having most of the sqlite slt test files 
running in df sqllogictest suite. After attempting to run and fix things 
manually  (for days ... I'm an idiot sometimes) I smartened up and hacked up a 
version of the sqllogictest crate to do most of the work for me. The result is 
that everything runs however a LOT of queries are skipped for DF for various 
reasons.
   
   I'm going to push all of this to a df branch in my github account. I would 
really really like some assistance with evaluating the files and submitting 
tickets for any issues that were uncovered.
   
   There are apparently almost 5.7 million queries of which 78,444 were marked 
to be skipped by Datafusion because of errors or differences in results when 
compared to sqlite or postgres. If DF results were the same as postgres however 
(even attempting equality of floats) the results in the slt were updated. This 
is mostly because sqlite is rather simple and just doesn't seem to properly 
handle floats, etc correctly.
   
   Because of the format differences between the slt files in DF vs sqlite a 
custom version of sqllogictest dependency is required. I'm hoping to push a PR 
to that crate's repo at some point to cover the additional change required to 
run this. run command is cargo test --test sqllogictests -- --include-sqlite
   
   changes are @ 
https://github.com/Omega359/arrow-datafusion/tree/feature/sqllogictest_add_sqlite
   
   Also I've been looking at duckdb's test - found a conway's game of life impl 
in sql. It doesn't even parse in df 😦 
   
   https://gist.github.com/janickr/58fab629ee3ea7e5638a
   
   The largest issue I had with porting the tests was that DF is just missing 
some sql support compared to sqlite. No delete. No create index. No duplicate 
column names. No scalar subqueries. Type conversion doesn't handle null all 
that well. And it seems that CTE expressions have a lot of limitations after 
playing with that for the last hour.
   
   [Here is a 
gist](https://gist.github.com/Omega359/e5d6557078f6e6ee1c077235f3607922) of the 
most common errors. Note that the delete errors will not show up for anyone who 
looks at my branch linked above - I just removed that portion of tests rather 
than try and deal with each file given DF's lack of support


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to