alamb commented on code in PR #8857: URL: https://github.com/apache/arrow-datafusion/pull/8857#discussion_r1452321151
########## datafusion/sqllogictest/test_files/order.slt: ########## @@ -578,3 +578,209 @@ SortPreservingMergeExec: [log_c12_base_c11@0 DESC] statement ok drop table aggregate_test_100; + + +# Sort with lots of repetition values +# Test sorting a parquet file with 2 million records that has lots of values that are repeated +statement ok +CREATE EXTERNAL TABLE repeat_much STORED AS PARQUET LOCATION '../../parquet-testing/data/repeat_much.snappy.parquet'; Review Comment: I think you could use a path like this and leave the file at its current location: ```suggestion CREATE EXTERNAL TABLE repeat_much STORED AS PARQUET LOCATION '../../datafusion/core/tests/data/repeat_much.snappy.parquet'; ``` If it is only used in sqllogictests, maybe you could move the file to ``` datafusion/sqllogictest/data/repeat_much.snappy.parquet ``` instead -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
