0ax1 opened a new pull request, #16230:
URL: https://github.com/apache/datafusion/pull/16230

   For queries 2, 3, 10, 18 and 21 the TPC-H spec defines a row count limit.
   
   ```
   2.1.2.9 Queries 2, 3, 10, 18 and 21 require that a given number of rows are 
to be returned (e.g., “Return the first 10 selected
   rows”). If N is the number of rows to be returned, the query must return 
exactly the first N rows unless fewer than N
   rows qualify, in which case all rows must be returned. There are three 
permissible ways of satisfying this
   requirement. A test sponsor must select any one of them and use it 
consistently for all the queries that require that a
   specified number of rows be returned.
   ```
   
   https://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v2.17.1.pdf
   
   ## Which issue does this PR close?
   
   
   - Closes #16229.
   
   ## Rationale for this change
   
   Returned row counts should match the TPC-H spec.
   
   ## What changes are included in this PR?
   
   `limit` clause was added to TPC-H queries, 2, 3, 10, 18, 21.
   
   ## Are these changes tested?
   
   I re-ran all of the affected queries locally to double-check their row 
counts.
   
   ## Are there any user-facing changes?
   
   Users will now see the correct row counts when running TPC-H benchmarks.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to