EdisonWang created SPARK-27789:
----------------------------------

             Summary: Use stopEarly in codegen of ColumnarBatchScan
                 Key: SPARK-27789
                 URL: https://issues.apache.org/jira/browse/SPARK-27789
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.4.3, 2.4.2, 2.4.1, 2.4.0
            Reporter: EdisonWang


Suppose that we have a hive table like this
```sql("create table parquet_test (id int) using parquet")```, and our query 
sql is `select id from parquet_test limit 10`.
With `spark.sql.hive.convertMetastoreParquet`  set to false, the sql execution 
will go into `InputAdapter`, in its codegen, it can use `stopEarly` to 
accelerate local limit.
But if we set  `spark.sql.hive.convertMetastoreParquet`  to true, the sql 
exectuion will go into `ColumnarBatchScan`, which didn't optimize local limit.

In this patch, We use `stopEarly` in `ColumnarBatchScan` as well



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to