[ 
https://issues.apache.org/jira/browse/DRILL-8528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18004646#comment-18004646
 ] 

ASF GitHub Bot commented on DRILL-8528:
---------------------------------------

shfshihuafeng opened a new pull request, #3000:
URL: https://github.com/apache/drill/pull/3000

   # [DRILL-8528](https://issues.apache.org/jira/browse/DRILL-8528):
   
   HBase Limit Push Down
   
   ## Description
   
   support  Limit Push Down for HBase.i test 
   
   select * from clicks limit 3
   The log shows that The storage layer hbase (HBaseRecordReader)only got 3 
rows of data
   `2025-07-11 01:48:17,297 [178f302e-26db-37dd-500d-db2797b17a5c:frag:0:0] 
INFO  o.a.d.e.s.hbase.HBaseRecordReader - Took 7 ms to get 3 records`
   
   plan is as follow
   
   ```
   00-00    Screen : rowType = RecordType(ANY row_key, (VARCHAR(65535), ANY) 
MAP clickinfo, (VARCHAR(65535), ANY) MAP iteminfo): rowcount = 3.0, cumulative 
cost = {18.3 rows, 51.3 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 301
   00-01      Project(row_key=[$0], clickinfo=[$1], iteminfo=[$2]) : rowType = 
RecordType(ANY row_key, (VARCHAR(65535), ANY) MAP clickinfo, (VARCHAR(65535), 
ANY) MAP iteminfo): rowcount = 3.0, cumulative cost = {18.0 rows, 51.0 cpu, 0.0 
io, 0.0 network, 0.0 memory}, id = 300
   00-02        SelectionVectorRemover : rowType = RecordType(ANY row_key, 
(VARCHAR(65535), ANY) MAP clickinfo, (VARCHAR(65535), ANY) MAP iteminfo): 
rowcount = 3.0, cumulative cost = {15.0 rows, 42.0 cpu, 0.0 io, 0.0 network, 
0.0 memory}, id = 297
   00-03          Limit(fetch=[3]) : rowType = RecordType(ANY row_key, 
(VARCHAR(65535), ANY) MAP clickinfo, (VARCHAR(65535), ANY) MAP iteminfo): 
rowcount = 3.0, cumulative cost = {12.0 rows, 39.0 cpu, 0.0 io, 0.0 network, 
0.0 memory}, id = 296
   00-04            Scan(table=[[hbase, clicks]], groupscan=[HBaseGroupScan 
[HBaseScanSpec=HBaseScanSpec [tableName="clicks"], columns=[`row_key`, 
`clickinfo`, `iteminfo`], maxRecords=3]]) : rowType = RecordType(ANY row_key, 
(VARCHAR(65535), ANY) MAP clickinfo, (VARCHAR(65535), ANY) MAP iteminfo): 
rowcount = 9.0, cumulative cost = {9.0 rows, 27.0 cpu, 0.0 io, 0.0 network, 0.0 
memory}, id = 295
   ```
   
   
   ## Documentation
   (Please describe user-visible changes similar to what should appear in the 
Drill documentation.)
   
   ## Testing
   add test:  org.apache.drill.hbase.TestHBaseFilterPushDown#testLimitPushDown
   




> Hbase Limit PushDown
> --------------------
>
>                 Key: DRILL-8528
>                 URL: https://issues.apache.org/jira/browse/DRILL-8528
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - HBase
>    Affects Versions: 1.23.0
>            Reporter: shihuafeng
>            Priority: Major
>             Fix For: 1.23.0
>
>
> Hbase limit pushdown



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to