[ 
https://issues.apache.org/jira/browse/HUDI-9353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17949504#comment-17949504
 ] 

Davis Zhang commented on HUDI-9353:
-----------------------------------

next step:
 # Need to showcase when batch look up on 1 index file group will happen
 # in that case, when we use the old hfile reader streaming mode it out 
performs the point look up mode

> New hfile reader should support "stream" mode
> ---------------------------------------------
>
>                 Key: HUDI-9353
>                 URL: https://issues.apache.org/jira/browse/HUDI-9353
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: Davis Zhang
>            Priority: Major
>
> The HFile reader in HBase has two primary modes: # *Pread mode* - This is the 
> first mode that starts with "p"
>  # *Streaming mode* - This is the second mode that prefixes with "stream"
>  
> {*}Pread mode{*}: (point look up, already supported) * Uses random access 
> reads (positioned reads)
>  * Good for seeking to specific positions in the file
>  * Optimized for random access patterns
>  * Typically used for point lookups or when reading non-contiguous parts of a 
> file
>  * More efficient when you need to jump around in the file
> {*}Streaming mode{*}: * Uses sequential reads
>  * Optimized for reading large contiguous sections of data
>  * More efficient when reading entire blocks or scanning through data 
> sequentially
>  * Better performance for scan operations or when reading a file from 
> beginning to end
>  * Reduces the number of I/O operations
>  
> The streaming mode we should add similar support in hudi hfile reader.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to