[ 
https://issues.apache.org/jira/browse/HBASE-17917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15994633#comment-15994633
 ] 

Duo Zhang commented on HBASE-17917:
-----------------------------------

Test command
{noformat}
./bin/hbase pe --rows=1000000 --cacheBlocks=false --caching=30 
--scanReadType=pread/stream/default --nomapred scan 10
{noformat}

|| ReadType || %us || RS %CPU || DN %CPU ||
| pread | 16 | 400 | 160 |
| stream | 13 | 390 | 30 |
| default | 13 | 390 | 30 |

So for reading the data of the whole region, streaming read will be better(as 
now we will open a separated reader for streaming read). The main difference is 
at DN side, we can save lots of seeks when using streaming read so the %CPU of 
DN will be much less.

Let me run the random scan test.

> Use pread by default for all user scan
> --------------------------------------
>
>                 Key: HBASE-17917
>                 URL: https://issues.apache.org/jira/browse/HBASE-17917
>             Project: HBase
>          Issue Type: Sub-task
>          Components: scan
>    Affects Versions: 2.0.0
>            Reporter: Duo Zhang
>            Assignee: Duo Zhang
>             Fix For: 2.0.0
>
>         Attachments: HBASE-17917.patch, HBASE-17917-v1.patch, 
> HBASE-17917-v2.patch, HBASE-17917-v2.patch, HBASE-17917-v3.patch, 
> HBASE-17917-v4.patch, HBASE-17917-v5.patch
>
>
> As said in the parent issue. We need some benchmark here first.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to