[ 
https://issues.apache.org/jira/browse/HBASE-17917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15994770#comment-15994770
 ] 

Duo Zhang commented on HBASE-17917:
-----------------------------------

The test command is
{noformat}
./bin/hbase pe --rows=100000 --cacheBlocks=false --caching=10 
--scanReadType=pread/stream/default --nomapred scanRange10 10
{noformat}

|| ReadType || %us || RS %CPU || DN % CPU ||
| pread | 20 | 450 | 250 |
| stream | 14 | 340 | 130 |
| default | 20 | 450 | 250 |

Streaming read has a lower cpu usage but also a much longer time to finish the 
test (1190s vs. 150s). And in fact, after the patch here, we will create a new 
DFSInputStream when openning a new scanner for streaming read, so almost every 
request to RS will lead to a request to NN. It is not scalable so the bad 
performance is expected...

> Use pread by default for all user scan
> --------------------------------------
>
>                 Key: HBASE-17917
>                 URL: https://issues.apache.org/jira/browse/HBASE-17917
>             Project: HBase
>          Issue Type: Sub-task
>          Components: scan
>    Affects Versions: 2.0.0
>            Reporter: Duo Zhang
>            Assignee: Duo Zhang
>             Fix For: 2.0.0
>
>         Attachments: HBASE-17917.patch, HBASE-17917-v1.patch, 
> HBASE-17917-v2.patch, HBASE-17917-v2.patch, HBASE-17917-v3.patch, 
> HBASE-17917-v4.patch, HBASE-17917-v5.patch
>
>
> As said in the parent issue. We need some benchmark here first.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to