[ 
https://issues.apache.org/jira/browse/KYLIN-1636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15261998#comment-15261998
 ] 

liyang commented on KYLIN-1636:
-------------------------------

By MemMappedFileScanBenchmark

10000000 records read without mmap, 5485K rec/sec
10000000 records read with mmap, 5614K rec/sec

50 bytes per record, so the IO speed is ~250MB/sec which is far beyond the 
~75MB/sec of normal mag disk. This is because the file was still in OS cache as 
it's newly created. If we cap disk speed at 75MB/sec, then what to expect is 
5500/(250/75) = *1650K* rec/sec

> Benchmark scan speed on different storage
> -----------------------------------------
>
>                 Key: KYLIN-1636
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1636
>             Project: Kylin
>          Issue Type: Improvement
>            Reporter: liyang
>
> We want to better understand the scan speed of different storage. Namely 
> HBase, Cassandra, Parquet, MemMappedFile etc.
> To be specific, we test reading 10 millions records with single thread. Each 
> record is 5 dimensions of type int4, and 2 measures of type long8, totally 
> ~50 bytes if include cuboid ID and other overhead.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to