[ 
https://issues.apache.org/jira/browse/KYLIN-1636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15261764#comment-15261764
 ] 

liyang edited comment on KYLIN-1636 at 4/28/16 8:40 AM:
--------------------------------------------------------

Some general processing (not storage dependent) on my Intel i7-4790 3.6 GHz.

By GTScannerBenchmark

10000000 records filtered to 3000000, 10482K rec/sec
10000000 records filtered to 800000, 12391K rec/sec
10000000 records filtered to 400400, 9267K rec/sec
10000000 records aggregated to 100, 4800K rec/sec
10000000 records aggregated to 100, 5022K rec/sec
10000000 records aggregated to 10000, 2340K rec/sec
10000000 records aggregated to 1000000, 1183K rec/sec



was (Author: [email protected]):
Some general processing (not storage dependent) on my Intel i7-4790 3.6 GHz.

10000000 records filtered to 3000000, 10482K rec/sec
10000000 records filtered to 800000, 12391K rec/sec
10000000 records filtered to 400400, 9267K rec/sec
10000000 records aggregated to 100, 4800K rec/sec
10000000 records aggregated to 100, 5022K rec/sec
10000000 records aggregated to 10000, 2340K rec/sec
10000000 records aggregated to 1000000, 1183K rec/sec


> Benchmark scan speed on different storage
> -----------------------------------------
>
>                 Key: KYLIN-1636
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1636
>             Project: Kylin
>          Issue Type: Improvement
>            Reporter: liyang
>
> We want to better understand the scan speed of different storage. Namely 
> HBase, Cassandra, Parquet, MemMappedFile etc.
> To be specific, we test reading 10 millions records with single thread. Each 
> record is 5 dimensions of type int4, and 2 measures of type long8, totally 
> ~50 bytes if include cuboid ID and other overhead.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to