[
https://issues.apache.org/jira/browse/KYLIN-1636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15261764#comment-15261764
]
liyang edited comment on KYLIN-1636 at 4/28/16 8:40 AM:
--------------------------------------------------------
Some general processing (not storage dependent) on my Intel i7-4790 3.6 GHz.
By GTScannerBenchmark
10000000 records filtered to 3000000, 10482K rec/sec
10000000 records filtered to 800000, 12391K rec/sec
10000000 records filtered to 400400, 9267K rec/sec
10000000 records aggregated to 100, 4800K rec/sec
10000000 records aggregated to 100, 5022K rec/sec
10000000 records aggregated to 10000, 2340K rec/sec
10000000 records aggregated to 1000000, 1183K rec/sec
was (Author: [email protected]):
Some general processing (not storage dependent) on my Intel i7-4790 3.6 GHz.
10000000 records filtered to 3000000, 10482K rec/sec
10000000 records filtered to 800000, 12391K rec/sec
10000000 records filtered to 400400, 9267K rec/sec
10000000 records aggregated to 100, 4800K rec/sec
10000000 records aggregated to 100, 5022K rec/sec
10000000 records aggregated to 10000, 2340K rec/sec
10000000 records aggregated to 1000000, 1183K rec/sec
> Benchmark scan speed on different storage
> -----------------------------------------
>
> Key: KYLIN-1636
> URL: https://issues.apache.org/jira/browse/KYLIN-1636
> Project: Kylin
> Issue Type: Improvement
> Reporter: liyang
>
> We want to better understand the scan speed of different storage. Namely
> HBase, Cassandra, Parquet, MemMappedFile etc.
> To be specific, we test reading 10 millions records with single thread. Each
> record is 5 dimensions of type int4, and 2 measures of type long8, totally
> ~50 bytes if include cuboid ID and other overhead.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)