[
https://issues.apache.org/jira/browse/HIVE-396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yuntao Jia updated HIVE-396:
----------------------------
Attachment: hive_benchmark_2009-07-21.tar.gz
Updated the benchmark script to make it more automatic. Now it outputs all the
timings to a csv file which looks like:
Timings, grep select, rankings select, uservisits aggregation,
uservisits-rankings join
Trial 1
Hive,126.3,25.0,546.1,447.9,
PIG,240.5,31.0,672.3,658.3,
Hadoop,135.4,21.6,394.9,486.1
Trial 2
Hive,126.3,25.0,546.1,447.9,
PIG,240.5,31.0,672.3,658.3,
Hadoop,135.4,21.6,394.9,486.1
Trial 3
Hive,126.3,25.0,546.1,447.9,
PIG,240.5,31.0,672.3,658.3,
Hadoop,135.4,21.6,394.9,486.1
The first line shows the queries, followed by query timings from different
trials. Within each trial, there are three lines showing the query timings on
Hive, PIG and Hadoop, respectively. The numbers here are for illustration
purpose only.
The file can be directly opened in excel. User can then easily generate a
performance graph on top of it
> Hive performance benchmarks
> ---------------------------
>
> Key: HIVE-396
> URL: https://issues.apache.org/jira/browse/HIVE-396
> Project: Hadoop Hive
> Issue Type: New Feature
> Reporter: Zheng Shao
> Assignee: Yuntao Jia
> Attachments: hive_benchmark_2009-06-18.pdf,
> hive_benchmark_2009-06-18.tar.gz, hive_benchmark_2009-07-12.pdf,
> hive_benchmark_2009-07-21.tar.gz
>
>
> We need some performance benchmark to measure and track the performance
> improvements of Hive.
> Some references:
> PIG performance benchmarks PIG-200
> PigMix: http://wiki.apache.org/pig/PigMix
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.