[
https://issues.apache.org/jira/browse/PIG-4435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371319#comment-14371319
]
Rohini Palaniswamy commented on PIG-4435:
-----------------------------------------
[~ahamadh],
Elaborating more on my previous comment, this project involves
1) Building a test framework that has a pluggable datagen framework
(different plugins to generate data for TPC-DI and TPC-DS) and runs the TPC-DI
pig scripts on a cluster and collects benchmark numbers
For this we would like to see a design for the test framework in the
proposal - You can refer to existing Pig e2e test framework. It was written
years ago, but the concept still holds but can be simplified and improved with
newer technologies like YAML for configuration.
2) Translating TPC-DI queries to Pig Latin. They will have to be then added
to the test framework. Translation of TPC-DI queries to Pig Latin is a
independent task from building the test framework, but a test framework is
needed for repeated automated execution of the queries in a Jenkins build.
For this, would like to see a couple of translated TPC-DI queries as
examples in the proposal.
> TPC-DI queries for Pig
> ----------------------
>
> Key: PIG-4435
> URL: https://issues.apache.org/jira/browse/PIG-4435
> Project: Pig
> Issue Type: Improvement
> Reporter: Daniel Dai
> Labels: gsoc2015
>
> Migrate TPC-DI queries to Pig so we can compare performance with other tool.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)