[ https://issues.apache.org/jira/browse/PIG-4435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371319#comment-14371319 ]
Rohini Palaniswamy commented on PIG-4435: ----------------------------------------- [~ahamadh], Elaborating more on my previous comment, this project involves 1) Building a test framework that has a pluggable datagen framework (different plugins to generate data for TPC-DI and TPC-DS) and runs the TPC-DI pig scripts on a cluster and collects benchmark numbers For this we would like to see a design for the test framework in the proposal - You can refer to existing Pig e2e test framework. It was written years ago, but the concept still holds but can be simplified and improved with newer technologies like YAML for configuration. 2) Translating TPC-DI queries to Pig Latin. They will have to be then added to the test framework. Translation of TPC-DI queries to Pig Latin is a independent task from building the test framework, but a test framework is needed for repeated automated execution of the queries in a Jenkins build. For this, would like to see a couple of translated TPC-DI queries as examples in the proposal. > TPC-DI queries for Pig > ---------------------- > > Key: PIG-4435 > URL: https://issues.apache.org/jira/browse/PIG-4435 > Project: Pig > Issue Type: Improvement > Reporter: Daniel Dai > Labels: gsoc2015 > > Migrate TPC-DI queries to Pig so we can compare performance with other tool. -- This message was sent by Atlassian JIRA (v6.3.4#6332)