[jira] Subscription: PIG patch available

2015-08-05 Thread jira
Issue Subscription Filter: PIG patch available (31 issues) Subscriber: pigdaily Key Summary PIG-4649[Pig on Tez] Union followed by HCatStorer misses some data https://issues.apache.org/jira/browse/PIG-4649 PIG-4644PORelationToExprProject.clone() is broken

[jira] [Updated] (PIG-4638) Allow TOMAP to accept dynamically sized input

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4638: -- Status: Open (was: Patch Available) Allow TOMAP to accept dynamically sized input

[jira] [Updated] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4405: -- Status: Open (was: Patch Available) Adding 'map[]' support to mock/Storage

[jira] [Commented] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14654920#comment-14654920 ] Niels Basjes commented on PIG-4405: --- I agree, we should go for the consistency. I was

[jira] [Updated] (PIG-4526) Make setting up the build environment easier

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4526: -- Status: Open (was: Patch Available) Ran into the same problem as HADOOP-11936 Make setting up the

[jira] [Updated] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4405: -- Attachment: PIG-4405-20150805-1058.patch Updated patch that uses the form indicated by [~daijy] Adding

[jira] [Updated] (PIG-4638) Allow TOMAP to accept dynamically sized input

2015-08-05 Thread Niels Basjes (JIRA)
Fix For: 0.16.0 Attachments: PIG-4638-20150723.patch, PIG-4638-20150805-1058.patch Currently the TOMAP function only supports a parameter list of values. Triggered by reading http://stackoverflow.com/q/17847970/ {quote}I want to convert a bag of tuples to a map with specific value

[jira] [Updated] (PIG-4638) Allow TOMAP to accept dynamically sized input

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4638: -- Attachment: PIG-4638-20150805-1058.patch Updated patch to follow the changes in PIG-4405. Allow TOMAP

[jira] [Updated] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Niels Basjes (JIRA)
Attachments: PIG-4405-20150723.patch, PIG-4405-20150805-1058.patch The mock/Storage contains convenience methods for creating a bag and a tuple when doing unit tests. Pig has however 3 complex data types ( see http://pig.apache.org/docs/r0.14.0/basic.html#Simple+and+Complex

[jira] [Updated] (PIG-4638) Allow TOMAP to accept dynamically sized input

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4638: -- Attachment: PIG-4638-20150805-1247.patch New patch created with {{--no-prefix}} Allow TOMAP to accept

[jira] [Updated] (PIG-4639) Add better parser for Apache HTTPD access log.

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4639: -- Attachment: PIG-4639-20150805-1247.patch The appropriate files are now downloaded from maven central

[jira] [Updated] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Niels Basjes (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niels Basjes updated PIG-4405: -- Attachment: PIG-4405-20150805-1247.patch New patch created with {{--no-prefix}} Adding 'map[]' support

[jira] [Updated] (PIG-4639) Add better parser for Apache HTTPD access log.

2015-08-05 Thread Niels Basjes (JIRA)
Assignee: Niels Basjes Fix For: 0.16.0 Attachments: PIG-4639-20150723-classnotfound.patch, PIG-4639-20150725.patch, PIG-4639-20150805-1247.patch Currently there are two parsers for Apache HTTPD acces log files in piggybank that only allow parsing the 'combined' and 'common

[jira] [Updated] (PIG-4638) Allow TOMAP to accept dynamically sized input

2015-08-05 Thread Niels Basjes (JIRA)
Basjes Assignee: Niels Basjes Fix For: 0.16.0 Attachments: PIG-4638-20150723.patch, PIG-4638-20150805-1058.patch, PIG-4638-20150805-1247.patch Currently the TOMAP function only supports a parameter list of values. Triggered by reading http://stackoverflow.com/q

[jira] [Updated] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Niels Basjes (JIRA)
Assignee: Niels Basjes Fix For: 0.16.0 Attachments: PIG-4405-20150723.patch, PIG-4405-20150805-1058.patch, PIG-4405-20150805-1247.patch The mock/Storage contains convenience methods for creating a bag and a tuple when doing unit tests. Pig has however 3 complex data types

[jira] [Updated] (PIG-4639) Add better parser for Apache HTTPD access log.

2015-08-05 Thread Niels Basjes (JIRA)
: piggybank Affects Versions: 0.15.0 Reporter: Niels Basjes Assignee: Niels Basjes Fix For: 0.16.0 Attachments: PIG-4639-20150723-classnotfound.patch, PIG-4639-20150725.patch, PIG-4639-20150805-1247.patch Currently there are two parsers

Re: Running piggybank unit tests?

2015-08-05 Thread Niels Basjes
I got it running in Eclipse. ant clean compile-test piggybank jar -Dhadoopversion=23 Then import it into eclipse Then manually remove pigmix from the buildpath and add the piggybank stuff to the build path. Just wondering: Wouldn't having it in Maven (with piggybank being a module/subproject)

[jira] [Commented] (PIG-4639) Add better parser for Apache HTTPD access log.

2015-08-05 Thread Niels Basjes (JIRA)
-classnotfound.patch, PIG-4639-20150725.patch, PIG-4639-20150805-1247.patch Currently there are two parsers for Apache HTTPD acces log files in piggybank that only allow parsing the 'combined' and 'common' logformats. These two also only parse the 'basics'. This is proposed patch to add the existing

[jira] [Updated] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Daniel Dai (JIRA)
Reporter: Niels Basjes Assignee: Niels Basjes Fix For: 0.16.0 Attachments: PIG-4405-20150723.patch, PIG-4405-20150805-1058.patch, PIG-4405-20150805-1247.patch The mock/Storage contains convenience methods for creating a bag and a tuple when doing

[jira] [Updated] (PIG-4638) Allow TOMAP to accept dynamically sized input

2015-08-05 Thread Daniel Dai (JIRA)
Versions: 0.15.0 Reporter: Niels Basjes Assignee: Niels Basjes Fix For: 0.16.0 Attachments: PIG-4638-20150723.patch, PIG-4638-20150805-1058.patch, PIG-4638-20150805-1247.patch Currently the TOMAP function only supports a parameter list of values

[jira] [Created] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4650: --- Summary: ant mvn-deploy target is broken Key: PIG-4650 URL: https://issues.apache.org/jira/browse/PIG-4650 Project: Pig Issue Type: Bug Components: build

[jira] [Updated] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4650: Attachment: PIG-4650-1.patch ant mvn-deploy target is broken ---

[jira] [Updated] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4650: Status: Patch Available (was: Open) ant mvn-deploy target is broken ---

[jira] [Commented] (PIG-4649) [Pig on Tez] Union followed by HCatStorer misses some data

2015-08-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14658752#comment-14658752 ] Rohini Palaniswamy commented on PIG-4649: - bq. We can remove this once HIVE-11456 is

[jira] [Updated] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Ken Wu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Wu updated PIG-4623: Hadoop Flags: Reviewed Fix Version/s: site Affects Version/s: 0.15.0 Status: Patch

[jira] [Updated] (PIG-4649) [Pig on Tez] Union followed by HCatStorer misses some data

2015-08-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohini Palaniswamy updated PIG-4649: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch

[jira] [Commented] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14658843#comment-14658843 ] Daniel Dai commented on PIG-4623: - I also mean to generate a patch and attach the patch file

[jira] [Commented] (PIG-4644) PORelationToExprProject.clone() is broken

2015-08-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14658844#comment-14658844 ] Rohini Palaniswamy commented on PIG-4644: - {code} if (this.isProjectToEnd()) { +

[jira] [Commented] (PIG-4644) PORelationToExprProject.clone() is broken

2015-08-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14658848#comment-14658848 ] Rohini Palaniswamy commented on PIG-4644: - Also you can skip the test in

[jira] [Updated] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Ken Wu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Wu updated PIG-4623: Attachment: TestCSVStorage.java CSVLoader.java Here are the patches Fixed the 'new line' character

[jira] [Updated] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4623: Attachment: PIG-4623-1.patch Attach patch. Next time, please use svn diff, or git diff/git show to generate

[jira] [Updated] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4623: Resolution: Fixed Fix Version/s: (was: site) 0.16.0 Status: Resolved

[jira] [Commented] (PIG-4644) PORelationToExprProject.clone() is broken

2015-08-05 Thread Anthony Hsu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14659289#comment-14659289 ] Anthony Hsu commented on PIG-4644: -- Thanks for the review, Rohini. I had to do it this way

[jira] [Updated] (PIG-4645) Support hadoop-like Counter using spark accumulator

2015-08-05 Thread kexianda (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kexianda updated PIG-4645: -- Attachment: PIG-4645.patch Support hadoop-like Counter using spark accumulator

[jira] [Commented] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Ken Wu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14659338#comment-14659338 ] Ken Wu commented on PIG-4623: - Hi Rohini Palaniswamy, Thanks for your comments. Let me comment

[jira] [Commented] (PIG-4645) Support hadoop-like Counter using spark accumulator

2015-08-05 Thread kexianda (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14659317#comment-14659317 ] kexianda commented on PIG-4645: --- Implement a Hadoop-like Counter, using Spark Accumulator.

[jira] [Updated] (PIG-4645) Support hadoop-like Counter using spark accumulator

2015-08-05 Thread kexianda (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kexianda updated PIG-4645: -- Summary: Support hadoop-like Counter using spark accumulator (was: Collect Input/Output Statistics of Spark Job)

[jira] [Updated] (PIG-4644) PORelationToExprProject.clone() is broken

2015-08-05 Thread Anthony Hsu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated PIG-4644: - Attachment: PIG-4644.2.patch PORelationToExprProject.clone() is broken

[jira] [Commented] (PIG-4644) PORelationToExprProject.clone() is broken

2015-08-05 Thread Anthony Hsu (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14659293#comment-14659293 ] Anthony Hsu commented on PIG-4644: -- I've removed the test, updated my

[jira] [Commented] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14659175#comment-14659175 ] Rohini Palaniswamy commented on PIG-4623: - The patch has a couple of bugs - So

[jira] [Commented] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14659180#comment-14659180 ] Rohini Palaniswamy commented on PIG-4650: - +1 ant mvn-deploy target is broken

[jira] [Updated] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4650: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Reopened] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai reopened PIG-4623: - Thanks Rohini for capturing this. Rollback the patch to address the issue raised. Fixed the 'new line'

[jira] [Commented] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Rohini Palaniswamy (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14659206#comment-14659206 ] Rohini Palaniswamy commented on PIG-4623: - Actually my 3rd comment was wrong. That

[jira] [Reopened] (PIG-4639) Add better parser for Apache HTTPD access log.

2015-08-05 Thread Rohini Palaniswamy (JIRA)
: piggybank Affects Versions: 0.15.0 Reporter: Niels Basjes Assignee: Niels Basjes Fix For: 0.16.0 Attachments: PIG-4639-20150723-classnotfound.patch, PIG-4639-20150725.patch, PIG-4639-20150805-1247.patch Currently there are two parsers for Apache