[jira] [Updated] (PIG-4635) NPE while running pig script in tez mode( pig 0.15 with tez 0.7)

2015-09-25 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4635: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Updated] (PIG-4635) NPE while running pig script in tez mode( pig 0.15 with tez 0.7)

2015-09-24 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4635: Attachment: PIG-4635-2.patch Yes, maintaining another variable is safer, I can do that. I do want

[jira] [Updated] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4683: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Commented] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905011#comment-14905011 ] Daniel Dai commented on PIG-4683: - New patch fixed unit test failures. > Nested order is broken after

[jira] [Updated] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4683: Attachment: PIG-4683-2.patch > Nested order is broken after PIG-3591 in some ca

[jira] [Updated] (PIG-4635) NPE while running pig script in tez mode( pig 0.15 with tez 0.7)

2015-09-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4635: Status: Patch Available (was: Open) > NPE while running pig script in tez mode( pig 0.15 with tez

[jira] [Assigned] (PIG-4635) NPE while running pig script in tez mode( pig 0.15 with tez 0.7)

2015-09-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai reassigned PIG-4635: --- Assignee: Daniel Dai (was: Rohini Palaniswamy) > NPE while running pig script in tez mode( pig 0

[jira] [Updated] (PIG-4635) NPE while running pig script in tez mode( pig 0.15 with tez 0.7)

2015-09-23 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4635: Attachment: PIG-4635-1.patch Finally get a reproduction. The issue is caused by transient variable

[jira] [Updated] (PIG-4674) TOMAP should infer schema

2015-09-22 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4674: Attachment: PIG-4674-fixtest2.patch Another unit test failure patch. > TOMAP should infer sch

[jira] [Updated] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-22 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4679: Attachment: PIG-4679-fixtest.patch Fix a unit test failure. > Performance degradation

[jira] [Commented] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14901641#comment-14901641 ] Daniel Dai commented on PIG-4683: - [~rohini] 1) No, because currently SecondaryKeyOptimizerTez does

[jira] [Updated] (PIG-3635) Fix e2e tests for Hadoop 2.X on Windows

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-3635: Resolution: Not A Problem Status: Resolved (was: Patch Available) Yes, Windows tests has been fixed

[jira] [Updated] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4683: Attachment: PIG-4683-1.patch > Nested order is broken after PIG-3591 in some ca

[jira] [Updated] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4683: Status: Patch Available (was: Open) > Nested order is broken after PIG-3591 in some ca

[jira] [Comment Edited] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14901228#comment-14901228 ] Daniel Dai edited comment on PIG-4683 at 9/21/15 7:23 PM: -- The reason

[jira] [Updated] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4683: Fix Version/s: 0.15.1 > Nested order is broken after PIG-3591 in some ca

[jira] [Created] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4683: --- Summary: Nested order is broken after PIG-3591 in some cases Key: PIG-4683 URL: https://issues.apache.org/jira/browse/PIG-4683 Project: Pig Issue Type: Bug

[jira] [Updated] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4683: Description: The following script fail after > Nested order is broken after PIG-3591 in some ca

[jira] [Updated] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4683: Description: The following script fail after PIG-3591. {code} a = load '1.txt' using PigStorage(',') as (a0

[jira] [Commented] (PIG-4683) Nested order is broken after PIG-3591 in some cases

2015-09-21 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14901228#comment-14901228 ] Daniel Dai commented on PIG-4683: - The reason is PigSecondaryKeyComparator doesn't sort on index

[jira] [Updated] (PIG-4674) TOMAP should infer schema

2015-09-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4674: Attachment: PIG-4674-fixtest.patch Commit PIG-4674-fixtest.patch to fix unit test failure. > TOMAP sho

[jira] [Commented] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14790701#comment-14790701 ] Daniel Dai commented on PIG-4673: - I cannot see the patch, can you attach to the Jira? > Built In

[jira] [Updated] (PIG-4676) Upgrade Hive to 1.2.1

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4676: Attachment: PIG-4676-fixtest.patch TestLoaderStorerShipCacheFiles is broken by the patch. Attach fix

[jira] [Updated] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4679: Attachment: PIG-4679-1.patch > Performance degradation due to InputSizeReducerEstimator since PIG-3

[jira] [Updated] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4679: Status: Patch Available (was: Open) > Performance degradation due to InputSizeReducerEstimator since

[jira] [Updated] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4673: Attachment: PIG-4673-1.patch Move the code to piggybank, adjust the format. Otherwise looks good. Will check

[jira] [Updated] (PIG-4674) TOMAP should infer schema

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4674: Attachment: PIG-4674-3.patch Yes, there is a hole. Apply suggested change. > TOMAP should infer sch

[jira] [Updated] (PIG-4674) TOMAP should infer schema

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4674: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Comment Edited] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791485#comment-14791485 ] Daniel Dai edited comment on PIG-4679 at 9/17/15 2:39 AM: -- Patch committed to trunk

[jira] [Commented] (PIG-4676) Upgrade Hive to 1.2.1

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791481#comment-14791481 ] Daniel Dai commented on PIG-4676: - PIG-4676-fixtest.patch committed. > Upgrade Hive to 1.

[jira] [Updated] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4673: Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: (was: site) 0.16.0

[jira] [Updated] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4679: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Updated] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4679: Attachment: PIG-4679-0.patch We don't estimate size for non-hdfs inputs before 0.12. However, we will use

[jira] [Created] (PIG-4679) Performance degradation due to InputSizeReducerEstimator since PIG-3754

2015-09-15 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4679: --- Summary: Performance degradation due to InputSizeReducerEstimator since PIG-3754 Key: PIG-4679 URL: https://issues.apache.org/jira/browse/PIG-4679 Project: Pig Issue

[jira] [Updated] (PIG-4676) Upgrade Hive to 1.2.1

2015-09-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4676: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Updated] (PIG-4674) TOMAP should infer schema

2015-09-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4674: Attachment: PIG-4674-2.patch Yes, that is missing. Attach second patch. > TOMAP should infer sch

[jira] [Updated] (PIG-1387) Syntactical Sugar for PIG-1385

2015-09-12 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-1387: Assignee: Gianmarco De Francisci Morales (was: Daniel Dai) > Syntactical Sugar for PIG-1

[jira] [Assigned] (PIG-1387) Syntactical Sugar for PIG-1385

2015-09-12 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai reassigned PIG-1387: --- Assignee: Daniel Dai (was: Gianmarco De Francisci Morales) > Syntactical Sugar for PIG-1

[jira] [Created] (PIG-4674) TOMAP should infer schema

2015-09-11 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4674: --- Summary: TOMAP should infer schema Key: PIG-4674 URL: https://issues.apache.org/jira/browse/PIG-4674 Project: Pig Issue Type: Bug Components: impl

[jira] [Updated] (PIG-4674) TOMAP should infer schema

2015-09-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4674: Attachment: PIG-4674-1.patch > TOMAP should infer schema > - > >

[jira] [Updated] (PIG-4629) org.apache.hadoop.hive.ql.exec.FunctionRegistry#getFunctionInfo() throws SemanticException since Hive 1.1.0

2015-09-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4629: Resolution: Duplicate Status: Resolved (was: Patch Available

[jira] [Updated] (PIG-4676) Upgrade Hive to 1.2.1

2015-09-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4676: Status: Patch Available (was: Open) > Upgrade Hive to 1.2.1 > - > >

[jira] [Updated] (PIG-4676) Upgrade Hive to 1.2.1

2015-09-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4676: Attachment: PIG-4676-1.patch > Upgrade Hive to 1.2.1 > - > >

[jira] [Commented] (PIG-4676) Upgrade Hive to 1.2.1

2015-09-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14741885#comment-14741885 ] Daniel Dai commented on PIG-4676: - Note unlike hive-exec.jar, hive-exec-core.jar does not contain shaded

[jira] [Commented] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-10 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14739882#comment-14739882 ] Daniel Dai commented on PIG-4673: - Sounds useful to have it in piggybank. I can review it once you put

[jira] [Commented] (PIG-3294) Allow Pig use Hive UDFs

2015-09-10 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14740159#comment-14740159 ] Daniel Dai commented on PIG-3294: - This should be a Parquet issue. I upload a patch to PARQUET-334. > Al

Re: pig cogroup by null

2015-09-09 Thread Daniel Dai
There is no setting, however you can rewrite your query as the following: table1x = foreach table1 generate (a is null?'':a) as a, (b is null?'':b) as b; table2x = foreach table2 generate (a is null?'':a) as a, (b is null?'':b) as b; k = cogroup table1x by (a,b),table2x by (a,b); Daniel On

Re: Hive UDF's vs. "native" Pig UDF's

2015-09-09 Thread Daniel Dai
There are some moderate overhead for Hive UDF. My test shows around 10%-20% slow down than Pig native UDF. I will create a document Jira. Thanks, Daniel On 9/9/15, 9:27 AM, "Rohini Palaniswamy" wrote: >Daniel, > Not sure you saw this. We will have to document the

[jira] [Created] (PIG-4672) Document performance implication for Hive UDF

2015-09-09 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4672: --- Summary: Document performance implication for Hive UDF Key: PIG-4672 URL: https://issues.apache.org/jira/browse/PIG-4672 Project: Pig Issue Type: Task

[jira] [Commented] (PIG-4627) [Pig on Tez] Self join does not handle null values correctly

2015-09-02 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14728059#comment-14728059 ] Daniel Dai commented on PIG-4627: - +1 for PIG-4627-fix-testfailures.patch. > [Pig on Tez] Self join d

[jira] [Commented] (PIG-4574) Eliminate identity vertex for order by and skewed join right after LOAD

2015-09-02 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14728055#comment-14728055 ] Daniel Dai commented on PIG-4574: - +1 for PIG-4574-fix-testfailures.patch. > Eliminate identity ver

[jira] [Commented] (PIG-3622) Allow casting bytearray fields to bytearray type

2015-09-01 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726481#comment-14726481 ] Daniel Dai commented on PIG-3622: - +1. Thanks for fixing. > Allow casting bytearray fields to bytear

Re: Problem when running our code with tez

2015-08-25 Thread Daniel Dai
JobID is vague is Tez, you shall use dagId instead. However, I don¹t see a way you can get DagId within RecordWriter/OutputCommitter. A possible solution is to use conf.get(³mapreduce.workflow.id²) + conf.get(³mapreduce.workflow.node.name²). Note both are Pig specific configuration and only

[jira] [Commented] (PIG-4656) Improve String serialization and comparator performance in BinInterSedes

2015-08-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700312#comment-14700312 ] Daniel Dai commented on PIG-4656: - Yes, you are right. You skip the length to do

[jira] [Updated] (PIG-2597) Move grunt from javacc to ANTLR

2015-08-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-2597: Assignee: Dilip Ramesh (was: Daniel Dai) Move grunt from javacc to ANTLR

[jira] [Commented] (PIG-2597) Move grunt from javacc to ANTLR

2015-08-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700725#comment-14700725 ] Daniel Dai commented on PIG-2597: - QueryParser is now handling commands with/without semi

[jira] [Commented] (PIG-4656) Improve String serialization and comparator performance in BinInterSedes

2015-08-15 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698434#comment-14698434 ] Daniel Dai commented on PIG-4656: - Chararray serde change looks fine, just curious why

[jira] [Commented] (PIG-4654) Reduce tez memory.reserve-fraction and clear spillables for better memory utilization

2015-08-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14697892#comment-14697892 ] Daniel Dai commented on PIG-4654: - +1 Reduce tez memory.reserve-fraction and clear

[jira] [Commented] (PIG-4657) [Pig on Tez] Optimize GroupBy and Distinct key comparison

2015-08-14 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14697869#comment-14697869 ] Daniel Dai commented on PIG-4657: - Wow, that's much more than I expected. I was hoping

[jira] [Commented] (PIG-4657) [Pig on Tez] Optimize GroupBy and Distinct key comparison

2015-08-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14696149#comment-14696149 ] Daniel Dai commented on PIG-4657: - +1. Do you have any performance numbers with/without

[jira] [Commented] (PIG-4651) Optimize NullablePartitionWritable serialization for skewed join

2015-08-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14692443#comment-14692443 ] Daniel Dai commented on PIG-4651: - +1. Same with PIG-4627, we shall eventually use

[jira] [Commented] (PIG-4651) Optimize NullablePartitionWritable serialization for skewed join

2015-08-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14682441#comment-14682441 ] Daniel Dai commented on PIG-4651: - +1. Note this patch also fix the comparator used

[jira] [Commented] (PIG-4627) [Pig on Tez] Self join does not handle null values correctly

2015-08-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14692337#comment-14692337 ] Daniel Dai commented on PIG-4627: - The best solution is to use PigWritableComparator

[jira] [Updated] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4405: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Updated] (PIG-4638) Allow TOMAP to accept dynamically sized input

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4638: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Created] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4650: --- Summary: ant mvn-deploy target is broken Key: PIG-4650 URL: https://issues.apache.org/jira/browse/PIG-4650 Project: Pig Issue Type: Bug Components: build

[jira] [Updated] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4650: Attachment: PIG-4650-1.patch ant mvn-deploy target is broken

[jira] [Updated] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4650: Status: Patch Available (was: Open) ant mvn-deploy target is broken

[jira] [Commented] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14658843#comment-14658843 ] Daniel Dai commented on PIG-4623: - I also mean to generate a patch and attach the patch file

[jira] [Updated] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4623: Attachment: PIG-4623-1.patch Attach patch. Next time, please use svn diff, or git diff/git show to generate

[jira] [Updated] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4623: Resolution: Fixed Fix Version/s: (was: site) 0.16.0 Status: Resolved

[jira] [Updated] (PIG-4650) ant mvn-deploy target is broken

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4650: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Reopened] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-05 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai reopened PIG-4623: - Thanks Rohini for capturing this. Rollback the patch to address the issue raised. Fixed the 'new line

[jira] [Commented] (PIG-4623) Fixed the 'new line' character inside double-quote causing the csv parsing failure

2015-08-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14654796#comment-14654796 ] Daniel Dai commented on PIG-4623: - Can you generate a patch and attach to the ticket

Re: pig and parquet-bundle*jar

2015-08-04 Thread Daniel Dai
The idea is only include dependencies of most popular UDF/LoadFunc/StoreFunc in lib. If Pig include dependencies of all existing Pig UDF/LoadFunc/StoreFunc, Pig might end up bundling too many jars. Clearly popularity is not a measurable term and it will change over time, and we make such a

[jira] [Commented] (PIG-4612) accumulating upon filters is still accumulating

2015-08-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14654829#comment-14654829 ] Daniel Dai commented on PIG-4612: - You will need to call initial/intermediate, otherwise

[jira] [Commented] (PIG-4629) org.apache.hadoop.hive.ql.exec.FunctionRegistry#getFunctionInfo() throws SemanticException since Hive 1.1.0

2015-08-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14653978#comment-14653978 ] Daniel Dai commented on PIG-4629: - I see several Jiras opened for Hive 1.1.0 issue. Can we

[jira] [Commented] (PIG-4405) Adding 'map[]' support to mock/Storage

2015-08-04 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14653973#comment-14653973 ] Daniel Dai commented on PIG-4405: - Bag has the same issue. Take a sample from existing code

[jira] [Resolved] (PIG-4624) Error on ORC empty file without schema

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4624. - Resolution: Fixed Hadoop Flags: Reviewed Patch committed to trunk. Thanks [~thejas], [~rohini

[jira] [Commented] (PIG-4642) Function call error, when comparing two instances of class Tuple, should use compareTuple(), while in current version the method compare() compares two instances of

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652906#comment-14652906 ] Daniel Dai commented on PIG-4642: - No, it should use mComparator.compare, mComparator

[jira] [Resolved] (PIG-4646) PushUpFilter should not push before nested projection with FILTER operators

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4646. - Resolution: Fixed Assignee: Daniel Dai Fix Version/s: 0.12.0 Thanks for reporting

[jira] [Commented] (PIG-4640) Compiling Pig with JDK8 or JDK7 Update 85 breaks Ruby UDFs

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652773#comment-14652773 ] Daniel Dai commented on PIG-4640: - I definitely prefer a more efficient implementation. We

[jira] [Resolved] (PIG-3622) Allow casting bytearray fields to bytearray type

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-3622. - Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.16.0 The script is completely fine

[jira] [Commented] (PIG-4647) OrcStorage should refer to shaded kryo

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652724#comment-14652724 ] Daniel Dai commented on PIG-4647: - And since Hive shade all dependent jars, Pig shall pull

[jira] [Updated] (PIG-4647) OrcStorage should refer to shaded kryo

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4647: Fix Version/s: 0.16.0 OrcStorage should refer to shaded kryo

[jira] [Commented] (PIG-1769) Consistency for HBaseStorage

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652781#comment-14652781 ] Daniel Dai commented on PIG-1769: - Definitely a new issue. A closed ticket cannot

Re: Running piggybank unit tests?

2015-08-03 Thread Daniel Dai
ant -Dhadoopversion=23 -Dtestcase=TestXMLLoader test I am using eclipse and I can run piggybank UT within eclipse. What issue with IntelliJ? Thanks, Daniel On 7/31/15, 2:49 PM, Niels Basjes ni...@basjes.nl wrote: Hi, How do I correctly run a single unit test (or single class) that resides in

[jira] [Updated] (PIG-4636) Occurred spelled incorrectly in error message for Launcher and POMergeCogroup

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4636: Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 0.16.0 Status: Resolved

[jira] [Commented] (PIG-3294) Allow Pig use Hive UDFs

2015-08-03 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14653108#comment-14653108 ] Daniel Dai commented on PIG-3294: - setFuncInputSchema does invoke setInputSchema. However

Re: Review Request 35491: PIG-4574: Eliminate identity vertex for order by and skewed join right after LOAD

2015-06-24 Thread Daniel Dai
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/35491/#review89294 --- Ship it! Ship It! - Daniel Dai On June 16, 2015, 7:19 a.m

[jira] [Commented] (PIG-3159) TestAvroStorage.testArrayWithSnappyCompression fails on mac with Java 7

2015-06-18 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-3159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14592047#comment-14592047 ] Daniel Dai commented on PIG-3159: - Which version of Pig? Pig trunk is using 1.1.0.1

[jira] [Resolved] (PIG-4602) The test org.apache.pig.builtin.TestOrcStoragePushdown.testPredicatePushdownTimestamp is failing

2015-06-17 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4602. - Resolution: Fixed Yes, Hive-1.2.0 has known issue. The coming 1.2.1 should be fine. The test

Re: test case org.apache.pig.builtin.TestOrcStoragePushdown.java - testPredicatePushdownTimestamp() failed when PredicatePushdownOptimizer enabled for timestamp filter

2015-06-17 Thread Daniel Dai
commons-lang3 is a new dependency in Hive 1.2.0. Will need to add the following line to ivy.xml and recompile: dependency org=org.apache.commons name=commons-lang3 rev=${commons-lang3.version} conf=compile-master / Daniel On 6/17/15, 1:44 AM, Shi Ju SJ Xie xiesh...@cn.ibm.com wrote:

[jira] [Updated] (PIG-4592) Pig 0.15 stopped working with Hadoop 1.x

2015-06-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4592: Status: Patch Available (was: Open) Pig 0.15 stopped working with Hadoop 1.x

[jira] [Updated] (PIG-4592) Pig 0.15 stopped working with Hadoop 1.x

2015-06-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4592: Attachment: PIG-4592-1.patch I see the problem. It is a by-produce of PIG-4499. Attach a patch. Pig 0.15

[jira] [Updated] (PIG-4592) Pig 0.15 stopped working with Hadoop 1.x

2015-06-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4592: Fix Version/s: 0.15.0 Pig 0.15 stopped working with Hadoop 1.x

[jira] [Commented] (PIG-4592) Pig 0.15 stopped working with Hadoop 1.x

2015-06-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14588820#comment-14588820 ] Daniel Dai commented on PIG-4592: - Target signanddeploy takes output.jarfile.core instead

[jira] [Updated] (PIG-4592) Pig 0.15 stopped working with Hadoop 1.x

2015-06-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4592: Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Patch

[jira] [Commented] (PIG-4602) The test org.apache.pig.builtin.TestOrcStoragePushdown.testPredicatePushdownTimestamp is failing

2015-06-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14588800#comment-14588800 ] Daniel Dai commented on PIG-4602: - It pass on Apache build (https://builds.apache.org/view

[jira] [Resolved] (PIG-4533) Document error: Pig does support concatenated gz file

2015-06-16 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai resolved PIG-4533. - Resolution: Fixed Hadoop Flags: Reviewed Patch committed to trunk. Thanks Tomas, Rohini! Document

<    4   5   6   7   8   9   10   11   12   13   >