[
https://issues.apache.org/jira/browse/PIG-798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-798:
---
Description:
In the following script I have a tab separated text file, which I load using
PigStorage
Components: impl
Affects Versions: 0.2.0
Reporter: Viraj Bhat
Fix For: 0.2.0
In the following script I have a tab separated text file, which I load using
PigStorage() and store using BinStorage()
{code}
A = load '/user/viraj/visits.txt' using PigStorage
[
https://issues.apache.org/jira/browse/PIG-798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-798:
---
Summary: Schema errors when using PigStorage and none when using BinStorage
in FOREACH?? (was: Schema errors
[
https://issues.apache.org/jira/browse/PIG-564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12704455#action_12704455
]
Viraj Bhat commented on PIG-564:
Another special character / is not handled correctly
[
https://issues.apache.org/jira/browse/PIG-564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12704455#action_12704455
]
Viraj Bhat edited comment on PIG-564 at 4/29/09 8:04 PM:
-
Another
[
https://issues.apache.org/jira/browse/PIG-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12703937#action_12703937
]
Viraj Bhat commented on PIG-774:
Daniel,
Thanks again for your patch, I worked with Pradeep
[
https://issues.apache.org/jira/browse/PIG-619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12703941#action_12703941
]
Viraj Bhat commented on PIG-619:
So when does the Multi-Store query optimization get committed
/browse/PIG-790
Project: Pig
Issue Type: Bug
Affects Versions: 0.0.0
Reporter: Viraj Bhat
Priority: Minor
I have a simple Pig script which loads integer data and does an Bincond, where
it compares, col1 eq ''. There is an error message
[
https://issues.apache.org/jira/browse/PIG-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12703977#action_12703977
]
Viraj Bhat commented on PIG-774:
I modified the file PigScriptParser.jj, and it works.
Pig
[
https://issues.apache.org/jira/browse/PIG-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12702620#action_12702620
]
Viraj Bhat commented on PIG-774:
One workaround for this issue is using the FilterFunc, which
[
https://issues.apache.org/jira/browse/PIG-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12702621#action_12702621
]
Viraj Bhat commented on PIG-774:
Ciemo, as stated in the original problem description
[
https://issues.apache.org/jira/browse/PIG-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12702620#action_12702620
]
Viraj Bhat edited comment on PIG-774 at 4/24/09 4:35 PM:
-
One
[
https://issues.apache.org/jira/browse/PIG-755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12702645#action_12702645
]
Viraj Bhat commented on PIG-755:
Ciemo presently there is an option in Pig known as dryrun
[
https://issues.apache.org/jira/browse/PIG-774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-774:
---
Description:
I created a very small test case in which I did the following.
1) Created a UTF-8 file which
: https://issues.apache.org/jira/browse/PIG-772
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.3.0
Reporter: Viraj Bhat
Priority: Minor
Fix For: 0.3.0
I have a Pig script which tries to display all bags
[
https://issues.apache.org/jira/browse/PIG-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-772:
---
Description:
I have a Pig script which tries to display all bags which are greater than the
average value
[
https://issues.apache.org/jira/browse/PIG-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-772:
---
Issue Type: Improvement (was: Bug)
Semantics of Filter statement inside ForEach should support filtering
[
https://issues.apache.org/jira/browse/PIG-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-772:
---
Attachment: half.txt
Input file
Semantics of Filter statement inside ForEach should support filtering
[
https://issues.apache.org/jira/browse/PIG-754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12696263#action_12696263
]
Viraj Bhat commented on PIG-754:
Ciemo there is a workaround in this form, if we make
[
https://issues.apache.org/jira/browse/PIG-754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12696265#action_12696265
]
Viraj Bhat commented on PIG-754:
Another workaround as suggested in PIG:564 :)
{code}
pig
[
https://issues.apache.org/jira/browse/PIG-754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12696265#action_12696265
]
Viraj Bhat edited comment on PIG-754 at 4/6/09 2:43 PM:
Another
[
https://issues.apache.org/jira/browse/PIG-754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12696273#action_12696273
]
Viraj Bhat commented on PIG-754:
Something that I am still not understanding is why does
/browse/PIG-755
Project: Pig
Issue Type: Bug
Components: grunt
Affects Versions: 0.3.0
Reporter: Viraj Bhat
Fix For: 0.3.0
I have a script in which I do a parameter substitution for the input file. I
have a use case where I find
[
https://issues.apache.org/jira/browse/PIG-755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-755:
---
Attachment: localparamsub.pig
inputfile.txt
Script and testfile
Difficult to debug parameter
://issues.apache.org/jira/browse/PIG-751
Project: Pig
Issue Type: Bug
Components: grunt
Affects Versions: 0.3.0
Reporter: Viraj Bhat
Fix For: 0.3.0
I have an input file which is being loaded by BinStorage()
{code}
myinput = LOAD 'partfile' USING
Issue Type: Bug
Components: impl
Affects Versions: 0.3.0
Reporter: Viraj Bhat
Fix For: 0.3.0
I have a Pig script in which I count the number of distinct records resulting
from the filter, this statement is embedded in a foreach. The number of records
I get
[
https://issues.apache.org/jira/browse/PIG-738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12693970#action_12693970
]
Viraj Bhat commented on PIG-738:
This works, as the Pig parser ignores single front slash
Reporter: Viraj Bhat
Fix For: 0.3.0
Consider a pig script which parses and counts regular expressions from a text
file.
The regular expression supplied in the Pig script needs to escape the .
(dot) character.
{code}
register myregexp.jar;
-- pattern not picked up
[
https://issues.apache.org/jira/browse/PIG-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-738:
---
Attachment: regexpinput.txt
myregexp.jar
RegexGroupCount.java
Java,Jar for UDF
[
https://issues.apache.org/jira/browse/PIG-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-738:
---
Attachment: regexp.pig
Pig script
Regexp passed from pigscript fails in UDF
[
https://issues.apache.org/jira/browse/PIG-736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-736:
---
Attachment: pig_latestversion_errmsg.log
pig_oldversion_errmsg.log
Mixed up the files
[
https://issues.apache.org/jira/browse/PIG-514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12689772#action_12689772
]
Viraj Bhat commented on PIG-514:
Another test case: consider the following input file:
1
-736
URL: https://issues.apache.org/jira/browse/PIG-736
Project: Pig
Issue Type: Bug
Affects Versions: 1.0.1
Reporter: Viraj Bhat
Fix For: 1.0.1
Suppose I have Pig script which accesses a directory in HDFS for which I do not
have
[
https://issues.apache.org/jira/browse/PIG-736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-736:
---
Attachment: pig_oldversion_errmsg.log
pig_newversion_errmsg.log
Pig error logs
Inconsistent
[
https://issues.apache.org/jira/browse/PIG-736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-736:
---
Attachment: (was: pig_newversion_errmsg.log)
Inconsistent error message when the message should be about
[
https://issues.apache.org/jira/browse/PIG-736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-736:
---
Attachment: pig_oldversion_errmsg.log
pig_newversion_errmsg.log
re-attaching with ASF inclusion
[
https://issues.apache.org/jira/browse/PIG-736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-736:
---
Attachment: (was: pig_oldversion_errmsg.log)
Inconsistent error message when the message should be about
: grunt
Affects Versions: 1.0.1
Reporter: Viraj Bhat
Fix For: 1.0.1
Pig script, which uses a UDF loads in 3 chararray columns, and then
concatenates columns 2 and 3 using a semicolon.
{code}
register CONCATSEP.jar;
A = LOAD 'someinput/*' USING PigStorage(';') as
(col1
[
https://issues.apache.org/jira/browse/PIG-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-731:
---
Attachment: semicolonerr.pig
CONCATSEP.jar
Pig script and jar file for testing the script
/PIG-693
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
Consider the following Pig Script
{code}
register myudf.jar;
A = load 'one.txt' using PigStorage() as ( one
[
https://issues.apache.org/jira/browse/PIG-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-693:
---
Attachment: URLDECODE.java
Eval UDF
Parameter to UDF which is an alias returned in another UDF in nested
[
https://issues.apache.org/jira/browse/PIG-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-693:
---
Attachment: one.txt
Test input file to start execution
Parameter to UDF which is an alias returned in another
[
https://issues.apache.org/jira/browse/PIG-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-656:
---
Attachment: TOKENIZE.jar
TOKENIZE.jar with java source file included
Use of eval word in the package
[
https://issues.apache.org/jira/browse/PIG-564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-564:
---
Summary: Parameter Substitution using -param option does not seem to work
when parameters contain special
Affects Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
Consider the following Pig script where we generate column names b and b in the
FOREACH
{code}
DATA = LOAD 'blah.txt' as (a:long, b:long);
RESULT = FOREACH DATA GENERATE a, b, (b20?b:0) as b
[
https://issues.apache.org/jira/browse/PIG-644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-644:
---
Attachment: blah.txt
Sample input
Duplicate column names in foreach do not throw parser error
-619
URL: https://issues.apache.org/jira/browse/PIG-619
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: types_branch
Environment: Hadoop 18, Multi-node hadoop installation
Reporter: Viraj Bhat
Fix
[
https://issues.apache.org/jira/browse/PIG-619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-619:
---
Attachment: mydata.txt
Test data
Dumping empty results produces Unable to get results for
/tmp/temp
[
https://issues.apache.org/jira/browse/PIG-613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-613:
---
Attachment: myfloatdata.txt
Test input file
Casting elements inside a tuple does not take effect
Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
Attachments: myfloatdata.txt
Consider the following Pig script which casts return values of the SQUARE UDF
which are tuples of doubles to long. The describe output of B shows it is
long, however
[
https://issues.apache.org/jira/browse/PIG-613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-613:
---
Attachment: SQUARE.java
SQUARE UDF
Casting elements inside a tuple does not take effect
[
https://issues.apache.org/jira/browse/PIG-595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-595:
---
Attachment: querypairs.txt
Input file
Use of Combiner causes java.lang.ClassCastException in ForEach
: impl
Affects Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
Attachments: querypairs.txt
The following Pig script causes a ClassCastException when QueryPairs is used in
the ForEach statement. This is due to the use of the combiner.
{code
: Pig
Issue Type: Bug
Components: impl
Affects Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
I have a UDF known as INSETFROMFILE, which matches data against a set of values
stored in an HDFS file. The INSETFROMFILE extends
[
https://issues.apache.org/jira/browse/PIG-594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-594:
---
Attachment: myurldata.txt
Input data for Pig Script
Inconsistent behaviour of FilterFunc UDF when used
[
https://issues.apache.org/jira/browse/PIG-594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-594:
---
Attachment: INSETFROMFILE.java
INSETFROMFILE UDF which uses FilterFunc
Inconsistent behaviour of FilterFunc
[
https://issues.apache.org/jira/browse/PIG-568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-568:
---
Attachment: myudfint.pig
Pig Script causing the exception
Reducer plan generation fails when UDF contains
URL: https://issues.apache.org/jira/browse/PIG-564
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
Consider the following Pig script which uses parameter
Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
There is a need to sometimes generate empty tuples and bags as a part of the
Pig syntax rather than using UDF's
{code}
a = load 'mydata.txt' using PigStorage();
b =foreach a generate ( ) as emptytuple;
c
[
https://issues.apache.org/jira/browse/PIG-558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-558:
---
Attachment: table1
Table1 for test
Distinct followed by a Join results in Invalid size 0 for a tuple error
[
https://issues.apache.org/jira/browse/PIG-558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-558:
---
Attachment: table2
Table2 for test
Distinct followed by a Join results in Invalid size 0 for a tuple error
Versions: types_branch
Reporter: Viraj Bhat
Fix For: types_branch
The UDF RegexMatcher, reports its progress using the reporter (PigProgressable)
object in the exec method. It seems that the reporter object is not being set
in the EvalFunc and hence the following piece
[
https://issues.apache.org/jira/browse/PIG-537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-537:
---
Attachment: mymarks.txt
Test file mymarks.txt
Failure in Hadoop map collect stage due to type mismatch
101 - 163 of 163 matches
Mail list logo