Consider passing result of COUNT/COUNT_STAR to LIMIT
-
Key: PIG-1660
URL: https://issues.apache.org/jira/browse/PIG-1660
Project: Pig
Issue Type: Improvement
Affects Versions: 0.7.0
Support param_files to be loaded into HDFS
--
Key: PIG-1630
URL: https://issues.apache.org/jira/browse/PIG-1630
Project: Pig
Issue Type: New Feature
Affects Versions: 0.7.0
Reporter:
Support to 2 level nested foreach
-
Key: PIG-1631
URL: https://issues.apache.org/jira/browse/PIG-1631
Project: Pig
Issue Type: New Feature
Affects Versions: 0.7.0
Reporter: Viraj Bhat
What I
Using an alias withing Nested Foreach causes indeterminate behaviour
Key: PIG-1633
URL: https://issues.apache.org/jira/browse/PIG-1633
Project: Pig
Issue Type: Bug
Multiple names for the group field
Key: PIG-1634
URL: https://issues.apache.org/jira/browse/PIG-1634
Project: Pig
Issue Type: New Feature
Affects Versions: 0.7.0, 0.6.0, 0.5.0, 0.4.0, 0.3.0, 0.2.0,
Return code from Pig is 0 even if the job fails when using -M flag
--
Key: PIG-1615
URL: https://issues.apache.org/jira/browse/PIG-1615
Project: Pig
Issue Type: Bug
Affects
[
https://issues.apache.org/jira/browse/PIG-1615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12910414#action_12910414
]
Viraj Bhat commented on PIG-1615:
-
I tested this on Pig 0.8, but with a downloaded version,
[
https://issues.apache.org/jira/browse/PIG-282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-282:
---
Release Note:
This feature allows to specify Hadoop Partitioner for the following operations:
GROUP/COGROUP,
Parameter subsitution using -param option runs into problems when substituing
entire pig statements in a shell script (maybe this is a bash problem)
[
https://issues.apache.org/jira/browse/PIG-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1586:
Description:
I have a Pig script as a template:
{code}
register Countwords.jar;
A = $INPUT;
B = FOREACH A
Difference in Semantics between Load statement in Pig and HDFS client on
Command line
-
Key: PIG-1576
URL: https://issues.apache.org/jira/browse/PIG-1576
Project:
XMLLoader in Piggybank does not support bz2 or gzip compressed XML files
Key: PIG-1561
URL: https://issues.apache.org/jira/browse/PIG-1561
Project: Pig
Issue Type: Bug
Piggybank MultiStorage does not scale when processing around 7k records per
bucket
--
Key: PIG-1547
URL: https://issues.apache.org/jira/browse/PIG-1547
Project: Pig
[
https://issues.apache.org/jira/browse/PIG-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12895858#action_12895858
]
Viraj Bhat commented on PIG-1537:
-
Hi Olga, I have given the specific script with UDF's for
[
https://issues.apache.org/jira/browse/PIG-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1537:
Description:
I have script which is of this pattern and it uses 2 StoreFunc's:
{code}
register loader.jar
Column pruner causes wrong results when using both Custom Store UDF and
PigStorage
--
Key: PIG-1537
URL: https://issues.apache.org/jira/browse/PIG-1537
Project: Pig
[
https://issues.apache.org/jira/browse/PIG-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12864963#action_12864963
]
Viraj Bhat commented on PIG-1345:
-
Richard thanks for suggesting a workaround. The error
[
https://issues.apache.org/jira/browse/PIG-798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12861097#action_12861097
]
Viraj Bhat commented on PIG-798:
Hi Ashutosh,
Yes that is possible, I know that we can do
[
https://issues.apache.org/jira/browse/PIG-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12861106#action_12861106
]
Viraj Bhat commented on PIG-1211:
-
Ashutosh, yes as more and more people adopt Pig, they
[
https://issues.apache.org/jira/browse/PIG-798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12861134#action_12861134
]
Viraj Bhat commented on PIG-798:
Ashutosh thanks for clarifying, we will wait till that bug is
[
https://issues.apache.org/jira/browse/PIG-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12860397#action_12860397
]
Viraj Bhat commented on PIG-1345:
-
Which release will PIG:908 be fixed?
Does it guarantee
[
https://issues.apache.org/jira/browse/PIG-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12860419#action_12860419
]
Viraj Bhat commented on PIG-1211:
-
Ashutosh, I feel that the user may not be interested in
[
https://issues.apache.org/jira/browse/PIG-1339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12860445#action_12860445
]
Viraj Bhat commented on PIG-1339:
-
Hi Ashutosh this does not work in trunk. I am using the
[
https://issues.apache.org/jira/browse/PIG-798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12860452#action_12860452
]
Viraj Bhat commented on PIG-798:
Hi Ashutosh,
The problem here is not about using the data
[
https://issues.apache.org/jira/browse/PIG-798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-798:
---
Affects Version/s: 0.6.0
0.5.0
0.4.0
0.3.0
[
https://issues.apache.org/jira/browse/PIG-1378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859384#action_12859384
]
Viraj Bhat commented on PIG-1378:
-
har:// currently works in Pig 0.7 when the hdfs location
har url not usable in Pig scripts
-
Key: PIG-1378
URL: https://issues.apache.org/jira/browse/PIG-1378
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.7.0
Reporter:
[
https://issues.apache.org/jira/browse/PIG-1378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1378:
Description:
I am trying to use har (Hadoop Archives) in my Pig script.
I can use them through the HDFS
[
https://issues.apache.org/jira/browse/PIG-518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12857157#action_12857157
]
Viraj Bhat commented on PIG-518:
The above script generates the following error in Pig 0.7
[
https://issues.apache.org/jira/browse/PIG-518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat resolved PIG-518.
Fix Version/s: 0.7.0
Resolution: Fixed
LOBinCond exception in LogicalPlanValidationExecutor when
[
https://issues.apache.org/jira/browse/PIG-829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat resolved PIG-829.
Fix Version/s: 0.7.0
Resolution: Fixed
Pig 0.7 yields the correct result.
{code}
x = LOAD 'something'
Pig/Zebra fails without proper error message when the
mapred.jobtracker.maxtasks.per.job exceeds threshold
--
Key: PIG-1377
URL:
Order by fails with java.lang.String cannot be cast to
org.apache.pig.data.DataBag
--
Key: PIG-1374
URL: https://issues.apache.org/jira/browse/PIG-1374
Project: Pig
[
https://issues.apache.org/jira/browse/PIG-756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854762#action_12854762
]
Viraj Bhat commented on PIG-756:
In Pig 0.7 we have moved local mode of Pig to local mode of
[
https://issues.apache.org/jira/browse/PIG-756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat resolved PIG-756.
Resolution: Fixed
Fix Version/s: 0.7.0
https://issues.apache.org/jira/browse/PIG-1053 fixes this
Link casting errors in POCast to actual lines numbers in Pig script
---
Key: PIG-1345
URL: https://issues.apache.org/jira/browse/PIG-1345
Project: Pig
Issue Type: Improvement
International characters in column names not supported
--
Key: PIG-1339
URL: https://issues.apache.org/jira/browse/PIG-1339
Project: Pig
Issue Type: Bug
Components: impl
Cannot convert DataByeArray to Chararray and results in
FIELD_DISCARDED_TYPE_CONVERSION_FAILED 20
-
Key: PIG-1341
URL: https://issues.apache.org/jira/browse/PIG-1341
[
https://issues.apache.org/jira/browse/PIG-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1341:
Component/s: impl
Summary: Cannot convert DataByeArray to Chararray and results in
pig_log file missing even though Main tells it is creating one and an M/R job
fails
Key: PIG-1343
URL: https://issues.apache.org/jira/browse/PIG-1343
Project: Pig
Inifinite loop in JobClient when reading from BinStorage Message:
[org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to
process : 2]
[
https://issues.apache.org/jira/browse/PIG-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1308:
Description:
Simple script fails to read files from BinStorage() and fails to submit jobs to
JobTracker.
Type mismatch in key from map: expected
org.apache.pig.impl.io.NullableFloatWritable, recieved
org.apache.pig.impl.io.NullableText
---
Key: PIG-1278
Detect org.apache.pig.data.DataByteArray cannot be cast to
org.apache.pig.data.Tuple type of errors at Compile Type during creation of
logical plan
---
[
https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840339#action_12840339
]
Viraj Bhat commented on PIG-1252:
-
A modified version of the script works, does this have to
Column pruner causes wrong results
--
Key: PIG-1272
URL: https://issues.apache.org/jira/browse/PIG-1272
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.6.0
[
https://issues.apache.org/jira/browse/PIG-1272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840389#action_12840389
]
Viraj Bhat commented on PIG-1272:
-
Now with Pig 0.7 or trunk we have the following error:
Script producing varying number of records when COGROUPing value of map data
type with and without types
Key: PIG-1263
URL:
Diamond splitter does not generate correct results when using Multi-query
optimization
--
Key: PIG-1252
URL: https://issues.apache.org/jira/browse/PIG-1252
Project:
[
https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1252:
Description:
I have script which uses split but somehow does not use one of the split
branch. The skeleton
Error Number makes it hard to debug: ERROR 2999: Unexpected internal error.
org.apache.pig.backend.datastorage.DataStorageException cannot be cast to
java.lang.Error
Passing Complex map types to and from streaming causes a problem
Key: PIG-1243
URL: https://issues.apache.org/jira/browse/PIG-1243
Project: Pig
Issue Type: Bug
Affects
[
https://issues.apache.org/jira/browse/PIG-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat reopened PIG-1194:
-
Hi Richard,
I ran the script attached on the ticket and found out that the map tasks fails
with the
[
https://issues.apache.org/jira/browse/PIG-1131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12831248#action_12831248
]
Viraj Bhat commented on PIG-1131:
-
Olga I marked it as critical since we mention that Pig can
[
https://issues.apache.org/jira/browse/PIG-1131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12831251#action_12831251
]
Viraj Bhat commented on PIG-1131:
-
Ashutosh I was able to recreate a similar problem using
Document unknown keywords as missing or to do in future
---
Key: PIG-1220
URL: https://issues.apache.org/jira/browse/PIG-1220
Project: Pig
Issue Type: Bug
Components:
[
https://issues.apache.org/jira/browse/PIG-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1174:
Fix Version/s: 0.7.0
Creation of output path should be done by storage function
[
https://issues.apache.org/jira/browse/PIG-940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-940:
---
Affects Version/s: (was: 0.3.0)
0.5.0
Fix Version/s: 0.7.0
Cross site HDFS
[
https://issues.apache.org/jira/browse/PIG-531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-531:
---
Fix Version/s: 0.5.0
Hi Olga,
I think we have a way to handle it in multi-query optimization. Is it
ERROR 2055: Received Error while processing the map plan
Key: PIG-1194
URL: https://issues.apache.org/jira/browse/PIG-1194
Project: Pig
Issue Type: Bug
Components: impl
[
https://issues.apache.org/jira/browse/PIG-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1194:
Attachment: inputdata.txt
Testdata to run with this script
ERROR 2055: Received Error while processing the
[
https://issues.apache.org/jira/browse/PIG-1187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12800315#action_12800315
]
Viraj Bhat commented on PIG-1187:
-
Hi Jeff,
This is specific to the data we are using and it
UTF-8 (international code) breaks with loader when load with schema is specified
Key: PIG-1187
URL: https://issues.apache.org/jira/browse/PIG-1187
Project: Pig
Sucessive replicated joins do not generate Map Reduce plan and fails due to OOM
---
Key: PIG-1157
URL: https://issues.apache.org/jira/browse/PIG-1157
Project: Pig
[
https://issues.apache.org/jira/browse/PIG-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1157:
Attachment: oomreplicatedjoin.pig
replicatedjoinexplain.log
Explain output and Pig script.
set default_parallelism construct does not set the number of reducers correctly
---
Key: PIG-1144
URL: https://issues.apache.org/jira/browse/PIG-1144
Project: Pig
[
https://issues.apache.org/jira/browse/PIG-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1144:
Attachment: brokenparallel.out
genericscript_broken_parallel.pig
Script and explain output
[
https://issues.apache.org/jira/browse/PIG-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12788436#action_12788436
]
Viraj Bhat commented on PIG-1144:
-
This happens on the real cluster, where the sorting job
[
https://issues.apache.org/jira/browse/PIG-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12788439#action_12788439
]
Viraj Bhat commented on PIG-1144:
-
Hi Daniel,
One more thing to note is that the Last Sort
[
https://issues.apache.org/jira/browse/PIG-1144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12788481#action_12788481
]
Viraj Bhat commented on PIG-1144:
-
Hi Daniel,
Thanks again for your input. This is more of a
Pig simple join does not work when it contains empty lines
--
Key: PIG-1131
URL: https://issues.apache.org/jira/browse/PIG-1131
Project: Pig
Issue Type: Bug
Components: impl
[
https://issues.apache.org/jira/browse/PIG-1131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1131:
Attachment: simplejoinscript.pig
junk2.txt
junk1.txt
Dummy datasets and pig
Unable to set Custom Job Name using the -Dmapred.job.name parameter
---
Key: PIG-1124
URL: https://issues.apache.org/jira/browse/PIG-1124
Project: Pig
Issue Type: Bug
Pig parser does not recognize its own data type in LIMIT statement
--
Key: PIG-1101
URL: https://issues.apache.org/jira/browse/PIG-1101
Project: Pig
Issue Type: Bug
PigCookBook use of PARALLEL keyword
---
Key: PIG-1081
URL: https://issues.apache.org/jira/browse/PIG-1081
Project: Pig
Issue Type: Bug
Components: documentation
Affects Versions: 0.5.0
Pig CookBook documentation Take Advantage of Join Optimization
additions:Merge and Skewed Join
Key: PIG-1084
URL: https://issues.apache.org/jira/browse/PIG-1084
[
https://issues.apache.org/jira/browse/PIG-1060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12773744#action_12773744
]
Viraj Bhat commented on PIG-1060:
-
Hi Ankur and Richard,
I have a script which demonstrates
Behvaiour of COGROUP with and without schema when using * operator
Key: PIG-1064
URL: https://issues.apache.org/jira/browse/PIG-1064
Project: Pig
Issue Type: Bug
PigStorage interpreting chararray/bytearray for a tuple element inside a bag as
float or double
---
Key: PIG-1031
URL: https://issues.apache.org/jira/browse/PIG-1031
[
https://issues.apache.org/jira/browse/PIG-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-1031:
Description:
I have a data stored in a text file as:
{(4153E765)}
{(AF533765)}
I try reading it using
ERROR 2100 (hdfs://localhost/tmp/temp175740929/tmp-1126214010 does not exist)
and ERROR 2999: (Unexpected internal error. null) when using Multi-Query
optimization
Issues with mv command when used after store when using -param_file/-param
options
--
Key: PIG-974
URL: https://issues.apache.org/jira/browse/PIG-974
Project: Pig
[
https://issues.apache.org/jira/browse/PIG-974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-974:
---
Attachment: studenttab10k
Testdata
Issues with mv command when used after store when using -param_file/-param
[
https://issues.apache.org/jira/browse/PIG-974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12758962#action_12758962
]
Viraj Bhat commented on PIG-974:
It turns out that the problem was due to single quotes.
Cross site HDFS access using the default.fs.name not possible in Pig
Key: PIG-940
URL: https://issues.apache.org/jira/browse/PIG-940
Project: Pig
Issue Type: Bug
[
https://issues.apache.org/jira/browse/PIG-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12749722#action_12749722
]
Viraj Bhat commented on PIG-940:
One important point to add:
{code}
localmachine.company.com
Type mismatch in key from map: expected
org.apache.pig.impl.io.NullableBytesWritable, recieved
org.apache.pig.impl.io.NullableText when doing simple group
[
https://issues.apache.org/jira/browse/PIG-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12742668#action_12742668
]
Viraj Bhat commented on PIG-919:
This problem can be solved simply by casting the firstname to
[
https://issues.apache.org/jira/browse/PIG-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740360#action_12740360
]
Viraj Bhat commented on PIG-913:
The following works though..
{code}
data = LOAD
Problem accessing a tuple within a bag
--
Key: PIG-828
URL: https://issues.apache.org/jira/browse/PIG-828
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.3.0
[
https://issues.apache.org/jira/browse/PIG-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-828:
---
Attachment: tupleacc.pig
studenttab5
Input script and data.
Problem accessing a tuple within
PigStorage() does not accept Unicode characters in its contructor
--
Key: PIG-816
URL: https://issues.apache.org/jira/browse/PIG-816
Project: Pig
Issue Type: Bug
[
https://issues.apache.org/jira/browse/PIG-816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-816:
---
Attachment: pig_1243043613713.log
Log file for detailed error message
PigStorage() does not accept Unicode
[
https://issues.apache.org/jira/browse/PIG-656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710862#action_12710862
]
Viraj Bhat commented on PIG-656:
Another pig parse issue when a udf was defined within a
[
https://issues.apache.org/jira/browse/PIG-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat reopened PIG-656:
Documentation should be updated on the eval keyword and what it actually does
otherwise the user can be lost
[
https://issues.apache.org/jira/browse/PIG-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-656:
---
Summary: Use of eval or any other keyword in the package hierarchy of a UDF
causes parse exception (was: Use
COUNT(*) does not work
---
Key: PIG-812
URL: https://issues.apache.org/jira/browse/PIG-812
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.2.0
Reporter: Viraj Bhat
[
https://issues.apache.org/jira/browse/PIG-812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-812:
---
Attachment: studenttab10k
Input file
COUNT(*) does not work
---
Key:
[
https://issues.apache.org/jira/browse/PIG-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710619#action_12710619
]
Viraj Bhat commented on PIG-774:
Hi Daniel,
For this patch to work, is it important to set:
[
https://issues.apache.org/jira/browse/PIG-798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Bhat updated PIG-798:
---
Description:
In the following script I have a tab separated text file, which I load using
PigStorage() and
1 - 100 of 162 matches
Mail list logo