Re: pig on tez NoSuchMethodError decodeBase64

2015-03-10 Thread Patcharee Thongtra
load_u.pig it just worked. Patcharee On 03/09/2015 08:42 PM, Daniel Dai wrote: Sounds like a commons-codec version conflict. Tez is using commons-codec 1.4. Do you see another version of commons-codec.jar in your Hadoop lib or CLASSPATH? Thanks, Daniel On 3/9/15, 5:28 AM, "Patc

pig on tez NoSuchMethodError decodeBase64

2015-03-09 Thread Patcharee Thongtra
Hi, I tried to run pig on TEZ> pig -Dpig.additional.jars=... -useHCatalog -x TEZ load_u.pig But I got an exception> Error cause TezChild exit.:java.lang.NoSuchMethodError: org.apache.commons.codec.binary.Base64.decodeBase64(Ljava/lang/String;)[B It seems the problem is from commons-codec-*.

Load columns changed name

2015-01-15 Thread Patcharee Thongtra
Hi, I have a hive table with a column which was changed its name. Pig is not able to load data from this column, it is all empty. Any ideas how to fix it? BR, Patcharee

left join on multiple columns

2015-01-08 Thread Patcharee Thongtra
Hi, I am new to pig. I am using pig version 0.12. I found an unexpected behaviour from left join on multiple columns as listed below -- ... ... dump r_four_dim1; describe r_four_dim1; dump result_height; describe result_height; j

extract tuple from bag in an order

2014-06-05 Thread Patcharee Thongtra
Hi, I have the following data (2009-09-09,2,1,{(70)},{(80)},{(90)}) (2010-10-10,2,12,{(71),(75)},{(81),(85)},{(91),(95)}) (2012-12-12,2,9,{(76),(77),(78)},{(86),(87),(88)},{(96),(97),(98)}) which is in the format {date: chararray, zone: int, z: int, uTmp: {(varvalue: int)}, vTmp: {(varvalue:

Re: java.lang.String cannot be cast to java.lang.Integer

2014-05-30 Thread Patcharee Thongtra
ot;11 "). Ultimately, you can read the z column as chararray and process it with udf. This will give you chance to log the faulty record. Tell if any of this removed the problem. Piotr 30 maj 2014 12:02 "Patcharee Thongtra" napisał(a): Hi, I got very strange exception.

java.lang.String cannot be cast to java.lang.Integer

2014-05-30 Thread Patcharee Thongtra
Hi, I got very strange exception. 80693 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias ordered. Backend error : java.lang.String cannot be cast to java.lang.Integer 14/05/30 11:53:22 ERROR grunt.Grunt: ERROR 1066: Unable to open iterator for ali

HCatalog select one column from hive

2014-05-28 Thread Patcharee Thongtra
Hi, I am very new to pig and hive. In pig, how can I execute this command 'select a_column from a_table' through HCatalog? Any guidance is appreciated. Patcharee

HCatLoader Table not found

2014-05-16 Thread Patcharee Thongtra
Hi, I am using HCatLoader to load data from a table (existing in hive). A = load 'rwf_data' USING org.apache.hcatalog.pig.HCatLoader(); describe A; I got Error 1115: Table not found : ... It is weird. Any suggestions on this? Thanks Patcharee

store to defined filename

2014-05-14 Thread Patcharee Thongtra
Hi, Is it possible to store results in to a file with determined filename, instead of part-r-0? How to do that? Patcharee

Re: pass command line parameters to custom LOAD

2014-05-12 Thread Patcharee Thongtra
14/04/custom-parameters-to-pig-script.html Hope this answers your question. ​ On Tue, May 6, 2014 at 1:54 PM, Patcharee Thongtra < patcharee.thong...@uni.no> wrote: Hi, How can I pass command line parameters to my custom LOAD function? Patcharee

pass command line parameters to custom LOAD

2014-05-06 Thread Patcharee Thongtra
Hi, How can I pass command line parameters to my custom LOAD function? Patcharee

increase memory size used by Pig UDF

2014-04-30 Thread Patcharee Thongtra
Hi, How can I increase memory size used by Pig UDF? I got OutOfMemoryError exception which was thrown before Pig submitted jobs to Hadoop, see error log. 426405 [JobControl] ERROR org.apache.pig.backend.hadoop23.PigJobControl - Error while trying to run jobs. java.lang.RuntimeException: ja

How do I flatten bag after group?

2014-04-23 Thread Patcharee Thongtra
Hi, From the schema C: {group: (int,int,int),{(varvalue: {t: (varname: chararray,shape: float)})}} I would like to get {int,int,int,(varname,shape)}, where there are multiple varname and shape value of each varname. How can I write the pig script to generate that? Patcharee

Number of map task

2014-04-22 Thread Patcharee Thongtra
Hi, I wrote a custom InputFormat. When I ran the pig script Load function using this InputFormat, the number of InputSplit = 16, but there was only 2 map tasks handling these splits. Apparently the no. of map tasks = the no. of input files. Does the number of Map task not correspond to the n

Pass user configurations/arguments to UDF

2014-04-09 Thread Patcharee Thongtra
Hi, I implemented a custom load function. How to pass some user settings to this function? Any help is appreciated, Patcharee

Cannot initialize Cluster. Please check your configuration for mapreduce.framework.name and the correspond server addresses

2014-04-03 Thread Patcharee Thongtra
Hi, I am trying to run pig test. When I execute mvn test, I got error org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias data at org.apache.pig.PigServer.openIterator(PigServer.java:880) at com.mortardata.pig.TestExampleLoader.testLoader(