[ 
https://issues.apache.org/jira/browse/PIG-3656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sudhir Ranjan updated PIG-3656:
-------------------------------

    Description: 
Getting exception on storing null valued record/tupple as avro.
The input file having one column with long values (one of them is null means 
nothing) and when I am trying to store the data in avro format ,it throws error.
Please suggest if I am missing any thing some where as per the bellow codebase 
or else please provide the patch.
input file: /home/hadoop/work/sudhir/AvroAnalysis/input/TSV_uncompressed/part*
content: 
2037179309
2037179338
2037179367
2037179433
2037179437
2037179449
2037179547
2037179631
2037179631

2037179309
2037179338
2037179367
2037179433
2037179437
2037179449
2037179547
2037179631
2037179631


******My code base.
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/snappy-java-1.0.4.1.jar
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/avro-1.7.5.jar
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/json-simple-1.1.jar;
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/piggybank.jar;
REGISTER 
/home/hadoop/work/sudhir/AvroAnalysis/Avrojars/jackson-core-asl-1.5.5.jar;
REGISTER 
/home/hadoop/work/sudhir/AvroAnalysis/Avrojars/jackson-mapper-asl-1.5.5.jar;
– The input file only have 1 column (normal TEXT data i.e TSV format) and the 
file having a null value means nothing
A = load '/home/hadoop/work/sudhir/AvroAnalysis/input/TSV_uncompressed/part*' 
using PigStorage('\t') as (USER_ID:long);
– The soutput to be stored in avro data format
STORE A INTO '/home/hadoop/work/sudhir/AvroAnalysis/output/AvroStore/' USING 
org.apache.pig.piggybank.storage.avro.AvroStorage('schema','{"namespace":"com.sudhir.schema.users.avro","type":"long","name":"users_avro","doc":"Avro
 storing with schema using Pig.","fields":[
{"name":"USER_ID","type":["null","long"],"default":null}
]}');
*******Getting Error like:
INFO 
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher 
- 100% complete
ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 2997: Unable to recreate 
exception from backed error: 
org.apache.avro.file.DataFileWriter$AppendWriteException: 
java.lang.NullPointerException: null of long
ERROR org.apache.pig.tools.pigstats.PigStatsUtil - 1 map reduce job(s) failed!

  was:
Getting exception on storing null valued record/tupple as avro.
The input file having one column with long values (one of them is null means 
nothing) and when I am trying to store the data in avro format ,it throws error.
Please suggest if I am missing any thing some where as per the bellow codebase 
or else please provide the patch.
******My code base.
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/snappy-java-1.0.4.1.jar
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/avro-1.7.5.jar
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/json-simple-1.1.jar;
REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/piggybank.jar;
REGISTER 
/home/hadoop/work/sudhir/AvroAnalysis/Avrojars/jackson-core-asl-1.5.5.jar;
REGISTER 
/home/hadoop/work/sudhir/AvroAnalysis/Avrojars/jackson-mapper-asl-1.5.5.jar;
– The input file only have 1 column (normal TEXT data i.e TSV format) and the 
file having a null value means nothing
A = load '/home/hadoop/work/sudhir/AvroAnalysis/input/TSV_uncompressed/part*' 
using PigStorage('\t') as (USER_ID:long);
– The soutput to be stored in avro data format
STORE A INTO '/home/hadoop/work/sudhir/AvroAnalysis/output/AvroStore/' USING 
org.apache.pig.piggybank.storage.avro.AvroStorage('schema','{"namespace":"com.sudhir.schema.users.avro","type":"long","name":"users_avro","doc":"Avro
 storing with schema using Pig.","fields":[
{"name":"USER_ID","type":["null","long"],"default":null}
]}');
*******Getting Error like:
INFO 
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher 
- 100% complete
ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 2997: Unable to recreate 
exception from backed error: 
org.apache.avro.file.DataFileWriter$AppendWriteException: 
java.lang.NullPointerException: null of long
ERROR org.apache.pig.tools.pigstats.PigStatsUtil - 1 map reduce job(s) failed!


> Exception on storing Null value through AvroStorage using PIG
> -------------------------------------------------------------
>
>                 Key: PIG-3656
>                 URL: https://issues.apache.org/jira/browse/PIG-3656
>             Project: Pig
>          Issue Type: Bug
>         Environment: Hadoop 0.20.2-cdh3u5
> Apache Pig version 0.8.1-cdh3u5
> java version "1.6.0_27"
>            Reporter: Sudhir Ranjan
>
> Getting exception on storing null valued record/tupple as avro.
> The input file having one column with long values (one of them is null means 
> nothing) and when I am trying to store the data in avro format ,it throws 
> error.
> Please suggest if I am missing any thing some where as per the bellow 
> codebase or else please provide the patch.
> input file: /home/hadoop/work/sudhir/AvroAnalysis/input/TSV_uncompressed/part*
> content: 
> 2037179309
> 2037179338
> 2037179367
> 2037179433
> 2037179437
> 2037179449
> 2037179547
> 2037179631
> 2037179631
> 2037179309
> 2037179338
> 2037179367
> 2037179433
> 2037179437
> 2037179449
> 2037179547
> 2037179631
> 2037179631
> ******My code base.
> REGISTER 
> /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/snappy-java-1.0.4.1.jar
> REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/avro-1.7.5.jar
> REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/json-simple-1.1.jar;
> REGISTER /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/piggybank.jar;
> REGISTER 
> /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/jackson-core-asl-1.5.5.jar;
> REGISTER 
> /home/hadoop/work/sudhir/AvroAnalysis/Avrojars/jackson-mapper-asl-1.5.5.jar;
> – The input file only have 1 column (normal TEXT data i.e TSV format) and the 
> file having a null value means nothing
> A = load '/home/hadoop/work/sudhir/AvroAnalysis/input/TSV_uncompressed/part*' 
> using PigStorage('\t') as (USER_ID:long);
> – The soutput to be stored in avro data format
> STORE A INTO '/home/hadoop/work/sudhir/AvroAnalysis/output/AvroStore/' USING 
> org.apache.pig.piggybank.storage.avro.AvroStorage('schema','{"namespace":"com.sudhir.schema.users.avro","type":"long","name":"users_avro","doc":"Avro
>  storing with schema using Pig.","fields":[
> {"name":"USER_ID","type":["null","long"],"default":null}
> ]}');
> *******Getting Error like:
> INFO 
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher
>  - 100% complete
> ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 2997: Unable to recreate 
> exception from backed error: 
> org.apache.avro.file.DataFileWriter$AppendWriteException: 
> java.lang.NullPointerException: null of long
> ERROR org.apache.pig.tools.pigstats.PigStatsUtil - 1 map reduce job(s) failed!



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to