Hi Shakun,
SequenceFile always contains key-value pairs - that is how the format is 
defined. However this doesn’t suite Sqoop as we consider entire row as “key” 
and hence we’re not using the “value” field - and that is the null that you’re 
observing. If you use generic tool such as (hadoop dfs -text) you will get 
generic output that will include the value field and hence will show a null 
string. Simply don’t use the “value” field in your application and you will be 
good to go!

Jarcec

On Sep 23, 2014, at 11:00 PM, shakun grover <[email protected]> wrote:

> Hi All,
> 
> Whenever I do a Sqoop Import using the following command:
> 
> Name: test
> 
> Database configuration
> 
> Schema name: test
> Table name: emp
> Table SQL statement:
> Table column names: name,id
> Partition column name: id
> Nulls in partition column: true
> Boundary query:
> 
> Output configuration
> 
> Storage type:
>  0 : HDFS
> Choose: 0
> Output format:
>  0 : TEXT_FILE
>  1 : SEQUENCE_FILE
> Choose: 1
> Output directory: /tmp/Seq1/1
> 
> Throttling resources
> 
> Extractors:
> Loaders:
> Job was successfully updated with status FINE
> 
> It gives me the following output file:
> 'Tom',1 (null)
> ''Blue',2 (null)
> 'James',3 (null)
> 'Tom',4 (null)
> 'Erik',5 (null)
> 
> 
> I want to know that why it is appending (null) in the output sequence file.
> 
> Any help will be highly appreciated.
> 
> Thanks in advance!!
> 
> -- 
> Thanks & Regards,
> Shakun Grover

Reply via email to