BinStorage/PigStorageSchema cannot load data from a different namenode
----------------------------------------------------------------------
Key: PIG-1865
URL: https://issues.apache.org/jira/browse/PIG-1865
Project: Pig
Issue Type: Bug
Affects Versions: 0.8.0, 0.7.0, 0.9.0
Reporter: Vivek Padmanabhan
BinStorage/PigStorageSchema cannot load data from a different namenode. The
main reason for this is that, in the getSchema method , they use
org.apache.pig.impl.io.FileLocalizer to check whether the exists, but the
filesystem in HDataStorage refers to the natively configured dfs.
The test case is simple :
a = load 'hdfs://<nn2>/input' using BinStorage();
dump a;
Here if I specify -Dmapreduce.job.hdfs-servers, it should have worked , by pig
still takes the fs from fs.default.name so to make it work i had to override
fs.default.name in pig command line.
Raising this as a bug since the same scenario works with PigStorage.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira