I can call listdir to read from local filesystem in a python UDF. Did you 
implement your function as a proper UDF?
________________________________________
From: Haider [[email protected]]
Sent: Monday, December 02, 2013 5:22 AM
To: [email protected]
Subject: listdir() python function is not wokring on hadoop

Hi all

   is there any one who successfully used listdir() function to retrieve
files one by one from HDFS using python script.


 if __name__ == '__main__':

    for filename in os.listdir("/user/hdmaster/XML2"):
    print filename

ERROR streaming.StreamJob: Job not successful. Error: # of failed Map Tasks
exceeded allowed limit. FailedCount: 1. LastFailedTask:
task_201312020139_0025_m_000000
13/12/02 05:20:50 INFO streaming.StreamJob: killJob...

My intention is to take files one by one to parse.

Any help or suggestion on this will be so much helpful to me

Thanks
Haider

Reply via email to