Refer hadoop put, get syntax for placing input files on hdfs (automate script) and pig dump, store after mapreduce to have your output directory - http://pig.apache.org/docs/r0.9.2/start.html#Pig+Tutorial+Files
Thanks Rekha From: A Geek <[email protected]<mailto:[email protected]>> Reply-To: <[email protected]<mailto:[email protected]>> Date: Tue, 18 Sep 2012 05:04:05 +0000 To: <[email protected]<mailto:[email protected]>> Subject: Hadoop HDFS and Mapreducer question Hello All, I'm learning hadoop, hdfs etc and currently tying to solve one issue. Can someone help me how to start attacking the following problem: I'm trying to come up with some sample code to store the files \YEAR\Month\Date\account structure using Hadoop technique s. Example: The file will submit to the program as below Test_20120917_ACC1.csv and Test_20120916_ACC2.csv HDFS has to create structure as below HDFS_HOME\2012\09\17\ACC1\Test_20120917_ACC1.csv HDFS_HOME\2012\09\16\ACC1\Test_20120916_ACC2.csv Can someone give me pointers on how to start on this. Highly Appreciated. Thanks for reading the question. Thanks, DW
