Hi, I want to execute my pig script for multiple sub directories and then want that output should match with the input directorory structure.
Example /input/parent/child1/ /input/parent/child2/ /input/parent/child3/ etc Output should be /output/parent/child1/ /output/parent/child2/ /output/parent/child3/ Which Pig Storage format can i use. to explain it better I want to make sure that my pig script executes only under folder which doubt have any further childs. Basically my folder structure is hive partitions /input/parent/child1/YYYY/MM/DD /input/parent/child2/YYYY/MM/DD /input/parent/child3/YYYY/MM/DD I want to process all data in hive partitions and have result in format /output/parent/child1/YYYY/MM/DD /output/parent/child2/YYYY/MM/DD /output/parent/child3/YYYY/MM/DD Thanks in advance
