Hi, I'm setting up Hadoop and Pig to read data out of a Cassandra cluster. I have TaskTrackers installed on each Cass node, and a separate NameNode/JobTracker. Where does Pig fit into the puzzle? Should it be installed on the Namenode? Or on one of the Cass nodes with the TaskTracker?
Thanks, Gabriel Rosendorf
