Dear Wiki user, You have subscribed to a wiki page or wiki category on "Cassandra Wiki" for change notification.
The "HadoopSupport" page has been changed by JonathanEllis. http://wiki.apache.org/cassandra/HadoopSupport?action=diff&rev1=6&rev2=7 -------------------------------------------------- }}} Cassandra also provides a [[http://hadoop.apache.org/pig/|Pig]] `LoadFunc` for running jobs in the Pig DSL instead of writing Java code by hand. This is in https://svn.apache.org/repos/asf/cassandra/trunk/contrib/pig/. - Cassandra's splits are location-aware (this is the nature of the Hadoop InputSplit design). Cassandra gives hadoop a list of locations with the data, and Hadoop tries to schedule jobs on instances near that data, which in practice means you should have Hadoop instances on each of your Cassandra machines. + Cassandra's splits are location-aware (this is the nature of the Hadoop InputSplit design). Cassandra gives hadoop a list of locations with each split of data, and Hadoop tries to schedule jobs on instances near that data, which in practice means you should have Hadoop instances on each of your Cassandra machines.
