Hello everyone, I have successfully installed Shark 0.9 and Spark 0.9 in standalone mode in a cluster of 6 nodes for testing purposes.
I would like to use Shark API in Spark programs. So far I could only find the following: $./bin/shark-shell scala> val youngUsers = sc.sql2rdd("SELECT * FROM users WHERE age < 20") scala> println(youngUsers.count) ... scala> val featureMatrix = youngUsers.map(extractFeatures(_)) scala> kmeans(featureMatrix) Is there a more complete sample code to start a program using Shark API in Spark? Thanks! Jerry