According to the api docs for the pipe operator, def pipe(command: String): RDD <http://spark.apache.org/docs/1.0.0/api/scala/org/apache/spark/rdd/RDD.html> [String]: Return an RDD created by piping elements to a forked external process. However, its not clear to me:
Will the outputted RDD capture the standard out from the process as its output (i assume that is the most common implementation)? Incidentally, I have not been able to use the pipe command to run an external process yet, so any hints on that would be appreciated. -- jay vyas