I'm new to Spark. I have a bunch of files (in HDFS) that has a bunch of tweets (in JSON format.) I want to read and parse these into a RDD so that I can do some interactive processing on these tweets.
Has someone done something like this before ? Example ? I though I would ask before implementing one myself from scratch. Thanks -Soumya
