Can anybody help here? On 22 July 2015 at 10:38, Jeetendra Gangele <gangele...@gmail.com> wrote:
> Hi All, > > I am trying to capture the user activities for real estate portal. > > I am using RabbitMS and Spark streaming combination where all the Events I > am pushing to RabbitMQ and then 1 secs micro job I am consuming using Spark > streaming. > > Later on I am thinking to store the consumed data for analytics or near > real time recommendations. > > Where should I store this data in Spark RDD itself and using SparkSQL > people can query this data for analytics or real time recommendations, this > data is not huge currently its 10 GB per day. > > Another alternatiove will be either Hbase or Cassandra, which one will be > better? > > Any suggestions? > > > Also for this use cases should I use any existing big data platform like > hortonworks or I can deploy standalone spark cluster ? >