Re: spark shared RDD

2015-12-09 Thread Calvin Jia
Hi Ben, Tachyon can be used to share data between spark jobs. If you specify the input to your jobs as a Tachyon path, you can leverage Tachyon's memory centric storage on reads, improving the performance when reading the same dataset multiple times. The examples on this page may be helpful: http:

spark shared RDD

2015-11-10 Thread Ben
Hi, After reading some documentations about spark and ignite, I am wondering if shared RDD from ignite can be used to share data in memory without any duplication between multiple spark jobs. Running on mesos I can collocate them, but will this be enough to avoid memory duplication or not? I am als

spark shared RDD

2015-11-10 Thread Ben
Hi, After reading some documentations about spark and ignite, I am wondering if shared RDD from ignite can be used to share data in memory without any duplication between multiple spark jobs. Running on mesos I can collocate them, but will this be enough to avoid memory duplication or not? I am als

spark shared RDD

2015-11-09 Thread Ben
Hi, After reading some documentations about spark and ignite, I am wondering if shared RDD from ignite can be used to share data in memory without any duplication between multiple spark jobs. Running on mesos I can collocate them, but will this be enough to avoid memory duplication or not? I am als