Hi Ben,
Tachyon can be used to share data between spark jobs. If you specify the
input to your jobs as a Tachyon path, you can leverage Tachyon's memory
centric storage on reads, improving the performance when reading the same
dataset multiple times. The examples on this page may be helpful:
http:
Hi,
After reading some documentations about spark and ignite,
I am wondering if shared RDD from ignite can be used to share data in
memory without any duplication between multiple spark jobs.
Running on mesos I can collocate them, but will this be enough to avoid
memory duplication or not?
I am als
Hi,
After reading some documentations about spark and ignite,
I am wondering if shared RDD from ignite can be used to share data in
memory without any duplication between multiple spark jobs.
Running on mesos I can collocate them, but will this be enough to avoid
memory duplication or not?
I am als
Hi,
After reading some documentations about spark and ignite,
I am wondering if shared RDD from ignite can be used to share data in
memory without any duplication between multiple spark jobs.
Running on mesos I can collocate them, but will this be enough to avoid
memory duplication or not?
I am als