n Thu, Jan 28, 2016 at 4:41 AM, Philip Lee wrote:
> Hi,
>
> Simple Question about Spark Distribution of Small Dataset.
>
> Let's say I have 8 machine with 48 cores and 48GB of RAM as a cluster.
> Dataset (format is ORC by Hive) is so small like 1GB, but I copied it to
>
Hi,
Simple Question about Spark Distribution of Small Dataset.
Let's say I have 8 machine with 48 cores and 48GB of RAM as a cluster.
Dataset (format is ORC by Hive) is so small like 1GB, but I copied it to
HDFS.
1) if spark-sql run the dataset distributed on HDFS in each machine, what
ha