Re: about hdfs in flink cluster

Robert Metzger Thu, 16 Oct 2014 03:19:32 -0700

Hi,

good to hear that you are using Flink.

HDFS is a distributed file system for reliably storing huge amounts of
data. Many users of Flink use HDFS to store all kinds of data on it.
This includes both input data for their jobs and also results.
Often, Flink and HDFS are installed next to each other in a cluster so that
the same machines that contain the data also process it.

An example for you could be that you store the data from MySQL in HDFS. Or
you could join data from MySQL with data in HDFS.

Let us know if you have more questions. We are happy to help!

On Thu, Oct 16, 2014 at 9:56 AM, Márton Balassi <[email protected]>
wrote:

> Flink does not require HDFS if you decide not to use it. The version you
> are currently using should fit you then. You might change it if you need/do
> not need YARN support.
>
> On Thu, Oct 16, 2014 at 4:27 AM, 杨洪波 <[email protected]> wrote:
>
> > hey all，
> > i am new in flink and i have a question：  i use flink cluster to read
> > mysql datasource，my biz don't need hdfs， do i must set HDFS in my flink
> > cluster? if yes,then for what flink use it ?
> > thanks!
>

Re: about hdfs in flink cluster

Reply via email to