That's a good point. The HDFS setup is definitely not a hard requirement. However, Flink on a cluster does only work together with a storage system that can be accessed in parallel. HDFS is likely the most popular choice for that.
The HDFS setup could be marked as optional in the cluster setup. 2014-10-16 15:16 GMT+02:00 杨洪波 <[email protected]>: > > > thanks. maybe i need to build flink myself and then setup in cluster > mode without hdfs . > > > why we need hdfs setup in "cluster setup ? ( it said "This involves > two steps. First, installing and configuring Flink and second installing > and configuring the Hadoop Distributed Filesystem (HDFS)." ) > > > local setup & cluster setup without hdfs & cluster setup with hdfs & yarn > setup ? > > > > > > > > At 2014-10-16 18:27:08, "Kostas Tzoumas" <[email protected]> wrote: > >No, you don't need to install HDFS. You can use Flink without HDFS. > > > >On Thu, Oct 16, 2014 at 12:17 PM, Robert Metzger <[email protected]> > >wrote: > > > >> Hi, > >> > >> good to hear that you are using Flink. > >> > >> HDFS is a distributed file system for reliably storing huge amounts of > >> data. Many users of Flink use HDFS to store all kinds of data on it. > >> This includes both input data for their jobs and also results. > >> Often, Flink and HDFS are installed next to each other in a cluster so > that > >> the same machines that contain the data also process it. > >> > >> An example for you could be that you store the data from MySQL in HDFS. > Or > >> you could join data from MySQL with data in HDFS. > >> > >> > >> Let us know if you have more questions. We are happy to help! > >> > >> > >> > >> On Thu, Oct 16, 2014 at 9:56 AM, Márton Balassi < > [email protected]> > >> wrote: > >> > >> > Flink does not require HDFS if you decide not to use it. The version > you > >> > are currently using should fit you then. You might change it if you > >> need/do > >> > not need YARN support. > >> > > >> > On Thu, Oct 16, 2014 at 4:27 AM, 杨洪波 <[email protected]> wrote: > >> > > >> > > hey all, > >> > > i am new in flink and i have a question: i use flink cluster to > read > >> > > mysql datasource,my biz don't need hdfs, do i must set HDFS in my > flink > >> > > cluster? if yes,then for what flink use it ? > >> > > thanks! > >> > > >> >
