Hi all,

We have files (size of each files  600GB- 700 GB ).  We want to load (bulk 
upload)these files into HDFS.

We are planning to choose Apache NiFi to ingest high volume of data.

Could you please advise if Apache NiFi is the right technology for  the high 
volume  of data  ingest?

If we use hadoop fs -put commands to load (bulk upload)these files into HDFS, 
do you foresee  any performance differnce between hadoop fs -put  and Apache 
NiFi?

Also we have couple of questions here?

  *   What  is the difference between NIFI and PUT - in terms of architecture, 
performance, scalability, high availability, fault tolerance ?
  *   Do we have any comparison metrics for the following items -supported 
features, unsupported features, limitations, free GA version and availability 
for commercial purpose etc?
  *   What Hadoop distributions are supported by NIFI?

Thanks & regards,
Rajib

Reply via email to