? to Ingest 10TB from FTP

Varadhan, Jawahar Fri, 14 Aug 2015 13:16:24 -0700

What is the best way to bring such a huge file from a FTP server into Hadoop to 
persist in HDFS? Since a single jvm process might run out of memory, I was 
wondering if I can use Spark or Flume to do this. Any help on this matter is 
appreciated. 
I prefer a application/process running inside Hadoop which is doing this 
transfer
Thanks.

Setting up Spark/flume/? to Ingest 10TB from FTP

Reply via email to