you'll need a filesystem with

* consistency
* accessibility everywhere
* supports a binding through one of the hadoop fs connectors

NFS-style distributed filesystems work with file:// ; things like glusterfs 
need their own connectors.

you can use azure's wasb:// as a drop in replacement for HDFS in Azure.I think 
google cloud storage is similar, but haven't played with it. Ask google.

You cannot do the same for S3 except on EMR and Amazon's premium emrfs:// 
offering, which adds the consistency layer.



On 22 Jun 2017, at 00:50, Alaa Zubaidi (PDF) 
<alaa.zuba...@pdf.com<mailto:alaa.zuba...@pdf.com>> wrote:

Hi,

Can we run Spark on YARN with out installing HDFS?
If yes, where would HADOOP_CONF_DIR point to?

Regards,

This message may contain confidential and privileged information. If it has 
been sent to you in error, please reply to advise the sender of the error and 
then immediately permanently delete it and all attachments to it from your 
systems. If you are not the intended recipient, do not read, copy, disclose or 
otherwise use this message or any attachments to it. The sender disclaims any 
liability for such unauthorized use. PLEASE NOTE that all incoming e-mails sent 
to PDF e-mail accounts will be archived and may be scanned by us and/or by 
external service providers to detect and prevent threats to our systems, 
investigate illegal or inappropriate behavior, and/or eliminate unsolicited 
promotional e-mails (“spam”). If you have any concerns about this process, 
please contact us at legal.departm...@pdf.com<mailto:legal.departm...@pdf.com>.

Reply via email to