Hello all - I have installed a pseudo-distributed yarn and spark. My beam pipeline reads a TextIO from file and it runs fine when I launch the pipeline using --master spark://master. However, I am having difficulties in getting this run with --master yarn. I am pretty sure using TextIO from a local file in yarn is causing issues. I did look into beam api beam.sdk.io.hadoop and spark, but no luck in finding right info. If you could nudge me in the right direction, that'd be great! Thank you for your help.
Regards, Mahesh *--* *Mahesh Vangala* *(Ph) 443-326-1957* *(web) mvangala.com <http://mvangala.com>*
