Re: Deploying Samza Jobs Using S3 and YARN on AWS

2017-09-23 Thread Jagadish Venkatraman
Glad that you found the issue Xiaochuan! Should you decide to use the HttpFileSystem, Please set the config for yarn.package.path to point to the HTTP URI of your job's binary. Do let us know should you hit snags down that path! Best, On Sat, Sep 23, 2017 at 5:24 PM XiaoChuan Yu wrote: > I fou

Re: Deploying Samza Jobs Using S3 and YARN on AWS

2017-09-23 Thread XiaoChuan Yu
I found out that it was necessary to include "hadoop-aws" as a part of the package submitted to YARN similar to the instructions for deploying from HDFS . However, due to a dependency conflict on the AWS SDK between ou

Re: Samza Job Slow to Restart

2017-09-23 Thread XiaoChuan Yu
>> How long does it take? It took around 10 minute from "Got offset 0 for topic ... " to init() being called on the Task. >> Have you measured which parts of the start up sequence take the most time? >> - is it checkpoint restoration, or restore of local state? Should be checkpoint restoration. T