I ran into problems using 5.19 so I referred to 5.17 and it resolved my
issues.

On Wed, Nov 28, 2018 at 2:48 AM Conrad Lee <con...@parsely.com> wrote:

> Hello Vadim,
>
> Interesting.  I've only been running this job at scale for a couple weeks
> so I can't say whether this is related to recent EMR changes.
>
> Much of the EMR-specific code for spark has to do with writing files to
> s3.  In this case I'm writing files to the cluster's HDFS though so my
> sense is that this is a spark issue, not an EMR (but I'm not sure).
>
> Conrad
>
> On Tue, Nov 27, 2018 at 5:21 PM Vadim Semenov <va...@datadoghq.com> wrote:
>
>> Hey Conrad,
>>
>> has it started happening recently?
>>
>> We recently started having some sporadic problems with drivers on EMR
>> when it gets stuck, up until two weeks ago everything was fine.
>> We're trying to figure out with the EMR team where the issue is coming
>> from.
>> On Tue, Nov 27, 2018 at 6:29 AM Conrad Lee <con...@parsely.com> wrote:
>> >
>> > Dear spark community,
>> >
>> > I'm running spark 2.3.2 on EMR 5.19.0.  I've got a job that's hanging
>> in the final stage--the job usually works, but I see this hanging behavior
>> in about one out of 50 runs.
>> >
>> > The second-to-last stage sorts the dataframe, and the final stage
>> writes the dataframe to HDFS.
>> >
>> > Here you can see the executor logs, which indicate that it has finished
>> processing the task.
>> >
>> > Here you can see the thread dump from the executor that's hanging.
>> Here's the text of the blocked thread.
>> >
>> > I tried to work around this problem by enabling speculation, but
>> speculative execution never takes place.  I don't know why.
>> >
>> > Can anyone here help me?
>> >
>> > Thanks,
>> > Conrad
>>
>>
>>
>> --
>> Sent from my iPhone
>>
>

Reply via email to