[arangodb-google] Re: "Postmortem": using request in Foxx apps

Bart DS Thu, 30 Nov 2017 08:40:09 -0800

Hy Frank,

I'm sorry to reply to an old thread, but I'm also regularly seeing the SSL 
error "{communication} unable to perform ssl handshake: Connection reset by 
peer : 104".


Did you find a cause/solution for this error?

Another SSL related error that regularly appears is: "{communication} 
unable to perform ssl handshake: short read : 335544539"

Thanks,

Bart

On Wednesday, April 19, 2017 at 8:37:45 AM UTC+2, Frank Celler wrote:
>
> That message is issued by Boost ASIO when a client tries to connect to the 
> ArangoDB server and the initial SSL handshake failed. The message 
> "Connection reset by peer" is coming from Boost. I will google to see if 
> there is any explanation.
>
> Am Mittwoch, 19. April 2017 04:25:20 UTC+2 schrieb Thomas Weiss:
>>
>> Guys, it seems that my diagnostic was wrong... Looking through the logs 
>> I've just seen that the issues are still there, with slow queries and lock 
>> timeouts.
>> I've also seen the SSL trace many times: "{communication} unable to 
>> perform ssl handshake: Connection reset by peer : 104".
>>
>> Maybe we should start with that SSL trace: in which situations does 
>> ArangoDB output that log?
>>
>> Thanks,
>> Thomas
>>
>> On Tuesday, April 18, 2017 at 8:41:33 PM UTC+8, Thomas Weiss wrote:
>>>
>>> Also if it can help, it happened with 3.1.15 on Ubuntu 16.04
>>>
>>> On Tuesday, April 18, 2017 at 7:46:12 PM UTC+8, Frank Celler wrote:
>>>>
>>>> Thomas has shared with me a (private) Azure account we can try. Will 
>>>> post the result here.
>>>>
>>>> Am Dienstag, 18. April 2017 13:40:46 UTC+2 schrieb Jan:
>>>>>
>>>>> Hi Thomas,
>>>>>
>>>>> thanks for the analysis you did! 
>>>>> That means you are connecting to Azure Table Storage from Foxx via the 
>>>>> request module and SSL, right? Which SSL protocol are you using to 
>>>>> connect 
>>>>> to it?
>>>>> And the problem seems to happen (not confirmed) when Azure Table 
>>>>> Storage has higher response time than usual? 
>>>>>
>>>>> And do you happen to remember who answered what and when on Slack 
>>>>> regardings the TLS support changes? AFAIK we fixed a few bugs in the TLS 
>>>>> code in 3.1 recently, but I am not aware of any changes that introduced 
>>>>> new 
>>>>> issues there. And TLS support should have been there in 3.0 already. So I 
>>>>> am wondering if you could provide some more info on this.
>>>>>
>>>>> Thanks!
>>>>> Jan
>>>>>
>>>>> Am Montag, 17. April 2017 10:50:21 UTC+2 schrieb Thomas Weiss:
>>>>>>
>>>>>> Hi everyone,
>>>>>>
>>>>>> I just wanted to share with you my recent experience in 
>>>>>> troubleshooting strange problems.
>>>>>>
>>>>>> Background: This project uses Foxx where most of the app logic is 
>>>>>> implemented. From Foxx functions, I used the request module to post 
>>>>>> events 
>>>>>> to Azure Table Storage.
>>>>>>
>>>>>> Everything was really working fine until ~2 weeks ago when I started 
>>>>>> to notice that my ArangoDB instances would sometimes go through some 
>>>>>> "apnea" with:
>>>>>> - requests taking a long time to run (many minutes!)
>>>>>> - lock timeouts in Foxx transactions
>>>>>> - general performance degradation with the web dashboard not available
>>>>>> Those issues would last for 10 to 15 minutes and everything would get 
>>>>>> back to normal.
>>>>>>
>>>>>> I first suspected my code to be at fault and spent a lot of time 
>>>>>> trying to figure out what triggered those problems. But then I found out 
>>>>>> that:
>>>>>> - both staging and production environments were impacted, but they 
>>>>>> were not running the same version of my app (and the prod was >1 week 
>>>>>> older)
>>>>>> - when those apnea happen, I would sometimes get error logs about SSL 
>>>>>> handshakes
>>>>>> - (not confirmed) issues in prod and staging would happen 
>>>>>> approximately at the same time
>>>>>> - (not confirmed) issues would happen when the Azure Table Storage 
>>>>>> would have higher response time
>>>>>>
>>>>>> I asked on Slack about the SSL handshake thing and someone answered 
>>>>>> that there was a bug introduced with TLS support (which I guess was 
>>>>>> 3.1), 
>>>>>> and then it hit me that I upgraded my instances from 3.0.10 to 3.1.15 
>>>>>> not 
>>>>>> too long ago.
>>>>>>
>>>>>> So I decided to change the flow of events within the system (not a 
>>>>>> small change!) to avoid having Arango use the request module. This was 
>>>>>> deployed nearly a week ago, and I didn't have any problem since then!
>>>>>>
>>>>>> Cheers,
>>>>>> Thomas
>>>>>>
>>>>>

-- 
You received this message because you are subscribed to the Google Groups 
"ArangoDB" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

[arangodb-google] Re: "Postmortem": using request in Foxx apps

Reply via email to