HTTP2Transport race may lead to deadlock #10

jshslt · 2017-05-22T20:14:59Z

Hi,
I think there's a race in HTTP2Transport that can lead to deadlock if the networkLoop thread tries to signal out a successful connection, while another thread has asked for the network thread to terminate (and is waiting to join).

full scenario -
A - if any client thread calls MessageRouter::disable. MessageRouter lock is taken. MessageRouter::disconnectAllTransportsLocked is called. the transports are iterated, HTTP2Transport::disconnect is called, which calls join() on the SDK network thread.

B - SDK network thread, when connection is just established, HTTP2Transport::networkLoop calls the onConnected callback MessageRouter::onConnected, which tries to take the MessageRouter lock.

if B happens while A is in process, deadlock.

please advise?

The text was updated successfully, but these errors were encountered:

sanjayrd · 2017-05-23T21:07:19Z

Thanks for pointing this out @jshslt. We haven't had a chance to dive deep into this yet, but we will definitely investigate the matter to see what we can do.

In the meantime, could you expand on the use case of this? More specifically, is this an issue that you actually foresee happening in your production code or just something you noticed as you were going through the code?

Thanks!
Sanjay

jshslt · 2017-05-23T21:12:36Z

@sanjayrd we happened to hit it for real. We've been getting a lot of SERVER_SIDE_DISCONNECTs, and sometimes the SDK appears to get in a PENDING->CONNECTED->PENDING->CONNECTED loop because of these - when we catch this starting to happen, we sometimes forcibly and fully disconnect the transport with MessageRouter::disconnect(), to rest a bit before trying to reconnect again.

garmin-coleman · 2017-05-29T17:43:55Z

@jshslt - FYI we were also seeing the PENDING->CONNECTED loop you mention. For us it would reliably happen if we accidentally had two clients using the same refresh token. The two clients would essentially ping-pong, causing the server to kick the other off when reconnecting.

JamieMeyers · 2017-06-02T17:05:05Z

We have confirmed the problem and are working on a fix.

Thanks,
Jamie

Changes in this update -Implemented Sensory wake word detector functionality -Removed the need for a std::recursive_mutex in MessageRouter -Added AIP unit test -Added handleDirectiveImmediately functionality to SpeechSynthesizer -Added memory profiles for: AIP SpeechSynthesizer ContextManager AVSUtils AVSCommon -Bug fix for MultipartParser.h compiler warning -Suppression of sensitive log data even in debug builds. Use cmake parameter -DACSDK_EMIT_SENSITIVE_LOGS=ON to allow logging of sensitive information in DEBUG builds -Fix crash in ACL when attempting to use more than 10 streams -Updated MediaPlayer to use autoaudiosink instead of requiring pulseaudio -Updated MediaPlayer build to suppport local builds of GStreamer -Fixes for the following Github issues: #5 #8 #9 #10 #17 #24

scotthea-amazon · 2017-06-09T23:56:34Z

A fix for this has been pushed in version 0.4.1.

JamieMeyers added the bug label Jun 2, 2017

yugoren added the ACL label Jun 2, 2017

scotthea-amazon self-assigned this Jun 8, 2017

JamieMeyers added this to the 0.4.1 milestone Jun 13, 2017

JamieMeyers closed this as completed Jun 13, 2017

jade-github mentioned this issue Oct 12, 2017

V1.1.0 Compile Error #209

Closed

kuodehai mentioned this issue Nov 1, 2017

make MediaPlayer test #285

Closed

This was referenced Dec 13, 2017

AuthDelegate Error for installing AVS SDK for arm target #287

Closed

avs-sdk-crash in speaking/thinking state #391

Closed

jie714 mentioned this issue Mar 13, 2018

v1.2 build errors with Android NDK #305

Closed

zeusshuang mentioned this issue Jun 28, 2018

Some issues under AVS Music Self Test #814

Closed

preth-2018 mentioned this issue Jul 31, 2018

setupPipelineFailed #833

Closed

6 tasks

gavinlwz mentioned this issue Aug 14, 2018

Cross compile the AVS Device SDK 1.8.1 for MIPS linux can not work #898

Closed

6 tasks

indrachatterjee86 mentioned this issue Sep 25, 2019

No volume control over BT for Iphone as there is no media syc property to set like some Android phones (Samsumg Galaxy A70) #1472

Open

brett-lynnes mentioned this issue Jan 22, 2020

AVS crash on null pointer in log message (SpeechSynthesizer) #1596

Closed

6 tasks

xlb767923274 mentioned this issue Nov 11, 2021

SampleApp crash on ubuntu 18.04 LTS #1994

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTTP2Transport race may lead to deadlock #10

HTTP2Transport race may lead to deadlock #10

jshslt commented May 22, 2017

sanjayrd commented May 23, 2017

jshslt commented May 23, 2017

garmin-coleman commented May 29, 2017

JamieMeyers commented Jun 2, 2017

scotthea-amazon commented Jun 9, 2017

HTTP2Transport race may lead to deadlock #10

HTTP2Transport race may lead to deadlock #10

Comments

jshslt commented May 22, 2017

sanjayrd commented May 23, 2017

jshslt commented May 23, 2017

garmin-coleman commented May 29, 2017

JamieMeyers commented Jun 2, 2017

scotthea-amazon commented Jun 9, 2017