Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gh-91351: Fix some bugs in importlib handling of re-entrant imports #94504

Merged
merged 35 commits into from
Jan 21, 2023
Merged
Changes from 3 commits
Commits
Show all changes
35 commits
Select commit Hold shift + click to select a range
2a4d4f3
Add some implementation documentation
exarkun Jun 27, 2022
c9e33c7
Move _blocking_on management into a context manager
exarkun Jun 27, 2022
c5912bf
Support re-entrant imports in _BlockingOnManager
exarkun Jun 30, 2022
27ae2f0
Apply formatting and comment changes from review
exarkun Nov 28, 2022
1fcc78b
Merge remote-tracking branch 'origin/main' into 91351-importlib-reent…
exarkun Nov 28, 2022
5fd15d8
Rename _BlockingOnManager.tid as suggested by review
exarkun Nov 28, 2022
7a24f2c
flip the first two arguments to _has_deadlock as suggested by review
exarkun Nov 28, 2022
08892b4
Mark up parameters following PEP 257 as suggested by review
exarkun Nov 28, 2022
59b53c0
rename the `_blocking_on` parameter as suggested by review
exarkun Nov 28, 2022
20007c5
further document motivation for `blocking_on` parameter as suggested …
exarkun Nov 28, 2022
bad1d3c
Rename more _has_deadlocked parameters as suggested by review
exarkun Nov 28, 2022
2821fcf
Treat None and [] the same for this case as suggested by review
exarkun Nov 28, 2022
95c73cb
update old comment to refer to new names as suggested by review
exarkun Nov 28, 2022
49ff9dd
Make _ModuleLock.count a list of True as suggested by review
exarkun Nov 28, 2022
cd174a8
Adjust the check for a module lock being released as suggest by review
exarkun Nov 28, 2022
719b181
Finish the _BlockingOnManager.tid renaming
exarkun Nov 28, 2022
6e809cd
Fix renaming of `_blocking_on` parameter to `_has_deadlocked`
exarkun Nov 28, 2022
74cbccd
Apply review suggestion
exarkun Jan 6, 2023
c579533
Apply review suggestion
exarkun Jan 6, 2023
decb70b
Apply review suggestion
exarkun Jan 6, 2023
3c91cc3
Apply review suggestion
exarkun Jan 6, 2023
e032ae2
Apply review suggestion
exarkun Jan 6, 2023
dba393a
Apply review suggestion
exarkun Jan 6, 2023
25d554b
Apply review suggestion
exarkun Jan 6, 2023
44157a9
Apply review suggestion
exarkun Jan 6, 2023
f99ed46
Apply review suggestion
exarkun Jan 6, 2023
b6d21f8
Apply review suggestion
exarkun Jan 6, 2023
5643442
Apply review suggestion
exarkun Jan 6, 2023
92036a8
Apply review suggestion
exarkun Jan 6, 2023
bf14ce2
Apply review suggestion
exarkun Jan 6, 2023
1cc6033
Apply review suggestion
exarkun Jan 6, 2023
ab40737
Apply review suggestion
exarkun Jan 6, 2023
36082eb
news blurb
exarkun Jan 6, 2023
55fc599
Merge branch 'main' into 91351-importlib-reentrancy
exarkun Jan 6, 2023
b35f0a8
Apply suggestions from code review
exarkun Jan 14, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
246 changes: 210 additions & 36 deletions Lib/importlib/_bootstrap.py
Original file line number Diff line number Diff line change
Expand Up @@ -54,48 +54,180 @@ def _new_module(name):
# A dict mapping module names to weakrefs of _ModuleLock instances
# Dictionary protected by the global import lock
_module_locks = {}
# A dict mapping thread ids to _ModuleLock instances

# A dict mapping thread ids to lists of _ModuleLock instances. This maps a
# thread to the module locks it is blocking on acquiring. The values are
# lists because a single thread could perform a re-entrant import and be "in
# the process" of blocking on locks for more than one module. "in the
# process" because a thread cannot actually block on acquiring more than one
# lock but it can have set up bookkeeping that reflects that it intends to
# block on acquiring more than one lock.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
_blocking_on = {}


class _BlockingOnManager:
"""
A context manager responsible to updating ``_blocking_on`` to track which
threads are likely blocked on taking the import locks for which modules.
"""
exarkun marked this conversation as resolved.
Show resolved Hide resolved
def __init__(self, tid, lock):
# The id of the thread in which this manager is being used.
self.tid = tid
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# The _ModuleLock for a certain module which the running thread wants
# to take.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.lock = lock

def __enter__(self):
"""
Mark the running thread as waiting for the lock this manager knows
about.
"""
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# Interactions with _blocking_on are *not* protected by the global
# import lock here because each thread only touches the state that it
# owns (state keyed on its thread id). The global import lock is
# re-entrant (ie, a single thread may take it more than once) so it
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# wouldn't help us be correct in the face of re-entrancy either.

# First look up the module locks the running thread already intends to
# take. If this thread hasn't done an import before, it may not be
# present in the dict so be sure to initialize it in this case.
self.blocked_on = _blocking_on.setdefault(self.tid, [])

# Whether we are re-entering or not, add this lock to the list because
# now this thread is going to be blocked on it.
self.blocked_on.append(self.lock)
exarkun marked this conversation as resolved.
Show resolved Hide resolved

def __exit__(self, *args, **kwargs):
"""
Mark the running thread as no longer waiting for the lock this manager
knows about.
"""
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.blocked_on.remove(self.lock)


class _DeadlockError(RuntimeError):
pass



def _has_deadlock(seen, subject, tids, _blocking_on):
exarkun marked this conversation as resolved.
Show resolved Hide resolved
"""
Considering a graph where nodes are threads (represented by their id
as keys in ``blocking_on``) and edges are "blocked on" relationships
(represented by values in ``_blocking_on``), determine whether ``subject``
is reachable starting from any of the threads given by ``tids``.
exarkun marked this conversation as resolved.
Show resolved Hide resolved

:param seen: A set of threads that have already been visited.
:param subject: The thread id to try to reach.
:param tids: The thread ids from which to begin.
:param blocking_on: A dict representing the thread/blocking-on graph.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
"""
if subject in tids:
# If we have already reached the subject, we're done - signal that it
# is reachable.
return True

# Otherwise, try to reach the subject from each of the given tids.
for tid in tids:
blocking_on = _blocking_on.get(tid)
if blocking_on is None:
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# There are no edges out from this node, skip it.
continue

if tid in seen:
# bpo 38091: the chain of tid's we encounter here
# eventually leads to a fixpoint or a cycle, but
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# does not reach 'me'. This means we would not
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# actually deadlock. This can happen if other
# threads are at the beginning of acquire() below.
return False
seen.add(tid)

# Follow the edges out from this thread.
edges = [lock.owner for lock in blocking_on]
if _has_deadlock(seen, subject, edges, _blocking_on):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it worth doing this recursively? From the one use of the function below it looks like reasonable defaults could be provided and then you would use a while loop until the candidate threads to check is exhausted. You could then have an else clause that returns False as the default result.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree it could be implemented iteratively. This seems like a purely stylistic choice though. Unless there is a stronger motivation I would prefer not to rewrite it.

return True

return False


class _ModuleLock:
"""A recursive lock implementation which is able to detect deadlocks
(e.g. thread 1 trying to take locks A then B, and thread 2 trying to
take locks B then A).
"""

def __init__(self, name):
self.lock = _thread.allocate_lock()
# Create an RLock for protecting the import process for the
# corresponding module. Since it is an RLock a single thread will be
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# able to take it more than once. This is necessary to support
# re-entrancy in the import system that arises from (at least) signal
# handlers and the garbage collector. Consider the case of:
#
# import foo
# -> ...
# -> importlib._bootstrap._ModuleLock.acquire
# -> ...
# -> <garbage collector>
# -> __del__
# -> import foo
# -> ...
# -> importlib._bootstrap._ModuleLock.acquire
# -> _BlockingOnManager.__enter__
#
# If a different thread than the running thread holds the lock then it
# will have to block on taking it which is just what we want for
# thread safety.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.lock = _thread.RLock()
self.wakeup = _thread.allocate_lock()

# The name of the module for which this is a lock.
self.name = name

# Either None if this lock is not owned by any thread or the thread
# identifier for the owning thread.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.owner = None
self.count = 0
self.waiters = 0

# This is a count of the number of times the owning thread has
# acquired this lock. This supports RLock-like ("re-entrant lock")
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# behavior, necessary in case a single thread is following a circular
# import dependency and needs to take the lock for a single module
# more than once.
#
# Counts are represented as a list of None because list.append(None)
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# and list.pop() are both atomic and thread-safe and it's hard to find
# another primitive with the same properties.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.count = []

# This is a count of the number of threads that are blocking on
# `self.wakeup.acquire()` to try to get their turn holding this module
exarkun marked this conversation as resolved.
Show resolved Hide resolved
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# lock. When the module lock is released, if this is greater than
# zero, it is decremented and `self.wakeup` is released one time. The
# intent is that this will let one other thread make more progress on
# acquiring this module lock. This repeats until all the threads have
# gotten a turn.
#
# This is incremented in `self.acquire` when a thread notices it is
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# going to have to wait for another thread to finish.
#
# See the comment above count for explanation of the representation.
self.waiters = []

def has_deadlock(self):
# Deadlock avoidance for concurrent circular imports.
me = _thread.get_ident()
tid = self.owner
seen = set()
while True:
lock = _blocking_on.get(tid)
if lock is None:
return False
tid = lock.owner
if tid == me:
return True
if tid in seen:
# bpo 38091: the chain of tid's we encounter here
# eventually leads to a fixpoint or a cycle, but
# does not reach 'me'. This means we would not
# actually deadlock. This can happen if other
# threads are at the beginning of acquire() below.
return False
seen.add(tid)
# To avoid deadlocks for concurrent or re-entrant circular imports,
# look at the "blocking on" state to see if any threads are blocking
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# on getting the import lock for any module for which the import lock
# is held by this thread.
return _has_deadlock(
seen=set(),
# Try to find this thread
exarkun marked this conversation as resolved.
Show resolved Hide resolved
subject=_thread.get_ident(),
# starting from the thread that holds the import lock for this
# module.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
tids=[self.owner],
# using the global "blocking on" state.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
_blocking_on=_blocking_on,
)

def acquire(self):
"""
Expand All @@ -104,35 +236,77 @@ def acquire(self):
Otherwise, the lock is always acquired and True is returned.
"""
tid = _thread.get_ident()
_blocking_on[tid] = self
try:
with _BlockingOnManager(tid, self):
while True:
# Protect interaction with state on self with a per-module
# lock. This makes it safe for more than one thread to try to
# acquire the lock for a single module at the same time.
with self.lock:
if self.count == 0 or self.owner == tid:
if self.count == [] or self.owner == tid:
# If the lock for this module is unowned then we can
# take the lock immediately and succeed. If the lock
# for this module is owned by the running thread then
# we can also allow the acquire to succeed. This
# supports circular imports (thread T imports module A
# which imports module B which imports module A).
self.owner = tid
self.count += 1
self.count.append(None)
return True

# At this point we know the lock is held (because count !=
# 0) by another thread (because owner != tid). We'll have
# to get in line to take the module lock.

# But first, check to see if this thread would create a
# deadlock by acquiring this module lock. If it would
# then just stop with an error.
#
# XXX It's not clear who is expected to handle this error.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# There is one handler in _lock_unlock_module but many
# times this method is called when entering the context
# manager _ModuleLockManager instead - so _DeadlockError
# will just propagate up to application code.
#
# This seems to be more than just a hypothetical -
# https://stackoverflow.com/questions/59509154
# https://github.com/encode/django-rest-framework/issues/7078
if self.has_deadlock():
raise _DeadlockError('deadlock detected by %r' % self)
exarkun marked this conversation as resolved.
Show resolved Hide resolved

# Check to see if we're going to be able to acquire the
# lock. If we are going to have to wait then increment
# the waiters so `self.release` will know to unblock us
# later on. We do this part non-blockingly so we don't
# get stuck here before we increment waiters. We have
# this extra acquire call (in addition to the one below,
# outside the self.lock context manager) to make sure
# self.wakeup is held when the next acquire is called (so
# we block). This is probably needlessly complex and we
# should just take self.wakeup in the return codepath
# above.
if self.wakeup.acquire(False):
self.waiters += 1
# Wait for a release() call
self.waiters.append(None)

# Now blockingly take the lock. This won't complete until the
# thread holding this lock (self.owner) calls self.release.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.wakeup.acquire()

# Taking it has served its purpose (making us wait) so we can
exarkun marked this conversation as resolved.
Show resolved Hide resolved
# give it up now. We'll take it non-blockingly again on the
# next iteration around this while loop.
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.wakeup.release()
finally:
del _blocking_on[tid]

def release(self):
tid = _thread.get_ident()
with self.lock:
if self.owner != tid:
raise RuntimeError('cannot release un-acquired lock')
assert self.count > 0
self.count -= 1
if self.count == 0:
assert len(self.count) > 0
self.count.pop()
if len(self.count) == 0:
exarkun marked this conversation as resolved.
Show resolved Hide resolved
self.owner = None
if self.waiters:
self.waiters -= 1
if len(self.waiters) > 0:
self.waiters.pop()
self.wakeup.release()

def __repr__(self):
Expand Down