2.2 KiB
Background
In Python, multithreading is ineffective at concurrency for CPU bound tasks
due to the GIL (global interpreter lock). Extension modules can release
the GIL in CPU bound tasks, but that isn't an option in pure Python.
Users use libraries such as multiprocessing, subprocess, concurrent.futures.ProcessPoolExecutor,
etc, to work around the GIL. These modules call fork()
underneath the hood. Various issues have
been reported when using these modules with gRPC Python. gRPC Python wraps
gRPC core, which uses multithreading for performance, and hence doesn't support fork()
.
Historically, we didn't support forking in gRPC, but some users seemed
to be doing fine until their code started to break on version 1.6. This was
likely caused by the addition of background c-threads and a background
Python thread.
Current Status
1.11
The background Python thread was removed entirely. This allows forking after creating a channel. However, the channel must not have issued any RPCs prior to the fork. Attempting to fork with an active channel that has been used can result in deadlocks/corrupted wire data.
1.9
A regression was noted in cases where users are doing fork/exec. This
was due to pthread_atfork()
handler that was added in 1.7 to partially
support forking in gRPC. A deadlock can happen when pthread_atfork
handler is running, and an application thread is calling into gRPC.
We have provided a workaround for this issue by allowing users to turn
off the handler using env flag GRPC_ENABLE_FORK_SUPPORT=False
.
This should be set whenever a user expects to always call exec
immediately following fork. It will disable the fork handlers.
1.7
A pthread_atfork()
handler was added in 1.7 to automatically shut down
the background c-threads when fork was called. This does not shut down the
background Python thread, so users could not have any open channels when
forking.
Future Work
1.13
The workaround when using fork/exec by setting
GRPC_ENABLE_FORK_SUPPORT=False
should no longer be needed. Following
this PR, fork
handlers will not automatically run when multiple threads are calling
into gRPC.