Feature #20876
Updated by ioquatix (Samuel Williams) 12 months ago
This is an evolution of the previous proposal: https://bugs.ruby-lang.org/issues/20855
## Background
The current Fiber Scheduler performance can be significantly impacted by blocking operations that cannot be deferred to the event loop, particularly in high-concurrency environments where Fibers rely on non-blocking operations for efficient task execution.
## Proposal
Pull Request: https://github.com/ruby/ruby/pull/12016
We will introduce a new fiber scheduler hook called `blocking_operation_work`:
```ruby
class MySchduler
# ...
def blocking_operation_wait(work)
# Example implementation:
Thread.new(&work).join
end
end
```
We introduce a new flag for `rb_nogvl`: `RB_NOGVL_BLOCKING_OPERATION` which indicates that `rb_nogvl(func, ...)` is a blocking operation that is safe to execute on a different thread or thread pool.
When a C extension invokes `rb_nogvl(..., RB_NOGVL_BLOCKING_OPERATION)`, and a fiber scheduler is available, all the arguments will be saved into a instance of a callable object (at this time a `Proc`) called `work`. When `work` is `#call`ed, it will execute `rb_nogvl` again with all the same arguments.
The fiber scheduler can decide how to execute that work, e.g. on a separate thread, to mitigate the performance impact of the blocking operation on the event loop.

### Cancellation
`rb_nogvl` takes several arguments, a `func` for the actual work, and `unblock_func` to cancel `func` if possible. These arguments are preserved in the `work` proc, and cancellation works the same. However, some extra effort may be required in the fiber scheduler hook, e.g.
```ruby
class MySchduler
# ...
def blocking_operation_wait(work)
thread = Thread.new(&work)
thread.join
thread = nil
ensure
thread&.kill
end
end
```
## Example
Using the branch of `async` gem: https://github.com/socketry/async/pull/352/files and enabling zlib deflate to use this feature, the following performance improvement was achieved:
```ruby
require "zlib"
require "async"
require "benchmark"
DATA = Random.new.bytes(1024*1024*100)
duration = Benchmark.measure do
Async do
10.times do
Async do
Zlib.deflate(DATA)
end
end
end
end
# Ruby 3.3.4: ~16 seconds
# Ruby 3.4.0 + PR: ~2 seconds.
```
To run this benchmark yourself, you must compile CRuby with these two PRs:
- https://github.com/ruby/ruby/pull/12016
- https://github.com/ruby/zlib/pull/88
In addition, enable `RB_NOGVL_BLOCKING_OPERATION` in `zlib.c`'s call to `rb_nogvl`.
Then, use this branch of async: https://github.com/socketry/async/pull/352