Bug #20493
closedSegfault on rb_io_getline_fast
Description
We've spotted a consistent segfault when running bundle install with --jobs 4
When running: bundle install -j 4
we'd get a Segfault at:
/usr/lib64/ruby/3.3.0/rubygems/ext/builder.rb:93: [BUG] Segmentation fault at 0x0000000000000014
ruby 3.3.1 (2024-04-23 revision c56cd86388) [x86_64-linux-gnu]
Full log is available here.
I could not find a shorter reproducer besides using bundler with --jobs 4
or
--jobs 8
.
Here's a sample command to trigger the behavior (it creates the Gemfile and
calls bundler) 1.
We installed all debug symbols and narrowed down the location of the segfault to
rb_io_getline_fast
in io.c
At line 4001 str
is T_NONE
, which makes further usage down the line in
io_enc_str
raise a null pointer dereference.
With the notes from extension.rdoc - Appendix E. RB_GC_GUARD to protect from premature GC I've prepared a patched ruby 3.3.1 package that does not
segfault. It's on OBS Project home:josegomezr:branches:ruby/ruby3.3.
Adding a RB_GC_GUARD
on rb_io_getline_fast
@ io.c:4004
just before the return
--- ruby3.3.orig/ruby-3.3.1/io.c
+++ ruby3.3/ruby-3.3.1/io.c
@@ -4004,6 +4004,7 @@ rb_io_getline_fast(rb_io_t *fptr, rb_enc
ENC_CODERANGE_SET(str, cr);
fptr->lineno++;
+ RB_GC_GUARD(str);
return str;
}
Fixes the segfault in our tests. bundle
finish the installation and the image is built.
I've set up a project in OBS to provide reproduceables.
- ruby3.3.1 package.
- ruby3.3.1 base image with enough dependencies to reproduce with the reproducer script.
And the corresponding container is exported in the containers-patched
repository.
Here I leave the docker images generated by OBS:
- 3.3.1 [without patches, segfaults.]
registry.opensuse.org/home/josegomezr/branches/ruby/containers/containers/base-ruby33:latest
- 3.3.1 [with patch, does not fail]
registry.opensuse.org/home/josegomezr/branches/ruby/containers/containers-patched/base-ruby33:latest
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
- Assignee set to kjtsanaktsidis (KJ Tsanaktsidis)
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
Thank you for your bug report and excellent reproduction! I've had a look at this today - I haven't reached a conclusion yet, but I want to write down where I got so far.
The crash is happening because:
- One thread is calling
rb_io_getline_fast
- The VALUE variable
str
is saved in register%r15
-
rb_io_getline_fast
calls a chain of functions that eventually yields the GVL and performs a syscall - None of these other functions spill
%r15
to the stack. It's present only in%r15
for the duration of the blocking system call. - Another Ruby thread runs and performs a GC
- As part of that, it marks
ec->machine
for all other threads. This includes a representation of that thread's register state injmpbuf
. - However, the value for
str
seems not to be injmpbuf
when it performs GC marking.
In theory, we're supposed to capture Ruby VALUE pointers only present in machine registers by (ab)using setjmp
to capture an opaque blob, which should include all of the values of the machine's registers at the point setjmp
was called. But that seems not to be happening here.
Your patch works around the problem by explicitly spilling str
to the stack with RB_GC_GUARD, so that the other thread can see it. However, this shouldn't be nescessary. I believe the C extension rules here are that an RB_GC_GUARD is only necessary when you dereference and use part of a VALUE but don't retain a reference to the whole thing - i.e. code like this needs a guard:
VALUE str = rb_str_new_cstr(...);
some_c_api(RSTRING_PTR(str), RSTRING_LEN(str)); // actually does ((struct RString *)str)->as.heap.ptr and ((struct RString *)str)->as.heap.len
something_which_might_gc();
some_other_api(RSTRING_PTR(str), RSTRING_LEN(str));
RB_GC_GUARD(str); // nescessary here!
Code like this shouldn't need a guard, i think, and plenty such code exists without a guard
VALUE str = rb_str_new_cstr(...);
some_c_api(RSTRING_PTR(str), RSTRING_LEN(str)); // actually does ((struct RString *)str)->as.heap.ptr and ((struct RString *)str)->as.heap.len
something_which_might_gc();
some_other_api(RSTRING_PTR(str), RSTRING_LEN(str));
return str; // Not guard needed; we still use str after the GC, so the compiler will necessarily keep the value of it around on the stack or in a register
Of course, this whole scheme could collapse if the compiler decided to do something like increment str
in-place and then decrement it later, but it seems to work well enough in practice :/
So I'm going to keep investigating why the str
is not being captured in ec->machine.jmpbuf
.
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
OK, I believe I've worked out what's wrong here.
The BLOCKING_REGION macro is used to release the GVL, run some code, and re-acquire the GVL. It looks like this:
#define BLOCKING_REGION(th, exec, ubf, ubfarg, fail_if_interrupted) do { \
struct rb_blocking_region_buffer __region; \
if (blocking_region_begin(th, &__region, (ubf), (ubfarg), fail_if_interrupted) || \
/* always return true unless fail_if_interrupted */ \
!only_if_constant(fail_if_interrupted, TRUE)) { \
exec; \
blocking_region_end(th, &__region); \
}; \
} while(0)
blocking_region_begin
calls RB_VM_SAVE_MACHINE_CONTEXT
, which:
- Updates the "end of machine stack" pointer in
ec->machine
to be the current stack pointer, - Abuses
setjmp
to save the register state in theec->machine
structure as well
The idea being that while the thread is off doing things in C-land, another thread performing GC can mark the portion of the machine stack from before the BLOCKING_REGION
call.
The problem is that the disassembly for blocking_region_begin
looks like this:
(rr) disassemble
Dump of assembler code for function blocking_region_begin:
0x000074794fa36900 <+0>: push %r15
0x000074794fa36902 <+2>: push %r14
0x000074794fa36904 <+4>: mov %rdx,%r14
0x000074794fa36907 <+7>: push %r13
0x000074794fa36909 <+9>: push %r12
0x000074794fa3690b <+11>: mov %rsi,%r12
0x000074794fa3690e <+14>: push %rbp
0x000074794fa3690f <+15>: push %rbx
i.e. it pushes several of the callee-saved registers to the stack, and uses them for its own purposes - including %r15
, which had the value of str
from the rb_io_getline_fast
frame below. When it calls setjmp
to capture the register state, the value of %r15
that's saved is the value used in this function, not the old value.
That would be OK, because it's spilled to the stack, and the "end of stack pointer" is also incremented, so it should be reachable by the GC that way. However, blocking_region_begin
is a function, and it's not inlined, the machine stack pointer is restored at the function end. Then, the memory where %r15
and hence str
were spilled to gets re-used for the C code doing the without-the-GVL work.
I.e. the sequence of operations is:
- Call
blocking_region_begin
. Stack pointer is decremented (stack made bigger) - Callee-saved registers including the value of
str
are spilled to the stack. - %r15 is overwritten
- Current $sp is set as EC's "end of the stack" value.
- Registers are saved to EC's regs structure
-
blocking_region_begin
returns. Stack pointer is incremented (stack made smaller). The memory containing the value ofstr
now lies beyond the end of the stack. The real value ofstr
is returned to %r15, because it's callee-saved. - More work happens. The stack is made bigger again by other functions, and junk is written to that stack slot
- GC marking then can't see the value of
str
A "quick fix" for this is to inline the saving of machine stack state into the BLOCKING_REGION
macro. That way, it's not in a separate function call, and so the stack frame containing the spilled registers is not invalidated until after the blocking completes.
Longer term, though, I'd like to rethink how this works a bit. I think BLOCKING_REGION
being a kind of "macro accepting a block" is a bad idea, because it mixes up the careful stack management that it has to do with the stack state of the caller. I think it should be converted to use a function pointer callback like rb_thread_call_without_gvl
etc, and we should also consider using hand-written assembly to perform the stack management I think (like what we use in the fiber switching code).
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
I opened https://github.com/ruby/ruby/pull/10795 as a proposed fix for this. Let's see what other people (and CI) think :)
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
After applying that fix to the RPM spec file in https://build.opensuse.org/package/show/devel:languages:ruby/ruby3.3, I can't get the reproduction to crash any more.
Updated by josegomezr (Jose Gomez) 6 months ago
Thanks for this extremely detailed explanation!
Just for my own understanding [I'm not as proficient in low level C, took me a while reading through all details], can I assert that: the fix I suggested would've only "fixed" the segfault on a reduced scope [what I understand are the C definition of the ruby IO#read*
methods] but wouldn't solve it across the board because the root cause lies on how the context is saved/restored when yielding execution to C-land functions?
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
You're welcome, thank you for posting such a detailed and actionable report. You're exactly right; your patch deals with this issue in IO methods using rb_io_getline_fast
but this is probably causing crashes in other places as well; hopefully I've fixed that.
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
- Backport changed from 3.1: UNKNOWN, 3.2: UNKNOWN, 3.3: UNKNOWN to 3.1: UNKNOWN, 3.2: REQUIRED, 3.3: REQUIRED
Closed by merging https://github.com/ruby/ruby/pull/10795
I opened two PR's to backport this to 3.2 & 3.3 (I think crashes like this should be backported?)
Updated by kjtsanaktsidis (KJ Tsanaktsidis) 6 months ago
- Status changed from Open to Closed
Updated by josegomezr (Jose Gomez) 6 months ago
Awesome! I was about to ask what was the plan for having this change in ruby 3.2/3.3.
I've submitted the patches to the projects in openSUSE with the contents of the Backport PR's to have our rubies fixed
Is there an estimation for the next ruby release?
Updated by k0kubun (Takashi Kokubun) 6 months ago · Edited
- Backport changed from 3.1: UNKNOWN, 3.2: REQUIRED, 3.3: REQUIRED to 3.1: UNKNOWN, 3.2: REQUIRED, 3.3: DONE
ruby_3_3 b6c07acedb3ca56471754a082b3db20bb863c92e.
Updated by k0kubun (Takashi Kokubun) 6 months ago
Is there an estimation for the next ruby release?
Today. I just released Ruby 3.3.2.
Updated by josegomezr (Jose Gomez) 6 months ago
k0kubun (Takashi Kokubun) wrote in #note-12:
Today. I just released Ruby 3.3.2.
Awesome! I forgot to respond when I read it. I see it propagated to OBS too, thanks all for your work & efforts!
Until the next bug! 😹
Updated by nagachika (Tomoyuki Chikanaga) 5 months ago
- Backport changed from 3.1: UNKNOWN, 3.2: REQUIRED, 3.3: DONE to 3.1: UNKNOWN, 3.2: DONE, 3.3: DONE
ruby_3_2 2f010f31f1887ad0f429709a2906fc5a4dae8e87.