Feature #4990
closedProposal: Internal GC/memory subsystem API
Description
There is significant interest in improving/altering the performance, behavior and features of MRI's GC in 1.8 and 1.9 series.
Proposal: MRI should support an internal GC API -- to separate MRI core from its current GC implementation,
and provide hooks for additional features:
- Interfaces between MRI internals and any GC/allocator implementation:
- stock MRI GC
- malloc() without free() to support valgrind testing (or short-lived programs)
- variants of stock MRI GC (http://engineering.twitter.com/2011/03/building-faster-ruby-garbage-collector.html and REE)
- BDW (http://www.hpl.hp.com/personal/Hans_Boehm/gc/)
- other collectors (https://github.com/kstephens/smal)
- 
Support selecting GC implementations at run-time or compile time. 
- 
Support malloc() replacements, at run-time and/or compile time, such as: 
- tcmalloc
- jemalloc
- Support callback hooks in allocation and GC phases to orthogonally add features, such as:
- performant/correct WeakReferences and ReferenceQueues (http://redmine.ruby-lang.org/issues/4168).
- allocation tracing/debugging.
- instance caching (e.g.: Floats)
- computational caching.
- cache invalidation.
- metrics collection.
- Interfaces to common features of alternate GCs:
- finalization
- weak references
- atomic allocations (e.g.: string or binary data)
- mostly read-only/static allocations (e.g.: code, global bindings)
A prototype GC phase callback API for 1.8, REE and 1.9 is here:
https://github.com/kstephens/ref/tree/master-mri-gc_api/patch
This GC API should be supported on both 1.8 and 1.9 code lines.
        
           Updated by matz (Yukihiro Matsumoto) over 14 years ago
          Updated by matz (Yukihiro Matsumoto) over 14 years ago
          
          
        
        
      
      This GC API should be supported on both 1.8 and 1.9 code lines.
There's no chance to add new API to 1.8. 1.8 has been dead new-feature-wise-ly.
matz.
        
           Updated by kstephens (Kurt  Stephens) about 14 years ago
          Updated by kstephens (Kurt  Stephens) about 14 years ago
          
          
        
        
      
      I've made a small amount of progress on a prototype. I will post links to github branches ASAP.
        
           Updated by kstephens (Kurt  Stephens) about 14 years ago
          Updated by kstephens (Kurt  Stephens) about 14 years ago
          
          
        
        
      
      Branch is here: https://github.com/kstephens/ruby/tree/trunk-mem-api
Current progress:
Supports boot-time selection between the standard gc.c memory system (named "core") and a malloc-only system (named "malloc");
the "core" memory system is default.
Build:
./configure --prefix=... && make
Then try:
RUBY_MEM_SYS=malloc make install
or
RUBY_MEM_SYS=malloc:D  make install # produces debug output using malloc-only.
RUBY_MEM_SYS=malloc:DL make install # produces debug output using malloc-only with @file:line.
Features not yet implemented: hooks for finalization, add hooks in gc.c for GC callbacks (weak reference/reference queue support).
        
           Updated by naruse (Yui NARUSE) about 14 years ago
          Updated by naruse (Yui NARUSE) about 14 years ago
          
          
        
        
      
      Kurt Stephens wrote:
Branch is here: https://github.com/kstephens/ruby/tree/trunk-mem-api
Current progress:
Supports boot-time selection between the standard gc.c memory system (named "core") and a malloc-only system (named "malloc");
the "core" memory system is default.
Why don't you use GC.disable ?
        
           Updated by kstephens (Kurt  Stephens) about 14 years ago
          Updated by kstephens (Kurt  Stephens) about 14 years ago
          
          
        
        
      
      Yui NARUSE wrote:
Why don't you use GC.disable ?
Because GC.disable doesn't leverage memory debuggers (valgrind) effectively; gc.c still continues to allocate large chunks (heaps) via malloc(); thus valgrind can not distinguish between (and instrument) RVALUEs that are parceled from the large chunks in rb_newobj().
Also because this is a proof-of-concept prototype.
        
           Updated by shyouhei (Shyouhei Urabe) about 14 years ago
          Updated by shyouhei (Shyouhei Urabe) about 14 years ago
          
          
        
        
      
      Kurt Stephens wrote:
Because GC.disable doesn't leverage memory debuggers (valgrind) effectively; gc.c still continues to allocate large chunks (heaps) via malloc(); thus valgrind can not distinguish between (and instrument) RVALUEs that are parceled from the large chunks in rb_newobj().
Don't blame valgind it supports such situation http://valgrind.org/docs/manual/mc-manual.html#mc-manual.mempools
        
           Updated by kstephens (Kurt  Stephens) about 14 years ago
          Updated by kstephens (Kurt  Stephens) about 14 years ago
          
          
        
        
      
      Shyouhei Urabe wrote:
Don't blame valgind it supports such situation http://valgrind.org/docs/manual/mc-manual.html#mc-manual.mempools
I'm not "blaming" valgrind, I'm not even "blaming" gc.c. Valgrind has hooks for recognizing custom allocators, but only if the allocator is instrumented. It's obvious, MRI 1.9 uses some of the VALGRIND_*() hooks, however the hooks are enabled at compile-time.
Again, this is part of a proof-of-concept -- that different allocators and features can be selected at boot-time (and/or compile-time). This is not the final work -- there is a larger goal stated in the description.
BTW: using the RUBY_MEM_SYS=malloc allocator during "make" and "make install" appears to improve speed with only a modest memory increase. Some ruby programs are short-lived and do not create much collectable garbage.
If anyone thinks the larger goal, or even this dinky "malloc-only" allocator, is useful, please speak up. :)
        
           Updated by mame (Yusuke Endoh) over 13 years ago
          Updated by mame (Yusuke Endoh) over 13 years ago
          
          
        
        
      
      - Status changed from Open to Assigned
- Assignee set to authorNari (Narihiro Nakamura)
        
           Updated by mame (Yusuke Endoh) almost 13 years ago
          Updated by mame (Yusuke Endoh) almost 13 years ago
          
          
        
        
      
      - Target version set to 2.6
        
           Updated by authorNari (Narihiro Nakamura) almost 12 years ago
          Updated by authorNari (Narihiro Nakamura) almost 12 years ago
          
          
        
        
      
      - Status changed from Assigned to Closed
Hi. I'm sorry to late reply. Could you make more small proposal? It's difficult to introduce big feature at once.