Feature #12967
closedAdd a default for RUBY_GC_HEAP_GROWTH_MAX_SLOTS out-of-the-box
Description
As it stands Ruby heaps grow at a rate that is too fast for the vast majority of applications that use Ruby.
def rss
_,rss = `ps ax -o pid,rss | grep '#{Process.pid} '`.split(/\s+/)
rss.to_i
end
def heap_length
GC.stat[:heap_sorted_length]
end
@array = []
@heap_length = heap_length
while true
@array << ""
if @heap_length < heap_length
@heap_length = heap_length
puts "RSS #{rss}, Heap length: #{@heap_length}"
end
end
RSS 14364, Heap length: 133
RSS 33828, Heap length: 237
RSS 39652, Heap length: 424
RSS 59476, Heap length: 759
RSS 112192, Heap length: 1342
RSS 126452, Heap length: 2257
RSS 142020, Heap length: 3295
RSS 184056, Heap length: 5058
RSS 266648, Heap length: 7810
RSS 339224, Heap length: 11113
compare that to a sane default of say
RUBY_GC_HEAP_GROWTH_MAX_SLOTS=100000
RUBY_GC_HEAP_GROWTH_MAX_SLOTS=100000 ruby test_mem.rb
RSS 15580, Heap length: 133
RSS 30452, Heap length: 237
RSS 37212, Heap length: 424
RSS 55116, Heap length: 667
RSS 107716, Heap length: 911
RSS 114388, Heap length: 1095
RSS 117492, Heap length: 1233
RSS 119196, Heap length: 1309
RSS 122872, Heap length: 1474
RSS 123588, Heap length: 1509
RSS 125396, Heap length: 1589
RSS 129648, Heap length: 1756
RSS 131096, Heap length: 1781
RSS 132812, Heap length: 1863
RSS 136420, Heap length: 2031
RSS 137960, Heap length: 2052
RSS 139832, Heap length: 2135
RSS 143568, Heap length: 2304
RSS 145112, Heap length: 2322
RSS 146936, Heap length: 2406
RSS 150404, Heap length: 2573
RSS 173660, Heap length: 2781
RSS 175984, Heap length: 2866
RSS 179664, Heap length: 3034
RSS 192980, Heap length: 3242
RSS 193784, Heap length: 3325
RSS 197304, Heap length: 3493
Note, this test deals with the absolute minimal amount of RSS, cause all data is stored in RVALUES, with strings like typical apps RSS can grow in much bigger increments.
RSS will jump from 266MB to 339MB, out of the box, this is a massive amount of growth just for Ruby heaps.
The trouble with this huge amount of heap growth is that it amplifies issues around heap fragmentation, in web apps once heap grows it is very unlikely it will shrink again cause there are just enough long living objects that a full slot never frees up properly.
My suggestion is to ship with a far safer default of: RUBY_GC_HEAP_GROWTH_MAX_SLOTS=100000
This has zero impact on "small scripts" < 50M rss, and major advantages for larger apps.
Thoughts?
Honestly I see zero downsides to capping it so Ruby heaps only grow in 5mb increments. It eliminates lots of current bloat and allows the GC to operate more efficiently.
Updated by shyouhei (Shyouhei Urabe) almost 8 years ago
We looked at this issue at yesterday's developer meeting.
Ko1 said he was not sure if the proposed default value is decent. Also he said to me that there should be some kind of comprehensive approach to parameterize GC. It seems he thinks sporadic use of environment variables make things complicated.
Updated by ko1 (Koichi Sasada) almost 8 years ago
- Status changed from Open to Rejected
My suggestion is to ship with a far safer default of: RUBY_GC_HEAP_GROWTH_MAX_SLOTS=100000
To define "safe" is difficult. Providing tuning parameter is enough for this purpose.
Updated by normalperson (Eric Wong) almost 8 years ago
ko1@atdot.net wrote:
Issue #12967 has been updated by Koichi Sasada.
Status changed from Open to Rejected
My suggestion is to ship with a far safer default of: RUBY_GC_HEAP_GROWTH_MAX_SLOTS=100000
To define "safe" is difficult. Providing tuning parameter is enough for this purpose.
I think a problem is few people know tuning parameters (even if
we documented in the ruby(1) manpage), and some tuning parameter
behaviors may change with time.
So, we should try to find better defaults for the common case.
Maybe it's just me, but I have more problems with memory usage
than speed with Ruby. I think Sam and most other Rails users
will agree with me.
For tracking down high memory usage in apps, I prefer
single-threaded servers since it's easier to track down what
code was executing when big growths happen. But single-threaded
is also not memory-efficient....
Unfortunately, tracking/controlling memory growth in a
multi-threaded, shared heap processes is not easy, either;
especially when they have to know how their malloc()
implementation works, and what tuning there is, too.
In my experience, typical Ruby programmers would rather not be
going through their entire code base to hunt memory growth
locations, regardless of single or multi-thread. Instead, I see
people either buy more memory, or abandon Ruby.
Anyways, I don't think RUBY_GC_HEAP_GROWTH_MAX_SLOTS=100000
will hurt anyone.
Updated by nnc (Nemanja Chorlya) almost 7 years ago
IMHO this should be reconsidered, as it provides a huge memory saving for a typical Rails app and it's so easy and unobtrusive, but almost no one is aware of it
But if it was capped like this out of the box, a lot more people would benefit and have a better experience with Ruby, especially on memory constrained environments like Heroku. Just by tweaking RUBY_GC_HEAP_GROWTH_MAX_SLOTS we were able to use smaller dynos and save a lot of money on our Heroku bills.