Feature #9118
closedIn Enumerable#to_a, use size to set array capa when possible
Description
Cross-post from https://github.com/ruby/ruby/pull/444.
Enumerable#to_a works by creating an empty array with small capacity, then populating it and expanding the capacity as it goes. For large enumerables, this causes several resizes, which can hurt performance. When an enumerable exposes a size method, we can guess that the resulting array's size will usually be equal to the enumerable's size. If we're right, we only have to set capacity once, and if we're wrong, we don't lose anything.
The attached file (or linked PR) adjusts enum.c's to_a method to take advantage of the size method when it's there. In my tests this makes Range#to_a about 10% faster, and doesn't have any significant effect on a vanilla enum with no size method. I couldn't find any existing benchmark that this consistently made better or worse.
If you like this idea, this could also be done in other classes with custom to_a, like Hash.
Files