Feature #16987: Enumerator::Lazy vs Array methods - Ruby - Ruby Issue Tracking System

Actions

Copy link

Feature #16987

closed

Enumerator::Lazy vs Array methods

Added by zverok (Victor Shepelev) about 5 years ago. Updated about 3 years ago.

Status:

Closed

Assignee:

Target version:

[ruby-core:98958]

Description

Enumerations are designed to be greedy (immediately executed on each method call within a chain) by default. Sometimes, that is not useful for practical purposes (e.g. 2 mln strings array, drop comments, split into fields, find the first ten whose field 2 is equal to some value). So one needs to either do everything in one each block, or use Enumerable#lazy. There are three problems with the latter:

It is much less known,
It is said to be almost always slower than non-lazy, and is therefore not recommended,
It lacks some methods that are often necessary in processing large data chunks.

I want to discuss (3) here. Enumerator::Lazy would better, but actually doesn't, have methods such as: #flatten, #product, and #compact. They are all methods of Array, not Enumerable. In fact,

They probably should belong to Enumerable (none of them requires anything besides #each to function),
They are definitely useful for lazily processing large sequences.

Related issues 1 (0 open — 1 closed)

Actions

Copy link

Updated by sawa (Tsuyoshi Sawada) about 5 years ago

Description updated (diff)

Actions

Copy link

Updated by sawa (Tsuyoshi Sawada) about 5 years ago

Description updated (diff)

Actions

Copy link

#3 [ruby-core:99182]

Updated by midnight (Sarun R) about 5 years ago

I used Lazy all the time. There is nothing to be done here about its popularity.
FWIW, People knew about it, but choose not to rely on it because they want to support old versions of Ruby.
Hence, it is not very popular in open-source settings.

Regardless of what should be implemented, for now, you can use

lazy.flat_map(&:itself)

as #flatten, and

lazy.select(&:itself)

as #compact.

Only #product is the tricky one that requires multiple operations, but it is not used very often anyway.

What I missed most is #scan.
https://ramdajs.com/docs/#scan
It is basically a #reduce that yield at every iteration.

Actions

Copy link

Updated by matz (Yukihiro Matsumoto) over 4 years ago

Related to Feature #17312: New methods in Enumerable and Enumerator::Lazy: flatten, product, compact added

Actions

Copy link

Updated by zverok (Victor Shepelev) about 3 years ago

Status changed from Open to Closed

Actions

Copy link

Also available in: Atom PDF

Like0

Like0Like0Like0Like0Like0

Project

General

Profile

Ruby

Tags

Custom queries

Feature #16987

Enumerator::Lazy vs Array methods

Updated by sawa (Tsuyoshi Sawada) about 5 years ago

Updated by sawa (Tsuyoshi Sawada) about 5 years ago

Updated by midnight (Sarun R) about 5 years ago

Updated by matz (Yukihiro Matsumoto) over 4 years ago

Updated by zverok (Victor Shepelev) about 3 years ago