Project

General

Profile

Actions

Feature #14602

open

Version of dig that raises error if a key is not present

Added by amcaplan (Ariel Caplan) almost 7 years ago. Updated 6 months ago.

Status:
Open
Assignee:
-
Target version:
-
[ruby-core:86102]

Description

Currently, if I have a hash like this:

{
    :name => {
        :first => "Ariel",
        :last => "Caplan"
    }
}

and I want to navigate confidently and raise a KeyError if something is missing, I can do:

hash.fetch(:name).fetch(:first)

Unfortunately, the length of the name, combined with the need to repeat the method name every time, means most programmers are more likely to do this:

hash[:name][:first]

which leads to many unexpected errors.

The Hash#dig method made it easy to access methods safely from a nested hash; I'd like to have something similar for access without error protection, and I'd think the most natural name would be Hash#dig!. It would work like this:

hash = {
    :name => {
        :first => "Ariel",
        :last => "Caplan"
    }
}
hash.dig!(:name, :first) # => Ariel
hash.dig!(:name, :middle) # raises KeyError (key not found: :middle)
hash.dig!(:name, :first, :foo) # raises TypeError (String does not have #dig! method)

Related issues 2 (2 open0 closed)

Is duplicate of Ruby master - Feature #12282: Hash#dig! for repeated applications of Hash#fetchOpenActions
Has duplicate Ruby master - Feature #15563: #dig that throws an exception if a key doesn't existOpenActions

Updated by shevegen (Robert A. Heiler) almost 7 years ago

I think this may be somewhat problematic since it does not appear
to fit to other methods that end with a "!", such as .chop() versus
.chop!() for a String or .map() versus .map!() for an Array.

In the past I thought that "!" would mean mostly "modify in place",
but matz wrote somewhere on the bug tracker that it is more meant
as a "caution" indicator to the ruby user.

Another problem is, I think, that your suggestion of .dig!() does
something that .dig() itself isn't doing (raising multiple
different errors). But that may just be me, perhaps others see
no problem - at the end of the day you'd only have to convince
matz anyway. :)

Updated by duerst (Martin Dürst) almost 7 years ago

Would a keyword parameter to dig work for you?

E.g. hash.dig!(:name, :middle, raise_error: true) or something similar.

Updated by amcaplan (Ariel Caplan) almost 7 years ago

shevegen (Robert A. Heiler) wrote:

I think this may be somewhat problematic since it does not appear
to fit to other methods that end with a "!", such as .chop() versus
.chop!() for a String or .map() versus .map!() for an Array.

In the past I thought that "!" would mean mostly "modify in place",
but matz wrote somewhere on the bug tracker that it is more meant
as a "caution" indicator to the ruby user.

Another problem is, I think, that your suggestion of .dig!() does
something that .dig() itself isn't doing (raising multiple
different errors). But that may just be me, perhaps others see
no problem - at the end of the day you'd only have to convince
matz anyway. :)

You have a good point about the bang methods often signifying an in-place operation rather than an error-prone one; the latter is more of a Rails convention than a Ruby one, and I mixed things up. Thank you for pointing that out. Mea culpa.

Ideally, we'd just have Hash#fetch take an arbitrary number of arguments, each representing another layer of depth. However, that option is closed off due to backwards compatibility issues, since it already takes an optional second argument representing a default return value in case the item isn't found.

So it sounds like we need a new method name. Perhaps something like #deep_fetch would work, but I'll have to think on it more to see if I can come up with something better. EDIT: Apparently there's already a deep_fetch gem that does exactly this, which might indicate that the name #deep_fetch would be understood intuitively by the community.

In terms of the multiple errors: This isn't actually new, and in fact I copied the current behavior of Hash#dig with the exception of adding the KeyError:

hash = {
    :name => {
        :first => "Ariel",
        :last => "Caplan"
    }
}

hash.dig(:name, :first) # => Ariel
hash.dig(:name, :middle) # => nil
hash.dig(:name, :first, :foo) # raises TypeError (String does not have #dig method)

Updated by amcaplan (Ariel Caplan) almost 7 years ago

duerst (Martin Dürst) wrote:

Would a keyword parameter to dig work for you?

E.g. hash.dig!(:name, :middle, raise_error: true) or something similar.

I appreciate the thought. I personally would be more likely to do hash.fetch(:name).fetch(:middle) instead of adding a keyword argument to #dig, unless the list was extremely long (probably at least 4 consecutive keys), which I'd suspect is unusual enough that it's not worth adding to Ruby core for that unusual case.

Actions #5

Updated by k0kubun (Takashi Kokubun) over 5 years ago

  • Is duplicate of Feature #12282: Hash#dig! for repeated applications of Hash#fetch added
Actions #6

Updated by k0kubun (Takashi Kokubun) over 5 years ago

  • Has duplicate Feature #15563: #dig that throws an exception if a key doesn't exist added

Updated by robb (Robb Shecter) almost 5 years ago

amcaplan (Ariel Caplan) wrote:

The Hash#dig method made it easy to access methods safely from a nested hash; I'd like to have something similar for access without error protection, and I'd think the most natural name would be Hash#dig!.

FYI, I've implemented this as a gem: https://github.com/dogweather/digbang

Actions #8

Updated by jaredbeck (Jared Beck) over 2 years ago

I personally don't mind dig!. I interpret the ! as a general sign of caution, rather than some meaning specific to data structures (ie. self-modification). But, if we can't have dig!, how about fetch_dig or dig_fetch?

Whatever we call it, it's a good idea and should be in standard ruby. IMO it's too small to be a gem. We don't want to live in a "left-pad" world. :)

Updated by fpsvogel (Felipe Vogel) over 2 years ago

For me this is a nice shortcut to safely access values in a large config hash. So I would use it if it became part of Ruby core.

I like the name dig! because it's short, but if that has too much of a Rails flavor rather than Ruby, then deep_fetch and dig_fetch seem like fine names too.

I recently wrote about some alternatives for this with benchmarks and other considerations, including the two gems mentioned above as well as custom implementations, at https://fpsvogel.com/posts/2022/ruby-hash-dot-syntax-dig-performance-benchmarks#dig-with-errors

Updated by zverok (Victor Shepelev) over 2 years ago

Just a bit of "design space" analysis:

  1. I think dig! is unusual for core Ruby. A lot of Rubyists are used that in Rails pairs like find_by/find_by! are raising/non-raising, but I don't remember any Ruby core API using this convention
  2. I don't believe the keyword argument is expressive enough. The "visual structure" of the dig signature includes multiple values of user data (and the list of values might be of arbitrary length), so the option at the end of arguments is a) not visible enough and b) not immediately intuitively obvious if it isn't part of user's data
  3. In Hash, we already have at least two examples of using fetch in a sense "get the value or fail": #[] vs #fetch and #values_at vs #fetch_values. It seems like it gives enough precedent to look at the "fetch"-based naming, and it seems like fetch_dig, while grammatically not ideal, would be guessable enough, based on existing experience.

Updated by jeremyevans0 (Jeremy Evans) over 2 years ago

duerst (Martin Dürst) wrote in #note-2:

Would a keyword parameter to dig work for you?

E.g. hash.dig!(:name, :middle, raise_error: true) or something similar.

Keyword approach is not backwards compatible, because keywords are currently treated as positional arguments:

{:name=>{:middle=>{{:raise_error=>true}=>2}}}.dig(:name, :middle, raise_error: true)
# => 2

Updated by nobu (Nobuyoshi Nakada) over 2 years ago

[:name, :middle].inject(hash, :fetch) # raises KeyError (key not found: :middle)

Updated by shyouhei (Shyouhei Urabe) over 2 years ago

nobu (Nobuyoshi Nakada) wrote in #note-12:

[:name, :middle].inject(hash, :fetch) # raises KeyError (key not found: :middle)

Doesn't interface with Arrays.

Updated by mame (Yusuke Endoh) over 2 years ago

shyouhei (Shyouhei Urabe) wrote in #note-13:

Doesn't interface with Arrays.

[0, 0, 0].inject([[[:foo]]], :fetch) #=> :foo ?

I don't think @nobu (Nobuyoshi Nakada) is serious about the idiom, though.

Updated by shyouhei (Shyouhei Urabe) over 2 years ago

mame (Yusuke Endoh) wrote in #note-14:

shyouhei (Shyouhei Urabe) wrote in #note-13:

Doesn't interface with Arrays.

[0, 0, 0].inject([[[:foo]]], :fetch) #=> :foo ?

I don't think @nobu (Nobuyoshi Nakada) is serious about the idiom, though.

Hmm, sorry. I was confusing.

Updated by Eregon (Benoit Daloze) over 2 years ago

There is also this blog post discussing about dig_fetch, which seems very similar: https://shopify.engineering/dig-fetch-truffleruby

message = data.dig_fetch(:response, :message) { IdentityObject.new }
# instead of
message = data.fetch(:response, {}).fetch(:message, IdentityObject.new)

So it still allows a default value if any part is missing when provided a block, and raises an exception otherwise.

Updated by matz (Yukihiro Matsumoto) over 2 years ago

It seems to be a nice idea. But is deep_fetch is the best name for it?

Matz.

Updated by byroot (Jean Boussier) over 2 years ago

A couple other ideas I had:

  • lookup
  • pick
  • traverse

Updated by st0012 (Stan Lo) over 2 years ago

How about dig_for? If we’re digging for something, it kinda makes sense to raise an exception if it’s not there.

Updated by mollemoll (Jonas Molander) over 2 years ago

Some ideas:

reveal
uncover
unfold
dive
enlight
scoop
reel

Updated by p8 (Petrik de Heus) over 2 years ago

Maybe:

fetch_each(:name, :first)
fetch_all(:name, :first)
fetch_tail(:name, :first)
fetch_end(:name, :first)

# or with dig_
dig_each(:name, :first)
dig_tail(:name, :first)
dig_all(:name, :first)
dig_end(:name, :first)
dig_bottom(:name, :first)
dig_deep(:name, :first)
dig_down(:name, :first)

# or maybe
delve(:name, :first)
drill(:name, :first) # as in drill down 

Updated by olivierlacan (Olivier Lacan) over 2 years ago

Considering dig as an operation that can be done manually (with fingers) and address exceptional situations (unexpected objects) more subtly, I would suggest shovel which is a more blunt instrument which tends to raise a notable exception sound (clang!) when it hits an unexpected object.

hash = {
  name: {
    first: "Ariel",
    last: "Caplan"
  }
}

hash.dig(:name, :first) # => Ariel
hash.dig(:name, :middle) # => nil

hash.shovel(:name, : first) # => Ariel
hash.shovel(:name, :middle) # => KeyError (key not found: :middle)

There's a slight semantic overlap with the Hash shovel operator (<<) which could be an issue.

Updated by amcaplan (Ariel Caplan) over 2 years ago

We can think of this as either a variation of fetch or a variation of dig. Ultimately it's both, of course, just depends how you look at it.

If we think of it as fetch-based, deep_fetch would be OK but we also might go with something that really describes quite literally what it does, which is fetch recursively. So, fetch_recursive or rfetch (think of Array#bsearch as prior art - though of course it's not 100% comparable) might be the way to go.

If we take the dig-based perspective, it's dig but non-permissive. So dig_strict might be the most literal way of explaining what it does.

Rather than advocating strongly for 1 specific word, I'd just gently recommend that we avoid introducing more dig-like verbs. While seasoned Rubyists will know the difference, newcomers won't have any obvious reason to assume that dig differs from traverse in strictness. At least there's a convention for fetch vs [] which already exists and we can use it to avoid introducing more new language/concepts.

Updated by duerst (Martin Dürst) over 2 years ago

amcaplan (Ariel Caplan) wrote in #note-23:

We can think of this as either a variation of fetch or a variation of dig. Ultimately it's both, of course, just depends how you look at it.

Or maybe we can think it as a combination of dig with fetch. Then what about dig_fetch or fetch_dig? These names don't look very natural, but it's easy to understand what they are about.

Updated by ufuk (Ufuk Kayserilioglu) over 2 years ago

My humble suggestion would be Hash#retrieve. The operation tries to retrieve a value in the depths of a hash and if it comes back empty handed, that is an error.

hash = {
  name: {
    first: "Ariel",
    last: "Caplan"
  }
}

hash.dig(:name, :first) # => Ariel
hash.dig(:name, :middle) # => nil

hash.retrieve(:name, :first) # => Ariel
hash.retrieve(:name, :middle) # => KeyError (key not found: :middle)

The word "retrieve" is a synonym for "fetch" in English, and has deep roots in CS terminology related to "data retrieval" and similar.

Updated by zverok (Victor Shepelev) over 2 years ago

I fully agree with @duerst (Martin Dürst) in #14602#note-24:

maybe we can think it as a combination of dig with fetch. Then what about dig_fetch or fetch_dig? These names don't look very natural, but it's easy to understand what they are about.

First, we already have examples of fetch-based naming: not only #fetch itself as a variation of #[], but also #fetch_values as a variation of #values_at, so there is a precedent for recognizability

Second, I value short one-word names, so all the witty options like #shovel and #retrieve are nice, but I am afraid that when we have a variation of a known method in an API established long ago, introducing completely new word into Ruby would be a false move. Imagine you started to read code and met with #retrieve (or #shovel) for the first time. There is nothing that might help you to understand what it does; one verb that "a bit resembles dig" is not suggestive enough.

Third, deep_fetch is somewhat suggestive, but the problem "it behaves like dig, but the name logic is nothing like dig" stands. Maybe if it would a pair of, IDK, #deep_fetch and #deep_get it might've been tolerable, but now is too late for that, everybody has used to #dig.

fetch_dig, OTOH, is reasonably short, clearly suggests the meaning, and follows the logic of other methods existing.

Updated by sawa (Tsuyoshi Sawada) over 2 years ago

What about simply allowing fetch to take multiple arguments?

If there are more than one argument, always interpret the last one as the default value unless when there is a block, in which case, the block is evaluated if a key is missing somewhere in the path.

You would need to explicitly write a block if you want to raise an error, but I believe your intention is not to let the raised error go through all the way to the top level to end the program; you intend to catch that error somewhere, and do something with it, right? Then, you can instead write that routine in the block from the beginning.

(1) No change to present behavior:

hash.fetch(:name, :first) # => {:first => "Ariel", :last => "Caplan"}

(2) Perhaps error prone use cases, but these are suited for dig, so do not use fetch in practice in such cases, and it would not be a problem:

hash.fetch(:name, :first, nil) # => "Ariel"
hash.fetch(:name, :middle, nil) # => `nil`

(3) Explicitly raise an error in a block:

hash.fetch(:name, :first){|key| raise KeyError} # => "Ariel"
hash.fetch(:name, :middle){|key| raise KeyError} # !> KeyError

(4) Or, write a routine:

hash.fetch(:name, :middle){|key| process_missing_key(key)}

Updated by sinsoku (Takumi Shotoku) about 1 year ago

I agree with #14602#note-23.
I think a fetch-based or dig-based name would be better, since the function can be inferred from the method name.

Personally, I feel that deep_fetch is a simple and nice name.
I often use deep_dup and deep_merge in Rails apps and are familiar with the method name deep_*.

I found fetch_dig and dig_fetch to be a little strange in that they have verbs lined up.
I think dig_strict is a good name, but I prefer deep_fetch because deep has fewer characters and is simpler than strict.

Updated by Eregon (Benoit Daloze) about 1 year ago

I like deep_fetch too.

Updated by mame (Yusuke Endoh) about 1 year ago

We discussed at the dev meeting but did not reach a conclusion. But there has been some progress.

  • @akr (Akira Tanaka) said we shouldn't add new vocabulary, and Matz agreed.
  • There is Array#fetch which throws an exception when it is not found, so we want to use this vocabulary: fetch_* or *_fetch
    • Matz didn't like any of the proposed fetch_each fetch_all fetch_tail fetch_end
  • Matz prefers dig or recursive to deep to represent this behavior.
    • deep sounds to be an operation on the entire data structure, due to the influence of deep_copy.
  • Matz was interested in how dig_fetch or fetch_dig would sound to English native speakers.
  • Matz also mentioned recursive_fetch, but said it seemed a bit long.

Updated by Eregon (Benoit Daloze) about 1 year ago

mame (Yusuke Endoh) wrote in #note-30:

  • deep sounds to be an operation on the entire data structure, due to the influence of deep_copy.

It actually is a deep operation, on the entire data structure/multiple objects, exactly like deep_copy would.
So it seems a perfect match, no?

Updated by mame (Yusuke Endoh) about 1 year ago

As far as I understand, Matz expects that, for a tree structure:

  • dig_* touches only nodes on a specific path,
  • deep_* touches all nodes.

Updated by edmz (Ed Mz) about 1 year ago

Following what @mame (Yusuke Endoh) explained about Matz expectations, I would like to (humbly) suggest another option:

  • dig_expected

(and a my +1 to dig_strict)

Updated by jordan-brough (Jordan Brough) 6 months ago

Matz was interested in how dig_fetch or fetch_dig would sound to English native speakers.

As a native english speaker, "dig_fetch" sounds natural and makes sense to me.

Actions

Also available in: Atom PDF

Like1
Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like1Like0Like0Like1