Re: [rust-dev] Let’s avoid having both foo() and foo_opt()

spir Fri, 06 Dec 2013 16:50:01 -0800

On 12/06/2013 09:41 PM, Simon Sapin wrote:

We have some functions and methods such as
[std::str::from_utf8](http://static.rust-lang.org/doc/master/std/str/fn.from_utf8.html)
that may succeed and give a result, or fail when the input is invalid.


1. Sometimes we assume the input is valid and don’t want to deal with the error
case. Task failure works nicely.

2. Sometimes we do want to do something different on invalid input, so returning
an `Option<T>` works best.

And so we end up with both `from_utf8` and `from_utf8`. This particular case is
worse because we also have `from_utf8_owned` and `from_utf8_owned_opt`, to cover
everything.

Multiplying names like this is just not good design. I’d like to reduce this
pattern.

Getting behavior 1. when you have 2. is easy: just call `.unwrap()` on the
Option. I think we should rename every `foo_opt()` function or method to just
`foo`, remove the old `foo()` behavior, and tell people (through documentation)
to use `foo().unwrap()` if they want it back?

The downsides are that unwrap is more verbose and gives less helpful error
messages on task failure. But I think it’s worth it.

What do you think?

(PS: I’m guilty of making this worse in #10828, but I’d like to discuss this
before sending pull requests with invasive API changes.)


[A bit long, sorry, this is a topic about which i have thought for a while.]

There may be a more complicated general pattern, of all kinds of functions thatmay not be able to perform their nominal task, due to invalid input, but theclient cannot know whether the input is valid without more or less reproducingsaid task. Checking utf8 validity is about the same job as decoding, forinstance, to reuse your example.

Compare with a function computing the average value of a collection of numbers(or the sum, product, std-dev, etc...) which is passed an empty collection: herethe client can know, thus:1. if this abnormal case does not belong to the app's logic, the client shouldjust call the func stupidly so that the func failure is a signal of logicalerror on the app side2. if instead this case belongs to the app's logic, the client should firstcheck, and never call the func in this special caseThus, despite very possible failure, there should here be only one version ofthe func (no *_opt), one that stupidly fails, with a stupid error msg.

Back to the cases where the client cannot know before calling. To this categorybelong a whole series of search/find functions, and many dealing with the filesystem, user input, input in general. In the latter case, a func's input is infact not (all) provided by the client. But there is the same pattern ofanomalous cases which may, or not, belong to the app logic (1. or 2. above): isit correct (if special or exceptional) that such file does not exist, or suchcollection does not hold the searched item? Meaning, should I deal with suchcases? If not, if such a case does not belong to the application logic, again Ishould stupidly call a func that stupidly fails with a stupid error msg, so I amtold, simply and early, of my logical errors. These are true errors (notso-called exceptions), and the failure is a symptom or signal, a good thing (if,indeed, what you want is making good software). I *want* it; and want it so!

I'm in favor of simple funcs doing simple tasks simply, and just failing inanomalous cases, for these reasons. [1]

Remains the situation combined of such funcs, of which the client cannot knowwhether they will be able to perform their task, and of abnormal cases belongingto the logic of the app (there are also whole categories of maintenance andsafety modes here). For such situations, it looks somewhat logical to have 2versions, I guess. Playing a bit with words:1. when I ask to _find_ something, I take it for granted the thing is somewherethere, and expect a location in return2. when I ask to _search_ something, I do not take anything for granted, andexpect in return _either_ "not there!" or a locationThis duality applies both to the so-called real world (despite blur naturallanguage word meanings) and software worlds, and extends to all cases, it seems.We can certainly find a way, using Option or another mean, to combine bothcategories of situations (1. or 2.) using a single tool, but this should be verywell done, and ensure that:* In cases of category 1., developers are warned about their errors exactly asif they had called a stupidly failing func.* At the semantic (conceptual) level, the duality of categories remains somehowpreserved, including in source code.About the latter, in particular it should be obvious in code, without knowledgeof language arcanes or weird idioms, that (or whether) the caller expects asuccess unconditionally -- because (and in other words that) the anomalous casejust does not belong to this app; this is critical information to the reader.How to do that right?

PS: I take the opportunity of again thanking the initiators of this Rust projectfor their welcoming and open-mindedness to such exchanges. This is really great.I'm not sure anymore Rust is the right language for me (maybe too complicatedand abstract for my poor and old little mind...) but go on following the mailinglist (and please keep it a mailing list! ;-) for this quality of sharing.


Denis

[1] I'm very much against complicated funcs, even more against ones that try toguess what you want in anomolous cases (eg return 0 as sum of no number, or 1 asproduct, lol!), and totally against the pretention that software should notfail, as apparently defended by Go's designers.This leads to illogical things, like their utf8 decoder, precisely, insertingcodes for the replacement character U+FFFD �, a *valid* code, instead ofsignaling invalid source. So that you never know the input was invalid, orwhether this code belong to the source or was inserted by their func...


_______________________________________________
Rust-dev mailing list
Rust-dev@mozilla.org
https://mail.mozilla.org/listinfo/rust-dev

Re: [rust-dev] Let’s avoid having both foo() and foo_opt()

Reply via email to