http://www.perlmonks.org?node_id=316288


in reply to Re: Re: Re: Re: What should be returned in scalar context? (best practice)
in thread What should be returned in scalar context?

This node falls below the community's threshold of quality. You may see it by logging in.

Replies are listed 'Best First'.
•Re: Re: Re: Re: Re: Re: What should be returned in scalar context? (best practice)
by merlyn (Sage) on Dec 22, 2003 at 02:43 UTC
    "returns an array" would be confusing to me, since it's not possible. Well, except maybe in an "lvalue" context.

    An array is a container, holding a list. You can't return containers. Only values. Scalar value is to scalar variable what list is to array. You can no more return a scalar variable than you can return an array.

    Put another way, you can't take a reference to a list, only to an array. And yet you can't take a reference to the return value of a subroutine. You have to construct an anonymous array (a new container) to hold the value.

    These are not just implementation issues. These are user-visible concepts, and affect your behavior if you have the proper model.

    Your casual conversation may make sense to you, but you'll confuse your reader. And the point of communication is to communicate for the reader, not just for your own notes.

    -- Randal L. Schwartz, Perl hacker
    Be sure to read my standard disclaimer if this is a reply.

      Your casual conversation may make sense to you, but you'll confuse your reader.

      My point is not a casual one even if my explanation has failed to be as clear as I would like. Allow me to try to explain it to you as it is fundamentally about documentation, and your skill at technical writing is unquestionable.

      When I document that a sub of mine "returns an array", I mean very specifically that the expression supplied to return() (be it explicit or implicit) is an array.

      My use of the word "return" corresponds directly to the Perl keyword "return" without trying to describe what return() itself actually does. This is a considered choice, not laziness or lack of understanding.

      The behavior of return() is not within my purview. Isn't it to my own benefit to avoid describing it? Don't I, in fact, have a responsibility to yield to its proper documentation?

      Granted this is unlikely, but for the sake of argument consider the possibility of return()'s behavior changing. Imagine that a boolean context and a couple boolean values are added to the language. (Yes, it is a ridiculous notion but bare with me.) By not attempting to enumerate every context in the first place, haven't I insulated my own documentation from the change and avoided the need for edits?

      I believe this to be a fine documentation strategy. It is simple and allows me to be brief, describe the code itself, and communicate exactly what to expect. It avoids redundancy and protects the documentation, the published interface, from outside changes. It ignores the sometimes sticky issue of context which, really, should not have to reappear in every function's documentation. For these reasons, I will continue to use it.

      The one concern that I do have is that it seems several people are confused by it. I find this disconcerting as it is really quite simple. My only explanation is that many people are either too focused on their knowledge of Perl's inner workings or are hung up on documenting their own functions like Perl builtins. (Builtins aren't coded in Perl and hence, it wouldn't make sense to refer to the return keyword in their documentation; this is true of XS code as well.) Whatever the case, this kind of confusion seems to be prevalent wherever lists and list context are discussed. Maybe people still aren't comfortable with the concepts and want to be reminded of the specifics at every turn.

      -sauoq
      "My two cents aren't worth a dime.";
      
        Saying it "returns an array", simply because you see the tokens for "return" and an array name, is like saying "assigns an array" in the statement "$foo = @bar".

        You are ignoring semantics, completely focussing on syntax. You are not describing what it does, you are describing the text of your source code.

        When you do that, you might as well leave all descriptions off, and let the reader go find the source for themselves.

        I find documentation much clearer when I describe what things do. Not what they are.

        It does not return an array. It can't. Writing "return @array" does not return an array.

        I'll hope not to run across your style of mis-documentation in any place other than the Monestary. It will frustrate me and my clients and my students.

        -- Randal L. Schwartz, Perl hacker
        Be sure to read my standard disclaimer if this is a reply.

        A reply falls below the community's threshold of quality. You may see it by logging in.

        I've decided that there's no changing your opinion on the matter. Despite the fact that four people have responded questioning your use of "returns an array", you have stood fast. You called my expectations of an arrayref in scalar context when you say "returns an array" as ridiculous and I certainly appreciate your input on the matter. It does show, however, that there's at least one idiot who reads "returns an array" this way. You state here that you can say "returns an array" and you won't need to update the documentation in the future. I mean no disrespect when I say that I read that as you intentionally use abstract terms and "golf" your documentation to lower the amount of maintainence needed. If this is the case, why bother at all? If you aren't going to properly describe your code in non-ambiguous terminology then what good does your documentation serve?

        But hey, I'm just a dimwit who believes "returns an array" means that the subroutine returns an arrayref in scalar context. =/

        My use of the word "return" corresponds directly to the Perl keyword "return" without trying to describe what return() itself actually does.

        That's where your mistake is.

        Consider that print 1 + 1; does not print 1 + 1. See how return @foo is not returning an array. Realise that split //, $foo is not splitting a regular expression. You can't just take syntax and use the same in English.

        Like I said before: if your documentation is there just to explain HOW your code works, then simply duplicate the code in a verbatim paragraph. Start documenting WHAT your code does. print 1 + 1; prints 2. return @foo returns either a list or a number of elements. Never an array. Just like how print @foo does not print an array and how with yoursub(@foo), yoursub never gets to use that array @foo. It receives a list (that is accessible through the special array @_, which not the array that was "passed").

        Juerd # { site => 'juerd.nl', plp_site => 'plp.juerd.nl', do_not_use => 'spamtrap' }

        A reply falls below the community's threshold of quality. You may see it by logging in.
Re: Re: Re: Re: Re: Re: What should be returned in scalar context? (best practice)
by jarich (Curate) on Dec 22, 2003 at 04:51 UTC
    Now let me try this:

    sub f1 { return (33, 42) } sub f2 { return my @x = (33, 42) } my $nelts = f1(); # apparently returns number of elements (2) my ($first) = f1(); # apparently returns first element (33) my $last = f2(); # apparently returns last element (42) my ($first2) = f2(); # apparently returns first element (33) print "nelts: $nelts (expecting 2)\n"; print "first: $first (expecting 33)\n"; print "last: $last (expecing 42)\n"; print "first2: $first2 (expecting 33)\n";

    Let's run that shall we?

    # Output: nelts: 42 (expecting 2) first: 33 (expecting 33) last: 2 (expecing 42) first2: 33 (expecting 33)

    Hmm. So your suggestion that you can know how it will act isn't even quite true for yourself. But only because you got your examples backwards.

    It's interesting though, because I never really bothered noticing a different between the two return options. What you're saying is that if you give return an array then the returned value gives you the number of elements in the list when taken in scalar context. Whereas if you give return a list then the return value gives the final element of the list in scalar context.

    An unfortunate inconsistency and one which probably causes many subtle bugs.

    To document that the first returns a list and the second returns an array is perfectly acceptable

    I understand why you're arguing that there is a distinction between these two cases and I can guess why you feel its better to describe the second as returning an array. However I do agree that the statement is technically incorrect. The subroutine may be returning as an array but what you get back on the other side is a list until you copy it into an array. And you want it to be a list too, as there's very little value of even thinking of it being an array except for the fact that you can take it in scalar context to get the number of elements.

    Since the issue is on semantics and your attempt to denote a difference between the two ways of returning a list, I cannot recommend a better choice for how you should document this. I would say that the subroutine returns a list or the number of elements in that list in scalar context, rather than saying it returns an array. But I can understand, even if I don't agree, your reasons for your choice of words.

    I'm now going to have to take especial care in future, when calling subroutines which return lists in scalar context. Will I be getting the final value of the length of the list, because I suspect I usually assume the latter.

    jarich

      First, thanks for catching the fact that I had my examples switched. As I explain in my update, I edited my description and failed to switch my examples to match. Good catch and I appreciate your close attention.

      An unfortunate inconsistency and one which probably causes many subtle bugs.

      No, this is not an inconsistency. It is, in fact, consistent:

      $ perl -le 'print scalar(33, 42)' 42 $ perl -le 'print scalar(my @x = (33, 42))' 2
      It is well understood that list literals do not act like arrays in scalar context. The expression is evaluated in the proper context before it is returned.

      But I can understand, even if I don't agree, your reasons for your choice of words.

      I'm not sure you actually do, although you are making an effort to and I appreciate that. Let me see if I can better explain the point that others have so completely missed. When I say, "this sub returns an array," my meaning is very specific and clear. The sub explicitly or implicitly return()s an array. What return() does is already clearly documented in perldoc perlfunc and it is not my responsibility nor would it be prudent for me to duplicate that documentation.

      Let's say I document that the sub "returns a list or the number of elements in that list in scalar context" as you suggest. If the developers decide that return() should behave differently, my documentation may become incorrect. If I say simply that my function "returns an array" and a change is made to return()'s behavior, I don't need to change my documentation. It remains correct. Granted, the likelihood of return() changing is pretty much non-existent... but that's completely beside the point. My documentation methodology is perfectly sound.

      -sauoq
      "My two cents aren't worth a dime.";