Alternative to nested-if on single return functions

Question

I'd like to see if there has been any precedent on alternatives to nested-ifs--particularly for error-code returns. My workplace requires one return per function, so I cannot early exit. Here is some sample code, rewritten to protect confidentiality:

static ErrorCode_t FooBar ( Input_t input ) {
    ErrorCode_t errorCode = FAIL;

    if ( condition1IsNotMet )
    {
        errorCode = ERROR_CODE_1;
    }
    else
    {
        doSomethingHere;

        if ( condition2IsNotMet )
        {
            errorCode = ERROR_CODE_2;
        }
        else
        {
            doAnotherThingHere;
        }
    }

    return errorCode;
}

I would have like to show my attempt toward writing an alternative to this; however, I can't think of any else frankly. I cannot do chained if, else-if, else-if, etc. because procedures must execute between condition checking.

Any suggestions would be appreciated.

Edit: Please note my requirements before marking as duplicate. I need to have a single return. I need to have behavior occur between condition checking. Also note the rationale behind a single return (though I do not agree) is for readability, not performance.

score 6 · Answer 1 · edited Jun 16 '20 at 10:01

Ah, Golden Rules of thumb that are worth too much to cut off.

Might I recommend taking whomever is enforcing that rule and giving them this piece of code to improve. Bring pop corn.

What you are writing is not a mathematical formula with a pithy implementation with a single line of logical execution.

What you are writing is a command. To follow the golden rule to its ultimate end will require serious refactoring.

static ErrorCode_t FooBar_impl_2( Input_t input )
{
    doAnotherThingHere;
    return FAIL;
}
static ErrorCode_t FooBar_impl_1( Input_t input )
{
    doSomethingHere;
return ( condition2IsNotMet )
    ? ERROR_CODE_2
    : FooBar_impl_2(input)
    ;

}
static ErrorCode_t FooBar ( Input_t input )
{
    return ( condition1IsNotMet )
        ? ERROR_CODE_1
        : FooBar_impl_1(input )
        ;
}

Now understand that...

Which is precisely how the optimising compiler is interpreting your example program.

Chalk 1 for the golden rule of single return, you the programmer are the compiler!

BUT.... I'm being disingenuous...

The golden rule of having a single return was coined in the days of assembler (or right now if you are writing in assembler). Back then a function, cough, cough sorry those didn't exist yet...

What I meant was that back then the section of assembler code whose first instruction was labelled with either a line number, or a shiny label could do pretty much whatever it pleased.

When you "called" - I mean jumped to - that instruction it did not have to return back to whomever "called" - I mean jumped to - it.

The convention of the time was to place the "return address" in a special register, or on the stack at a specific offset from the current top of the stack. The idea being that the jumped to set of instructions would do what it did, and jump back to that address.

Guess what smart programmers didn't do?

And that is how the rule of having a single return came to be. The section of jumped to code, must return back to the address it was given, and nowhere else.

Any modern function does exactly that. Whomever called it will be returned to when the function is done doing whatever it does (whether or not it actually finishes what it does is beside the point).

With that in mind, I would refactor as:

static ErrorCode_t FooBar ( Input_t input )
{
    if ( condition1IsNotMet )
        return ERROR_CODE_1;
doSomethingHere;

if ( condition2IsNotMet )
    return ERROR_CODE_2;

doAnotherThingHere;

return FAIL;

}

And the compiler guarantees the rule of thumb.

user949300 · Accepted Answer · 2019-11-15T17:30:04.213

I vote for multiple returns, but the following pattern sometimes fits. It assumes that a null (or 0) "error" means "all is well.

static ErrorCode_t FooBar ( Input_t input ) {
  ErrorCode_t errorCode = null;

  if ( condition1IsNotMet )
  {
    errorCode = ERROR_CODE_1;
  }

  if (!errorCode) {
    doSomethingHere;

    if ( condition2IsNotMet )
    {
        errorCode = ERROR_CODE_2;
    }
  }

  if (!errorCode) {
    doAnotherThingHere;
  }

  return errorCode;
}

(Added much later) Here is how the code might look if you broke it up into several smaller functions, each of which returns an ErrorCode_t.

static ErrorCode_t FooBar ( Input_t input ) {

  ErrorCode_t errorCode = checkPreconditions(input);

  if (!errorCode) {
    doSomethingHere();
    errorCode = checkCondition2();
  }

  if (!errorCode) {
    errorCode  = doAnotherThing();
  }

  return errorCode;
}

ianmandarini · Answer 3 · 2019-11-14T23:19:16.493

You are asking the wrong question

You have asked about alternatives for your nested-ifs but I believe this is a case of a XY Question.

The XY problem is asking about your attempted solution rather than your actual problem.

That is, you are trying to solve problem X, and you think solution Y would work, but instead of asking about X when you run into trouble, you ask about Y.

I believe that the true question you are facing is one about clean code/ code quality.

Asking how to rewrite nested-ifs is just your current attempt at trying to clean the FooBar function. As a result, I will attempt to answer the following question:

How can I improve the code quality of my FooBar function?

For my answers, I will be referencing and quoting the brilliant must-read book by Robert C. Martin Clean Code: A Handbook of Agile Software Craftsmanship

Your functions should do one thing

Take a look at your example. Does it look that it does only one thing?

static ErrorCode_t FooBar (Input_t input)
{
    ErrorCode_t errorCode = FAIL;
    if (condition1IsNotMet) {
        errorCode = ERROR_CODE_1;
    } else {
        // doSomethingHere
        if (condition2IsNotMet)
            errorCode = ERROR_CODE_2;
        else
            // doAnotherThingHere
    }

    return errorCode;
}

It doesn't.

And the various if-elses dividing the code into big chunks is a good indication of so. Your code clearly has sections, making them inherently more complex to reason.

Functions should do one things. They should do it well. They should do it only.

Your functions should be small

Functions should Hardly be 20 lines long.

in fact, most functions could be expected to be less than 10 or even less than 5 lines long. Wait! How am I going to write a nested-if with so few lines? Well, Robert C. Martin writes:

(...) blocks within if statements, else statements, while statements, and so on should be one line long. Probably that line should be a function call. Not only does this keep the enclosing function small, but it also adds documentary value because the function called withing the block can have a nicely descriptive name.

This also implies that functions should not be large enough to hold nested structures. (...)

Refactoring your code

Using these two principles, it should be clear that the doSomethingHere and doAnotherThingHere should be independent functions and that we need to break your nested if. It would end up looking something like this:

static ErrorCode_t FooBar(Input_t input)
{
    if (condition1IsNotMet)
        return ERROR_CODE_1;
    return DoSomethingHere(input);
}

static ErrorCode_t DoSomething(Input_t input)
{
    // doSomethingHere
    return HandleMoreThings(input);
}

static ErrorCode_t HandleMoreThings(Input_t input)
{
    if (condition2IsNotMet)
        return ERROR_CODE_2;
    return DoAnotherThing(input);
}

static ErrorCode_t DoAnotherThing(Input_t input)
{
    // doAnotherThingHere
    return FAIL;
}

How you break up your functions depend of the one thing they do

I can only speculate about the behavior of your code. But now we can ask the questions for each one of the smaller functions:

static ErrorCode_t FooBar(Input_t input);
static ErrorCode_t DoSomething(Input_t input);
static ErrorCode_t HandleMoreThings(Input_t input);
static ErrorCode_t DoAnotherThing(Input_t input);

For each one of them, what is the expected result? The final code may vary according to the actual intent of your code.

For example, you could argue that the code should be written as:

static ErrorCode_t FooBar(Input_t input)
{
    if (condition1IsNotMet)
        return ERROR_CODE_1;

    result = DoSomethingHere(input);

    if(result == SUCCESS);
        return ERROR_CODE_2
    else
        return FAIL;
}

static AnotherErrorCode_t DoSomething(Input_t input)
{
    // doSomethingHere
    return HandleMoreThings(input);
}

static AnotherErrorCode_t HandleMoreThings(Input_t input)
{
    if (condition2IsNotMet)
        return SUCCESS;
    DoAnotherThing(input);
    return FAIL;
}

static void DoAnotherThing(Input_t input)
{
    // doAnotherThingHere
}

I cannot tell without the details.

The one thing your code does cannot be told by only looking at its conditional statements. Hence, you will need to adapt this answer to your code using your best judgment.

A note about the "only one return statement" constraint

In the end of your question, you have added:

Please note my requirements before marking as duplicate. I need to have a single return. I need to have behavior occur between condition checking. Also note the rationale behind a single return (though I do not agree) is for readability, not performance.

By actually dividing your functions in smaller chunks, you get readability without that artificial rule of "having only one return in the end". in fact, that rule seems to be an artificial attempt to solve a "bad code" problem.

Also, even if you have to follow the one return at the end rule (perhaps because of a company guideline), it is much easier to do so if you divide your code into smaller chunks.

Bottom Line

The problem is neither having multiple returns or writing the nested-ifs in a good way.

The problem is that multiple returns indicate that your code does not do one thing only. Same thing for the nested-ifs.

By organizing your code in smaller chunks, you will get a better code.

score 0 · Answer 4 · answered Nov 15 '19 at 06:20

user949300's suggestion is very good, but it assumes that the you want to skip all the "optional" steps as soon as an error is encountered and makes "early success" more cumbersome.

I think those are both valid assumptions nearly all the time, but in case they aren't, here are two ways of skipping some steps independent of the value of the status code you're eventually going to return.

(Ab)use a builtin control-flow structure such as do ... while or switch
Use a keep-going variable, possibly hiding it behind a macro.

Here's an example using a do ... while loop.

// foo_do_while.c
#include <stdio.h>
#include <stdbool.h>

#define CONDITION_1 true
#define CONDITION_2 true

#define ERROR_CODE_INVALID (-1)
#define ERROR_CODE_1 1
#define ERROR_CODE_2 2

void do_something(void) {
    printf("do_something\n");
}

int no_nest(void) {
    int error_code = ERROR_CODE_INVALID;
    do {
        if (CONDITION_1) {
            printf("1\n");
            error_code = ERROR_CODE_1;
            break;
        }
        do_something();
        if (CONDITION_2) {
            printf("2\n");
            error_code = ERROR_CODE_2;
            break;
        }
    } while(0);
    return error_code;
}

int main() {
    printf("no_nest status: %d\n", no_nest());
    return 0;
}

and using a switch statement.

// foo_switch.c
#include <stdio.h>
#include <stdbool.h>

#define CONDITION_1 true
#define CONDITION_2 true

#define ERROR_CODE_INVALID (-1)
#define ERROR_CODE_1 1
#define ERROR_CODE_2 2

void do_something(void) {
    printf("do_something\n");
}

int no_nest(void) {
    int error_code = ERROR_CODE_INVALID;
    switch (0) {
    case 0:
        if (CONDITION_1) {
            printf("1\n");
            error_code = ERROR_CODE_1;
            break;
        }
        do_something();
        if (CONDITION_2) {
            printf("2\n");
            error_code = ERROR_CODE_2;
            break;
        }
    }
    return error_code;
}

int main() {
    printf("no_nest status: %d\n", no_nest());
    return 0;
}

Here's using a keep-going variable explicitly.

#include <stdio.h>
#include <stdbool.h>

#define CONDITION_1 false
#define CONDITION_2 true

#define ERROR_CODE_INVALID (-1)
#define ERROR_CODE_1 1
#define ERROR_CODE_2 2

void do_something(void) {
    printf("do_something\n");
}

int no_nest(void) {
    bool keep_going = 1;
    int error_code = ERROR_CODE_INVALID;
    if (keep_going) {
        if (CONDITION_1) {
            printf("1\n");
            error_code = ERROR_CODE_1;
            keep_going = 0;
        }
    }
    if (keep_going) {
        do_something();
    }
    if (keep_going) {
        if (CONDITION_2) {
            printf("2\n");
            error_code = ERROR_CODE_2;
            keep_going = 0;
        }
    }
    return error_code;
}

int main() {
    printf("no_nest status: %d\n", no_nest());
    return 0;
}

It's also possible to hide the keep_going check behind a macro, assuming that macros are not prohibited by the same standards that prohibit multiple returns. This example shows a macro taking a block and a begin/end pair of macros.

#include <stdio.h>
#include <stdbool.h>

#define CONDITION_1 false
#define CONDITION_2 true

#define ERROR_CODE_INVALID (-1)
#define ERROR_CODE_1 1
#define ERROR_CODE_2 2


#define CHK_KG(block) \
    do { \
        if (keep_going) { \
            block \
        } \
    } while (0);

#define BEGIN_CHK_KG \
    do { if (keep_going) {   {{{{{{{{{{

#define END_CHK_KG \
    }}}}}}}}}} } } while(0);

void do_something(void) {
    printf("do_something\n");
}

int no_nest(void) {
    bool keep_going = 1;
    int error_code = ERROR_CODE_INVALID;
    CHK_KG(
        if (CONDITION_1) {
            printf("1\n");
            error_code = ERROR_CODE_1;
            keep_going = 0;
        }
    )
    BEGIN_CHK_KG
    do_something();
    END_CHK_KG
    if (keep_going) {
        if (CONDITION_2) {
            printf("2\n");
            error_code = ERROR_CODE_2;
            keep_going = 0;
        }
    }
    return error_code;
}

int main() {
    printf("no_nest status: %d\n", no_nest());
    return 0;
}