Programming Patterns - CP1404/Starter GitHub Wiki

This page contains standard 'patterns' that you should get used to. When you need to perform one of these tasks, follow the standard pattern.

Use the menu on the right to choose a section.

Go here for our Style Guide (things like naming and commenting)

For the most part, this guide is not language-specific, so many patterns are presented as pseudocode.
Python 3 is used where actual code is provided... and Python is similar enough to pseudocode that this is usually suitable as a pattern.

Main program structure

For most programs, you will have a main function and a number of other functions.
Think of main as the whole program with the other functions as the tools that main uses, with the details abstracted away.

main should go at the top of your file, and someone reading your code for the first time should be able to read only main and understand what the program does... that is, main should "look like" the whole program.
In the following example pseudocode, the specifics don't matter, but you can see the program structure and how the functions contain the detail. Notice that the order of the first few items is what we expect based on Python best-practices (e.g., global constants come after imports, which come after the module docstring comment).

""" module-level docstring """
import statements
CONSTANTS

function main()
    opening statement
    result = do_step1()
    do_step2(result)
    closing statement

function do_step1()
    ...
    return result

function do_step2(parameter)
    ...


main()

Decision structures

When you need to make a decision in your program, you usually use one of the following patterns.
(See the repetition section below for when you need to repeatedly make decisions, e.g., for most error-checking.)

The examples below use scenarios where you want to print a result for a given score, where score is an integer. Each situation could be stand-alone, or part of a loop, like for score in scores:

if, no else

Use this if you want to do something when the condition is true, but do nothing when it's false. Like, if it's raining, take an umbrella. You don't need to say "else don't take an umbrella".
In this example, we don't want to print anything for the non-outstanding scores, so there's no else.

if score >= 90:
    print("That's outstanding!")

Some beginning programmers use else for no reason, like the following example... This is redundant and never of any value... Don't do it.

if score >= 90:
    print("That's outstanding!")
else:
    pass  # Don't do this :(

if, else

Use this if you want to do something when the condition is true, and something different when it's false.
In this example, we want to print a result for the score no matter what its value is. Note that we do not need a second condition (elif) to handle the "fail" case, because if the score is not >= 50 we already know that it must be < 50.

if score >= 50:
    print("Pass")
else:
    print("Fail")

if, elif, else

Use this if you want to handle all cases in some way - in our example, there will be an output printed for every possible score. This is the pattern that we use for menus as well - handle each menu option we know about and the trailing else handles the invalid option (see below).
In this example, we always want to print one result for the score no matter what its value is.

if score >= 90:
    print("Excellent")
elif score >= 50:
    print("Passable")
else:
    print("Bad")

if, elif, no else

Similar to the if with no else, use this when you want to handle multiple possible cases, but there will be some cases where there is no result/outcome handled. The cases are mutually exclusive, but you're happy to do nothing in some cases.
This is rare.
If you use this pattern, ask yourself:

"what case(s) do I NOT want to handle?", or "What scenario(s) do I want to ignore?"

If there is no answer to that question, you should not use this pattern. (Often you've chosen a last elif that should just be an else.)

In the following example, the very high scores win a prize, but the others don't, and we don't need to tell them. (E.g., at graduation, they announce which graduates get a University medal, but they don't say which ones do not get a medal.)

if score >= 90:
    print("You win a car!")
elif score >= 80:
    print("You win a horse :)")  # but you do not win a car AND a horse

if, if, if

Use this when you want multiple outcomes. That is, the results/outcomes are not mutually exclusive. One condition being true does not affect the other conditions.
In the following example, we want to print all the results that a score could achieve.

if score >= 90:
    print("You win a car!")
if score >= 80:
    print("You win a horse :)")  # here, you can win BOTH a car and a horse
if score >= 50:
    print("You passed")

So, as you design your decision structures, recognise what each pattern is for and how it applies to your situation.
E.g., You would not use the "if, if, if" pattern for determining a grade (F, C, HD...) from a percentage because you don't want multiple outcomes (grades) - you know that would be inefficient since those grades are mutually exclusive - as soon as we know what grade it is, we don't need to ask any more.

Boundary conditions

(This applies to both decision and repetition structures so it's here between them.)
Some of the most common programming errors happen with boundary conditions and so these should usually always be tested explicitly. In the examples above, 50 is a pass. But if we get our boundary condition wrong, we might have something like:

if score > 50:
    print("You passed")
# or
if score > 49:
    print("You passed")

In the first case (score > 50), this works for all values greater than 50, but it would make 50 a fail, not a pass as it should be. If you test your code using the boundaries as input values, you will see that 50 is not a pass. If you have a program with 7 boundaries (e.g., F, P, C, D, HD, too high, too low), you'll need to test all 7, plus some others.
In the second case, this works for now, but we have 2 problems: the problem domain specifies that 50 is a pass, so we should use the value 50, not change it to something we hope works - there's a chance we might make a mistake; secondly, if we change score to be a float instead of an int we now have failing values like 49.1 that will result in pass when they should not!

Did you catch that?
Use the values and names in the problem domain - e.g., the problem description says that a score of 50 or more_ is a pass, so use the values and names: score, 50, pass.
It's much harder to make a mistake when you're following what the problem description says... Just check those boundaries when you write them (> 50 or >= 50 or < 50 or <= 50 or == 50...?) and test them!

Decision or Repetition?

Choose decision (if...) when you wish to do something at most once (0 or 1.
Choose repetition (while/for) when you wish to do something potentially more than once (0 or more).

Here is an example of the difference between if and while

One:

if I am hungry
    eat a mouthful

Two:

while I am hungry
    eat a mouthful

With the first one (if), I only eat one mouthful... I might still be hungry!
With the second (while), there's repetition, so I eat as many mouthfuls as needed until I am not hungry.
In both cases, if/while I'm not hungry, I don't eat any mouthfuls.

Repetition structures

In most languages, there are multiple kinds of loops and you should choose the most appropriate kind.
The most common choice is:

Use for loops for definite iteration, like for item in sequence...
Use while loops for indefinite iteration, like while condition...

Using a while loop and maintaining your own counter (e.g., using a while loop to iterate through the numbers from 1 to 10) would be considered an anti-pattern, since this is what for loops are for!
Using a for loop and maintaining your own counter (e.g., iterating through elements in a list and manually using +1 for the index) would also be poor, since for loops can do this for you.

For loops (definite iteration)

For loops are mostly used when you want to do something with each item in a sequence. Let's consider three common scenarios.

Sequence of numbers

Sequences of numbers can be generated with range.

for i in range(10):
    # do something with i

Collection of values

When you have a variable or expression containing items, you could iterate over the range of indexes (sequence[i]), but this would be unnecessary and not very 'Pythonic'.
We want to use the standard convention:

for item in sequence:
    # do something with item

Note that we will very often end up with loops of the form:

for singular in plural:
    ...

Example, for dog in dogs, for number in numbers, for book in books...

Both index and element

Use enumerate when you want both the index and the element. Example:

names = ["Barry", "Tux", "Ada", "Maggie"]
for i, name in enumerate(names):
    print(i, " - ", name)

While loops (indefinite iteration)

Almost all while loops follow the same standard pattern (as below with menus and error checking).
Do not force the loop to be True the first time by setting a value for your loop condition variable, and do not use while True... unless this is really the best way to do it.

<priming read - do something the loop will depend on, e.g., get/calculate a number>
while <condition based on something from above>
    <body of the loop - do the thing you want to repeat>
    <same as the priming read again>
<do next thing now that the loop is finished (condition was false)>

Example - number guessing game

SECRET = 6
guess = int(input("? "))
while guess != SECRET:
    print("Guess again!")
    guess = int(input("? "))
print("You got it!")

Menus

Use the if/elif.../else pattern in Python (switch statements in other languages) inside a while loop that handles the quit option.

display menu
get choice
while choice != <quit option>
    if choice == <first option>
        <do first task>
    else if choice == <second option>
        <do second task>
    ...
    else if choice == <n-th option>
        <do n-th task>
    else
        display invalid input error message
    display menu
    get choice
<do final thing, if needed>

Sometimes inexperienced coders use a while True or while Boolean loop for menus, and handle the quit option with an extra if, which makes it less readable than our standard pattern since it doesn't clearly indicate how the loop stops.
With a readable (meaningful) condition you know without having to read the rest of the code how the loop will end.
Also, by having the "final thing" outside the loop, you have less indenting (a Zen of Python value) and it's easier to see that this code runs after the menu loop quits.

Error checking

<priming read - get some input>
while <input is bad>
    display error message
    <same as the priming read again - get some input>
do next thing now that you know the input is valid

Example:

age = int(input("Age: "))
while age < 0:
    print("Invalid age!")
    age = int(input("Age: "))
print("You are {} years old".format(age))

Exception-based error checking (CP1404 only; not used in CP1401)

You can't have a 'normal' priming read since it might crash before you get to the condition, so you need your try/except inside a loop that you control.

Example:

is_valid_input = False
while not is_valid_input:
    try:
        age = int(input("Age: "))
        if age < 0:
            print("Age must be >= 0")
        else:
            is_valid_input = True
    except ValueError:
        print("Invalid (not an integer)")
print("Next year you will be", age + 1)

Function with error checking

Suppose you have a function that should do a task only there are no errors, like adding a value to a collection if it's valid.
In this case, you can check for errors first, then do the task if there are no errors... instead of checking it's valid and doing the task if it's valid.
The difference is in the nesting level. You want your main task to be at the highest (outermost) level, not inside an if/else.
Example structure:

function do_task(input)
    if input has error 1
        display error message 1
        return
    if input has error 2
        display error message 2
        return
    do task (knowing we don't have any of the above errors)

Boolean-returning error-checking function

Similar to the above pattern, but when you want to check if an input is valid using a function.
This pattern uses a kind of "innocent until proven guilty" approach. The last line returns True (input is valid), but only if the code reaches there. If any problem is found (you can have as many tests as you need), the function returns False.
Note that you do not need else/elif after a return.

function is_valid(input)
    if input has error 1
        return False
    if input has error 2
        return False
    return True

Finding

function find(needle, haystack)
    for each item in haystack
        if item == needle (or however we compare to find the needle in the haystack)
            return item (or whatever is meant to happen)
    return None (since we did not find it)

There's no need for else or continue since the loop will move to the next item.

Filtering

filtered_items = new list
for each item in items
    if item matches what we want
        add item to filtered_items

There's no need for else or continue since the loop will move to the next item.

Note that Python (and many languages) have neat shortcuts for filtering, e.g., using list comprehensions:

filtered_items = [item for item in items if item matches what we want]

Accumulation

The accumulation pattern varies depending on how you need to loop (definite, indefinite) and what it is you are accumulating, but always has common steps, such as:

initialising a total (before the loop)
accumulating (adding onto the total) (inside the loop)
using the value (outside the loop... but you may also want it inside the loop)

The following pseudocode also shows a counter, assuming we don't know how many times the loop will run.

total = 0
number_of_things = 0
repeat <somehow>
    get/determine/generate/calculate value
    total = total + value
    number_of_things += 1
average = total / number_of_things
print total, average

Constants

In some cases, replacing a literal with a named constant can help make your code more readable and maintainable.
Consider the following program:

print("If you buy over 5 items, save 10%!")
number_of_products = int(input("Number of products: "))
while number_of_products <= 0:
    print("Invalid number")
    number_of_products = int(input("Number of products: "))
total = number_of_products * 32.5
if number_of_products > 5:
    total -= total * 0.1
print(f"{number_of_products} x ${32.5:.2f} products = ${total:.2f}")

The above program works, but contains "magic numbers" that are used more than once.
Magic numbers are not evil. You do NOT need to change any literal into a named constant, but there are some general guidelines to help decide when you should probably use constants.

If the value is used more than once
If introducing a name is more helpful than the number

Notice in this program that we reuse the numbers 5 and 32.5 more than once.
Again, it's not a rule that we replace these with constants, but it could be considered good practice.
Note that these numbers are used in both strings and as numbers in calculations.

A good way to consider the use of constants is to ask yourself a question like:
"What if I wanted to change this later?"
E.g., what if the threshold for a discount changed from 5 items to 10 items? How many places would I need to change my code? One is better than two, so let's use a constant.

There's another interesting one in this example. 10% is here twice, so... "what if the discount changed from 10% to 15%?"
The value 10% only appears in the string, but the value 0.1 appears in the calculation, and these are the same.
We do not want to break our program by having a constant for the 0.1 and forgetting to use it for the print 10% part.
This brings us to a rule for using constants:

If you have a constant, then you MUST use it everywhere the value exists.

Here's our code with three introduced constants. It works the same way, but is more readable and maintainable (easier to modify and extend).
You might notice that the constants are all at the top, like configuration 'variables'.

One more thing to notice: There's no real benefit in turning the literal 0 into a constant as it's not a value that will ever change or that needs further explaining. It's just... zero (no products or some products).

# version 2 - notice how it is easier to read,
# and now we only have one place to change if we need to update the values
DISCOUNT_THRESHOLD = 5
ITEM_PRICE = 32.5
DISCOUNT_RATE = 0.1

print(f"If you buy over {DISCOUNT_THRESHOLD} items, save {DISCOUNT_RATE * 100:.0f}%!")
number_of_products = int(input("Number of products: "))
while number_of_products <= 0:
    print("Invalid number")
    number_of_products = int(input("Number of products: "))
total = number_of_products * ITEM_PRICE
if number_of_products > DISCOUNT_THRESHOLD:
    total -= total * DISCOUNT_RATE
print(f"{number_of_products} x ${ITEM_PRICE:.2f} products = ${total:.2f}")

Don't Repeat Yourself (DRY)

DRY is a principle to help avoid 'bad patterns' rather than a pattern itself.
Here's a counter-example to show you what NOT to do.

score = int(input("Score: "))
if score < 0:  # condition 1
    result = score * 2
    print("Bad score :(")
elif score >= 0 and score < 20:  # condition 2
    result = score * 2
    print("Score is OK.")
elif score > 20:  # condition 3
    result = score * 2
    print("Your score is good!")
print("Double your score is", result)

This program works, so what's the problem?

Remember, we don't just want working code, we want good code!
condition 2 is only checked if condition 1 is False. if condition 1 is False, this is because score must be not < 0, so score >= 0 is redundant. It will always be True. condition 2 should be replaced by just score < 20
condition 3 appears to check if condition 2 was False, which we already know, but because this code uses "elif no else" we might just make a mistake like getting the boundary condition wrong. What happens if the user enters 20? Oops! The right choice of pattern is important! We can fix this by changing condition 3 to score >= 20 but then we ask a question we can guarantee will always be True when we get to it (since the first 2 were False), so that's repeating ourselves. DRY.
Lastly, in all three paths, we repeat the line result = score * 2. Again, this works, but is not good. Since we always want to do this, it should go outside the decision structure.

Here's the code with these problems fixed. Ah, that's better :)

score = int(input("Score: "))
if score < 0:
    print("Bad score :(")
elif score < 20:
    print("Score is OK.")
else:
    print("Your score is good!")
result = score * 2
print("Double your score is", result)

Working with Booleans

In most cases, where you are dealing with a condition or value and you care about whether it is true or false, then you never need to compare to True or False. E.g., instead of:

if condition == True:
... or
if condition == False:

You can just use:

if condition:
... or
if not condition:

If you are ever returning (or setting a variable to) True/False depending on a condition, you can just return (or set) to the condition. So, instead of:

if condition:
    return True
else:
    return False

You can just use:

return condition

Function design

The most important aspect of function design is the Single Responsibility Principle (SRP), which means that functions should "do one thing". What "one thing" means depends on the context, but a single function should be an abstraction of a single task.

In general, there are 3 kinds of functions, those that are designed to:

get input (from the user or another source)
process data
produce output (to the console, file, or another sink)

Very commonly, the structure in terms of parameters and return statements will look something like (example):

function main():
    data = get_input()
    result = process(data)
    display(result)

(This is a simplification to make the point about reusability, not a rule that never changes.)
That is:

input-getting functions don't always take in parameters, but they do return what they get.
data-processing functions do take in parameters (they do NOT get the input data from the user or other source), and they do return the results (they do NOT display/print/save the result)
output-producing functions do take in parameter (what they are to display), but do not return anything

Why is this important?

Some good ways to understand function design include asking these questions about function reuse:

What if we wanted to rewrite the program's interface in French or Farsi? We should not have to change the processing function, because it should not do any user interface things (input or output on the screen).
What if we wanted to get our input from a file instead of the user? We should not have to change the processing function because it should not get any user input. A well-designed function can be used with input either from the user or a file (or anywhere), because it takes in the input as parameters.
Same as above for if we wanted to write the output to a file instead of display it on the screen... the processing function shouldn't care where the data (input parameters) comes from, or where the results (return values) go, since that's not its job.
Functions designed like this are more testable. You can write test code that passes in inputs and compares outputs (returned values) to known correct results for those inputs (e.g., using Python's assert statement or doctest module). You really can't easily "test" functions in any automated way if they get user input or print results.

Note about functions that populate lists

When you write a function to populate a list, such as a function that loads data from a file or gets user input, create the empty list inside the function. Do not pass in an empty list for the function to change.
Doing it this way is more common, easier to understand, easier to name, and makes the function more reusable.
If you did pass in a list, then you can't use the function without first creating a list.
An exception to this would be if you wanted to add to an existing list. However, this is easily overcome by returning a new list and then adding that to your existing list outside the function.
So...

Do

    values = get_values()

Don't

    values = []
    get_values(values)

Data storage

Always store data in the best, most correct, format.
E.g., if you read a price from console or file input, it will initially be a string, but you should convert it and store it as a float.
If you want to print it using string formatting (e.g., $23.40), don't store it as a string, just print it that way... leaving the variable as a float.

(Another example) If you're asking the user to make a yes/no choice, they might click a button or type "yes", but then you would convert this and store it as a Boolean because that's the most appropriate type to store a yes/no (True/False) state.

In general, don't store derivable data.
This creates a maintenance burden. Even if your code works correctly, when you maintain it and add to it, you have to remember to update the same information in multiple places.
E.g., don't store age if you already have a date of birth (DOB). Doing so can lead to inconsistency, e.g., your age might not get updated when the date changes. Just calculate the age when you need it and it can't be wrong.
Don't store the length of a list... that's derivable and can be retrieved at any time (unless you're using a language where this is not the case).
Don't store two constants/variables when one is just derived from the other. Like in the example below... we should NOT have to change two constants, just the first one:

MAXIMUM = 10
HALF_MAX = 5

print(f"Enter a number between {HALF_MAX} and {MAXIMUM}")

Never

Here are a few things you should never do... You can consider these to be "anti-patterns".
("Never" is a strong word, and there will likely be some rare situations where you might maybe sometimes want to do these things, but it's very unlikely.)

Never set a variable value you don't use. Why define/set a variable, ignore what you just did and set it to something new?
Never replace function parameters: If you have a function that takes in a parameter (x), you will never want to set that variable (x) immediately... otherwise, why would you pass it in? (This is the same as the previous point.)
Never convert to the same type: don't convert from type A to type A. E.g., in Python, the input function always returns a str type, so you never need to write something like x = str(input("?"))... or y = int(0).
Never programmatically modify a literal. Since they're literals, just use the value you want. So, instead of: float(0), use 0.0. Don't use "hello".upper(), just use "HELLO". Why make the computer work more by using 11 / 2 when it's more efficient to use 5.5?
Never use the verbose (unbound) syntax for method calls unless you need it: You should always prefer the concise (bound) format. E.g., use "Hello".upper() not str.upper("Hello").
Never use an existing variable as a for loop's target variable. It replaces what the original variable is. The target variable should always be a new variable (i is common).
Similarly, never replace the target variable with a different value. If you have something like for thing in things: thing = something_else then what's the point of the first thing?
Never use while True loops if you can easily enough use a "standard" while loop. If you have to write an if statement to break out of a loop, that if-condition should probably just be your normal loop condition.