Recursion and Backtracking - A Practical Deep Dive

All Notes

Code

Recursion and backtracking are two fundamental algorithmic techniques that often go hand-in-hand. While distinct, understanding one often paves the way for grasping the other, especially as backtracking is frequently implemented using recursion. This post will demystify both, showing you how they work with practical, runnable code examples.

We’ll focus on understanding the core concepts, common pitfalls, and when to apply these powerful tools.

Understanding Recursion: The Self-Referential Path

At its heart, recursion is a method of solving a problem where the solution depends on solutions to smaller instances of the same problem. Think of it as a function that calls itself, directly or indirectly, to solve a piece of the puzzle.

Every recursive function must have two main components:

Base Case(s): One or more conditions that stop the recursion. Without a base case, the function would call itself indefinitely, leading to a stack overflow. This is the “trivial” solution that can be solved directly without further recursion.
Recursive Step: The part where the function calls itself with a modified (usually smaller) input, moving closer to the base case.

How Recursion Works: The Call Stack

When a function is called, information about that call (parameters, local variables, return address) is pushed onto a data structure called the call stack. When a recursive function calls itself, new frames are pushed onto the stack for each call. When a base case is reached, the function starts returning, and these frames are popped off the stack, unwinding the calls.

Let’s illustrate with simple examples.

Example 1: Factorial Calculation

The factorial of a non-negative integer n, denoted n!, is the product of all positive integers less than or equal to n. For example, 5! = 5 * 4 * 3 * 2 * 1 = 120.

The recursive definition is:

n! = n * (n-1)! for n > 0
0! = 1 (This is our base case)

# factorial.py

def factorial(n: int) -> int:
    """
    Calculates the factorial of a non-negative integer recursively.
    """
    # Base case: factorial of 0 is 1
    if n == 0:
        return 1
    # Recursive step: n! = n * (n-1)!
    else:
        return n * factorial(n - 1)

if __name__ == "__main__":
    print(f"Factorial of 0: {factorial(0)}")
    print(f"Factorial of 1: {factorial(1)}")
    print(f"Factorial of 5: {factorial(5)}")
    print(f"Factorial of 10: {factorial(10)}")

    # Example of a large number (might hit recursion limit in Python for very large N)
    # Note: Python has a default recursion limit (often 1000). For larger numbers,
    # you might need to increase it (sys.setrecursionlimit) or use an iterative solution.
    # print(f"Factorial of 1000: {factorial(1000)}")

Factorial of 0: 1
Factorial of 1: 1
Factorial of 5: 120
Factorial of 10: 3628800

Execution Flow for factorial(3):

factorial(3) is called. n is not 0, so it returns 3 * factorial(2).
factorial(2) is called. n is not 0, so it returns 2 * factorial(1).
factorial(1) is called. n is not 0, so it returns 1 * factorial(0).
factorial(0) is called. n is 0 (base case), so it returns 1.
factorial(1) receives 1 from factorial(0). It computes 1 * 1 = 1 and returns 1.
factorial(2) receives 1 from factorial(1). It computes 2 * 1 = 2 and returns 2.
factorial(3) receives 2 from factorial(2). It computes 3 * 2 = 6 and returns 6.

Example 2: Fibonacci Sequence

The Fibonacci sequence is a series of numbers where each number is the sum of the two preceding ones, usually starting with 0 and 1. F(n) = F(n-1) + F(n-2)

F(0) = 0 (Base case 1)
F(1) = 1 (Base case 2)

# fibonacci.py

def fibonacci(n: int) -> int:
    """
    Calculates the n-th Fibonacci number recursively.
    """
    if n <= 0:
        return 0
    elif n == 1:
        return 1
    else:
        return fibonacci(n - 1) + fibonacci(n - 2)

if __name__ == "__main__":
    print(f"Fibonacci(0): {fibonacci(0)}")
    print(f"Fibonacci(1): {fibonacci(1)}")
    print(f"Fibonacci(2): {fibonacci(2)}")
    print(f"Fibonacci(5): {fibonacci(5)}")
    print(f"Fibonacci(10): {fibonacci(10)}")

    # Note: Naive recursive Fibonacci is very inefficient due to redundant calculations.
    # For example, fibonacci(5) calls fibonacci(3) and fibonacci(4).
    # fibonacci(4) then calls fibonacci(2) and fibonacci(3) again!
    # For larger numbers (e.g., fibonacci(30)), you'll notice a significant delay.
    # This is a classic example where memoization (dynamic programming) or an iterative
    # approach is preferred for performance.
    # print(f"Fibonacci(30): {fibonacci(30)}")

Fibonacci(0): 0
Fibonacci(1): 1
Fibonacci(2): 1
Fibonacci(5): 5
Fibonacci(10): 55

This Fibonacci example highlights a common issue with naive recursion: redundant computation. Many subproblems are solved multiple times. This is where techniques like memoization (caching results of expensive function calls) or dynamic programming become crucial.

Example 3: Directory Listing (File System Traversal)

Recursion is naturally suited for problems involving tree-like or graph-like data structures, such as file systems. Listing files and subdirectories recursively is a great real-world use case.

# list_files.py
import os

def list_files_recursively(path: str, indent: int = 0):
    """
    Recursively lists files and directories from a given path.
    """
    prefix = " " * indent * 4
    try:
        for item in os.listdir(path):
            item_path = os.path.join(path, item)
            if os.path.isfile(item_path):
                print(f"{prefix}- {item}")
            elif os.path.isdir(item_path):
                print(f"{prefix}+ {item}/")
                # Recursive call for subdirectories
                list_files_recursively(item_path, indent + 1)
    except PermissionError:
        print(f"{prefix}Error: Permission denied for {path}")
    except FileNotFoundError:
        print(f"{prefix}Error: Path not found {path}")

if __name__ == "__main__":
    # Create a dummy directory structure for demonstration
    dummy_dir = "temp_recursive_test_dir"
    os.makedirs(os.path.join(dummy_dir, "subdir1", "subsubdir_a"), exist_ok=True)
    os.makedirs(os.path.join(dummy_dir, "subdir2"), exist_ok=True)

    with open(os.path.join(dummy_dir, "file1.txt"), "w") as f: f.write("content")
    with open(os.path.join(dummy_dir, "subdir1", "file2.txt"), "w") as f: f.write("content")
    with open(os.path.join(dummy_dir, "subdir1", "subsubdir_a", "file3.py"), "w") as f: f.write("content")

    print(f"Listing contents of: {dummy_dir}/\n")
    list_files_recursively(dummy_dir)

    # Clean up dummy directory (optional)
    import shutil
    shutil.rmtree(dummy_dir)

Listing contents of: temp_recursive_test_dir/

- file1.txt
+ subdir1/
    - file2.txt
    + subsubdir_a/
        - file3.py
+ subdir2/

Note: This example uses os.listdir, os.path.isfile, and os.path.isdir from Python’s os module. The base case here is implicitly when os.listdir(path) returns an empty list, meaning there are no more items to process in that directory. The indent parameter helps visualize the depth of recursion.

Backtracking: Exploring Paths and Undoing Choices

Backtracking is an algorithmic technique for solving problems recursively by trying to build a solution incrementally, one piece at a time. It works by exploring all possible paths (or combinations of choices) to find a solution.

The core idea is:

Explore a choice: Make a decision and add it to the current partial solution.
Recurse: Move to the next step, trying to complete the solution with the current choice.
Check for validity: If the current path leads to an invalid or non-optimal state, or a dead end, we discard the current choice.
Backtrack (Un-try): Remove the last choice made and try a different one. This “undoing” of choices is what gives backtracking its name.

Backtracking problems often involve:

Finding all permutations or combinations.
Solving puzzles (e.g., Sudoku, N-Queens).
Finding paths in a maze.

Example 1: N-Queens Problem

The N-Queens puzzle is the problem of placing n non-attacking queens on an n × n chessboard. A queen can attack horizontally, vertically, and diagonally. This is a classic backtracking problem.

The strategy:

Place queens one row at a time.
For each row, try placing a queen in every column.
Before placing, check if the position is safe (not attacked by previously placed queens).
If safe, place the queen, and recursively try to place a queen in the next row.
If a placement leads to a dead end (cannot place queens in subsequent rows), backtrack: remove the queen from the current position and try the next column.

We can keep track of occupied columns and diagonals using sets for efficient lookups.

# n_queens.py

def solve_n_queens(n: int) -> list[list[str]]:
    """
    Finds all distinct solutions to the N-Queens puzzle.
    Returns a list of board configurations, where each configuration is a list of strings.
    """
    solutions = []
    # Sets to keep track of occupied columns and diagonals
    cols = set()
    pos_diag = set() # (row + col)
    neg_diag = set() # (row - col)

    board = [["." for _ in range(n)] for _ in range(n)]

    def backtrack(r: int):
        # Base case: All queens placed (reached the end of the board)
        if r == n:
            # Found a valid solution, add a copy of the current board state
            solutions.append(["".join(row) for row in board])
            return

        # Iterate through columns for the current row
        for c in range(n):
            # Check if the current position (r, c) is safe
            if c in cols or (r + c) in pos_diag or (r - c) in neg_diag:
                continue # This position is attacked, try next column

            # Place queen (make a choice)
            cols.add(c)
            pos_diag.add(r + c)
            neg_diag.add(r - c)
            board[r][c] = "Q"

            # Recurse: Try to place queen in the next row
            backtrack(r + 1)

            # Backtrack (undo the choice): Remove queen
            cols.remove(c)
            pos_diag.remove(r + c)
            neg_diag.remove(r - c)
            board[r][c] = "."

    backtrack(0) # Start placing queens from row 0
    return solutions

def print_solutions(solutions: list[list[str]]):
    """Prints the found N-Queens solutions in a readable format."""
    if not solutions:
        print("No solutions found.")
        return

    print(f"Found {len(solutions)} solution(s):\n")
    for i, solution in enumerate(solutions):
        print(f"--- Solution {i + 1} ---")
        for row in solution:
            print(row)
        print("-" * (len(solution[0]) + 8) + "\n") # Adjust line length for readability

if __name__ == "__main__":
    n_value = 4 # Example for N=4
    print(f"Solving N-Queens for N = {n_value}")
    n_queens_solutions = solve_n_queens(n_value)
    print_solutions(n_queens_solutions)

    # Example for N=8 (might take a bit longer)
    # n_value_large = 8
    # print(f"\nSolving N-Queens for N = {n_value_large}")
    # n_queens_solutions_large = solve_n_queens(n_value_large)
    # print_solutions(n_queens_solutions_large)

Solving N-Queens for N = 4
Found 2 solution(s):

--- Solution 1 ---
.Q..
...Q
Q...
..Q.
----------

--- Solution 2 ---
..Q.
Q...
...Q
.Q..
----------

Understanding the N-Queens Backtracking:

backtrack(r): This function attempts to place a queen in row r.
Base Case: If r == n, it means we have successfully placed queens in all n rows. A valid configuration is found, so we record it and return.
Iteration for c in range(n): For the current row r, we try every possible column c.
if c in cols or (r + c) in pos_diag or (r - c) in neg_diag:: This is the pruning step. Before making a choice, we check if placing a queen at (r, c) would conflict with any previously placed queens. If it conflicts, we continue to the next column, effectively abandoning this invalid path.
Place Queen (Make a Choice): If (r, c) is safe, we “place” the queen by:
- Adding c to cols (this column is now occupied).
- Adding r + c to pos_diag (this positive diagonal is occupied).
- Adding r - c to neg_diag (this negative diagonal is occupied).
- Updating board[r][c] to “Q”.
backtrack(r + 1) (Recurse): We then recursively call backtrack for the next row (r + 1), aiming to place the next queen.
Backtrack (Undo the Choice): After the recursive call returns (meaning all possibilities from (r, c) were explored, or a solution was found and returned), we “undo” our choice:
- Remove c, r + c, and r - c from their respective sets.
- Set board[r][c] back to “.”. This crucial step allows the algorithm to explore other columns in the current row r without interference from the queen just placed. It’s like saying, “Okay, that path didn’t work (or we found all solutions down that path), so let’s un-make that decision and try another one.”

Example 2: Generating All Permutations of a List

Generating permutations is another classic use case for backtracking. Given a list of distinct numbers, we want to return all possible permutations.

Strategy:

Start with an empty permutation.
At each step, iterate through the remaining available numbers.
Pick an available number, add it to the current permutation.
Recursively call to build the rest of the permutation.
After the recursive call, “un-pick” the number to allow other numbers to be chosen at that position (backtrack).

# permutations.py

def generate_permutations(nums: list[int]) -> list[list[int]]:
    """
    Generates all permutations of a list of distinct integers using backtracking.
    """
    solutions = []
    n = len(nums)
    current_permutation = []
    used = [False] * n # To keep track of which numbers have been used

    def backtrack():
        # Base case: If the current permutation has 'n' elements, it's complete
        if len(current_permutation) == n:
            solutions.append(list(current_permutation)) # Add a copy
            return

        # Explore choices
        for i in range(n):
            if not used[i]: # If the number at index i hasn't been used yet
                # Make a choice
                used[i] = True
                current_permutation.append(nums[i])

                # Recurse
                backtrack()

                # Backtrack (undo the choice)
                current_permutation.pop()
                used[i] = False

    backtrack() # Start the process
    return solutions

if __name__ == "__main__":
    test_nums1 = [1, 2, 3]
    print(f"Permutations of {test_nums1}:")
    perms1 = generate_permutations(test_nums1)
    for p in perms1:
        print(p)
    print(f"Total permutations: {len(perms1)}\n")

    test_nums2 = ["A", "B"]
    print(f"Permutations of {test_nums2}:")
    perms2 = generate_permutations(test_nums2)
    for p in perms2:
        print(p)
    print(f"Total permutations: {len(perms2)}\n")

Permutations of [1, 2, 3]:
[1, 2, 3]
[1, 3, 2]
[2, 1, 3]
[2, 3, 1]
[3, 1, 2]
[3, 2, 1]
Total permutations: 6

Permutations of ['A', 'B']:
['A', 'B']
['B', 'A']
Total permutations: 2

Understanding the Permutations Backtracking:

backtrack(): This function tries to extend the current_permutation.
Base Case: If len(current_permutation) == n, a full permutation has been built, so it’s added to solutions.
Iteration for i in range(n): We loop through all available numbers (checked by if not used[i]).
Make a Choice: If nums[i] hasn’t been used, we mark it as used[i] = True and add nums[i] to current_permutation.
Recurse: We then call backtrack() again to fill the next position in the permutation.
Backtrack (Undo the Choice): After the recursive call returns, we “un-make” the choice by current_permutation.pop() and used[i] = False. This frees up nums[i] to be used in other permutations at different positions, or for another choice at the current level of recursion.

Recursion vs. Iteration: A Quick Comparison

While many recursive problems can be solved iteratively (using loops and explicit stacks/queues), there are trade-offs:

Feature	Recursion	Iteration (Loops)
Readability	Often more concise and natural for problems with self-similar substructures (e.g., trees).	Can be more verbose for complex problems, but clearer flow for sequential tasks.
Call Stack	Uses the implicit call stack. Deep recursion can lead to `Stack Overflow` errors.	Uses a constant amount of stack space (excluding data structures like queues/stacks you might explicitly manage).
Performance	Can be slower due to function call overhead. May recompute values (Fibonacci) without memoization.	Generally faster as it avoids function call overhead. More direct control over execution.
Memory	Each recursive call adds a new stack frame.	Typically uses less memory, especially for deep problems.

Note: Some languages (like Scheme, Scala, C++ in certain cases) offer Tail Call Optimization (TCO), where recursive calls that are the very last operation in a function can be optimized to not consume additional stack space. Python, however, does not perform TCO, so be mindful of the recursion depth limit for very deep recursive calls.

Key Takeaways and When to Use Them

Recursion:
- When to use: Problems that can be broken down into smaller, self-similar subproblems. Ideal for tree/graph traversals, mathematical functions defined recursively (e.g., factorial, Fibonacci, if memoized).
- Think: “What is the simplest version of this problem (base case)? How can I define the problem in terms of a smaller version of itself (recursive step)?”
- Pitfalls: Missing base case (infinite recursion), redundant computation (fix with memoization/DP), stack overflow (too many recursive calls without TCO).
Backtracking:
- When to use: Problems that involve finding all (or some) solutions by exploring a set of choices. Typically involves combinatorial search, constraint satisfaction, or pathfinding where choices are made sequentially and might need to be undone.
- Think: “What choices can I make at this step? If I make this choice, what’s the next step? If this path fails, how do I undo my choice and try another one?”
- Core elements:
  - Choice: Identify the options available at each step.
  - Constraints: Rules that validate a choice or prune invalid paths.
  - Goal: The condition that defines a complete, valid solution.
  - Un-choice (Backtrack): Crucially, reverting the state after exploring a path.

Recursion is a technique, and backtracking is an algorithmic strategy often implemented using recursion. Master these, and you’ll unlock elegant solutions to a wide array of complex problems. The best way to truly grasp them is to practice – draw out the call stack, trace the execution of backtracking, and write your own code. Good luck!

How the Holographic Principle Inspires Compression Models

Jun 17, 2025

What Are Embeddings? From Theory to Real Use

Jun 17, 2025

Y Combinator (Not the Startup One) What It Actually Does in Code

Jun 17, 2025

India’s AI Talent 30% of Global LLM Researchers by 2026 – Your Automation Goldmine

Jun 27, 2025

India’s AI Policy $1.2B to Boost LLM Development by 2026

Jun 27, 2025

Feynman Diagrams and Data Flow in Compilation A Visual Analogy

Jun 17, 2025