Debugging Techniques

Advanced R Package Development

Outline

Reading error messages and traceback()
Print debugging
browser(): interactive debugging
Breakpoints, debug(), and debugonce()
rlang::last_trace()
Strategy summary

Reading Error Messages

Read the message first

When something goes wrong, R prints an error. The first step is always to read it carefully.

load_all()

biomass_index(cpue = "ten", area_swept = 5)

The error tells you what went wrong, but not where in the call chain.

traceback() shows the sequence of function calls that led to the error:

traceback()

Read it bottom to top: your entry point is at the bottom, the error location is at the top.

Deeper call chains

biomass_index(area_swept = 5, catch = "bad", effort = 10)

traceback()

3: validate_numeric_inputs(catch = catch, effort = effort) at cpue.R#5
2: cpue(catch, effort, ...) at biomass.R#6
1: biomass_index(area_swept = 5, catch = "bad", effort = 10)

Frame 1: your call.
Frame 2: cpue().
Frame 3: validate_numeric_inputs() — where the error was raised.

Print Debugging

Quick variable inspection

The quick-and-dirty approach: add a print() or message() to see what is happening at a given point.

biomass_index <- function(cpue = NULL, area_swept, catch = NULL, effort = NULL, ...) {
  rlang::check_dots_used()

  if (is.null(cpue) && (!is.null(catch) && !is.null(effort))) {
    cpue <- cpue(catch, effort, ...)
  }

  validate_numeric_inputs(cpue = cpue, area_swept = area_swept)

  print(paste("cpue value:", cpue)) # <-- temporary debugging output

  cpue * area_swept
}

load_all()

biomass_index(area_swept = 5, catch = 100, effort = 10)

Once you have found the problem, delete the print() statement.

When print debugging is enough

Quick checks of variable values
Confirming which branch of an if statement runs
Simple, localised problems

When you need more — complex logic, many intermediate values, full environment inspection — reach for browser().

browser()

Basic usage

browser() pauses execution and drops you into an interactive debugger:

cpue <- function(
  catch,
  effort,
  gear_factor = 1,
  method = c("ratio", "log"),
  verbose = getOption("fishr.verbose", FALSE)
) {
  method <- match.arg(method)
  validate_numeric_inputs(catch = catch, effort = effort)

  browser() # <-- execution pauses here

  raw_cpue <- switch(method, ratio = catch / effort, log = log(catch / effort))
  raw_cpue * gear_factor
}

load_all()

cpue(c(100, 200, 300), c(10, 20, 30))

Browser commands

Once inside the browser you have a mini command language:

Command	Action
`n`	Next: execute the current line and step to the next
`s`	Step into: if the current line calls a function, enter it
`c`	Continue: resume until the next `browser()` or end
`Q`	Quit: stop execution immediately
`where`	Show the call stack (like `traceback()` while paused)

You can also run any R expression: ls(), str(catch), etc.

Conditional browser

You don’t always want to pause on every call. Stop only when something is wrong:

cpue <- function(catch, effort, gear_factor = 1, ...) {
  if (any(effort == 0)) browser() # only pause when there is a problem

  # ... rest of function
}

Run the function as many times as needed — it only stops when the suspicious condition is met.

Remove before committing

Warning

browser() calls should never be committed to your package.

R CMD check (via check()) will warn about leftover browser() calls, but build the habit of removing them immediately after debugging.

Breakpoints, debug(), debugonce()

Visual breakpoints in RStudio

Click in the editor gutter (the grey margin to the left of line numbers) to set a breakpoint (red dot).

Same effect as browser() — but does not modify your source file.

After setting a breakpoint, run load_all() to activate it.

debug() and debugonce()

For functions you can’t easily edit (other packages, or when you don’t want to touch source):

# Enter browser on EVERY call to cpue
debug(cpue)
cpue(100, 10)  # pauses
cpue(200, 20)  # pauses again

undebug(cpue)  # turn it off

# Enter browser on the NEXT call only
debugonce(cpue)
cpue(100, 10)  # pauses
cpue(200, 20)  # runs normally, auto-cleared

Breakpoints: your own package code during development debugonce(): quick one-off inspection of any function debug(): step through a function repeatedly (remember undebug())

options(error = recover)

Post-mortem debugging — inspect state after an error:

options(error = recover)

biomass_index(cpue = "ten", area_swept = 5)

R prints the call stack as a numbered list and prompts you to pick a frame:

Enter a frame number, or 0 to exit

1: biomass_index(cpue = "ten", area_swept = 5)
2: validate_numeric_inputs(cpue = cpue, area_swept = area_swept)

Selection:

Type a frame number to browse into it, inspect variables, then 0 to exit.

options(error = NULL)  # reset to default when done

rlang::last_trace()

Cleaner backtraces with rlang

When working with tidyverse or rlang-based packages, rlang::last_trace() gives a cleaner backtrace than traceback():

# Standard R: flat list, can be noisy
traceback()

# rlang: tree-structured, hides internal frames
rlang::last_trace()

rlang::last_trace() organises the call stack as a tree and collapses internal package machinery, making it easier to see the path through your code.

Inspecting the error object

rlang::last_error()

Returns the error object itself — message, call, and (for rlang errors) the backtrace attached to the error.

Useful when working with tidyverse internals or rlang-style errors. Not essential for everyday debugging.

Debugging Strategy

A practical order of operations

Read the error message — often sufficient on its own
traceback() — find where in the call chain the error occurred
Print debugging — quick variable inspection for simple problems
browser() — interactive inspection for complex logic
debugonce() / debug() — step through functions from other packages
options(error = recover) — post-mortem exploration of the full call stack

Your Turn

Exercise

There is a bug in this standardize_effort() function. It is supposed to convert effort from hours to days (dividing by 24), then calculate CPUE.

Add this to R/utils.R:

standardize_effort <- function(catch, effort_hours) {
  validate_numeric_inputs(catch = catch, effort_hours = effort_hours)

  effort_days <- effort_hours * 24

  cpue(catch = catch, effort = effort_days)
}

load_all()

# Effort is 48 hours = 2 days, so CPUE should be 100/2 = 50
standardize_effort(catch = 100, effort_hours = 48)

The result should be 50 but returns something much smaller. Use the debugging tools you just learned to find and fix it.

Solution

The bug is on the conversion line — it multiplies by 24 instead of dividing:

# Bug
effort_days <- effort_hours * 24

# Fix
effort_days <- effort_hours / 24

You could find this by:

Adding browser() before the cpue() call and checking effort_days
Adding message("effort_days: ", effort_days) to see the intermediate value
Stepping through with debugonce(standardize_effort) and inspecting each line

Debugging Techniques

Outline

Reading Error Messages

Read the message first

Deeper call chains

Print Debugging

Quick variable inspection

When print debugging is enough

browser()

Basic usage

Browser commands

RStudio debug toolbar

Conditional browser

Remove before committing

Breakpoints, debug(), debugonce()

Visual breakpoints in RStudio

debug() and debugonce()

options(error = recover)

rlang::last_trace()

Cleaner backtraces with rlang

Inspecting the error object

Debugging Strategy

A practical order of operations

Your Turn

Exercise

Solution