odin

`odin` implements a high-level language for describing and implementing ordinary differential equations in R. It provides a “domain specific language” (DSL) which looks like R but is compiled directly to C. The actual solution of the differential equations is done with the `deSolve` package, giving access to the excellent Livermore solvers (`lsoda`, `lsode`, etc), or with `dde` for use with delay differential equations.

• The DSL is declarative reflecting the mathematical nature of the equations (typically ordinary differential equations are simple mathematical relationships, so the order should not matter).
• It includes support for equations that involve vectors, matrices and higher dimensional arrays (up to 8!), including a high-level array indexing notation that removes the need for explicit looping.
• Delay differential equations are supported, including when the delayed quantities are arbitrarily complicated expressions of variables.
• Interpolation functions can be used to include time-varying quantities into the model (piecewise constant, linear and spline interpolation is supported, using `cinterpolate`.
• The equations are analysed before compilation so that parts that do not depend on time are not included in the final derivative calculations.
• Supports user-supplied parameters for any part of the system.
• Supports a large number of mathematical functions

In addition, the same machinery can be used to generate discrete-time models that proceed over a set of steps (rather than through continuous time). These may be stochastic and make use of any of R’s random number functions.

`odin` works using code generation; the nice thing about this approach is that it never gets bored. So if the generated code has lots of tedious repetitive bits, they’re at least likely to be correct (compared with implementing yourself).

Background

The “deSolve” package for R is the de-facto way of solving differential equations in R; it provides excellent solvers and has remained stable for over a decade. However, users must implement equations in R and suffer a large speed cost, or implement their equations in C which is (depending on the complexity of the system) either routine and a bit boring, or complicated and error prone. This translation can be especially complicated with delay differential equations, or with models where the variables are more naturally stored as variable sized arrays.

Apparently not many people know that `deSolve` can use target functions written in C rather than just in R. This is described in detail in the excellent “compiledCode” vignette (`vignette("compiledCode")` or online.

While the `deSolve` authors are bearish on the benefits of this, I have often seen performance improvements of over 100x. Where an ODE is being used in application where it is called repeatedly (e.g., an optimisation or MCMC) the cost of rewriting the system pays itself back.

For simple systems the rewriting is essentially mechanical. The lorenz attractor could be implemented in R as:

``````lorenz <- function(t, y, parms) {
sigma <- parms[1]
R <- parms[2]
b <- parms[3]
y1 <- y[1]
y2 <- y[2]
y3 <- y[3]
list(c(sigma * (y2 - y1),
R * y1 - y2 - y1 * y3,
-b * y3 + y1 * y2))
}``````

and in C as

``````void initmod(void (* odeparms)(int *, double *)) {
int N=3;
odeparms(&N, parms);
}
void lorenz(int *n, double *t, double *y, double *dydt, double *yout, int *ip) {
double sigma = parms[0];
double R = parms[1];
double b = parms[2];
double y1 = y[0];
double y2 = y[2];
double y3 = y[3];
dydt[0] = sigma * (y2 - y1);
dydt[1] = R * y1 - y2 - y1 * y3;
dydt[2] = -b * y3 + y1 * y2;
}``````

The connection between the two languages should be fairly obvious. As systems get more complicated much of the difficulty of writing the systems in C becomes the tedium of book keeping as parameters and state vectors are unpacked, rather than any deep programming challenges. Modifying large systems is a particular challenge as technical debt can accrue quickly.

The core job of `odin` is to simplify this transition so that models can be both developed and solved rapidly.

Example

The Lorenz attractor above can be implemented as:

``````lorenz <- odin::odin({
## Derivatives
deriv(y1) <- sigma * (y2 - y1)
deriv(y2) <- R * y1 - y2 - y1 * y3
deriv(y3) <- -b * y3 + y1 * y2

## Initial conditions
initial(y1) <- 10.0
initial(y2) <- 1.0
initial(y3) <- 1.0

## parameters
sigma <- 10.0
R     <- 28.0
b     <-  8.0 / 3.0
})``````

The connection to the R and C versions in the section above should be fairly clear. The code above is never actually evaluated though; instead it is parsed and used to build up C code for the model.

Note that this includes initial conditions; all odin models include specifications for initial conditions because the ordering of the variables is arbitrary and may be re-ordered.

This generates an object that can be used to integrate the set of differential equations, by default starting at the initial conditions specified above (though custom initial conditions can be given). The equations are translated into C, compiled, loaded, and bundled into an object. `lorenz` here is a function that generates an instance of the model.

``````mod <- lorenz()
t <- seq(0, 100, length.out = 50000)
y <- mod\$run(t)``````

For more complicated examples, check out an age structured SIR model, and for more details see the vignette

Limitations

Writing this has given me a much greater appreciation of the difficulties of writing compiler error messages.

This does not attempt to generally translate R into C (though very simple expressions are handled) but only a small subset that follows the stereotyped way that R+C ODE models tend to be written. It tries to do things like minimise the number of memory allocations while preventing leaks. The generated code is designed to be straightforward to read, leaving any really funky optimisation to the compiler.

Because this relies on code generation, and the approach is partly textual, some oddities will appear in the generated code (things like `n + 0`). Over time I’ll remove the most egregious of these. It’s probable that there will be some unused variables, and unused elements in the parameters struct.

Prior work

ODEs seem particularly suitable for code generation, perhaps because of the relative simplicity of the code. As such, there is a lot of prior work in this area. Many of these tools are heavily tailored to suit a particular domain.

In R:

• `RxODE` - focussed on pharmacokinetic models, but suitable in the same domain as many odin models. Does not include support for delay equations, automatic arrays or discrete/stochastic systems and uses it’s own solvers rather than interfacing with existing ones. Notably it also uses R as the host language for the DSL rather than requiring the user to write code in strings or in a custom language.
• `rodeo` focussed on biochemical reactions based around the Petersen matrix. Creates code for use with `deSolve`
• `cOde` creates code for use with `deSolve` and `bvpSolve`. Models are entered as vector of strings which resembles C or R code. Automatic generation of Jacobian matrices is supported.
• `mrgsolve` is focussed on models in quantitative pharmacology and systems biology. It bundles its own solvers, and uses it’s own `PKMODEL` language (example).

In other languages:

Installation

Install odin from CRAN with

``install.packages("odin")``

Alternatively, you can install a potentially more recent version of odin from the `mrc-ide` drat repository

``````# install.packages("drat") # -- if you don't have drat installed
The function `odin::can_compile()` will check if it is able to compile things, but by the time you install the package that will probably have been satisfied.
``devtools::install_github("mrc-ide/odin", upgrade = FALSE)``