This tutorial was generated from an IPython notebook that can be downloaded here.
exoplanet comes bundled with a few utilities that can make it easier to use and debug PyMC3 models for fitting exoplanet data. This tutorial briefly describes these features and their use.
The main extra is the
that extends the PyMC3 sampling procedure to include support for
learning off-diagonal elements of the mass matrix. This is very
important for any problems where there are covariances between the
parameters (this is true for pretty much all exoplanet models). A
thorough discussion of this can be found elsewhere
online, but here is a
simple demo where we sample a covariant Gaussian using
First, we generate a random positive definite covariance matrix for the Gaussian:
import numpy as np ndim = 5 np.random.seed(42) L = np.random.randn(ndim, ndim) L[np.diag_indices_from(L)] = 0.1 * np.exp(L[np.diag_indices_from(L)]) L[np.triu_indices_from(L, 1)] = 0.0 cov = np.dot(L, L.T)
And then we can sample this using PyMC3 and
import pymc3 as pm import exoplanet as xo with pm.Model() as model: pm.MvNormal("x", mu=np.zeros(ndim), chol=L, shape=(ndim,)) trace = pm.sample( tune=2000, draws=2000, chains=2, cores=2, step=xo.get_dense_nuts_step() )
Multiprocess sampling (2 chains in 2 jobs) NUTS: [x] Sampling 2 chains, 0 divergences: 100%|██████████| 8000/8000 [00:12<00:00, 634.12draws/s]
This is a little more verbose than the standard use of PyMC3, but the
performance is several orders of magnitude better than you would get
without the mass matrix tuning. As you can see from the
pymc3.summary, the autocorrelation time of this chain is about 1 as
we would expect for a simple problem like this.
I find that when I’m debugging a PyMC3 model, I often want to inspect
the value of some part of the model for a given set of parameters. As
far as I can tell, there isn’t a simple way to do this in PyMC3, so
exoplanet comes with a hack for doing this:
exoplanet.eval_in_model(). This function handles the mapping
between named PyMC3 variables and the input required by the Theano
function that can evaluate the requested variable or tensor.
As a demo, let’s say that we’re fitting a parabola to some data:
np.random.seed(42) x = np.sort(np.random.uniform(-1, 1, 50)) with pm.Model() as model: logs = pm.Normal("logs", mu=-3.0, sd=1.0) a0 = pm.Normal("a0") a1 = pm.Normal("a1") a2 = pm.Normal("a2") mod = a0 + a1 * x + a2 * x ** 2 # Sample from the prior prior_sample = pm.sample_prior_predictive(samples=1) y = xo.eval_in_model(mod, prior_sample) y += np.exp(prior_sample["logs"]) * np.random.randn(len(y)) # Add the likelihood pm.Normal("obs", mu=mod, sd=pm.math.exp(logs), observed=y) # Fit the data map_soln = xo.optimize() trace = pm.sample(cores=2)
optimizing logp for variables: [a2, a1, a0, logs] 25it [00:00, 587.54it/s, logp=4.272312e+01] message: Optimization terminated successfully. logp: -180.51036900636572 -> 42.72311721910288 Auto-assigning NUTS sampler... Initializing NUTS using jitter+adapt_diag... Multiprocess sampling (2 chains in 2 jobs) NUTS: [a2, a1, a0, logs] Sampling 2 chains, 0 divergences: 100%|██████████| 2000/2000 [00:02<00:00, 869.89draws/s] The acceptance probability does not match the target. It is 0.8913461853394434, but should be close to 0.8. Try to increase the number of tuning steps.
After running the fit, it might be interesting to look at the
predictions of the model. We could have added a
node for eveything, but that can end up taking up a lot of memory and
sometimes its useful to be able to experiement with different outputs.
exoplanet.utils.eval_in_model() we can, for example,
evaluate the maximum a posteriori (MAP) model prediction on a fine grid:
import matplotlib.pyplot as plt x_grid = np.linspace(-1.1, 1.1, 5000) with model: pred = xo.eval_in_model(a0 + a1 * x_grid + a2 * x_grid ** 2, map_soln) plt.plot(x, y, ".k", label="data") plt.plot(x_grid, pred, label="map") plt.legend(fontsize=12) plt.xlabel("x") plt.ylabel("y") _ = plt.xlim(-1.1, 1.1)
We can also combine this with
to plot this prediction for a set of samples in the trace.
samples = np.empty((50, len(x_grid))) with model: y_grid = a0 + a1 * x_grid + a2 * x_grid ** 2 for i, sample in enumerate(xo.get_samples_from_trace(trace, size=50)): samples[i] = xo.eval_in_model(y_grid, sample) plt.plot(x, y, ".k", label="data") plt.plot(x_grid, pred, label="map") plt.plot(x_grid, samples, color="C1", alpha=0.1, label="posterior") plt.plot(x_grid, samples[1:].T, color="C1", alpha=0.1) plt.legend(fontsize=12) plt.xlabel("x") plt.ylabel("y") _ = plt.xlim(-1.1, 1.1)