Self-diffusion coefficients, D*, are routinely estimated from molecular dynamics simulations by fitting a linear model to the observed mean-squared displacements (MSDs) of mobile species. MSDs derived from simulation suffer from statistical noise, which introduces uncertainty in the resulting estimate of D*. An optimal scheme for estimating D* will minimise this uncertainty, i.e., will have high statistical efficiency, and will give an accurate estimate of the uncertainty itself. We present a scheme for estimating D* from a single simulation trajectory with high statistical efficiency and accurately estimating the uncertainty in the predicted value. The statistical distribution of MSDs observable from a given simulation is modelled as a multivariate normal distribution using an analytical covariance matrix for an equivalent system of freely diffusing particles, which we parameterise from the available simulation data. We then perform Bayesian regression to sample the distribution of linear models that are compatible with this model multivariate normal distribution, to obtain a statistically efficient estimate of D* and an accurate estimate of the associated statistical uncertainty.
Andrew R. McCluskey*,
Samuel W. Coles
and
Benjamin J. Morgan†
*[email protected]/†[email protected]
This is the electronic supplementary information (ESI) associated with the publication "Estimation of diffusive properties for in-silico materials using a Gaussian process".
This ESI uses showyourwork
to provide a completely reproducible and automated analysis, plotting, and paper generation workflow.
To run the workflow and generate the paper locally using the cached data run the following:
git clone [email protected]:arm61/msd-errors.git
cd msd-errors
pip install showyourwork
showyourwork build
Full details of the workflow can be determined from the Snakefile
and the showyourwork.yml
.