I was very glad that I was able to attend my PhD graduation ceremony in Brisbane last week. My extended family were there to cheer me on, as well as both of my supervisors and my dear friend, Kal. Out of 28 doctorates there were a dozen from maths, including four statisticians. Unfortunately none of us snagged an Outstanding Doctoral Thesis Award, but it was nice to be nominated!
Once again I would like to thank my supervisors Kerrie Mengersen & Fiona Harden for their mentorship, as well as my co-authors Cathy Hargrave, Chris Drovandi, Christian Robert, Tony Pettitt, Tim Deegan & Mike Poulsen for their contributions to my research.
I will be presenting a talk at the ACEMS International Workshop on Monte Carlo Methods for Spatial Stochastic Systems (MCMSS) at the University of Queensland, Brisbane, July 21-23 (abstract below). Other speakers include Gareth Roberts, Adrian Baddeley, Robert Kohn & Kevin Burrage. The workshop programme is now available online.
Moores, Pettitt & Mengersen (2015) “Scalable Bayesian Inference for the Inverse Temperature of a Hidden Potts Model” arXiv:1503.08066 [stat.CO]
Moores, Drovandi, Mengersen & Robert (2015) “Pre-processing for approximate Bayesian computation in image analysis”
Statistics & Computing 25(1): 23-33. DOI: 10.1007/s11222-014-9525-6
My R package, bayesImageS version 0.1-21, is now available online. It uses a hidden Potts model with additive Gaussian noise for image segmentation of 2D and 3D datasets. The latent labels z can be simulated using chequerboard updating or the Swendsen-Wang algorithm. Several methods for full Bayesian inference with intractable likelihoods are supported, including pseudolikelihood, the exchange algorithm, path sampling, and approximate Bayesian computation (ABC-MCMC & ABC-SMC).
The R source package is released under the GNU General Public License, version 2. Its computational engine is implemented in C++ using RcppArmadillo and OpenMP. There is zero documentation available at present, but I’m working on that…
The process for Australian doctoral candidates is different from Europe and elsewhere. Due mainly to geographic isolation, there is no “viva voce” where external examiners are invited to interrogate the candidate in person. Rather, there is a public seminar followed by a meeting with the university thesis committee. Once recommended changes to the thesis have been made, it it sent to the external examiners for review. So if you’ve recently received my thesis in your inbox, then thanks in advance!
Some of my thesis papers have recently appeared online:
“Pre-processing for approximate Bayesian computation in image analysis”
Moores, Drovandi, Mengersen & Robert (2015)
Statistics & Computing 25(1): 23-33. DOI: 10.1007/s11222-014-9525-6
“An external field prior for the hidden Potts model, with application to cone-beam computed tomography”
Moores, Hargrave, Deegan, Poulsen, Harden & Mengersen (2015)
Computational Statistics & Data Analysis 86: 27–41. DOI: 10.1016/j.csda.2014.12.001
Previously I listed all of the software that I usually install on Windows 7. Now that I’ve got an iMac with OS X Mavericks, it’s time to give an updated list:
From the App Store
Follow these instructions for installing the Command Line Tools (you don’t need a full install of Xcode on Mavericks)
From other websites
gcc and gfortran 4.9.0 from “High Performance Computing for Mac OS X”
- requires sudo to untar into /usr/local/bin
- don’t forget to edit your Makeconf using these instructions…
Problems with Gate Keeper
I’ve submitted a poster to the Network on Computational Statistics and Machine Learning (NCSML) workshop “Big Data, Big Models, it is a Big Deal” at the University of Warwick on the 1st & 2nd of September. More details are available from the workshop homepage. My abstract is as follows:
Scalable Bayesian computation for intractable likelihoods in image analysis
The availability of inexpensive, high-quality imaging has given scientists the capacity to generate more data than ever before. In medicine, some patients are scanned daily throughout their course of treatment, to monitor their progress as well as for image-guided therapies. Satellites such as Landsat and MODIS orbit the globe, regularly providing remotely-sensed imagery of the Earth’s surface. Automated methods of image analysis are vital in order to keep pace with the volumes of data that are generated in these settings. Increases in image resolution and sample depth have improved the quality of these images, but this has also resulted in a vast increase in the size of the digital representation. Many methods that were originally developed for much smaller images are infeasible for the image dimensions that are required by current applications. Thus, the scalability of automated methods to meet the needs of real world data is a major concern.
The hidden Potts model is widely applied in image analysis to segment the image pixels and label them according to their underlying classification. The inverse temperature parameter of this model governs the strength of spatial cohesion and therefore has a substantial influence over the resulting model fit. The difficulty arises from the dependence of an intractable normalising constant on the value of the inverse temperature, thus there is no closed form solution for sampling from the distribution directly. We review three computational approaches for addressing this issue, namely pseudolikelihood, path sampling, and the approximate exchange algorithm. We compare the accuracy and scalability of these methods using a simulation study.
This is joint work with Clair Alston and Kerrie Mengersen, Queensland University of Technology, Australia.
Thesis Title: Bayesian Computational Methods for Spatial Analysis of Images.
Candidate: Matthew T. Moores
Discipline: Statistical Science
Prof. Ian Turner
Prof. Tony Pettitt
Dr. Fiona Harden
Dr. Chris Drovandi
This thesis is motivated by an important applied problem in image-guided radiation therapy. The aim is to assist in interpreting cone-beam computed tomography (CT) of radiotherapy patients by labelling the image pixels according to tissue type. These medical images have poor contrast-to-noise ratio (CNR), thus external sources of information are needed for accurate segmentation. Such sources include the individualised treatment plan, which is based on a diagnostic-quality CT scan that has been manually labelled by a clinician. We also use published studies of physiological variability to derive an estimate of spatial uncertainty.
To address this problem, we adopt a hidden Potts model of the image lattice. We introduce a method for deriving and representing the spatial prior for each patient as an external field in the hidden Potts model. Tissue density estimates are derived from the planning CT and adjusted to account for differences in image modality, forming priors for the noise parameters. These priors can be updated sequentially as more images of the patient are acquired.
Scalable computational algorithms are required for Bayesian inference on 3D volumetric images of this size. We evaluate the existing methods for intractable likelihood problems, including path sampling, pseudolikelihood and the approximate exchange algorithm. We introduce a precomputation step that involves fitting a binding function between the parameters and sufficient statistic of the Potts model. Using this precomputation, we achieve two orders of magnitude improvement in the scalability of approximate Bayesian computation with sequential Monte Carlo (ABC-SMC).
Matthew completed a Bachelor of Information Technology at QUT in 1996 and a Master of Mathematical Science in 2008. Prior to commencing his PhD, he was involved with the Visible Cell project at the Institute for Molecular Bioscience, UQ. He has also worked in R&D for various international companies, including DEC, Compaq and Oracle. He was a member of the development teams for the FaceWorks facial animation software, SpeechBot multimedia search engine, AgileTV speech-enabled programme guide, and the Healthcare Transaction Base. Matthew’s research interests include computational statistics, Bayesian inference, and image analysis