presents
Marc Suchard
Biomathematics Department
UCLA
Title: Ridiculously Parallel Statistical Inference
Abstract
Massive numerical integration plagues the statistical inference ofpartially observed stochastic processes. An important biologicalexample entertains partially observed continuous-time Markov chains(CTMCs) to model molecular sequence evolution. Joint inference ofphylogenetic trees and codon-based substitution models of sequenceevolution remains computationally impractical. Parallelizing datalikelihood calculations is an obvious strategy; however, across acluster-computer, this scales with the total number of processingcores, incurring considerable cost to achieve reasonable run-time. To solve this problem, I describe many-core computing algorithms thatharness inexpensive graphics processing units (GPUs) for calculationof the likelihood under CTMC models of evolution. High-end GPUscontaining hundreds of cores and are low-cost. These novel algorithmsare particularly efficient for large state-spaces, including codonmodels, and large data sets, such as full genome alignments where wedemonstrate up to 150-fold speed-up. I conclude with a discussion of thefuture of many-core computing in statistics and touch upon recentexperiences with massively large and high-dimensional mixture models.
Friday, April 16
3:00pm - 4:00pm
2203 SAS Hall