ABSTRACT
Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, USA
Paul O. Lewis
Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut, USA
David L. Swofford
Department of Biology and Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, USA
David Bryant
Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand
CONTENTS
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 5.2 The generalized stepping-stone (GSS) method . . . . . . . . . . . . . . . . . . 96 5.3 Reference distribution for tree topology . . . . . . . . . . . . . . . . . . . . . . . . . 98
5.3.1 Tree topology reference distribution . . . . . . . . . . . . . . . . . . . . . 98 5.3.1.1 Tree simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.3.1.2 Tree topology reference distribution . . . . . . . . . 100
5.3.2 Edge length reference distribution . . . . . . . . . . . . . . . . . . . . . . 103 5.3.3 Comparison with CCD methods . . . . . . . . . . . . . . . . . . . . . . . . 104
5.4 Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 5.4.1 Model details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 5.4.2 Brute-force approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106 5.4.3 GSS performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
5.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 5.6 Funding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Algorithms, and
The marginal likelihood is central to Bayesian model selection. It is the normalizing constant in Bayes’ formula, and the Bayes factor used to compare two models is a ratio of marginal likelihoods. The marginal likelihood is defined as the expected value of the likelihood with respect to the prior. Methods for accurately estimating the marginal likelihood in phylogenetics were developed only recently (Lartillot and Philippe, 2006a; Xie et al., 2011; Fan et al., 2011; Arima and Tardella, 2012). Two of these methods-thermodynamic integration (TI; Lartillot and Philippe, 2006a) and the stepping-stone method (SS; Xie et al., 2011)—allow marginal likelihood estimation when the tree topology varies, but the most efficient methods to date-generalized stepping-stone (GSS; Fan et al., 2011) and the inflated density ratio method (IDR; Arima and Tardella, 2012)—have thus far remained restricted to estimating marginal likelihoods for a fixed tree topology. This chapter is concerned with updating GSS to allow variable tree topology, and the chapter by Wu et al. (Chapter 6) is concerned with updating the IDR method to allow variable tree topology.