Big Data Predictive Phylogenetics with Bayesian Learning
Project Number5K25AI153816-05
Contact PI/Project LeaderHOLBROOK, ANDREW JAMES
Awardee OrganizationUNIVERSITY OF CALIFORNIA LOS ANGELES
Description
Abstract Text
Big Data Predictive Phylogenetics with Bayesian Learning
Abstract
Andrew Holbrook, Ph.D., is a Bayesian statistician with a broad background in applied, theoretical and compu-
tational data science. His proposed research Big Data Predictive Phylogenetics with Bayesian Learning tackles
viral outbreak forecasting by combining Bayesian phylogenetic modeling with flexible, `self-exciting' stochastic
process models. The development and publication of open-source, high-performance computing software for his
models will facilitate fast epidemiological field response in a big data setting. Dr. Holbrook will apply his method-
ology to the reconstruction of the 2015-2016 Zika virus epidemic in the Americas, focusing on identifying key
geographical routes of transmission and phylogenetic clades with enhanced infectiousness.
Candidate: Dr. Holbrook is Postdoctoral Scholar at the UCLA Department of Human Genetics. He earned his
Ph.D. in Statistics from the Department of Statistics at UC Irvine, during which time he completed his dissertation
Geometric Bayes, an investigation into Bayesian modeling and computing on abstract mathematical spaces, and
simultaneously participated in scientific collaborations at the UC Irvine Alzheimer's Disease Research Center.
The proposed career development plan will establish Dr. Holbrook as an independent leader in data intensive
viral epidemiology by 1) facilitating coursework to build biological domain knowledge, 2) affording Dr. Holbrook
the opportunity to lead his own project while remaining under the expert oversight of UCLA Prof. Marc Suchard,
M.D., Ph.D., and 3) allowing Dr. Holbrook to continue his focus on quantitative viral epidemiology once he has
moved to a faculty commitment.
Mentors: During the first three years of the award period, Dr. Holbrook will work closely with Prof. Suchard,
continuing their current schedule of weekly meetings. Prof. Suchard is a leading expert in both Bayesian phylo-
genetics and high-performance statistical computing; and with his medical background, Prof. Suchard will advise
Dr. Holbrook in his expansion of domain knowledge in viral epidemiology. As secondary mentor, Prof. KristianAndersen, Ph.D., of the Scripps Institute will advise Dr. Holbrook in the impactful application of his statistical
and computational methodologies to the 2015-2016 Zika virus epidemic. Dr. Holbrook and Profs. Suchard and
Andersen will maintain their collaborations after the postdoctoral period.
Research: Bayesian phylogenetics successfully reconstructs evolutionary histories but fails to predict viral
spread. Self-exciting point processes are devoid of biological insight and fail to account for geographic networks
of diffusion. Aim 1 addresses deficiencies in these two complementary viral epidemiological modeling techniques
by innovating a combined model where the phylogenetic and self-excitatory components support each other.
Aim 2 makes widespread adoption a reality by publishing open-source, massively parallel computing software
suitable for big data analysis. Aim 3 reconstructs the 2015-2016 Zika epidemic, learns key geographical routes
of transmission and identifies phylogenetic clades with enhanced infectiousness.
Public Health Relevance Statement
Project Narrative
Tracking and predicting viral outbreaks remains an open epidemiological problem with deadly consequences.
Dr. Holbrook will attack the problem with his Bayesian phylogenetic Hawkes processes, a class of models tailored
to simultaneously reconstruct evolutionary histories and predict viral diffusion dynamics. With the mentorship
of Profs. Marc Suchard (primary) and KristianAndersen (secondary), Dr. Holbrook will develop open-source,
high-performance computing software and apply his statistical computing methodology to the analysis of the
2015-2016 Zika virus epidemic of the Americas, learning key routes of transmission and identifying phylogenetic
clades with enhanced infectiousness.
National Institute of Allergy and Infectious Diseases
CFDA Code
855
DUNS Number
092530369
UEI
RN64EPNH8JC6
Project Start Date
01-June-2020
Project End Date
31-May-2025
Budget Start Date
01-June-2024
Budget End Date
31-May-2025
Project Funding Information for 2024
Total Funding
$106,467
Direct Costs
$98,581
Indirect Costs
$7,886
Year
Funding IC
FY Total Cost by IC
2024
National Institute of Allergy and Infectious Diseases
$106,467
Year
Funding IC
FY Total Cost by IC
Sub Projects
No Sub Projects information available for 5K25AI153816-05
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 5K25AI153816-05
Patents
No Patents information available for 5K25AI153816-05
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 5K25AI153816-05
Clinical Studies
No Clinical Studies information available for 5K25AI153816-05
News and More
Related News Releases
No news release information available for 5K25AI153816-05
History
No Historical information available for 5K25AI153816-05
Similar Projects
No Similar Projects information available for 5K25AI153816-05