Building Tools and Community to Make Pangenomes Accessible
Project Number1U01HG013760-01
Contact PI/Project LeaderGARRISON, ERIK
Awardee OrganizationUNIVERSITY OF TENNESSEE HEALTH SCI CTR
Description
Abstract Text
PROJECT ABSTRACT
The human pangenome encompasses global genetic diversity missing from any single reference genome. This
project will develop sequence alignment, population genetic, and visualization tools that enable new ways to
access and understand the pangenome. These innovations aim to catalyze discoveries by making expansive
pangenome resources more intuitive for diverse genomics communities. We will create sequence mapping and
alignment methods that allow rapid search and comparison to complement graph-based pangenome approaches.
Using succinct data structures, our techniques will enable interpretation of pangenomes while also developing
sublinear indexes of the pangenome for sequence search based on exploring all-vs-all genome homology. For
population genetics, we will implement analyses for GWAS, heritability, selection, and phylogeny directly on a
novel encoding of pangenome graphs based on node coverage, which captures allele zygosity across the entire
pangenome. This representation naturally captures all variation including everything from SNPs to structural vari-
ants to centromeric haplotypes. To aid exploration, we will build interactive visualizations using GPU-accelerated
algorithms to enable real-time interaction with massive graphs which provide a human-scale interface to the
overwhelming diversity of data present in pangenomes. Throughout this work, we will engage diverse genomics
communities via open source software, hands-on workshops, conferences, and integration with public databases.
We will target bioinformaticians, geneticists, clinicians, and evolutionary biologists to refine methodology. Broader
pangenome adoption can help overcome reference bias, empower more equitable genomics, and increase un-
derstanding of human genetic diversity to advance health. Our user-focused pangenome tools seek to make
these expansive resources tangible and integrated for the genomics community to catalyze new findings.
Public Health Relevance Statement
PROJECT NARRATIVE
The human pangenome encompasses global genetic diversity missing from the reference genome. This project
will develop sequence alignment, population genetics, and data visualization tools that enable new ways to access
and understand the pangenome. These innovations aim to catalyze discoveries by making expansive pangenome
resources more intuitive for diverse genomics communities.
No Sub Projects information available for 1U01HG013760-01
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 1U01HG013760-01
Patents
No Patents information available for 1U01HG013760-01
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 1U01HG013760-01
Clinical Studies
No Clinical Studies information available for 1U01HG013760-01
News and More
Related News Releases
No news release information available for 1U01HG013760-01
History
No Historical information available for 1U01HG013760-01
Similar Projects
No Similar Projects information available for 1U01HG013760-01