Large-scale annotation-free disease correlation analysis of the iHMP
Project Number1R03OD030596-01
Former Number1R03DE030189-01
Contact PI/Project LeaderBROWN, C. TITUS
Awardee OrganizationUNIVERSITY OF CALIFORNIA AT DAVIS
Description
Abstract Text
Project Summary
We will work with the iHMP data resource to apply novel tools and data analysis methodologies
to the challenge of disease association between large microbiome data sets, Inflammatory
Bowel Disease, and the onset of diabetes. We will start with an annotation-free approach using
k-mers to preprocess IBD and diabetes cohorts. We then will apply a novel scaling technology
implemented in the sourmash software to reduce the data set size by a factor of 2000, rendering
it tractable to machine learning approaches. We next will use random forests to determine a
subset of predictive k-mers, and will measure their accuracy on validation data sets not used in
the initial training. Finally, we will annotate the predictive k-mers using all available genome
databases as well as a novel method to infer the metagenomic presence of accessory genomes
of known genomes. Our outcomes will include a catalog of microbial genomes that correlate
with IBD subtype and the onset of diabetes, as well as automated workflows to apply similar
approaches to other data sets.
Public Health Relevance Statement
Project Narrative
We propose to work with the iHMP data, a large central microbiome resource, to study disease
correlations with inflammatory bowel disease and diabetes. We will work to associate specific
microbial species with the disease conditions. We will also produce resources that will help
other researchers perform similar studies.
No Sub Projects information available for 1R03OD030596-01
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 1R03OD030596-01
Patents
No Patents information available for 1R03OD030596-01
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 1R03OD030596-01
Clinical Studies
No Clinical Studies information available for 1R03OD030596-01
News and More
Related News Releases
No news release information available for 1R03OD030596-01
History
No Historical information available for 1R03OD030596-01
Similar Projects
No Similar Projects information available for 1R03OD030596-01