Unsupervised Statistical Methods for Data-driven Analyses in Spatially Resolved Transcriptomics Data
Project Number4R00HG012229-03
Former Number5K99HG012229-02
Contact PI/Project LeaderWEBER, LUKAS MARTIN
Awardee OrganizationBOSTON UNIVERSITY MEDICAL CAMPUS
Description
Abstract Text
Project Summary/Abstract
Recently developed spatially resolved transcriptomics (ST) technologies measure transcriptome-wide gene
expression at a near-single-cell, single-cell, or sub-cellular resolution in intact tissue, preserving the spatial
organization of complex tissues. These technologies build upon widely-adopted single-cell RNA sequencing
(scRNA-seq) technologies by adding spatial coordinates to the transcriptome-wide gene expression
measurements, thus enabling an understanding of how the spatial organization of cells in complex tissues
influences function, disease initiation, progression, and therapeutic response in human health and disease.
However, these technologies also present new statistical and computational challenges, which need to
be addressed to accurately interpret this complex data. While initial studies applying these tools have reused
data analysis methods and data storage techniques designed for scRNA-seq, unfortunately these approaches
largely ignore spatial information. Furthermore, existing methodologies for ST data rely on external information
such as marker genes or reference cell types, potentially leading to systematic errors and biased results during
preprocessing, feature selection, classification of spatially resolved cell types, and differential discovery. There
do not yet exist robust and accurate preprocessing and unsupervised statistical methodologies to investigate
ST data in a data-driven manner. The overall goals of this K99/R00 Pathway to Independence Award proposal
are to request support to address this fundamental gap in statistical methodology to develop spatially-aware (1)
methods for preprocessing, (2) unsupervised methods for spatially resolved clustering and differential
discovery between conditions, and (3) data infrastructure and benchmarking resources to standardize the
storage and access of ST data. These proposed methods will lead to an improved understanding of health and
disease mechanisms.
This proposal will provide the training, mentoring, and professional development to accomplish my
research goals and transition to a tenure track faculty position at a research institution with independent
extramural funding. As the demand for ST technologies grows, in particular now that it has been highlighted as
the Nature Methods 2020 Method of the Year, these urgently needed statistical methods and open-source
software proposed in this project will enable ST technologies to transform precision medicine through novel
biological insights relating to spatial properties of cell populations and gene expression in healthy and diseased
tissues. At the completion of this award, I will become part of a new generation of researchers, proficient in
spatial statistics, machine learning, and spatial transcriptomics technologies, enabling me to work closely with
biomedical researchers spatially profiling the transcriptomes of complex tissues.
Public Health Relevance Statement
Project Narrative
Spatially resolved transcriptomics (ST), the Nature Methods 2020 Method of the Year, is poised to transform
our understanding of how the spatial organization of cells in complex tissues influences function, disease
initiation, progression, and therapeutic response. However, existing statistical methods to analyze these data
fail to address new challenges, thereby limiting appropriate interpretation of these increasingly complex data.
These urgently needed spatially-aware, unsupervised statistical methods and open-source software proposed
in this award will enable ST technologies to transform precision medicine through novel biological insights
relating to spatial properties of cell populations and gene expression in healthy and diseased tissues.
No Sub Projects information available for 4R00HG012229-03
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 4R00HG012229-03
Patents
No Patents information available for 4R00HG012229-03
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 4R00HG012229-03
Clinical Studies
No Clinical Studies information available for 4R00HG012229-03
News and More
Related News Releases
No news release information available for 4R00HG012229-03
History
No Historical information available for 4R00HG012229-03
Similar Projects
No Similar Projects information available for 4R00HG012229-03