Prediction of nearest neighbor parameters for folding RNAs with modified nucleotides
Project Number5R21GM148835-02
Contact PI/Project LeaderAVIRAN, SHARON
Awardee OrganizationUNIVERSITY OF CALIFORNIA AT DAVIS
Description
Abstract Text
Natural and synthetic RNAs play key roles in cellular function, biotechnology, and medicine. RNAs fold into
intricate structures, which often drive their functions, thus determining RNA structure is fundamental to biology
and biotechnology. Computational thermodynamics-based secondary structure modeling (TSSM) is a popular,
low-cost, and rapid approach to structure prediction, which has enabled transcriptome-wide structure-function
studies and massive structure-based screens of synthetic RNA libraries. However, recent evidence suggests
that a diversity of post-transcriptional chemical nucleotide modifications additionally exert profound impact on
local and/or global structure, to ultimately modulate the RNA’s stability, expression, or regulatory function. Such
modifications are widespread in all life domains and represent a new and poorly understood layer of gene
regulation, which has been implicated in disease. Moreover, they are routinely introduced into RNA medicines
as a means of evading the innate immune response. Taken together, the wealth of natural modifications and
development of novel artificial ones, the growing interest in their mechanism, and their centrality to RNA medicine
underscore a pressing need to determine structures of RNAs with modified nucleotides rapidly and accurately.
However, TSSM methods cannot account for the effects of modifications due to a lack of parameters to estimate
their folding stabilities. They rely on the feature-rich Turner nearest-neighbor (NN) thermodynamic model, which
is parameterized by 294 free-energy change values derived for canonical bases from 802 costly and laborious
UV melting experiments. Given the diverse and rapidly expanding pool of modifications, it is impractical to repeat
such experiments for each type. The premise of this proposal is that NN parameters can be learned more
efficiently from alternative experiments, which are affordable, widely accessible, and high throughput.
Specifically, next-generation sequencing has transformed RNA Structure Probing (SP) into a routine massively
parallel experiment, which reports structural information about local nucleotide dynamics. SP is widely used to
gain insights into RNA structure and function from genome-wide studies and to constrain TSSM algorithms to
improve their predictions. However, unlike melting assays, the relationship between RNA folding stability and SP
measurements is highly nontrivial, and thus the problem of recovering the parameters from SP data is difficult.
The goal of this proposal is to develop novel algorithms and software to estimate NN parameters from
high-throughput SP data. We will design statistical inference methods that reconcile information from folding
algorithms and SP experiments and apply them to data for unmodified and modified RNAs to estimate new
parameters for modified nucleotides. As the link between SP data and folding thermodynamics is complex, and
furthermore, the ability to fit the Turner parameters from SP data has not been explored, we will assess the
feasibility, accuracy, performance, and computational efficiency of the developed methods. Validation
efforts will include comparing to experimentally derived values and evaluating predictions over held-out data.
Public Health Relevance Statement
Project Narrative
Cells use a diversity of chemical nucleotide modifications to regulate genes, and scientists use them to create
potent RNA vaccines and drugs, where such modifications often exert their effect by altering RNA folding to
trigger additional changes. This underscores a need to determine how modified RNAs fold, but while
computational folding is widely used, accounting for the effects of modifications warrants revising its many
parameters via a multitude of costly and laborious UV melting assays. We will develop computational methods
for rapid estimation of these parameters from alternative, high-throughput structure probing assays, to advance
studies of the roles of modifications in cellular function and to accelerate engineering of efficacious RNA
medicines.
No Sub Projects information available for 5R21GM148835-02
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 5R21GM148835-02
Patents
No Patents information available for 5R21GM148835-02
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 5R21GM148835-02
Clinical Studies
No Clinical Studies information available for 5R21GM148835-02
News and More
Related News Releases
No news release information available for 5R21GM148835-02
History
No Historical information available for 5R21GM148835-02
Similar Projects
No Similar Projects information available for 5R21GM148835-02