Expanding the capabilities of third-generation sequencing for multi-dimensional genome and transcriptome analysis.
Project Number1DP2GM159179-01
Former Number1DP2OD036996-01
Contact PI/Project LeaderFLAVAHAN, WILLIAM ALEXANDER
Awardee OrganizationUNIV OF MASSACHUSETTS MED SCH WORCESTER
Description
Abstract Text
Project Summary
In cells, nucleic acids exist as molecular complexes, whose function depends not only on their base-pairing
potential but also on their shapes and chemical structures. The DNA in a cell’s genome forms a highly dynamic
macromolecular structure, called ‘chromatin’, and the DNA can be modified by regulatory proteins (e.g. 5-
methylation of cytosine) or it can be damaged by oxidation or genotoxic compounds. RNA is subject to over 170
different modifications, which can impact its stability, structure, or function. There is a wealth of information in
these nucleic acids beyond their ability to form Watson-Crick base pairings, yet next-generation sequencing
approaches are only able to interpret DNA through its base pairing and loses much of this information. Third-
generation sequencing technologies, such as nanopore sequencing, can theoretically measure modified or
damaged bases. Nanopore sequencing works by passing nucleic acids through an engineered pore and
measuring the current that flows past the five bases of the DNA molecule currently in the pore. Thus, if a modified
base is in the pore, the current will be slightly different than it would be for the unmodified base. However, in
order to recognize these subtle current differences, training libraries with the modified (and unmodified) bases in
every single five-base combination must be sequenced on the device to create a set of reference values that
can be used to decode these subtle changes. This is a complicated and expensive process, beyond the reach
of a single lab, and as such only a couple of modified bases can currently be sequenced by nanopore. My lab
has developed an approach that allows for the rapid and cheap (on the order of several hundred dollars)
generation of these training libraries using procedural barcoded synthesis. With this approach, the generation
of a reference library is easily achievable for a single project by a small academic lab. My lab has generated
training libraries and successfully sequenced numerous modified DNA bases, which has allowed us to perform
innovative multi-dimensional sequencing experiments; reading DNA damage or encoded cellular properties such
as proliferation along with the base sequence. In this project, I seek to expand our technology. First, we will
generate training libraries to allow for direct detection of common damaged bases resulting from processes such
as oxidation or alkylation, which will permit deeper understanding of how DNA damage occurs in settings such
as aging or cancer treatment. Second, we will adapt our method to work with RNA, allowing for direct detection
of the myriad modifications that occur in the transcriptome and are important in the structure, function, and
regulation of mRNA and non-coding functional RNAs alike. The experiments performed as part of this project
will enhance our understanding of how nucleic acids are modified and damaged within cells, and the further
development of our technology will provide invaluable tools for further interrogation of altered nucleic acid bases.
Public Health Relevance Statement
Project Narrative
Nucleic acids, such as DNA and RNA, exist in cells as highly complex molecular structures subject to numerous
modifications and vulnerable to damage, either of which can change their function and impact cellular states;
however current sequencing technology can only analyze their base sequence, ignoring much of this information.
Here, I present preliminary data and propose a path to expand a technique developed by my lab to directly
sequence modified or damaged DNA or RNA, using an innovative approach to modify novel sequencing
technology. With these tools, we hope to understand the processes that converge on DNA and RNA, such as
chemotherapy in cancer or natural damage that occurs during aging or neurodegeneration, to better learn how
cells modify, repair, and interpret their genomes.
No Sub Projects information available for 1DP2GM159179-01
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 1DP2GM159179-01
Patents
No Patents information available for 1DP2GM159179-01
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 1DP2GM159179-01
Clinical Studies
No Clinical Studies information available for 1DP2GM159179-01
News and More
Related News Releases
No news release information available for 1DP2GM159179-01
History
No Historical information available for 1DP2GM159179-01
Similar Projects
No Similar Projects information available for 1DP2GM159179-01