ABSTRACT
Somatic mosaicism (SM), i.e. the presence of cells with somatically acquired mutations, is a driving feature of
cancer and several developmental diseases. However, whereas today we have detailed understanding and
predictive models of benign and pathogenic inherited polymorphisms, germline de novo mutations, and tumor
mutations, we have only limited knowledge of the burden, allele frequency spectrum, clonal patterns, and
mutational signatures of healthy somatic mosaicism. Realizing that such currently missing knowledge is critical
for informing experimental design in future studies of mosaicism’s biological and clinical consequences, NIH is
launching an ambitious initiative, the Somatic Mosaicism across Human Tissues (SMaHT) project to construct a
comprehensive human somatic mosaicism atlas. As part of this initiative, funding announcement RFA-RM-22-
011 calls for Tool Development Projects to develop “approaches that significantly improve the sensitivity,
accuracy, and threshold of detection of all types of somatic variants across the complete genome”. Such
comprehensive detection is currently challenging because somatic mosaicism mutations occur across a wide
range of mutation types and lengths, but the majority of today’s variant detection tools have low sensitivity for
larger, structural events. Furthermore, somatic mutations are typically at very low allele frequency (<1%), but
accurate detection of low-frequency variation today is beyond the capabilities of most tools. We have pioneered
a unique-kmer guided detection approach in our RUFUS tool, designed for germline de novo mutation detection.
This approach focuses on identifying the novel DNA sequence created by a mutation, which allows the same
underlying algorithm, with uniform algorithmic behavior and sensitivity, to be applied across the full range of
mutation types. RUFUS has been validated for accurately detecting germline de novo mutations in large
discovery datasets and rare-disease diagnostic studies. Our preliminary analyses also indicate that RUFUS has
high sensitivity across a full range of somatic mutations. This application proposes to adapt the RUFUS
algorithm for somatic mosaic mutation detection with high sensitivity and specificity across the entire
mutation type, mutation length, and allele frequency spectrum; and thus, substantially contribute to the
construction of a comprehensive mosaicism atlas. To achieve this overall goal, in the first (UG3) phase of
the project we will focus on algorithmic development to improve low-frequency allele detection, empirically
characterize RUFUS’s sensitivity and specificity, and ready the tool for adoption into the SMaHT Network’s
central analysis pipelines. In the second (UH3) phase of the project, we will integrate RUFUS into the central
analysis workflow of the SMaHT consortium; optimize and extend its performance for analyzing the vast SMaHT
somatic mosaicism dataset. We anticipate that RUFUS will contribute substantially to the SMaHT Initiative's goal
to comprehensively map out human somatic mosaicism across individuals, organs, and tissues.
Public Health Relevance Statement
NARRATIVE
The overarching goal of this Tool Development Project is to adapt our RUFUS germline de novo mutation
detection software for accurate and sensitive discovery of somatic mutations. We anticipate that this
development will lead to a computational mutation detection tool that can facilitate effective and accurate
discovery of somatic mutations of all types, lengths, and allele frequencies, and thus substantially contribute to
the cataloging and understanding of healthy human somatic mosaicism.
NIH Spending Category
No NIH Spending Category available.
Project Terms
AdoptionAlgorithmsAtlasesAutomobile DrivingBehaviorBenignBiologicalCalibrationCatalogingCellsClinicalComputational algorithmComputer softwareDNADNA SequenceDataData AnalysesData DiscoveryData SetDetectionDevelopmentEventExperimental DesignsFrequenciesFundingFutureGene FrequencyGenetic PolymorphismGenomeGoalsHumanIndividualInheritedKnowledgeLengthLinkMalignant NeoplasmsMapsMeasuresMosaicismMutationMutation AnalysisMutation DetectionNoiseOrganPathogenicityPatternPerformancePhaseRare DiseasesRepetitive SequenceResourcesSamplingSensitivity and SpecificitySomatic MutationSpecificityTissuesTrainingUnited States National Institutes of HealthValidationVariantalgorithm developmentanalysis pipelinebasede novo mutationdesigndetection sensitivitydevelopmental diseasedisease diagnostichuman tissueimprovedmosaic variantnovelpredictive modelingtooltool developmenttumorvariant detection
National Institute of Neurological Disorders and Stroke
CFDA Code
853
DUNS Number
009095365
UEI
LL8GLEVH6MG3
Project Start Date
01-April-2023
Project End Date
31-March-2025
Budget Start Date
01-April-2024
Budget End Date
31-March-2025
Project Funding Information for 2024
Total Funding
$384,694
Direct Costs
$249,801
Indirect Costs
$134,893
Year
Funding IC
FY Total Cost by IC
2024
NIH Office of the Director
$384,694
Year
Funding IC
FY Total Cost by IC
Sub Projects
No Sub Projects information available for 5UG3NS132134-02
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 5UG3NS132134-02
Patents
No Patents information available for 5UG3NS132134-02
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 5UG3NS132134-02
Clinical Studies
No Clinical Studies information available for 5UG3NS132134-02
News and More
Related News Releases
No news release information available for 5UG3NS132134-02
History
No Historical information available for 5UG3NS132134-02
Similar Projects
No Similar Projects information available for 5UG3NS132134-02