Awardee OrganizationUNIV OF NORTH CAROLINA CHAPEL HILL
Description
Abstract Text
A Data Commons that realizes the goal of efficiency in research needs to transform the way we
access, use, and generate data. This vision will require the efforts of a multidisciplinary, multiinstitutional
investigative team with complementary expertise in biomedicine, cloud architecture,
software engineering, analytical tools, and data harmonization. Each of the eight Key
Capabilities (KCs) addresses specific challenges faced by scientists working with large-scale
biomedical data. The proposed projects are designed such that each KC has unique objectives
and deliverables in the form of stand-alone Minimum Viable Products (MVPs), yet together, the
KCs form a continuum of insights and approaches that capture the five V’s of data and reflect
FAIR principles.
The specific scientific use
case for KC8 is sex as
biological variable (SABV).
SABV is agnostic with respect
to any disease or medical
condition, manifests across
multiple clinical and model
systems, is relevant to all
types of data and datasets
emphasized in the RFA,
requires a data model and
data harmonization across
data, and addresses
challenges in scientific rigor
and transparency, as recently
emphasized by NIH.
Moreover, the use case
achieves these goals by
examining and computing
over the data sources
identified by NIH, namely,
TOPMed, GTEx, and MODs.
Furthermore, SABV as a use
case enables contributing
KCs to examine each KC in
the context of real-life
challenges. Indeed, a key challenges in data integration across multiple knowledge domains is
the identification of commonalities and trends that can be used in a predictive manner. SABV
serves as an exemplar that requires maximizing data utility for computational use.
To exemplify cross-KC connectivity with the proposed work, consider a collaborative team with
interest in determining whether specific dietary interventions differ in effectiveness by sex. Using
KC8.MVP1, the team examines the impact of SABV on gene expression in the pancreas
(GTEx) and on metabolic gene products (MODs). Results are moved into the cloud environment
provided by KC4 PIVOT and the compute capabilities provided by KC5 Data Science Stacks
and CWL Execution Tools, where the team leverages whole-genome sequencing and
phenotypic data from TOPMed via KC8.MVP2 and using KC3 API and Tool Suite, KC2 GUIDs
Best Practices and Registry, and KC7 Indexing/Search Capabilities to facilitate the process.
The team conducts analyses to identify covariates and develop models for further analysis of
loci that exhibit sexual dimorphisms and/or sex interactions with dietary interventions. All results
are deposited into the Data Commons and used to guide the efficient design of randomized
controlled clinical trials. The team’s resources and products are assessed using KC1 FAIR-TLC
METRICS, and KC6 Governance Council oversees team activities to ensure that all ethical,
security, and privacy issues have been considered and requirements are enforced.
We envision a set of independent, yet interoperable, KCs designed to seamlessly address
biomedical data challenges in the context of SABV, with likely complementarity to the KCs
proposed by other groups. Our team was assembled specifically for its collaborative and open
science values and has circulated a draft Consortium Agreement among the partnering
institutions.
Public Health Relevance Statement
Data not available.
NIH Spending Category
No NIH Spending Category available.
Project Terms
AddressAgreementArchitectureBiologicalBiological ModelsClinicalCollaborationsDataData ScienceData SetData SourcesDepositionDietary InterventionDiseaseEffectivenessEnsureEnvironmentEthicsExhibitsFAIR principlesGene ExpressionGoalsInstitutionKnowledgeLifeMedicalMetabolicModelingPancreasPrivacyProcessRandomized Controlled Clinical TrialsRegistriesResearchResourcesScientistSecuritySoftware EngineeringTrans-Omics for Precision MedicineUnited States National Institutes of HealthVisionWorkanalytical tooldata integrationdata modelingdesigngene productgenome sequencingindexinginsightinterestinteroperabilitymultidisciplinaryopen dataphenotypic datasexsexual dimorphismtooltrendwhole genome
No Sub Projects information available for 3OT3OD025464-01S2
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 3OT3OD025464-01S2
Patents
No Patents information available for 3OT3OD025464-01S2
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 3OT3OD025464-01S2
Clinical Studies
No Clinical Studies information available for 3OT3OD025464-01S2
News and More
Related News Releases
No news release information available for 3OT3OD025464-01S2
History
No Historical information available for 3OT3OD025464-01S2
Similar Projects
No Similar Projects information available for 3OT3OD025464-01S2