Awardee OrganizationUNIVERSITY OF NEBRASKA MEDICAL CENTER
Description
Abstract Text
Scientific Abstract
As the leading cause of cancer death in the United States, lung cancer accounts for about 20% of all cancer
deaths. While there are two major types of lung cancer (i.e., 80%~85% for non-small cell lung cancer (NSCLC)
and 10%~15% for small cell lung cancer (SCLC)), each type of lung cancer has multiple distinct subtypes
characterized by morphological, molecular, and genetic alterations. Identifying lung cancer subtypes can
facilitate downstream risk stratification and tailored treatment design. While various conventional methods like
morphological analysis, computed tomography (CT) and imaging techniques, cytogenetic analysis,
immunophenotyping, or molecular profiling have been used for lung cancer subtype identification, they are
usually costly, time-consuming, labor-intensive, and sometimes inaccurate. Recent progress has witnessed the
application of next generation sequencing (NGS) for identifying lung cancer subtypes, but they are limited to bulk
NGS data, or single omics data only. With tons of omics data being generated within and beyond the Common
Fund data sets (e.g., GTEx and HuBMAP), we hypothesize that integration of single-cell and bulk multi-omics
data including genomics, transcriptomics, and epigenetics data will significantly facilitate subtype-specific
biomarker discovery and boost the accuracy of lung cancer subtype identification. To address these concerns,
we propose to develop an integrated machine learning (ML) framework for accurate and cost-effective
lung cancer subtype identification by combining single-cell and bulk multi-omics data within and beyond
Common Fund data sets. To achieve this, two specific aims are undertaken. Aim 1, to establish a gene-
signature-transfer ML model that leverages large-scale bulk and single-cell transcriptomics data within and
beyond Common Fund data sets for lung cancer subtype identification. Besides identifying well-annotated lung
cancer subtypes, we will also explore novel lung cancer subtypes by detecting rare cell types from large-scale
single cell data, from which cluster-specific and rare-cell-type specific gene signatures can be transferred to the
bulk transcriptomics data for improving performance of lung cancer subtype identification. Aim 2, to develop a
multi-omics integration framework to systematically combine single-cell and bulk multi-omics data (including
genomics, transcriptomics, epigenomics) to further boost lung cancer subtype identification. Our model is flexible
to tackle cases when only partial or incomplete multi-omics data are available for new patients. We believe
successful completion of this study will have direct impacts on improving downstream lung cancer risk
stratification, facilitating diagnosis and prognosis, and optimizing treatment selection. We also expect that our
proposed framework in this study can be customized and extensible to identifying subtypes of other types of
cancer.
Public Health Relevance Statement
Project Narrative
In this project, we propose to develop an accurate and cost-effective machine learning framework for lung cancer
subtype identification by integrating multi-modal genomics data within and beyond the Common Fund data sets.
If successful, our study will directly improve downstream lung cancer diagnosis, prognosis, and treatment. Our
proposed framework in this study can also be extensible to identifying subtypes of other cancers.
National Institute of Dental and Craniofacial Research
$1
2024
NIH Office of the Director
$306,999
Year
Funding IC
FY Total Cost by IC
Sub Projects
No Sub Projects information available for 1R03OD038391-01
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 1R03OD038391-01
Patents
No Patents information available for 1R03OD038391-01
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 1R03OD038391-01
Clinical Studies
No Clinical Studies information available for 1R03OD038391-01
News and More
Related News Releases
No news release information available for 1R03OD038391-01
History
No Historical information available for 1R03OD038391-01
Similar Projects
No Similar Projects information available for 1R03OD038391-01