Bayesian Methods for Genomics with Variable Selection
Project Number5R01HG003319-02
Contact PI/Project LeaderVANNUCCI, MARINA
Awardee OrganizationTEXAS A&M UNIVERSITY
Description
Abstract Text
DESCRIPTION (provided by applicant): The overall objective of this research proposal is to develop new Bayesian methodologies for the analysis of data that arise in genomics. Of particular interest are situations where a large number of variables is available and selection of a predictive subset is one of the goals. The theoretical developments we propose are motivated by a variety of studies, some conducted by our biomedical collaborators, using DNA microarray technologies. One of the goals of this project is to contribute novel theoretical developments in variable and feature selection in statistics. Another goal is to provide the biomedical community with sound methods for the analysis of high-dimensional data. The identification of important biomarkers will provide a better understanding of the molecular mechanisms involved in specific diseases, and will in turn improve diagnosis, drug development, and treatment of patients.The specific aims of our proposed research are:
1. Clustering of High-Dimensional Data: We will develop novel Bayesian methods for simultaneously clustering experimental units and identifying the variables that best discriminate the different groups.
2. Analysis of High-Dimensional Data with Censored Survival Outcomes: We will investigate novel methods for variable selection in parametric survival models. The methods will lead to estimates of the survival and to the identification of the predictive variables.
3. Application to Microarray Studies: We will apply the methods of Specific Aims #1 and #2 to a series of biomedical studies involving microarray data. These include studies on rheumatoid arthritis and osteoarthritis and adult acute lymphobiastic leukemia.
4. Application to Proteomic Data: We will adapt our methodologies to the problem of extracting important features in proteomics data, incorporating dimension reduction wavelet techniques.
5. Software development: We will develop statistical software and will make it available to the public.
Public Health Relevance Statement
Data not available.
NIH Spending Category
No NIH Spending Category available.
Project Terms
acute lymphocytic leukemiabiotechnologycomputer program /softwarecomputer system design /evaluationfunctional /structural genomicshigh throughput technologyhuman datainformation retrievalmathematical modelmicroarray technologymodel design /developmentmolecular biology information systemmolecular geneticsnucleic acid sequenceproteomicsrheumatoid arthritisstatistics /biometry
No Sub Projects information available for 5R01HG003319-02
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 5R01HG003319-02
Patents
No Patents information available for 5R01HG003319-02
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 5R01HG003319-02
Clinical Studies
No Clinical Studies information available for 5R01HG003319-02
News and More
Related News Releases
No news release information available for 5R01HG003319-02
History
No Historical information available for 5R01HG003319-02
Similar Projects
No Similar Projects information available for 5R01HG003319-02