DESCRIPTION: Funding is sought for the Summer Institute for Statistics of Big Data (SISBID) at the University of Washington. This program will provide workshops on the statistical and computational skills needed to access, process, manage, and analyze large biomedical data sets. It will be co-directed by Ali Shojaie and Daniela Witten, faculty in the Department of Biostatistics at University of Washington. The SISBID program will consist of five 2.5-day in-person courses, or modules, taught at the University of Washington each July. An individual participant can register for whichever set of modules he or she chooses. The five modules are as follows: (1) Accessing Biomedical Big Data; (2) Data Visualization; (3) Supervised Methods for Statistical Machine Learning; (4) Unsupervised Methods for Statistical Machine Learning; (5) Reproducible Research for Biomedical Big Data. Each module will consist of a combination of formal lectures and hands-on computing labs. Participants will work together in teams in order to apply the skills that they develop in each module to important problems drawn from relevant case studies. The primary audience for SISBID will consist of biomedical scientists who would like to develop the statistical and computational training needed to make use of Biomedical Big Data. The secondary audience will consist of individuals with stronger statistical or computational backgrounds but little exposure to biology, who will learn how to apply their skills to problems associated with Biomedical Big Data. Participants will include advanced undergraduates, graduate students, post-doctoral fellows, and researchers, and will be drawn from industry, government, and academia. In order to ensure that all participants are able to fully engage in the program, participants will be expected to already have some prior background in R programming and statistical inference, which can be obtained by taking two free online courses before the program begins. Each of the five modules will be co-taught by two instructors. The ten instructors will be drawn from top universities and research centers across the U.S., such as the University of Washington, Rice University, University of Iowa, Johns Hopkins University, MD Anderson Cancer Research Center, Fred Hutchinson Cancer Research Center, and University of North Carolina. They have been selected based on research expertise and excellence in teaching. Lecture videos and slides will be made freely available online so that individuals who are unable to attend SISBID in person can still benefit from the program. This proposal specifically requests funds for 55 student / postdoctoral fellow travel scholarships per year, 130 student / postdoctoral fellow registration scholarships per year,
instructor travel and stipends, teaching assistant stipends, and PI salary support.
Public Health Relevance Statement
PUBLIC HEALTH RELEVANCE: In recent years, the biomedical sciences have been inundated by Big Data, such as DNA sequence data and electronic medical records. In principle, it should be possible to use such data for a variety of tasks, such as predicting an individual's risk of developing diabetes or cancer, and tailoring therapies to an individual should
he or she become ill. The Summer Institute for Statistics of Big Data will provide biomedical researchers with the computational and statistical training needed in order to take advantage of Big Data, so that they can more effectively use it to understand human diseases and to improve human health.
NIH Spending Category
Networking and Information Technology R&D
Project Terms
AcademiaAreaBig DataBiologyBiomedical ComputingBiomedical ResearchBiometryCancer CenterCase StudyCollectionComputer softwareComputerized Medical RecordDNA SequenceDataData SetDiabetes MellitusEducational process of instructingEducational workshopEnsureEnvironmentExposure toFacultyFred Hutchinson Cancer Research CenterFundingGovernmentHealthHumanHybridsImageryIndividualIndustryInstitutesIowaKnowledgeLearningLearning ModuleMachine LearningMalignant NeoplasmsNCI Center for Cancer ResearchNorth CarolinaParticipantPersonsPostdoctoral FellowProcessRecordsResearchResearch PersonnelResourcesRiceRiskRunningScholarshipScienceSlideStatistical ComputingStatistical MethodsStudentsTrainingTraining ActivityTraining ProgramsTravelUnited StatesUniversitiesVideotapeWagesWashingtonWorkbasebiomedical scientistdata visualizationgraduate studenthuman diseaseimprovedinstructorlecturesmemberopen sourceprogramsskillsstatisticsteacherweb site
No Sub Projects information available for 5R25EB020380-02
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 5R25EB020380-02
Patents
No Patents information available for 5R25EB020380-02
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 5R25EB020380-02
Clinical Studies
No Clinical Studies information available for 5R25EB020380-02
News and More
Related News Releases
No news release information available for 5R25EB020380-02
History
No Historical information available for 5R25EB020380-02
Similar Projects
No Similar Projects information available for 5R25EB020380-02