Project Summary
Harnessing Machine Learning & Object Detection for Automated Evaluation of Student-folded Protein Models
This project will impact public health through improved bioscience education. Mini-toobers are one of 3D Molecular
Designs’ (3DMD) most innovative, popular, and effective modeling tools. This free-form modeling material (foam-
covered wire) typically represents the backbone of a folded protein, allowing students to model actual protein
structures. The problem is that mini-toobers lack an immediate feedback system. As a free-form modeling tool,
students can use mini-toobers to model biological structures correctly just as easily as incorrectly (right vs. left-handed
helices). 3DMD’s other modeling tools have feedback features like magnets, arrows, post/hole connections, etc. that
help students self-correct. Educators observe student’s use of models to see their understanding of a concept; this
observation is a type of formative assessment. However, there is often too much to observe in a mini-toober model
(shape, sidechains, secondary structures, etc.) for an educator to assess all the data in real-time effectively.
Our solution is to apply machine learning and object detection to create digital applications that deliver immediate
feedback and precise assessment of student-folded mini-toober protein models. Two applications will be created:
1) Student Training App. The app's augmented reality features will give students immediate feedback on their modeling
of alpha helices, beta sheets, and other protein motifs to help them learn protein structure and function. 2) Assessment
App. This password-protected app will assess the model’s accuracy, generate a numerical score, and identify inaccurate
regions of the protein. This Phase I proposal focuses on two specific aims necessary to establish the feasibility of
achieving this goal:
1. Develop AI technology to locate student-folded zinc finger protein models with an 80% average precision rating.
2. Create a data gathering plan to collect the 3D coordinates of student-folded protein models.
We will focus our research and development efforts on feasibility testing machine learning frameworks, including
TensorFlow, PyTorch, and Unity Barracuda; on computer vision libraries such as OpenCV; and on object detection and
conversion systems, including YOLO and ONNX to achieve these specific aims. One of the most powerful aspects of
object detection is the ability to identify various shapes and forms. This particularly applies to 3DMD’s Mini-Toober
protein modeling materials, as they can be folded into various structures depending on how a student interacts with the
material. We will collect student-folded zinc finger models from outreach programs run by other 3DMD staff and
partnering organizations. To test different approaches, we will scan the model with the developed app while recording
video for documentation. The team will continue to develop and refine the application until at least an 80% average
precision rating is achieved. We will then create a data-collection plan to document enough models at Science
Olympiad competitions to build the asset library needed for the next phase of this project.
Public Health Relevance Statement
Project Narrative
Harnessing Machine Learning & Object Detection for Automated Evaluation of Student-folded Protein Models
The Harnessing Machine Learning & Object Detection for Automated Evaluation of Student-folded Protein Models
will, in the long term, impact public health through improved bioscience education in secondary schools. This application
will give students real-time feedback on their modeling of proteins in the physical world. As students build their physical
models of proteins, they simultaneously build their conceptual understanding of proteins, the critical part of nearly all
biological systems.
No Sub Projects information available for 1R43GM154526-01
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 1R43GM154526-01
Patents
No Patents information available for 1R43GM154526-01
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 1R43GM154526-01
Clinical Studies
No Clinical Studies information available for 1R43GM154526-01
News and More
Related News Releases
No news release information available for 1R43GM154526-01
History
No Historical information available for 1R43GM154526-01
Similar Projects
No Similar Projects information available for 1R43GM154526-01