Background And Mission
Between 2009 and 2015, U.S. employment in STEM (Science, Technology, Engineering, and Math) occupations grew by 10.5% to 8.6 million -more than double the rate of growth in non-STEM fields. Although 80% of the fastest-growing U.S. occupations require STEM skills, our country is unable to produce enough qualified graduates to fill these jobs. For every 1.9 available STEM jobs, there is only one STEM professional available for hire, and as many as 2.4 million unfilled STEM job vacancies are estimated for the coming years. This gap is partly due to a “leaky” STEM talent development pipeline –about 60% of students who enroll in a STEM field switch to a non-STEM field or drop out of college. Among women and minorities, that number rises to 80%. There is a critical need to implement approaches that attract and retain students interested in the fields of science and mathematics.
The CODE Project’s primary goal is to increase access to research-based learning experiences, particularly for students at small institutions with fewer undergraduate research opportunities. CODE introduces a broad range of students to bioinformatics in a mentored project-based format to capture interest, build confidence, and encourage continued participation in science education and a STEM career path like bioinformatics.
Historically, access to this type of project has been restricted to larger universities with academic medical centers or extensive research programs. However, many undergraduates attend smaller two-or four-year colleges where research-intensive experiences are rare. When compared to their peers at larger research-level institutions, students from smaller schools are at a significant disadvantage for acceptance into professional, graduate, or other advanced STEM-based training. This further decreases the pool of eligible STEM employees.
HudsonAlpha seeks to lower these barriers to access, placing genomic analysis within the grasp of a wider, more diverse audience. We believe these experiences will lead more Alabama students to complete the STEM workforce pipeline and successfully obtain careers in high demand fields such as genomics, modeling and simulation, and informatics.
Project Description
Current sequencing technologies make it possible to obtain the entire genetic code of an individual or agriculturally important plant in a matter of days. Typically, millions of genomic changes are identified, each of which must be examined and interpreted to detect the handful of variants with significant clinical or agricultural consequences. Often the process detects DNA variants that are not well understood because they have not been studied. A DNA variant of uncertain significance, or VUS, undergoes extensive analysis and testing to determine whether it has a role in the development of a trait or disease.
The VUS interpretation process is critically important yet, at times, painstakingly slow. Enormous amounts of genomic data are being generated in clinical and research laboratories like HudsonAlpha, meaning the number of variants awaiting deeper categorization continues to grow.
During a CODE project, trained faculty facilitators mentor their students in research projects to characterize DNA variants of uncertain significance previously identified at HudsonAlpha or curated from public databases. Analyzing a VUS involves both computational and experimental research to explain how a specific change in the DNA sequence might affect the function of a protein and, in turn, the health of a human, plant, or animal. Students are encouraged to study genes that are of interest to them. CODE facilitators have taught these research projects as labs, units in a related class, individual assignments, or full-semester courses.
Recognizing that many universities and colleges have minimal budgets for lab-based research, CODE utilizes computer-based toolsets that require minimal startup expenses but allow for real and significant contributions to science. Facilitators from participating schools will be trained in the DNA variant analysis pipeline during a two-day workshop led by HudsonAlpha researchers and educators.
HudsonAlpha provides facilitators and students with sequence data (gathered from de-identified patient samples) and any relevant background information. In addition, a series of online video and print tutorials developed at HudsonAlpha support facilitators as they guide students through the analysis pipeline. Students use publicly available databases and specialized software to examine each DNA variant and its surrounding sequence, align multiple DNA and protein sequences to study evolutionary conservation and structural comparison, model the protein’s structure, and run molecular dynamics simulations. Finally, they use this information to make predictions regarding the functional consequence of the DNA variant. Some schools may extend the research beyond CODE and combine computational analyses with biochemical, molecular, and physiological techniques to experimentally test a student’s predictions.
Students are encouraged to document their work and findings related to their genetic variant in a written report to be submitted to and discussed with the HudsonAlpha team. All student and faculty participants are invited to attend an annual symposium at HudsonAlpha to share their work, interact with other participating institutions, and learn from researchers in the field. Participating colleges and universities are encouraged to incorporate this project into their course curriculum, providing students with credit hours towards their degree and authentic research experience. A clearly defined policy of authorship sharing for resulting scientific publications and intellectual property rights has been established, designed to recognize the important role students and advisors will play in contributing to the scientific body of knowledge.
GOAL
The CODE project's primary goal is to increase access to research-based learning experiences, particularly for students at small institutions with fewer undergraduate research opportunities. CODE will introduce a broad range of students to bioinformatics in a mentored project-based format to capture interest, build confidence, and encourage continued participation in science education and a STEM career path like bioinformatics.
Objectives
-
Create a robust community of college faculty facilitators trained to mentor undergraduate students in bioinformatics research projects.
-
Broaden the awareness, interest, and skills of undergraduate students in bioinformatics careers and practices.
- HudsonAlpha provides training to facilitators and students from each participating college, along with online video and print training and tutorial resources.
- Student and faculty participants attend the annual CODE Student Symposium at HudsonAlpha to share their work, interact with other participating institutions and learn from researchers in the field.
Colleges and universities interested in participating in CODE can contact Michele Morris, Project Director, for more information at mmorris@hudsonalpha.org.
Participant Institution Benefits and Requirements
The CODE project is building a regional network of faculty from smaller colleges and universities to offer entry-level bioinformatics and protein modeling experiences to their students. We are growing our network and welcome educators interested in exploring cutting edge computational biology research with their students. No bioinformatics experience is required, but a sense of adventure and an excitement about learning new techniques is a must!
Participating schools receive training, a group license for modeling software, and are invited to the annual CODE Student Symposium at HudsonAlpha Institute to share their work and learn from others in the bioinformatics community.
Colleges and universities are invited to participate in CODE at no expense due to the generous funding of the NSF IUSE-EHR grant award No. 2120918.
Facilitator Training Workshops are held twice a year in July and December.
- Applicants must be an instructor of undergraduate students at an accredited college or university in the United States
- Applicants must be willing to commit two (2) days in training in Huntsville, AL
- Applicants must be willing to mentor a CODE project at their institution and participate in the program evaluation surveys
Apply for a Facilitator Training Workshop at https://bit.ly/hudsonalphacodetraining
If you would like more information about the CODE program contact Michele Morris, CODE Director, at mmorris@hudsonalpha.org.
The CODE project is actively recruiting new institutions to introduce genomic research and explore the field of computational biology with their students. Participating schools receive training, modeling software and are invited to a Student Symposium to share their work and learn from others in the bioinformatics community.
CODE Participating Institutions






