Skip to Main Content

Big Data Summer Immersion at Yale

Transforming Analytical Learning in the Era of Big Data: A Summer Institute in Biostatistics (SIBS) program

The Big Data Summer Immersion at Yale (BDSY), is an intensive, interdisciplinary program that equips undergraduate students with the skills to utilize the power of big data for human health. Combining expertise from biostatistics, statistics, epidemiology, engineering, and computer science, with pressing challenges in human health, BDSY offers a truly unique learning experience.

Big Data Summer Immersion at Yale

The Big Data Summer Immersion at Yale (BDSY) is an interdisciplinary training and research program in biostatistics that introduces undergraduate students to the intersection of big data and human health.

Watch video

What is special about the Big Data Summer Immersion at Yale?

Entering its second year, the Big Data Summer Immersion at the Yale School of Public Health continues to strengthen its foundation as a premier undergraduate training experience in data science and public health. Yale University, with its rich landscape of teaching, research and practice in data science and health, provides a perfect setting to launch this program in the summer of 2025. The students will stay in the beautiful residential colleges at Yale University campus. Lectures and project work will largely take place in the Yale School of Public Health.

Undergraduate trainees joining this six-week immersion program will go through preparatory bootcamp in R, Python and Statistical learning in the first two weeks. Each morning, they will attend didactic lectures by stellar faculty in biostatistics, epidemiology, statistics, computer science, biomedical data science and each afternoon they will work in small groups on mentored research projects, supported by a graduate student and a faculty mentor. The projects will be in the areas of infectious disease modeling, causal inference, and genomics. Public health, medicine, social science and policy researchers will share their perspectives on using data science for improving human health. Throughout the program, the students will attend professional development workshops and have an opportunity to bond over social events. At the end of six weeks, they will present their research in a concluding symposium through posters and oral presentation. In our morning lectures, we will focus on responsible and ethical use of data science and artificial intelligence in this iteration of the program.

BDSY 2026

The 2026 program runs from June 15 - July 24 in New Haven, CT.

  • Application opens on December 15, 2025.
  • Application closes on March 13, 2026.
  • Admissions decisions by April 1, 2026.

Sign up for application notifications here.