The goal of this program is to bring together mathematical and computational scientists, sequencing technology developers in both industry and academia, and the biologists who use the instruments for particular research applications. This presents a unique opportunity to foster interactions between these three communities over an extended period of time and advance the mathematical foundations of this exciting field.

Biological sciences have been transformed over the past two decades by the development of technologies capable of performing large-scale measurements of cellular states. In particular, DNA sequencing instruments have undergone an extraordinary increase in efficiency during the past few years that has reduced the time and cost required to sequence billions of bases by several orders of magnitude.

Sequencing advancements are revolutionizing the scale and potential applications of genomic studies. There is an enormous need to develop mathematical and computational infrastructures to meet emerging data analysis challenges. To name just a few examples, applications requiring the development of novel mathematical and statistical frameworks include the reconstruction of RNA transcript populations, identifying sequence variations (both single-nucleotide and segmental) and exploring their disease associations, locating the sites of protein-DNA interactions, elucidating population histories, and reconstructing microbial communities that colonize particular hosts or environmental niches.graphic

Specifically, the innovative aspects of our program are:

  1. Research mentoring during the long program.
  2. Open problems from mentors and intro-workshop.
  3. Research skills training.
  4. Development of course plans for participants.
  5. Co-located conference to increase participant investment.
  6. Available computational resources for collaborations.
  7. Leveraging innovative programs in genomics education at UCLA.

CGSI is made possible by National Institutes of Health grant GM112625.