CS  584   Topics in Computational and Life Science

 

---

 

Semester:                 Spring 2007

Meeting Date:           Friday

Meeting Time:          12:50 -- 2PM; 3 – 4PM; others scheduled as necessary

Coordinators:            Dr. James Lu (jlu@mathcs.emory.edu; 404-712-8638)

                                        Dr. Kim Gernert (gernert@emory.edu; 404-727-3501)

 

Objective:

Our objective is to learn. 

 

You want more detail?  Well, alright: Computer Science plays an increasingly important role in the research study of life sciences (and, to be sure, in nearly all intellectual inquiries[1]).  The development and application of computing techniques, broadly construed, to the study of biology, medicine, and healthcare carry particular significance due to their obvious societal benefits.  The scholarly inquiry of a discipline X based on computing can be divided roughly into two types: computational-X, and X-informatics.  The former focuses on the construction of mathematical models and numerical solution techniques to analyze and solve problems in X, while the latter is concerned with the management and processing of data, information, and knowledge in X.  Examples include computational biology, computational economics, bioinformatics, and medical informatics.

 

Our goal in the course is to sample and to learn about a few modern computational and informatics techniques in the context of current life science research studies.  Towards that end, the course will be organized around a series of case studies.  Upon completing the course, students (and at least one of the instructors) will have an understanding of how computing techniques are assisting the advances of science and medicine.  Equally important, we hope to learn and understand how these interdisciplinary inquiries are shaping, directing, and synthesizing new Computer Science research.  Students will also be in a position to study and explore related topics on their own after the course.

 

Organization:

Each case study is based on an ongoing project directed by a researcher (both Emory and from outside).  The researcher will give a seminar on the project.  We will provide background lectures prior to each project seminar, and follow up with a discussion.

 

 

Grading:

Grading will be on a S/U basis.  Students will be given written assignments that will organize, summarize, or analyze each case study.  Questions to stimulate discussion will be distributed after each seminar.  We hope to use the collective efforts of students and instructors to prepare materials that can be useful for future offerings of the course.  Class attendance and participation will also be important.


---

 

Introduction

 

19 January: Course Overview, J. Lu

 

We give a brief overview of how some related courses are organized elsewhere, and discuss our course objectives and meta-objectives (in context).  We discuss course organization, administration, and the roles that students will play in the course.  Keep in mind that this is trial run of what we hope to be a regular course in the future, so even the best laid plan will not necessarily work out exactly – but we will do our best.

Reading: Molecular Biology for Computer Scientist, Lawrence Hunter, 1993.

 

26 January: Current Biological Research Overview, K. Gernert

 

We will give an overview of biological research and its application towards predictive health studies.  Historically, scientists studied a single molecule or a single biological process at a time.  However, technology now allows researcher to use high-throughput methods in the study of cells, organisms and disease processes.  This ability has overwhelmed the scientist with experimental data that requires new computational resources for storage, querying and analysis.  Also, as the computational tools are developed, scientists are now able to evaluate multiple experimental data types in concert and thus visualize more and more complicated systems.  Such applications have opened the door to predictive health and related fields.

Reading: http://www.systemsbiology.org/Systems_Biology_in_Depth

Homework 1  

 

 

---

 

Protein Function Prediction

 

2 February: Biology Preparation, K. Gernert

 

We will cover the basic concepts of biology which are measured in many of the case studies to be presented in this course.  Biological concepts will include: 

cells – the building blocks of life,

genomics:  DNA – the genetic material, its variability and regulation,

transcriptomics:  RNA – the template for protein synthesis, its variable expression

proteomics:  proteins – polymers which are instrumental in biological structures and processes, their expression, active state, and regulation.

metabolomics:  monitoring of the substrates and end-products of biological reactions,

clinical studies and clinical labs:  multiple questionnaires and lab samples monitoring the health of the patient. 

 

9 February: Open mathematics and statistics discussion with Dr. Sun

Reading:

 

9 February (Seminar): An Integrated Approach for Protein Function Prediction

 

Abstract:

Assigning functions to novel proteins is one of the most important problems in the post-genomic era. Many different sources of genomic data, such as protein-protein physical interactions, genetic interactions, gene expression, and domain structure, contain information about protein function. We developed a novel approach that employs the theory of kernel based Markov random fields to infer a protein's functions using an integrated approach combining various sources of biological data. The model is flexible in that other protein pairwise relationship information and features of individual proteins can be easily incorporated. We apply our integrated approach to predict functions of yeast proteins and in mouse.

 

 

Time: 3PM

Room: W201

Speaker: Dr. Fengzhu Sun

Biosketch: Dr. Sun is an Associate Professor of Molecular and Computational Biology and Mathematics at USC. His bachelors in Mathematics is from Shandong University, Masters in Probability and Statistics is from Peking University, and PhD in Applied Mathematics is from University of Southern California. Dr. Sun was an assistant professor of genetics and biostatistics at Emory University from 1995 to 2000.

 

16 February: Discussion and Summary

 

---

Pattern Generation and Homestasis in Small Neural Networks

 

23 February: Biology Preparation, A. Prinz

 

I would like to spend the class time introducing the students to the problem of high-dimensional model parameter spaces, the approach of exploring them with computational brute force, and some of the things we have learned about the biology of neuronal systems form it. In the course of that, the students will get exposed to the neuronal model database

Reading:

 

Speaker: Dr. Astrid Prinz

Biosketch: Dr. Prinz is Assistant Professor of Biology at Emory.  She received her Diploma in Physics from the University of Ulm, Germany in 1996, and her Ph.D. from the Munich Technical University in 2000.

 

2 March: Mathematics/CS open discussion with Dr. Hickey

 

 

2 March (Seminar): NeuroVis: combining dimensional stacking and pixelization to visually

explore multidimensional data

 

Abstract: Give a description here.

 

Time: 3PM

Room: W201

Speaker: Dr. Timothy Hickey

Biosketch: Dr. Hickey is Professor and Chair of Computer Science at Brandeis University.  He received his Ph.D. in Mathematics from the University of Chicago in 1986.

 

 

9 March: Discussion and Summary

 

---

 

The Protective Immunity Project (PIP)

 

23 March: Biological Preparation, C. Martens and V. Hertzberg

The Protective Immunity Project (PIP) is examining immune function in renal transplantation. In particular, the temporal effects of immuno-suppressive drug regimens as well as to challenges with vaccination will be characterized. Flow cytometry (FACS) and gene expression microarray (Affymetrix) data will be collected longitudinally in humans as well as in animal models. Computational challenges include storage of these large, complex datasets, data reduction techniques, and numerical algorithms in modelling. (slides)

 

30 March: Computational open discussion, Li Xiong

A discussion of possible data mining techniques to analyze the PIP data. (slides)

 

---

 

Cancer Biomedical Informatics

 

6 April: Patrick McConnell

An Introduction to the Cancer Biomedical Informatics Grid (caBIG) (slides)


13 April: Christopher Flowers

Developing Information Systems for Cancer Research (slides)


20 April: James Lu

Querying and Browsing Heterogeneous Data Sources


27 April: Student Presentations


CS  584   Topics in Computational and Life Science

 

---

 

 

1/19

Introduction

 

J. Lu

1/26

Biology introduction

 

K. Gernert

 

 

 

 

 

 

2/2

Biological prep

 

K. Gernert

2/8

 

Seminar time at GA Tech

Dr. Fengzhu Sun, USC

2/9

Mathematics and Statistics

prep

 

Dr. Fengzhu Sun, USC

2/9

 

Seminar 3:00

Dr. Fengzhu Sun, USC

2/16

Discussion on Dr. Sun

 

Lu, Gernert

 

 

 

 

 

 

2/23

Biological prep

 

Dr. Astrid Prinz

Biology, Emory

3/2

Mathematical prep

 

Dr. Timothy Hickey

3/2

 

Seminar 3:00

Dr. Timothy Hickey

Brandeis University

3/9

Discussion on Dr. Hickey

 

Lu, Gernert

 

 

 

 

 

 

3/16

Spring break

 

 

3/23

Biological prep

 

Dr. Christine Martens, Surgery

Dr. Vicki Hertzberg, Biostatistics

3/30

CS

 

Dr. Li Xiong, CS

 

 

 

 

4/6

CS Development

caBIG data structure

Patrick McConnell,

Duke University

 

 

 

 

 

 

4/13

Biological prep

 

Dr. Christopher Flowers

4/20

CS

 

Dr. James Lu

 

 

 

 

 

 

4/27

Student Presentations

 

 

 

 

 

 

 



[1] Case in point, the inaugural issue of the International Journal of Social and Humanistic Computing is scheduled to launch in 2007.