Pierre Baldi, University of California, Irvine, USA

From Genomes to Drug Leads: Integrative Systems Biology Approaches
These three lectures will be organized around three themes at three different scales: Genomes and Omic Data, Integrative Systems Biology, and Chemoinformatics.

In the Genomics and Omic data theme, we will provide a brief historical overview of genomics and survey our understanding of the human genome and current challenges and opportunities in relation to human diseases and P4 (preventive, personalized, predictive, participatory) medicine. This will be followed by an overview of the different kinds of omic data that are available today.

In the Integrative Systems Biology theme, we will describe how omic and other data can be efficiently integrated. We will demonstrate the Crick expert system which uses databases of transcription factors and Bayesian statistical approaches to derive comprehensive, genome-wide, maps of regulatory elements. These maps are used to infer core regulatory circuits and loops. These inferences in turn are augmented and refined by integrating them with other data, such as: (1) Gene ontology; (2) Protein-protein interaction; (3) RNA; (4) Gene expression; (5) Epigenetic modifications; (6) Chromatin and DNA 3D structure; (7) SNPs; (8) Drugs; (9) Metabolites; and (10) Scientific literature. This approach enables the identification of new regulatory mechanisms and targets. Examples of collaborative projects based on the predictions made by Crick will be described together with a relatively new high-throughput technology for probing the response of the immune system, with applications to antigen discovery and vaccines.

In the Chemoinformatics theme, we will provide an overview of the area, including the available data on small molecules and their similarity measures, and how to build databases of small molecules with the underlying efficient compression, storage, search, and statistically significant retrieval algorithms. We will also present machine learning methods for the prediction of the physical, chemical, and biological properties of small molecules. Finally, we will tie the three themes together and show how computational methods can support the identification of novel drug leads.

Sample of references by our group: