Network Analysis in Systems Biology

Instructor: Avi Ma’ayan, PhD

Intermediate Level • 3 weeks at 10 hours a week • Flexible Schedule

Skills You'll Gain

Data Processing
R Programming
Unix Commands
Statistical Analysis
Biology
Unsupervised Learning
Bioinformatics
Data Integration
Network Analysis
Molecular Biology
Data analysis
Machine Learning Algorithms

Shareable Certificate

Earn a shareable certificate to add to your LinkedIn profile

Outcomes

                    
                                Learn new concepts from industry experts
                            
                                Gain a foundational understanding of a subject or tool
                            
                                Develop job-relevant skills with hands-on projects
                            
                                Earn a shareable career certificate

There are 10 modules in this course

The 'Introduction to Complex Systems' module discusses complex systems and leads to the idea that a cell can be considered a complex system or a complex agent living in a complex environment just like us. The 'Introduction to Biology for Engineers' module provides an introduction to some central topics in cell and molecular biology for those who do not have the background in the field. This is not a comprehensive coverage of cell and molecular biology. The goal is to provide an entry point to motivate those who are interested in this field, coming from other disciplines, to begin studying biology.

In the 'Topological and Network Evolution Models' module, we provide several lectures about a historical perspective of network analysis in systems biology. The focus is on in-silico network evolution models. These are simple computational models that, based of few rules, can create networks that have a similar topology to the molecular networks observed in biological systems.

The 'Types of Biological Networks' module is about the various types of networks that are typically constructed and analyzed in systems biology and systems pharmacology. This lecture ends with the idea of functional association networks (FANs). Following this lecture are lectures that discuss how to construct FANs and how to use these networks for analyzing gene lists.

This set of lectures in the 'Data Processing and Identifying Differentially Expressed Genes' module first discusses data normalization methods, and then several lectures are devoted to explaining the problem of identifying differentially expressed genes with the focus on understanding the inner workings of a new method developed by the Ma'ayan Laboratory called the Characteristic Direction.

In the 'Gene Set Enrichment and Network Analyses' module the emphasis is on tools developed by the Ma'ayan Laboratory to analyze gene sets. Several tools will be discussed including: Enrichr, GEO2Enrichr, Expression2Kinases and DrugPairSeeker. In addition, one lecture will be devoted to a method we call enrichment vector clustering we developed, and two lectures will describe the popular gene set enrichment analysis (GSEA) method and an improved method we developed called principal angle enrichment analysis (PAEA).

A set of lectures in the 'Deep Sequencing Data Processing and Analysis' module will cover the basic steps and popular pipelines to analyze RNA-seq and ChIP-seq data going from the raw data to gene lists to figures. These lectures also cover UNIX/Linux commands and some programming elements of R, a popular freely available statistical software. Note that since these lectures were developed and recorded during the Fall of 2013, it is possible that there are better tools that should be used now since the field is rapidly advancing.

This module is devoted to various method of clustering: principal component analysis, self-organizing maps, network-based clustering and hierarchical clustering. The theory behind these methods of analysis are covered in detail, and this is followed by some practical demonstration of the methods for applications using R and MATLAB.

The lectures in the 'Resources for Data Integration' module are about the various types of networks that are typically constructed and analyzed in systems biology and systems pharmacology. These lectures start with the idea of functional association networks (FANs). Following this lecture are several lectures that discuss how to construct FANs from various resources and how to use these networks for analyzing gene lists as well as to construct a puzzle that can be used to connect genomic data with phenotypic data.

The final set of lectures presents the idea of crowdsourcing. MOOCs provide the opportunity to work together on projects that are difficult to complete alone (microtasks) or compete for implementing the best algorithms to solve hard problems (megatasks). You will have the opportunity to participate in various crowdsourcing projects: microtasks and megatasks. These projects are designed specifically for this course.

The final exam consists of multiple choice questions from topics covered in all of modules of the course. Some of the questions may require you to perform some of the analysis methods you learned throughout the course on new datasets.

Network Analysis in Systems Biology

Instructor: Avi Ma’ayan, PhD

Skills You'll Gain

Shareable Certificate

Outcomes

There are 10 modules in this course

Course Overview and Introductions 2 hours to complete

Topological and Network Evolution Models 2 hours to complete

Types of Biological Networks 2 hours to complete

Data Processing and Identifying Differentially Expressed Genes 1 hour to complete

Gene Set Enrichment and Network Analyses 6 hours to complete

Deep Sequencing Data Processing and Analysis 5 hours to complete

Principal Component Analysis, Self-Organizing Maps, Network-Based Clustering and Hierarchical Clustering 4 hours to complete

Resources for Data Integration 1 hour to complete

Crowdsourcing: Microtasks and Megatasks 49 minutes to complete

Final Exam 30 minutes to complete