clusterProfiler Tutorial: GO Enrichment Analysis in R
📖 RNA-Seq Data Analysis Workflow — check it out for an overview.
What is clusterProfiler?
clusterProfiler is a software package for running functional enrichment analyses on gene lists, including GO enrichment analysis and pathway analysis.
Gene lists are commonly obtained by performing differential expression analysis (DEG analysis) on data from experiments such as RNA-Seq.
This page explains how to install and use clusterProfiler.
Installation
First, if R is not yet installed on your system, install it. (The example below uses Homebrew.)
Launch R and run the following commands to install BiocManager and clusterProfiler.
If the following command runs without errors, the installation was successful.
You also need to install the appropriate annotation package. The example below is for human.
If the following command runs without errors, the installation was successful.
Preparing the Data
Prepare your gene lists using a tool such as edgeR. You will need both a list of all genes (all_genes) and a list of differentially expressed genes (degs), as shown below.
For more details on how to use edgeR, see this page.
Running GO Enrichment Analysis
The following command performs enrichment analysis on the Biological Process category of Gene Ontology.
The output looks like this.
The enrichment analysis has produced a list of Gene Ontology terms that are significantly overrepresented among the differentially expressed genes compared to the full set of genes.
RNA-Seq Data Analysis Software
This is an RNA-Seq Data Analysis Software recommended for those who:
✔︎ Seeking to avoid outsourcing or collaboration for RNA-Seq data analysis.
✔︎ Lacking time to learn RNA-Seq data analysis.
✔︎ Frustrated by the complexity of existing tools.
Users can perform gene expression quantification, identification of differentially expressed genes, gene ontology(GO) analysis, pathway analysis, as well as drawing volcano plots, MA plots, and heatmaps.
About the Author
BxINFO LLC
A research support company specializing in bioinformatics.
We provide tools and information to support life science research, with a focus on RNA-Seq analysis.