Tutorial: Exploring All Enrichr Gene Set Libraries

A complete guide to understanding the structure, scope, and use-cases of all predefined gene set libraries in Enrichr.

1. Introduction

Enrichr provides a comprehensive collection of over 240 predefined gene set libraries compiled from diverse biological resources, ranging from gene ontologies and curated pathways to experimental perturbation datasets and disease associations. Each library is a collection of gene sets representing biological knowledge: pathways, transcription factor targets, drug perturbation responses, disease signatures, and more.

These libraries allow you to perform enrichment analysis by testing whether your input gene list significantly overlaps with known biological categories. Understanding what each library represents, and when to use it, is key to interpreting results correctly.

2. Gene Ontology (GO) Libraries

Gene Ontology (GO) libraries categorize genes based on biological process, molecular function, and cellular component annotations. They are ideal for broad biological interpretation.

When to use: For broad, unbiased exploration of gene function, especially in discovery-driven projects.

3. Pathway Libraries

These libraries contain curated gene sets representing canonical biological pathways from multiple databases.

When to use: For mechanistic interpretation and pathway-level insights in gene expression or differential analysis studies.

4. Transcription Factor Targets & Regulatory Networks

These libraries collect genes regulated by transcription factors based on ChIP-seq, motif prediction, or co-expression.

When to use: To identify upstream regulators of your gene list or infer transcriptional programs driving your data.

5. Perturbation Libraries

These collections capture transcriptional responses to various perturbations, such as drugs, ligands, gene knockouts, and environmental conditions.

When to use: To identify compounds, genetic perturbations, or stimuli that mimic or reverse your gene signature.

6. Disease and Phenotype Libraries

These libraries group genes by disease association, phenotype, or clinical evidence.

When to use: For linking gene lists to disease contexts, interpreting clinical relevance, or exploring phenotype-driven biology.

7. GWAS and Genetic Association Libraries

When to use: To connect gene lists to genetic evidence and identify traits or diseases with shared molecular signatures.

8. Cell Type and Tissue Expression Libraries

When to use: For identifying tissue or cell-type specificity of your gene list.

9. Protein-Protein Interactions, Complexes & PTMs

When to use: For studying network topology or contextualizing your gene set within interaction maps.

10. Specialized and Emerging Libraries

When to use: For niche analyses focused on specific regulatory, structural, or post-translational features.

11. Choosing the right library

With hundreds of libraries available, it's crucial to select those that match your biological question. For example:

Often, the most powerful analyses combine results from multiple complementary libraries, for example, using GO for functional context, ChEA for regulatory insights, and DSigDB for drug associations.

← Back to Tutorials