Awesome collections of HiC-seq analysing softwares
The eukaryotic genome have hierarchical three_dimensional conformation, where physical contacts between chromatin have been proved to govern the regulation of gene expression. High-resolution chromosome conformation capture sequencing (HiC-seq) has been one of the most important methods to investigate the 3D genome since its first publication in 2009. Since then, numerous of softwares and packages have been developed to facilitate the analysis of HiC-seq data.
Here is a curated awesome list of HiC-seq data related softwares/packages/intro-papers that are recommended for HiC data analysis and research.
Your contributions are always welcome!
- Pipelines
- Toolkits
- Mapping
- Normalisation
- Visualisation
- Significant contacts identification
- Topologically-Associated Domains identification
- Other awesome HiC softwares
- HiC-seq introduction papers
Pipelines to process HiC-seq data from raw/trimmed data input to interactions/matrices output
- HiC-Pro - An easy configured pipeline that can process raw data/trimmed data all the way to HiC interaction matrix, it is one of the most popular pipeline with a user-friendly documentation and it plays well with HPC.
- Juicer - Comes with a comprehensive Wiki and provides a couple of other tools for downstream analysis like a browser called JuiceBox for visualisation. The performance is very fast and it plays well with HPC, but it is not designed for data with relative low resolution.
Toolkits that containing multiple tools to analyse HiC-seq data
- diffHiC - Well documented user's guide and it has multiple functions for filtering, normalisation and identifying differential HiC interactions from multiple datasets. It is a R package so it can work well with lots of other R packages.
- Homer - Recently updated a new HiC workflow, including alignment, quality control, filtering, generating interaction matrix and normalisation, identifying TADs and loops. It can also generate configure file for circos plots.
- TADbit - A python library that contains multiple functions for analysing HiC data with its own TADs calling algorithm, and it has a well documented tutorial.
- HiTC - Developed by the author of HiC-Pro.
Softwares/packages to conduct alignments and filtering of HiC-seq data
- HiCUP - Provides well documented tutorial including YouTube videos, and it can generate a interactive html report as a result.
- Other mapping strategies can be found in pipelines & Toolkits.
Softwares/packages to conduct matrix normalisation of HiC-seq data
-
Explicit fashion
- HiCNorm - Included in the HiTC R package.
-
Implicit fashion
- Iterative Correction and Eigenvector decomposition (ICE) - Included in the HiC-Pro pipeline.
Softwares/packages to conduct visualisation of HiC-seq data
- JuiceBox - A part of the Juicer toolkit and it can also customise multiple types of data tracks, and it can plot gene loops on heatmaps.
- HiCPlotter - Compatible with the HiC-Pro pipeline and easy to use.
- Sushi - A R package and one of the function is to take common HiC matrix as input to plot a heatmap.
- HiGlass - A web-based browser and can also be run locally within a Docker container.
Algorithms to identify significant contacts
- FitHiC - Provides a two-step spline-fitting procedure and binomial model to identify significant interactions.
- CHiCAGO - This model only works for capture HiC data, it uses negative binomial random model to model read counts and uses poisson random model to model sequencing errors and artefacts.
Algorithms to identify Topologically-Associated Domains (TADs)
- rGMAP - A R package including TADs calling functions and plotting functions for visualisation, and it is able to identify sub-TADs.
- HiTAD - A python library and has a comprehensive documentation for all the functions.
- TopDom - An easy to use R package.
Other softwares/packages
- HiC-QC - QC for preliminary HiC libraries.
- Boost-HiC - HiC patterns detection from low resolution HiC data.
- HiCPlus - Resolution Enhancement of HiC interaction heatmap.
Introduction papers/tutorials of HiC-seq
-
Comprehensive mapping of long-range interactions reveals folding principles of the human genome - The first paper describing HiC-seq.
-
Technical review: a Hitchhiker's guide to chromosome conformation capture. In Plant Chromatin Dynamics - Awesome review of introducing all chromosome conformation capture assays.
-
Genome-wide mapping and analysis of chromosome architecture - Awesome review of HiC-seq normalisation methods and some of the TADs calling approaches.
-
A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping - Highest resolution of HiC-seq data available, and includes an awesome video explaining the 3D genome.
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.