>
HISAT2 Tutorial: RNA-Seq Read Mapping & Splice-Aware Alignment

HISAT2 Tutorial: RNA-Seq Read Mapping & Splice-Aware Alignment

Last updated: March 13, 2026

📖 RNA-Seq Data Analysis Workflow — check it out for an overview.

Introduction

Quantifying gene expression from RNA-Seq sequencing data typically requires a mapping step. Mapping is the process of aligning read sequences (FASTQ files) to their corresponding positions on a reference sequence. Popular mapping tools for RNA-Seq include HISAT2, STAR, and Bowtie2. This page walks through how to use HISAT2.

For an overview of the entire RNA-Seq data analysis workflow, see the RNA-Seq analysis workflow guide.

Installation

Precompiled binaries are available from the official download page. Download the version that matches your operating system.

HISAT2 download

Extract the archive and make sure the binary is on your PATH (the example below is for macOS).

$ unzip hisat2-2.2.1-OSX_x86_64.zip

Verify the installation by displaying the help message:

$ hisat2 -h

If you see output similar to the following, HISAT2 has been installed successfully.

HISAT2 version 2.2.1 by Daehwan Kim (infphilo@gmail.com, www.ccb.jhu.edu/people/infphilo) Usage: hisat2 [options]* -x <ht2-idx> {-1 <m1> -2 <m2> | -U <r> | --sra-acc <SRA accession number>} [-S <sam>] <ht2-idx> Index filename prefix (minus trailing .X.ht2). <m1> Files with #1 mates, paired with files in <m2>. Could be gzip'ed (extension: .gz) or bzip2'ed (extension: .bz2). <m2> Files with #2 mates, paired with files in <m1>. Could be gzip'ed (extension: .gz) or bzip2'ed (extension: .bz2). <r> Files with unpaired reads. Could be gzip'ed (extension: .gz) or bzip2'ed (extension: .bz2). <SRA accession number> Comma-separated list of SRA accession numbers, e.g. --sra-acc SRR353653,SRR353654. <sam> File for SAM output (default: stdout) <m1>, <m2>, <r> can be comma-separated lists (no whitespace) and can be specified many times. E.g. '-U file1.fq,file2.fq -U file3.fq'. ...

Index construction (build)

Before mapping, you need to build an index from the reference sequence with the following command:

$ hisat2-build genome.fa genome

Here, genome.fa is the FASTA file of the reference genome you want to align reads against.

This produces eight files named genome.1.ht2 through genome.8.ht2. Index files enable fast sequence lookup and must be generated ahead of time for virtually all mapping tools, not just HISAT2.

Mapping

Now you can align the reads to the reference genome:

$ hisat2 -x genome -1 reads1.fastq.gz -2 reads2.fastq.gz -S output.sam

This produces a SAM file containing the alignment results. In practice, it is convenient to convert the SAM file to BAM format and sort it for downstream analysis:

$ samtools view -bS output.sam > output.bam $ samtools sort output.bam > output.sorted.bam

You can inspect the results in a genome browser such as IGV. The aligned reads will appear as shown below.

Mapping result

RNA-Seq Data Analysis Software

This is an RNA-Seq Data Analysis Software recommended for those who:

✔︎ Seeking to avoid outsourcing or collaboration for RNA-Seq data analysis.

✔︎ Lacking time to learn RNA-Seq data analysis.

✔︎ Frustrated by the complexity of existing tools.

overview

Users can perform gene expression quantification, identification of differentially expressed genes, gene ontology(GO) analysis, pathway analysis, as well as drawing volcano plots, MA plots, and heatmaps.

BxINFO LLC logo

BxINFO LLC

A research support company specializing in bioinformatics.

We provide tools and information to support life science research, with a focus on RNA-Seq analysis.

→ Learn more