>
FASTA Format: File Structure & Examples

FASTA Format: File Structure & Examples

Last updated: March 13, 2026

Introduction

A FASTA file is a text file in FASTA format used to store nucleotide or amino acid sequences. Common file extensions include ".fasta", ".fa", ".fna", and ".fas".

Lines beginning with ">" serve as headers, containing the sequence name, ID, or description. Everything from the header line to the next ">" line (or end of file) constitutes a single sequence entry.

Line breaks can appear anywhere within a sequence. When present, lines are typically wrapped to a fixed width of around 60-80 characters. Be aware that if you search for a sequence in a standard text editor, line breaks within the sequence may prevent matches.

A file format that stores both nucleotide sequences and their quality scores is called FASTQ. For more details on the FASTQ format, see here.

Example of a FASTA File

>NC_003070.9 Arabidopsis thaliana chromosome 1 sequence CCCTAAACCCTAAACCCTAAACCCTAAACCTCTGAATCCTTAATCCCTAAATCCCTAAATCTTTAAATCC TACATCCATGAATCCCTAAATACCTAATTCCCTAAACCCGAAACCGGTTTCTCTGGTTGAAAATCATTGT GTATATAATGATAATTTTATCGTTTTTATGTAATTGCTTATTGTTGTGTGTAGATTTTTTAAAAATATCA ... >NC_003076.8 Arabidopsis thaliana chromosome 5 sequence TATACCATGTACCCTCAACCTTAAAACCCTAAAACCTATACTATAAATCTTTAAAACCTATACTCTAAAC CATAGGGTTTGTGAGTTTGCATAAAGTGTCACGTATAAGTGTTTCTAACATGTGAGTTTGCATAAGAGTC TCGACTATGTGTTTGTTCAAAAGTGACGTAAGTGTTTAGACTAGAGCCGGCCGTGAGCACAAGCGGGCCA ...

Nucleic Acid Bases Used in FASTA Files

CharacterNucleic Acid Base
GGuanine
CCytosine
AAdenine
TThymine
MAdenine or Cytosine
RAdenine or Guanine
WAdenine or Thymine
SCytosine or Guanine
YCytosine or Thymine
KGuanine or Thymine
VAdenine or Cytosine or Guanine
HAdenine or Cytosine or Thymine
DAdenine or Guanine or Thymine
BCytosine or Guanine or Thymine
NAdenine or Cytosine or Guanine or Thymine
-Gap

RNA-Seq Data Analysis Software

This is an RNA-Seq Data Analysis Software recommended for those who:

✔︎ Seeking to avoid outsourcing or collaboration for RNA-Seq data analysis.

✔︎ Lacking time to learn RNA-Seq data analysis.

✔︎ Frustrated by the complexity of existing tools.

overview

Users can perform gene expression quantification, identification of differentially expressed genes, gene ontology(GO) analysis, pathway analysis, as well as drawing volcano plots, MA plots, and heatmaps.

BxINFO LLC logo

BxINFO LLC

A research support company specializing in bioinformatics.

We provide tools and information to support life science research, with a focus on RNA-Seq analysis.

→ Learn more