(compressed version) is used to store NGS data.
A fastq file has at least one record, each record consists of four lines.
- ID, starts with
- End of sequence, starts with
- Sequencing quality information. One ASCII encoded quality score per base.
A record’s sequence is called read
Quality scores can be represented using three different encodings which use a different range of ASCII characters:
|Name||ASCII character range
|Sanger, Illumina >= v1.8||33-126
|Solexa, Illumina < v1.3||59-126
|Illumina v1.3 - v1.7||64-126
- Bioinformatics Data Skills
- Galaxy Wiki