fasta.gz(compressed version) is a very generic text format that is used to store sequence data. A fasta file has at least one record, each record consists of a minimum of two lines.
- ID, starts with
- Sequence, typically wrapped to multiple lines at a fixed maximum line witdh.
>gi|30212|emb|X56692.1| H.sapiens mRNA for C-reactive protein GGACTTCTAGCCCCTGAACTTTCAGCCGAATACATCTTTTCCAAAGGAGTGAATTCAGGCCCTTGT CTGGCAGCAGGACGTGACCReferences: