HiSeq results

FASTQ file name

FASTQ files use the following naming scheme: 

<SampleName>_<BarcodeSequence>_L<LaneNumber>_R<ReadNumber>_001.fastq.gz

Example: NA10831_ATCACG_L002_R1_001.fastq.gz

  • SampleName: In order to avoid conflict with sample names from other customers, some user submitted sample names might have been slightly modified. However, you should still be able to recognize them.
  • BarcodeSequence: Index sequence used for this sample. 'NoIndex' will be used if the samples were not multiplexed in this lane.
  • Lane: Lane in which the sample was sequenced.
  • ReadNumber: R1 for single-read runs and R1/R2 for paired-end runs (first/second read). 

 

FASTQ format

FASTQ files are delivered in GNU zip format with .gz file extension. The quality score is encoded in the standard way (Sanger fastq). For more information on the FASTQ format refer to wikipedia.

FASTQ format uses four lines per sequence.

  • Line 1 begins with a '@' character and is followed by a sequence identifier and an optional description (header).
  • Line 2 is the raw sequence letters.
  • Line 3 begins with a '+' character and is optionally followed by the same sequence identifier (and any description) again.
  • Line 4 encodes the quality values for the sequence in Line 2, and must contain the same number of symbols as letters in the sequence.

FASTQ header (line 1) contains various information separated either by ':' or a space:

An example from a HiSeq FASTQ header:

      @EAS139:136:FC706VJ:2:5:1000:12850 1:N:18:ATCACG

  • @ - Each sequence identifier line starts with @.
  • InstrumentID - unique identifier of the sequencer (EAS139)
  • RunNumber - Run number on instrument (136).
  • Flowcell_ID - ID of flowcell (FC706VJ).
  • LaneNumber - positive integer, currently 1-8 (2)
  • TileNumber - positive integer (5)
  • X - x coordinate of the spot. Integer which can be negative (1000)
  • Y - y coordinate of the spot. Integer which can be negative (12850)
  • ReadNumber - 1 for single reads; 1 or 2 for paired ends (1)
  • whether it is filtered - NB: Y if the read is filtered out, not in the delivered fastq file, N otherwise (N)
  • ControlNumber - 0 when none of the control bits are on, otherwise it is an even number (18)
  • IndexSequence - Index sequence for this read (ATCACG)

 

Published June 20, 2012 11:07 AM - Last modified Oct. 1, 2014 11:49 AM