2015-02-12 Marina Lizio (marina.lizio@riken.jp) Inquiries to fantom-help@gsc.riken.jp HeliscopeCAGE sequencing, delve mapping, CTSS aggregation. This folder contains all the snapshots and time course primary data generated by the FANTOM5 project. Files are arranged in sub-folders whose names follow a simple scheme of .. . Technology is either hCAGE (CAGE sequencing on Heliscope single molecule sequencer) or LQhCAGE (Low Quantity hCAGE). For details on the protocols used, please see [http://fantom.gsc.riken.jp/sstar/Protocols]. The biological category is one of primary_cell, cell line, timecourse, fractionation or tissue. Within each of these sub-folders, for each sample, the following types of files are provided: 00_*.assay_sdrf.txt is a tab delimited flat file describing the experimental details for each sample. *.bam is the indexed mapping file including the whole alignments *.bam.bai is the corresponding index file of the bam file *.ctss.bed.gz represents a CAGE TSS file. It is obtained by converting BAM alignments into BED and aggregating the resulting sequences in CAGE tags. In the conversion, only those sequence tags with alignment quality score above 20 are retained. *.rdna.fa.gz is a FASTA format file including all the ribosomal DNA sequences. We have chosen the file name scheme carefully to provide as much information as we could for the samples. The structure follows a scheme where ..... is used.