Annotation files

= Transcript association

* (Human phase1+2 set) hg38_liftover_CAGE_peaks_phase1+2_trans.txt
* (Mouse phase1+2 set) mm10_liftover_CAGE_peaks_phase1+2_trans.txt


(Format)
1) CAGE peak ID
   "(Old FANTOM5 CAGE Peak ID);(new TSS ID)"
2) Associated transcript ID
3) Distance between CAGE peak and associated transcript
4) Number of transcripts within 500bp (polII transcripts) or 50bp (non-polII transcripts)
5) All transcripts within the above cutoffs (500bp or 50bp)
    (Transcript ID:distance)


= Gene/protein association

* (Human phase1+2 set) hg38_liftover_CAGE_peaks_phase1+2_annot.txt
* (Mouse phase1+2 set) mm10_liftover_CAGE_peaks_phase1+2_annot.txt


(Format)
1) CAGE peak ID
   "(Old FANTOM5 CAGE Peak ID);(new TSS ID)"
2) Associated transcript ID
3) Distance between CAGE peak and associated transcript
4) EntrezGene GeneID
5) HGNC/MGI ID
6) UniProt ID
7) Associated gene name
8) Associated gene symbol
9) Associated gene synonym
10) Source of associated gene

= CAGE peak name

* (Human phase1+2 set) human_phase1+2_new_CAGE_Peak_name.txt

(Format)
1) CAGE peak ID
   "(Old FANTOM5 CAGE Peak ID);(new TSS ID)"
2) New CAGE peak name
3) Old (Obsoleted) CAGE peak name


= Annotation sources

Jun. 16th, 2015 snapshots of the following databases

* EntrezGene
* Gencode (Human v22, Mouse vM4)
* HGNC
* MGI
* UCSC Genome Browser (hg38/mm10)
* UniProt