Annotation files

= Transcript association

* (Human phase1+2 set) hg38_liftover+new_CAGE_peaks_phase1and2_trans.txt
* (Mouse phase1+2 set) mm10_liftover+new_CAGE_peaks_phase1and2_trans.txt


(Format)
1) CAGE peak ID
   "(Old FANTOM5 CAGE Peak ID);(new TSS ID)"
2) Associated transcript ID
3) Distance between CAGE peak and associated transcript
4) Number of transcripts within 500bp (polII transcripts) or 50bp (non-polII transcripts)
5) All transcripts within the above cutoffs (500bp or 50bp)
    (Transcript ID:distance)


= Gene/protein association

* (Human phase1+2 set) hg38_liftover+new_CAGE_peaks_phase1and2_annot.txt
* (Mouse phase1+2 set) mm10_liftover+new_CAGE_peaks_phase1and2_annot.txt


(Format)
1) CAGE peak ID
   "(Old FANTOM5 CAGE Peak ID);(new TSS ID)"
2) Associated transcript ID
3) Distance between CAGE peak and associated transcript
4) EntrezGene GeneID
5) HGNC/MGI ID
6) UniProt ID
7) Associated gene name
8) Associated gene symbol
9) Associated gene synonym
10) Source of associated gene

= CAGE peak name

* (Human phase1+2 set) human_phase1and2_CAGE_Peak_name.txt
* (Mouse phase1+2 set) mouse_phase1and2_CAGE_Peak_name.txt

(Format)
1) CAGE peak ID
   "(Old FANTOM5 CAGE Peak ID);(new TSS ID)"
2) New CAGE peak name
3) Old (Obsoleted) CAGE peak name


= Annotation sources

Apr. 2nd, 2021 snapshots of the following databases

* EntrezGene
* Gencode (Human v37, Mouse vM25)
* HGNC
* MGI
* UCSC Genome Browser (hg38/mm10)
* UniProt