FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

FusionGeneSummary

leaf

FusionProtFeature

leaf

FusionGeneSequence

leaf

FusionGenePPI

leaf

RelatedDrugs

leaf

RelatedDiseases

Fusion gene ID: 37113

FusionGeneSummary for TAF4_GATA5

check button Fusion gene summary
Fusion gene informationFusion gene name: TAF4_GATA5
Fusion gene ID: 37113
HgeneTgene
Gene symbol

TAF4

GATA5

Gene ID

6874

140628

Gene nameTATA-box binding protein associated factor 4GATA binding protein 5
SynonymsTAF2C|TAF2C1|TAF4A|TAFII130|TAFII135CHTD5|GATAS|bB379O24.1
Cytomap

20q13.33

20q13.33

Type of geneprotein-codingprotein-coding
Descriptiontranscription initiation factor TFIID subunit 4RNA polymerase II TBP-associated factor subunit CTAF(II)130TAF(II)135TAF4 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 135kDaTAF4A RNA polymerase II, TATA box binding protein (TBPtranscription factor GATA-5GATA binding factor-5
Modification date2018052320180523
UniProtAcc

O00268

Q9BWX5

Ensembl transtripts involved in fusion geneENST00000252996, ENST00000609045, 
ENST00000252997, 
Fusion gene scores* DoF score10 X 5 X 5=2502 X 2 X 1=4
# samples 132
** MAII scorelog2(13/250*10)=-0.943416471633632
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(2/4*10)=2.32192809488736
Context

PubMed: TAF4 [Title/Abstract] AND GATA5 [Title/Abstract] AND fusion [Title/Abstract]

Functional or gene categories assigned by FusionGDB annotationTumor suppressor gene involved fusion gene, in-frame but not retained their domain.
Tumor suppressor gene involved fusion gene, retained protein feature but frameshift.
DDR (DNA damage repair) gene involved fusion gene, in-frame but not retained their domain.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneTAF4

GO:0006352

DNA-templated transcription, initiation

9603525

HgeneTAF4

GO:0006367

transcription initiation from RNA polymerase II promoter

9603525

HgeneTAF4

GO:0045893

positive regulation of transcription, DNA-templated

12771217

TgeneGATA5

GO:0045944

positive regulation of transcription by RNA polymerase II

14986113

TgeneGATA5

GO:0060575

intestinal epithelial cell differentiation

9566909


check button Fusion gene information from three resources
(ChiTars (NAR, 2018), tumorfusions (NAR, 2018), Gao et al. (Cell, 2018))
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
Data typeSourceCancer typeSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
TCGARVBRCATCGA-A7-A13D-01ATAF4chr20

60639507

-GATA5chr20

61040977

-
* LD: Li Ding group's fusion gene list
  RV: Roel Verhaak group's fusion gene list
  ChiTaRs fusion database

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000252996ENST00000252997TAF4chr20

60639507

-GATA5chr20

61040977

-
intron-3CDSENST00000609045ENST00000252997TAF4chr20

60639507

-GATA5chr20

61040977

-

Top

FusionProtFeatures for TAF4_GATA5


check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
TAF4

O00268

GATA5

Q9BWX5

Part of the TFIID complex, a multimeric protein complexthat plays a central role in mediating promoter responses tovarious activators and repressors. Potentiates transcriptionalactivation by the AF-2S of the retinoic acid, vitamin D3 andthyroid hormone. Transcription factor required during cardiovasculardevelopment (PubMed:23289003). Plays an important role in thetranscriptional program(s) that underlies smooth muscle celldiversity (By similarity). Binds to the functionally importantCEF-1 nuclear protein binding site in the cardiac-specificslow/cardiac troponin C transcriptional enhancer(PubMed:25543888). {ECO:0000250|UniProtKB:P97489,ECO:0000269|PubMed:23289003, ECO:0000269|PubMed:25543888}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

.

* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-115142_1484531086Compositional biasNote=Poly-Ala
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-115270_2774531086Compositional biasNote=Poly-Pro
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-115333_3394531086Compositional biasNote=Poly-Ala
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-11539_424531086Compositional biasNote=Poly-His
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-11552_574531086Compositional biasNote=Poly-Ala
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-11598_1014531086Compositional biasNote=Poly-Gly

- In-frame and not-retained protein feature among the 13 regional features.
>>>>
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-115682_6854531086Compositional biasNote=Poly-Pro
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-115810_8154531086Compositional biasNote=Poly-Ala
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-115830_8334531086Compositional biasNote=Poly-Asp
HgeneTAF4chr20:60639507chr20:61040977ENST00000252996-115590_6874531086DomainTAFH
TgeneGATA5chr20:60639507chr20:61040977ENST00000252997-37189_213275398Zinc fingerGATA-type 1
TgeneGATA5chr20:60639507chr20:61040977ENST00000252997-37243_267275398Zinc fingerGATA-type 2


Top

FusionGeneSequence for TAF4_GATA5


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences.
(nt: nucleotides, aa: amino acids)

* Fusion amino acid sequences.
>In-frame_TAF4_ENST00000252996_chr20_60639507_-_GATA5_ENST00000252997_chr20_61040977_-_576aa
MAAGSDLLDEVFFNSEVDEKVVSDLVGSLESQLAASAAHHHHLAPRTPEVRAAAAGALGNHVVSGSPAGAAGAGPAAPAEGAPGAAPEPP
PAGRARPGGGGPQRPGPPSPRRPLVPAGPAPPAAKLRPPPEGSAGSCAPVPAAAAVAAGPEPAPAGPAKPAGPAALAARAGPGPGPGPGP
GPGPGPGKPAGPGAAQTLNGSAALLNSHHAAAPAVSLVNNGPAALLPLPKPAAPGTVIQTPPFVGAAAPPAPAAPSPPAAPAPAAPAAAP
PPPPPAPATLARPPGHPAGPPTAAPAVPPPAAAQNGGSAGAAPAPAPAAGGPAGVSGQPGPGAAAAAPAPGVKAESPKRVVQAAPPAAQT
LAASGPASTAASMVIGPTMQGALPSPAAVPPPAPGTPTGLPKGAAGAVTQSLSRTPTATTSGIRATLTPTVLAPRLPQPPQNPTNIQNFQ
LPPGAAASGYEEGKHPDTEAEAKDHRQGQGLLRIHKECLGLPICCRQHXQLSSHFESQAQPGVPSVPWAQHGPPGLWPGGXLSCPRPLGV


* Fusion transcript sequences (only coding sequence (CDS) region).
>In-frame_TAF4_ENST00000252996_chr20_60639507_-_GATA5_ENST00000252997_chr20_61040977_-_1729nt
ATGGCGGCGGGCTCGGATCTGCTGGACGAGGTCTTCTTCAACAGCGAGGTGGACGAGAAAGTGGTGAGCGACCTGGTGGGCTCGCTGGAG
TCGCAGCTGGCGGCCAGCGCGGCCCACCACCACCACCTCGCGCCGCGCACGCCCGAGGTGCGGGCCGCGGCCGCCGGCGCGCTCGGGAAC
CATGTTGTGAGCGGCAGCCCGGCCGGAGCCGCGGGCGCAGGGCCGGCCGCCCCCGCCGAGGGCGCGCCCGGAGCGGCGCCGGAGCCGCCC
CCCGCAGGTAGAGCGCGGCCGGGGGGCGGGGGGCCGCAGCGCCCGGGCCCCCCCTCACCGCGCCGCCCCCTTGTCCCCGCAGGGCCCGCG
CCGCCCGCCGCGAAGCTGAGGCCGCCGCCCGAGGGCAGCGCGGGGTCCTGCGCCCCGGTGCCCGCCGCCGCCGCCGTCGCCGCGGGGCCC
GAGCCCGCCCCCGCCGGCCCCGCCAAGCCCGCCGGCCCCGCCGCGCTGGCCGCCCGCGCCGGCCCCGGCCCCGGGCCCGGCCCCGGCCCC
GGCCCCGGCCCTGGCCCTGGCAAGCCCGCCGGCCCCGGCGCCGCGCAAACTTTGAATGGGAGCGCCGCGCTGCTGAACTCGCACCACGCC
GCCGCACCTGCTGTCAGCCTGGTCAACAACGGGCCCGCCGCGCTGCTGCCGCTGCCCAAGCCCGCCGCCCCCGGCACTGTCATCCAGACG
CCCCCCTTCGTGGGCGCCGCCGCGCCCCCCGCGCCCGCCGCGCCCTCGCCCCCCGCCGCCCCCGCGCCCGCCGCCCCCGCCGCCGCCCCG
CCCCCGCCACCCCCCGCGCCCGCCACTCTGGCCCGGCCGCCCGGCCACCCCGCCGGACCCCCGACCGCCGCGCCCGCCGTGCCGCCCCCC
GCCGCCGCCCAGAACGGGGGCAGCGCCGGGGCAGCCCCCGCCCCCGCCCCGGCCGCCGGGGGCCCCGCGGGGGTCAGCGGCCAACCCGGG
CCCGGCGCGGCGGCTGCGGCGCCGGCGCCGGGGGTCAAGGCCGAGTCGCCCAAGAGGGTGGTGCAGGCGGCGCCCCCGGCGGCGCAGACC
CTGGCGGCCAGCGGCCCGGCCAGCACGGCGGCCAGCATGGTCATCGGGCCAACTATGCAAGGGGCGCTGCCCAGCCCGGCCGCCGTCCCG
CCGCCCGCCCCCGGGACCCCCACCGGGCTGCCCAAAGGCGCGGCCGGCGCAGTGACCCAGAGCCTGTCCCGGACGCCCACGGCCACCACC
AGCGGGATTCGGGCCACCCTGACGCCCACCGTGCTGGCCCCCCGCTTGCCGCAGCCGCCTCAGAACCCGACCAACATCCAGAACTTCCAG
CTGCCCCCAGGTGCCGCGGCCTCTGGCTATGAAGAAGGAAAGCATCCAGACACGGAAGCGGAAGCCAAAGACCATCGCCAAGGCCAGGGG
CTCCTCAGGATCCACAAGGAATGCCTCGGCCTCCCCATCTGCTGTCGCCAGCACTGACAGCTCAGCAGCCACTTCGAAAGCCAAGCCCAG
CCTGGCGTCCCCAGTGTGCCCTGGGCCCAGCATGGCCCCCCAGGCCTCTGGCCAGGAGGATGACTCTCTTGCCCCCGGCCACTTGGAGTT
CAAGTTCGAGCCTGAGGACTTTGCCTTCCCCTCCACGGCCCCAAGCCCCCAGGCTGGCCTCAGGGGGGCTCTGCGCCAAGAGGCCTGGTG


* Fusion transcript sequences (Full-length transcript).
>In-frame_TAF4_ENST00000252996_chr20_60639507_-_GATA5_ENST00000252997_chr20_61040977_-_3068nt
ATGGCGGCGGGCTCGGATCTGCTGGACGAGGTCTTCTTCAACAGCGAGGTGGACGAGAAAGTGGTGAGCGACCTGGTGGGCTCGCTGGAG
TCGCAGCTGGCGGCCAGCGCGGCCCACCACCACCACCTCGCGCCGCGCACGCCCGAGGTGCGGGCCGCGGCCGCCGGCGCGCTCGGGAAC
CATGTTGTGAGCGGCAGCCCGGCCGGAGCCGCGGGCGCAGGGCCGGCCGCCCCCGCCGAGGGCGCGCCCGGAGCGGCGCCGGAGCCGCCC
CCCGCAGGTAGAGCGCGGCCGGGGGGCGGGGGGCCGCAGCGCCCGGGCCCCCCCTCACCGCGCCGCCCCCTTGTCCCCGCAGGGCCCGCG
CCGCCCGCCGCGAAGCTGAGGCCGCCGCCCGAGGGCAGCGCGGGGTCCTGCGCCCCGGTGCCCGCCGCCGCCGCCGTCGCCGCGGGGCCC
GAGCCCGCCCCCGCCGGCCCCGCCAAGCCCGCCGGCCCCGCCGCGCTGGCCGCCCGCGCCGGCCCCGGCCCCGGGCCCGGCCCCGGCCCC
GGCCCCGGCCCTGGCCCTGGCAAGCCCGCCGGCCCCGGCGCCGCGCAAACTTTGAATGGGAGCGCCGCGCTGCTGAACTCGCACCACGCC
GCCGCACCTGCTGTCAGCCTGGTCAACAACGGGCCCGCCGCGCTGCTGCCGCTGCCCAAGCCCGCCGCCCCCGGCACTGTCATCCAGACG
CCCCCCTTCGTGGGCGCCGCCGCGCCCCCCGCGCCCGCCGCGCCCTCGCCCCCCGCCGCCCCCGCGCCCGCCGCCCCCGCCGCCGCCCCG
CCCCCGCCACCCCCCGCGCCCGCCACTCTGGCCCGGCCGCCCGGCCACCCCGCCGGACCCCCGACCGCCGCGCCCGCCGTGCCGCCCCCC
GCCGCCGCCCAGAACGGGGGCAGCGCCGGGGCAGCCCCCGCCCCCGCCCCGGCCGCCGGGGGCCCCGCGGGGGTCAGCGGCCAACCCGGG
CCCGGCGCGGCGGCTGCGGCGCCGGCGCCGGGGGTCAAGGCCGAGTCGCCCAAGAGGGTGGTGCAGGCGGCGCCCCCGGCGGCGCAGACC
CTGGCGGCCAGCGGCCCGGCCAGCACGGCGGCCAGCATGGTCATCGGGCCAACTATGCAAGGGGCGCTGCCCAGCCCGGCCGCCGTCCCG
CCGCCCGCCCCCGGGACCCCCACCGGGCTGCCCAAAGGCGCGGCCGGCGCAGTGACCCAGAGCCTGTCCCGGACGCCCACGGCCACCACC
AGCGGGATTCGGGCCACCCTGACGCCCACCGTGCTGGCCCCCCGCTTGCCGCAGCCGCCTCAGAACCCGACCAACATCCAGAACTTCCAG
CTGCCCCCAGGTGCCGCGGCCTCTGGCTATGAAGAAGGAAAGCATCCAGACACGGAAGCGGAAGCCAAAGACCATCGCCAAGGCCAGGGG
CTCCTCAGGATCCACAAGGAATGCCTCGGCCTCCCCATCTGCTGTCGCCAGCACTGACAGCTCAGCAGCCACTTCGAAAGCCAAGCCCAG
CCTGGCGTCCCCAGTGTGCCCTGGGCCCAGCATGGCCCCCCAGGCCTCTGGCCAGGAGGATGACTCTCTTGCCCCCGGCCACTTGGAGTT
CAAGTTCGAGCCTGAGGACTTTGCCTTCCCCTCCACGGCCCCAAGCCCCCAGGCTGGCCTCAGGGGGGCTCTGCGCCAAGAGGCCTGGTG
TGCGCTGGCCTTGGCCTAGGTCCCCAGGCCAGCCCATGTCAGGGGAACAGCCTGGAACAGACCACCCACTGAGTCACCTCCGTGCCTGCT
TTGCTCCAGCACAGCAGAGACCAGCAGGCCCCCCAACCCAGAGACTGGGTCTGCTGGAGTCTCCACACAGTGGTGGGGAGGCCTTCTGGA
CAGACGGCAGTCGGGCCCCAGAGCAAGAAGGCTGGTGAGGGAAGGGCTCAGCTTCCCACCCCACGTACAGCAAGGGACTCCCCAGGTGCG
GCCCAAGGCTCCGGACCACACTGGCCCCCTGCGGCGGAGGCCAACGCAGGGCACCACCACCACCAACTTGAATTCCGTCATCAATGCTCA
CCGTCAATATGTTTACAAGTTGTAGCAGTTGGGGGAAAACAGTCAACCTCCCAGTGTAAAACCAAGATTCCCAGTGAAGCACCTGAGGCC
AAGCAGGGGAGAGGAATGAGGGGAGCAGCTGGACATGGGCCTCCTGAGGCCTCGGGGCTGTCCTTCATTGCCCACATGGATAGACGGAGC
TGTGGTGCAGAGAACTTTTCCCGCAACAGGTGCAGGACTGCCAGGGATCGGAGTGCGGGCCGCGCACGGTGCCAGGATTCCGCCGAGGGG
AAGCCGCTCACATTGCAGTCATCACAGACTTACGCACTTGTTTGGACAGTTTTTCCAGAGGGGATGGGAAAGGGCCTTGTTCTAGCTGAA
TCTGTGTATCATGACCATTTCTGACAGGCAGAATGAATTGTCTGGTAGCCCTGTCCTGACCCATCCAAGCGCTGTTGGGGCTGGTGGTGA
CGTGGTCACATGTCCTGGCATATCTGGGGCCACGCAGTTTAGTCTCTTGTCCCAGGAGAATTGTTAGTGACCCCTCTTTCTCTTGCAAGC
CCCCTCCACACTGGGTTGGATGATACCTTAATGAGTGACGCTGGCGAGAGGCACCCTACCCGACGCAGCTGTGAATGGCCGGTGATGTAT
GTCAGGAGGCCACAGGGAGCAGAGGAGCGGGGCAGGCAGCCACAGGGCCCTGCGGGGAGCACATCCTCGCCTCCGTCCGGCTGCTGCCCT
TCAACAACAAGCCCTGATTTTTCCAGCAATGCCAGAAACCTGGATTTTAAGTCTTCCAATTTGATTCAAAAATATTTTTAACATTGTGAG
CCAGCTAGACCCCCAGTGCACCACCCCATATTGAAAAACAGTTGTCTGGCATCAGCTTCAGGAGCGGGTCCGGTCATTCTGAAACTGTCC
CTCCAGAGGTTCTTCCAGCCCCACTTCTATGCGATGTCATCTTTTCTAAAAGAGACAAATGAAGCCACAGGGAAAGTGAAATAAAGCCTT


Top

FusionGenePPI for TAF4_GATA5


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page

.

check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors
TAF4TBP, CBX5, TAF1, GTF2A2, TAF4, TAF12, TAF8, TAF5, KAT2A, SUPT3H, TAF10, CREM, SP1, TAF9, TAF13, TBPL1, GTF2A1, ATF7, CREB1, TCF12, ELAVL1, TAF6, TAF7, TRRAP, TAF3, TAF9B, TAF1L, AHR, HTT, JUN, GTF2F2, HIST3H3, MED26, FBXO6, SOX6, EGFR, NUP50, ANAPC1, ANAPC7, CDC16, NAP1L4, ANAPC13, CDC23, CDC27, NCL, SF3B3, EWSR1, HIST1H3E, KIFAP3, TAF7L, TAF11GATA5KLF2


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

RelatedDrugs for TAF4_GATA5


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.0 2018-04-02)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

RelatedDiseases for TAF4_GATA5


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
TgeneGATA5C0009404Colorectal Neoplasms1CTD_human