FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

FusionGeneSummary

leaf

FusionProtFeature

leaf

FusionGeneSequence

leaf

FusionGenePPI

leaf

RelatedDrugs

leaf

RelatedDiseases

Fusion gene ID: 37988

FusionGeneSummary for THOC1_THADA

check button Fusion gene summary
Fusion gene informationFusion gene name: THOC1_THADA
Fusion gene ID: 37988
HgeneTgene
Gene symbol

THOC1

THADA

Gene ID

9984

63892

Gene nameTHO complex 1THADA, armadillo repeat containing
SynonymsHPR1|P84|P84N5ARMC13|GITA
Cytomap

18p11.32

2p21

Type of geneprotein-codingprotein-coding
DescriptionTHO complex subunit 1hTREX84nuclear matrix protein p84tho1thyroid adenoma-associated proteindeath receptor-interacting proteingene inducing thyroid adenomas protein
Modification date2018052320180523
UniProtAcc

Q96FV9

Q6YHU6

Ensembl transtripts involved in fusion geneENST00000261600, ENST00000582313, 
ENST00000330266, ENST00000405975, 
ENST00000415080, ENST00000405006, 
ENST00000485353, ENST00000402360, 
ENST00000404790, ENST00000403856, 
Fusion gene scores* DoF score4 X 5 X 3=609 X 9 X 7=567
# samples 710
** MAII scorelog2(7/60*10)=0.222392421336448
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
log2(10/567*10)=-2.5033487351675
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: THOC1 [Title/Abstract] AND THADA [Title/Abstract] AND fusion [Title/Abstract]

Functional or gene categories assigned by FusionGDB annotation
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneTHOC1

GO:0006406

mRNA export from nucleus

17190602

HgeneTHOC1

GO:0006915

apoptotic process

10512864

HgeneTHOC1

GO:0032784

regulation of DNA-templated transcription, elongation

15870275

HgeneTHOC1

GO:0046784

viral mRNA export from host cell nucleus

18974867


check button Fusion gene information from three resources
(ChiTars (NAR, 2018), tumorfusions (NAR, 2018), Gao et al. (Cell, 2018))
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
Data typeSourceCancer typeSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
TCGARVUCSTCGA-N8-A4PO-01ATHOC1chr18

252539

-THADAchr2

43655370

-
* LD: Li Ding group's fusion gene list
  RV: Roel Verhaak group's fusion gene list
  ChiTaRs fusion database

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000261600ENST00000330266THOC1chr18

252539

-THADAchr2

43655370

-
In-frameENST00000261600ENST00000405975THOC1chr18

252539

-THADAchr2

43655370

-
In-frameENST00000261600ENST00000415080THOC1chr18

252539

-THADAchr2

43655370

-
In-frameENST00000261600ENST00000405006THOC1chr18

252539

-THADAchr2

43655370

-
5CDS-5UTRENST00000261600ENST00000485353THOC1chr18

252539

-THADAchr2

43655370

-
5CDS-intronENST00000261600ENST00000402360THOC1chr18

252539

-THADAchr2

43655370

-
5CDS-intronENST00000261600ENST00000404790THOC1chr18

252539

-THADAchr2

43655370

-
5CDS-intronENST00000261600ENST00000403856THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-3CDSENST00000582313ENST00000330266THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-3CDSENST00000582313ENST00000405975THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-3CDSENST00000582313ENST00000415080THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-3CDSENST00000582313ENST00000405006THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-5UTRENST00000582313ENST00000485353THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-intronENST00000582313ENST00000402360THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-intronENST00000582313ENST00000404790THOC1chr18

252539

-THADAchr2

43655370

-
5UTR-intronENST00000582313ENST00000403856THOC1chr18

252539

-THADAchr2

43655370

-

Top

FusionProtFeatures for THOC1_THADA


check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
THOC1

Q96FV9

THADA

Q6YHU6

Required for efficient export of polyadenylated RNA.Acts as component of the THO subcomplex of the TREX complex whichis thought to couple mRNA transcription, processing and nuclearexport, and which specifically associates with spliced mRNA andnot with unspliced pre-mRNA. TREX is recruited to spliced mRNAs bya transcription-independent mechanism, binds to mRNA upstream ofthe exon-junction complex (EJC) and is recruited in asplicing- and cap-dependent manner to a region near the 5' end ofthe mRNA where it functions in mRNA export to the cytoplasm viathe TAP/NFX1 pathway. The TREX complex is essential for the exportof Kaposi's sarcoma-associated herpesvirus (KSHV) intronless mRNAsand infectious virus production. Regulates transcriptionalelongation of a subset of genes. Involved in genome stability bypreventing co-transcriptional R-loop formation. Participates in an apoptotic pathway which ischaracterized by activation of caspase-6, increases in theexpression of BAK1 and BCL2L1 and activation of NF-kappa-B. Thispathway does not require p53/TP53, nor does the presence ofp53/TP53 affect the efficiency of cell killing. Activates a G2/Mcell cycle checkpoint prior to the onset of apoptosis. Apoptosisis inhibited by association with RB1.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

.

* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
>>>>>>>>
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneTHADAchr18:252539chr2:43655370ENST00000402360-017886_918-10938Coiled coilOntology_term=ECO:0000255
TgeneTHADAchr18:252539chr2:43655370ENST00000402360-0171322_1327-10938Compositional biasNote=Poly-Leu
TgeneTHADAchr18:252539chr2:43655370ENST00000402360-0171529_1532-10938Compositional biasNote=Poly-Ala
TgeneTHADAchr18:252539chr2:43655370ENST00000402360-0174_7-10938Compositional biasNote=Poly-Lys
TgeneTHADAchr18:252539chr2:43655370ENST00000405006-26381322_132713081954Compositional biasNote=Poly-Leu
TgeneTHADAchr18:252539chr2:43655370ENST00000405006-26381529_153213081954Compositional biasNote=Poly-Ala
TgeneTHADAchr18:252539chr2:43655370ENST00000405975-26381322_132713081954Compositional biasNote=Poly-Leu
TgeneTHADAchr18:252539chr2:43655370ENST00000405975-26381529_153213081954Compositional biasNote=Poly-Ala

- In-frame and not-retained protein feature among the 13 regional features.
>>
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneTHOC1chr18:252539chr2:43655370ENST00000261600-921570_653225658DomainDeath
HgeneTHOC1chr18:252539chr2:43655370ENST00000261600-921414_430225658MotifNuclear localization signal
TgeneTHADAchr18:252539chr2:43655370ENST00000405006-2638886_91813081954Coiled coilOntology_term=ECO:0000255
TgeneTHADAchr18:252539chr2:43655370ENST00000405975-2638886_91813081954Coiled coilOntology_term=ECO:0000255
TgeneTHADAchr18:252539chr2:43655370ENST00000405006-26384_713081954Compositional biasNote=Poly-Lys
TgeneTHADAchr18:252539chr2:43655370ENST00000405975-26384_713081954Compositional biasNote=Poly-Lys


Top

FusionGeneSequence for THOC1_THADA


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences.
(nt: nucleotides, aa: amino acids)

* Fusion amino acid sequences.
>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000330266_chr2_43655370_-_305aa
MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTA
STPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ
FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCDMGEPNRHPSMFLLLLVLERLYASPMDGTSSALSMGPFVPFIMR

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000405975_chr2_43655370_-_871aa
MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTA
STPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ
FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCDMGEPNRHPSMFLLLLVLERLYASPMDGTSSALSMGPFVPFIMR
CGHSPVYHSREMAARALVPFVMIDHIPNTIRTLLSTLPSCTDQCFRQNHIHGTLLQVFHLLQAYSDSKHGTNSDFQHELTDITVCTKAKL
WLAKRQNPCLVTRAVYIDILFLLTCCLNRSAKDNQPVLESLGFWEEVRGIISGSELITGFPWAFKVPGLPQYLQSLTRLAIAAVWAAAAK
SGERETNVPISFSQLLESAFPEVRSLTLEALLEKFLAAASGLGEKGVPPLLCNMGEKFLLLAMKENHPECFCKILKILHCMDPGEWLPQT
EHCVHLTPKEFLIWTMDIASNERSEIQSVALRLASKVISHHMQTCVENRELIAAELKQWVQLVILSCEDHLPTESRLAVVEVLTSTTPLF
LTNPHPILELQDTLALWKCVLTLLQSEEQAVRDAATETVTTAMSQENTCQSTEFAFCQVDASIALALALAVLCDLLQQWDQLAPGLPILL
GWLLGESDDLVACVESMHQVEEDYLFEKAEVNFWAETLIFVKYLCKHLFCLLSKSGWRPPSPEMLCHLQRMVSEQCHLLSQFFRELPPAA

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000415080_chr2_43655370_-_871aa
MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTA
STPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ
FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCDMGEPNRHPSMFLLLLVLERLYASPMDGTSSALSMGPFVPFIMR
CGHSPVYHSREMAARALVPFVMIDHIPNTIRTLLSTLPSCTDQCFRQNHIHGTLLQVFHLLQAYSDSKHGTNSDFQHELTDITVCTKAKL
WLAKRQNPCLVTRAVYIDILFLLTCCLNRSAKDNQPVLESLGFWEEVRGIISGSELITGFPWAFKVPGLPQYLQSLTRLAIAAVWAAAAK
SGERETNVPISFSQLLESAFPEVRSLTLEALLEKFLAAASGLGEKGVPPLLCNMGEKFLLLAMKENHPECFCKILKILHCMDPGEWLPQT
EHCVHLTPKEFLIWTMDIASNERSEIQSVALRLASKVISHHMQTCVENRELIAAELKQWVQLVILSCEDHLPTESRLAVVEVLTSTTPLF
LTNPHPILELQDTLALWKCVLTLLQSEEQAVRDAATETVTTAMSQENTCQSTEFAFCQVDASIALALALAVLCDLLQQWDQLAPGLPILL
GWLLGESDDLVACVESMHQVEEDYLFEKAEVNFWAETLIFVKYLCKHLFCLLSKSGWRPPSPEMLCHLQRMVSEQCHLLSQFFRELPPAA

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000405006_chr2_43655370_-_871aa
MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTA
STPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ
FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCDMGEPNRHPSMFLLLLVLERLYASPMDGTSSALSMGPFVPFIMR
CGHSPVYHSREMAARALVPFVMIDHIPNTIRTLLSTLPSCTDQCFRQNHIHGTLLQVFHLLQAYSDSKHGTNSDFQHELTDITVCTKAKL
WLAKRQNPCLVTRAVYIDILFLLTCCLNRSAKDNQPVLESLGFWEEVRGIISGSELITGFPWAFKVPGLPQYLQSLTRLAIAAVWAAAAK
SGERETNVPISFSQLLESAFPEVRSLTLEALLEKFLAAASGLGEKGVPPLLCNMGEKFLLLAMKENHPECFCKILKILHCMDPGEWLPQT
EHCVHLTPKEFLIWTMDIASNERSEIQSVALRLASKVISHHMQTCVENRELIAAELKQWVQLVILSCEDHLPTESRLAVVEVLTSTTPLF
LTNPHPILELQDTLALWKCVLTLLQSEEQAVRDAATETVTTAMSQENTCQSTEFAFCQVDASIALALALAVLCDLLQQWDQLAPGLPILL
GWLLGESDDLVACVESMHQVEEDYLFEKAEVNFWAETLIFVKYLCKHLFCLLSKSGWRPPSPEMLCHLQRMVSEQCHLLSQFFRELPPAA


* Fusion transcript sequences (only coding sequence (CDS) region).
>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000330266_chr2_43655370_-_915nt
ATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACAAAAACATC
AAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTATTCTAGAA
GAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTTGTACCGCA
TCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAAAAAATGTT
GCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGTCTAAATCC
CAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGCAGAGTCAG
TTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAGAAGGAATG
GATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGTTTCTCTTA
CTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCATTATGAGA
GTCTTGCTCTGTTGTCCAGGCTGGAGTGCAGTGGCACGATCTCAGCTCACTGCAACCTCCACCTCCTGGGTTCAAGCAATTCTCCTGCCT

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000405975_chr2_43655370_-_2613nt
ATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACAAAAACATC
AAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTATTCTAGAA
GAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTTGTACCGCA
TCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAAAAAATGTT
GCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGTCTAAATCC
CAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGCAGAGTCAG
TTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAGAAGGAATG
GATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGTTTCTCTTA
CTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCATTATGAGG
TGTGGTCACTCACCTGTCTACCACTCCCGTGAAATGGCAGCTCGTGCCTTGGTCCCATTTGTTATGATAGATCACATTCCTAATACCATT
CGAACTCTGTTGTCCACACTCCCCAGCTGCACTGACCAGTGTTTCCGGCAAAACCACATTCATGGGACACTTCTCCAGGTTTTTCATTTG
TTGCAAGCCTACTCAGACTCCAAACACGGAACGAATTCAGACTTCCAGCACGAGCTGACTGACATCACTGTTTGTACCAAAGCCAAACTC
TGGCTGGCCAAGAGGCAAAATCCATGTTTGGTGACCAGAGCTGTATATATTGATATTCTCTTCCTATTGACTTGCTGCCTCAACAGATCT
GCAAAGGACAACCAGCCAGTTCTGGAGAGTCTTGGCTTCTGGGAGGAAGTCAGAGGGATTATCTCAGGATCAGAGCTGATAACGGGATTC
CCTTGGGCCTTCAAGGTGCCAGGCCTGCCCCAGTACCTCCAGAGCCTCACCAGACTAGCCATTGCTGCAGTGTGGGCCGCGGCAGCCAAG
AGTGGAGAGCGGGAGACGAATGTCCCCATCTCTTTCTCTCAGCTGTTAGAATCTGCCTTCCCTGAAGTGCGCTCACTAACACTGGAAGCC
CTCTTGGAAAAGTTCTTAGCAGCAGCCTCTGGACTTGGAGAGAAGGGCGTGCCACCCTTGCTGTGCAACATGGGAGAGAAGTTCTTATTG
TTGGCCATGAAGGAAAATCACCCAGAATGCTTCTGCAAGATACTGAAAATTCTCCACTGCATGGACCCTGGTGAGTGGCTTCCCCAGACG
GAGCACTGTGTCCATCTGACCCCAAAGGAGTTCTTGATCTGGACGATGGATATTGCTTCCAATGAAAGATCTGAAATTCAGAGTGTAGCT
CTGAGACTTGCTTCCAAAGTCATTTCCCACCACATGCAGACATGTGTGGAGAACAGGGAATTGATAGCTGCTGAGCTGAAGCAGTGGGTT
CAGCTGGTCATCTTGTCATGTGAAGACCATCTTCCTACAGAGTCTAGGCTGGCCGTCGTTGAAGTCCTCACCAGTACTACACCACTTTTC
CTCACCAACCCCCATCCTATTCTTGAGTTGCAGGATACACTTGCTCTCTGGAAGTGTGTCCTTACCCTTCTGCAGAGTGAGGAGCAAGCT
GTTAGAGATGCAGCCACGGAAACCGTGACAACTGCCATGTCACAAGAAAATACCTGCCAGTCAACAGAGTTTGCCTTCTGCCAGGTGGAT
GCCTCCATCGCTCTGGCCCTGGCCCTGGCCGTCCTGTGTGATCTGCTCCAGCAGTGGGACCAGTTGGCCCCTGGACTGCCCATCCTGCTG
GGATGGCTGTTGGGAGAGAGTGATGACCTCGTGGCCTGTGTGGAGAGCATGCATCAGGTGGAAGAAGACTACCTGTTTGAAAAAGCAGAA
GTCAACTTTTGGGCCGAGACCCTGATCTTTGTGAAATACCTCTGCAAGCACCTCTTCTGTCTCCTCTCAAAGTCCGGCTGGCGTCCCCCA
AGCCCTGAGATGCTCTGTCACCTTCAAAGGATGGTGTCAGAGCAGTGCCACCTCCTGTCTCAGTTCTTCAGAGAGCTTCCACCAGCTGCT
GAGTTTGTGAAGACAGTGGAGTTCACAAGACTACGCATTCAAGAGGAAAGGACTTTGGCTTGCTTGAGGCTGCTGGCCTTTTTGGAAGGA
AAGGAAGGGGAAGACACCCTAGTTCTCAGTGTTTGGGACTCTTATGCAGAATCGAGGCAGTTAACTCTTCCAAGAACAGAAGCGGCATGT

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000415080_chr2_43655370_-_2613nt
ATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACAAAAACATC
AAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTATTCTAGAA
GAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTTGTACCGCA
TCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAAAAAATGTT
GCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGTCTAAATCC
CAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGCAGAGTCAG
TTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAGAAGGAATG
GATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGTTTCTCTTA
CTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCATTATGAGG
TGTGGTCACTCACCTGTCTACCACTCCCGTGAAATGGCAGCTCGTGCCTTGGTCCCATTTGTTATGATAGATCACATTCCTAATACCATT
CGAACTCTGTTGTCCACACTCCCCAGCTGCACTGACCAGTGTTTCCGGCAAAACCACATTCATGGGACACTTCTCCAGGTTTTTCATTTG
TTGCAAGCCTACTCAGACTCCAAACACGGAACGAATTCAGACTTCCAGCACGAGCTGACTGACATCACTGTTTGTACCAAAGCCAAACTC
TGGCTGGCCAAGAGGCAAAATCCATGTTTGGTGACCAGAGCTGTATATATTGATATTCTCTTCCTATTGACTTGCTGCCTCAACAGATCT
GCAAAGGACAACCAGCCAGTTCTGGAGAGTCTTGGCTTCTGGGAGGAAGTCAGAGGGATTATCTCAGGATCAGAGCTGATAACGGGATTC
CCTTGGGCCTTCAAGGTGCCAGGCCTGCCCCAGTACCTCCAGAGCCTCACCAGACTAGCCATTGCTGCAGTGTGGGCCGCGGCAGCCAAG
AGTGGAGAGCGGGAGACGAATGTCCCCATCTCTTTCTCTCAGCTGTTAGAATCTGCCTTCCCTGAAGTGCGCTCACTAACACTGGAAGCC
CTCTTGGAAAAGTTCTTAGCAGCAGCCTCTGGACTTGGAGAGAAGGGCGTGCCACCCTTGCTGTGCAACATGGGAGAGAAGTTCTTATTG
TTGGCCATGAAGGAAAATCACCCAGAATGCTTCTGCAAGATACTGAAAATTCTCCACTGCATGGACCCTGGTGAGTGGCTTCCCCAGACG
GAGCACTGTGTCCATCTGACCCCAAAGGAGTTCTTGATCTGGACGATGGATATTGCTTCCAATGAAAGATCTGAAATTCAGAGTGTAGCT
CTGAGACTTGCTTCCAAAGTCATTTCCCACCACATGCAGACATGTGTGGAGAACAGGGAATTGATAGCTGCTGAGCTGAAGCAGTGGGTT
CAGCTGGTCATCTTGTCATGTGAAGACCATCTTCCTACAGAGTCTAGGCTGGCCGTCGTTGAAGTCCTCACCAGTACTACACCACTTTTC
CTCACCAACCCCCATCCTATTCTTGAGTTGCAGGATACACTTGCTCTCTGGAAGTGTGTCCTTACCCTTCTGCAGAGTGAGGAGCAAGCT
GTTAGAGATGCAGCCACGGAAACCGTGACAACTGCCATGTCACAAGAAAATACCTGCCAGTCAACAGAGTTTGCCTTCTGCCAGGTGGAT
GCCTCCATCGCTCTGGCCCTGGCCCTGGCCGTCCTGTGTGATCTGCTCCAGCAGTGGGACCAGTTGGCCCCTGGACTGCCCATCCTGCTG
GGATGGCTGTTGGGAGAGAGTGATGACCTCGTGGCCTGTGTGGAGAGCATGCATCAGGTGGAAGAAGACTACCTGTTTGAAAAAGCAGAA
GTCAACTTTTGGGCCGAGACCCTGATCTTTGTGAAATACCTCTGCAAGCACCTCTTCTGTCTCCTCTCAAAGTCCGGCTGGCGTCCCCCA
AGCCCTGAGATGCTCTGTCACCTTCAAAGGATGGTGTCAGAGCAGTGCCACCTCCTGTCTCAGTTCTTCAGAGAGCTTCCACCAGCTGCT
GAGTTTGTGAAGACAGTGGAGTTCACAAGACTACGCATTCAAGAGGAAAGGACTTTGGCTTGCTTGAGGCTGCTGGCCTTTTTGGAAGGA
AAGGAAGGGGAAGACACCCTAGTTCTCAGTGTTTGGGACTCTTATGCAGAATCGAGGCAGTTAACTCTTCCAAGAACAGAAGCGGCATGT

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000405006_chr2_43655370_-_2613nt
ATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACAAAAACATC
AAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTATTCTAGAA
GAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTTGTACCGCA
TCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAAAAAATGTT
GCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGTCTAAATCC
CAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGCAGAGTCAG
TTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAGAAGGAATG
GATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGTTTCTCTTA
CTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCATTATGAGG
TGTGGTCACTCACCTGTCTACCACTCCCGTGAAATGGCAGCTCGTGCCTTGGTCCCATTTGTTATGATAGATCACATTCCTAATACCATT
CGAACTCTGTTGTCCACACTCCCCAGCTGCACTGACCAGTGTTTCCGGCAAAACCACATTCATGGGACACTTCTCCAGGTTTTTCATTTG
TTGCAAGCCTACTCAGACTCCAAACACGGAACGAATTCAGACTTCCAGCACGAGCTGACTGACATCACTGTTTGTACCAAAGCCAAACTC
TGGCTGGCCAAGAGGCAAAATCCATGTTTGGTGACCAGAGCTGTATATATTGATATTCTCTTCCTATTGACTTGCTGCCTCAACAGATCT
GCAAAGGACAACCAGCCAGTTCTGGAGAGTCTTGGCTTCTGGGAGGAAGTCAGAGGGATTATCTCAGGATCAGAGCTGATAACGGGATTC
CCTTGGGCCTTCAAGGTGCCAGGCCTGCCCCAGTACCTCCAGAGCCTCACCAGACTAGCCATTGCTGCAGTGTGGGCCGCGGCAGCCAAG
AGTGGAGAGCGGGAGACGAATGTCCCCATCTCTTTCTCTCAGCTGTTAGAATCTGCCTTCCCTGAAGTGCGCTCACTAACACTGGAAGCC
CTCTTGGAAAAGTTCTTAGCAGCAGCCTCTGGACTTGGAGAGAAGGGCGTGCCACCCTTGCTGTGCAACATGGGAGAGAAGTTCTTATTG
TTGGCCATGAAGGAAAATCACCCAGAATGCTTCTGCAAGATACTGAAAATTCTCCACTGCATGGACCCTGGTGAGTGGCTTCCCCAGACG
GAGCACTGTGTCCATCTGACCCCAAAGGAGTTCTTGATCTGGACGATGGATATTGCTTCCAATGAAAGATCTGAAATTCAGAGTGTAGCT
CTGAGACTTGCTTCCAAAGTCATTTCCCACCACATGCAGACATGTGTGGAGAACAGGGAATTGATAGCTGCTGAGCTGAAGCAGTGGGTT
CAGCTGGTCATCTTGTCATGTGAAGACCATCTTCCTACAGAGTCTAGGCTGGCCGTCGTTGAAGTCCTCACCAGTACTACACCACTTTTC
CTCACCAACCCCCATCCTATTCTTGAGTTGCAGGATACACTTGCTCTCTGGAAGTGTGTCCTTACCCTTCTGCAGAGTGAGGAGCAAGCT
GTTAGAGATGCAGCCACGGAAACCGTGACAACTGCCATGTCACAAGAAAATACCTGCCAGTCAACAGAGTTTGCCTTCTGCCAGGTGGAT
GCCTCCATCGCTCTGGCCCTGGCCCTGGCCGTCCTGTGTGATCTGCTCCAGCAGTGGGACCAGTTGGCCCCTGGACTGCCCATCCTGCTG
GGATGGCTGTTGGGAGAGAGTGATGACCTCGTGGCCTGTGTGGAGAGCATGCATCAGGTGGAAGAAGACTACCTGTTTGAAAAAGCAGAA
GTCAACTTTTGGGCCGAGACCCTGATCTTTGTGAAATACCTCTGCAAGCACCTCTTCTGTCTCCTCTCAAAGTCCGGCTGGCGTCCCCCA
AGCCCTGAGATGCTCTGTCACCTTCAAAGGATGGTGTCAGAGCAGTGCCACCTCCTGTCTCAGTTCTTCAGAGAGCTTCCACCAGCTGCT
GAGTTTGTGAAGACAGTGGAGTTCACAAGACTACGCATTCAAGAGGAAAGGACTTTGGCTTGCTTGAGGCTGCTGGCCTTTTTGGAAGGA
AAGGAAGGGGAAGACACCCTAGTTCTCAGTGTTTGGGACTCTTATGCAGAATCGAGGCAGTTAACTCTTCCAAGAACAGAAGCGGCATGT


* Fusion transcript sequences (Full-length transcript).
>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000330266_chr2_43655370_-_923nt
CCGAGAAGATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACA
AAAACATCAAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTA
TTCTAGAAGAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTT
GTACCGCATCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAA
AAAATGTTGCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGT
CTAAATCCCAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGC
AGAGTCAGTTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAG
AAGGAATGGATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGT
TTCTCTTACTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCA
TTATGAGAGTCTTGCTCTGTTGTCCAGGCTGGAGTGCAGTGGCACGATCTCAGCTCACTGCAACCTCCACCTCCTGGGTTCAAGCAATTC

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000405975_chr2_43655370_-_2733nt
CCGAGAAGATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACA
AAAACATCAAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTA
TTCTAGAAGAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTT
GTACCGCATCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAA
AAAATGTTGCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGT
CTAAATCCCAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGC
AGAGTCAGTTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAG
AAGGAATGGATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGT
TTCTCTTACTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCA
TTATGAGGTGTGGTCACTCACCTGTCTACCACTCCCGTGAAATGGCAGCTCGTGCCTTGGTCCCATTTGTTATGATAGATCACATTCCTA
ATACCATTCGAACTCTGTTGTCCACACTCCCCAGCTGCACTGACCAGTGTTTCCGGCAAAACCACATTCATGGGACACTTCTCCAGGTTT
TTCATTTGTTGCAAGCCTACTCAGACTCCAAACACGGAACGAATTCAGACTTCCAGCACGAGCTGACTGACATCACTGTTTGTACCAAAG
CCAAACTCTGGCTGGCCAAGAGGCAAAATCCATGTTTGGTGACCAGAGCTGTATATATTGATATTCTCTTCCTATTGACTTGCTGCCTCA
ACAGATCTGCAAAGGACAACCAGCCAGTTCTGGAGAGTCTTGGCTTCTGGGAGGAAGTCAGAGGGATTATCTCAGGATCAGAGCTGATAA
CGGGATTCCCTTGGGCCTTCAAGGTGCCAGGCCTGCCCCAGTACCTCCAGAGCCTCACCAGACTAGCCATTGCTGCAGTGTGGGCCGCGG
CAGCCAAGAGTGGAGAGCGGGAGACGAATGTCCCCATCTCTTTCTCTCAGCTGTTAGAATCTGCCTTCCCTGAAGTGCGCTCACTAACAC
TGGAAGCCCTCTTGGAAAAGTTCTTAGCAGCAGCCTCTGGACTTGGAGAGAAGGGCGTGCCACCCTTGCTGTGCAACATGGGAGAGAAGT
TCTTATTGTTGGCCATGAAGGAAAATCACCCAGAATGCTTCTGCAAGATACTGAAAATTCTCCACTGCATGGACCCTGGTGAGTGGCTTC
CCCAGACGGAGCACTGTGTCCATCTGACCCCAAAGGAGTTCTTGATCTGGACGATGGATATTGCTTCCAATGAAAGATCTGAAATTCAGA
GTGTAGCTCTGAGACTTGCTTCCAAAGTCATTTCCCACCACATGCAGACATGTGTGGAGAACAGGGAATTGATAGCTGCTGAGCTGAAGC
AGTGGGTTCAGCTGGTCATCTTGTCATGTGAAGACCATCTTCCTACAGAGTCTAGGCTGGCCGTCGTTGAAGTCCTCACCAGTACTACAC
CACTTTTCCTCACCAACCCCCATCCTATTCTTGAGTTGCAGGATACACTTGCTCTCTGGAAGTGTGTCCTTACCCTTCTGCAGAGTGAGG
AGCAAGCTGTTAGAGATGCAGCCACGGAAACCGTGACAACTGCCATGTCACAAGAAAATACCTGCCAGTCAACAGAGTTTGCCTTCTGCC
AGGTGGATGCCTCCATCGCTCTGGCCCTGGCCCTGGCCGTCCTGTGTGATCTGCTCCAGCAGTGGGACCAGTTGGCCCCTGGACTGCCCA
TCCTGCTGGGATGGCTGTTGGGAGAGAGTGATGACCTCGTGGCCTGTGTGGAGAGCATGCATCAGGTGGAAGAAGACTACCTGTTTGAAA
AAGCAGAAGTCAACTTTTGGGCCGAGACCCTGATCTTTGTGAAATACCTCTGCAAGCACCTCTTCTGTCTCCTCTCAAAGTCCGGCTGGC
GTCCCCCAAGCCCTGAGATGCTCTGTCACCTTCAAAGGATGGTGTCAGAGCAGTGCCACCTCCTGTCTCAGTTCTTCAGAGAGCTTCCAC
CAGCTGCTGAGTTTGTGAAGACAGTGGAGTTCACAAGACTACGCATTCAAGAGGAAAGGACTTTGGCTTGCTTGAGGCTGCTGGCCTTTT
TGGAAGGAAAGGAAGGGGAAGACACCCTAGTTCTCAGTGTTTGGGACTCTTATGCAGAATCGAGGCAGTTAACTCTTCCAAGAACAGAAG
CGGCATGTTGAAGAAAATCTGGGGGATTGGGATGGGGGTATGTGTGGATTTTTCCTCCACTAAATCTGCAGGAAACATGTTGAACATAAA

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000415080_chr2_43655370_-_2732nt
CCGAGAAGATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACA
AAAACATCAAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTA
TTCTAGAAGAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTT
GTACCGCATCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAA
AAAATGTTGCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGT
CTAAATCCCAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGC
AGAGTCAGTTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAG
AAGGAATGGATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGT
TTCTCTTACTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCA
TTATGAGGTGTGGTCACTCACCTGTCTACCACTCCCGTGAAATGGCAGCTCGTGCCTTGGTCCCATTTGTTATGATAGATCACATTCCTA
ATACCATTCGAACTCTGTTGTCCACACTCCCCAGCTGCACTGACCAGTGTTTCCGGCAAAACCACATTCATGGGACACTTCTCCAGGTTT
TTCATTTGTTGCAAGCCTACTCAGACTCCAAACACGGAACGAATTCAGACTTCCAGCACGAGCTGACTGACATCACTGTTTGTACCAAAG
CCAAACTCTGGCTGGCCAAGAGGCAAAATCCATGTTTGGTGACCAGAGCTGTATATATTGATATTCTCTTCCTATTGACTTGCTGCCTCA
ACAGATCTGCAAAGGACAACCAGCCAGTTCTGGAGAGTCTTGGCTTCTGGGAGGAAGTCAGAGGGATTATCTCAGGATCAGAGCTGATAA
CGGGATTCCCTTGGGCCTTCAAGGTGCCAGGCCTGCCCCAGTACCTCCAGAGCCTCACCAGACTAGCCATTGCTGCAGTGTGGGCCGCGG
CAGCCAAGAGTGGAGAGCGGGAGACGAATGTCCCCATCTCTTTCTCTCAGCTGTTAGAATCTGCCTTCCCTGAAGTGCGCTCACTAACAC
TGGAAGCCCTCTTGGAAAAGTTCTTAGCAGCAGCCTCTGGACTTGGAGAGAAGGGCGTGCCACCCTTGCTGTGCAACATGGGAGAGAAGT
TCTTATTGTTGGCCATGAAGGAAAATCACCCAGAATGCTTCTGCAAGATACTGAAAATTCTCCACTGCATGGACCCTGGTGAGTGGCTTC
CCCAGACGGAGCACTGTGTCCATCTGACCCCAAAGGAGTTCTTGATCTGGACGATGGATATTGCTTCCAATGAAAGATCTGAAATTCAGA
GTGTAGCTCTGAGACTTGCTTCCAAAGTCATTTCCCACCACATGCAGACATGTGTGGAGAACAGGGAATTGATAGCTGCTGAGCTGAAGC
AGTGGGTTCAGCTGGTCATCTTGTCATGTGAAGACCATCTTCCTACAGAGTCTAGGCTGGCCGTCGTTGAAGTCCTCACCAGTACTACAC
CACTTTTCCTCACCAACCCCCATCCTATTCTTGAGTTGCAGGATACACTTGCTCTCTGGAAGTGTGTCCTTACCCTTCTGCAGAGTGAGG
AGCAAGCTGTTAGAGATGCAGCCACGGAAACCGTGACAACTGCCATGTCACAAGAAAATACCTGCCAGTCAACAGAGTTTGCCTTCTGCC
AGGTGGATGCCTCCATCGCTCTGGCCCTGGCCCTGGCCGTCCTGTGTGATCTGCTCCAGCAGTGGGACCAGTTGGCCCCTGGACTGCCCA
TCCTGCTGGGATGGCTGTTGGGAGAGAGTGATGACCTCGTGGCCTGTGTGGAGAGCATGCATCAGGTGGAAGAAGACTACCTGTTTGAAA
AAGCAGAAGTCAACTTTTGGGCCGAGACCCTGATCTTTGTGAAATACCTCTGCAAGCACCTCTTCTGTCTCCTCTCAAAGTCCGGCTGGC
GTCCCCCAAGCCCTGAGATGCTCTGTCACCTTCAAAGGATGGTGTCAGAGCAGTGCCACCTCCTGTCTCAGTTCTTCAGAGAGCTTCCAC
CAGCTGCTGAGTTTGTGAAGACAGTGGAGTTCACAAGACTACGCATTCAAGAGGAAAGGACTTTGGCTTGCTTGAGGCTGCTGGCCTTTT
TGGAAGGAAAGGAAGGGGAAGACACCCTAGTTCTCAGTGTTTGGGACTCTTATGCAGAATCGAGGCAGTTAACTCTTCCAAGAACAGAAG
CGGCATGTTGAAGAAAATCTGGGGGATTGGGATGGGGGTATGTGTGGATTTTTCCTCCACTAAATCTGCAGGAAACATGTTGAACATAAA

>In-frame_THOC1_ENST00000261600_chr18_252539_-_THADA_ENST00000405006_chr2_43655370_-_2717nt
CCGAGAAGATGTCTCCGACGCCGCCGCTCTTCAGTTTGCCCGAAGCGCGGACGCGGTTTACGAAGTCTACCAGAGAGGCCTTGAACAACA
AAAACATCAAGCCATTGTTAAGTACCTTCAGCCAGGTACCTGGCAGTGAAAATGAAAAAAAATGTACCCTTGACCAAGCTTTCAGAGGTA
TTCTAGAAGAAGAAATTATAAATCATTCATCATGTGAAAACGTTTTAGCTATTATTTCTCTTGCTATTGGGGGAGTAACTGAAGGTATTT
GTACCGCATCTACACCTTTTGTATTGTTGGGAGATGTTTTGGATTGTCTTCCTTTGGATCAGTGTGACACAATATTCACTTTTGTGGAAA
AAAATGTTGCTACTTGGAAATCAAATACATTCTATTCTGCTGGGAAAAATTACTTACTACGTATGTGCAATGATCTCCTAAGAAGATTGT
CTAAATCCCAGAATACAGTCTTCTGTGGACGGATTCAGCTCTTTTTGGCCAGGCTTTTCCCTCTGTCTGAGAAATCAGGTCTTAACTTGC
AGAGTCAGTTTAATCTGGAAAATGTCACTGTTTTCAATACAAATGAGCAGGAAAGCACCCTGGGTCAGAAGCACACTGAAGATAGAGAAG
AAGGAATGGATGTAGAAGAAGGCGAAATGGGAGACGAGGAAGCTCCAACAACGTGTGATATGGGAGAACCAAATCGTCATCCAAGCATGT
TTCTCTTACTTTTGGTGTTGGAGAGACTCTACGCTTCCCCGATGGATGGTACTTCTTCTGCTCTCAGCATGGGACCTTTTGTTCCCTTCA
TTATGAGGTGTGGTCACTCACCTGTCTACCACTCCCGTGAAATGGCAGCTCGTGCCTTGGTCCCATTTGTTATGATAGATCACATTCCTA
ATACCATTCGAACTCTGTTGTCCACACTCCCCAGCTGCACTGACCAGTGTTTCCGGCAAAACCACATTCATGGGACACTTCTCCAGGTTT
TTCATTTGTTGCAAGCCTACTCAGACTCCAAACACGGAACGAATTCAGACTTCCAGCACGAGCTGACTGACATCACTGTTTGTACCAAAG
CCAAACTCTGGCTGGCCAAGAGGCAAAATCCATGTTTGGTGACCAGAGCTGTATATATTGATATTCTCTTCCTATTGACTTGCTGCCTCA
ACAGATCTGCAAAGGACAACCAGCCAGTTCTGGAGAGTCTTGGCTTCTGGGAGGAAGTCAGAGGGATTATCTCAGGATCAGAGCTGATAA
CGGGATTCCCTTGGGCCTTCAAGGTGCCAGGCCTGCCCCAGTACCTCCAGAGCCTCACCAGACTAGCCATTGCTGCAGTGTGGGCCGCGG
CAGCCAAGAGTGGAGAGCGGGAGACGAATGTCCCCATCTCTTTCTCTCAGCTGTTAGAATCTGCCTTCCCTGAAGTGCGCTCACTAACAC
TGGAAGCCCTCTTGGAAAAGTTCTTAGCAGCAGCCTCTGGACTTGGAGAGAAGGGCGTGCCACCCTTGCTGTGCAACATGGGAGAGAAGT
TCTTATTGTTGGCCATGAAGGAAAATCACCCAGAATGCTTCTGCAAGATACTGAAAATTCTCCACTGCATGGACCCTGGTGAGTGGCTTC
CCCAGACGGAGCACTGTGTCCATCTGACCCCAAAGGAGTTCTTGATCTGGACGATGGATATTGCTTCCAATGAAAGATCTGAAATTCAGA
GTGTAGCTCTGAGACTTGCTTCCAAAGTCATTTCCCACCACATGCAGACATGTGTGGAGAACAGGGAATTGATAGCTGCTGAGCTGAAGC
AGTGGGTTCAGCTGGTCATCTTGTCATGTGAAGACCATCTTCCTACAGAGTCTAGGCTGGCCGTCGTTGAAGTCCTCACCAGTACTACAC
CACTTTTCCTCACCAACCCCCATCCTATTCTTGAGTTGCAGGATACACTTGCTCTCTGGAAGTGTGTCCTTACCCTTCTGCAGAGTGAGG
AGCAAGCTGTTAGAGATGCAGCCACGGAAACCGTGACAACTGCCATGTCACAAGAAAATACCTGCCAGTCAACAGAGTTTGCCTTCTGCC
AGGTGGATGCCTCCATCGCTCTGGCCCTGGCCCTGGCCGTCCTGTGTGATCTGCTCCAGCAGTGGGACCAGTTGGCCCCTGGACTGCCCA
TCCTGCTGGGATGGCTGTTGGGAGAGAGTGATGACCTCGTGGCCTGTGTGGAGAGCATGCATCAGGTGGAAGAAGACTACCTGTTTGAAA
AAGCAGAAGTCAACTTTTGGGCCGAGACCCTGATCTTTGTGAAATACCTCTGCAAGCACCTCTTCTGTCTCCTCTCAAAGTCCGGCTGGC
GTCCCCCAAGCCCTGAGATGCTCTGTCACCTTCAAAGGATGGTGTCAGAGCAGTGCCACCTCCTGTCTCAGTTCTTCAGAGAGCTTCCAC
CAGCTGCTGAGTTTGTGAAGACAGTGGAGTTCACAAGACTACGCATTCAAGAGGAAAGGACTTTGGCTTGCTTGAGGCTGCTGGCCTTTT
TGGAAGGAAAGGAAGGGGAAGACACCCTAGTTCTCAGTGTTTGGGACTCTTATGCAGAATCGAGGCAGTTAACTCTTCCAAGAACAGAAG
CGGCATGTTGAAGAAAATCTGGGGGATTGGGATGGGGGTATGTGTGGATTTTTCCTCCACTAAATCTGCAGGAAACATGTTGAACATAAA


Top

FusionGenePPI for THOC1_THADA


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page

.

check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors
THOC1RB1, DDX39B, ALYREF, THOC2, THOC5, NCBP1, THOC3, THOC7, THOC6, IK, SF3B2, SF3B1, ZCCHC8, UTRN, ZC3H15, UBR7, RRP9, XPO4, VPS53, TRMT112, TPM3, TPM1, USP13, ZNF830, DHX8, THOC1, FRA10AC1, SAP30BP, WDR77, RPA3, RPA2, RPA1, RABGEF1, TRIM54, MOAP1, USHBP1, OBSL1, RNF2, LUZP4, DSCC1, KRAS, ADAR, DHX9, FMR1, GSN, PRMT1, SP110, NFKBIL1, FXR1, DUSP11, TOP3B, BCLAF1, EIF4A3, ZC3H11A, THRAP3, C17orf85, DDX39A, KIF1C, B4GALT7, ZC3H13, ZZEF1, ZC3H3, CHTOP, RBM15B, MRPS17, GTSE1, C14orf166, RBM27, BCCIP, SCAF1, ZC3H14, CAAP1, TDRD3, FYTTD1, POLDIP3, SARNP, PWWP2A, ATG4A, TARSL2, C19orf47, RBM33, ZFC3H1, C15orf52, NEDD4, MRPL12, MTNR1B, DLST, PDHA1THADACUL3, UBA2, AHCYL1, SRXN1, SSSCA1, SEC24A, SEC23A, SULT1A1, NUBP2, STAM, IGBP1, PSAT1, SIN3A, TES, EGFR, USHBP1, CD274, VSIG2, CA14, LYPD3, SCN2B, ALDH3A2, NTRK1, VASN, TMEM206, TNFRSF1A, OPRM1, SIGLECL1, CHRM4, FZD10, ILVBL, EDNRB, NAA10, CCKBR, CD83, APLNR, MLST8, MTOR, XPO6, XPO4


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

RelatedDrugs for THOC1_THADA


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.0 2018-04-02)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

RelatedDiseases for THOC1_THADA


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
TgeneTHADAC0011860Diabetes Mellitus, Non-Insulin-Dependent1CTD_human