FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

FusionGeneSummary

leaf

FusionProtFeature

leaf

FusionGeneSequence

leaf

FusionGenePPI

leaf

RelatedDrugs

leaf

RelatedDiseases

Fusion gene ID: 15640

FusionGeneSummary for GTF2IRD1_ALK

check button Fusion gene summary
Fusion gene informationFusion gene name: GTF2IRD1_ALK
Fusion gene ID: 15640
HgeneTgene
Gene symbol

GTF2IRD1

ALK

Gene ID

9569

238

Gene nameGTF2I repeat domain containing 1ALK receptor tyrosine kinase
SynonymsBEN|CREAM1|GTF3|MUSTRD1|RBAP2|WBS|WBSCR11|WBSCR12|hMusTRD1alpha1CD246|NBLST3
Cytomap

7q11.23

2p23.2-p23.1

Type of geneprotein-codingprotein-coding
Descriptiongeneral transcription factor II-I repeat domain-containing protein 1USE B1-binding proteinWilliams-Beuren syndrome chromosome region 11binding factor for early enhancergeneral transcription factor 3general transcription factor IIImuscle TFII-I repeaALK tyrosine kinase receptorCD246 antigenanaplastic lymphoma receptor tyrosine kinasemutant anaplastic lymphoma kinase
Modification date2018052220180527
UniProtAcc

Q9UHL9

Q9UM73

Ensembl transtripts involved in fusion geneENST00000265755, ENST00000455841, 
ENST00000424337, ENST00000476977, 
ENST00000489094, 
ENST00000389048, 
ENST00000431873, ENST00000498037, 
Fusion gene scores* DoF score9 X 5 X 8=36024 X 23 X 10=5520
# samples 1046
** MAII scorelog2(10/360*10)=-1.84799690655495
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(46/5520*10)=-3.58496250072116
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: GTF2IRD1 [Title/Abstract] AND ALK [Title/Abstract] AND fusion [Title/Abstract]

Functional or gene categories assigned by FusionGDB annotationOncogene involved fusion gene, in-frame and retained their domain.
Kinase involved fusion gene, inframe and retained kinase domain.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneALK

GO:0016310

phosphorylation

9174053

TgeneALK

GO:0046777

protein autophosphorylation

9174053


check button Fusion gene information from three resources
(ChiTars (NAR, 2018), tumorfusions (NAR, 2018), Gao et al. (Cell, 2018))
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
Data typeSourceCancer typeSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
TCGARVTHCATCGA-EL-A4KD-01AGTF2IRD1chr7

73935627

+ALKchr2

29446394

-
* LD: Li Ding group's fusion gene list
  RV: Roel Verhaak group's fusion gene list
  ChiTaRs fusion database

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000265755ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000265755ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000265755ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000455841ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000455841ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000455841ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000424337ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000424337ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000424337ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000476977ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000476977ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000476977ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
3UTR-3CDSENST00000489094ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
3UTR-intronENST00000489094ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
3UTR-intronENST00000489094ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-

Top

FusionProtFeatures for GTF2IRD1_ALK


check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
GTF2IRD1

Q9UHL9

ALK

Q9UM73

May be a transcription regulator involved in cell-cycleprogression and skeletal muscle differentiation. May repress GTF2Itranscriptional functions, by preventing its nuclear residency, orby inhibiting its transcriptional activation. May contribute toslow-twitch fiber type specificity during myogenesis and inregenerating muscles. Binds troponin I slow-muscle fiber enhancer(USE B1). Binds specifically and with high affinity to the EFGsequences derived from the early enhancer of HOXC8 (Bysimilarity). {ECO:0000250, ECO:0000269|PubMed:11438732}. Neuronal receptor tyrosine kinase that is essentiallyand transiently expressed in specific regions of the central andperipheral nervous systems and plays an important role in thegenesis and differentiation of the nervous system. Transducessignals from ligands at the cell surface, through specificactivation of the mitogen-activated protein kinase (MAPK) pathway.Phosphorylates almost exclusively at the first tyrosine of the Y-x-x-x-Y-Y motif. Following activation by ligand, ALK inducestyrosine phosphorylation of CBL, FRS2, IRS1 and SHC1, as well asof the MAP kinases MAPK1/ERK2 and MAPK3/ERK1. Acts as a receptorfor ligands pleiotrophin (PTN), a secreted growth factor, andmidkine (MDK), a PTN-related factor, thus participating in PTN andMDK signal transduction. PTN-binding induces MAPK pathwayactivation, which is important for the anti-apoptotic signaling ofPTN and regulation of cell proliferation. MDK-binding inducesphosphorylation of the ALK target insulin receptor substrate(IRS1), activates mitogen-activated protein kinases (MAPKs) andPI3-kinase, resulting also in cell proliferation induction. DrivesNF-kappa-B activation, probably through IRS1 and the activation ofthe AKT serine/threonine kinase. Recruitment of IRS1 to activatedALK and the activation of NF-kappa-B are essential for theautocrine growth and survival signaling of MDK.{ECO:0000269|PubMed:11121404, ECO:0000269|PubMed:11278720,ECO:0000269|PubMed:11387242, ECO:0000269|PubMed:11809760,ECO:0000269|PubMed:12107166, ECO:0000269|PubMed:12122009,ECO:0000269|PubMed:15226403, ECO:0000269|PubMed:15908427,ECO:0000269|PubMed:16317043, ECO:0000269|PubMed:16878150,ECO:0000269|PubMed:17274988}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

.

* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
>>>
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727119_213335960RepeatNote=GTF2I-like 1
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727119_213335945RepeatNote=GTF2I-like 1
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727119_213367977RepeatNote=GTF2I-like 1
TgeneALKchr7:73935627chr2:29446394ENST00000389048-18291116_139210571621DomainProtein kinase
TgeneALKchr7:73935627chr2:29446394ENST00000389048-18291197_119910571621RegionNote=Inhibitor binding
TgeneALKchr7:73935627chr2:29446394ENST00000389048-18291060_162010571621Topological domainCytoplasmic

- In-frame and not-retained protein feature among the 13 regional features.
>>>>>>>>>>>>>>>>>>
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727906_930335960Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727906_930335945Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727906_930367977Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727898_905335960MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727898_905335945MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727898_905367977MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727342_436335960RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727556_650335960RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727696_790335960RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727793_887335960RepeatNote=GTF2I-like 5
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727342_436335945RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727556_650335945RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727696_790335945RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727793_887335945RepeatNote=GTF2I-like 5
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727342_436367977RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727556_650367977RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727696_790367977RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727793_887367977RepeatNote=GTF2I-like 5
TgeneALKchr7:73935627chr2:29446394ENST00000389048-1829816_94010571621Compositional biasNote=Gly-rich
TgeneALKchr7:73935627chr2:29446394ENST00000389048-1829264_42710571621DomainMAM 1
TgeneALKchr7:73935627chr2:29446394ENST00000389048-1829437_47310571621DomainNote=LDL-receptor class A
TgeneALKchr7:73935627chr2:29446394ENST00000389048-1829478_63610571621DomainMAM 2
TgeneALKchr7:73935627chr2:29446394ENST00000389048-182919_103810571621Topological domainExtracellular
TgeneALKchr7:73935627chr2:29446394ENST00000389048-18291039_105910571621TransmembraneHelical


Top

FusionGeneSequence for GTF2IRD1_ALK


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences.
(nt: nucleotides, aa: amino acids)

* Fusion amino acid sequences.
>In-frame_GTF2IRD1_ENST00000265755_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_899aa
MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCRG
PPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAE
YDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIREL
KQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRRKHQELQAMQMELQSPEYKLSK
LRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIIS
KFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGP
GRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPK
NCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPP
PLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGN

>In-frame_GTF2IRD1_ENST00000455841_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_931aa
MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCLS
AAQHRAATSQLEGRVVRRVLTVASRALCPTGGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRAS
VVPLPYERLLREPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPK
VPPQDLPPTATSSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILF
SIVHDKSVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPN
DPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVA
RDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLW
EIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEK
VPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPT
SLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQG

>In-frame_GTF2IRD1_ENST00000424337_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_899aa
MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCRG
PPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAE
YDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIREL
KQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRRKHQELQAMQMELQSPEYKLSK
LRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIIS
KFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGP
GRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPK
NCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPP
PLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGN

>In-frame_GTF2IRD1_ENST00000476977_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_899aa
MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCRG
PPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAE
YDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIREL
KQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRRKHQELQAMQMELQSPEYKLSK
LRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIIS
KFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGP
GRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPK
NCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPP
PLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGN


* Fusion transcript sequences (only coding sequence (CDS) region).
>In-frame_GTF2IRD1_ENST00000265755_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_2697nt
ATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATC
ATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAG
AGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGG
CCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAA
CATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTG
CCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAG
TATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGAC
TCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCA
CCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTC
AAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGAC
TTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATC
GTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAG
CTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCG
CGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCA
AGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGC
AAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGA
GACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGAC
ATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCT
GGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCA
GTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATC
TTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAG
AACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGG
ATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCT
GTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCA
CCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAA
GGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTG
TGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAAC
CTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTG
ACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCC

>In-frame_GTF2IRD1_ENST00000455841_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_2793nt
ATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATC
ATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAG
AGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCTCTCC
GCAGCTCAGCACAGGGCAGCGACATCCCAGCTCGAAGGCCGGGTGGTGAGACGGGTGCTCACTGTGGCCTCGCGTGCTCTGTGTCCCACA
GGAGGGCCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCC
CTGGAACATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGT
GTGGTGCCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCA
GCCGAGTATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGG
CGGGACTCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAG
GTGCCACCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGA
GAGCTCAAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCC
CAAGACTTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTC
TCCATCGTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTG
AGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAG
GTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAAC
GACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATC
ATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCG
GGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCT
CGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCA
GGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATG
CTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGG
GAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCA
CCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTG
GAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAA
GTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCC
CCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCC
GTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACC
AGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGG
GGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCT
TCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGC
TTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCC

>In-frame_GTF2IRD1_ENST00000424337_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_2697nt
ATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATC
ATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAG
AGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGG
CCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAA
CATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTG
CCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAG
TATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGAC
TCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCA
CCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTC
AAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGAC
TTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATC
GTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAG
CTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCG
CGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCA
AGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGC
AAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGA
GACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGAC
ATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCT
GGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCA
GTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATC
TTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAG
AACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGG
ATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCT
GTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCA
CCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAA
GGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTG
TGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAAC
CTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTG
ACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCC

>In-frame_GTF2IRD1_ENST00000476977_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_2697nt
ATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATC
ATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAG
AGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGG
CCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAA
CATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTG
CCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAG
TATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGAC
TCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCA
CCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTC
AAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGAC
TTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATC
GTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAG
CTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCG
CGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCA
AGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGC
AAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGA
GACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGAC
ATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCT
GGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCA
GTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATC
TTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAG
AACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGG
ATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCT
GTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCA
CCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAA
GGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTG
TGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAAC
CTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTG
ACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCC


* Fusion transcript sequences (Full-length transcript).
>In-frame_GTF2IRD1_ENST00000265755_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_3540nt
TAAATGGCAGCCAATGGAGGGTGGTGTTGCGCGGGGCTGGGATTAGGGCCGGGGCGAATGGCTGGCAATCTTACTGGGATTACAGAACAA
AGAGCCTCCCCGCGCTCCCGCTCTCCGCTCCTCTCCCCGCGCCGCCCCGCCCTCCGCCGCAGCCCGCGCCGGGGGTGGGGGCCGCCGAGC
GCCAGCCCCCCGGCCGGCCGATTCCCCCCCCGCGCCCCCTCCCCGCGCCTCCCTCCCCGCCCTCGCCGCGCCGCCGTCCTCGCCTCCCTC
TGCCTCTCCTTCCCCCATTCTCCCGGATTAATTAAGGAGGCAGCGGCAGGAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCT
GGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGC
TGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAAC
GCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAG
CTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGT
GGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTT
TATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTG
CCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAG
CTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGAC
TGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGC
ACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCC
ATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAAC
GTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATG
GAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAG
ACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTAT
GAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAA
CTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCC
CGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTG
GCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCT
GCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGC
TACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGAC
ACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTT
GTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAA
GACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATAT
GGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAA
CGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATC
TCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAG
GTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCT
ATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCG
GGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAAT
GTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGC
AAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGG
CAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGA
AAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGAAAATATCATAAAAATGAGTGATAAATACAAGGCCCA
GATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAG
AATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGTTGATGTGGACATGAGCCATTTGAGGGGAGAGGGAAC

>In-frame_GTF2IRD1_ENST00000455841_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_3456nt
GCCAGCCCCCCGGCCGGCCGATTCCCCCCCCGCGCCCCCTCCCCGCGCCTCCCTCCCCGCCCTCGCCGCGCCGCCGTCCTCGCCTCCCTC
TGCCTCTCCTTCCCCCATTCTCCCGGATTAATTAAGGAGGCAGCGGCAGGAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCT
GGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGC
TGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAAC
GCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAG
CTACAGTCAGACTTCCTCAGGTTCTGCCTCTCCGCAGCTCAGCACAGGGCAGCGACATCCCAGCTCGAAGGCCGGGTGGTGAGACGGGTG
CTCACTGTGGCCTCGCGTGCTCTGTGTCCCACAGGAGGGCCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGC
GAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGAT
GTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAG
GGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGC
TTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCA
CGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTG
TACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGT
CGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATC
CAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATG
CAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCT
GGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAG
GTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAG
GACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCC
CTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCC
TCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGAC
ATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGG
GCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAA
ACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTG
GAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAG
CCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATA
GAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAG
GCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCA
GAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTG
CACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAAT
AATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGA
CTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGT
GGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATTCTG
AAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGA
GAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAACCTATTTTGAAGTACCACCAAAAAAGCTGTA
TTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGAAAATATCATAAAAATGAGTGATAAATACAA
GGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGT
AGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGTTGATGTGGACATGAGCCATTTGAGGGGAGA

>In-frame_GTF2IRD1_ENST00000424337_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_3221nt
GAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCTGGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTA
AGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGT
CTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGG
GCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATC
CGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGT
ACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGA
GGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCC
TCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGG
AGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCC
CAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTT
CCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTG
GACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAG
TGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCA
TCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCC
TCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGG
CTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGA
ACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCC
TCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTC
AGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGA
TTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCC
CAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATA
TGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTG
TATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCC
AGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACC
CTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCT
CCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATA
TGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACG
GCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAA
GCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGG
AGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTG
CCCCTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACT
TCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTG
CCAACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAA
GAAGAAAATATCATAAAAATGAGTGATAAATACAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTAT
GCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGC

>In-frame_GTF2IRD1_ENST00000476977_chr7_73935627_+_ALK_ENST00000389048_chr2_29446394_-_4838nt
AACATTTAGCAGCAAACTCAACATACATTTTGGCCAAACGCCCACCGGCCAGTTGTTTAAATCAATATTTATCCACAGCCAAAAAGCAGA
GGAGAGCCAGAACTGAGGCCAGAGCCGAGCTCTGATGCATCTCTCATTTCTCGGGATGTTTCTGTCCCTGTGGTTGGACACCTCTGGCCT
TGTGAAGTGTGATGCACTGTCACATCTCCTGTTCTGTGTCATTGGCCAATGATCATATATCATGACCTGCTGGAAGGCCTGTCTGTGGCT
GGGACCACACGCCTTGGGCCTTATGCACACTGGGCACTGGTCGGGATCCTGGGGTGCAACAGTGGCAGGCAGACCTGGTTTCTGCCCTCA
AAGAGCTTACAGATGGCAGGGGCACCCATGGCGGCAGAAGACACCCCAGGCCTGGTGCCCTTTGTGGTGCCAGCCCCATGGTCCTCCTGC
CTGGGCCTTCCCTACCCCATTGGGTGCAGAAACTCCCTGTCTGCAGGTAGGAGACAGAGGGTAGGTTTTCAGGCTCCCTTGGGAACTGCA
GCCCTGCTCTCTGCCATCAACACCCAGCAGGGGCCACACAGAGAGCACCGGGACTGAGCCCATAGAGGGGAACCGAGAGGCCCCTGCCTC
TAGTCTCTGCCTTCTTTGCTTGGATTGGTGCAGGGACAGCTGCCTTGAGGGCAGGCCCTGGCACTGGGGCAGCCTGTGGGTGCCCCCTGG
GTCAAGAAGGAGAGGGGCAGGGTAGAACCAGGAGCCAAAGGAGGCTGATCTTTGCATCTCATGGGTGCCCAGCTGGACACTGTCATACCC
AGGAAGCCTGTGCCATGCCATGGGGACCCACAACTGGGGGCCCTGGACTTGAGGGGGAGGATGCAGCTCTGTCCCCCAGGAACCCCATTG
CAACAGGACACAGTCCTGCCCTGGGGAGCCCCTGACCTGAGACAAAGCAGCCTCGGCCCTGCTGTATCTTTCCATACCCCTGATGCCAAG
TCTCCTGGCTAGGAGGGAAACTGAGGCTGGAAGGCCTCGGCGGGGGTGGCATTGGCCTCGGGAGCATGTGGCTTGATGCAGAAATGTGAC
GGCAGAGCTCAGAGGCATGCGGAAGGGAGGGGAGGACATCACCGGCTCCTGACCCAGCTGGGCTTCAGGTTGGGGGTACAGGAGGTGGGC
AAGCAGGTTGGACAATTAAAAGCTTCGATGAGGCTGGGTGAGTGGCTTATGCCTGTATTTCCAACACTTTGGGAGGCTGAGGTGGGCAGA
TCACCTGAGGCCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGACGTGTGGT
GGCACCTGTAATCCCAGCTACCCGGGAGGCTGAGGCAGGAGAATCACTCGAACCCAGGAAGGGGAGGTTGCAGTGAGCCAAGATTGCACC
ACTGCACTATAGCCTGGGCAACAGAGTGAGACTCTGTCTCGAAATAAAATTAAATTTAAAATTTAAAAAAGCTTCAAGGACAACCAGCAG
ATGATGGCAGGACCAGGAAGGGTGCTTCAGGCAGCGGGAACTGAACCTGCACAGACTAGGGAAGCATGAACATCGGTACATCTGGGAGTG
GCACAGCTTAGAGGGCCTGGAGCATGGAGTGTGAGGGGGAACTGGAGTGAGGGACAGGAGACCAGGCGACCATGGCCTTGCTGGGTAAGC
GCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTG
CCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCA
CAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATCCGG
AGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACC
TTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGC
TGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCA
TGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGC
TGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAA
CCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCT
GCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGAC
AGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGT
ACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCA
TGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCA
TTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTG
TGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACA
TTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCC
GAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGT
ATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTG
GAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAG
AGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGC
CATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTAT
ACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGG
ACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTG
AGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCT
CTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGG
CATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCT
CCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCT
GTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGG
TACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCC
CTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCT
CTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCA
ACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAA
GAAAATATCATAAAAATGAGTGATAAATACAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCT
TCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTT


Top

FusionGenePPI for GTF2IRD1_ALK


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page

.

check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors
GTF2IRD1HDAC3, PIAS2, SMAD2, USF1, UBC, EXOSC4, BRCA1, MAGEA10, GTF2IRD2B, FBXW11, HNRNPD, SORT1, XRCC1, SYNCRIP, TOR1AIP1, SGTB, BAG6, WASF3, ZNF23, LRRIQ1, PPP2R2D, NOLC1, KIFAP3, AKNAD1, TRIM25, CBX5, CBX1, CBX3, SP1, ALMS1, ATF7IP, ATP2C1, HTRA4, INTS12, KPNA1, KPNA3, KPNA4, MBD3L1, PKP2, SPTLC1, TMEM55A, USP20, VIMP, DCAF6, KPNA2, PIAS1, ZC4H2, ZMYM5, PKP1, ZMYM2, ZMYM3ALKPTN, SHC1, PLCG1, JAK3, SHC3, RAB35, HSPD1, MAP3K4, GRB2, SOCS1, PIK3CB, RAD17, IRF7, EPHB2, EIF4B, MAPK8IP3, CENPF, PLCB2, PDX1, STAT3, MAP2K7, JAK2, MAP3K5, MEP1B, GAK, EPHA1, UBASH3A, MTIF2, MYLK, MAP3K1, MAPK1, SOCS5, CDK13, SMC6, IRS4, KRT18, KRT74, IRS1, IKBKG, ZC3HC1, TNK2, ERRFI1, PIK3R1, HSP90AA1, BCAR1, ACTB, PDLIM3, VIM, TUBB4B, MYO6, MYH10, MYH9, TUBGCP2, ACTN4, CORO1C, FLII, BICD2, GNB2L1, SRC, PXN, PRKCQ, MYBPC2, CDK9, PPM1A, DUSP19


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

RelatedDrugs for GTF2IRD1_ALK


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.0 2018-04-02)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
TgeneALKQ9UM73DB08865CrizotinibALK tyrosine kinase receptorsmall moleculeapproved
TgeneALKQ9UM73DB09063CeritinibALK tyrosine kinase receptorsmall moleculeapproved

Top

RelatedDiseases for GTF2IRD1_ALK


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneGTF2IRD1C0175702Williams Syndrome1CTD_human;ORPHANET
HgeneGTF2IRD1C0376634Craniofacial Abnormalities1CTD_human
TgeneALKC0007131Non-Small Cell Lung Carcinoma28CTD_human
TgeneALKC0027819Neuroblastoma12CTD_human;ORPHANET
TgeneALKC0152013Adenocarcinoma of lung (disorder)8CTD_human
TgeneALKC0206180Ki-1+ Anaplastic Large Cell Lymphoma6CTD_human
TgeneALKC2751681NEUROBLASTOMA, SUSCEPTIBILITY TO, 34UNIPROT
TgeneALKC0018199Granuloma, Plasma Cell3CTD_human
TgeneALKC0007621Neoplastic Cell Transformation2CTD_human
TgeneALKC0027627Neoplasm Metastasis2CTD_human
TgeneALKC0001973Alcoholic Intoxication, Chronic1PSYGENET
TgeneALKC0006118Brain Neoplasms1CTD_human
TgeneALKC0007134Renal Cell Carcinoma1CTD_human
TgeneALKC0011570Mental Depression1PSYGENET
TgeneALKC0011581Depressive disorder1PSYGENET
TgeneALKC0027643Neoplasm Recurrence, Local1CTD_human
TgeneALKC0036341Schizophrenia1PSYGENET
TgeneALKC0079744Diffuse Large B-Cell Lymphoma1CTD_human
TgeneALKC0085269Plasma Cell Granuloma, Pulmonary1CTD_human
TgeneALKC0278601Inflammatory Breast Carcinoma1CTD_human