FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

FusionGeneSummary

leaf

FusionProtFeature

leaf

FusionGeneSequence

leaf

FusionGenePPI

leaf

RelatedDrugs

leaf

RelatedDiseases

Fusion gene ID: 8373

FusionGeneSummary for CP_SOX5

check button Fusion gene summary
Fusion gene informationFusion gene name: CP_SOX5
Fusion gene ID: 8373
HgeneTgene
Gene symbol

CP

SOX5

Gene ID

1356

6660

Gene nameceruloplasminSRY-box 5
SynonymsCP-2L-SOX5|L-SOX5B|L-SOX5F|LAMSHF
Cytomap

3q24-q25.1

12p12.1

Type of geneprotein-codingprotein-coding
Descriptionceruloplasminceruloplasmin (ferroxidase)transcription factor SOX-5SRY (sex determining region Y)-box 5
Modification date2018052320180523
UniProtAcc

P00450

P35711

Ensembl transtripts involved in fusion geneENST00000264613, ENST00000462336, 
ENST00000546136, ENST00000381381, 
ENST00000309359, ENST00000451604, 
ENST00000537393, ENST00000541536, 
ENST00000396007, ENST00000545921, 
ENST00000541847, ENST00000441133, 
ENST00000536850, 
Fusion gene scores* DoF score4 X 3 X 3=367 X 7 X 5=245
# samples 79
** MAII scorelog2(7/36*10)=0.959358015502654
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
log2(9/245*10)=-1.4447848426729
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CP [Title/Abstract] AND SOX5 [Title/Abstract] AND fusion [Title/Abstract]

Functional or gene categories assigned by FusionGDB annotationTranscription factor involved fusion gene, inframe and retained DNA-binding domain.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneSOX5

GO:0032332

positive regulation of chondrocyte differentiation

21401405

TgeneSOX5

GO:0061036

positive regulation of cartilage development

21401405

TgeneSOX5

GO:0071560

cellular response to transforming growth factor beta stimulus

21401405

TgeneSOX5

GO:2000741

positive regulation of mesenchymal stem cell differentiation

21401405


check button Fusion gene information from three resources
(ChiTars (NAR, 2018), tumorfusions (NAR, 2018), Gao et al. (Cell, 2018))
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
Data typeSourceCancer typeSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
TCGARVBRCATCGA-BH-A18V-06ACPchr3

148939434

-SOX5chr12

23999127

-
TCGALDBRCATCGA-A7-A3J0-01ACPchr3

148939434

-SOX5chr12

23999127

-
TCGALDBRCATCGA-BH-A18V-01ACPchr3

148939434

-SOX5chr12

23999127

-
* LD: Li Ding group's fusion gene list
  RV: Roel Verhaak group's fusion gene list
  ChiTaRs fusion database

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000264613ENST00000546136CPchr3

148939434

-SOX5chr12

23999127

-
In-frameENST00000264613ENST00000381381CPchr3

148939434

-SOX5chr12

23999127

-
In-frameENST00000264613ENST00000309359CPchr3

148939434

-SOX5chr12

23999127

-
In-frameENST00000264613ENST00000451604CPchr3

148939434

-SOX5chr12

23999127

-
In-frameENST00000264613ENST00000537393CPchr3

148939434

-SOX5chr12

23999127

-
In-frameENST00000264613ENST00000541536CPchr3

148939434

-SOX5chr12

23999127

-
5CDS-intronENST00000264613ENST00000396007CPchr3

148939434

-SOX5chr12

23999127

-
5CDS-intronENST00000264613ENST00000545921CPchr3

148939434

-SOX5chr12

23999127

-
5CDS-intronENST00000264613ENST00000541847CPchr3

148939434

-SOX5chr12

23999127

-
5CDS-intronENST00000264613ENST00000441133CPchr3

148939434

-SOX5chr12

23999127

-
5CDS-intronENST00000264613ENST00000536850CPchr3

148939434

-SOX5chr12

23999127

-
intron-3CDSENST00000462336ENST00000546136CPchr3

148939434

-SOX5chr12

23999127

-
intron-3CDSENST00000462336ENST00000381381CPchr3

148939434

-SOX5chr12

23999127

-
intron-3CDSENST00000462336ENST00000309359CPchr3

148939434

-SOX5chr12

23999127

-
intron-3CDSENST00000462336ENST00000451604CPchr3

148939434

-SOX5chr12

23999127

-
intron-3CDSENST00000462336ENST00000537393CPchr3

148939434

-SOX5chr12

23999127

-
intron-3CDSENST00000462336ENST00000541536CPchr3

148939434

-SOX5chr12

23999127

-
intron-intronENST00000462336ENST00000396007CPchr3

148939434

-SOX5chr12

23999127

-
intron-intronENST00000462336ENST00000545921CPchr3

148939434

-SOX5chr12

23999127

-
intron-intronENST00000462336ENST00000541847CPchr3

148939434

-SOX5chr12

23999127

-
intron-intronENST00000462336ENST00000441133CPchr3

148939434

-SOX5chr12

23999127

-
intron-intronENST00000462336ENST00000536850CPchr3

148939434

-SOX5chr12

23999127

-

Top

FusionProtFeatures for CP_SOX5


check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CP

P00450

SOX5

P35711

Ceruloplasmin is a blue, copper-binding (6-7 atoms permolecule) glycoprotein. It has ferroxidase activity oxidizingFe(2+) to Fe(3+) without releasing radical oxygen species. It isinvolved in iron transport across the cell membrane. ProvidesCu(2+) ions for the ascorbate-mediated deaminase degradation ofthe heparan sulfate chains of GPC1. May also play a role in fetallung development or pulmonary antioxidant defense (By similarity).{ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

.

* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
>>>>>>>>>>>>>>>>>>>>>
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneSOX5chr3:148939434chr12:23999127ENST00000309359-115193_27477751Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000309359-115448_51577751Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000381381-113193_27477643Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000381381-113448_51577643Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000396007-07193_274-20378Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000396007-07448_515-20378Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000451604-115193_27490764Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000451604-115448_51590764Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000541536-012193_27477643Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000541536-012448_51577643Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000545921-115193_27480754Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000545921-115448_51580754Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000546136-014193_27477751Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000546136-014448_51577751Coiled coilOntology_term=ECO:0000255
TgeneSOX5chr3:148939434chr12:23999127ENST00000309359-115556_62477751DNA bindingHMG box
TgeneSOX5chr3:148939434chr12:23999127ENST00000381381-113556_62477643DNA bindingHMG box
TgeneSOX5chr3:148939434chr12:23999127ENST00000396007-07556_624-20378DNA bindingHMG box
TgeneSOX5chr3:148939434chr12:23999127ENST00000451604-115556_62490764DNA bindingHMG box
TgeneSOX5chr3:148939434chr12:23999127ENST00000541536-012556_62477643DNA bindingHMG box
TgeneSOX5chr3:148939434chr12:23999127ENST00000545921-115556_62480754DNA bindingHMG box
TgeneSOX5chr3:148939434chr12:23999127ENST00000546136-014556_62477751DNA bindingHMG box

- In-frame and not-retained protein feature among the 13 regional features.
>>>>>>>>>
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCPchr3:148939434chr12:23999127ENST00000264613-119209_357481066DomainNote=Plastocyanin-like 2
HgeneCPchr3:148939434chr12:23999127ENST00000264613-11920_200481066DomainNote=Plastocyanin-like 1
HgeneCPchr3:148939434chr12:23999127ENST00000264613-11920_357481066DomainNote=F5/8 type A 1
HgeneCPchr3:148939434chr12:23999127ENST00000264613-119370_560481066DomainNote=Plastocyanin-like 3
HgeneCPchr3:148939434chr12:23999127ENST00000264613-119370_718481066DomainNote=F5/8 type A 2
HgeneCPchr3:148939434chr12:23999127ENST00000264613-119570_718481066DomainNote=Plastocyanin-like 4
HgeneCPchr3:148939434chr12:23999127ENST00000264613-119730_1061481066DomainNote=F5/8 type A 3
HgeneCPchr3:148939434chr12:23999127ENST00000264613-119730_900481066DomainNote=Plastocyanin-like 5
HgeneCPchr3:148939434chr12:23999127ENST00000264613-119908_1061481066DomainNote=Plastocyanin-like 6


Top

FusionGeneSequence for CP_SOX5


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences.
(nt: nucleotides, aa: amino acids)

* Fusion amino acid sequences.
>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000546136_chr12_23999127_-_722aa
MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKLISVDTKLMAIKLCLHLPHTTHLPHLRRQKKVGDRVASPCLVQPWEL
LNGARAVXLMLLTPXSRGKWKSSSKTSRKKPPVLKNYSQRTGKTSFLQWDRGTLAKXKGLPRAXLRKKGNSWVXSTSXPASESSCWLPTM
SRRNXLPLRLRNSVSKWSWPSSNKNKLQDSSSSFYSNNTKSICSSNRSRFKVSCRHXXFPYSLLINGHWLQLPSKDSSSLQASAIRLDVV
TLTLFSXSQLPWQLLPQQHQAXAHSNCSSYMLPSXLQCRYLQEGSCQAYPKATLVLLYLLPAFTQTRAQTAHHPKARMKWHSHXTYQLNP
RPLMANHPHHPPLPICQLXEXTVGQAPSKPLSQQRXLVLQPELAQXVTXMTMMLSPRQSKKLGKXRSNSDGNNRCLMGRWLLXIVWVSIT
AEQKRKKQHWRVXLSNWQLNRMKKENLAMQXWISIXVEILMEVLESQSQEFIGNPEGVVAMNPTXSVQXMPSWCGLKMNGERSFKPFLTC
TTPTSARYWDLAGKLXQTXRNSHIMRSKPVSASSTWRSTLTISTSPGQSAPAWWMAKSCALVNTRQSCATGGRKCGSTSMLGNKHRSPLP
LLVLCTLEPSPWLGCPPLTCPRSTQACLAAQSLGCLLSRALTVXKERSHISKKRYRPRTSMEKFMMSTTRKRMIQMXIMGVTVKTILQDK

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000381381_chr12_23999127_-_614aa
MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKLISVDTKLMAIKLCLHLPHTTHLPHLRRQKKVGDRVASPCLVQPWEL
LNGARAVXLMLLTPXSRGKWKSSSKTSRKKPPVLKNYSQRTGKTSFLQWDRGTLAKXKGLPRAXLRKKGNSWVXSTSXPASESSCWLPTM
SRRNXLPLRLRNSVSKWSWPSSNKNKLQDSSSSFYSNNTKSICSSNRSRFKVSCRHXXFPYSLLINGHWLQLPSKDSSSLQASAIRLDVV
TLTLFSXSQLPWQLLPQQHQAXAHSNCSSYMLPSXLQCRYLQEGSCQAYPKATLVLLYLLPAFTQTRAQTAHHPKARKKQHWRVXLSNWQ
LNRMKKENLAMQXWISIXVEILMEVLESQSQEFIGNPEGVVAMNPTXSVQXMPSWCGLKMNGERSFKPFLTCTTPTSARYWDLAGKLXQT
XRNSHIMRSKPVSASSTWRSTLTISTSPGQSAPAWWMAKSCALVNTRQSCATGGRKCGSTSMLGNKHRSPLPLLVLCTLEPSPWLGCPPL

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000309359_chr12_23999127_-_722aa
MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKLISVDTKLMAIKLCLHLPHTTHLPHLRRQKKVGDRVASPCLVQPWEL
LNGARAVXLMLLTPXSRGKWKSSSKTSRKKPPVLKNYSQRTGKTSFLQWDRGTLAKXKGLPRAXLRKKGNSWVXSTSXPASESSCWLPTM
SRRNXLPLRLRNSVSKWSWPSSNKNKLQDSSSSFYSNNTKSICSSNRSRFKVSCRHXXFPYSLLINGHWLQLPSKDSSSLQASAIRLDVV
TLTLFSXSQLPWQLLPQQHQAXAHSNCSSYMLPSXLQCRYLQEGSCQAYPKATLVLLYLLPAFTQTRAQTAHHPKARMKWHSHXTYQLNP
RPLMANHPHHPPLPICQLXEXTVGQAPSKPLSQQRXLVLQPELAQXVTXMTMMLSPRQSKKLGKXRSNSDGNNRCLMGRWLLXIVWVSIT
AEQKRKKQHWRVXLSNWQLNRMKKENLAMQXWISIXVEILMEVLESQSQEFIGNPEGVVAMNPTXSVQXMPSWCGLKMNGERSFKPFLTC
TTPTSARYWDLAGKLXQTXRNSHIMRSKPVSASSTWRSTLTISTSPGQSAPAWWMAKSCALVNTRQSCATGGRKCGSTSMLGNKHRSPLP
LLVLCTLEPSPWLGCPPLTCPRSTQACLAAQSLGCLLSRALTVXKERSHISKKRYRPRTSMEKFMMSTTRKRMIQMXIMGVTVKTILQDK

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000451604_chr12_23999127_-_722aa
MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKLISVDTKLMAIKLCLHLPHTTHLPHLRRQKKVGDRVASPCLVQPWEL
LNGARAVXLMLLTPXSRGKWKSSSKTSRKKPPVLKNYSQRTGKTSFLQWDRGTLAKXKGLPRAXLRKKGNSWVXSTSXPASESSCWLPTM
SRRNXLPLRLRNSVSKWSWPSSNKNKLQDSSSSFYSNNTKSICSSNRSRFKVSCRHXXFPYSLLINGHWLQLPSKDSSSLQASAIRLDVV
TLTLFSXSQLPWQLLPQQHQAXAHSNCSSYMLPSXLQCRYLQEGSCQAYPKATLVLLYLLPAFTQTRAQTAHHPKARMKWHSHXTYQLNP
RPLMANHPHHPPLPICQLXEXTVGQAPSKPLSQQRXLVLQPELAQXVTXMTMMLSPRQSKKLGKXRSNSDGNNRCLMGRWLLXIVWVSIT
AEQKRKKQHWRVXLSNWQLNRMKKENLAMQXWISIXVEILMEVLESQSQEFIGNPEGVVAMNPTXSVQXMPSWCGLKMNGERSFKPFLTC
TTPTSARYWDLAGKLXQTXRNSHIMRSKPVSASSTWRSTLTISTSPGQSAPAWWMAKSCALVNTRQSCATGGRKCGSTSMLGNKHRSPLP
LLVLCTLEPSPWLGCPPLTCPRSTQACLAAQSLGCLLSRALTVXKERSHISKKRYRPRTSMEKFMMSTTRKRMIQMXIMGVTVKTILQDK

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000537393_chr12_23999127_-_722aa
MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKLISVDTKLMAIKLCLHLPHTTHLPHLRRQKKVGDRVASPCLVQPWEL
LNGARAVXLMLLTPXSRGKWKSSSKTSRKKPPVLKNYSQRTGKTSFLQWDRGTLAKXKGLPRAXLRKKGNSWVXSTSXPASESSCWLPTM
SRRNXLPLRLRNSVSKWSWPSSNKNKLQDSSSSFYSNNTKSICSSNRSRFKVSCRHXXFPYSLLINGHWLQLPSKDSSSLQASAIRLDVV
TLTLFSXSQLPWQLLPQQHQAXAHSNCSSYMLPSXLQCRYLQEGSCQAYPKATLVLLYLLPAFTQTRAQTAHHPKARMKWHSHXTYQLNP
RPLMANHPHHPPLPICQLXEXTVGQAPSKPLSQQRXLVLQPELAQXVTXMTMMLSPRQSKKLGKXRSNSDGNNRCLMGRWLLXIVWVSIT
AEQKRKKQHWRVXLSNWQLNRMKKENLAMQXWISIXVEILMEVLESQSQEFIGNPEGVVAMNPTXSVQXMPSWCGLKMNGERSFKPFLTC
TTPTSARYWDLAGKLXQTXRNSHIMRSKPVSASSTWRSTLTISTSPGQSAPAWWMAKSCALVNTRQSCATGGRKCGSTSMLGNKHRSPLP
LLVLCTLEPSPWLGCPPLTCPRSTQACLAAQSLGCLLSRALTVXKERSHISKKRYRPRTSMEKFMMSTTRKRMIQMXIMGVTVKTILQDK

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000541536_chr12_23999127_-_614aa
MKILILGIFLFLCSTPAWAKEKHYYIGIIETTWDYASDHGEKKLISVDTKLMAIKLCLHLPHTTHLPHLRRQKKVGDRVASPCLVQPWEL
LNGARAVXLMLLTPXSRGKWKSSSKTSRKKPPVLKNYSQRTGKTSFLQWDRGTLAKXKGLPRAXLRKKGNSWVXSTSXPASESSCWLPTM
SRRNXLPLRLRNSVSKWSWPSSNKNKLQDSSSSFYSNNTKSICSSNRSRFKVSCRHXXFPYSLLINGHWLQLPSKDSSSLQASAIRLDVV
TLTLFSXSQLPWQLLPQQHQAXAHSNCSSYMLPSXLQCRYLQEGSCQAYPKATLVLLYLLPAFTQTRAQTAHHPKARKKQHWRVXLSNWQ
LNRMKKENLAMQXWISIXVEILMEVLESQSQEFIGNPEGVVAMNPTXSVQXMPSWCGLKMNGERSFKPFLTCTTPTSARYWDLAGKLXQT
XRNSHIMRSKPVSASSTWRSTLTISTSPGQSAPAWWMAKSCALVNTRQSCATGGRKCGSTSMLGNKHRSPLPLLVLCTLEPSPWLGCPPL


* Fusion transcript sequences (only coding sequence (CDS) region).
>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000546136_chr12_23999127_-_2168nt
ATGAAGATTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAA
ACGACTTGGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTG
CCCCACACAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTC
CTGAACGGCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAA
CCCCCAGTATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTC
CCGAGAGCTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATG
AGCAGAAGAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGC
AGCAGCAGCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCG
TATTCCCTCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTG
ACCCTTACCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTAT
ATGCTGCCCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTA
CCAGCATTCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCA
AGACCTCTGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCT
CTGTCCCAGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAG
AAGCTCGGCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACT
GCCGAACAGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAA
TGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCA
ATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGC
ACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTC
TCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCA
TTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCA
CTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCC
CAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCA
ATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAG

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000381381_chr12_23999127_-_1844nt
ATGAAGATTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAA
ACGACTTGGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTG
CCCCACACAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTC
CTGAACGGCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAA
CCCCCAGTATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTC
CCGAGAGCTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATG
AGCAGAAGAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGC
AGCAGCAGCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCG
TATTCCCTCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTG
ACCCTTACCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTAT
ATGCTGCCCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTA
CCAGCATTCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAG
TTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAATGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGT
CAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATG
AACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACC
TAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAA
AGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACT
TCAATGTTGGGCAACAAGCACAGATCCCCATTGCCACTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTC
ACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGC
CACATATCAAAGAAGAGATACAGGCCGAGGACATCAATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATT

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000309359_chr12_23999127_-_2168nt
ATGAAGATTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAA
ACGACTTGGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTG
CCCCACACAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTC
CTGAACGGCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAA
CCCCCAGTATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTC
CCGAGAGCTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATG
AGCAGAAGAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGC
AGCAGCAGCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCG
TATTCCCTCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTG
ACCCTTACCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTAT
ATGCTGCCCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTA
CCAGCATTCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCA
AGACCTCTGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCT
CTGTCCCAGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAG
AAGCTCGGCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACT
GCCGAACAGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAA
TGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCA
ATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGC
ACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTC
TCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCA
TTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCA
CTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCC
CAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCA
ATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAG

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000451604_chr12_23999127_-_2168nt
ATGAAGATTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAA
ACGACTTGGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTG
CCCCACACAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTC
CTGAACGGCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAA
CCCCCAGTATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTC
CCGAGAGCTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATG
AGCAGAAGAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGC
AGCAGCAGCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCG
TATTCCCTCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTG
ACCCTTACCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTAT
ATGCTGCCCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTA
CCAGCATTCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCA
AGACCTCTGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCT
CTGTCCCAGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAG
AAGCTCGGCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACT
GCCGAACAGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAA
TGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCA
ATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGC
ACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTC
TCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCA
TTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCA
CTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCC
CAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCA
ATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAG

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000537393_chr12_23999127_-_2168nt
ATGAAGATTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAA
ACGACTTGGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTG
CCCCACACAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTC
CTGAACGGCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAA
CCCCCAGTATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTC
CCGAGAGCTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATG
AGCAGAAGAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGC
AGCAGCAGCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCG
TATTCCCTCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTG
ACCCTTACCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTAT
ATGCTGCCCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTA
CCAGCATTCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCA
AGACCTCTGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCT
CTGTCCCAGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAG
AAGCTCGGCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACT
GCCGAACAGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAA
TGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCA
ATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGC
ACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTC
TCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCA
TTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCA
CTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCC
CAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCA
ATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAG

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000541536_chr12_23999127_-_1844nt
ATGAAGATTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAA
ACGACTTGGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTG
CCCCACACAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTC
CTGAACGGCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAA
CCCCCAGTATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTC
CCGAGAGCTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATG
AGCAGAAGAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGC
AGCAGCAGCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCG
TATTCCCTCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTG
ACCCTTACCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTAT
ATGCTGCCCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTA
CCAGCATTCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAG
TTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAATGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGT
CAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATG
AACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACC
TAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAA
AGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACT
TCAATGTTGGGCAACAAGCACAGATCCCCATTGCCACTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTC
ACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGC
CACATATCAAAGAAGAGATACAGGCCGAGGACATCAATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATT


* Fusion transcript sequences (Full-length transcript).
>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000546136_chr12_23999127_-_7144nt
AGAGTTATGCACACCCTAATGCCTCCAACAATAACTGTTGACTTTTTATTTTCAGTCAGAGAAGCCTGGCAACCAAGAACTGTTTTTTTG
GTGGTTTACGAGAACTTAACTGAATTGGAAAATATTTGCTTTAATGAAACAATTTACTCTTGTGCAACACTAAATTGTGTCAATCAAGCA
AATAAGGAAGAAAGTCTTATTTATAAAATTGCCTGCTCCTGATTTTACTTCATTTCTTCTCAGGCTCCAAGAAGGGGAAAAAAATGAAGA
TTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAAACGACTT
GGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTGCCCCACA
CAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTCCTGAACG
GCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAACCCCCAG
TATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTCCCGAGAG
CTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATGAGCAGAA
GAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGCAGCAGCA
GCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCGTATTCCC
TCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTGACCCTTA
CCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTATATGCTGC
CCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTACCAGCAT
TCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCAAGACCTC
TGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCTCTGTCCC
AGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAGAAGCTCG
GCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACTGCCGAAC
AGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAATGATGGA
TTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACC
CCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGCACAACTC
CAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAA
GCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGA
ATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCACTGCTGG
TGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCC
TGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCAATGGAGA
AATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAGCCAACTG
ATAAGGGTCAAAAGATTGTTGTGACCTTAGGACTTAAAGAAGCCCTAACTGGTTCATCCTTACCAGTGGCCAAGCACATTAACTTTCTCA
TACACTGACTGTTACTTTAACTGTTAGTCTTAAATAGTTGGGACATCAGCTGACTAATAGACCTCAGCCTCAAAAGGCTTGGAAAGAAAA
AACAAATACAACAAGCAAACAACAATATCAACAACAAGAGATTGAAATAAGCTATGGGTAAAATAATGCCAGTAATTCAGCTGCTACATC
CAAGCACTGAAGTCTTACCCGTCAACTTTTTTTTTTTTTTAAATAAACTTTATGGCTGTTTGTTCTACAATGTTCTAGAAATTCTCACTC
AGGTACACAGTGCCAACAAGTGGCTTGTGAATGTGTTTTGTTGTTTTGTGCTACAATTTTTAAAAAGAAAAAAGTTTTGTTTTGTTTTTT
GGGGTTTCTGGGTTTTTTCCTTTTCTTTTTCTTTCCTTTCATTTTTTTTCTTTGTAATGCACCTGACAGAAAAAAAAGAAAAATGAATTT
CTCTTTACTTCTCTCCACCTTCTCCATCTCTCTACTTTAAAGATGGAAGTCTGTGCATGAGGGGAAAGAGGGAAAAAGAGCCTGTTTTTA
ACTTCCTTGCTATCCACCACAAAATAAGCAATTATTTTCTTTAGAGGACTTTATCTATTGCACACCACACTACATCTTTGAGCAAGTGCC
AAATTTGTACTGAAGTGTTGACCAAGTTCATTTTTTCTCTTTACTTTTTCCTTTTCCTTCTTAAGTTAGGACAGTGTTAAATCTTAGACA
ATCCCTTGAAAAACCTGAAATACCAGCAGCTGGTGAGATTTGACTTTTTTTTTTAATGGAAACTGTAGGTGCTGTTCTCAGGTGAAAAGA
GAGAGAGAGAGAGAGACATAAGAAATTTAGAGAAAAATATTTTCTGATCTTGGATTTTTGTGTGTATGTATGTATGTGATTATGGTACTA
ATAATAGGAATAACGTTGGACCATTGTGAGTTAAACCCACATCTGGGGATGAAATCCCACATCCTCCCAAGTGACTGGTCTAGAAATAAT
CTTGACCTTGACTTTGCACTTCAAATGACAACTTAACCAAGTATAGGGCTCAGAAATTATATTTTTAAATGTCTGATTATTATTGGATGG
ATCAGGTGGCCCTGTGTAATAGAGGTGTGCATGTATAACATGGAAGCTACTAGCAAACTGCTCCCAGATGTCCTTTCTCCCTGGTCAGTT
GGTTCCATTAACGTTTGCTACTTAGTGATTTTTGTTTTTCCTGTTGATATTTTGAGCAAAACAATCATTGTTTTCATTGAATATATTTGG
CCATTTTTTCAGACAAATAGAATTAGCTTATTTCTTCAACATTCCATCCTTTCCCGATCAGGAAATGAAACTGATGATTTTATAAGGTAT
TTTTCACCCCTCCATGAAGTGAGGTGGAGGCCTTTAGCATTTCAGAAGTGTGGGCCATATGTAGTTCATGCCATAAAAAGTAGGATTTAA
TTAAAAGTCATTGCAGCCCAATAAAATGGAGCCTGGCTGCACCCAGGGATCCTTGCCACTGCTCTTCCCTTGCTGTCAGATTAATCCACT
GAAGTCCAACTTTGGTTCAAGCAGAGTATTTGCAAAGAGCAACAACTGAATGTGATGGGACTGCTTATGTAGATTTTGCCAGCCAAATGC
CAAGGCAGTTGTAGGGCCTGTACAAATAAATGCAAAATCATTTCAAGTCAATTGCCATTATTTGTATTGAAGTATCAGATAGATAGTAAA
TACTGCAACTAGTAGCTTGATGTGCTATAGTTTTCACTCCAGTCATCATTTTCCTATCTCACCCCCCGAAACACCACCCTAAAGTTGGAT
TTTTACATATAAATAAAAAAAGAATCCCTTTTATTTTTCTCTCTCTCAGAAGACTTGCTCTGGGGTTTTGCTTGGAGGAGCTGATCATAT
CCTGCATGTCAGAGATTTCATTTTGTTTGTTTTTCTGTTTGTTTGTTTTTGATGACTTTATTTACATTTGAACTGCCTCTTTTTGCTAGT
GTTGTAATGATGATGATGATAAAATAATAATAATAATAGTAATAATAATAATAATGGTCAGCTGTTCCCAGATTAGTAAGTTATGTATAA
TTTATTTTATTTCTCCCTTTCTATTTTATTCTGCTCTGATTTGGAAGTGCAGCCAATAAGCTGTTAGATATTTAAAACAAATTTCCATCT
CCAACTCATATATGTATGTATATGTATATATATATATACATATATATACACACATATACACACACACACACACATTTACCCACAATTTTA
CTTCTCTTTCCGTAAGTTTTCTCTTTTTTAAGAATTTTGGCCCCAACAATATCAGTTCTGATGTTTAAAACAAAGGAGTGAATTAATAGG
AGGGTCGTGTTCAATCTTAATTAACTTATGCTCTCATCACTCATTTTAGTTATTTAATTGCCACAGTCATCTAGAGCCAATTTTTTTTTG
TTTTTGTTTGTTTTGTTTTTAAGCACTCTGGATAAGGAAAAAATAGACACTTGTTTTTATATCATTTTTATGTACAATGTGAAAACAGAT
TTTTAAAAAATTTGTTCTCTGTCTTTGTTAACAAATTCCTTCCGATTTATGTGGGTTCACCTCTCACGTTTGCTGGATACTTACTTTACA
CCATGTTATGAAAGATGCTTTGTTTTCATTTCATGAGTTTGGGCTACAAGAATACTTTGTTTTAGCTCCAGGTAGATGCCATTGAAGGCA
TTCATGGCTGAAATCAATTCTAACACACTTACCGATGAATTGACAACACTTTGGTGTTTGGTCTTTTTTTTGTTATATTTTGTTTTATTT
GAAGGGGAAAACAAAATTGTTAATGGTGACTTTTAACTGCTGAAATTCAAAATTTGATTTTAGTTAGGTTTATGTCCGAGACTAAACTGT
GCATGAACCGAAACAAAAAGCAAGTTCATACCGCTGCATTCTGTTCTCTTGTTCTGACGGTTAATTCACTGGACATCAAATTTTCTTTGC
CCTAGTTTTCCCCAAAAGAAAACAAAAGCAAAAATAGTTATATTAATGTTGAAAGTGCTTTGAAGTCACTTGATAAAAGGTGCTATGAAT
AGCAACTTACTGTGTTACTACATTGCCTTTTTGGCATTGGTTGATTTATAATTTTGGTAAAGGATCCTAAAACCCTGTATTCTTTTTCTG
TGCCCCATAATGACCAAAAAAGATAAAGAAAATGAGAGTAAGATCAAAAGATCAAACTAAAACATAAACAGTCTCCTTCCTGTCTGCTTT
CTTACCTATCTGTTCACACTTTATCTCTTTCCCTTTCTCTCTCTCTCTCACATGCGCGCGCGCGCGCGCGCACACACACACACACACACA
CTTGCCCCAACACTGAGAAATAATCTTGCCAGAATTCTGGTTTTTATAGTTGTTGTAGTCCGTTTCTCTAAAAATAGTCCTGTTTTCATA
TGAATCTAAGAGAACATTACCGGAGAAAATAAAGTTACTCATTTATAGTTACTAATAATAAGTTAGTGATTAACTTACCACAATTGGATA
AAATATAAAACTATTCTTCAGCACATTTATACCTAAATTAGTGAAGTCCATTTGTGAAGTTCATTTGGCATCTGCAGTTAAGAGTTAACC
ATGTATAAAATCCCTCCCAAAGAACATAATCATCGTTAGAACCAAATCCTCTTATAAAGAACAAATATAAAAAAGTTTTCTGTGGTGAAG
CTATTTTATAGATTTGATTTCCGGAGTAGAATTTTACCTTAATTCACTTAAAGAAAATTAACCATTTTTTGGTCCAACTTCAGATATTTT
CTTAATCCAGAAGCTGTGACTGTTCCCAGAAGAAACCTCTTATATCACATTTAAAACATAAAAAATAATTACACTTTCACTCATTCTAAT
GGGTAAATTGTTGAATTGTAATAAACAACTGTATATTTACAAACAGGTTTATAAGGTATTTGTCTATCAAATTAATAATAAAATAAAACT
TACGGGGTCTGCAAGGAAAACTATGAAGGCCACTTAAAGTGGTTCAGGTATCCCTTTGTCTTGTAGGGCATAGCTGTTGGTTTAGCCAGG
CTTTTACAGTATTTAATTTTAAAGTATCCTATAACTATCAACTTCCCAGGTTGAAGACGATGTGTTGAGTTTCCTACTGATTGATTGATT
CCTGTCCTCCCCAGTGTTTCCGTCACTGGTTCACTAAAACAGTATTTATATAGCTCCACTGGCTCTAAAGCTCTTAGTCCTTCTAATATT
TTGGATTTTACAAGTAAAAATGGAAAAAAAATAGAAAAGAGACAATCAAATGCCTGGAGCTTAAAACAAAGTATGTGCAACCTACCATCT
CACTTGAAATTTAATAAAATAATAAGTAATTATGTAAATATAACATAGAGTTATAGATTTATATTTTGTTCATAACACATAGTGTAATAT
AAGTTGTATATTTTCATGTTTTTGGTTTTATGTTATCATTCATGCCACAATAAAAATAAAACAGGAGTTTATGTGCTCTTAAAAAAAAGA
TGTGGGTTGCCACCAACCTGTTTTTCGTTTTTGTTTTTTGTTTATTTTATTTTATTTTTTTGCATTCTCCTTTTTCAGTATTACTGCCAT

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000381381_chr12_23999127_-_4028nt
AGAGTTATGCACACCCTAATGCCTCCAACAATAACTGTTGACTTTTTATTTTCAGTCAGAGAAGCCTGGCAACCAAGAACTGTTTTTTTG
GTGGTTTACGAGAACTTAACTGAATTGGAAAATATTTGCTTTAATGAAACAATTTACTCTTGTGCAACACTAAATTGTGTCAATCAAGCA
AATAAGGAAGAAAGTCTTATTTATAAAATTGCCTGCTCCTGATTTTACTTCATTTCTTCTCAGGCTCCAAGAAGGGGAAAAAAATGAAGA
TTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAAACGACTT
GGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTGCCCCACA
CAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTCCTGAACG
GCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAACCCCCAG
TATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTCCCGAGAG
CTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATGAGCAGAA
GAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGCAGCAGCA
GCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCGTATTCCC
TCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTGACCCTTA
CCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTATATGCTGC
CCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTACCAGCAT
TCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACA
GAATGAAGAAGGAAAATTTAGCCATGCAATGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAAT
TTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAG
AAAGATCCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAA
ACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCAC
CTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGT
TGGGCAACAAGCACAGATCCCCATTGCCACTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCC
CTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATAT
CAAAGAAGAGATACAGGCCGAGGACATCAATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAG
TGACAGTGAAAACCATATTGCAGGACAAGCCAACTGATAAGGGTCAAAAGATTGTTGTGACCTTAGGACTTAAAGAAGCCCTAACTGGTT
CATCCTTACCAGTGGCCAAGCACATTAACTTTCTCATACACTGACTGTTACTTTAACTGTTAGTCTTAAATAGTTGGGACATCAGCTGAC
TAATAGACCTCAGCCTCAAAAGGCTTGGAAAGAAAAAACAAATACAACAAGCAAACAACAATATCAACAACAAGAGATTGAAATAAGCTA
TGGGTAAAATAATGCCAGTAATTCAGCTGCTACATCCAAGCACTGAAGTCTTACCCGTCAACTTTTTTTTTTTTTTAAATAAACTTTATG
GCTGTTTGTTCTACAATGTTCTAGAAATTCTCACTCAGGTACACAGTGCCAACAAGTGGCTTGTGAATGTGTTTTGTTGTTTTGTGCTAC
AATTTTTAAAAAGAAAAAAGTTTTGTTTTGTTTTTTGGGGTTTCTGGGTTTTTTCCTTTTCTTTTTCTTTCCTTTCATTTTTTTTCTTTG
TAATGCACCTGACAGAAAAAAAAGAAAAATGAATTTCTCTTTACTTCTCTCCACCTTCTCCATCTCTCTACTTTAAAGATGGAAGTCTGT
GCATGAGGGGAAAGAGGGAAAAAGAGCCTGTTTTTAACTTCCTTGCTATCCACCACAAAATAAGCAATTATTTTCTTTAGAGGACTTTAT
CTATTGCACACCACACTACATCTTTGAGCAAGTGCCAAATTTGTACTGAAGTGTTGACCAAGTTCATTTTTTCTCTTTACTTTTTCCTTT
TCCTTCTTAAGTTAGGACAGTGTTAAATCTTAGACAATCCCTTGAAAAACCTGAAATACCAGCAGCTGGTGAGATTTGACTTTTTTTTTT
AATGGAAACTGTAGGTGCTGTTCTCAGGTGAAAAGAGAGAGAGAGAGAGAGACATAAGAAATTTAGAGAAAAATATTTTCTGATCTTGGA
TTTTTGTGTGTATGTATGTATGTGATTATGGTACTAATAATAGGAATAACGTTGGACCATTGTGAGTTAAACCCACATCTGGGGATGAAA
TCCCACATCCTCCCAAGTGACTGGTCTAGAAATAATCTTGACCTTGACTTTGCACTTCAAATGACAACTTAACCAAGTATAGGGCTCAGA
AATTATATTTTTAAATGTCTGATTATTATTGGATGGATCAGGTGGCCCTGTGTAATAGAGGTGTGCATGTATAACATGGAAGCTACTAGC
AAACTGCTCCCAGATGTCCTTTCTCCCTGGTCAGTTGGTTCCATTAACGTTTGCTACTTAGTGATTTTTGTTTTTCCTGTTGATATTTTG
AGCAAAACAATCATTGTTTTCATTGAATATATTTGGCCATTTTTTCAGACAAATAGAATTAGCTTATTTCTTCAACATTCCATCCTTTCC
CGATCAGGAAATGAAACTGATGATTTTATAAGGTATTTTTCACCCCTCCATGAAGTGAGGTGGAGGCCTTTAGCATTTCAGAAGTGTGGG
CCATATGTAGTTCATGCCATAAAAAGTAGGATTTAATTAAAAGTCATTGCAGCCCAATAAAATGGAGCCTGGCTGCACCCAGGGATCCTT
GCCACTGCTCTTCCCTTGCTGTCAGATTAATCCACTGAAGTCCAACTTTGGTTCAAGCAGAGTATTTGCAAAGAGCAACAACTGAATGTG
ATGGGACTGCTTATGTAGATTTTGCCAGCCAAATGCCAAGGCAGTTGTAGGGCCTGTACAAATAAATGCAAAATCATTTCAAGTCAATTG
CCATTATTTGTATTGAAGTATCAGATAGATAGTAAATACTGCAACTAGTAGCTTGATGTGCTATAGTTTTCACTCCAGTCATCATTTTCC

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000309359_chr12_23999127_-_4352nt
AGAGTTATGCACACCCTAATGCCTCCAACAATAACTGTTGACTTTTTATTTTCAGTCAGAGAAGCCTGGCAACCAAGAACTGTTTTTTTG
GTGGTTTACGAGAACTTAACTGAATTGGAAAATATTTGCTTTAATGAAACAATTTACTCTTGTGCAACACTAAATTGTGTCAATCAAGCA
AATAAGGAAGAAAGTCTTATTTATAAAATTGCCTGCTCCTGATTTTACTTCATTTCTTCTCAGGCTCCAAGAAGGGGAAAAAAATGAAGA
TTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAAACGACTT
GGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTGCCCCACA
CAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTCCTGAACG
GCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAACCCCCAG
TATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTCCCGAGAG
CTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATGAGCAGAA
GAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGCAGCAGCA
GCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCGTATTCCC
TCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTGACCCTTA
CCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTATATGCTGC
CCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTACCAGCAT
TCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCAAGACCTC
TGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCTCTGTCCC
AGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAGAAGCTCG
GCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACTGCCGAAC
AGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAATGATGGA
TTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACC
CCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGCACAACTC
CAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAA
GCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGA
ATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCACTGCTGG
TGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCC
TGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCAATGGAGA
AATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAGCCAACTG
ATAAGGGTCAAAAGATTGTTGTGACCTTAGGACTTAAAGAAGCCCTAACTGGTTCATCCTTACCAGTGGCCAAGCACATTAACTTTCTCA
TACACTGACTGTTACTTTAACTGTTAGTCTTAAATAGTTGGGACATCAGCTGACTAATAGACCTCAGCCTCAAAAGGCTTGGAAAGAAAA
AACAAATACAACAAGCAAACAACAATATCAACAACAAGAGATTGAAATAAGCTATGGGTAAAATAATGCCAGTAATTCAGCTGCTACATC
CAAGCACTGAAGTCTTACCCGTCAACTTTTTTTTTTTTTTAAATAAACTTTATGGCTGTTTGTTCTACAATGTTCTAGAAATTCTCACTC
AGGTACACAGTGCCAACAAGTGGCTTGTGAATGTGTTTTGTTGTTTTGTGCTACAATTTTTAAAAAGAAAAAAGTTTTGTTTTGTTTTTT
GGGGTTTCTGGGTTTTTTCCTTTTCTTTTTCTTTCCTTTCATTTTTTTTCTTTGTAATGCACCTGACAGAAAAAAAAGAAAAATGAATTT
CTCTTTACTTCTCTCCACCTTCTCCATCTCTCTACTTTAAAGATGGAAGTCTGTGCATGAGGGGAAAGAGGGAAAAAGAGCCTGTTTTTA
ACTTCCTTGCTATCCACCACAAAATAAGCAATTATTTTCTTTAGAGGACTTTATCTATTGCACACCACACTACATCTTTGAGCAAGTGCC
AAATTTGTACTGAAGTGTTGACCAAGTTCATTTTTTCTCTTTACTTTTTCCTTTTCCTTCTTAAGTTAGGACAGTGTTAAATCTTAGACA
ATCCCTTGAAAAACCTGAAATACCAGCAGCTGGTGAGATTTGACTTTTTTTTTTAATGGAAACTGTAGGTGCTGTTCTCAGGTGAAAAGA
GAGAGAGAGAGAGAGACATAAGAAATTTAGAGAAAAATATTTTCTGATCTTGGATTTTTGTGTGTATGTATGTATGTGATTATGGTACTA
ATAATAGGAATAACGTTGGACCATTGTGAGTTAAACCCACATCTGGGGATGAAATCCCACATCCTCCCAAGTGACTGGTCTAGAAATAAT
CTTGACCTTGACTTTGCACTTCAAATGACAACTTAACCAAGTATAGGGCTCAGAAATTATATTTTTAAATGTCTGATTATTATTGGATGG
ATCAGGTGGCCCTGTGTAATAGAGGTGTGCATGTATAACATGGAAGCTACTAGCAAACTGCTCCCAGATGTCCTTTCTCCCTGGTCAGTT
GGTTCCATTAACGTTTGCTACTTAGTGATTTTTGTTTTTCCTGTTGATATTTTGAGCAAAACAATCATTGTTTTCATTGAATATATTTGG
CCATTTTTTCAGACAAATAGAATTAGCTTATTTCTTCAACATTCCATCCTTTCCCGATCAGGAAATGAAACTGATGATTTTATAAGGTAT
TTTTCACCCCTCCATGAAGTGAGGTGGAGGCCTTTAGCATTTCAGAAGTGTGGGCCATATGTAGTTCATGCCATAAAAAGTAGGATTTAA
TTAAAAGTCATTGCAGCCCAATAAAATGGAGCCTGGCTGCACCCAGGGATCCTTGCCACTGCTCTTCCCTTGCTGTCAGATTAATCCACT
GAAGTCCAACTTTGGTTCAAGCAGAGTATTTGCAAAGAGCAACAACTGAATGTGATGGGACTGCTTATGTAGATTTTGCCAGCCAAATGC
CAAGGCAGTTGTAGGGCCTGTACAAATAAATGCAAAATCATTTCAAGTCAATTGCCATTATTTGTATTGAAGTATCAGATAGATAGTAAA
TACTGCAACTAGTAGCTTGATGTGCTATAGTTTTCACTCCAGTCATCATTTTCCTATCTCACCCCCCGAAACACCACCCTAAAGTTGGAT

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000451604_chr12_23999127_-_4298nt
AGAGTTATGCACACCCTAATGCCTCCAACAATAACTGTTGACTTTTTATTTTCAGTCAGAGAAGCCTGGCAACCAAGAACTGTTTTTTTG
GTGGTTTACGAGAACTTAACTGAATTGGAAAATATTTGCTTTAATGAAACAATTTACTCTTGTGCAACACTAAATTGTGTCAATCAAGCA
AATAAGGAAGAAAGTCTTATTTATAAAATTGCCTGCTCCTGATTTTACTTCATTTCTTCTCAGGCTCCAAGAAGGGGAAAAAAATGAAGA
TTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAAACGACTT
GGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTGCCCCACA
CAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTCCTGAACG
GCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAACCCCCAG
TATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTCCCGAGAG
CTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATGAGCAGAA
GAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGCAGCAGCA
GCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCGTATTCCC
TCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTGACCCTTA
CCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTATATGCTGC
CCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTACCAGCAT
TCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCAAGACCTC
TGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCTCTGTCCC
AGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAGAAGCTCG
GCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACTGCCGAAC
AGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAATGATGGA
TTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACC
CCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGCACAACTC
CAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAA
GCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGA
ATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCACTGCTGG
TGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCC
TGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCAATGGAGA
AATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAGCCAACTG
ATAAGGGTCAAAAGATTGTTGTGACCTTAGGACTTAAAGAAGCCCTAACTGGTTCATCCTTACCAGTGGCCAAGCACATTAACTTTCTCA
TACACTGACTGTTACTTTAACTGTTAGTCTTAAATAGTTGGGACATCAGCTGACTAATAGACCTCAGCCTCAAAAGGCTTGGAAAGAAAA
AACAAATACAACAAGCAAACAACAATATCAACAACAAGAGATTGAAATAAGCTATGGGTAAAATAATGCCAGTAATTCAGCTGCTACATC
CAAGCACTGAAGTCTTACCCGTCAACTTTTTTTTTTTTTTAAATAAACTTTATGGCTGTTTGTTCTACAATGTTCTAGAAATTCTCACTC
AGGTACACAGTGCCAACAAGTGGCTTGTGAATGTGTTTTGTTGTTTTGTGCTACAATTTTTAAAAAGAAAAAAGTTTTGTTTTGTTTTTT
GGGGTTTCTGGGTTTTTTCCTTTTCTTTTTCTTTCCTTTCATTTTTTTTCTTTGTAATGCACCTGACAGAAAAAAAAGAAAAATGAATTT
CTCTTTACTTCTCTCCACCTTCTCCATCTCTCTACTTTAAAGATGGAAGTCTGTGCATGAGGGGAAAGAGGGAAAAAGAGCCTGTTTTTA
ACTTCCTTGCTATCCACCACAAAATAAGCAATTATTTTCTTTAGAGGACTTTATCTATTGCACACCACACTACATCTTTGAGCAAGTGCC
AAATTTGTACTGAAGTGTTGACCAAGTTCATTTTTTCTCTTTACTTTTTCCTTTTCCTTCTTAAGTTAGGACAGTGTTAAATCTTAGACA
ATCCCTTGAAAAACCTGAAATACCAGCAGCTGGTGAGATTTGACTTTTTTTTTTAATGGAAACTGTAGGTGCTGTTCTCAGGTGAAAAGA
GAGAGAGAGAGAGAGACATAAGAAATTTAGAGAAAAATATTTTCTGATCTTGGATTTTTGTGTGTATGTATGTATGTGATTATGGTACTA
ATAATAGGAATAACGTTGGACCATTGTGAGTTAAACCCACATCTGGGGATGAAATCCCACATCCTCCCAAGTGACTGGTCTAGAAATAAT
CTTGACCTTGACTTTGCACTTCAAATGACAACTTAACCAAGTATAGGGCTCAGAAATTATATTTTTAAATGTCTGATTATTATTGGATGG
ATCAGGTGGCCCTGTGTAATAGAGGTGTGCATGTATAACATGGAAGCTACTAGCAAACTGCTCCCAGATGTCCTTTCTCCCTGGTCAGTT
GGTTCCATTAACGTTTGCTACTTAGTGATTTTTGTTTTTCCTGTTGATATTTTGAGCAAAACAATCATTGTTTTCATTGAATATATTTGG
CCATTTTTTCAGACAAATAGAATTAGCTTATTTCTTCAACATTCCATCCTTTCCCGATCAGGAAATGAAACTGATGATTTTATAAGGTAT
TTTTCACCCCTCCATGAAGTGAGGTGGAGGCCTTTAGCATTTCAGAAGTGTGGGCCATATGTAGTTCATGCCATAAAAAGTAGGATTTAA
TTAAAAGTCATTGCAGCCCAATAAAATGGAGCCTGGCTGCACCCAGGGATCCTTGCCACTGCTCTTCCCTTGCTGTCAGATTAATCCACT
GAAGTCCAACTTTGGTTCAAGCAGAGTATTTGCAAAGAGCAACAACTGAATGTGATGGGACTGCTTATGTAGATTTTGCCAGCCAAATGC
CAAGGCAGTTGTAGGGCCTGTACAAATAAATGCAAAATCATTTCAAGTCAATTGCCATTATTTGTATTGAAGTATCAGATAGATAGTAAA

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000537393_chr12_23999127_-_2967nt
AGAGTTATGCACACCCTAATGCCTCCAACAATAACTGTTGACTTTTTATTTTCAGTCAGAGAAGCCTGGCAACCAAGAACTGTTTTTTTG
GTGGTTTACGAGAACTTAACTGAATTGGAAAATATTTGCTTTAATGAAACAATTTACTCTTGTGCAACACTAAATTGTGTCAATCAAGCA
AATAAGGAAGAAAGTCTTATTTATAAAATTGCCTGCTCCTGATTTTACTTCATTTCTTCTCAGGCTCCAAGAAGGGGAAAAAAATGAAGA
TTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAAACGACTT
GGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTGCCCCACA
CAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTCCTGAACG
GCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAACCCCCAG
TATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTCCCGAGAG
CTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATGAGCAGAA
GAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGCAGCAGCA
GCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCGTATTCCC
TCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTGACCCTTA
CCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTATATGCTGC
CCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTACCAGCAT
TCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGATGAAGTGGCACAGCCACTGAACCTATCAGCTAAACCCAAGACCTC
TGATGGCAAATCACCCACATCACCCACCTCTCCCCATATGCCAGCTCTGAGAATAAACAGTGGGGCAGGCCCCCTCAAAGCCTCTGTCCC
AGCAGCGTTAGCTAGTCCTTCAGCCAGAGTTAGCACAATAGGTTACTTAAATGACCATGATGCTGTCACCAAGGCAATCCAAGAAGCTCG
GCAAATGAAGGAGCAACTCCGACGGGAACAACAGGTGCTTGATGGGAAGGTGGCTGTTGTGAATAGTCTGGGTCTCAATAACTGCCGAAC
AGAAAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACAGAATGAAGAAGGAAAATTTAGCCATGCAATGATGGA
TTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAATTTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACC
CCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAGAAAGATCCTTCAAGCCTTTCCTGACATGCACAACTC
CAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAAACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAA
GCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCACCTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGA
ATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGTTGGGCAACAAGCACAGATCCCCATTGCCACTGCTGG
TGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCCCTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCC
TGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATATCAAAGAAGAGATACAGGCCGAGGACATCAATGGAGA
AATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAGTGACAGTGAAAACCATATTGCAGGACAAGCCAACTG
ATAAGGGTCAAAAGATTGTTGTGACCTTAGGACTTAAAGAAGCCCTAACTGGTTCATCCTTACCAGTGGCCAAGCACATTAACTTTCTCA
TACACTGACTGTTACTTTAACTGTTAGTCTTAAATAGTTGGGACATCAGCTGACTAATAGACCTCAGCCTCAAAAGGCTTGGAAAGAAAA
AACAAATACAACAAGCAAACAACAATATCAACAACAAGAGATTGAAATAAGCTATGGGTAAAATAATGCCAGTAATTCAGCTGCTACATC
CAAGCACTGAAGTCTTACCCGTCAACTTTTTTTTTTTTTTAAATAAACTTTATGGCTGTTTGTTCTACAATGTTCTAGAAATTCTCACTC
AGGTACACAGTGCCAACAAGTGGCTTGTGAATGTGTTTTGTTGTTTTGTGCTACAATTTTTAAAAAGAAAAAAGTTTTGTTTTGTTTTTT

>In-frame_CP_ENST00000264613_chr3_148939434_-_SOX5_ENST00000541536_chr12_23999127_-_2539nt
AGAGTTATGCACACCCTAATGCCTCCAACAATAACTGTTGACTTTTTATTTTCAGTCAGAGAAGCCTGGCAACCAAGAACTGTTTTTTTG
GTGGTTTACGAGAACTTAACTGAATTGGAAAATATTTGCTTTAATGAAACAATTTACTCTTGTGCAACACTAAATTGTGTCAATCAAGCA
AATAAGGAAGAAAGTCTTATTTATAAAATTGCCTGCTCCTGATTTTACTTCATTTCTTCTCAGGCTCCAAGAAGGGGAAAAAAATGAAGA
TTTTGATACTTGGTATTTTTCTGTTTTTATGTAGTACCCCAGCCTGGGCGAAAGAAAAGCATTATTACATTGGAATTATTGAAACGACTT
GGGATTATGCCTCTGACCATGGGGAAAAGAAACTTATTTCTGTTGACACGAAGTTGATGGCAATAAAGTTATGTCTTCATTTGCCCCACA
CAACTCATCTACCTCACCTCAGAAGGCAGAAGAAGGTGGGCGACAGAGTGGCGAGTCCTTGTCTAGTACAGCCCTGGGAACTCCTGAACG
GCGCAAGGGCAGTTTAGCTGATGTTGTTGACACCTTGAAGCAGAGGAAAATGGAAGAGCTCATCAAAAACGAGCCGGAAGAAACCCCCAG
TATTGAAAAACTACTCTCAAAGGACTGGAAAGACAAGCTTCTTGCAATGGGATCGGGGAACTTTGGCGAAATAAAAGGGACTCCCGAGAG
CTTAGCTGAGAAAGAAAGGCAACTCATGGGTATGATCAACCAGCTGACCAGCCTCCGAGAGCAGCTGTTGGCTGCCCACGATGAGCAGAA
GAAACTAGCTGCCTCTCAGATTGAGAAACAGCGTCAGCAAATGGAGCTGGCCAAGCAGCAACAAGAACAAATTGCAAGACAGCAGCAGCA
GCTTCTACAGCAACAACACAAAATCAATTTGCTCCAGCAACAGATCCAGGTTCAAGGTCAGCTGCCGCCATTAATGATTCCCGTATTCCC
TCCTGATCAACGGACACTGGCTGCAGCTGCCCAGCAAGGATTCCTCCTCCCTCCAGGCTTCAGCTATAAGGCTGGATGTAGTGACCCTTA
CCCTGTTCAGCTGATCCCAACTACCATGGCAGCTGCTGCCGCAGCAACACCAGGCTTAGGCCCACTCCAACTGCAGCAGTTATATGCTGC
CCAGCTAGCTGCAATGCAGGTATCTCCAGGAGGGAAGCTGCCAGGCATACCCCAAGGCAACCTTGGTGCTGCTGTATCTCCTACCAGCAT
TCACACAGACAAGAGCACAAACAGCCCACCACCCAAAAGCAAGGAAAAAACAACACTGGAGAGTCTGACTCAGCAACTGGCAGTTAAACA
GAATGAAGAAGGAAAATTTAGCCATGCAATGATGGATTTCAATCTGAGTGGAGATTCTGATGGAAGTGCTGGAGTCTCAGAGTCAAGAAT
TTATAGGGAATCCCGAGGGCGTGGTAGCAATGAACCCCACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGATGAACGGAG
AAAGATCCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAGATATTGGGATCTCGCTGGAAAGCTATGACAAACCTAGAGAA
ACAGCCATATTATGAGGAGCAAGCCCGTCTCAGCAAGCAGCACCTGGAGAAGTACCCTGACTATAAGTACAAGCCCAGGCCAAAGCGCAC
CTGCCTGGTGGATGGCAAAAAGCTGCGCATTGGTGAATACAAGGCAATCATGCGCAACAGGCGGCAGGAAATGCGGCAGTACTTCAATGT
TGGGCAACAAGCACAGATCCCCATTGCCACTGCTGGTGTTGTGTACCCTGGAGCCATCGCCATGGCTGGGATGCCCTCCCCTCACCTGCC
CTCGGAGCACTCAAGCGTGTCTAGCAGCCCAGAGCCTGGGATGCCTGTTATCCAGAGCACTTACGGTGTGAAAGGAGAGGAGCCACATAT
CAAAGAAGAGATACAGGCCGAGGACATCAATGGAGAAATTTATGATGAGTACGACGAGGAAGAGGATGATCCAGATGTAGATTATGGGAG
TGACAGTGAAAACCATATTGCAGGACAAGCCAACTGATAAGGGTCAAAAGATTGTTGTGACCTTAGGACTTAAAGAAGCCCTAACTGGTT
CATCCTTACCAGTGGCCAAGCACATTAACTTTCTCATACACTGACTGTTACTTTAACTGTTAGTCTTAAATAGTTGGGACATCAGCTGAC
TAATAGACCTCAGCCTCAAAAGGCTTGGAAAGAAAAAACAAATACAACAAGCAAACAACAATATCAACAACAAGAGATTGAAATAAGCTA
TGGGTAAAATAATGCCAGTAATTCAGCTGCTACATCCAAGCACTGAAGTCTTACCCGTCAACTTTTTTTTTTTTTTAAATAAACTTTATG
GCTGTTTGTTCTACAATGTTCTAGAAATTCTCACTCAGGTACACAGTGCCAACAAGTGGCTTGTGAATGTGTTTTGTTGTTTTGTGCTAC


Top

FusionGenePPI for CP_SOX5


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page

.

check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors
CPLTF, RAD21, APOA1, BTRC, GDPD1, MED4, MED20, DDX31, SNX27SOX5SOX2, APP, CDK6, SMAD7, SMAD1, SMAD5, FTH1, CDC25A, RPS2, TTC1, UQCRFS1, MED27, TAF6, AES, CRX, KIFC3, LMO1, LMO2, SOX5, CDC23, KAT5, ARID5A, ZNF581, CBX8, FAM46B, PRR20A, MORN3, SUMO1P1, CEP85, SOX6, SOX13, SOX1, ANKRD40, SOX11


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

RelatedDrugs for CP_SOX5


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.0 2018-04-02)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
HgeneCPP00450DB01592IronCeruloplasminsmall moleculeapproved
HgeneCPP00450DB01593ZincCeruloplasminsmall moleculeapproved|investigational
HgeneCPP00450DB00055Drotrecogin alfaCeruloplasminbiotechapproved|investigational|withdrawn
HgeneCPP00450DB01373CalciumCeruloplasminsmall moleculeapproved|nutraceutical

Top

RelatedDiseases for CP_SOX5


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneCPC0019202Hepatolenticular Degeneration3CTD_human
HgeneCPC2931082Familial apoceruloplasmin deficiency3CTD_human;ORPHANET
HgeneCPC0022116Ischemia2CTD_human
HgeneCPC0023890Liver Cirrhosis2CTD_human
HgeneCPC0030567Parkinson Disease2CTD_human
HgeneCPC0001925Albuminuria1CTD_human
HgeneCPC0003873Rheumatoid Arthritis1CTD_human
HgeneCPC0004134Ataxia1CTD_human
HgeneCPC0004352Autistic Disorder1CTD_human
HgeneCPC0006111Brain Diseases1CTD_human
HgeneCPC0009375Colonic Neoplasms1CTD_human
HgeneCPC0011849Diabetes Mellitus1CTD_human;HPO
HgeneCPC0011854Diabetes Mellitus, Insulin-Dependent1CTD_human
HgeneCPC0012715Iron Metabolism Disorders1CTD_human
HgeneCPC0013384Dyskinetic syndrome1CTD_human
HgeneCPC0018995Hemochromatosis1CTD_human
HgeneCPC0019189Hepatitis, Chronic1CTD_human
HgeneCPC0022716Menkes Kinky Hair Syndrome1CTD_human
HgeneCPC0023904Liver Neoplasms, Experimental1CTD_human
HgeneCPC0025202melanoma1CTD_human
HgeneCPC0027746Nerve Degeneration1CTD_human
HgeneCPC0032914Pre-Eclampsia1CTD_human
HgeneCPC0033860Psoriasis1CTD_human
HgeneCPC0035304Retinal Degeneration1CTD_human;HPO
HgeneCPC0036341Schizophrenia1CTD_human
HgeneCPC0085397Pasteurellaceae Infections1CTD_human
HgeneCPC0282193Iron Overload1CTD_human
HgeneCPC0497327Dementia1CTD_human;HPO
HgeneCPC0993582Arthritis, Experimental1CTD_human
HgeneCPC2239176Liver carcinoma1CTD_human
HgeneCPC4277682Chemical and Drug Induced Liver Injury1CTD_human
TgeneSOX5C0004238Atrial Fibrillation1CTD_human
TgeneSOX5C0036341Schizophrenia1PSYGENET
TgeneSOX5C1510586Autism Spectrum Disorders1CTD_human