|
Fusion gene ID: 28636 |
FusionGeneSummary for PREP_SIM1 |
Fusion gene summary |
Fusion gene information | Fusion gene name: PREP_SIM1 | Fusion gene ID: 28636 | Hgene | Tgene | Gene symbol | PREP | SIM1 | Gene ID | 5550 | 6492 |
Gene name | prolyl endopeptidase | SIM bHLH transcription factor 1 | |
Synonyms | PE|PEP | bHLHe14 | |
Cytomap | 6q21 | 6q16.3 | |
Type of gene | protein-coding | protein-coding | |
Description | prolyl endopeptidasedJ355L5.1 (prolyl endopeptidase)post-proline cleaving enzymeprolyl oligopeptidase | single-minded homolog 1class E basic helix-loop-helix protein 14single-minded family bHLH transcription factor 1 | |
Modification date | 20180523 | 20180523 | |
UniProtAcc | P48147 | P81133 | |
Ensembl transtripts involved in fusion gene | ENST00000369110, | ENST00000369208, ENST00000262901, | |
Fusion gene scores | * DoF score | 12 X 7 X 5=420 | 4 X 4 X 3=48 |
# samples | 12 | 4 | |
** MAII score | log2(12/420*10)=-1.8073549220576 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | log2(4/48*10)=-0.263034405833794 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | |
Context | PubMed: PREP [Title/Abstract] AND SIM1 [Title/Abstract] AND fusion [Title/Abstract] | ||
Functional or gene categories assigned by FusionGDB annotation |
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types ** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10) |
Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez |
Partner | Gene | GO ID | GO term | PubMed ID |
Fusion gene information from three resources (ChiTars (NAR, 2018), tumorfusions (NAR, 2018), Gao et al. (Cell, 2018)) * All genome coordinats were lifted-over on hg19. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
Data type | Source | Cancer type | Sample | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
TCGA | LD | LUSC | TCGA-46-6026-01A | PREP | chr6 | 105736633 | - | SIM1 | chr6 | 100868834 | - |
* LD: Li Ding group's fusion gene list RV: Roel Verhaak group's fusion gene list ChiTaRs fusion database |
Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
ORF | Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
In-frame | ENST00000369110 | ENST00000369208 | PREP | chr6 | 105736633 | - | SIM1 | chr6 | 100868834 | - |
In-frame | ENST00000369110 | ENST00000262901 | PREP | chr6 | 105736633 | - | SIM1 | chr6 | 100868834 | - |
Top |
FusionProtFeatures for PREP_SIM1 |
Main function of each fusion partner protein. (from UniProt) |
Hgene | Tgene |
PREP | SIM1 |
Cleaves peptide bonds on the C-terminal side of prolylresidues within peptides that are up to approximately 30 aminoacids long. | Transcriptional factor that may have pleiotropic effectsduring embryogenesis and in the adult. |
Retention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at . * Minus value of BPloci means that the break pointn is located before the CDS. |
- In-frame and retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Tgene | >SIM1 | chr6:105736633 | chr6:100868834 | ENST00000262901 | - | 7 | 11 | 336_766 | 332 | 767 | Domain | Single-minded C-terminal |
Tgene | >SIM1 | chr6:105736633 | chr6:100868834 | ENST00000369208 | - | 8 | 12 | 336_766 | 332 | 767 | Domain | Single-minded C-terminal |
Tgene | >SIM1 | chr6:105736633 | chr6:100868834 | ENST00000262901 | - | 7 | 11 | 368_387 | 332 | 767 | Motif | Nuclear localization signal |
Tgene | >SIM1 | chr6:105736633 | chr6:100868834 | ENST00000369208 | - | 8 | 12 | 368_387 | 332 | 767 | Motif | Nuclear localization signal |
- In-frame and not-retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000262901 | - | 7 | 11 | 1_53 | 332 | 767 | Domain | bHLH |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000262901 | - | 7 | 11 | 218_288 | 332 | 767 | Domain | PAS 2 |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000262901 | - | 7 | 11 | 292_335 | 332 | 767 | Domain | Note=PAC |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000262901 | - | 7 | 11 | 77_147 | 332 | 767 | Domain | PAS 1 |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000369208 | - | 8 | 12 | 1_53 | 332 | 767 | Domain | bHLH |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000369208 | - | 8 | 12 | 218_288 | 332 | 767 | Domain | PAS 2 |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000369208 | - | 8 | 12 | 292_335 | 332 | 767 | Domain | Note=PAC |
Tgene | SIM1 | chr6:105736633 | chr6:100868834 | ENST00000369208 | - | 8 | 12 | 77_147 | 332 | 767 | Domain | PAS 1 |
Top |
FusionGeneSequence for PREP_SIM1 |
For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. (nt: nucleotides, aa: amino acids) |
* Fusion amino acid sequences. |
>In-frame_PREP_ENST00000369110_chr6_105736633_-_SIM1_ENST00000369208_chr6_100868834_-_919aa MLSLQYPDVYRDETAVQDYHGHKICDPYAWLEDPDSEQTKAFVEAQNKITVPFLEQCPIRGLYKERMTELYDYPKYSCHFKKGKRYFYFY NTGLQNQRVLYVQDSLEGEARVFLDPNILSDDGTVALRGYAFSEDGEYFAYGLSASGSDWVTIKFMKVDGAKELPDVLERVKFSCMAWTH DGKGMFYNSYPQQDGKSDGTETSTNLHQKLYYHVLGTDQSEDILCAEFPDEPKWMGGAELSDDGRYVLLSIREGCDPVNRLWYCDLQQES SGIAGILKWVKLIDNFEGEYDYVTNEGTVFTFKTNRQSPNYRVINIDFRDPEESKWKVLVPEHEKDVLEWIACVRSNFLVLCYLHDVKNI LQLHDLTTGALLKTFPLDVGSIVGYSGQKKDTEIFYQFTSFLSPGIIYHCDLTKEELEPRVFREVTVKGIDASDYQTVQIFYPSKDGTKI PMFIVHKKGIKLDGSHPAFLYGYGGFNISITPNYRDTEYKGLQLSLDQISASKPAFSYTSSSTPTMTDNRKGAKSRLSSSKSKSRTSPYP QYSGFHTERSESDHDSQWGGSPLTDTASPQLLDPADRPGSQHDASCAYRQFSDRSSLCYGFALDHSRLVEERHFHTQACEGGRCEAGRYF LGTPQAGREPWWGSRAALPLTKASPESREAYENSMPHIASVHRIHGRGHWDEDSVVSSPDPGSASESGDRYRTEQYQSSPHEPSKIETLI RATQQMIKEEENRLQLRKAPSDQLASINGAGKKHSLCFANYQQPPPTGEVCHGSALANTSPCDHIQQREGKMLSPHENDYDNSPTALSRI SSPNSDRISKSSLILAKDYLHSDISPHQTAGDHPTVSPNCFGSHRQYFDKHAYTLTGYALEHLYDSETIRNYSLGCNGSHFDVTSHLRMQ >In-frame_PREP_ENST00000369110_chr6_105736633_-_SIM1_ENST00000262901_chr6_100868834_-_919aa MLSLQYPDVYRDETAVQDYHGHKICDPYAWLEDPDSEQTKAFVEAQNKITVPFLEQCPIRGLYKERMTELYDYPKYSCHFKKGKRYFYFY NTGLQNQRVLYVQDSLEGEARVFLDPNILSDDGTVALRGYAFSEDGEYFAYGLSASGSDWVTIKFMKVDGAKELPDVLERVKFSCMAWTH DGKGMFYNSYPQQDGKSDGTETSTNLHQKLYYHVLGTDQSEDILCAEFPDEPKWMGGAELSDDGRYVLLSIREGCDPVNRLWYCDLQQES SGIAGILKWVKLIDNFEGEYDYVTNEGTVFTFKTNRQSPNYRVINIDFRDPEESKWKVLVPEHEKDVLEWIACVRSNFLVLCYLHDVKNI LQLHDLTTGALLKTFPLDVGSIVGYSGQKKDTEIFYQFTSFLSPGIIYHCDLTKEELEPRVFREVTVKGIDASDYQTVQIFYPSKDGTKI PMFIVHKKGIKLDGSHPAFLYGYGGFNISITPNYRDTEYKGLQLSLDQISASKPAFSYTSSSTPTMTDNRKGAKSRLSSSKSKSRTSPYP QYSGFHTERSESDHDSQWGGSPLTDTASPQLLDPADRPGSQHDASCAYRQFSDRSSLCYGFALDHSRLVEERHFHTQACEGGRCEAGRYF LGTPQAGREPWWGSRAALPLTKASPESREAYENSMPHIASVHRIHGRGHWDEDSVVSSPDPGSASESGDRYRTEQYQSSPHEPSKIETLI RATQQMIKEEENRLQLRKAPSDQLASINGAGKKHSLCFANYQQPPPTGEVCHGSALANTSPCDHIQQREGKMLSPHENDYDNSPTALSRI SSPNSDRISKSSLILAKDYLHSDISPHQTAGDHPTVSPNCFGSHRQYFDKHAYTLTGYALEHLYDSETIRNYSLGCNGSHFDVTSHLRMQ |
* Fusion transcript sequences (only coding sequence (CDS) region). |
>In-frame_PREP_ENST00000369110_chr6_105736633_-_SIM1_ENST00000369208_chr6_100868834_-_2757nt ATGCTGTCCCTTCAGTACCCCGACGTGTACCGCGACGAGACCGCCGTACAGGATTATCATGGTCATAAAATTTGTGACCCTTACGCCTGG CTTGAAGACCCCGACAGTGAACAGACTAAGGCCTTTGTGGAGGCCCAGAATAAGATTACTGTGCCATTTCTTGAGCAGTGTCCCATCAGA GGTTTATACAAAGAGAGAATGACTGAACTATATGATTATCCCAAGTATAGTTGCCACTTCAAGAAAGGAAAACGGTATTTTTATTTTTAC AATACAGGTTTGCAGAACCAGCGAGTATTATATGTACAGGATTCCTTAGAGGGTGAGGCCAGAGTGTTCCTGGACCCCAACATACTGTCT GACGATGGCACAGTGGCACTCCGAGGTTATGCGTTCAGCGAAGATGGTGAATATTTTGCCTATGGTCTGAGTGCCAGTGGCTCAGACTGG GTGACAATCAAGTTCATGAAAGTTGATGGTGCCAAAGAGCTTCCAGATGTGCTTGAAAGAGTCAAGTTCAGCTGTATGGCCTGGACCCAT GATGGGAAGGGAATGTTCTACAACTCATACCCTCAACAGGATGGAAAAAGTGATGGCACAGAGACATCTACCAATCTCCACCAAAAGCTC TACTACCATGTCTTGGGAACCGATCAGTCAGAAGATATTTTGTGTGCTGAGTTTCCTGATGAACCTAAATGGATGGGTGGAGCTGAGTTA TCTGATGATGGCCGCTATGTCTTGTTATCAATAAGGGAAGGATGTGATCCAGTAAACCGACTCTGGTACTGTGACCTACAGCAGGAATCC AGTGGCATCGCGGGAATCCTGAAGTGGGTAAAACTGATTGACAACTTTGAAGGGGAATATGACTACGTGACCAATGAGGGGACGGTGTTC ACATTCAAGACGAATCGCCAGTCTCCCAACTATCGCGTGATCAACATTGACTTCAGGGATCCTGAAGAGTCTAAGTGGAAAGTACTTGTT CCTGAGCATGAGAAAGATGTCTTAGAATGGATAGCTTGTGTCAGGTCCAACTTCTTGGTCTTATGCTACCTCCATGACGTCAAGAACATT CTGCAGCTCCATGACCTGACTACTGGTGCTCTCCTTAAGACCTTCCCGCTCGATGTCGGCAGCATTGTAGGGTACAGCGGTCAGAAGAAG GACACTGAAATCTTCTATCAGTTTACTTCCTTTTTATCTCCAGGTATCATTTATCACTGTGATCTTACCAAAGAGGAGCTGGAGCCAAGA GTTTTCCGAGAGGTGACCGTAAAAGGAATTGATGCTTCTGATTACCAGACAGTCCAGATTTTCTACCCTAGCAAGGATGGTACGAAGATT CCAATGTTCATTGTGCATAAAAAAGGCATAAAATTGGATGGCTCTCATCCAGCTTTCTTATATGGCTATGGCGGCTTCAACATATCCATC ACACCCAACTACAGAGACACAGAATACAAAGGGCTGCAGCTCTCCCTGGATCAGATCTCAGCCTCCAAACCAGCCTTCTCCTATACCAGC AGCTCCACCCCCACCATGACTGACAACAGAAAGGGGGCCAAATCCCGGCTCTCCAGCTCAAAGTCAAAATCCAGGACTTCCCCATACCCT CAGTATTCGGGATTTCACACAGAAAGATCGGAATCTGATCATGACAGCCAGTGGGGCGGAAGTCCCTTGACCGACACGGCCTCTCCGCAG CTTCTGGACCCCGCCGATAGGCCTGGCTCCCAGCACGACGCATCGTGCGCCTACAGACAGTTTTCGGACCGCAGCTCTCTCTGCTATGGC TTTGCGCTTGACCACTCGAGGCTGGTGGAAGAGAGGCATTTCCATACCCAGGCCTGTGAAGGAGGCCGATGTGAGGCAGGCAGGTACTTC CTGGGAACGCCGCAGGCCGGGAGGGAGCCCTGGTGGGGCTCTCGCGCAGCCTTGCCCCTGACAAAGGCCTCCCCAGAAAGCAGAGAAGCC TATGAAAACAGCATGCCTCACATCGCTTCAGTCCACAGGATCCATGGGCGAGGTCATTGGGATGAAGATAGTGTGGTCAGTTCTCCAGAC CCTGGGTCGGCCAGTGAATCAGGTGACCGATATCGTACTGAGCAGTATCAAAGTAGCCCACATGAACCCAGCAAAATTGAAACTCTTATA AGAGCCACTCAGCAAATGATTAAAGAAGAAGAGAACAGATTACAGCTAAGGAAAGCCCCCTCAGACCAACTGGCTTCCATTAATGGGGCT GGGAAAAAACACTCCCTGTGTTTTGCAAACTACCAACAGCCCCCACCAACAGGTGAAGTCTGCCATGGCTCTGCTCTTGCCAACACTTCA CCATGTGACCATATCCAGCAGAGAGAGGGAAAAATGTTGAGCCCCCATGAAAATGACTATGACAACAGTCCCACCGCACTATCTCGGATA AGTAGTCCCAATTCGGATCGCATTTCAAAATCCAGTTTGATCCTAGCTAAAGACTATCTGCATTCGGATATATCTCCTCATCAGACAGCA GGAGACCACCCTACTGTCTCTCCAAACTGCTTTGGCTCTCACCGGCAGTATTTTGACAAGCATGCTTACACATTAACTGGATATGCCCTG GAGCACTTATATGACAGCGAAACCATTAGAAACTATTCCTTGGGCTGTAATGGCTCACACTTTGATGTAACTTCCCATCTGAGGATGCAA >In-frame_PREP_ENST00000369110_chr6_105736633_-_SIM1_ENST00000262901_chr6_100868834_-_2757nt ATGCTGTCCCTTCAGTACCCCGACGTGTACCGCGACGAGACCGCCGTACAGGATTATCATGGTCATAAAATTTGTGACCCTTACGCCTGG CTTGAAGACCCCGACAGTGAACAGACTAAGGCCTTTGTGGAGGCCCAGAATAAGATTACTGTGCCATTTCTTGAGCAGTGTCCCATCAGA GGTTTATACAAAGAGAGAATGACTGAACTATATGATTATCCCAAGTATAGTTGCCACTTCAAGAAAGGAAAACGGTATTTTTATTTTTAC AATACAGGTTTGCAGAACCAGCGAGTATTATATGTACAGGATTCCTTAGAGGGTGAGGCCAGAGTGTTCCTGGACCCCAACATACTGTCT GACGATGGCACAGTGGCACTCCGAGGTTATGCGTTCAGCGAAGATGGTGAATATTTTGCCTATGGTCTGAGTGCCAGTGGCTCAGACTGG GTGACAATCAAGTTCATGAAAGTTGATGGTGCCAAAGAGCTTCCAGATGTGCTTGAAAGAGTCAAGTTCAGCTGTATGGCCTGGACCCAT GATGGGAAGGGAATGTTCTACAACTCATACCCTCAACAGGATGGAAAAAGTGATGGCACAGAGACATCTACCAATCTCCACCAAAAGCTC TACTACCATGTCTTGGGAACCGATCAGTCAGAAGATATTTTGTGTGCTGAGTTTCCTGATGAACCTAAATGGATGGGTGGAGCTGAGTTA TCTGATGATGGCCGCTATGTCTTGTTATCAATAAGGGAAGGATGTGATCCAGTAAACCGACTCTGGTACTGTGACCTACAGCAGGAATCC AGTGGCATCGCGGGAATCCTGAAGTGGGTAAAACTGATTGACAACTTTGAAGGGGAATATGACTACGTGACCAATGAGGGGACGGTGTTC ACATTCAAGACGAATCGCCAGTCTCCCAACTATCGCGTGATCAACATTGACTTCAGGGATCCTGAAGAGTCTAAGTGGAAAGTACTTGTT CCTGAGCATGAGAAAGATGTCTTAGAATGGATAGCTTGTGTCAGGTCCAACTTCTTGGTCTTATGCTACCTCCATGACGTCAAGAACATT CTGCAGCTCCATGACCTGACTACTGGTGCTCTCCTTAAGACCTTCCCGCTCGATGTCGGCAGCATTGTAGGGTACAGCGGTCAGAAGAAG GACACTGAAATCTTCTATCAGTTTACTTCCTTTTTATCTCCAGGTATCATTTATCACTGTGATCTTACCAAAGAGGAGCTGGAGCCAAGA GTTTTCCGAGAGGTGACCGTAAAAGGAATTGATGCTTCTGATTACCAGACAGTCCAGATTTTCTACCCTAGCAAGGATGGTACGAAGATT CCAATGTTCATTGTGCATAAAAAAGGCATAAAATTGGATGGCTCTCATCCAGCTTTCTTATATGGCTATGGCGGCTTCAACATATCCATC ACACCCAACTACAGAGACACAGAATACAAAGGGCTGCAGCTCTCCCTGGATCAGATCTCAGCCTCCAAACCAGCCTTCTCCTATACCAGC AGCTCCACCCCCACCATGACTGACAACAGAAAGGGGGCCAAATCCCGGCTCTCCAGCTCAAAGTCAAAATCCAGGACTTCCCCATACCCT CAGTATTCGGGATTTCACACAGAAAGATCGGAATCTGATCATGACAGCCAGTGGGGCGGAAGTCCCTTGACCGACACGGCCTCTCCGCAG CTTCTGGACCCCGCCGATAGGCCTGGCTCCCAGCACGACGCATCGTGCGCCTACAGACAGTTTTCGGACCGCAGCTCTCTCTGCTATGGC TTTGCGCTTGACCACTCGAGGCTGGTGGAAGAGAGGCATTTCCATACCCAGGCCTGTGAAGGAGGCCGATGTGAGGCAGGCAGGTACTTC CTGGGAACGCCGCAGGCCGGGAGGGAGCCCTGGTGGGGCTCTCGCGCAGCCTTGCCCCTGACAAAGGCCTCCCCAGAAAGCAGAGAAGCC TATGAAAACAGCATGCCTCACATCGCTTCAGTCCACAGGATCCATGGGCGAGGTCATTGGGATGAAGATAGTGTGGTCAGTTCTCCAGAC CCTGGGTCGGCCAGTGAATCAGGTGACCGATATCGTACTGAGCAGTATCAAAGTAGCCCACATGAACCCAGCAAAATTGAAACTCTTATA AGAGCCACTCAGCAAATGATTAAAGAAGAAGAGAACAGATTACAGCTAAGGAAAGCCCCCTCAGACCAACTGGCTTCCATTAATGGGGCT GGGAAAAAACACTCCCTGTGTTTTGCAAACTACCAACAGCCCCCACCAACAGGTGAAGTCTGCCATGGCTCTGCTCTTGCCAACACTTCA CCATGTGACCATATCCAGCAGAGAGAGGGAAAAATGTTGAGCCCCCATGAAAATGACTATGACAACAGTCCCACCGCACTATCTCGGATA AGTAGTCCCAATTCGGATCGCATTTCAAAATCCAGTTTGATCCTAGCTAAAGACTATCTGCATTCGGATATATCTCCTCATCAGACAGCA GGAGACCACCCTACTGTCTCTCCAAACTGCTTTGGCTCTCACCGGCAGTATTTTGACAAGCATGCTTACACATTAACTGGATATGCCCTG GAGCACTTATATGACAGCGAAACCATTAGAAACTATTCCTTGGGCTGTAATGGCTCACACTTTGATGTAACTTCCCATCTGAGGATGCAA |
* Fusion transcript sequences (Full-length transcript). |
>In-frame_PREP_ENST00000369110_chr6_105736633_-_SIM1_ENST00000369208_chr6_100868834_-_8296nt CTGCTTTCTGCACGTCCTCGCCGCCGCGCCGCCAGTCCGTTTGTGCTAGCTCTGGCCGTGAGCCGGCCGCCCGCTGCCGCCGGCCGCCCC GCAGCTGCCTGCGCCCCAGCCGCGCCTCGCGGCCAGCCCGGCTAGCTCAGGTCCGCTCCCGGAGCCCGCGCCCCTCCACGCTGCCCCCTG CCTGTCCCCGGCCATGCTGTCCCTTCAGTACCCCGACGTGTACCGCGACGAGACCGCCGTACAGGATTATCATGGTCATAAAATTTGTGA CCCTTACGCCTGGCTTGAAGACCCCGACAGTGAACAGACTAAGGCCTTTGTGGAGGCCCAGAATAAGATTACTGTGCCATTTCTTGAGCA GTGTCCCATCAGAGGTTTATACAAAGAGAGAATGACTGAACTATATGATTATCCCAAGTATAGTTGCCACTTCAAGAAAGGAAAACGGTA TTTTTATTTTTACAATACAGGTTTGCAGAACCAGCGAGTATTATATGTACAGGATTCCTTAGAGGGTGAGGCCAGAGTGTTCCTGGACCC CAACATACTGTCTGACGATGGCACAGTGGCACTCCGAGGTTATGCGTTCAGCGAAGATGGTGAATATTTTGCCTATGGTCTGAGTGCCAG TGGCTCAGACTGGGTGACAATCAAGTTCATGAAAGTTGATGGTGCCAAAGAGCTTCCAGATGTGCTTGAAAGAGTCAAGTTCAGCTGTAT GGCCTGGACCCATGATGGGAAGGGAATGTTCTACAACTCATACCCTCAACAGGATGGAAAAAGTGATGGCACAGAGACATCTACCAATCT CCACCAAAAGCTCTACTACCATGTCTTGGGAACCGATCAGTCAGAAGATATTTTGTGTGCTGAGTTTCCTGATGAACCTAAATGGATGGG TGGAGCTGAGTTATCTGATGATGGCCGCTATGTCTTGTTATCAATAAGGGAAGGATGTGATCCAGTAAACCGACTCTGGTACTGTGACCT ACAGCAGGAATCCAGTGGCATCGCGGGAATCCTGAAGTGGGTAAAACTGATTGACAACTTTGAAGGGGAATATGACTACGTGACCAATGA GGGGACGGTGTTCACATTCAAGACGAATCGCCAGTCTCCCAACTATCGCGTGATCAACATTGACTTCAGGGATCCTGAAGAGTCTAAGTG GAAAGTACTTGTTCCTGAGCATGAGAAAGATGTCTTAGAATGGATAGCTTGTGTCAGGTCCAACTTCTTGGTCTTATGCTACCTCCATGA CGTCAAGAACATTCTGCAGCTCCATGACCTGACTACTGGTGCTCTCCTTAAGACCTTCCCGCTCGATGTCGGCAGCATTGTAGGGTACAG CGGTCAGAAGAAGGACACTGAAATCTTCTATCAGTTTACTTCCTTTTTATCTCCAGGTATCATTTATCACTGTGATCTTACCAAAGAGGA GCTGGAGCCAAGAGTTTTCCGAGAGGTGACCGTAAAAGGAATTGATGCTTCTGATTACCAGACAGTCCAGATTTTCTACCCTAGCAAGGA TGGTACGAAGATTCCAATGTTCATTGTGCATAAAAAAGGCATAAAATTGGATGGCTCTCATCCAGCTTTCTTATATGGCTATGGCGGCTT CAACATATCCATCACACCCAACTACAGAGACACAGAATACAAAGGGCTGCAGCTCTCCCTGGATCAGATCTCAGCCTCCAAACCAGCCTT CTCCTATACCAGCAGCTCCACCCCCACCATGACTGACAACAGAAAGGGGGCCAAATCCCGGCTCTCCAGCTCAAAGTCAAAATCCAGGAC TTCCCCATACCCTCAGTATTCGGGATTTCACACAGAAAGATCGGAATCTGATCATGACAGCCAGTGGGGCGGAAGTCCCTTGACCGACAC GGCCTCTCCGCAGCTTCTGGACCCCGCCGATAGGCCTGGCTCCCAGCACGACGCATCGTGCGCCTACAGACAGTTTTCGGACCGCAGCTC TCTCTGCTATGGCTTTGCGCTTGACCACTCGAGGCTGGTGGAAGAGAGGCATTTCCATACCCAGGCCTGTGAAGGAGGCCGATGTGAGGC AGGCAGGTACTTCCTGGGAACGCCGCAGGCCGGGAGGGAGCCCTGGTGGGGCTCTCGCGCAGCCTTGCCCCTGACAAAGGCCTCCCCAGA AAGCAGAGAAGCCTATGAAAACAGCATGCCTCACATCGCTTCAGTCCACAGGATCCATGGGCGAGGTCATTGGGATGAAGATAGTGTGGT CAGTTCTCCAGACCCTGGGTCGGCCAGTGAATCAGGTGACCGATATCGTACTGAGCAGTATCAAAGTAGCCCACATGAACCCAGCAAAAT TGAAACTCTTATAAGAGCCACTCAGCAAATGATTAAAGAAGAAGAGAACAGATTACAGCTAAGGAAAGCCCCCTCAGACCAACTGGCTTC CATTAATGGGGCTGGGAAAAAACACTCCCTGTGTTTTGCAAACTACCAACAGCCCCCACCAACAGGTGAAGTCTGCCATGGCTCTGCTCT TGCCAACACTTCACCATGTGACCATATCCAGCAGAGAGAGGGAAAAATGTTGAGCCCCCATGAAAATGACTATGACAACAGTCCCACCGC ACTATCTCGGATAAGTAGTCCCAATTCGGATCGCATTTCAAAATCCAGTTTGATCCTAGCTAAAGACTATCTGCATTCGGATATATCTCC TCATCAGACAGCAGGAGACCACCCTACTGTCTCTCCAAACTGCTTTGGCTCTCACCGGCAGTATTTTGACAAGCATGCTTACACATTAAC TGGATATGCCCTGGAGCACTTATATGACAGCGAAACCATTAGAAACTATTCCTTGGGCTGTAATGGCTCACACTTTGATGTAACTTCCCA TCTGAGGATGCAACCAGACCCAGCACAAGGACACAAGGGAACATCTGTTATAATAACCAACGGAAGCTGATGTTTTGCTGAAATATTTTG TTCTTTAAGGATCTCTGAAACATATTTATAGTTTAATACCCCATTACCAGCATTTACTATGCCACAGATTGTTAGAGAGTATAACTTAAG TTACTGGGTATTTGATACGTGTTCCTATAAAATCAAAGAAAACATAGCACTAGCATTCAGGGTTATACACAGAAAAGGGAGCTAAATTGA ATACACAAATTTCCCCTCTAATTATATGGGAACCAGAATAGATAAATTTTGACTTGAAAAATATTCATGTAGATCAAGTGTGCATATATA CTACATGAGAGGACTGATGAATGACAACATTGCATTGTGACTATCCAGTGATCCTCAAACACACAAACTATTACTTACAAACTGCGGTAT ACATTTTACATATGGAAATATAGGCTATGTAATGTAAATACATCAAAAATGGGTAATTTTCTTTGACTCTGTCACACTAAACTTCTTAAC GAAATTTCCATTCCCAAAATAACTGAGAAAGAGAGAGATACATCTTATAAACTGACTTCTTTGTGGTTTCAAATCAGCCAGCTCATTTGG TTCAGGCATAAATTAGAGAAATGGTTCTGGATATGGTGCAAAAATGAGTTTTCACCTGGTATCCATTATAAACAATCAGGAAGAGGTAAT TTTTCACCTTGCTTTTCAGTTAGACAAGGACCAGGATTGCACTGACATGGCGCTGAGGGTTTTTCTAAGTAAGAACACTGAGATATTGGG ACACACATCAAAAACCTGGAGTGCTCAATTGGAAGTAGTTCTATGAATATGGAAAGGCCAGAGGCAGAGTGAAATAAAATGCTATCTCAA AGTTTAACACAATTTAAGGGCTCAGCATAAGTAAACAACATATTTGGGGTTTGCTTGTAAAACCAACTAAATAAAAAATTCAAACCAATT CACCCAGAAAAAAGACCAATAGGTGCAAAAATAAAAGGAAAACCAGTGAAGTGCCACATGACAGCAGTGTTAAGTGTTTGAAAACGTTTC AAAGCACATATGTGCCAATGTGACAACATGTGGAAAGCCTCAGGAGAGAGTCTAAGATAAAAGCTTAGGCTGATAGACAAGTAGTTAAGA GCTAAGAGCAGTACTCTGAAGGAATAGGCAAAATGTTTATTTTCCTTATTGTTTGTAAACAACAAACTTGGTCTTACATCTGTGTGGTAT AGTAGAAAGGCCAGCTGACTAGATCTCTGGATTCTAATTTTGGCCCTACCTGTAACTTAATTTTGTGACCACAGTTGTACCATTCACCGT GCCTGGGCTCTAGTTTCCTGGTTTGTAAGGCAGCCCCAGCGTTCATGTTCTGTGATAGAGCAGAACTGAACTTATTACCTAATTAACTCT CTGCTATGAGTTGTCAAGACTGATCATTCTGTTTTTTCTGTACACAGAAGTTTAGATGCTTTGTGACTTAAGCAGGTGTGTGGGCTCCTT TAGGCAGGTTACAGTTAACTTTCTAGATTTAGCCAAAATGTAGTTGGTAGAAATTTATGACAGAGAAGAAAAATAGCAAGATTTATTTAA TGCTCTATTCTCTAGTAATTAAAGCAAAATGATATAAGTGATTAGAGAGATTTCATAGATAATATTTCCTAGGAGTCCCGTGCCCAAAAC TCTTGCCATATTTAAGGAAGGGAATGTACTGGCAGAACTTGACTTACAATTTTGGAAACAGAACATGAAATTGAATACTCAGGGATTAGT AGTATATTTTCATCATTTAACACAGTTATTTCACAGTCCATAGGTAGGTGACAATCTGAACGATAATATGAAAGATGAGTACTGGGTTGT TTGTATATGAAAGGAGAGGCTACATAAAATTAAATAATGTTAACATTTGTGCTGTAAATGCAAAAAAGTGCAAGTAAGTGAGAAAGTTAG TGGTTATATATATTTTCAAAAAATTCAAAGTGCAGTTGACCCTTGAATAACATGGGTTTGAACTGTGCAGATTCACTTATGTGTGGTTTT TTTTCCAATCAAATGCAGATAAAAAACCCAGTATTTGGAGAAATGTGAAACCTGAGTATACAGAGGGCGATGTTTTGTATAAGTGAGCGC CACAGGGCCAACAGGCAGGACTTGAGTATGTGAGAATTTGATTATATGGGGGTAATCCTGGAACCAACTCCCTGTTTATACCAAGGGGTG ACCATACAATAAAAACTAAGTTTTGAATTATACAAAAAAGAGTATTTAGTAATTTTGAGAAAATTATTTATGAGAAATCCTGGAATACAT AAAATGGAGTTTTGATCAATTTGCAACATGAAGTAATGTCTTAAGAATGCTAAAGGTTTTGTAGAATTCTGTAGATTGATAGATGTTTGG AAGAAAAAATATTTGATGAATATGGTGTATTTTGGAAAGGTAATTTTCACAGATGGGTATTTCTGGTGGCTATATATTCTACCAAATGTG CCAGGGACAGAAGCTAAAAGAAATGGAAACTTAAGAACCAGGGAAAATGAACTGATATTAAAGAAAAGGCTGTCACTAAAGGAATTTCAG TGGAATTTTTATATGGAAGATAGTTAACTGGTGAGAAGAAGAAATTATGAATTTCACAGATGAGAGTAATCAGAAGGAATAATGCTCTTT AAAATGACATGACCAATATTTTTAAGAGAATACAGAATTAACTAAATTATATATTGATTCAAAATTTTGGTAGATTGACCCAGTTTTAAG TACATAACCATTTTAAAATTTACTGAATGTCAATAATTTTATAGTAGCAGTTTTATTACATTACATAAAATGTCATATTGACTCACTATA TTGTATAGCTTACATTTTTTGCAATATGGCATGATTATCACAAAATGACCCAAGGTAAGTTAAGACATTTGATGACAATGTACTTTCTTA ATATACATGTGAATGTGTTAAAATGTGTTTATAATCTACATTATTTTGTTCTGTGCAGCAACATATTTCCTCATAATGAAGATGTCATCT GTGTCTTTTAAAAATAGTATATAAAATAACTACGTTCCCGATAATTTAATCAGTTAAACTTAATATGGCTTTAGTTTTAATTCTTGATTT CCATAACTTTGCCAAATGTTGAGAAACATGAAACTTGATTGAAATAGATACAGAAGTATGATTTTCATATCACCTTGGGGATACATAAAA ATGTGCTCTATCCAATTACTGGTATAATATGTATCAATTCATTAGATGATACATAGCTCATGGTTTTGTAGAATTCTGGTTTCTAGTGAT AATAAGTATCTTTATTGCCCATTTCACCTGCCATACCTACTTTAAAAATTTGGATGACCAAAGAAGTGAATGAAGTTTCCCCAATCTTTC TTGGGAAAATTCTGTTTAAATTGTTATTGGGAATATATAGATTTGATGTGTTATTTTGGGATGTGTAATATTAATGTCACAATCACTACA GGTCAGATATTTAGGTGTGCTTCGGTCCACAAGTCTTTTGTCATATTGTGCTCGCTTACCTAAACTGTAATTGCCTGATAAAAGATAATA TGACAGTGCTGAGGTGACATTCAAAATGAGACGGGATGCAATAAATGATTGCCTTCAGTATTTACACAGAAAAACCACTTGCTCTTTCTT GGTTATATTTCATAGGGCTAGATTGTTTCCATCGGGCACTATTCTTAATTCACTCTATACCGTGTTGTCACATGCTGCCTCTGGGCAGAG AATCCCACTTAAAATAAGCTTTAGGGGATCTTCTAGAAAAGGGTAGTGTGCTCCAATTTTTATTTCTCCTTCTTATGGGGGATGAGGATC AATTATAGTACAAGACCATTTGGAAAATATGCATATATATTTTAAAATGAAATACAAGATTTCCGCCCTCAAGATCTAAACCAAGTAAGT TTGATGGATGGCAGCACAGATAAAGAAATCTTCTGAATACACATTTGATTTTTATAGCTCTCTCTTTCAAGAGTAGGATATTACTAAGAT ATTCAGTGCAATCTATCAGTCCTAGTAGTATTTAGAAGTGACAGAGTGAATGCTGATATTTGGAAAATTGAGCATTTGACCATTTTGGGG TCACTCACCTGGCCTAGTGGAATAGCTTACACTTTAATACAATCAAACACAGTCTTTCTGATATAAAATGTTTGAATGCCCAGATTTGGT TCCCTAGTTCACTCAAATCATAGATGTTGACATACATCCTGTTTTCATAATGATTTTAAAAAGAATGTAAATCCTATGGTCCTCCAAATA AATGATTCAACGTTTATGTAAATGTCAAGTTTTTGTGAATAGGGATTTATGTCTAAGACAGAAAAATCTACTGAACTAGTTCTTACTGTG GAACTAATTGTGGATAAAACATTCGTTATCATCTAAATTTTAAATGAATGACAACAGTTATGGACACATGCAAAAGGTATCAGACATAGA AAATATTGTTGGGGGAAATTTCTGTGTGGTGCTAATTTCTTGTAAGGGTTGTACAGATAGTGCTTAACTCAAAAAGTGAGAGTGTGGTCA TTGCCAAATAGGGGTTTGCTCAGAGTTCCTTCTCTAGGACTGTATGCAGAAAGCATACACACACACATACGCACACAGAAAAATATGATT TTTATATCATATTCATGAAAATGTGCCATATCCAATTATAACACACACACACACACACACACACACACACACACACACACATAAATAAAT AAATGATGGTTCACATAAGAAATCCTAATTGCTAATTTTAAACCAAACATTTGTAGTTTTGTTTATTGCAACTTTGCTGCATGGGACTTT GCTTTCATAAATCTATATGGGGTTGGGGTTAATTTGCCCTAATTTGCTGACCTGGGACACATGTAATCACTGTTAAACTTACACCCGGTA ACCCTGATGTGTTTACATTTCAAAAGAAATGAAATTGGCCTGGAAAAAAATTTTGGAAGTACTGTAAGTCTTTTTTCTTTTTTTTTCCGA AGGGAAATATTTCAAAAAAGGAAACATTATGAGTAGACACTTCAAAAAAGATAAAATATTTTACATTTGTTTTTTGACTAATGTGCTATA AAAAGGATTATATTTGTGAGAAAAGATACTGATCGCCAATATTTCAAATACCGTCTTGCAATGTATAGTTTTTAGTGACATTGTAGTATA AAGCTGTAATTTGAAATTTTACTTTGGAATGTAAAGTAGAAAATATTAGCTATGTCAATGATATCTTGCAAAGTGTTCCCATTTATAATT ATTTATATTGTAAATAGCTTTCTGAAGTAAATTCGAAGTTAATGTGCATAAAATGTATTTATTATGTGAGGAATTTTTTGGTTTAAAATA >In-frame_PREP_ENST00000369110_chr6_105736633_-_SIM1_ENST00000262901_chr6_100868834_-_4439nt CTGCTTTCTGCACGTCCTCGCCGCCGCGCCGCCAGTCCGTTTGTGCTAGCTCTGGCCGTGAGCCGGCCGCCCGCTGCCGCCGGCCGCCCC GCAGCTGCCTGCGCCCCAGCCGCGCCTCGCGGCCAGCCCGGCTAGCTCAGGTCCGCTCCCGGAGCCCGCGCCCCTCCACGCTGCCCCCTG CCTGTCCCCGGCCATGCTGTCCCTTCAGTACCCCGACGTGTACCGCGACGAGACCGCCGTACAGGATTATCATGGTCATAAAATTTGTGA CCCTTACGCCTGGCTTGAAGACCCCGACAGTGAACAGACTAAGGCCTTTGTGGAGGCCCAGAATAAGATTACTGTGCCATTTCTTGAGCA GTGTCCCATCAGAGGTTTATACAAAGAGAGAATGACTGAACTATATGATTATCCCAAGTATAGTTGCCACTTCAAGAAAGGAAAACGGTA TTTTTATTTTTACAATACAGGTTTGCAGAACCAGCGAGTATTATATGTACAGGATTCCTTAGAGGGTGAGGCCAGAGTGTTCCTGGACCC CAACATACTGTCTGACGATGGCACAGTGGCACTCCGAGGTTATGCGTTCAGCGAAGATGGTGAATATTTTGCCTATGGTCTGAGTGCCAG TGGCTCAGACTGGGTGACAATCAAGTTCATGAAAGTTGATGGTGCCAAAGAGCTTCCAGATGTGCTTGAAAGAGTCAAGTTCAGCTGTAT GGCCTGGACCCATGATGGGAAGGGAATGTTCTACAACTCATACCCTCAACAGGATGGAAAAAGTGATGGCACAGAGACATCTACCAATCT CCACCAAAAGCTCTACTACCATGTCTTGGGAACCGATCAGTCAGAAGATATTTTGTGTGCTGAGTTTCCTGATGAACCTAAATGGATGGG TGGAGCTGAGTTATCTGATGATGGCCGCTATGTCTTGTTATCAATAAGGGAAGGATGTGATCCAGTAAACCGACTCTGGTACTGTGACCT ACAGCAGGAATCCAGTGGCATCGCGGGAATCCTGAAGTGGGTAAAACTGATTGACAACTTTGAAGGGGAATATGACTACGTGACCAATGA GGGGACGGTGTTCACATTCAAGACGAATCGCCAGTCTCCCAACTATCGCGTGATCAACATTGACTTCAGGGATCCTGAAGAGTCTAAGTG GAAAGTACTTGTTCCTGAGCATGAGAAAGATGTCTTAGAATGGATAGCTTGTGTCAGGTCCAACTTCTTGGTCTTATGCTACCTCCATGA CGTCAAGAACATTCTGCAGCTCCATGACCTGACTACTGGTGCTCTCCTTAAGACCTTCCCGCTCGATGTCGGCAGCATTGTAGGGTACAG CGGTCAGAAGAAGGACACTGAAATCTTCTATCAGTTTACTTCCTTTTTATCTCCAGGTATCATTTATCACTGTGATCTTACCAAAGAGGA GCTGGAGCCAAGAGTTTTCCGAGAGGTGACCGTAAAAGGAATTGATGCTTCTGATTACCAGACAGTCCAGATTTTCTACCCTAGCAAGGA TGGTACGAAGATTCCAATGTTCATTGTGCATAAAAAAGGCATAAAATTGGATGGCTCTCATCCAGCTTTCTTATATGGCTATGGCGGCTT CAACATATCCATCACACCCAACTACAGAGACACAGAATACAAAGGGCTGCAGCTCTCCCTGGATCAGATCTCAGCCTCCAAACCAGCCTT CTCCTATACCAGCAGCTCCACCCCCACCATGACTGACAACAGAAAGGGGGCCAAATCCCGGCTCTCCAGCTCAAAGTCAAAATCCAGGAC TTCCCCATACCCTCAGTATTCGGGATTTCACACAGAAAGATCGGAATCTGATCATGACAGCCAGTGGGGCGGAAGTCCCTTGACCGACAC GGCCTCTCCGCAGCTTCTGGACCCCGCCGATAGGCCTGGCTCCCAGCACGACGCATCGTGCGCCTACAGACAGTTTTCGGACCGCAGCTC TCTCTGCTATGGCTTTGCGCTTGACCACTCGAGGCTGGTGGAAGAGAGGCATTTCCATACCCAGGCCTGTGAAGGAGGCCGATGTGAGGC AGGCAGGTACTTCCTGGGAACGCCGCAGGCCGGGAGGGAGCCCTGGTGGGGCTCTCGCGCAGCCTTGCCCCTGACAAAGGCCTCCCCAGA AAGCAGAGAAGCCTATGAAAACAGCATGCCTCACATCGCTTCAGTCCACAGGATCCATGGGCGAGGTCATTGGGATGAAGATAGTGTGGT CAGTTCTCCAGACCCTGGGTCGGCCAGTGAATCAGGTGACCGATATCGTACTGAGCAGTATCAAAGTAGCCCACATGAACCCAGCAAAAT TGAAACTCTTATAAGAGCCACTCAGCAAATGATTAAAGAAGAAGAGAACAGATTACAGCTAAGGAAAGCCCCCTCAGACCAACTGGCTTC CATTAATGGGGCTGGGAAAAAACACTCCCTGTGTTTTGCAAACTACCAACAGCCCCCACCAACAGGTGAAGTCTGCCATGGCTCTGCTCT TGCCAACACTTCACCATGTGACCATATCCAGCAGAGAGAGGGAAAAATGTTGAGCCCCCATGAAAATGACTATGACAACAGTCCCACCGC ACTATCTCGGATAAGTAGTCCCAATTCGGATCGCATTTCAAAATCCAGTTTGATCCTAGCTAAAGACTATCTGCATTCGGATATATCTCC TCATCAGACAGCAGGAGACCACCCTACTGTCTCTCCAAACTGCTTTGGCTCTCACCGGCAGTATTTTGACAAGCATGCTTACACATTAAC TGGATATGCCCTGGAGCACTTATATGACAGCGAAACCATTAGAAACTATTCCTTGGGCTGTAATGGCTCACACTTTGATGTAACTTCCCA TCTGAGGATGCAACCAGACCCAGCACAAGGACACAAGGGAACATCTGTTATAATAACCAACGGAAGCTGATGTTTTGCTGAAATATTTTG TTCTTTAAGGATCTCTGAAACATATTTATAGTTTAATACCCCATTACCAGCATTTACTATGCCACAGATTGTTAGAGAGTATAACTTAAG TTACTGGGTATTTGATACGTGTTCCTATAAAATCAAAGAAAACATAGCACTAGCATTCAGGGTTATACACAGAAAAGGGAGCTAAATTGA ATACACAAATTTCCCCTCTAATTATATGGGAACCAGAATAGATAAATTTTGACTTGAAAAATATTCATGTAGATCAAGTGTGCATATATA CTACATGAGAGGACTGATGAATGACAACATTGCATTGTGACTATCCAGTGATCCTCAAACACACAAACTATTACTTACAAACTGCGGTAT ACATTTTACATATGGAAATATAGGCTATGTAATGTAAATACATCAAAAATGGGTAATTTTCTTTGACTCTGTCACACTAAACTTCTTAAC GAAATTTCCATTCCCAAAATAACTGAGAAAGAGAGAGATACATCTTATAAACTGACTTCTTTGTGGTTTCAAATCAGCCAGCTCATTTGG TTCAGGCATAAATTAGAGAAATGGTTCTGGATATGGTGCAAAAATGAGTTTTCACCTGGTATCCATTATAAACAATCAGGAAGAGGTAAT TTTTCACCTTGCTTTTCAGTTAGACAAGGACCAGGATTGCACTGACATGGCGCTGAGGGTTTTTCTAAGTAAGAACACTGAGATATTGGG ACACACATCAAAAACCTGGAGTGCTCAATTGGAAGTAGTTCTATGAATATGGAAAGGCCAGAGGCAGAGTGAAATAAAATGCTATCTCAA AGTTTAACACAATTTAAGGGCTCAGCATAAGTAAACAACATATTTGGGGTTTGCTTGTAAAACCAACTAAATAAAAAATTCAAACCAATT CACCCAGAAAAAAGACCAATAGGTGCAAAAATAAAAGGAAAACCAGTGAAGTGCCACATGACAGCAGTGTTAAGTGTTTGAAAACGTTTC AAAGCACATATGTGCCAATGTGACAACATGTGGAAAGCCTCAGGAGAGAGTCTAAGATAAAAGCTTAGGCTGATAGACAAGTAGTTAAGA GCTAAGAGCAGTACTCTGAAGGAATAGGCAAAATGTTTATTTTCCTTATTGTTTGTAAACAACAAACTTGGTCTTACATCTGTGTGGTAT AGTAGAAAGGCCAGCTGACTAGATCTCTGGATTCTAATTTTGGCCCTACCTGTAACTTAATTTTGTGACCACAGTTGTACCATTCACCGT GCCTGGGCTCTAGTTTCCTGGTTTGTAAGGCAGCCCCAGCGTTCATGTTCTGTGATAGAGCAGAACTGAACTTATTACCTAATTAACTCT CTGCTATGAGTTGTCAAGACTGATCATTCTGTTTTTTCTGTACACAGAAGTTTAGATGCTTTGTGACTTAAGCAGGTGTGTGGGCTCCTT |
Top |
FusionGenePPI for PREP_SIM1 |
Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in . |
Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160) |
Hgene | Hgene's interactors | Tgene | Tgene's interactors |
PREP | SIRT7, TARS, BRCA1, SGOL2, AIP, GLO1, HSPA4, HSPD1, HSPH1, KCNAB2, PABPC4, ALDOA, ALDOC, CTH, GDI2, ISYNA1, ME1, OAT, PABPC1, QDPR, NTRK1, SUCO, MTG2, NSMAF, RHPN1, KIF3A, SNCA | SIM1 | ARNT, HSP90AA1 |
- Retained PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Still interaction with |
- Lost PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
- Retained PPIs, but lost function due to frame-shift fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
Top |
RelatedDrugs for PREP_SIM1 |
Drugs targeting genes involved in this fusion gene. (DrugBank Version 5.1.0 2018-04-02) |
Partner | Gene | UniProtAcc | DrugBank ID | Drug name | Drug activity | Drug type | Drug status |
Top |
RelatedDiseases for PREP_SIM1 |
Diseases associated with fusion partners. (DisGeNet 4.0) |
Partner | Gene | Disease ID | Disease name | # pubmeds | Source |
Hgene | PREP | C0002622 | Amnesia | 1 | CTD_human |
Hgene | PREP | C0002624 | Retrograde amnesia | 1 | CTD_human |
Hgene | PREP | C0005586 | Bipolar Disorder | 1 | PSYGENET |
Hgene | PREP | C0038356 | Stomach Neoplasms | 1 | CTD_human |
Hgene | PREP | C0338831 | Manic | 1 | PSYGENET |
Tgene | SIM1 | C0020456 | Hyperglycemia | 1 | CTD_human |
Tgene | SIM1 | C1458155 | Mammary Neoplasms | 1 | CTD_human |