KEGG   Homo sapiens (human): 84033
Entry
84033             CDS       T01001                                 
Symbol
OBSCN, ARHGEF30, RHABDO1, UNC89
Name
(RefSeq) obscurin, cytoskeletal calmodulin and titin-interacting RhoGEF
  KO
K17531  obscurin-RhoGEF [EC:2.7.11.1]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    84033 (OBSCN)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01001 Protein kinases [BR:hsa01001]
    84033 (OBSCN)
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:hsa04131]
    84033 (OBSCN)
Enzymes [BR:hsa01000]
 2. Transferases
  2.7  Transferring phosphorus-containing groups
   2.7.11  Protein-serine/threonine kinases
    2.7.11.1  non-specific serine/threonine protein kinase
     84033 (OBSCN)
Protein kinases [BR:hsa01001]
 Serine/threonine kinases: CAMK group
  Trio family [OT]
   84033 (OBSCN)
Membrane trafficking [BR:hsa04131]
 Others
  Rho GTPase associated proteins
   Rho guanine nucleotide exchange factors (GEFs)
    84033 (OBSCN)
SSDB
Motif
Pfam: I-set Ig_3 ig Ig_2 V-set Pkinase Ig_5 PK_Tyr_Ser-Thr Ig_4 C1-set C2-set_2 Izumo-Ig SOS1_NGEF_PH fn3 RhoGEF PH Ig_DSCAM-DSCAML_C ABC1 IQ IQ_SEC7_PH
Other DBs
NCBI-GeneID: 84033
NCBI-ProteinID: NP_001373054
OMIM: 608616
HGNC: 15719
Ensembl: ENSG00000154358
UniProt: Q5VST9
Structure
LinkDB
Position
1:228208044..228378876
AA seq 8925 aa
MDQPQFSGAPRFLTRPKAFVVSVGKDATLSCQIVGNPTPQVSWEKDQQPVAAGARFRLAQ
DGDLYRLTILDLALGDSGQYVCRARNAIGEAFAAVGLQVDAEAACAEQAPHFLLRPTSIR
VREGSEATFRCRVGGSPRPAVSWSKDGRRLGEPDGPRVRVEELGEASALRIRAARPRDGG
TYEVRAENPLGAASAAAALVVDSDAADTASRPGTSTAALLAHLQRRREAMRAEGAPASPP
STGTRTCTVTEGKHARLSCYVTGEPKPETVWKKDGQLVTEGRRHVVYEDAQENFVLKILF
CKQSDRGLYTCTASNLVGQTYSSVLVVVREPAVPFKKRLQDLEVREKESATFLCEVPQPS
TEAAWFKEETRLWASAKYGIEEEGTERRLTVRNVSADDDAVYICETPEGSRTVAELAVQG
NLLRKLPRKTAVRVGDTAMFCVELAVPVGPVHWLRNQEEVVAGGRVAISAEGTRHTLTIS
QCCLEDVGQVAFMAGDCQTSTQFCVSAPRKPPLQPPVDPVVKARMESSVILSWSPPPHGE
RPVTIDGYLVEKKKLGTYTWIRCHEAEWVATPELTVADVAEEGNFQFRVSALNSFGQSPY
LEFPGTVHLAPKLAVRTPLKAVQAVEGGEVTFSVDLTVASAGEWFLDGQALKASSVYEIH
CDRTRHTLTIREVPASLHGAQLKFVANGIESSIRMEVRAAPGLTANKPPAAAAREVLARL
HEEAQLLAELSDQAAAVTWLKDGRTLSPGPKYEVQASAGRRVLLVRDVARDDAGLYECVS
RGGRIAYQLSVQGLARFLHKDMAGSCVDAVAGGPAQFECETSEAHVHVHWYKDGMELGHS
GERFLQEDVGTRHRLVAATVTRQDEGTYSCRVGEDSVDFRLRVSEPKAVFAKEQPACREV
QAEVGASATLSCEVAQDQMEVTWYKDGKKLSSSSKVHVEAVGCMRRLVVQQVGQADSGEY
SCEARGQRVSFRLDVAEPKVVFAKEQLARRKLQAEAGASATLSCEVAQAQTEVTWYKDGK
KLSSSSKVCMEATGCTRRLVVQQAGQADAGEYSCEAGGQRLSFHLDVKEPKVVFAKDQVA
HSEVQAEAGASATLSCEVAQAQTEVMWYKDGKKLSSSLKVHVEAKGCRRRLVVQQAGKTD
AGDYSCEARGQRVSFRLHITEPKMMFAKEQSVHNEVQAEAGASAMLSCEVAQAQTEVTWY
KDGKKLSSSSKVGMEVKGCTRRLVLPQAGKADAGEYSCEAGGQRVSFHLHITEPKGVFAK
EQSVHNEVQAEAGTTAMLSCEVAQPQTEVTWYKDGKKLSSSSKVRMEVKGCTRRLVVQQV
GKADAGEYSCEAGGQRVSFQLHITEPKAVFAKEQLVHNEVRTEAGASATLSCEVAQAQTE
VTWYKDGKKLSSSSKVRIEAAGCMRQLVVQQAGQADAGEYTCEAGGQRLSFHLDVSEPKA
VFAKEQLAHRKVQAEAGAIATLSCEVAQAQTEVTWYKDGKKLSSSSKVRMEAVGCTRRLV
VQQACQADTGEYSCEAGGQRLSFSLDVAEPKVVFAKEQPVHREVQAQAGASTTLSCEVAQ
AQTEVMWYKDGKKLSFSSKVRMEAVGCTRRLVVQQAGQAVAGEYSCEAGSQRLSFHLHVA
EPKAVFAKEQPAHREVQAEAGASATLSCEVAQAQTEVTWYKDGKKLSSSLKVHVEAAGCT
RRLVVQQAGQADTGEYSCEAGGQQLSFRLQVAEPKAVFAKEQPASREVQAEAGTSATLSC
EVAQAQTEVTWYKDGKKLSSSSKVRMEAVGCTRRLVVQEAGQADAGEYSCKAGDQRLSFH
LHVAEPKVVFAKEQPAHREVQAEAGASATLSCEVAQAQTEVTWYKDGKKLSSSSKVRVEA
VGCTRRLVVQQAGQAEAGEYSCEAGGQQLSFRLQVAEPKAVFAKEQVVFAKDQPVHREVQ
AEAGTSTMLSCEVAQAQTEVMWYKDGKKLSSSSKMRVEAVGCTRRLVVQEAGQADAGEYS
CEAGGQRLSFHLHVAEPKVVFAKEQPACREVQAEAGASATLSCEVAQGQMEVTWYKDGKK
LSSSSKVHMEASGYTRRLVVQQAGQADAGEYSCEAGGQRLSFRLHVAELEPQISERPCRR
EPLVVKEHEDIILTATLATPSAATVTWLKDGVEIRRSKRHETASQGDTHTLTVHGAQVLD
SAIYSCRVGAEGQDFPVQVEEVAAKFCRLLEPVCGELGGTVTLACELSPACAEVVWRCGN
TQLRVGKRFQMVAEGPVRSLTVLGLRAEDAGEYVCESRDDHTSAQLTVSVPRVVKFMSGL
STVVAEEGGEATFQCVVSPSDVAVVWFRDGALLQPSEKFAISQSGASHSLTISDLVLEDA
GQITVEAEGASSSAALRVREAPVLFKKKLEPQTVEERSSVTLEVELTRPWPELRWTRNAT
ALAPGKNVEIHAEGARHRLVLHNVGFADRGFFGCETPDDKTQAKLTVEMRQVRLVRGLQA
VEAREQGTATMEVQLSHADVDGSWTRDGLRFQQGPTCHLAVRGPMHTLTLSGLRPEDSGL
MVFKAEGVHTSARLVVTELPVSFSRPLQDVVTTEKEKVTLECELSRPNVDVRWLKDGVEL
RAGKTMAIAAQGACRSLTIYRCEFADQGVYVCDAHDAQSSASVKVQGRNIQIVRPLEDVE
VMEKDGATFSCEVSHDEVPGQWFWEGSKLRPTDNVRIRQEGRTYTLIYRRVLAEDAGEIQ
FVAENAESRAQLRVKELPVTLVRPLRDKIAMEKHRGVLECQVSRASAQVRWFKGSQELQP
GPKYELVSDGLYRKLIISDVHAEDEDTYTCDAGDVKTSAQFFVEEQSITIVRGLQDVTVM
EPAPAWFECETSIPSVRPPKWLLGKTVLQAGGNVGLEQEGTVHRLMLRRTCSTMTGPVHF
TVGKSRSSARLVVSDIPVVLTRPLEPKTGRELQSVVLSCDFRPAPKAVQWYKDDTPLSPS
EKFKMSLEGQMAELRILRLMPADAGVYRCQAGSAHSSTEVTVEAREVTVTGPLQDAEATE
EGWASFSCELSHEDEEVEWSLNGMPLYNDSFHEISHKGRRHTLVLKSIQRADAGIVRASS
LKVSTSARLEVRVKPVVFLKALDDLSAEERGTLALQCEVSDPEAHVVWRKDGVQLGPSDK
YDFLHTAGTRGLVVHDVSPEDAGLYTCHVGSEETRARVRVHDLHVGITKRLKTMEVLEGE
SCSFECVLSHESASDPAMWTVGGKTVGSSSRFQATRQGRKYILVVREAAPSDAGEVVFSV
RGLTSKASLIVRERPAAIIKPLEDQWVAPGEDVELRCELSRAGTPVHWLKDRKAIRKSQK
YDVVCEGTMAMLVIRGASLKDAGEYTCEVEASKSTASLHVEEKANCFTEELTNLQVEEKG
TAVFTCKTEHPAATVTWRKGLLELRASGKHQPSQEGLTLRLTISALEKADSDTYTCDIGQ
AQSRAQLLVQGRRVHIIEDLEDVDVQEGSSATFRCRISPANYEPVHWFLDKTPLHANELN
EIDAQPGGYHVLTLRQLALKDSGTIYFEAGDQRASAALRVTEKPSVFSRELTDATITEGE
DLTLVCETSTCDIPVCWTKDGKTLRGSARCQLSHEGHRAQLLITGATLQDSGRYKCEAGG
ACSSSIVRVHARPVRFQEALKDLEVLEGGAATLRCVLSSVAAPVKWCYGNNVLRPGDKYS
LRQEGAMLELVVRNLRPQDSGRYSCSFGDQTTSATLTVTALPAQFIGKLRNKEATEGATA
TLRCELSKAAPVEWRKGSETLRDGDRYCLRQDGAMCELQIRGLAMVDAAEYSCVCGEERT
SASLTIRPMPAHFIGRLRHQESIEGATATLRCELSKAAPVEWRKGRESLRDGDRHSLRQD
GAVCELQICGLAVADAGEYSCVCGEERTSATLTVKALPAKFTEGLRNEEAVEGATAMLWC
ELSKVAPVEWRKGPENLRDGDRYILRQEGTRCELQICGLAMADAGEYLCVCGQERTSATL
TIRALPARFIEDVKNQEAREGATAVLQCELNSAAPVEWRKGSETLRDGDRYSLRQDGTKC
ELQIRGLAMADTGEYSCVCGQERTSAMLTVRALPIKFTEGLRNEEATEGATAVLRCELSK
MAPVEWWKGHETLRDGDRHSLRQDGARCELQIRGLVAEDAGEYLCMCGKERTSAMLTVRA
MPSKFIEGLRNEEATEGDTATLWCELSKAAPVEWRKGHETLRDGDRHSLRQDGSRCELQI
RGLAVVDAGEYSCVCGQERTSATLTVRALPARFIEDVKNQEAREGATAVLQCELSKAAPV
EWRKGSETLRGGDRYSLRQDGTRCELQIHGLSVADTGEYSCVCGQERTSATLTVRALPAR
FTQDLKTKEASEGATATLQCELSKVAPVEWKKGPETLRDGGRYSLKQDGTRCELQIHDLS
VADAGEYSCMCGQERTSATLTVRALPARFTEGLRNEEAMEGATATLQCELSKAAPVEWRK
GLEALRDGDKYSLRQDGAVCELQIHGLAMADNGVYSCVCGQERTSATLTVRALPARFIED
MRNQKATEGATVTLQCKLRKAAPVEWRKGPNTLKDGDRYSLKQDGTSCELQIRGLVIADA
GEYSCICEQERTSATLTVRALPARFIEDVRNHEATEGATAVLQCELSKAAPVEWRKGSET
LRDGDRYSLRQDGTRCELQIRGLAVEDTGEYLCVCGQERTSATLTVRALPARFIDNMTNQ
EAREGATATLHCELSKVAPVEWRKGPETLRDGDRHSLRQDGTRCELQIRGLSVADAGEYS
CVCGQERTSATLTIRALPAKFTKGLRNEEATEGATAMLQCELSKVAPVEWRKGPETLRDG
DRYNLRQDGTRCELQIHGLSVADTGEYSCVCGQEKTSATLTVKAPQPVFREPLQSLQAEE
GSTATLQCELSEPTATVVWSKGGLQLQANGRREPRLQGCTAELVLQDLQREDTGEYTCTC
GSQATSATLTVTAAPVRFLRELQHQEVDEGGTAHLCCELSRAGASVEWRKGSLQLFPCAK
YQMVQDGAAAELLVRGVEQEDAGDYTCDTGHTQSMASLSVRVPRPKFKTRLQSLEQETGD
IARLCCQLSDAESGAVVQWLKEGVELHAGPKYEMRSQGATRELLIHQLEAKDTGEYACVT
GGQKTAASLRVTEPEVTIVRGLVDAEVTADEDVEFSCEVSRAGATGVQWCLQGLPLQSNE
VTEVAVRDGRIHTLRLKGVTPEDAGTVSFHLGNHASSAQLTVRAPEVTILEPLQDVQLSE
GQDASFQCRLSRASGQEARWALGGVPLQANEMNDITVEQGTLHLLTLHKVTLEDAGTVSF
HVGTCSSEAQLKVTAKNTVVRGLENVEALEGGEALFECQLSQPEVAAHTWLLDDEPVHTS
ENAEVVFFENGLRHLLLLKNLRPQDSCRVTFLAGDMVTSAFLTVRGWRLEILEPLKNAAV
RAGAQACFTCTLSEAVPVGEASWYINGAAVQPDDSDWTVTADGSHHALLLRSAQPHHAGE
VTFACRDAVASARLTVLGLPDPPEDAEVVARSSHTVTLSWAAPMSDGGGGLCGYRVEVKE
GATGQWRLCHELVPGPECVVDGLAPGETYRFRVAAVGPVGAGEPVHLPQTVRLAEPPKPV
PPQPSAPESRQVAAGEDVSLELEVVAEAGEVIWHKGMERIQPGGRFEVVSQGRQQMLVIK
GFTAEDQGEYHCGLAQGSICPAAATFQVALSPASVDEAPQPSLPPEAAQEGDLHLLWEAL
ARKRRMSREPTLDSISELPEEDGRSQRLPQEAEEVAPDLSEGYSTADELARTGDADLSHT
SSDDESRAGTPSLVTYLKKAGRPGTSPLASKVGAPAAPSVKPQQQQEPLAAVRPPLGDLS
TKDLGDPSMDKAAVKIQAAFKGYKVRKEMKQQEGPMFSHTFGDTEAQVGDALRLECVVAS
KADVRARWLKDGVELTDGRHHHIDQLGDGTCSLLITGLDRADAGCYTCQVSNKFGQVTHS
ACVVVSGSESEAESSSGGELDDAFRRAARRLHRLFRTKSPAEVSDEELFLSADEGPAEPE
EPADWQTYREDEHFICIRFEALTEARQAVTRFQEMFATLGIGVEIKLVEQGPRRVEMCIS
KETPAPVVPPEPLPSLLTSDAAPVFLTELQNQEVQDGYPVSFDCVVTGQPMPSVRWFKDG
KLLEEDDHYMINEDQQGGHQLIITAVVPADMGVYRCLAENSMGVSSTKAELRVDLTSTDY
DTAADATESSSYFSAQGYLSSREQEGTESTTDEGQLPQVVEELRDLQVAPGTRLAKFQLK
VKGYPAPRLYWFKDGQPLTASAHIRMTDKKILHTLEIISVTREDSGQYAAYISNAMGAAY
SSARLLVRGPDEPEEKPASDVHEQLVPPRMLERFTPKKVKKGSSITFSVKVEGRPVPTVH
WLREEAERGVLWIGPDTPGYTVASSAQQHSLVLLDVGRQHQGTYTCIASNAAGQALCSAS
LHVSGLPKVEEQEKVKEALISTFLQGTTQAISAQGLETASFADLGGQRKEEPLAAKEALG
HLSLAEVGTEEFLQKLTSQITEMVSAKITQAKLQVPGGDSDEDSKTPSASPRHGRSRPSS
SIQESSSESEDGDARGEIFDIYVVTADYLPLGAEQDAITLREGQYVEVLDAAHPLRWLVR
TKPTKSSPSRQGWVSPAYLDRRLKLSPEWGAAEAPEFPGEAVSEDEYKARLSSVIQELLS
SEQAFVEELQFLQSHHLQHLERCPHVPIAVAGQKAVIFRNVRDIGRFHSSFLQELQQCDT
DDDVAMCFIKNQAAFEQYLEFLVGRVQAESVVVSTAIQEFYKKYAEEALLAGDPSQPPPP
PLQHYLEQPVERVQRYQALLKELIRNKARNRQNCALLEQAYAVVSALPQRAENKLHVSLM
ENYPGTLQALGEPIRQGHFIVWEGAPGARMPWKGHNRHVFLFRNHLVICKPRRDSRTDTV
SYVFRNMMKLSSIDLNDQVEGDDRAFEVWQEREDSVRKYLLQARTAIIKSSWVKEICGIQ
QRLALPVWRPPDFEEELADCTAELGETVKLACRVTGTPKPVISWYKDGKAVQVDPHHILI
EDPDGSCALILDSLTGVDSGQYMCFAASAAGNCSTLGKILVQVPPRFVNKVRASPFVEGE
DAQFTCTIEGAPYPQIRWYKDGALLTTGNKFQTLSEPRSGLLVLVIRAASKEDLGLYECE
LVNRLGSARASAELRIQSPMLQAQEQCHREQLVAAVEDTTLERADQEVTSVLKRLLGPKA
PGPSTGDLTGPGPCPRGAPALQETGSQPPVTGTSEAPAVPPRVPQPLLHEGPEQEPEAIA
RAQEWTVPIRMEGAAWPGAGTGELLWDVHSHVVRETTQRTYTYQAIDTHTARPPSMQVTI
EDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLSQQQEGTTYSLVLRHVASKD
AGVYTCLAQNTGGQVLCKAELLVLGGDNEPDSEKQSHRRKLHSFYEVKEEIGRGVFGFVK
RVQHKGNKILCAAKFIPLRSRTRAQAYRERDILAALSHPLVTGLLDQFETRKTLILILEL
CSSEELLDRLYRKGVVTEAEVKVYIQQLVEGLHYLHSHGVLHLDIKPSNILMVHPAREDI
KICDFGFAQNITPAELQFSQYGSPEFVSPEIIQQNPVSEASDIWAMGVISYLSLTCSSPF
AGESDRATLLNVLEGRVSWSSPMAAHLSEDAKDFIKATLQRAPQARPSAAQCLSHPWFLK
SMPAEEAHFINTKQLKFLLARSRWQRSLMSYKSILVMRSIPELLRGPPDSPSLGVARHLC
RDTGGSSSSSSSSDNELAPFARAKSLPPSPVTHSPLLHPRGFLRPSASLPEEAEASERST
EAPAPPASPEGAGPPAAQGCVPRHSVIRSLFYHQAGESPEHGALAPGSRRHPARRRHLLK
GGYIAGALPGLREPLMEHRVLEEEAAREEQATLLAKAPSFETALRLPASGTHLAPGHSHS
LEHDSPSTPRPSSEACGEAQRLPSAPSGGAPIRDMGHPQGSKQLPSTGGHPGTAQPERPS
PDSPWGQPAPFCHPKQGSAPQEGCSPHPAVAPCPPGSFPPGSCKEAPLVPSSPFLGQPQA
PPAPAKASPPLDSKMGPGDISLPGRPKPGPCSSPGSASQASSSQVSSLRVGSSQVGTEPG
PSLDAEGWTQEAEDLSDSTPTLQRPQEQATMRKFSLGGRGGYAGVAGYGTFAFGGDAGGM
LGQGPMWARIAWAVSQSEEEEQEEARAESQSEEQQEARAESPLPQVSARPVPEVGRAPTR
SSPEPTPWEDIGQVSLVQIRDLSGDAEAADTISLDISEVDPAYLNLSDLYDIKYLPFEFM
IFRKVPKSAQPEPPSPMAEEELAEFPEPTWPWPGELGPHAGLEITEESEDVDALLAEAAV
GRKRKWSSPSRSLFHFPGRHLPLDEPAELGLRERVKASVEHISRILKGRPEGLEKEGPPR
KKPGLASFRLSGLKSWDRAPTFLRELSDETVVLGQSVTLACQVSAQPAAQATWSKDGAPL
ESSSRVLISATLKNFQLLTILVVVAEDLGVYTCSVSNALGTVTTTGVLRKAERPSSSPCP
DIGEVYADGVLLVWKPVESYGPVTYIVQCSLEGGSWTTLASDIFDCCYLTSKLSRGGTYT
FRTACVSKAGMGPYSSPSEQVLLGGPSHLASEEESQGRSAQPLPSTKTFAFQTQIQRGRF
SVVRQCWEKASGRALAAKIIPYHPKDKTAVLREYEALKGLRHPHLAQLHAAYLSPRHLVL
ILELCSGPELLPCLAERASYSESEVKDYLWQMLSATQYLHNQHILHLDLRSENMIITEYN
LLKVVDLGNAQSLSQEKVLPSDKFKDYLETMAPELLEGQGAVPQTDIWAIGVTAFIMLSA
EYPVSSEGARDLQRGLRKGLVRLSRCYAGLSGGAVAFLRSTLCAQPWGRPCASSCLQCPW
LTEEGPACSRPAPVTFPTARLRVFVRNREKRRALLYKRHNLAQVR
NT seq 26778 nt   +upstreamnt  +downstreamnt
atggatcagccacagttcagcggggcgccccgctttctcacccggcccaaggccttcgtg
gtgtcggtgggcaaggacgccaccctcagctgccagatcgtgggtaatcccacgccacag
gtgagctgggagaaggaccagcagccggtggcggccggcgcgcgcttccgtctggcccag
gacggcgacctctaccgcctcactatcctggacctggcgctgggcgacagtgggcaatac
gtgtgccgcgcgcgcaatgccataggcgaggccttcgctgctgtgggcctgcaggtggac
gcggaggccgcgtgcgccgagcaggcgccgcacttcctgctgcggcccacgtccatccgc
gtgcgcgagggctcagaggccaccttccgctgccgcgtgggtggctccccgaggccggca
gtgagctggtccaaggacgggcggcgcctgggtgagcccgacggcccccgcgtgcgcgtg
gaggagctcggcgaggcaagtgcgctgcgcattcgggcggcgcggccgcgcgacggcggc
acttacgaggtccgcgccgagaacccgctgggcgctgccagcgccgccgcggcgctagtg
gtggactcggacgccgcggacacggccagccggcccgggacctccacggccgcgctcctg
gcgcacctgcagcggcggcgcgaggctatgcgcgccgagggcgcccccgcctcaccgccc
agcaccggcacgcgcacctgcacggtgactgaaggcaagcacgcgcgcctcagctgctac
gtgaccggcgagcccaagcccgagacggtgtggaagaaggacggccagctggtgaccgag
ggccggcgccacgtggtgtacgaggacgcgcaggagaacttcgtgctcaagatcctcttc
tgcaagcagtcggaccgcggcctctacacctgcacggcgtccaacctcgtgggccagacc
tacagctctgtgctggtcgtagtgcgcgagcccgcggttcccttcaaaaagcggctgcaa
gatctggaggtgcgggagaaggagtcggccacgttcctatgtgaggtgccccagccgtcc
actgaggccgcgtggttcaaggaggagacgcggttgtgggcgagcgccaagtacggcatc
gaggaggagggcaccgagcgccgcctgaccgtgcgcaatgtctcggccgacgacgacgcg
gtgtacatctgcgagacgccagagggcagccgcacggtggcggagctcgcagtccaagga
aacctcctccgaaagctccctcggaagacggcggtgcgcgtgggcgacacggctatgttt
tgcgtggagctggcggtcccggtgggccccgtccactggctgcggaaccaggaggaagtg
gtggcggggggccgcgtggccatctccgcggagggcacgcgccacacactaaccatctcc
cagtgctgcctggaggatgtgggccaggtggcctttatggctggcgactgccagacgtcc
acccagttctgcgtgtcggcccccaggaagcctcccctgcaaccccctgtggatcctgtg
gtaaaggccaggatggagagttccgtgattctcagctggtccccaccaccccatggggaa
cgccctgtcactatcgacggctacctggtagagaagaagaagcttggcacctacacctgg
atcaggtgccacgaggctgaatgggtggctacacctgagctgaccgtggctgatgtggcg
gaggaggggaacttccagttccgagtgtccgctctcaacagctttggtcagagtccctac
ctcgagttcccggggactgtccacctggcccccaagctggccgtgaggacaccgctgaag
gcggtgcaggcggtagagggtggcgaggtcactttctccgtggacctcacggtggcctca
gcgggtgagtggttcctggatgggcaggccctgaaggccagcagtgtgtatgagatccac
tgtgatcgcacccggcacacgctcaccatccgggaggtgcccgccagcctgcacggggcg
cagctgaagttcgtggccaacggcattgagagcagcatccggatggaggtccgggcggcc
ccagggctgactgccaacaagccgccagccgcagctgcccgggaggtgctggctcggctg
cacgaggaggcgcagctgctggctgagctgtcagatcaggctgcggctgtgacgtggctg
aaggatggtcgcacactgtccccaggccccaagtatgaggtgcaggcatcggccgggcgg
cgggtgctccttgtgcgagatgtggcccgggacgatgcaggcctctacgagtgcgtcagc
cgcgggggccgcatcgcctaccagctctccgtgcaaggcctcgcgcgctttctgcacaag
gacatggcgggcagctgtgtggatgccgtggctgggggcccggcgcagtttgagtgtgag
acctccgaagcccacgtccacgtgcactggtacaaggatggcatggagctgggccactcc
ggtgagcgcttcttgcaggaggatgtggggacgcggcaccggctggtggcagccacagtc
accaggcaggatgaaggcacctactcctgccgcgtgggcgaggactctgtggacttccgg
ctccgcgtctctgagcccaaggcggtgtttgccaaggagcagccggcgtgcagggaggtg
caggctgaggtgggggccagcgccacgctgagctgtgaggtggcccaggaccagatggag
gtgacgtggtacaaggacgggaagaagttgagctccagctcgaaagtgcacgtggaggct
gtgggctgtatgcggaggctggtggtgcagcaggtgggccaggcagactccggagagtac
agctgtgaggccaggggacagagggtctccttccgcctggacgttgcagagcccaaggtg
gtgtttgctaaggagcagctggcacgcaggaagctgcaggcagaggcaggagccagtgcc
acactgagctgcgaggtggcccaggcccagacggaggtgacgtggtacaaggatgggaag
aagctgagctccagctcgaaagtgtgcatggaggccacaggctgcacgcgcaggctggtt
gtgcagcaggcaggccaggcggatgccggggagtatagctgcgaggctgggggccagcgg
ctctccttccatctggatgtcaaagagcccaaggtggtgtttgccaaggaccaggtggca
cacagtgaggtgcaggctgaggcaggggccagtgccacgctgagctgcgaggtggcccag
gcccagacggaggtgatgtggtacaaagatgggaagaagctgagctccagcttgaaagtg
catgtagaggccaagggctgcagacggaggctggtggtgcagcaggcaggcaagacagat
gccggggactacagctgcgaggccaggggccagagggtctccttccgcctgcacatcaca
gagcccaagatgatgtttgcaaaggagcagtcagtgcataatgaggtgcaggctgaggcg
ggggccagtgccatgctgagctgtgaggtggcccaggcccagacggaggtgacgtggtac
aaggatgggaagaagctgagctccagctcaaaagtgggcatggaggtcaaagggtgcaca
cggaggctggtgctgccacaggcgggcaaagcagatgctggggagtacagctgtgaggct
gggggccagagagtctccttccacctgcacatcacagagcccaagggggtgtttgcgaag
gagcagtcagtgcataatgaggtgcaggctgaggcggggaccactgccatgctgagctgt
gaggtggcccagccccagacagaggtgacgtggtacaaggacgggaagaagctgagctcc
agctcaaaagtacgcatggaggtcaagggctgcacacgaaggctggtagtgcagcaggtg
ggcaaagcagatgctggggagtacagctgcgaggctgggggccagagagtctcctttcaa
ctgcacatcacagagcccaaggcagtgtttgccaaggagcagttggtgcataatgaggtg
cggactgaggcaggggccagtgccacactgagctgtgaggtggcccaggcccagacagag
gtgacgtggtacaaggatgggaagaagctgagctccagttcgaaagtgcgcatagaggct
gcgggctgcatgcggcagctggtggtgcagcaggcaggccaggcagatgctggggagtac
acctgtgaggctgggggccagcggctctccttccacctggatgtttcagagcccaaggcg
gtgtttgcaaaggagcagctggcacacaggaaggtgcaggccgaggcgggggccattgcc
acgctgagctgcgaggtggcccaggcccagacagaggtgacgtggtacaaggacgggaag
aagctgagctccagctcgaaagttcgaatggaggctgtgggctgcacacggaggctggtg
gtgcagcaggcatgccaggcggacaccggggagtatagctgcgaggccgggggccagcgg
ctctccttcagcctggacgtggcagagcccaaggtggtgtttgccaaggagcagccagtg
cacagggaggtgcaggcccaggcgggggccagcaccacactcagctgcgaggtggctcag
gcccagacggaggtgatgtggtacaaggacgggaagaagctgagcttcagctcgaaagtg
cgcatggaggctgtgggctgcacacggaggctggtggtgcagcaggcgggccaggcggtc
gccggggagtacagctgcgaggcggggagccagcggctctccttccacctgcacgtggca
gagcccaaggcggtgtttgctaaggagcagccagcacacagggaggtgcaggctgaagca
ggggctagcgccacactgagctgtgaggtggcccaggcccagacagaggtgacttggtac
aaggacgggaagaagctgagctccagcttgaaagtgcatgtggaggctgcaggctgtaca
cggaggctggtggtgcagcaggcaggccaggcagacaccggagagtatagctgtgaggct
gggggccagcagctctccttccgcctgcaagtggcagagcccaaggcggtgtttgccaag
gagcagccagcgagcagggaggtgcaggctgaggcggggaccagtgccacgctgagctgc
gaggtggcccaggcccagacagaggtgacgtggtacaaggacgggaagaaactgagctcc
agctcgaaagtgcgaatggaggccgtgggctgcacacggaggctggtggtgcaggaggca
ggccaggcggacgccggggagtacagctgcaaggccggggatcagcggctgtccttccac
ctgcacgtggcagagcccaaggtggtgtttgccaaggagcagccagcacacagggaggtg
caggctgaggcgggggccagtgccacgctgagctgcgaggtggcccaggcccagacagag
gtgacgtggtacaaggatgggaagaagctgagttccagctcgaaagtgcgcgtggaggcc
gtgggctgcacacggaggctggtggtgcagcaggcgggccaggcagaggccggggagtac
agctgcgaggcagggggtcagcagctctccttccgcctgcaggtggcagagcccaaggcg
gtgtttgccaaggagcaggtggtgtttgccaaggatcagccggtgcacagggaggtgcag
gctgaggcagggaccagcaccatgctgagctgcgaggtggcccaagcccagacggaggtt
atgtggtacaaggacgggaagaagctgagctccagctcgaaaatgcgtgtggaggccgtg
ggctgcacacggaggctggtggtgcaggaggcaggccaggcggacgccggggagtacagc
tgcgaggctgggggccagcggctctccttccatctgcatgtggctgagcccaaggtggtg
tttgccaaggagcagccggcatgcagggaggtgcaggcggaggcgggggccagtgccacg
ctgagctgtgaggtggcccagggccagatggaggtgacatggtacaaggacgggaagaag
ctgagctccagctcaaaagtgcacatggaggccagcggctacacacggaggctggtagtg
cagcaggcgggccaggcggatgctggggagtacagctgtgaggcggggggccaacggctg
tccttccgcctgcacgtggcagagctggagccccaaatttcagagagaccctgccgcagg
gagcctctggtggtcaaggagcatgaagacatcatcctgaccgccacactggccacaccc
tctgcggccacggtgacctggctcaaggatggtgtggagattcgccgcagcaagcggcat
gagacagccagccagggggacacccacaccctgaccgtgcatggcgcccaggttctggac
agcgccatctacagctgccgtgtgggcgcagaggggcaggacttcccagtgcaggtggaa
gaggtggccgccaagttctgccggctgctggagcctgtgtgcggcgagctgggtggcacg
gtgacactggcctgcgagctaagcccagcgtgtgcagaggtggtgtggcgctgcggcaac
acgcagcttcgggtgggcaagcgcttccagatggtggccgaggggcccgtgcgctcactc
actgtgttggggctgcgcgcagaggacgcaggggagtacgtgtgtgagagccgtgatgac
cacaccagtgcgcagctcaccgtcagtgtgccccgagtggtgaagtttatgtctgggctg
agcaccgtggtcgcagaggagggcggcgaggccaccttccagtgcgtggtgtcccccagt
gatgtggcagtcgtgtggttccgggacggtgccctgcttcagcccagcgagaagtttgcc
atatcacagagtggcgccagccacagcctgaccatctcagacctggtgctggaggacgcg
ggccagatcaccgtggaggctgagggcgcctcatcctctgctgccctgagggtccgagag
gcgcctgtgctgttcaaaaagaagctggagccgcagacggtggaggagcggagctcggtg
accctggaggtggagctgacgcggccgtggccggagctgaggtggacacggaacgcgacg
gccctggcgccgggaaagaacgtggagatccacgccgagggcgcccgccaccgcctggtt
ctgcacaacgtaggttttgccgaccgtggcttctttggctgcgagacgccggatgacaag
acacaggccaaactcaccgtggagatgcgccaggtacggctcgtacggggcctgcaggca
gtggaggcacgggagcagggcacggctaccatggaggtgcagctgtcgcatgcggacgtg
gatggcagctggactcgtgacggtctgcggttccagcaggggcccacgtgccacctggct
gtgcggggccccatgcacaccctcacactctcggggctgcggccagaggatagtggcctt
atggtcttcaaggccgaaggagtgcacacgtcggcgcggctcgtggtcaccgagcttccc
gtgagcttcagccgcccgctgcaggacgtggtgaccactgagaaggagaaggttaccctg
gagtgcgagctgtcgcgtcctaatgtggatgtgcgctggctgaaggacggtgtggagctg
cgggcaggcaagacgatggccatcgcagcccagggcgcctgcaggagcctcaccatttac
cggtgcgagttcgcggatcagggagtgtatgtgtgtgatgcccatgatgcccagagctct
gcctccgtgaaggtacaaggccgcaacatccagatcgtgaggcccctggaggatgtggaa
gtgatggagaaggacggtgccaccttctcctgtgaggtctcccacgacgaagtgcctggc
cagtggttctgggagggcagtaaactgcggcccactgacaacgtgcgcatccgccaggaa
ggaaggacatacactctcatctaccggagagtcctggcggaagatgcaggagagatccaa
tttgtagccgaaaatgcagaatcgcgagcccagctccgagtgaaggagctgccagtgacc
ctcgtgcgcccgctgcgggacaagattgccatggagaagcaccgcggtgtgctggagtgt
caggtgtcccgggccagcgcccaggtgcggtggttcaagggcagtcaggagctgcagccc
gggcccaagtacgagctggtcagtgatggcctctaccgcaagctgatcatcagtgatgtc
cacgcagaggacgaggacacctacacctgtgacgccggtgatgtcaagaccagtgcacag
ttcttcgtggaagagcaatccatcaccattgtgcggggtctgcaggacgtgacagtgatg
gagcccgctcctgcctggtttgagtgtgagacctccatcccctcagtgcggccacctaag
tggctcctggggaagacggtgttgcaggctggggggaacgtgggcctggagcaggagggc
acggtgcaccggctgatgctgcggcggacctgctccaccatgaccgggcccgtgcacttc
accgttggcaagtcgcgctcctctgcccgcctggtggtctcagacatccccgtagtcctc
acacggccgttggagcccaagacagggcgtgagctgcagtcagtggtcctgtcctgcgac
ttccggccagcccccaaggctgtgcagtggtacaaggatgacacgcccctgtctccctct
gagaagtttaagatgagcctggagggtcagatggctgagctgcgcatcctccggctcatg
cctgctgatgctggtgtctaccggtgccaggcgggcagtgcccacagcagcactgaggtc
actgtggaagcgcgggaggtgacagtgacagggccgctacaggatgcagaggccacggag
gagggctgggccagcttctcctgtgagctgtcccacgaggatgaggaggtcgagtggtcg
ctcaacgggatgcccctgtacaacgacagcttccatgagatctcacacaagggccggcgc
cacacgctggtactgaagagcatccagcgggctgatgcgggcatagtacgcgcctcctcc
ctgaaggtgtcgacctctgcccgcctggaggtccgagtgaagccggtggtgttcctgaag
gcgctggatgacctgtccgcagaggagcgcggcaccctggccctgcagtgtgaagtctct
gaccccgaggcccatgtggtgtggcgcaaagatggcgtgcagctgggccccagtgacaag
tatgacttcctgcacacggcgggcacgcgggggctcgtggtgcatgacgtgagccctgaa
gacgccggcctgtacacctgccacgtgggctccgaggagacccgggcccgggtccgcgtg
cacgatctgcacgtgggcatcaccaagaggctgaagacaatggaggtgctggaaggggaa
agctgcagctttgagtgcgtcctgtcccacgagagtgccagcgacccggccatgtggaca
gtcggtgggaagacagtgggcagctccagccgcttccaggccacacgtcagggccgaaaa
tacatcctggtggtccgggaggctgcaccaagtgatgccggggaggtggtcttctctgtg
cggggcctcacctccaaggcctcactcattgtcagagagaggccggccgccatcatcaag
cccctggaagaccagtgggtggcgccaggggaggacgtggagctgcgctgtgagctgtca
cgggcgggaacgcccgtgcactggctgaaggacaggaaggccatccgcaagagccagaag
tatgatgtggtctgcgagggcacgatggccatgctggtcatccgcggggcctcgctcaag
gacgcgggcgagtacacgtgtgaggtggaggcttccaagagcacagccagcctccatgtg
gaagaaaaagcaaactgcttcacagaggagctgaccaatctgcaggtggaggagaaaggc
acagctgtgttcacgtgcaagacggagcaccccgcggccacagtgacctggcgcaagggc
ctcttggagctacgggcctcagggaagcaccagcccagccaggagggcctgaccctgcgg
ctcaccatcagtgccctggagaaggcagacagcgacacctatacctgcgacattggccag
gcccagtcccgggcccagctcctagtgcaaggccggagagtgcacatcatcgaggacctg
gaggatgtggatgtgcaggagggctcctcggccaccttccgttgccggatctccccggcc
aactacgagcctgtgcactggttcctggacaagacacccctgcatgccaacgagctcaat
gagatcgatgcccagcccgggggctaccacgtgctgaccctgcggcagctggcgctcaag
gactcgggcaccatctactttgaggcgggtgaccagcgggcctcggccgccctgcgggtc
actgagaagccaagcgtcttctcccgggagctcacagatgccaccatcacagagggtgag
gacttgaccctggtgtgcgagaccagcacctgcgacattcctgtgtgctggaccaaggat
gggaagaccctgcgggggtctgcccggtgccagctgagccatgagggccaccgggcccag
ctgctcatcactggggccaccctgcaggacagtggacgctacaagtgtgaggctgggggc
gcctgcagcagctccattgtcagggtgcatgcgcggccagtgcggttccaggaggccctg
aaggacctggaggtgctggagggtggtgctgccacactgcgctgtgtgctgtcatctgtg
gctgcgcccgtgaagtggtgctatggaaacaacgtcctgaggccaggtgacaaatacagc
ctacgccaggagggtgccatgctggagctggtggtccggaacctccggccgcaggacagc
gggcggtactcatgctccttcggggaccagactacttctgccaccctcacagtgactgcc
ctgcctgcccagttcatcgggaaactgagaaacaaggaggccacagaaggggccacggcc
acgctgcggtgtgagctgagcaaggcagcccctgtggagtggagaaaggggtccgagacc
ctcagagatggggacagatactgtctgaggcaggacggggccatgtgtgagctgcagatc
cgtggcctggccatggtggatgccgcggagtactcgtgtgtgtgtggagaggagaggacc
tcagcctcactcaccatcaggcccatgcctgcccacttcataggaagactgagacaccaa
gagagcatagaaggggccacagccacgctgcggtgtgagctgagcaaggcggcccccgtg
gagtggaggaaggggcgtgagagcctcagagatggggacagacatagcctgaggcaggac
ggggctgtgtgcgagctgcagatctgtggcctggctgtggcagatgctggggagtactcc
tgtgtgtgtggggaggagaggacctctgccactctcaccgtgaaggccctgccagccaag
ttcacagagggtctgaggaatgaagaggccgtggaaggggccacagccatgttgtggtgt
gaactgagcaaggtggcccctgtggagtggaggaaggggcccgagaacctcagagatggg
gacagatacatcctgaggcaggaggggaccaggtgtgagctgcagatctgtggcctggcc
atggcggacgccggggagtacttgtgtgtgtgcgggcaggagaggacctcagccacgctc
accatcagggctctgcctgccaggttcatagaagatgtgaaaaaccaggaggccagagaa
ggggccacagctgtgctgcagtgtgagctgaacagtgcagcccctgtggagtggagaaag
gggtctgagacccttagagatggggacagatacagcctgaggcaggacgggactaaatgt
gagctgcagattcgtggcctggccatggcagacactggggagtactcgtgcgtgtgcggg
caggagaggacctcggctatgctcaccgtcagggctctacccatcaagttcacagagggt
ctgaggaacgaagaggccacagaaggggcaacagccgtgctgcggtgtgagctgagcaag
atggcccccgtggagtggtggaaggggcatgagaccctcagagatggagacagacacagc
ctgaggcaggacggggccaggtgtgagctgcagatccgcggcctcgtggcagaggacgct
ggggagtacctgtgcatgtgcgggaaggagaggacctcagccatgctcaccgtcagggcc
atgccttccaagttcatagagggtctgaggaatgaagaggccacagaaggggacacggcc
acgctgtggtgtgagctgagcaaggcggcaccggtggagtggaggaaggggcatgagacc
ctcagagatggggacagacacagcctgaggcaggacgggtccaggtgtgagctgcagatc
cgtggcctggctgtggtggatgccggggagtactcgtgtgtgtgcgggcaggagaggacc
tcagccacactcactgtcagggccctgcctgccagattcatagaagatgtgaaaaaccag
gaggccagagaaggggccacggccgtgctgcaatgtgagctgagcaaggcggcccccgtg
gagtggaggaaggggtctgagaccctcagaggtggggacagatacagcctgaggcaggat
gggaccagatgtgagctgcagattcatggcctgtctgtggcagacactggggagtactcg
tgtgtgtgcgggcaggagaggacctcggccacactcaccgtcagggccctgcctgcacga
ttcactcaagatctgaagaccaaggaggcctcagaaggggccacagctacactgcagtgt
gagctgagcaaggtggcccctgtggaatggaagaagggtcctgagaccctcagagatggg
ggcagatacagcctgaagcaggatgggacgaggtgtgagctgcagatccatgacctgtct
gtggcggatgctggggaatactcatgcatgtgtggacaagagaggacctcggccacgctc
actgtcagggccctgcctgccaggttcacagagggtctgaggaatgaagaggccatggaa
ggggccacagccacactgcaatgtgagctgagcaaggcagcccctgtggagtggaggaaa
ggccttgaggctctcagagatggggacaaatacagcctgagacaagacggggctgtgtgt
gagctgcagattcatggcctggctatggcagataacggggtgtactcatgtgtgtgtggg
caggagaggacctcagctacactcactgtcagggccctgcctgccagattcatagaggat
atgagaaaccagaaggccacagaaggggctacagtcacattgcaatgtaagctgagaaag
gcggcccccgtggagtggagaaaggggcccaacaccctcaaagatggggacaggtacagc
ctgaagcaggatgggaccagttgtgagctgcagattcgtggcctggtcatagcagatgct
ggagaatactcgtgcatatgtgagcaggagaggacctcggccacgctcactgtcagggcc
ctgccggccagattcatagaagatgtgagaaatcacgaggccacagaaggggccacagct
gtgctgcagtgtgagctgagcaaggcggcccccgtggagtggcggaaggggtctgagacc
ctcagagatggggacagatatagcctgaggcaggacgggacgaggtgtgagctgcagatt
cgtggcctggctgtggaggacactggagagtatttgtgtgtgtgcgggcaggagagaacc
tcagctacactcactgtcagggccctgccagccagattcatagacaacatgacaaaccag
gaggccagagaaggggccacggccacactgcactgtgaactgagcaaggtggcccctgtg
gagtggaggaagggacctgaaaccctccgagatggggacagacacagcctgaggcaggat
gggaccaggtgtgagctgcagattcgtggcctgtctgtggcagatgccggggagtactcg
tgcgtgtgtgggcaggagaggacctcagccacactcacgatcagggccctgcccgccaag
ttcacaaagggtctgaggaatgaagaggccacagaaggggccacggctatgttgcagtgt
gagctgagcaaggtggcccctgttgagtggaggaagggacctgaaaccctcagagatggg
gacagatacaacctgaggcaggatgggaccagatgtgagctgcagattcatggcctgtcc
gtggcagacactggggagtactcatgtgtatgtggtcaggagaagacgtcggccactctc
actgtcaaggccccacagccagtgttccgggagccgctgcagagtctgcaggcggaggag
ggctccacggccaccctgcagtgtgagctgtctgagcccactgctacagtggtctggagc
aagggtggcctgcagctgcaggccaatgggcgccgggagccacggcttcagggctgcacc
gcggagctggtgttacaggacctacaacgtgaagacactggcgaatacacttgcacctgt
ggctcccaggccaccagtgccaccctcactgtcacagctgcgcctgtgcggttcctccga
gagctgcagcaccaggaggtggatgagggaggcaccgcacacttatgctgcgagctgagc
cgggcgggtgcgagcgtggagtggcgcaagggctccctacagctcttcccttgtgccaag
taccagatggtgcaggatggtgcagctgcagagctgctggtacgcggagtggagcaggag
gatgcgggtgactacacgtgtgacacgggccacacgcagagcatggccagcctctctgtc
cgtgtccccaggcccaagttcaagacccggcttcagagtctggagcaggagacaggtgac
atagcccggctgtgctgtcagctgagtgatgcagagtcgggggccgtggtgcaatggctc
aaggagggcgtggagctgcatgcgggccccaagtacgagatgcggagccagggggccacg
cgggagctgctgatccaccaactggaggccaaggacacgggcgagtatgcctgtgtgaca
ggcggccagaaaaccgctgcctccctcagggtcacagagcctgaggtgaccattgtacgg
gggctggttgatgcggaggtgacggccgatgaggatgttgagttcagctgtgaggtgtcc
agggctggagccacaggcgtgcagtggtgcctacagggcctgccactgcaaagcaatgag
gtgacagaggtggctgtgcgggatggccgcatccacaccctgcggctgaagggcgtgacg
cccgaggacgctggcactgtctccttccatttgggaaaccatgcttcctctgcccagctc
accgtcagagctcctgaggtgaccatcctggagcccctgcaggacgtgcagctcagtgag
ggccaggatgccagcttccagtgccggctatccagagcttcaggccaggaggcccgctgg
gctttaggaggggtgcccctgcaggccaacgagatgaatgacatcactgtggagcagggc
acactccacctgctcaccctgcacaaggtgacccttgaggatgctggaactgtcagtttc
cacgtgggcacgtgtagctctgaggcccagctgaaagtcacagccaagaacacggtggtg
cgggggctggagaatgtggaggcgctggagggcggcgaggcgctgttcgagtgccagctg
tcccagcccgaggtggccgcccacacctggctgctggacgacgaacccgtgcacacctcg
gagaacgccgaggtggtcttcttcgagaacggcctgcgccacctgctgctgctcaaaaac
ttgcggccacaagacagctgccgggtgaccttcctggctggggatatggtgacgtccgca
ttcctcacggtccgaggctggcgcctggagatcctggagcctctgaaaaacgcggcggtc
cgggccggcgcacaggcatgcttcacctgcacgctcagcgaggcggtgcccgtgggagag
gcgtcctggtacatcaatggcgcggcagtgcagccggatgacagcgactggactgtcacc
gccgacggcagtcaccacgccctactgctgcgcagcgcccagccccaccacgccggggag
gtcaccttcgcttgccgcgacgccgtggcctctgcgcggctcaccgtgctgggcctccct
gatcccccagaggatgctgaggtggtggctcgcagcagccacactgtgacactgtcttgg
gcagctcccatgagtgatggaggcggtggtctctgtggctaccgcgtggaggtgaaggag
ggggccacaggccagtggcggctgtgccacgagctggtgcctggacccgagtgtgtggtg
gatggcctggcccccggggagacctaccgcttccgtgtggcagctgtgggccctgtgggt
gctggggaaccggttcacctgccccagacagtgcggcttgcagagccaccgaagcctgtg
cctccccagccctcagcccctgagagccggcaggtggcagctggtgaagatgtctctctg
gagcttgaggtggtggctgaggctggtgaggtcatctggcacaagggaatggagcgcatc
cagcccggtgggcggttcgaggtggtctcccagggtcggcaacagatgctggtgatcaag
ggcttcacggcagaagaccagggcgagtaccactgtggcctggctcagggctccatctgc
cctgcggctgccaccttccaggtggcactgagcccagcctctgtggatgaggcccctcag
cccagcttgccccccgaggcagcccaggagggtgacctgcacctactgtgggaggccctg
gctcggaaacgtcgcatgagccgtgagcccacgctggactccattagcgagctgccagag
gaggacggccgctcgcagcgcctgccacaggaggcagaggaggtggcacctgatctctct
gaaggctactccacggccgatgagctggcccgcactggagatgctgacctctcacacacc
agctctgatgatgagtcccgggcaggcaccccttccctggtcacctacctcaagaaggct
gggaggccaggcacctcaccactggccagcaaggttggggccccagcagccccctctgtg
aagccacagcagcagcaggagccactggctgctgtgcgcccaccactgggagacctgagc
accaaagacctgggtgatccctcaatggacaaggcagctgtgaagatccaggctgccttt
aagggctacaaggtccggaaggagatgaagcagcaggaagggcccatgttctcccacaca
tttggggacaccgaggcacaggtgggggatgccctgcggctggagtgtgtcgtggccagc
aaggcagatgtgcgagcccgctggctgaaggatggtgtggagctgaccgatgggcggcac
catcacatcgaccagcttggggatggcacctgctctctgctgatcactggcctggaccgt
gctgatgctggctgctacacctgtcaggtgagcaacaagtttggccaggtgacccacagt
gcctgtgtggtggtcagtgggtcagagagtgaagccgagagctcctctgggggtgagctg
gacgatgccttccgccgggctgcccgtcggctgcaccggctcttccgcaccaaaagtccg
gctgaagtttcagatgaggagctcttcctgagtgcagacgagggccctgcagagccagag
gagcccgcggactggcagacataccgcgaagatgagcatttcatctgcatccgttttgag
gcgctcactgaggcccgccaggcggtaactcgcttccaggagatgtttgccacactgggc
attggggtggagatcaagctggtggaacaggggcctcggagggtagagatgtgcatcagc
aaagagactcctgcccctgtggtgcctccagagccattgcccagcctactgacttctgac
gctgccccagtgttcctgactgagttgcagaaccaagaagtgcaggatgggtatcctgtg
agctttgactgcgtggtgacaggtcagcccatgcccagtgtgcgctggttcaaggatggg
aagttgttggaggaggatgatcactacatgattaatgaagaccaacagggtggccatcag
ctcatcatcacagccgtggtgccagcagacatgggcgtctaccgctgcctggccgagaac
agcatgggtgtctcctccaccaaggctgagctccgtgtggacttgacaagcacagactat
gacactgcagcagatgccacggagtcctcatcctacttcagtgcccaaggctacctgtcc
agccgggagcaggagggaacagagtccaccactgatgagggccagctgccccaggtggtg
gaggagctgagagacctccaggtggcccctggcacacgcctggccaagttccagctcaag
gtgaaaggctaccctgctcccagattatactggttcaaagatggccagcccctgaccgca
tctgcccacatccgcatgactgacaagaagatcctgcacaccctggagatcatctccgtc
acccgggaggactctggccagtatgcagcctatatcagcaatgccatgggtgctgcctac
tcgtctgcccggctgctggttcgaggccctgatgagccagaagagaagcctgcatcagat
gtgcatgagcagctggtgccgccccgaatgctggagaggttcacccccaagaaagtgaag
aaaggctccagcatcaccttctctgtgaaggtagaaggacgcccggtgcccaccgtgcac
tggctcagggaggaggctgagagaggcgtgctgtggattggccctgacacaccgggctac
accgtggccagctctgcgcagcagcacagcctggtcctgctggacgtgggccggcagcac
cagggcacctacacatgcattgccagcaacgctgccggccaggccctctgctccgccagc
ctgcacgtctcgggcctgcctaaggtggaggagcaggagaaagtgaaggaagcgctgatt
tccactttcctgcaggggaccacacaagccatctcagcacaggggttggaaactgcgagt
tttgctgaccttggtgggcagaggaaagaagagcctctggctgccaaggaggccctcggc
cacctgtccctcgctgaggtgggcacagaggagttcctgcagaaactgacctcccagatc
actgagatggtatcggccaagatcacgcaggccaagctgcaggtgcccggaggtgacagt
gatgaggactccaagacaccatctgcatccccccgccatggccgatcacggccatcctcc
agcatccaggagtcttcctcagagtcagaggacggcgatgcccgaggcgagatctttgac
atctacgtggtcaccgctgactacctgcccctaggggctgagcaggatgccatcacgctg
cgggaaggccagtatgtggaggtcctggatgcagcccacccactgcgctggcttgtccgc
accaagcccaccaagtccagcccctcacggcagggctgggtgtcaccagcctacctggac
aggaggctcaagctgtcacctgagtggggggccgctgaggcccctgagttccctggggag
gctgtgtctgaagacgaatacaaggcaaggctgagctctgtgatccaggagctgctgagt
tctgagcaggccttcgtggaggagctgcagttcctgcagagccaccacctgcagcacctg
gagcgctgcccccacgtgcccatagctgtggccggccagaaggcagtcatcttccgcaat
gtgcgggacatcggccgcttccacagcagcttcctgcaggagttgcagcagtgcgacacg
gacgacgacgtggccatgtgcttcatcaagaaccaggcggcctttgagcagtacctggag
ttcctggtggggcgtgtgcaggctgagtcggtggtcgtcagcacggccatccaggagttc
tacaagaaatacgcggaggaggccctgttggcaggggacccctctcagcccccgccacca
cctctgcagcactacctggagcagccagtggagcgggtgcagcgctaccaggccttgctg
aaggagctgatccgcaacaaggcgcggaacagacagaactgcgcgctgctggagcaggcc
tatgccgtggtgtctgccctgccacagcgcgctgagaacaagctgcacgtgtccctcatg
gagaactacccaggcaccctgcaggccctgggcgagcccatccgccagggccacttcatc
gtgtgggagggtgcaccgggggcccgcatgccctggaagggccacaaccgtcacgtgttc
ctcttccgcaaccacctggtaatctgcaagccccggcgagactcccgcaccgataccgtc
agctacgtgttccggaacatgatgaagctgagcagcatcgacctgaacgaccaggtggag
ggggatgaccgcgccttcgaggtgtggcaggagcgggaggactcggtgcgcaagtacctg
ctgcaggcacggacagccattatcaagagctcgtgggtgaaggagatctgtggcatccag
cagcgtctggccctgcctgtgtggcggcccccggactttgaagaggagctggccgactgc
acagccgagctgggtgagacagtcaagctggcctgccgcgtgacgggcacacccaagcct
gtcatcagctggtacaaagatgggaaagcagtgcaggtggacccccaccacatcctcatt
gaagaccctgatggctcgtgtgcactcatcctggacagcctgaccggtgtggactctggc
cagtacatgtgcttcgcggccagcgccgctggcaactgcagtaccctgggcaagatcctg
gtgcaagtcccaccacggttcgtgaacaaggtccgggcctcaccctttgtggagggagag
gacgcccagttcacctgcaccatcgaaggcgccccgtacccgcagatcaggtggtacaag
gacggggccctgctgaccactggcaacaagttccagacactgagtgagcctcgcagcggc
ctgctagtgctggtgatccgggcggccagcaaggaggacctggggctctacgagtgtgag
ctggtgaaccggctgggctccgcgcgggctagtgcggagctgcgcattcagagccccatg
ctgcaggcccaggagcagtgtcacagggagcagctcgtggctgcagtggaagacaccacc
ctggagcgagcggaccaggaggtcacatctgtcctgaagagactgctgggccccaaggcg
ccaggcccctccacaggggacctcactggccctggcccctgccccaggggggcacccgca
ctccaggaaaccggctcccagcccccagtcaccggaacttcggaggcacctgccgtgccc
ccgagggtgccacagcccctcctccacgaaggcccagagcaggagccggaggccattgcc
agagcccaggaatggactgtgcccattcggatggagggtgcagcctggcccggggcaggc
acaggggagctgctctgggacgtccacagccacgtggtcagagagaccacacagaggacc
tacacataccaggccatcgacacgcacaccgcacggcccccatccatgcaggtaaccatc
gaggatgtgcaggcacagacaggcggaacggcccaattcgaggctatcattgagggcgac
ccacagccctcggtgacctggtacaaggacagcgtccagctggtggacagcacccggctt
agccagcagcaagaaggcaccacatactccctggtgctgaggcatgtggcctcgaaggat
gccggcgtttacacctgcctggcccaaaacactggtggccaggtgctctgcaaggcagag
ctgctggtgcttgggggggacaatgagccggactcagagaagcaaagccaccggaggaag
ctgcactccttctatgaggtcaaggaggagattggaaggggcgtgtttggcttcgtaaaa
agagtgcagcacaaaggaaacaagatcttgtgcgctgccaagttcatccccctacggagc
agaactcgggcccaggcatacagggagcgagacatcctggccgcgctgagccacccgctg
gtcacggggctgctggaccagtttgagacccgcaagaccctcatcctcatcctggagctg
tgctcatccgaggagctgctggaccgcctgtacaggaagggcgtggtgacggaggccgag
gtcaaggtctacatccagcagctggtggaggggctgcactacctgcacagccatggcgtt
ctccacctggacataaagccctctaacatcctgatggtgcatcctgcccgggaagacatt
aaaatctgcgactttggctttgcccagaacatcaccccagcagagctgcagttcagccag
tacggctcccctgagttcgtctcccccgagatcatccagcagaaccctgtgagcgaagcc
tccgacatttgggccatgggtgtcatctcctacctcagcctgacctgctcatccccattt
gccggcgagagtgaccgtgccaccctcctgaacgtcctggaggggcgcgtgtcatggagc
agccccatggctgcccacctcagcgaagacgccaaagacttcatcaaggctacgctgcag
agagcccctcaggcccggcctagtgcggcccagtgcctctcccacccctggttcctgaaa
tccatgcctgcggaggaggcccacttcatcaacaccaagcagctcaagttcctcctggcc
cgaagtcgctggcagcgttccctgatgagctacaagtccatcctggtgatgcgctccatc
cctgagctgctgcggggcccacccgacagcccctccctcggcgtagcccggcacctctgc
agggacactggtggctcctccagttcctcctcctcctctgacaacgagctcgccccattt
gcccgggctaagtcactgccaccctccccggtgacacactcaccactgctgcacccccgg
ggcttcctgcggccctcggccagcctgcctgaggaagccgaggccagtgagcgctccacc
gaggccccagctccgcctgcatctcccgagggtgccgggccaccggccgcccagggctgc
gtgccccggcacagcgtcatccgcagcctgttctaccaccaggcgggtgagagccctgag
cacggggccctggccccggggagcaggcggcacccggcccggcggcggcacctgctgaag
ggcgggtacattgcgggggcgctgccaggcctgcgcgagccactgatggagcaccgcgtg
ctggaggaggaggccgccagggaggagcaggccaccctcctggccaaagccccctcattc
gagactgccctccggctgcctgcctctggcacccacttggcccctggccacagccactcc
ctggaacatgactctccgagcaccccccgcccctcctcggaggcctgcggtgaggcacag
cgactgccttcagccccctccgggggggcccctatcagggacatggggcaccctcagggc
tccaagcagcttccatccactggtggccacccaggcactgctcagccagagaggccatcc
ccggacagcccttgggggcagccagcccctttctgccaccccaagcagggttctgccccc
caggagggctgcagcccccacccagcagttgccccatgccctcctggctccttccctcca
ggatcttgcaaagaggcccccttagtaccctcaagccccttcttgggacagccccaggca
ccccctgcccctgccaaagcaagccccccattggactctaagatggggcctggagacatc
tctcttcctgggaggccaaaacccggcccctgcagttccccagggtcagcctcccaggcg
agctcttcccaagtgagctccctcagggtgggctcctcccaggtgggcacagagcctggc
ccctccctggatgcggagggctggacccaggaggctgaggatctgtccgactccacaccc
accttgcagcggcctcaggaacaggcgaccatgcgcaagttctccctgggtggtcgcggg
ggctacgcaggcgtggctggctatggcacctttgcctttggtggagatgcagggggcatg
ctggggcaggggcccatgtgggccaggatagcctgggctgtgtcccagtcagaggaggag
gagcaggaggaggccagggctgagtcccagtcggaggagcagcaggaggccagggctgag
agcccactgccccaggtcagtgcaaggcctgtgcctgaggtcggcagggctcccaccagg
agctctccagagcccaccccatgggaggacatcgggcaggtctccctggtgcagatccgg
gacctgtcaggtgatgcggaggcggccgacacaatatccctggacatttccgaggtggac
cccgcctacctcaacctctcagacctgtacgatatcaagtacctcccattcgagtttatg
atcttcaggaaagtccccaagtccgctcagccagagccgccctcccccatggctgaggag
gagctggccgagttcccggagcccacgtggccctggccaggtgaactgggcccccacgca
ggcctggagatcacagaggagtcagaggatgtggacgcgctgctggcagaggctgccgtg
ggcaggaagcgcaagtggtcctcgccgtcacgcagcctcttccacttccctgggaggcac
ctgccgctggatgagcctgcagagctggggctgcgtgagagagtgaaggcctccgtggag
cacatctcccggatcctgaagggcaggccggaaggtctggagaaggaggggccccccagg
aagaagccaggccttgcttccttccggctctcaggtctgaagagctgggaccgagcgccg
acattcctaagggagctctcagatgagactgtggtcctgggccagtcagtgacactggcc
tgccaggtgtcagcccagccagctgcccaggccacctggagcaaagacggagcccccctg
gagagcagcagccgtgtcctcatctctgccaccctcaagaacttccagcttctgaccatc
ctggtggtggtggctgaggacctgggtgtgtacacctgcagcgtgagcaatgcgctgggg
acagtgaccaccacgggcgtcctccggaaggcagagcgcccctcatcttcgccatgcccg
gatatcggggaggtgtacgcggatggggtgctgctggtctggaagcccgtggaatcctac
ggccctgtgacctacattgtgcagtgcagcctagaaggcggcagctggaccacactggcc
tccgacatctttgactgctgctacctgaccagcaagctctcccggggtggcacctacacc
ttccgcacggcatgtgtcagcaaggcaggaatgggtccctacagcagcccctcggagcaa
gtcctcctgggagggcccagccacctggcctctgaggaggagagccaggggcggtcagcc
caacccctgcccagcacaaagaccttcgcattccagacacagatccagaggggccgcttc
agcgtggtgcggcaatgctgggagaaggccagcgggcgggcgctggccgccaagatcatc
ccctaccaccccaaggacaagacagcagtgctgcgcgaatacgaggccctcaagggcctg
cgccacccgcacctggcccagctgcacgcagcctacctcagcccccggcacctggtgctc
atcttggagctgtgctctgggcccgagctgctcccctgcctggccgagagggcctcctac
tcagaatccgaggtgaaggactacctgtggcagatgttgagtgccacccagtacctgcac
aaccagcacatcctgcacctggacctgaggtccgagaacatgatcatcaccgaatacaac
ctgctcaaggtcgtggacctgggcaatgcacagagcctcagccaggagaaggtgctgccc
tcagacaagttcaaggactacctagagaccatggctccagagctcctggagggccagggg
gctgttccacagacagacatctgggccatcggtgtgacagccttcatcatgctgagcgcc
gagtacccggtgagcagcgagggtgcacgcgacctgcagagaggactgcgcaaggggctg
gtccggctgagccgctgctacgcggggctgtccgggggcgccgtggccttcctgcgcagc
actctgtgcgcccagccctggggccggccctgcgcgtccagctgcctgcagtgcccgtgg
ctaacagaggagggcccggcctgttcgcggcccgcgcccgtgaccttccctaccgcgcgg
ctgcgcgtcttcgtgcgcaatcgcgagaagagacgcgcgctgctgtacaagaggcacaac
ctggcccaggtgcgctga

KEGG   Homo sapiens (human): 2023
Entry
2023              CDS       T01001                                 
Symbol
ENO1, ENO1-IT1, ENO1L1, HEL-S-17, MPB1, NNE, PPH
Name
(RefSeq) enolase 1
  KO
K01689  enolase 1/2/3 [EC:4.2.1.11]
Organism
hsa  Homo sapiens (human)
Pathway
hsa00010  Glycolysis / Gluconeogenesis
hsa01100  Metabolic pathways
hsa01200  Carbon metabolism
hsa01230  Biosynthesis of amino acids
hsa03018  RNA degradation
hsa04066  HIF-1 signaling pathway
hsa04820  Cytoskeleton in muscle cells
Module
hsa_M00001  Glycolysis (Embden-Meyerhof pathway), glucose => pyruvate
hsa_M00002  Glycolysis, core module involving three-carbon compounds
hsa_M00003  Gluconeogenesis, oxaloacetate => fructose-6P
Network
nt06017  Glycogen metabolism
nt06539  Cytoskeleton in muscle cells
  Element
N00731  Glycolysis
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09100 Metabolism
  09101 Carbohydrate metabolism
   00010 Glycolysis / Gluconeogenesis
    2023 (ENO1)
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   03018 RNA degradation
    2023 (ENO1)
 09130 Environmental Information Processing
  09132 Signal transduction
   04066 HIF-1 signaling pathway
    2023 (ENO1)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    2023 (ENO1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:hsa03019]
    2023 (ENO1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    2023 (ENO1)
Enzymes [BR:hsa01000]
 4. Lyases
  4.2  Carbon-oxygen lyases
   4.2.1  Hydro-lyases
    4.2.1.11  phosphopyruvate hydratase
     2023 (ENO1)
Messenger RNA biogenesis [BR:hsa03019]
 Prokaryotic type
  Bacterial mRNA degradation factors
   RNA degradosome components
    Other RNA degradosome components
     2023 (ENO1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Proteins found in most exosomes
   2023 (ENO1)
SSDB
Motif
Pfam: Enolase_C Enolase_N MR_MLE_C MAAL_C
Other DBs
NCBI-GeneID: 2023
NCBI-ProteinID: NP_001419
OMIM: 172430
HGNC: 3350
Ensembl: ENSG00000074800
UniProt: P06733 A0A024R4F1
Structure
LinkDB
Position
1:complement(8861000..8878686)
AA seq 434 aa
MSILKIHAREIFDSRGNPTVEVDLFTSKGLFRAAVPSGASTGIYEALELRDNDKTRYMGK
GVSKAVEHINKTIAPALVSKKLNVTEQEKIDKLMIEMDGTENKSKFGANAILGVSLAVCK
AGAVEKGVPLYRHIADLAGNSEVILPVPAFNVINGGSHAGNKLAMQEFMILPVGAANFRE
AMRIGAEVYHNLKNVIKEKYGKDATNVGDEGGFAPNILENKEGLELLKTAIGKAGYTDKV
VIGMDVAASEFFRSGKYDLDFKSPDDPSRYISPDQLADLYKSFIKDYPVVSIEDPFDQDD
WGAWQKFTASAGIQVVGDDLTVTNPKRIAKAVNEKSCNCLLLKVNQIGSVTESLQACKLA
QANGWGVMVSHRSGETEDTFIADLVVGLCTGQIKTGAPCRSERLAKYNQLLRIEEELGSK
AKFAGRNFRNPLAK
NT seq 1305 nt   +upstreamnt  +downstreamnt
atgtctattctcaagatccatgccagggagatctttgactctcgcgggaatcccactgtt
gaggttgatctcttcacctcaaaaggtctcttcagagctgctgtgcccagtggtgcttca
actggtatctatgaggccctagagctccgggacaatgataagactcgctatatggggaag
ggtgtctcaaaggctgttgagcacatcaataaaactattgcgcctgccctggttagcaag
aaactgaacgtcacagaacaagagaagattgacaaactgatgatcgagatggatggaaca
gaaaataaatctaagtttggtgcgaacgccattctgggggtgtcccttgccgtctgcaaa
gctggtgccgttgagaagggggtccccctgtaccgccacatcgctgacttggctggcaac
tctgaagtcatcctgccagtcccggcgttcaatgtcatcaatggcggttctcatgctggc
aacaagctggccatgcaggagttcatgatcctcccagtcggtgcagcaaacttcagggaa
gccatgcgcattggagcagaggtttaccacaacctgaagaatgtcatcaaggagaaatat
gggaaagatgccaccaatgtgggggatgaaggcgggtttgctcccaacatcctggagaat
aaagaaggcctggagctgctgaagactgctattgggaaagctggctacactgataaggtg
gtcatcggcatggacgtagcggcctccgagttcttcaggtctgggaagtatgacctggac
ttcaagtctcccgatgaccccagcaggtacatctcgcctgaccagctggctgacctgtac
aagtccttcatcaaggactacccagtggtgtctatcgaagatccctttgaccaggatgac
tggggagcttggcagaagttcacagccagtgcaggaatccaggtagtgggggatgatctc
acagtgaccaacccaaagaggatcgccaaggccgtgaacgagaagtcctgcaactgcctc
ctgctcaaagtcaaccagattggctccgtgaccgagtctcttcaggcgtgcaagctggcc
caggccaatggttggggcgtcatggtgtctcatcgttcgggggagactgaagataccttc
atcgctgacctggttgtggggctgtgcactgggcagatcaagactggtgccccttgccga
tctgagcgcttggccaagtacaaccagctcctcagaattgaagaggagctgggcagcaag
gctaagtttgccggcaggaacttcagaaaccccttggccaagtaa

KEGG   Homo sapiens (human): 2027
Entry
2027              CDS       T01001                                 
Symbol
ENO3, GSD13, MSE
Name
(RefSeq) enolase 3
  KO
K01689  enolase 1/2/3 [EC:4.2.1.11]
Organism
hsa  Homo sapiens (human)
Pathway
hsa00010  Glycolysis / Gluconeogenesis
hsa01100  Metabolic pathways
hsa01200  Carbon metabolism
hsa01230  Biosynthesis of amino acids
hsa03018  RNA degradation
hsa04066  HIF-1 signaling pathway
hsa04820  Cytoskeleton in muscle cells
Module
hsa_M00001  Glycolysis (Embden-Meyerhof pathway), glucose => pyruvate
hsa_M00002  Glycolysis, core module involving three-carbon compounds
hsa_M00003  Gluconeogenesis, oxaloacetate => fructose-6P
Network
nt06017  Glycogen metabolism
nt06539  Cytoskeleton in muscle cells
  Element
N00731  Glycolysis
N01821  Sarcomere, M-band
Disease
H00069  Glycogen storage disease
H01762  Muscle glycogen storage disease
H01953  Glycogen storage disease type XIII
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09100 Metabolism
  09101 Carbohydrate metabolism
   00010 Glycolysis / Gluconeogenesis
    2027 (ENO3)
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   03018 RNA degradation
    2027 (ENO3)
 09130 Environmental Information Processing
  09132 Signal transduction
   04066 HIF-1 signaling pathway
    2027 (ENO3)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    2027 (ENO3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:hsa03019]
    2027 (ENO3)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    2027 (ENO3)
Enzymes [BR:hsa01000]
 4. Lyases
  4.2  Carbon-oxygen lyases
   4.2.1  Hydro-lyases
    4.2.1.11  phosphopyruvate hydratase
     2027 (ENO3)
Messenger RNA biogenesis [BR:hsa03019]
 Prokaryotic type
  Bacterial mRNA degradation factors
   RNA degradosome components
    Other RNA degradosome components
     2027 (ENO3)
Exosome [BR:hsa04147]
 Exosomal proteins
  Proteins found in most exosomes
   2027 (ENO3)
SSDB
Motif
Pfam: Enolase_C Enolase_N MR_MLE_C MAAL_C
Other DBs
NCBI-GeneID: 2027
NCBI-ProteinID: NP_001361452
OMIM: 131370
HGNC: 3354
Ensembl: ENSG00000108515
UniProt: P13929
Structure
LinkDB
Position
17:4948710..4957129
AA seq 434 aa
MAMQKIFAREILDSRGNPTVEVDLHTAKGRFRAAVPSGASTGIYEALELRDGDKGRYLGK
GVLKAVENINNTLGPALLQKKLSVVDQEKVDKFMIELDGTENKSKFGANAILGVSLAVCK
AGAAEKGVPLYRHIADLAGNPDLILPVPAFNVINGGSHAGNKLAMQEFMILPVGASSFKE
AMRIGAEVYHHLKGVIKAKYGKDATNVGDEGGFAPNILENNEALELLKTAIQAAGYPDKV
VIGMDVAASEFYRNGKYDLDFKSPDDPARHITGEKLGELYKSFIKNYPVVSIEDPFDQDD
WATWTSFLSGVNIQIVGDDLTVTNPKRIAQAVEKKACNCLLLKVNQIGSVTESIQACKLA
QSNGWGVMVSHRSGETEDTFIADLVVGLCTGQIKTGAPCRSERLAKYNQLMRIEEALGDK
AIFAGRKFRNPKAK
NT seq 1305 nt   +upstreamnt  +downstreamnt
atggccatgcagaaaatctttgcccgggaaatcttggactccaggggcaaccccacggtg
gaggtggacctgcacacggccaagggccgattccgagcagctgtgcccagtggggcttcc
acgggtatctatgaggctctggaactaagagacggagacaaaggccgctacctggggaaa
ggagtcctgaaggctgtggagaacatcaacaatactctgggccctgctctgctgcaaaag
aaactaagcgttgtggatcaagaaaaagttgacaaatttatgattgagctagatgggacc
gagaataagtccaagtttggggccaatgccatcctgggcgtgtccttggccgtgtgtaag
gcgggagcagctgagaagggggtccccctgtaccgccacatcgcagatctcgctgggaac
cctgacctcatactcccagtgccagccttcaatgtgatcaacgggggctcccatgctgga
aacaagctggccatgcaggagttcatgattctgcctgtgggagccagctccttcaaggaa
gccatgcgcattggcgccgaggtctaccaccacctcaagggggtcatcaaggccaagtat
gggaaggatgccaccaatgtgggtgatgaaggtggcttcgcacccaacatcctggagaac
aatgaggccctggagctgctgaagacggccatccaggcggctggttacccagacaaggtg
gtgatcggcatggatgtggcagcatctgagttctatcgcaatgggaagtacgatcttgac
ttcaagtcgcctgatgatcccgcacggcacatcactggggagaagctcggagagctgtat
aagagctttatcaagaactatcctgtggtctccatcgaagacccctttgaccaggatgac
tgggccacttggacctccttcctctcgggggtgaacatccagattgtgggggatgacttg
acagtcaccaaccccaagaggattgcccaggccgttgagaagaaggcctgcaactgtctg
ctgctgaaggtcaaccagatcggctcggtgaccgaatcgatccaggcgtgcaaactggct
cagtctaatggctggggggtgatggtgagccaccgctctggggagactgaggacacattc
attgctgaccttgtggtggggctctgcacaggacagatcaagactggcgccccctgccgc
tcggagcgtctggccaaatacaaccaactcatgaggatcgaggaggctcttggggacaag
gcaatctttgctggacgcaagttccgtaacccgaaggccaagtga

KEGG   Homo sapiens (human): 84676
Entry
84676             CDS       T01001                                 
Symbol
TRIM63, CMH31, IRF, MURF1, MURF2, RNF28, SMRZ
Name
(RefSeq) tripartite motif containing 63
  KO
K10655  tripartite motif-containing protein 63 [EC:2.3.2.27]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Disease
H00292  Hypertrophic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    84676 (TRIM63)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04121 Ubiquitin system [BR:hsa04121]
    84676 (TRIM63)
Enzymes [BR:hsa01000]
 2. Transferases
  2.3  Acyltransferases
   2.3.2  Aminoacyltransferases
    2.3.2.27  RING-type E3 ubiquitin transferase
     84676 (TRIM63)
Ubiquitin system [BR:hsa04121]
 Ubiquitin ligases (E3)
  Single Ring-finger type E3
   TRIM/RBCC proteins
    84676 (TRIM63)
SSDB
Motif
Pfam: zf-RING_UBOX zf-C3HC4 zf-C3HC4_2 zf-B_box RING_XB3-XBAT31 zf-RING_5 zf-RING_2 zf-C3HC4_3 zf-C3HC4_4 zf-RING_6 Sina_RING Fzo_mitofusin DUF1664 ATP-synt_E_2 ANCHR-like_BBOX Phasin Spt4 Allexi_40kDa Zn_ribbon_FGT1_2
Other DBs
NCBI-GeneID: 84676
NCBI-ProteinID: NP_115977
OMIM: 606131
HGNC: 16007
Ensembl: ENSG00000158022
UniProt: Q969Q1
Structure
LinkDB
Position
1:complement(26051301..26067630)
AA seq 353 aa
MDYKSSLIQDGNPMENLEKQLICPICLEMFTKPVVILPCQHNLCRKCANDIFQAANPYWT
SRGSSVSMSGGRFRCPTCRHEVIMDRHGVYGLQRNLLVENIIDIYKQECSSRPLQKGSHP
MCKEHEDEKINIYCLTCEVPTCSMCKVFGIHKACEVAPLQSVFQGQKTELNNCISMLVAG
NDRVQTIITQLEDSRRVTKENSHQVKEELSQKFDTLYAILDEKKSELLQRITQEQEKKLS
FIEALIQQYQEQLDKSTKLVETAIQSLDEPGGATFLLTAKQLIKSIVEASKGCQLGKTEQ
GFENMDFFTLDLEHIADALRAIDFGTDEEEEEFIEEEDQEEEESTEGKEEGHQ
NT seq 1062 nt   +upstreamnt  +downstreamnt
atggattataagtcgagcctgatccaggatgggaatcccatggagaacttggagaagcag
ctgatctgccctatctgcctggagatgtttaccaagccagtggtcatcttgccgtgccag
cacaacctgtgccggaagtgtgccaatgacatcttccaggctgcaaatccctactggacc
agccggggcagctcagtgtccatgtctggaggccgtttccgctgccccacctgccgccac
gaggtgatcatggatcgtcacggagtgtacggcctgcagaggaacctgctggtggagaac
atcatcgacatctacaaacaggagtgctccagtcggccgctgcagaagggcagtcacccc
atgtgcaaggagcacgaagatgagaaaatcaacatctactgtctcacgtgtgaggtgccc
acctgctccatgtgcaaggtgtttgggatccacaaggcctgcgaggtggccccattgcag
agtgtcttccagggacaaaagactgaactgaataactgtatctccatgctggtggcgggg
aatgaccgtgtgcagaccatcatcactcagctggaggattcccgtcgagtgaccaaggag
aacagtcaccaggtaaaggaagagctgagccagaagtttgacacgttgtatgccatcctg
gatgagaagaaaagtgagttgctgcagcggatcacgcaggagcaggagaaaaagcttagc
ttcatcgaggccctcatccagcagtaccaggagcagctggacaagtccacaaagctggtg
gaaactgccatccagtccctggacgagcctgggggagccaccttcctcttgactgccaag
caactcatcaaaagcattgtggaagcttccaagggctgccagctggggaagacagagcag
ggctttgagaacatggacttctttactttggatttagagcacatagcagacgccctgaga
gccattgactttgggacagatgaggaagaggaagaattcattgaagaagaagatcaggaa
gaggaagagtccacagaagggaaggaagaaggacaccagtaa

KEGG   Homo sapiens (human): 84675
Entry
84675             CDS       T01001                                 
Symbol
TRIM55, MURF-2, RNF29, muRF2
Name
(RefSeq) tripartite motif containing 55
  KO
K10654  tripartite motif-containing protein 55
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    84675 (TRIM55)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04121 Ubiquitin system [BR:hsa04121]
    84675 (TRIM55)
Ubiquitin system [BR:hsa04121]
 Ubiquitin ligases (E3)
  Single Ring-finger type E3
   TRIM/RBCC proteins
    84675 (TRIM55)
SSDB
Motif
Pfam: zf-RING_UBOX zf-C3HC4 zf-C3HC4_2 RING_XB3-XBAT31 zf-B_box zf-RING_2 zf-RING_5 zf-C3HC4_4 zf-C3HC4_3 zf-RING_6 Sina_RING Cbl_N
Other DBs
NCBI-GeneID: 84675
NCBI-ProteinID: NP_908973
OMIM: 606469
HGNC: 14215
Ensembl: ENSG00000147573
UniProt: Q9BYV6
LinkDB
Position
8:66113351..66175485
AA seq 548 aa
MSASLNYKSFSKEQQTMDNLEKQLICPICLEMFTKPVVILPCQHNLCRKCASDIFQASNP
YLPTRGGTTMASGGRFRCPSCRHEVVLDRHGVYGLQRNLLVENIIDIYKQESTRPEKKSD
QPMCEEHEEERINIYCLNCEVPTCSLCKVFGAHKDCQVAPLTHVFQRQKSELSDGIAILV
GSNDRVQGVISQLEDTCKTIEECCRKQKQELCEKFDYLYGILEERKNEMTQVITRTQEEK
LEHVRALIKKYSDHLENVSKLVESGIQFMDEPEMAVFLQNAKTLLKKISEASKAFQMEKI
EHGYENMNHFTVNLNREEKIIREIDFYREDEDEEEEEGGEGEKEGEGEVGGEAVEVEEVE
NVQTEFPGEDENPEKASELSQVELQAAPGALPVSSPEPPPALPPAADAPVTQGEVVPTGS
EQTTESETPVPAAAETADPLFYPSWYKGQTRKATTNPPCTPGSEGLGQIGPPGSEDSNVR
KAEVAAAAASERAAVSGKETSAPAATSQIGFEAPPLQGQAAAPASGSGADSEPARHIFSF
SWLNSLNE
NT seq 1647 nt   +upstreamnt  +downstreamnt
atgagcgcatctctgaattacaaatctttttccaaagagcagcagaccatggataactta
gagaagcaactcatctgtcccatctgcttagagatgttcacgaaacctgtggtgattctc
ccttgtcagcacaacctgtgtaggaaatgtgccagtgatattttccaggcctctaacccg
tatttgcccacaagaggaggtaccaccatggcatcagggggccgattccgctgcccatcc
tgtagacatgaagtggttttggatagacatggggtatatggacttcagaggaacctgctg
gtggaaaatatcattgacatctacaagcaggagtccaccaggccagaaaagaaatccgac
cagcccatgtgcgaggaacatgaagaggagcgcatcaacatctactgtctgaactgcgaa
gtacccacctgctctctgtgcaaggtgtttggtgcacacaaagactgccaggtggctccc
ctcactcatgtgttccagagacagaagtctgagctcagtgatggcatcgccatcctcgtg
ggcagcaacgatcgagtccagggagtgatcagccagctggaagacacctgcaaaactatc
gaggaatgttgcagaaaacagaaacaagagctttgtgagaagtttgattacctgtatggc
attttggaggagaggaagaatgaaatgacccaagtcattacccgaacccaagaggagaaa
ctggaacatgtccgtgctctgatcaaaaagtattctgatcatttggagaacgtctcaaag
ttggttgagtcaggaattcagtttatggatgagccagaaatggcagtgtttctgcagaat
gccaaaaccctgctaaaaaaaatctcggaagcatcaaaggcatttcagatggagaaaata
gaacatggctatgagaacatgaaccacttcacagtcaacctcaatagagaagaaaagata
atacgtgaaattgacttttacagagaagatgaagatgaagaagaagaagaaggcggagaa
ggagaaaaagaaggagaaggagaagtgggaggagaagcagtagaagtggaagaggtagaa
aatgttcaaacagagtttccaggagaagatgaaaacccagaaaaagcttcagagctctct
caggtggagctgcaggctgcccctggggcacttccagtttcctctccagagccacctcca
gccctgccacctgctgcggatgcccctgtgacacagggggaggttgtacccactggctct
gagcagaccacagagtctgaaactccagtccctgcagcagcagaaactgcggatcccttg
ttttaccctagttggtataaaggccaaacccggaaagccaccaccaacccaccttgcacc
ccagggagcgaaggtctggggcaaatagggcctccaggttctgaggattcgaatgtacgg
aaggcagaagtggcagcagccgcagcgagtgagagggcagctgtgagtggtaaggaaact
agtgcacctgcagctacttctcagattggatttgaggctcctcccctccagggacaggct
gcagctccagcgagtggcagtggagctgattctgagccagctcgccatatcttctccttt
tcctggttgaactccctaaatgaatga

KEGG   Homo sapiens (human): 57159
Entry
57159             CDS       T01001                                 
Symbol
TRIM54, MURF, MURF-3, RNF30, muRF3
Name
(RefSeq) tripartite motif containing 54
  KO
K10653  tripartite motif-containing protein 54
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    57159 (TRIM54)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04121 Ubiquitin system [BR:hsa04121]
    57159 (TRIM54)
Ubiquitin system [BR:hsa04121]
 Ubiquitin ligases (E3)
  Single Ring-finger type E3
   TRIM/RBCC proteins
    57159 (TRIM54)
SSDB
Motif
Pfam: zf-RING_UBOX zf-C3HC4_2 zf-C3HC4 RING_XB3-XBAT31 zf-B_box zf-RING_5 zf-RING_2 zf-C3HC4_4 zf-C3HC4_3 zf-RING_6 PHP Sina_RING UvrD_C DUF349 COG2_C MIF4G_like_2 FlgN DUF1421 DUF5799 Zn_ribbon_FGT1_2 FUSC zf-RING_11
Other DBs
NCBI-GeneID: 57159
NCBI-ProteinID: NP_912730
OMIM: 606474
HGNC: 16008
Ensembl: ENSG00000138100
UniProt: Q9BYV2
Structure
LinkDB
Position
2:27282429..27307435
AA seq 358 aa
MNFTVGFKPLLGDAHSMDNLEKQLICPICLEMFSKPVVILPCQHNLCRKCANDVFQASNP
LWQSRGSTTVSSGGRFRCPSCRHEVVLDRHGVYGLQRNLLVENIIDIYKQESSRPLHSKA
EQHLMCEEHEEEKINIYCLSCEVPTCSLCKVFGAHKDCEVAPLPTIYKRQKSELSDGIAM
LVAGNDRVQAVITQMEEVCQTIEDNSRRQKQLLNQRFESLCAVLEERKGELLQALAREQE
EKLQRVRGLIRQYGDHLEASSKLVESAIQSMEEPQMALYLQQAKELINKVGAMSKVELAG
RPEPGYESMEQFTVRVEHVAEMLRTIDFQPGASGEEEEVAPDGEEGSAGPEEERPDGP
NT seq 1077 nt   +upstreamnt  +downstreamnt
atgaacttcacagtgggtttcaagccgctgctaggggatgcacacagcatggacaacctg
gagaagcagctcatctgccccatctgcctggagatgttctccaaaccagtggtgatcctg
ccctgccaacacaacctgtgccgcaaatgtgccaacgacgtcttccaggcctcgaatcct
ctatggcagtcccggggctccaccactgtgtcttcaggaggccgtttccgctgcccatcg
tgcaggcatgaggttgtcctggacagacacggtgtctacggcctgcagcgaaacctgcta
gtggagaacattatcgacatttacaagcaggagtcatccaggccgctgcactccaaggct
gagcagcacctcatgtgcgaggagcatgaagaagagaagatcaatatttactgcctgagc
tgtgaggtgcccacctgctctctctgcaaggtcttcggtgcccacaaggactgtgaggtg
gccccactgcccaccatttacaaacgccagaagagtgagctcagcgatggcatcgcgatg
ctggtggcaggcaatgaccgcgtgcaagcagtgatcacacagatggaggaggtgtgccag
actatcgaggacaatagccggaggcagaagcagttgttaaaccagaggtttgagagcctg
tgcgcagtgctggaggagcgcaagggtgagctgctgcaggcgctggcccgggagcaagag
gagaagctgcagcgcgtccgcggcctcatccgtcagtatggcgaccacctggaggcctcc
tctaagctggtggagtctgccatccagtccatggaagagccacaaatggcgctgtatctc
cagcaggccaaggagctgatcaataaggtcggggccatgtcgaaggtggagctggcaggg
cggccggagccaggctatgagagcatggagcaattcaccgtaagggtggagcacgtggcc
gaaatgctgcggaccatcgacttccagccaggcgcttccggggaggaagaggaggtggcc
ccagacggagaggagggcagcgcggggccggaggaagagcggccggatgggccttaa

KEGG   Homo sapiens (human): 823
Entry
823               CDS       T01001                                 
Symbol
CAPN1, CANP, CANP1, CANPL1, SPG76, muCANP, muCL
Name
(RefSeq) calpain 1
  KO
K01367  calpain-1 [EC:3.4.22.52]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04141  Protein processing in endoplasmic reticulum
hsa04210  Apoptosis
hsa04217  Necroptosis
hsa04218  Cellular senescence
hsa04382  Cornified envelope formation
hsa05010  Alzheimer disease
hsa05022  Pathways of neurodegeneration - multiple diseases
hsa05131  Shigellosis
Network
nt06460  Alzheimer disease
nt06466  Pathways of neurodegeneration
nt06539  Cytoskeleton in muscle cells
nt06545  Cornified envelope formation
  Element
N00997  Mutation-caused aberrant Abeta to electron transfer in Complex I
N01018  Mutation-caused aberrant Abeta to anterograde axonal transport
N01821  Sarcomere, M-band
N01931  Processing of filaggrin to amino acids
Disease
H00266  Hereditary spastic paraplegia
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   04141 Protein processing in endoplasmic reticulum
    823 (CAPN1)
 09140 Cellular Processes
  09143 Cell growth and death
   04210 Apoptosis
    823 (CAPN1)
   04217 Necroptosis
    823 (CAPN1)
   04218 Cellular senescence
    823 (CAPN1)
 09150 Organismal Systems
  09158 Development and regeneration
   04382 Cornified envelope formation
    823 (CAPN1)
 09160 Human Diseases
  09171 Infectious disease: bacterial
   05131 Shigellosis
    823 (CAPN1)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    823 (CAPN1)
   05022 Pathways of neurodegeneration - multiple diseases
    823 (CAPN1)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    823 (CAPN1)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.22  Cysteine endopeptidases
    3.4.22.52  calpain-1
     823 (CAPN1)
Peptidases and inhibitors [BR:hsa01002]
 Cysteine peptidases
  Family C2: calpain family
   823 (CAPN1)
SSDB
Motif
Pfam: Peptidase_C2 Calpain_III EF-hand_6 EF-hand_1 EF-hand_8 EF-hand_7 CAPN13-like_C_EFh
Other DBs
NCBI-GeneID: 823
NCBI-ProteinID: NP_001185797
OMIM: 114220
HGNC: 1476
Ensembl: ENSG00000014216
UniProt: P07384 B2RDI5
Structure
LinkDB
Position
11:65181373..65212006
AA seq 714 aa
MSEEIITPVYCTGVSAQVQKQRARELGLGRHENAIKYLGQDYEQLRVRCLQSGTLFRDEA
FPPVPQSLGYKDLGPNSSKTYGIKWKRPTELLSNPQFIVDGATRTDICQGALGDCWLLAA
IASLTLNDTLLHRVVPHGQSFQNGYAGIFHFQLWQFGEWVDVVVDDLLPIKDGKLVFVHS
AEGNEFWSALLEKAYAKVNGSYEALSGGSTSEGFEDFTGGVTEWYELRKAPSDLYQIILK
ALERGSLLGCSIDISSVLDMEAITFKKLVKGHAYSVTGAKQVNYRGQVVSLIRMRNPWGE
VEWTGAWSDSSSEWNNVDPYERDQLRVKMEDGEFWMSFRDFMREFTRLEICNLTPDALKS
RTIRKWNTTLYEGTWRRGSTAGGCRNYPATFWVNPQFKIRLDETDDPDDYGDRESGCSFV
LALMQKHRRRERRFGRDMETIGFAVYEVPPELVGQPAVHLKRDFFLANASRARSEQFINL
REVSTRFRLPPGEYVVVPSTFEPNKEGDFVLRFFSEKSAGTVELDDQIQANLPDEQVLSE
EEIDENFKALFRQLAGEDMEISVKELRTILNRIISKHKDLRTKGFSLESCRSMVNLMDRD
GNGKLGLVEFNILWNRIRNYLSIFRKFDLDKSGSMSAYEMRMAIESAGFKLNKKLYELII
TRYSEPDLAVDFDNFVCCLVRLETMFRFFKTLDTDLDGVVTFDLFKWLQLTMFA
NT seq 2145 nt   +upstreamnt  +downstreamnt
atgtcggaggagatcatcacgccggtgtactgcactggggtgtcagcccaagtgcagaag
cagcgggccagggagctgggcctgggccgccatgagaatgccatcaagtacctgggccag
gattatgagcagctgcgggtgcgatgcctgcagagtgggaccctcttccgtgatgaggcc
ttccccccggtaccccagagcctgggttacaaggacctgggtcccaattcctccaagacc
tatggcatcaagtggaagcgtcccacggaactgctgtcaaacccccagttcattgtggat
ggagctacccgcacagacatctgccagggagcactgggggactgctggctcttggcggcc
atcgcctccctcactctcaacgacaccctcctgcaccgagtggttccgcacggccagagc
ttccagaatggctatgccggcatcttccatttccagctgtggcaatttggggagtgggtg
gacgtggtcgtggatgacctgctgcccatcaaggacgggaagctagtgttcgtgcactct
gccgaaggcaacgagttctggagcgccctgcttgagaaggcctatgccaaggtaaatggc
agctacgaggccctgtcagggggcagcacctcagagggctttgaggacttcacaggcggg
gttaccgagtggtacgagttgcgcaaggctcccagtgacctctaccagatcatcctcaag
gcgctggagcggggctccctgctgggctgctccatagacatctccagcgttctagacatg
gaggccatcactttcaagaagttggtgaagggccatgcctactctgtgaccggggccaag
caggtgaactaccgaggccaggtggtgagcctgatccggatgcggaacccctggggcgag
gtggagtggacgggagcctggagcgacagctcctcagagtggaacaacgtggacccatat
gaacgggaccagctccgggtcaagatggaggacggggagttctggatgtcattccgagac
ttcatgcgggagttcacccgcctggagatctgcaacctcacacccgacgccctcaagagc
cggaccatccgcaaatggaacaccacactctacgaaggcacctggcggcgggggagcacc
gcggggggctgccgaaactacccagccaccttctgggtgaaccctcagttcaagatccgg
ctggatgagacggatgacccggacgactacggggaccgcgagtcaggctgcagcttcgtg
ctcgcccttatgcagaagcaccgtcgccgcgagcgccgcttcggccgcgacatggagact
attggcttcgcggtctacgaggtccctccggagctggtgggccagccggccgtacacttg
aagcgtgacttcttcctggccaatgcgtctcgggcgcgctcagagcagttcatcaacctg
cgagaggtcagcacccgcttccgcctgccacccggggagtatgtggtggtgccctccacc
ttcgagcccaacaaggagggcgacttcgtgctgcgcttcttctcagagaagagtgctggg
actgtggagctggatgaccagatccaggccaatctccccgatgagcaagtgctctcagaa
gaggagattgacgagaacttcaaggccctcttcaggcagctggcaggggaggacatggag
atcagcgtgaaggagttgcggacaatcctcaataggatcatcagcaaacacaaagacctg
cggaccaagggcttcagcctagagtcgtgccgcagcatggtgaacctcatggatcgtgat
ggcaatgggaagctgggcctggtggagttcaacatcctgtggaaccgcatccggaattac
ctgtccatcttccggaagtttgacctggacaagtcgggcagcatgagtgcctacgagatg
cggatggccattgagtcggcaggcttcaagctcaacaagaagctgtacgagctcatcatc
acccgctactcggagcccgacctggcggtcgactttgacaatttcgtttgctgcctggtg
cggctagagaccatgttccgatttttcaaaactctggacacagatctggatggagttgtg
acctttgacttgtttaagtggttgcagctgaccatgtttgcatga

KEGG   Homo sapiens (human): 824
Entry
824               CDS       T01001                                 
Symbol
CAPN2, CANP2, CANPL2, CANPml, mCANP
Name
(RefSeq) calpain 2
  KO
K03853  calpain-2 [EC:3.4.22.53]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04141  Protein processing in endoplasmic reticulum
hsa04210  Apoptosis
hsa04217  Necroptosis
hsa04218  Cellular senescence
hsa04510  Focal adhesion
hsa05010  Alzheimer disease
hsa05022  Pathways of neurodegeneration - multiple diseases
hsa05131  Shigellosis
Network
nt06460  Alzheimer disease
nt06466  Pathways of neurodegeneration
nt06539  Cytoskeleton in muscle cells
  Element
N00997  Mutation-caused aberrant Abeta to electron transfer in Complex I
N01018  Mutation-caused aberrant Abeta to anterograde axonal transport
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   04141 Protein processing in endoplasmic reticulum
    824 (CAPN2)
 09140 Cellular Processes
  09143 Cell growth and death
   04210 Apoptosis
    824 (CAPN2)
   04217 Necroptosis
    824 (CAPN2)
   04218 Cellular senescence
    824 (CAPN2)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    824 (CAPN2)
 09160 Human Diseases
  09171 Infectious disease: bacterial
   05131 Shigellosis
    824 (CAPN2)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    824 (CAPN2)
   05022 Pathways of neurodegeneration - multiple diseases
    824 (CAPN2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    824 (CAPN2)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.22  Cysteine endopeptidases
    3.4.22.53  calpain-2
     824 (CAPN2)
Peptidases and inhibitors [BR:hsa01002]
 Cysteine peptidases
  Family C2: calpain family
   824 (CAPN2)
SSDB
Motif
Pfam: Peptidase_C2 Calpain_III EF-hand_6 EF-hand_1 EF-hand_8 EF-hand_7
Other DBs
NCBI-GeneID: 824
NCBI-ProteinID: NP_001739
OMIM: 114230
HGNC: 1479
Ensembl: ENSG00000162909
UniProt: P17655 Q59EF6
Structure
LinkDB
Position
1:223701597..223776018
AA seq 700 aa
MAGIAAKLAKDREAAEGLGSHDRAIKYLNQDYEALRNECLEAGTLFQDPSFPAIPSALGF
KELGPYSSKTRGIEWKRPTEICADPQFIIGGATRTDICQGALGDCWLLAAIASLTLNEEI
LARVVPLNQSFQENYAGIFHFQFWQYGEWVEVVVDDRLPTKDGELLFVHSAEGSEFWSAL
LEKAYAKINGCYEALSGGATTEGFEDFTGGIAEWYELKKPPPNLFKIIQKALQKGSLLGC
SIDITSAADSEAITFQKLVKGHAYSVTGAEEVESNGSLQKLIRIRNPWGEVEWTGRWNDN
CPSWNTIDPEERERLTRRHEDGEFWMSFSDFLRHYSRLEICNLTPDTLTSDTYKKWKLTK
MDGNWRRGSTAGGCRNYPNTFWMNPQYLIKLEEEDEDEEDGESGCTFLVGLIQKHRRRQR
KMGEDMHTIGFGIYEVPEELSGQTNIHLSKNFFLTNRARERSDTFINLREVLNRFKLPPG
EYILVPSTFEPNKDGDFCIRVFSEKKADYQAVDDEIEANLEEFDISEDDIDDGFRRLFAQ
LAGEDAEISAFELQTILRRVLAKRQDIKSDGFSIETCKIMVDMLDSDGSGKLGLKEFYIL
WTKIQKYQKIYREIDVDRSGTMNSYEMRKALEEAGFKMPCQLHQVIVARFADDQLIIDFD
NFVRCLVRLETLFKIFKQLDPENTGTIELDLISWLCFSVL
NT seq 2103 nt   +upstreamnt  +downstreamnt
atggcgggcatcgcggccaagctggcgaaggaccgggaggcggccgaggggctgggctcc
cacgacagggccatcaagtacctcaaccaggactacgaggcgctgcggaacgagtgcctg
gaggccgggacgctcttccaggacccgtccttcccggccatcccctcggccctgggcttc
aaggagttggggccctactccagcaaaacccggggcatcgagtggaagcgccccacggag
atctgcgctgacccccagtttatcattggaggagccacccgcacagacatctgccaagga
gccctgggtgactgctggctgctggcagccattgcctccctcaccttgaatgaagaaatc
ctggctcgagtcgtccccctaaaccagagcttccaggaaaactatgcagggatctttcac
ttccagttctggcaatacggcgagtgggtggaggtggtggtggatgacaggctgcccacc
aaggacggggagctgctctttgtgcattcagccgaagggagcgagttctggagcgccctg
ctggagaaggcatacgccaagatcaacggatgctatgaagcgctatcagggggtgccacc
actgagggcttcgaagacttcaccggaggcattgctgagtggtatgagttgaagaagccc
cctcccaacctgttcaagatcatccagaaagctctgcaaaaaggctctctccttggctgc
tccatcgacatcaccagcgccgcggactcggaggccatcacgtttcagaagctggtgaag
gggcacgcgtactcggtcaccggagccgaggaggttgaaagtaacggaagcctacagaaa
ctgatccgcatccgaaatccctggggagaagtggagtggacagggcggtggaatgacaac
tgcccaagctggaacactatagacccagaggagagggaaaggctgaccagacggcatgaa
gatggagaattctggatgtctttcagtgacttcctgaggcactattcccgcctggagatc
tgtaacctgaccccagacactctcaccagcgatacctacaagaagtggaaactcaccaaa
atggatgggaactggaggcggggctccaccgcgggaggttgcaggaactacccgaacaca
ttctggatgaaccctcagtacctgatcaagctggaggaggaggatgaggacgaggaggat
ggggagagcggctgcaccttcctggtggggctcattcagaagcaccgacggcggcagagg
aagatgggcgaggacatgcacaccatcggctttggcatctatgaggttccagaggagtta
agtgggcagaccaacatccacctcagcaaaaacttcttcctgacgaatcgcgccagggag
cgctcagacaccttcatcaacctccgggaggtgctcaaccgcttcaagctgccgccagga
gagtacattctcgtgccttccaccttcgaacccaacaaggatggggatttctgcatccgg
gtcttttctgaaaagaaagctgactaccaagctgtcgatgatgaaatcgaggccaatctt
gaagagttcgacatcagcgaggatgacattgatgatggattcaggagactgtttgcccag
ttggcaggagaggatgcggagatctctgcctttgagctgcagaccatcctgagaagggtt
ctagcaaagcgccaagatatcaagtcagatggcttcagcatcgagacatgcaaaattatg
gttgacatgctagattcggacgggagtggcaagctggggctgaaggagttctacattctc
tggacgaagattcaaaaataccaaaaaatttaccgagaaatcgacgttgacaggtctggt
accatgaattcctatgaaatgcggaaggcattagaagaagcaggtttcaagatgccctgt
caactccaccaagtcatcgttgctcggtttgcagatgaccagctcatcatcgattttgat
aattttgttcggtgtttggttcggctggaaacgctattcaagatatttaagcagctggat
cccgagaatactggaacaatagagctcgaccttatctcttggctctgtttctcagtactt
tga

KEGG   Homo sapiens (human): 825
Entry
825               CDS       T01001                                 
Symbol
CAPN3, CANP3, CANPL3, LGMD2, LGMD2A, LGMDD4, LGMDR1, nCL-1, p94
Name
(RefSeq) calpain 3
  KO
K08573  calpain-3 [EC:3.4.22.54]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Disease
H00592  Calpainopathy
H00593  Limb-girdle muscular dystrophy
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    825 (CAPN3)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    825 (CAPN3)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.22  Cysteine endopeptidases
    3.4.22.54  calpain-3
     825 (CAPN3)
Peptidases and inhibitors [BR:hsa01002]
 Cysteine peptidases
  Family C2: calpain family
   825 (CAPN3)
SSDB
Motif
Pfam: Peptidase_C2 Calpain_III Calpain_u2 EF-hand_1 EF-hand_6 EF-hand_7 EF-hand_8 EF-hand_5 CAPN13-like_C_EFh SURF6
Other DBs
NCBI-GeneID: 825
NCBI-ProteinID: NP_000061
OMIM: 114240
HGNC: 1480
Ensembl: ENSG00000092529
UniProt: P20807
Structure
LinkDB
Position
15:42359501..42412317
AA seq 821 aa
MPTVISASVAPRTAAEPRSPGPVPHPAQSKATEAGGGNPSGIYSAIISRNFPIIGVKEKT
FEQLHKKCLEKKVLYVDPEFPPDETSLFYSQKFPIQFVWKRPPEICENPRFIIDGANRTD
ICQGELGDCWFLAAIACLTLNQHLLFRVIPHDQSFIENYAGIFHFQFWRYGEWVDVVIDD
CLPTYNNQLVFTKSNHRNEFWSALLEKAYAKLHGSYEALKGGNTTEAMEDFTGGVAEFFE
IRDAPSDMYKIMKKAIERGSLMGCSIDDGTNMTYGTSPSGLNMGELIARMVRNMDNSLLQ
DSDLDPRGSDERPTRTIIPVQYETRMACGLVRGHAYSVTGLDEVPFKGEKVKLVRLRNPW
GQVEWNGSWSDRWKDWSFVDKDEKARLQHQVTEDGEFWMSYEDFIYHFTKLEICNLTADA
LQSDKLQTWTVSVNEGRWVRGCSAGGCRNFPDTFWTNPQYRLKLLEEDDDPDDSEVICSF
LVALMQKNRRKDRKLGASLFTIGFAIYEVPKEMHGNKQHLQKDFFLYNASKARSKTYINM
REVSQRFRLPPSEYVIVPSTYEPHQEGEFILRVFSEKRNLSEEVENTISVDRPVKKKKTK
PIIFVSDRANSNKELGVDQESEEGKGKTSPDKQKQSPQPQPGSSDQESEEQQQFRNIFKQ
IAGDDMEICADELKKVLNTVVNKHKDLKTHGFTLESCRSMIALMDTDGSGKLNLQEFHHL
WNKIKAWQKIFKHYDTDQSGTINSYEMRNAVNDAGFHLNNQLYDIITMRYADKHMNIDFD
SFICCFVRLEGMFRAFHAFDKDGDGIIKLNVLEWLQLTMYA
NT seq 2466 nt   +upstreamnt  +downstreamnt
atgccgaccgtcattagcgcatctgtggctccaaggacagcggctgagccccggtcccca
gggccagttcctcacccggcccagagcaaggccactgaggctgggggtggaaacccaagt
ggcatctattcagccatcatcagccgcaattttcctattatcggagtgaaagagaagaca
ttcgagcaacttcacaagaaatgtctagaaaagaaagttctttatgtggaccctgagttc
ccaccggatgagacctctctcttttatagccagaagttccccatccagttcgtctggaag
agacctccggaaatttgcgagaatccccgatttatcattgatggagccaacagaactgac
atctgtcaaggagagctaggggactgctggtttctcgcagccattgcctgcctgaccctg
aaccagcaccttcttttccgagtcataccccatgatcaaagtttcatcgaaaactacgca
gggatcttccacttccagttctggcgctatggagagtgggtggacgtggttatagatgac
tgcctgccaacgtacaacaatcaactggttttcaccaagtccaaccaccgcaatgagttc
tggagtgctctgctggagaaggcttatgctaagctccatggttcctacgaagctctgaaa
ggtgggaacaccacagaggccatggaggacttcacaggaggggtggcagagttttttgag
atcagggatgctcctagtgacatgtacaagatcatgaagaaagccatcgagagaggctcc
ctcatgggctgctccattgatgatggcacgaacatgacctatggaacctctccttctggt
ctgaacatgggggagttgattgcacggatggtaaggaatatggataactcactgctccag
gactcagacctcgaccccagaggctcagatgaaagaccgacccggacaatcattccggtt
cagtatgagacaagaatggcctgcgggctggtcagaggtcacgcctactctgtcacgggg
ctggatgaggtcccgttcaaaggtgagaaagtgaagctggtgcggctgcggaatccgtgg
ggccaggtggagtggaacggttcttggagtgatagatggaaggactggagctttgtggac
aaagatgagaaggcccgtctgcagcaccaggtcactgaggatggagagttctggatgtcc
tatgaggatttcatctaccatttcacaaagttggagatctgcaacctcacggccgatgct
ctgcagtctgacaagcttcagacctggacagtgtctgtgaacgagggccgctgggtacgg
ggttgctctgccggaggctgccgcaacttcccagatactttctggaccaaccctcagtac
cgtctgaagctcctggaggaggacgatgaccctgatgactcggaggtgatttgcagcttc
ctggtggccctgatgcagaagaaccggcggaaggaccggaagctaggggccagtctcttc
accattggcttcgccatctacgaggttcccaaagagatgcacgggaacaagcagcacctg
cagaaggacttcttcctgtacaacgcctccaaggccaggagcaaaacctacatcaacatg
cgggaggtgtcccagcgcttccgcctgcctcccagcgagtacgtcatcgtgccctccacc
tacgagccccaccaggagggggaattcatcctccgggtcttctctgaaaagaggaacctc
tctgaggaagttgaaaataccatctccgtggatcggccagtgaaaaagaaaaaaaccaag
cccatcatcttcgtttcggacagagcaaacagcaacaaggagctgggtgtggaccaggag
tcagaggagggcaaaggcaaaacaagccctgataagcaaaagcagtccccacagccacag
cctggcagctctgatcaggaaagtgaggaacagcaacaattccggaacattttcaagcag
atagcaggagatgacatggagatctgtgcagatgagctcaagaaggtccttaacacagtc
gtgaacaaacacaaggacctgaagacacacgggttcacactggagtcctgccgtagcatg
attgcgctcatggatacagatggctctggaaagctcaacctgcaggagttccaccacctc
tggaacaagattaaggcctggcagaaaattttcaaacactatgacacagaccagtccggc
accatcaacagctacgagatgcgaaatgcagtcaacgacgcaggattccacctcaacaac
cagctctatgacatcattaccatgcggtacgcagacaaacacatgaacatcgactttgac
agtttcatctgctgcttcgttaggctggagggcatgttcagagcttttcatgcatttgac
aaggatggagatggtatcatcaagctcaacgttctggagtggctgcagctcaccatgtat
gcctga

KEGG   Homo sapiens (human): 1073
Entry
1073              CDS       T01001                                 
Symbol
CFL2, NEM7
Name
(RefSeq) cofilin 2
  KO
K05765  cofilin
Organism
hsa  Homo sapiens (human)
Pathway
hsa04360  Axon guidance
hsa04666  Fc gamma R-mediated phagocytosis
hsa04810  Regulation of actin cytoskeleton
hsa05133  Pertussis
hsa05170  Human immunodeficiency virus 1 infection
Network
nt06161  Human immunodeficiency virus 1 (HIV-1)
nt06539  Cytoskeleton in muscle cells
nt06541  Cytoskeleton in neurons
  Element
N00433  CXCR4-GNB/G-RAC signaling pathway
N01821  Sarcomere, M-band
N01838  Regulation of synaptic plasticity, p140Cap
Disease
H00698  Nemaline myopathy
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    1073 (CFL2)
 09150 Organismal Systems
  09151 Immune system
   04666 Fc gamma R-mediated phagocytosis
    1073 (CFL2)
  09158 Development and regeneration
   04360 Axon guidance
    1073 (CFL2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05170 Human immunodeficiency virus 1 infection
    1073 (CFL2)
  09171 Infectious disease: bacterial
   05133 Pertussis
    1073 (CFL2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:hsa04131]
    1073 (CFL2)
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:hsa04812]
    1073 (CFL2)
   04147 Exosome [BR:hsa04147]
    1073 (CFL2)
Membrane trafficking [BR:hsa04131]
 Others
  Actin-binding proteins
   Others
    1073 (CFL2)
Cytoskeleton proteins [BR:hsa04812]
 Eukaryotic cytoskeleton proteins
  Actin filaments / Microfilaments
   Actin-binding proteins
    ADF / Cofilin
     1073 (CFL2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Proteins found in most exosomes
   1073 (CFL2)
SSDB
Motif
Pfam: Cofilin_ADF
Other DBs
NCBI-GeneID: 1073
NCBI-ProteinID: NP_068733
OMIM: 601443
HGNC: 1875
Ensembl: ENSG00000165410
UniProt: Q9Y281 Q549N0
Structure
LinkDB
Position
14:complement(34709113..34714593)
AA seq 166 aa
MASGVTVNDEVIKVFNDMKVRKSSTQEEIKKRKKAVLFCLSDDKRQIIVEEAKQILVGDI
GDTVEDPYTSFVKLLPLNDCRYALYDATYETKESKKEDLVFIFWAPESAPLKSKMIYASS
KDAIKKKFTGIKHEWQVNGLDDIKDRSTLGEKLGGNVVVSLEGKPL
NT seq 501 nt   +upstreamnt  +downstreamnt
atggcttctggagttacagtgaatgatgaagtcatcaaagtttttaatgatatgaaagta
aggaaatcttctacacaagaggagatcaaaaagagaaagaaagcagttctcttctgttta
agcgatgacaaaagacaaataattgtagaggaagcaaagcagatcttggtgggtgacatt
ggtgatactgtagaggacccctacacatcttttgtgaagttgctacctctgaatgattgc
cgatatgctttgtacgatgccacatacgaaacaaaagagtctaagaaagaagacctagta
tttatattctgggctcctgaaagtgcacctttaaaaagcaagatgatttatgctagctct
aaagatgccattaaaaagaaatttacaggtattaaacatgagtggcaagtaaatggcttg
gatgatattaaggaccgttcgacacttggagagaaattgggaggcaatgtagtagtttca
cttgaaggaaaaccattataa

DBGET integrated database retrieval system