KEGG   Homo sapiens (human): 84033
Entry
84033             CDS       T01001                                 
Symbol
OBSCN, ARHGEF30, RHABDO1, UNC89
Name
(RefSeq) obscurin, cytoskeletal calmodulin and titin-interacting RhoGEF
  KO
K17531  obscurin-RhoGEF [EC:2.7.11.1]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    84033 (OBSCN)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01001 Protein kinases [BR:hsa01001]
    84033 (OBSCN)
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:hsa04131]
    84033 (OBSCN)
Enzymes [BR:hsa01000]
 2. Transferases
  2.7  Transferring phosphorus-containing groups
   2.7.11  Protein-serine/threonine kinases
    2.7.11.1  non-specific serine/threonine protein kinase
     84033 (OBSCN)
Protein kinases [BR:hsa01001]
 Serine/threonine kinases: CAMK group
  Trio family [OT]
   84033 (OBSCN)
Membrane trafficking [BR:hsa04131]
 Others
  Rho GTPase associated proteins
   Rho guanine nucleotide exchange factors (GEFs)
    84033 (OBSCN)
SSDB
Motif
Pfam: I-set Ig_3 ig Ig_2 V-set Pkinase PK_Tyr_Ser-Thr Ig_5 Ig_4 C1-set C2-set_2 Izumo-Ig SOS1_NGEF_PH fn3 RhoGEF PH ABC1 IQ Pkinase_fungal IQ_SEC7_PH
Other DBs
NCBI-GeneID: 84033
NCBI-ProteinID: NP_001092093
OMIM: 608616
HGNC: 15719
Ensembl: ENSG00000154358
UniProt: Q5VST9
Structure
LinkDB
Position
1:228208044..228378876
AA seq 7968 aa
MDQPQFSGAPRFLTRPKAFVVSVGKDATLSCQIVGNPTPQVSWEKDQQPVAAGARFRLAQ
DGDLYRLTILDLALGDSGQYVCRARNAIGEAFAAVGLQVDAEAACAEQAPHFLLRPTSIR
VREGSEATFRCRVGGSPRPAVSWSKDGRRLGEPDGPRVRVEELGEASALRIRAARPRDGG
TYEVRAENPLGAASAAAALVVDSDAADTASRPGTSTAALLAHLQRRREAMRAEGAPASPP
STGTRTCTVTEGKHARLSCYVTGEPKPETVWKKDGQLVTEGRRHVVYEDAQENFVLKILF
CKQSDRGLYTCTASNLVGQTYSSVLVVVREPAVPFKKRLQDLEVREKESATFLCEVPQPS
TEAAWFKEETRLWASAKYGIEEEGTERRLTVRNVSADDDAVYICETPEGSRTVAELAVQG
NLLRKLPRKTAVRVGDTAMFCVELAVPVGPVHWLRNQEEVVAGGRVAISAEGTRHTLTIS
QCCLEDVGQVAFMAGDCQTSTQFCVSAPRKPPLQPPVDPVVKARMESSVILSWSPPPHGE
RPVTIDGYLVEKKKLGTYTWIRCHEAEWVATPELTVADVAEEGNFQFRVSALNSFGQSPY
LEFPGTVHLAPKLAVRTPLKAVQAVEGGEVTFSVDLTVASAGEWFLDGQALKASSVYEIH
CDRTRHTLTIREVPASLHGAQLKFVANGIESSIRMEVRAAPGLTANKPPAAAAREVLARL
HEEAQLLAELSDQAAAVTWLKDGRTLSPGPKYEVQASAGRRVLLVRDVARDDAGLYECVS
RGGRIAYQLSVQGLARFLHKDMAGSCVDAVAGGPAQFECETSEAHVHVHWYKDGMELGHS
GERFLQEDVGTRHRLVAATVTRQDEGTYSCRVGEDSVDFRLRVSEPKVVFAKEQLARRKL
QAEAGASATLSCEVAQAQTEVTWYKDGKKLSSSSKVCMEATGCTRRLVVQQAGQADAGEY
SCEAGGQRLSFHLDVKEPKVVFAKDQVAHSEVQAEAGASATLSCEVAQAQTEVMWYKDGK
KLSSSLKVHVEAKGCRRRLVVQQAGKTDAGDYSCEARGQRVSFRLHITEPKMMFAKEQSV
HNEVQAEAGASAMLSCEVAQAQTEVTWYKDGKKLSSSSKVGMEVKGCTRRLVLPQAGKAD
AGEYSCEAGGQRVSFHLHITEPKGVFAKEQSVHNEVQAEAGTTAMLSCEVAQPQTEVTWY
KDGKKLSSSSKVRMEVKGCTRRLVVQQVGKADAGEYSCEAGGQRVSFQLHITEPKAVFAK
EQLVHNEVRTEAGASATLSCEVAQAQTEVTWYKDGKKLSSSSKVRIEAAGCMRQLVVQQA
GQADAGEYTCEAGGQRLSFHLDVSEPKAVFAKEQLAHRKVQAEAGAIATLSCEVAQAQTE
VTWYKDGKKLSSSSKVRMEAVGCTRRLVVQQACQADTGEYSCEAGGQRLSFSLDVAEPKV
VFAKEQPVHREVQAQAGASTTLSCEVAQAQTEVMWYKDGKKLSFSSKVRMEAVGCTRRLV
VQQAGQAVAGEYSCEAGSQRLSFHLHVAEPKAVFAKEQPASREVQAEAGTSATLSCEVAQ
AQTEVTWYKDGKKLSSSSKVRMEAVGCTRRLVVQEAGQADAGEYSCKAGDQRLSFHLHVA
EPKVVFAKEQPAHREVQAEAGASATLSCEVAQAQTEVTWYKDGKKLSSSSKVRVEAVGCT
RRLVVQQAGQAEAGEYSCEAGGQQLSFRLQVAELEPQISERPCRREPLVVKEHEDIILTA
TLATPSAATVTWLKDGVEIRRSKRHETASQGDTHTLTVHGAQVLDSAIYSCRVGAEGQDF
PVQVEEVAAKFCRLLEPVCGELGGTVTLACELSPACAEVVWRCGNTQLRVGKRFQMVAEG
PVRSLTVLGLRAEDAGEYVCESRDDHTSAQLTVSVPRVVKFMSGLSTVVAEEGGEATFQC
VVSPSDVAVVWFRDGALLQPSEKFAISQSGASHSLTISDLVLEDAGQITVEAEGASSSAA
LRVREAPVLFKKKLEPQTVEERSSVTLEVELTRPWPELRWTRNATALAPGKNVEIHAEGA
RHRLVLHNVGFADRGFFGCETPDDKTQAKLTVEMRQVRLVRGLQAVEAREQGTATMEVQL
SHADVDGSWTRDGLRFQQGPTCHLAVRGPMHTLTLSGLRPEDSGLMVFKAEGVHTSARLV
VTELPVSFSRPLQDVVTTEKEKVTLECELSRPNVDVRWLKDGVELRAGKTMAIAAQGACR
SLTIYRCEFADQGVYVCDAHDAQSSASVKVQGRTYTLIYRRVLAEDAGEIQFVAENAESR
AQLRVKELPVTLVRPLRDKIAMEKHRGVLECQVSRASAQVRWFKGSQELQPGPKYELVSD
GLYRKLIISDVHAEDEDTYTCDAGDVKTSAQFFVEEQSITIVRGLQDVTVMEPAPAWFEC
ETSIPSVRPPKWLLGKTVLQAGGNVGLEQEGTVHRLMLRRTCSTMTGPVHFTVGKSRSSA
RLVVSDIPVVLTRPLEPKTGRELQSVVLSCDFRPAPKAVQWYKDDTPLSPSEKFKMSLEG
QMAELRILRLMPADAGVYRCQAGSAHSSTEVTVEAREVTVTGPLQDAEATEEGWASFSCE
LSHEDEEVEWSLNGMPLYNDSFHEISHKGRRHTLVLKSIQRADAGIVRASSLKVSTSARL
EVRVKPVVFLKALDDLSAEERGTLALQCEVSDPEAHVVWRKDGVQLGPSDKYDFLHTAGT
RGLVVHDVSPEDAGLYTCHVGSEETRARVRVHDLHVGITKRLKTMEVLEGESCSFECVLS
HESASDPAMWTVGGKTVGSSSRFQATRQGRKYILVVREAAPSDAGEVVFSVRGLTSKASL
IVRERPAAIIKPLEDQWVAPGEDVELRCELSRAGTPVHWLKDRKAIRKSQKYDVVCEGTM
AMLVIRGASLKDAGEYTCEVEASKSTASLHVEEKANCFTEELTNLQVEEKGTAVFTCKTE
HPAATVTWRKGLLELRASGKHQPSQEGLTLRLTISALEKADSDTYTCDIGQAQSRAQLLV
QGRRVHIIEDLEDVDVQEGSSATFRCRISPANYEPVHWFLDKTPLHANELNEIDAQPGGY
HVLTLRQLALKDSGTIYFEAGDQRASAALRVTEKPSVFSRELTDATITEGEDLTLVCETS
TCDIPVCWTKDGKTLRGSARCQLSHEGHRAQLLITGATLQDSGRYKCEAGGACSSSIVRV
HARPVRFQEALKDLEVLEGGAATLRCVLSSVAAPVKWCYGNNVLRPGDKYSLRQEGAMLE
LVVRNLRPQDSGRYSCSFGDQTTSATLTVTALPAQFIGKLRNKEATEGATATLRCELSKA
APVEWRKGSETLRDGDRYCLRQDGAMCELQIRGLAMVDAAEYSCVCGEERTSASLTIRPM
PAHFIGRLRHQESIEGATATLRCELSKAAPVEWRKGRESLRDGDRHSLRQDGAVCELQIC
GLAVADAGEYSCVCGEERTSATLTVKALPAKFTEGLRNEEAVEGATAMLWCELSKVAPVE
WRKGPENLRDGDRYILRQEGTRCELQICGLAMADAGEYLCVCGQERTSATLTIRALPARF
IEDVKNQEAREGATAVLQCELNSAAPVEWRKGSETLRDGDRYSLRQDGTKCELQIRGLAM
ADTGEYSCVCGQERTSAMLTVRALPIKFTEGLRNEEATEGATAVLRCELSKMAPVEWWKG
HETLRDGDRHSLRQDGARCELQIRGLVAEDAGEYLCMCGKERTSAMLTVRAMPSKFIEGL
RNEEATEGDTATLWCELSKAAPVEWRKGHETLRDGDRHSLRQDGSRCELQIRGLAVVDAG
EYSCVCGQERTSATLTVRALPARFIEDVKNQEAREGATAVLQCELSKAAPVEWRKGSETL
RGGDRYSLRQDGTRCELQIHGLSVADTGEYSCVCGQERTSATLTVRAPQPVFREPLQSLQ
AEEGSTATLQCELSEPTATVVWSKGGLQLQANGRREPRLQGCTAELVLQDLQREDTGEYT
CTCGSQATSATLTVTAAPVRFLRELQHQEVDEGGTAHLCCELSRAGASVEWRKGSLQLFP
CAKYQMVQDGAAAELLVRGVEQEDAGDYTCDTGHTQSMASLSVRVPRPKFKTRLQSLEQE
TGDIARLCCQLSDAESGAVVQWLKEGVELHAGPKYEMRSQGATRELLIHQLEAKDTGEYA
CVTGGQKTAASLRVTEPEVTIVRGLVDAEVTADEDVEFSCEVSRAGATGVQWCLQGLPLQ
SNEVTEVAVRDGRIHTLRLKGVTPEDAGTVSFHLGNHASSAQLTVRAPEVTILEPLQDVQ
LSEGQDASFQCRLSRASGQEARWALGGVPLQANEMNDITVEQGTLHLLTLHKVTLEDAGT
VSFHVGTCSSEAQLKVTAKNTVVRGLENVEALEGGEALFECQLSQPEVAAHTWLLDDEPV
HTSENAEVVFFENGLRHLLLLKNLRPQDSCRVTFLAGDMVTSAFLTVRGWRLEILEPLKN
AAVRAGAQACFTCTLSEAVPVGEASWYINGAAVQPDDSDWTVTADGSHHALLLRSAQPHH
AGEVTFACRDAVASARLTVLGLPDPPEDAEVVARSSHTVTLSWAAPMSDGGGGLCGYRVE
VKEGATGQWRLCHELVPGPECVVDGLAPGETYRFRVAAVGPVGAGEPVHLPQTVRLAEPP
KPVPPQPSAPESRQVAAGEDVSLELEVVAEAGEVIWHKGMERIQPGGRFEVVSQGRQQML
VIKGFTAEDQGEYHCGLAQGSICPAAATFQVALSPASVDEAPQPSLPPEAAQEGDLHLLW
EALARKRRMSREPTLDSISELPEEDGRSQRLPQEAEEVAPDLSEGYSTADELARTGDADL
SHTSSDDESRAGTPSLVTYLKKAGRPGTSPLASKVGAPAAPSVKPQQQQEPLAAVRPPLG
DLSTKDLGDPSMDKAAVKIQAAFKGYKVRKEMKQQEGPMFSHTFGDTEAQVGDALRLECV
VASKADVRARWLKDGVELTDGRHHHIDQLGDGTCSLLITGLDRADAGCYTCQVSNKFGQV
THSACVVVSGSESEAESSSGGELDDAFRRAARRLHRLFRTKSPAEVSDEELFLSADEGPA
EPEEPADWQTYREDEHFICIRFEALTEARQAVTRFQEMFATLGIGVEIKLVEQGPRRVEM
CISKETPAPVVPPEPLPSLLTSDAAPVFLTELQNQEVQDGYPVSFDCVVTGQPMPSVRWF
KDGKLLEEDDHYMINEDQQGGHQLIITAVVPADMGVYRCLAENSMGVSSTKAELRVDLTS
TDYDTAADATESSSYFSAQGYLSSREQEGTESTTDEGQLPQVVEELRDLQVAPGTRLAKF
QLKVKGYPAPRLYWFKDGQPLTASAHIRMTDKKILHTLEIISVTREDSGQYAAYISNAMG
AAYSSARLLVRGPDEPEEKPASDVHEQLVPPRMLERFTPKKVKKGSSITFSVKVEGRPVP
TVHWLREEAERGVLWIGPDTPGYTVASSAQQHSLVLLDVGRQHQGTYTCIASNAAGQALC
SASLHVSGLPKVEEQEKVKEALISTFLQGTTQAISAQGLETASFADLGGQRKEEPLAAKE
ALGHLSLAEVGTEEFLQKLTSQITEMVSAKITQAKLQVPGGDSDEDSKTPSASPRHGRSR
PSSSIQESSSESEDGDARGEIFDIYVVTADYLPLGAEQDAITLREGQYVEVLDAAHPLRW
LVRTKPTKSSPSRQGWVSPAYLDRRLKLSPEWGAAEAPEFPGEAVSEDEYKARLSSVIQE
LLSSEQAFVEELQFLQSHHLQHLERCPHVPIAVAGQKAVIFRNVRDIGRFHSSFLQELQQ
CDTDDDVAMCFIKNQAAFEQYLEFLVGRVQAESVVVSTAIQEFYKKYAEEALLAGDPSQP
PPPPLQHYLEQPVERVQRYQALLKELIRNKARNRQNCALLEQAYAVVSALPQRAENKLHV
SLMENYPGTLQALGEPIRQGHFIVWEGAPGARMPWKGHNRHVFLFRNHLVICKPRRDSRT
DTVSYVFRNMMKLSSIDLNDQVEGDDRAFEVWQEREDSVRKYLLQARTAIIKSSWVKEIC
GIQQRLALPVWRPPDFEEELADCTAELGETVKLACRVTGTPKPVISWYKDGKAVQVDPHH
ILIEDPDGSCALILDSLTGVDSGQYMCFAASAAGNCSTLGKILVQVPPRFVNKVRASPFV
EGEDAQFTCTIEGAPYPQIRWYKDGALLTTGNKFQTLSEPRSGLLVLVIRAASKEDLGLY
ECELVNRLGSARASAELRIQSPMLQAQEQCHREQLVAAVEDTTLERADQEVTSVLKRLLG
PKAPGPSTGDLTGPGPCPRGAPALQETGSQPPVTGTSEAPAVPPRVPQPLLHEGPEQEPE
AIARAQEWTVPIRMEGAAWPGAGTGELLWDVHSHVVRETTQRTYTYQAIDTHTARPPSMQ
VTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLSQQQEGTTYSLVLRHVA
SKDAGVYTCLAQNTGGQVLCKAELLVLGGDNEPDSEKQSHRRKLHSFYEVKEEIGRGVFG
FVKRVQHKGNKILCAAKFIPLRSRTRAQAYRERDILAALSHPLVTGLLDQFETRKTLILI
LELCSSEELLDRLYRKGVVTEAEVKVYIQQLVEGLHYLHSHGVLHLDIKPSNILMVHPAR
EDIKICDFGFAQNITPAELQFSQYGSPEFVSPEIIQQNPVSEASDIWAMGVISYLSLTCS
SPFAGESDRATLLNVLEGRVSWSSPMAAHLSEDAKDFIKATLQRAPQARPSAAQCLSHPW
FLKSMPAEEAHFINTKQLKFLLARSRWQRSLMSYKSILVMRSIPELLRGPPDSPSLGVAR
HLCRDTGGSSSSSSSSDNELAPFARAKSLPPSPVTHSPLLHPRGFLRPSASLPEEAEASE
RSTEAPAPPASPEGAGPPAAQGCVPRHSVIRSLFYHQAGESPEHGALAPGSRRHPARRRH
LLKGGYIAGALPGLREPLMEHRVLEEEAAREEQATLLAKAPSFETALRLPASGTHLAPGH
SHSLEHDSPSTPRPSSEACGEAQRLPSAPSGGAPIRDMGHPQGSKQLPSTGGHPGTAQPE
RPSPDSPWGQPAPFCHPKQGSAPQEGCSPHPAVAPCPPGSFPPGSCKEAPLVPSSPFLGQ
PQAPPAPAKASPPLDSKMGPGDISLPGRPKPGPCSSPGSASQASSSQVSSLRVGSSQVGT
EPGPSLDAEGWTQEAEDLSDSTPTLQRPQEQATMRKFSLGGRGGYAGVAGYGTFAFGGDA
GGMLGQGPMWARIAWAVSQSEEEEQEEARAESQSEEQQEARAESPLPQVSARPVPEVGRA
PTRSSPEPTPWEDIGQVSLVQIRDLSGDAEAADTISLDISEVDPAYLNLSDLYDIKYLPF
EFMIFRKVPKSAQPEPPSPMAEEELAEFPEPTWPWPGELGPHAGLEITEESEDVDALLAE
AAVGRKRKWSSPSRSLFHFPGRHLPLDEPAELGLRERVKASVEHISRILKGRPEGLEKEG
PPRKKPGLASFRLSGLKSWDRAPTFLRELSDETVVLGQSVTLACQVSAQPAAQATWSKDG
APLESSSRVLISATLKNFQLLTILVVVAEDLGVYTCSVSNALGTVTTTGVLRKAERPSSS
PCPDIGEVYADGVLLVWKPVESYGPVTYIVQCSLEGGSWTTLASDIFDCCYLTSKLSRGG
TYTFRTACVSKAGMGPYSSPSEQVLLGGPSHLASEEESQGRSAQPLPSTKTFAFQTQIQR
GRFSVVRQCWEKASGRALAAKIIPYHPKDKTAVLREYEALKGLRHPHLAQLHAAYLSPRH
LVLILELCSGPELLPCLAERASYSESEVKDYLWQMLSATQYLHNQHILHLDLRSENMIIT
EYNLLKVVDLGNAQSLSQEKVLPSDKFKDYLETMAPELLEGQGAVPQTDIWAIGVTAFIM
LSAEYPVSSEGARDLQRGLRKGLVRLSRCYAGLSGGAVAFLRSTLCAQPWGRPCASSCLQ
CPWLTEEGPACSRPAPVTFPTARLRVFVRNREKRRALLYKRHNLAQVR
NT seq 23907 nt   +upstreamnt  +downstreamnt
atggatcagccacagttcagcggggcgccccgctttctcacccggcccaaggccttcgtg
gtgtcggtgggcaaggacgccaccctcagctgccagatcgtgggtaatcccacgccacag
gtgagctgggagaaggaccagcagccggtggcggccggcgcgcgcttccgtctggcccag
gacggcgacctctaccgcctcactatcctggacctggcgctgggcgacagtgggcaatac
gtgtgccgcgcgcgcaatgccataggcgaggccttcgctgctgtgggcctgcaggtggac
gcggaggccgcgtgcgccgagcaggcgccgcacttcctgctgcggcccacgtccatccgc
gtgcgcgagggctcagaggccaccttccgctgccgcgtgggtggctccccgaggccggca
gtgagctggtccaaggacgggcggcgcctgggtgagcccgacggcccccgcgtgcgcgtg
gaggagctcggcgaggcaagtgcgctgcgcattcgggcggcgcggccgcgcgacggcggc
acttacgaggtccgcgccgagaacccgctgggcgctgccagcgccgccgcggcgctagtg
gtggactcggacgccgcggacacggccagccggcccgggacctccacggccgcgctcctg
gcgcacctgcagcggcggcgcgaggctatgcgcgccgagggcgcccccgcctcaccgccc
agcaccggcacgcgcacctgcacggtgactgaaggcaagcacgcgcgcctcagctgctac
gtgaccggcgagcccaagcccgagacggtgtggaagaaggacggccagctggtgaccgag
ggccggcgccacgtggtgtacgaggacgcgcaggagaacttcgtgctcaagatcctcttc
tgcaagcagtcggaccgcggcctctacacctgcacggcgtccaacctcgtgggccagacc
tacagctctgtgctggtcgtagtgcgcgagcccgcggttcccttcaaaaagcggctgcaa
gatctggaggtgcgggagaaggagtcggccacgttcctatgtgaggtgccccagccgtcc
actgaggccgcgtggttcaaggaggagacgcggttgtgggcgagcgccaagtacggcatc
gaggaggagggcaccgagcgccgcctgaccgtgcgcaatgtctcggccgacgacgacgcg
gtgtacatctgcgagacgccagagggcagccgcacggtggcggagctcgcagtccaagga
aacctcctccgaaagctccctcggaagacggcggtgcgcgtgggcgacacggctatgttt
tgcgtggagctggcggtcccggtgggccccgtccactggctgcggaaccaggaggaagtg
gtggcggggggccgcgtggccatctccgcggagggcacgcgccacacactaaccatctcc
cagtgctgcctggaggatgtgggccaggtggcctttatggctggcgactgccagacgtcc
acccagttctgcgtgtcggcccccaggaagcctcccctgcaaccccctgtggatcctgtg
gtaaaggccaggatggagagttccgtgattctcagctggtccccaccaccccatggggaa
cgccctgtcactatcgacggctacctggtagagaagaagaagcttggcacctacacctgg
atcaggtgccacgaggctgaatgggtggctacacctgagctgaccgtggctgatgtggcg
gaggaggggaacttccagttccgagtgtccgctctcaacagctttggtcagagtccctac
ctcgagttcccggggactgtccacctggcccccaagctggccgtgaggacaccgctgaag
gcggtgcaggcggtagagggtggcgaggtcactttctccgtggacctcacggtggcctca
gcgggtgagtggttcctggatgggcaggccctgaaggccagcagtgtgtatgagatccac
tgtgatcgcacccggcacacgctcaccatccgggaggtgcccgccagcctgcacggggcg
cagctgaagttcgtggccaacggcattgagagcagcatccggatggaggtccgggcggcc
ccagggctgactgccaacaagccgccagccgcagctgcccgggaggtgctggctcggctg
cacgaggaggcgcagctgctggctgagctgtcagatcaggctgcggctgtgacgtggctg
aaggatggtcgcacactgtccccaggccccaagtatgaggtgcaggcatcggccgggcgg
cgggtgctccttgtgcgagatgtggcccgggacgatgcaggcctctacgagtgcgtcagc
cgcgggggccgcatcgcctaccagctctccgtgcaaggcctcgcgcgctttctgcacaag
gacatggcgggcagctgtgtggatgccgtggctgggggcccggcgcagtttgagtgtgag
acctccgaagcccacgtccacgtgcactggtacaaggatggcatggagctgggccactcc
ggtgagcgcttcttgcaggaggatgtggggacgcggcaccggctggtggcagccacagtc
accaggcaggatgaaggcacctactcctgccgcgtgggcgaggactctgtggacttccgg
ctccgcgtctctgagcccaaggtggtgtttgctaaggagcagctggcacgcaggaagctg
caggcagaggcaggagccagtgccacactgagctgcgaggtggcccaggcccagacggag
gtgacgtggtacaaggatgggaagaagctgagctccagctcgaaagtgtgcatggaggcc
acaggctgcacgcgcaggctggttgtgcagcaggcaggccaggcggatgccggggagtat
agctgcgaggctgggggccagcggctctccttccatctggatgtcaaagagcccaaggtg
gtgtttgccaaggaccaggtggcacacagtgaggtgcaggctgaggcaggggccagtgcc
acgctgagctgcgaggtggcccaggcccagacggaggtgatgtggtacaaagatgggaag
aagctgagctccagcttgaaagtgcatgtagaggccaagggctgcagacggaggctggtg
gtgcagcaggcaggcaagacagatgccggggactacagctgcgaggccaggggccagagg
gtctccttccgcctgcacatcacagagcccaagatgatgtttgcaaaggagcagtcagtg
cataatgaggtgcaggctgaggcgggggccagtgccatgctgagctgtgaggtggcccag
gcccagacggaggtgacgtggtacaaggatgggaagaagctgagctccagctcaaaagtg
ggcatggaggtcaaagggtgcacacggaggctggtgctgccacaggcgggcaaagcagat
gctggggagtacagctgtgaggctgggggccagagagtctccttccacctgcacatcaca
gagcccaagggggtgtttgcgaaggagcagtcagtgcataatgaggtgcaggctgaggcg
gggaccactgccatgctgagctgtgaggtggcccagccccagacagaggtgacgtggtac
aaggacgggaagaagctgagctccagctcaaaagtacgcatggaggtcaagggctgcaca
cgaaggctggtagtgcagcaggtgggcaaagcagatgctggggagtacagctgcgaggct
gggggccagagagtctcctttcaactgcacatcacagagcccaaggcagtgtttgccaag
gagcagttggtgcataatgaggtgcggactgaggcaggggccagtgccacactgagctgt
gaggtggcccaggcccagacagaggtgacgtggtacaaggatgggaagaagctgagctcc
agttcgaaagtgcgcatagaggctgcgggctgcatgcggcagctggtggtgcagcaggca
ggccaggcagatgctggggagtacacctgtgaggctgggggccagcggctctccttccac
ctggatgtttcagagcccaaggcggtgtttgcaaaggagcagctggcacacaggaaggtg
caggccgaggcgggggccattgccacgctgagctgcgaggtggcccaggcccagacagag
gtgacgtggtacaaggacgggaagaagctgagctccagctcgaaagttcgaatggaggct
gtgggctgcacacggaggctggtggtgcagcaggcatgccaggcggacaccggggagtat
agctgcgaggccgggggccagcggctctccttcagcctggacgtggcagagcccaaggtg
gtgtttgccaaggagcagccagtgcacagggaggtgcaggcccaggcgggggccagcacc
acactcagctgcgaggtggctcaggcccagacggaggtgatgtggtacaaggacgggaag
aagctgagcttcagctcgaaagtgcgcatggaggctgtgggctgcacacggaggctggtg
gtgcagcaggcgggccaggcggtcgccggggagtacagctgcgaggcggggagccagcgg
ctctccttccacctgcacgtggcagagcccaaggcggtgtttgccaaggagcagccagcg
agcagggaggtgcaggctgaggcggggaccagtgccacgctgagctgcgaggtggcccag
gcccagacagaggtgacgtggtacaaggacgggaagaaactgagctccagctcgaaagtg
cgaatggaggccgtgggctgcacacggaggctggtggtgcaggaggcaggccaggcggac
gccggggagtacagctgcaaggccggggatcagcggctgtccttccacctgcacgtggca
gagcccaaggtggtgtttgccaaggagcagccagcacacagggaggtgcaggctgaggcg
ggggccagtgccacgctgagctgcgaggtggcccaggcccagacagaggtgacgtggtac
aaggatgggaagaagctgagttccagctcgaaagtgcgcgtggaggccgtgggctgcaca
cggaggctggtggtgcagcaggcgggccaggcagaggccggggagtacagctgcgaggca
gggggtcagcagctctccttccgcctgcaggtggcagagctggagccccaaatttcagag
agaccctgccgcagggagcctctggtggtcaaggagcatgaagacatcatcctgaccgcc
acactggccacaccctctgcggccacggtgacctggctcaaggatggtgtggagattcgc
cgcagcaagcggcatgagacagccagccagggggacacccacaccctgaccgtgcatggc
gcccaggttctggacagcgccatctacagctgccgtgtgggcgcagaggggcaggacttc
ccagtgcaggtggaagaggtggccgccaagttctgccggctgctggagcctgtgtgcggc
gagctgggtggcacggtgacactggcctgcgagctaagcccagcgtgtgcagaggtggtg
tggcgctgcggcaacacgcagcttcgggtgggcaagcgcttccagatggtggccgagggg
cccgtgcgctcactcactgtgttggggctgcgcgcagaggacgcaggggagtacgtgtgt
gagagccgtgatgaccacaccagtgcgcagctcaccgtcagtgtgccccgagtggtgaag
tttatgtctgggctgagcaccgtggtcgcagaggagggcggcgaggccaccttccagtgc
gtggtgtcccccagtgatgtggcagtcgtgtggttccgggacggtgccctgcttcagccc
agcgagaagtttgccatatcacagagtggcgccagccacagcctgaccatctcagacctg
gtgctggaggacgcgggccagatcaccgtggaggctgagggcgcctcatcctctgctgcc
ctgagggtccgagaggcgcctgtgctgttcaaaaagaagctggagccgcagacggtggag
gagcggagctcggtgaccctggaggtggagctgacgcggccgtggccggagctgaggtgg
acacggaacgcgacggccctggcgccgggaaagaacgtggagatccacgccgagggcgcc
cgccaccgcctggttctgcacaacgtaggttttgccgaccgtggcttctttggctgcgag
acgccggatgacaagacacaggccaaactcaccgtggagatgcgccaggtacggctcgta
cggggcctgcaggcagtggaggcacgggagcagggcacggctaccatggaggtgcagctg
tcgcatgcggacgtggatggcagctggactcgtgacggtctgcggttccagcaggggccc
acgtgccacctggctgtgcggggccccatgcacaccctcacactctcggggctgcggcca
gaggatagtggccttatggtcttcaaggccgaaggagtgcacacgtcggcgcggctcgtg
gtcaccgagcttcccgtgagcttcagccgcccgctgcaggacgtggtgaccactgagaag
gagaaggttaccctggagtgcgagctgtcgcgtcctaatgtggatgtgcgctggctgaag
gacggtgtggagctgcgggcaggcaagacgatggccatcgcagcccagggcgcctgcagg
agcctcaccatttaccggtgcgagttcgcggatcagggagtgtatgtgtgtgatgcccat
gatgcccagagctctgcctccgtgaaggtacaaggaaggacatacactctcatctaccgg
agagtcctggcggaagatgcaggagagatccaatttgtagccgaaaatgcagaatcgcga
gcccagctccgagtgaaggagctgccagtgaccctcgtgcgcccgctgcgggacaagatt
gccatggagaagcaccgcggtgtgctggagtgtcaggtgtcccgggccagcgcccaggtg
cggtggttcaagggcagtcaggagctgcagcccgggcccaagtacgagctggtcagtgat
ggcctctaccgcaagctgatcatcagtgatgtccacgcagaggacgaggacacctacacc
tgtgacgccggtgatgtcaagaccagtgcacagttcttcgtggaagagcaatccatcacc
attgtgcggggtctgcaggacgtgacagtgatggagcccgctcctgcctggtttgagtgt
gagacctccatcccctcagtgcggccacctaagtggctcctggggaagacggtgttgcag
gctggggggaacgtgggcctggagcaggagggcacggtgcaccggctgatgctgcggcgg
acctgctccaccatgaccgggcccgtgcacttcaccgttggcaagtcgcgctcctctgcc
cgcctggtggtctcagacatccccgtagtcctcacacggccgttggagcccaagacaggg
cgtgagctgcagtcagtggtcctgtcctgcgacttccggccagcccccaaggctgtgcag
tggtacaaggatgacacgcccctgtctccctctgagaagtttaagatgagcctggagggt
cagatggctgagctgcgcatcctccggctcatgcctgctgatgctggtgtctaccggtgc
caggcgggcagtgcccacagcagcactgaggtcactgtggaagcgcgggaggtgacagtg
acagggccgctacaggatgcagaggccacggaggagggctgggccagcttctcctgtgag
ctgtcccacgaggatgaggaggtcgagtggtcgctcaacgggatgcccctgtacaacgac
agcttccatgagatctcacacaagggccggcgccacacgctggtactgaagagcatccag
cgggctgatgcgggcatagtacgcgcctcctccctgaaggtgtcgacctctgcccgcctg
gaggtccgagtgaagccggtggtgttcctgaaggcgctggatgacctgtccgcagaggag
cgcggcaccctggccctgcagtgtgaagtctctgaccccgaggcccatgtggtgtggcgc
aaagatggcgtgcagctgggccccagtgacaagtatgacttcctgcacacggcgggcacg
cgggggctcgtggtgcatgacgtgagccctgaagacgccggcctgtacacctgccacgtg
ggctccgaggagacccgggcccgggtccgcgtgcacgatctgcacgtgggcatcaccaag
aggctgaagacaatggaggtgctggaaggggaaagctgcagctttgagtgcgtcctgtcc
cacgagagtgccagcgacccggccatgtggacagtcggtgggaagacagtgggcagctcc
agccgcttccaggccacacgtcagggccgaaaatacatcctggtggtccgggaggctgca
ccaagtgatgccggggaggtggtcttctctgtgcggggcctcacctccaaggcctcactc
attgtcagagagaggccggccgccatcatcaagcccctggaagaccagtgggtggcgcca
ggggaggacgtggagctgcgctgtgagctgtcacgggcgggaacgcccgtgcactggctg
aaggacaggaaggccatccgcaagagccagaagtatgatgtggtctgcgagggcacgatg
gccatgctggtcatccgcggggcctcgctcaaggacgcgggcgagtacacgtgtgaggtg
gaggcttccaagagcacagccagcctccatgtggaagaaaaagcaaactgcttcacagag
gagctgaccaatctgcaggtggaggagaaaggcacagctgtgttcacgtgcaagacggag
caccccgcggccacagtgacctggcgcaagggcctcttggagctacgggcctcagggaag
caccagcccagccaggagggcctgaccctgcggctcaccatcagtgccctggagaaggca
gacagcgacacctatacctgcgacattggccaggcccagtcccgggcccagctcctagtg
caaggccggagagtgcacatcatcgaggacctggaggatgtggatgtgcaggagggctcc
tcggccaccttccgttgccggatctccccggccaactacgagcctgtgcactggttcctg
gacaagacacccctgcatgccaacgagctcaatgagatcgatgcccagcccgggggctac
cacgtgctgaccctgcggcagctggcgctcaaggactcgggcaccatctactttgaggcg
ggtgaccagcgggcctcggccgccctgcgggtcactgagaagccaagcgtcttctcccgg
gagctcacagatgccaccatcacagagggtgaggacttgaccctggtgtgcgagaccagc
acctgcgacattcctgtgtgctggaccaaggatgggaagaccctgcgggggtctgcccgg
tgccagctgagccatgagggccaccgggcccagctgctcatcactggggccaccctgcag
gacagtggacgctacaagtgtgaggctgggggcgcctgcagcagctccattgtcagggtg
catgcgcggccagtgcggttccaggaggccctgaaggacctggaggtgctggagggtggt
gctgccacactgcgctgtgtgctgtcatctgtggctgcgcccgtgaagtggtgctatgga
aacaacgtcctgaggccaggtgacaaatacagcctacgccaggagggtgccatgctggag
ctggtggtccggaacctccggccgcaggacagcgggcggtactcatgctccttcggggac
cagactacttctgccaccctcacagtgactgccctgcctgcccagttcatcgggaaactg
agaaacaaggaggccacagaaggggccacggccacgctgcggtgtgagctgagcaaggca
gcccctgtggagtggagaaaggggtccgagaccctcagagatggggacagatactgtctg
aggcaggacggggccatgtgtgagctgcagatccgtggcctggccatggtggatgccgcg
gagtactcgtgtgtgtgtggagaggagaggacctcagcctcactcaccatcaggcccatg
cctgcccacttcataggaagactgagacaccaagagagcatagaaggggccacagccacg
ctgcggtgtgagctgagcaaggcggcccccgtggagtggaggaaggggcgtgagagcctc
agagatggggacagacatagcctgaggcaggacggggctgtgtgcgagctgcagatctgt
ggcctggctgtggcagatgctggggagtactcctgtgtgtgtggggaggagaggacctct
gccactctcaccgtgaaggccctgccagccaagttcacagagggtctgaggaatgaagag
gccgtggaaggggccacagccatgttgtggtgtgaactgagcaaggtggcccctgtggag
tggaggaaggggcccgagaacctcagagatggggacagatacatcctgaggcaggagggg
accaggtgtgagctgcagatctgtggcctggccatggcggacgccggggagtacttgtgt
gtgtgcgggcaggagaggacctcagccacgctcaccatcagggctctgcctgccaggttc
atagaagatgtgaaaaaccaggaggccagagaaggggccacagctgtgctgcagtgtgag
ctgaacagtgcagcccctgtggagtggagaaaggggtctgagacccttagagatggggac
agatacagcctgaggcaggacgggactaaatgtgagctgcagattcgtggcctggccatg
gcagacactggggagtactcgtgcgtgtgcgggcaggagaggacctcggctatgctcacc
gtcagggctctacccatcaagttcacagagggtctgaggaacgaagaggccacagaaggg
gcaacagccgtgctgcggtgtgagctgagcaagatggcccccgtggagtggtggaagggg
catgagaccctcagagatggagacagacacagcctgaggcaggacggggccaggtgtgag
ctgcagatccgcggcctcgtggcagaggacgctggggagtacctgtgcatgtgcgggaag
gagaggacctcagccatgctcaccgtcagggccatgccttccaagttcatagagggtctg
aggaatgaagaggccacagaaggggacacggccacgctgtggtgtgagctgagcaaggcg
gcaccggtggagtggaggaaggggcatgagaccctcagagatggggacagacacagcctg
aggcaggacgggtccaggtgtgagctgcagatccgtggcctggctgtggtggatgccggg
gagtactcgtgtgtgtgcgggcaggagaggacctcagccacactcactgtcagggccctg
cctgccagattcatagaagatgtgaaaaaccaggaggccagagaaggggccacggccgtg
ctgcaatgtgagctgagcaaggcggcccccgtggagtggaggaaggggtctgagaccctc
agaggtggggacagatacagcctgaggcaggatgggaccagatgtgagctgcagattcat
ggcctgtctgtggcagacactggggagtactcgtgtgtgtgcgggcaggagaggacctcg
gccacactcaccgtcagggccccacagccagtgttccgggagccgctgcagagtctgcag
gcggaggagggctccacggccaccctgcagtgtgagctgtctgagcccactgctacagtg
gtctggagcaagggtggcctgcagctgcaggccaatgggcgccgggagccacggcttcag
ggctgcaccgcggagctggtgttacaggacctacaacgtgaagacactggcgaatacact
tgcacctgtggctcccaggccaccagtgccaccctcactgtcacagctgcgcctgtgcgg
ttcctccgagagctgcagcaccaggaggtggatgagggaggcaccgcacacttatgctgc
gagctgagccgggcgggtgcgagcgtggagtggcgcaagggctccctacagctcttccct
tgtgccaagtaccagatggtgcaggatggtgcagctgcagagctgctggtacgcggagtg
gagcaggaggatgcgggtgactacacgtgtgacacgggccacacgcagagcatggccagc
ctctctgtccgtgtccccaggcccaagttcaagacccggcttcagagtctggagcaggag
acaggtgacatagcccggctgtgctgtcagctgagtgatgcagagtcgggggccgtggtg
caatggctcaaggagggcgtggagctgcatgcgggccccaagtacgagatgcggagccag
ggggccacgcgggagctgctgatccaccaactggaggccaaggacacgggcgagtatgcc
tgtgtgacaggcggccagaaaaccgctgcctccctcagggtcacagagcctgaggtgacc
attgtacgggggctggttgatgcggaggtgacggccgatgaggatgttgagttcagctgt
gaggtgtccagggctggagccacaggcgtgcagtggtgcctacagggcctgccactgcaa
agcaatgaggtgacagaggtggctgtgcgggatggccgcatccacaccctgcggctgaag
ggcgtgacgcccgaggacgctggcactgtctccttccatttgggaaaccatgcttcctct
gcccagctcaccgtcagagctcctgaggtgaccatcctggagcccctgcaggacgtgcag
ctcagtgagggccaggatgccagcttccagtgccggctatccagagcttcaggccaggag
gcccgctgggctttaggaggggtgcccctgcaggccaacgagatgaatgacatcactgtg
gagcagggcacactccacctgctcaccctgcacaaggtgacccttgaggatgctggaact
gtcagtttccacgtgggcacgtgtagctctgaggcccagctgaaagtcacagccaagaac
acggtggtgcgggggctggagaatgtggaggcgctggagggcggcgaggcgctgttcgag
tgccagctgtcccagcccgaggtggccgcccacacctggctgctggacgacgaacccgtg
cacacctcggagaacgccgaggtggtcttcttcgagaacggcctgcgccacctgctgctg
ctcaaaaacttgcggccacaagacagctgccgggtgaccttcctggctggggatatggtg
acgtccgcattcctcacggtccgaggctggcgcctggagatcctggagcctctgaaaaac
gcggcggtccgggccggcgcacaggcatgcttcacctgcacgctcagcgaggcggtgccc
gtgggagaggcgtcctggtacatcaatggcgcggcagtgcagccggatgacagcgactgg
actgtcaccgccgacggcagtcaccacgccctactgctgcgcagcgcccagccccaccac
gccggggaggtcaccttcgcttgccgcgacgccgtggcctctgcgcggctcaccgtgctg
ggcctccctgatcccccagaggatgctgaggtggtggctcgcagcagccacactgtgaca
ctgtcttgggcagctcccatgagtgatggaggcggtggtctctgtggctaccgcgtggag
gtgaaggagggggccacaggccagtggcggctgtgccacgagctggtgcctggacccgag
tgtgtggtggatggcctggcccccggggagacctaccgcttccgtgtggcagctgtgggc
cctgtgggtgctggggaaccggttcacctgccccagacagtgcggcttgcagagccaccg
aagcctgtgcctccccagccctcagcccctgagagccggcaggtggcagctggtgaagat
gtctctctggagcttgaggtggtggctgaggctggtgaggtcatctggcacaagggaatg
gagcgcatccagcccggtgggcggttcgaggtggtctcccagggtcggcaacagatgctg
gtgatcaagggcttcacggcagaagaccagggcgagtaccactgtggcctggctcagggc
tccatctgccctgcggctgccaccttccaggtggcactgagcccagcctctgtggatgag
gcccctcagcccagcttgccccccgaggcagcccaggagggtgacctgcacctactgtgg
gaggccctggctcggaaacgtcgcatgagccgtgagcccacgctggactccattagcgag
ctgccagaggaggacggccgctcgcagcgcctgccacaggaggcagaggaggtggcacct
gatctctctgaaggctactccacggccgatgagctggcccgcactggagatgctgacctc
tcacacaccagctctgatgatgagtcccgggcaggcaccccttccctggtcacctacctc
aagaaggctgggaggccaggcacctcaccactggccagcaaggttggggccccagcagcc
ccctctgtgaagccacagcagcagcaggagccactggctgctgtgcgcccaccactggga
gacctgagcaccaaagacctgggtgatccctcaatggacaaggcagctgtgaagatccag
gctgcctttaagggctacaaggtccggaaggagatgaagcagcaggaagggcccatgttc
tcccacacatttggggacaccgaggcacaggtgggggatgccctgcggctggagtgtgtc
gtggccagcaaggcagatgtgcgagcccgctggctgaaggatggtgtggagctgaccgat
gggcggcaccatcacatcgaccagcttggggatggcacctgctctctgctgatcactggc
ctggaccgtgctgatgctggctgctacacctgtcaggtgagcaacaagtttggccaggtg
acccacagtgcctgtgtggtggtcagtgggtcagagagtgaagccgagagctcctctggg
ggtgagctggacgatgccttccgccgggctgcccgtcggctgcaccggctcttccgcacc
aaaagtccggctgaagtttcagatgaggagctcttcctgagtgcagacgagggccctgca
gagccagaggagcccgcggactggcagacataccgcgaagatgagcatttcatctgcatc
cgttttgaggcgctcactgaggcccgccaggcggtaactcgcttccaggagatgtttgcc
acactgggcattggggtggagatcaagctggtggaacaggggcctcggagggtagagatg
tgcatcagcaaagagactcctgcccctgtggtgcctccagagccattgcccagcctactg
acttctgacgctgccccagtgttcctgactgagttgcagaaccaagaagtgcaggatggg
tatcctgtgagctttgactgcgtggtgacaggtcagcccatgcccagtgtgcgctggttc
aaggatgggaagttgttggaggaggatgatcactacatgattaatgaagaccaacagggt
ggccatcagctcatcatcacagccgtggtgccagcagacatgggcgtctaccgctgcctg
gccgagaacagcatgggtgtctcctccaccaaggctgagctccgtgtggacttgacaagc
acagactatgacactgcagcagatgccacggagtcctcatcctacttcagtgcccaaggc
tacctgtccagccgggagcaggagggaacagagtccaccactgatgagggccagctgccc
caggtggtggaggagctgagagacctccaggtggcccctggcacacgcctggccaagttc
cagctcaaggtgaaaggctaccctgctcccagattatactggttcaaagatggccagccc
ctgaccgcatctgcccacatccgcatgactgacaagaagatcctgcacaccctggagatc
atctccgtcacccgggaggactctggccagtatgcagcctatatcagcaatgccatgggt
gctgcctactcgtctgcccggctgctggttcgaggccctgatgagccagaagagaagcct
gcatcagatgtgcatgagcagctggtgccgccccgaatgctggagaggttcacccccaag
aaagtgaagaaaggctccagcatcaccttctctgtgaaggtagaaggacgcccggtgccc
accgtgcactggctcagggaggaggctgagagaggcgtgctgtggattggccctgacaca
ccgggctacaccgtggccagctctgcgcagcagcacagcctggtcctgctggacgtgggc
cggcagcaccagggcacctacacatgcattgccagcaacgctgccggccaggccctctgc
tccgccagcctgcacgtctcgggcctgcctaaggtggaggagcaggagaaagtgaaggaa
gcgctgatttccactttcctgcaggggaccacacaagccatctcagcacaggggttggaa
actgcgagttttgctgaccttggtgggcagaggaaagaagagcctctggctgccaaggag
gccctcggccacctgtccctcgctgaggtgggcacagaggagttcctgcagaaactgacc
tcccagatcactgagatggtatcggccaagatcacgcaggccaagctgcaggtgcccgga
ggtgacagtgatgaggactccaagacaccatctgcatccccccgccatggccgatcacgg
ccatcctccagcatccaggagtcttcctcagagtcagaggacggcgatgcccgaggcgag
atctttgacatctacgtggtcaccgctgactacctgcccctaggggctgagcaggatgcc
atcacgctgcgggaaggccagtatgtggaggtcctggatgcagcccacccactgcgctgg
cttgtccgcaccaagcccaccaagtccagcccctcacggcagggctgggtgtcaccagcc
tacctggacaggaggctcaagctgtcacctgagtggggggccgctgaggcccctgagttc
cctggggaggctgtgtctgaagacgaatacaaggcaaggctgagctctgtgatccaggag
ctgctgagttctgagcaggccttcgtggaggagctgcagttcctgcagagccaccacctg
cagcacctggagcgctgcccccacgtgcccatagctgtggccggccagaaggcagtcatc
ttccgcaatgtgcgggacatcggccgcttccacagcagcttcctgcaggagttgcagcag
tgcgacacggacgacgacgtggccatgtgcttcatcaagaaccaggcggcctttgagcag
tacctggagttcctggtggggcgtgtgcaggctgagtcggtggtcgtcagcacggccatc
caggagttctacaagaaatacgcggaggaggccctgttggcaggggacccctctcagccc
ccgccaccacctctgcagcactacctggagcagccagtggagcgggtgcagcgctaccag
gccttgctgaaggagctgatccgcaacaaggcgcggaacagacagaactgcgcgctgctg
gagcaggcctatgccgtggtgtctgccctgccacagcgcgctgagaacaagctgcacgtg
tccctcatggagaactacccaggcaccctgcaggccctgggcgagcccatccgccagggc
cacttcatcgtgtgggagggtgcaccgggggcccgcatgccctggaagggccacaaccgt
cacgtgttcctcttccgcaaccacctggtaatctgcaagccccggcgagactcccgcacc
gataccgtcagctacgtgttccggaacatgatgaagctgagcagcatcgacctgaacgac
caggtggagggggatgaccgcgccttcgaggtgtggcaggagcgggaggactcggtgcgc
aagtacctgctgcaggcacggacagccattatcaagagctcgtgggtgaaggagatctgt
ggcatccagcagcgtctggccctgcctgtgtggcggcccccggactttgaagaggagctg
gccgactgcacagccgagctgggtgagacagtcaagctggcctgccgcgtgacgggcaca
cccaagcctgtcatcagctggtacaaagatgggaaagcagtgcaggtggacccccaccac
atcctcattgaagaccctgatggctcgtgtgcactcatcctggacagcctgaccggtgtg
gactctggccagtacatgtgcttcgcggccagcgccgctggcaactgcagtaccctgggc
aagatcctggtgcaagtcccaccacggttcgtgaacaaggtccgggcctcaccctttgtg
gagggagaggacgcccagttcacctgcaccatcgaaggcgccccgtacccgcagatcagg
tggtacaaggacggggccctgctgaccactggcaacaagttccagacactgagtgagcct
cgcagcggcctgctagtgctggtgatccgggcggccagcaaggaggacctggggctctac
gagtgtgagctggtgaaccggctgggctccgcgcgggctagtgcggagctgcgcattcag
agccccatgctgcaggcccaggagcagtgtcacagggagcagctcgtggctgcagtggaa
gacaccaccctggagcgagcggaccaggaggtcacatctgtcctgaagagactgctgggc
cccaaggcgccaggcccctccacaggggacctcactggccctggcccctgccccaggggg
gcacccgcactccaggaaaccggctcccagcccccagtcaccggaacttcggaggcacct
gccgtgcccccgagggtgccacagcccctcctccacgaaggcccagagcaggagccggag
gccattgccagagcccaggaatggactgtgcccattcggatggagggtgcagcctggccc
ggggcaggcacaggggagctgctctgggacgtccacagccacgtggtcagagagaccaca
cagaggacctacacataccaggccatcgacacgcacaccgcacggcccccatccatgcag
gtaaccatcgaggatgtgcaggcacagacaggcggaacggcccaattcgaggctatcatt
gagggcgacccacagccctcggtgacctggtacaaggacagcgtccagctggtggacagc
acccggcttagccagcagcaagaaggcaccacatactccctggtgctgaggcatgtggcc
tcgaaggatgccggcgtttacacctgcctggcccaaaacactggtggccaggtgctctgc
aaggcagagctgctggtgcttgggggggacaatgagccggactcagagaagcaaagccac
cggaggaagctgcactccttctatgaggtcaaggaggagattggaaggggcgtgtttggc
ttcgtaaaaagagtgcagcacaaaggaaacaagatcttgtgcgctgccaagttcatcccc
ctacggagcagaactcgggcccaggcatacagggagcgagacatcctggccgcgctgagc
cacccgctggtcacggggctgctggaccagtttgagacccgcaagaccctcatcctcatc
ctggagctgtgctcatccgaggagctgctggaccgcctgtacaggaagggcgtggtgacg
gaggccgaggtcaaggtctacatccagcagctggtggaggggctgcactacctgcacagc
catggcgttctccacctggacataaagccctctaacatcctgatggtgcatcctgcccgg
gaagacattaaaatctgcgactttggctttgcccagaacatcaccccagcagagctgcag
ttcagccagtacggctcccctgagttcgtctcccccgagatcatccagcagaaccctgtg
agcgaagcctccgacatttgggccatgggtgtcatctcctacctcagcctgacctgctca
tccccatttgccggcgagagtgaccgtgccaccctcctgaacgtcctggaggggcgcgtg
tcatggagcagccccatggctgcccacctcagcgaagacgccaaagacttcatcaaggct
acgctgcagagagcccctcaggcccggcctagtgcggcccagtgcctctcccacccctgg
ttcctgaaatccatgcctgcggaggaggcccacttcatcaacaccaagcagctcaagttc
ctcctggcccgaagtcgctggcagcgttccctgatgagctacaagtccatcctggtgatg
cgctccatccctgagctgctgcggggcccacccgacagcccctccctcggcgtagcccgg
cacctctgcagggacactggtggctcctccagttcctcctcctcctctgacaacgagctc
gccccatttgcccgggctaagtcactgccaccctccccggtgacacactcaccactgctg
cacccccggggcttcctgcggccctcggccagcctgcctgaggaagccgaggccagtgag
cgctccaccgaggccccagctccgcctgcatctcccgagggtgccgggccaccggccgcc
cagggctgcgtgccccggcacagcgtcatccgcagcctgttctaccaccaggcgggtgag
agccctgagcacggggccctggccccggggagcaggcggcacccggcccggcggcggcac
ctgctgaagggcgggtacattgcgggggcgctgccaggcctgcgcgagccactgatggag
caccgcgtgctggaggaggaggccgccagggaggagcaggccaccctcctggccaaagcc
ccctcattcgagactgccctccggctgcctgcctctggcacccacttggcccctggccac
agccactccctggaacatgactctccgagcaccccccgcccctcctcggaggcctgcggt
gaggcacagcgactgccttcagccccctccgggggggcccctatcagggacatggggcac
cctcagggctccaagcagcttccatccactggtggccacccaggcactgctcagccagag
aggccatccccggacagcccttgggggcagccagcccctttctgccaccccaagcagggt
tctgccccccaggagggctgcagcccccacccagcagttgccccatgccctcctggctcc
ttccctccaggatcttgcaaagaggcccccttagtaccctcaagccccttcttgggacag
ccccaggcaccccctgcccctgccaaagcaagccccccattggactctaagatggggcct
ggagacatctctcttcctgggaggccaaaacccggcccctgcagttccccagggtcagcc
tcccaggcgagctcttcccaagtgagctccctcagggtgggctcctcccaggtgggcaca
gagcctggcccctccctggatgcggagggctggacccaggaggctgaggatctgtccgac
tccacacccaccttgcagcggcctcaggaacaggcgaccatgcgcaagttctccctgggt
ggtcgcgggggctacgcaggcgtggctggctatggcacctttgcctttggtggagatgca
gggggcatgctggggcaggggcccatgtgggccaggatagcctgggctgtgtcccagtca
gaggaggaggagcaggaggaggccagggctgagtcccagtcggaggagcagcaggaggcc
agggctgagagcccactgccccaggtcagtgcaaggcctgtgcctgaggtcggcagggct
cccaccaggagctctccagagcccaccccatgggaggacatcgggcaggtctccctggtg
cagatccgggacctgtcaggtgatgcggaggcggccgacacaatatccctggacatttcc
gaggtggaccccgcctacctcaacctctcagacctgtacgatatcaagtacctcccattc
gagtttatgatcttcaggaaagtccccaagtccgctcagccagagccgccctcccccatg
gctgaggaggagctggccgagttcccggagcccacgtggccctggccaggtgaactgggc
ccccacgcaggcctggagatcacagaggagtcagaggatgtggacgcgctgctggcagag
gctgccgtgggcaggaagcgcaagtggtcctcgccgtcacgcagcctcttccacttccct
gggaggcacctgccgctggatgagcctgcagagctggggctgcgtgagagagtgaaggcc
tccgtggagcacatctcccggatcctgaagggcaggccggaaggtctggagaaggagggg
ccccccaggaagaagccaggccttgcttccttccggctctcaggtctgaagagctgggac
cgagcgccgacattcctaagggagctctcagatgagactgtggtcctgggccagtcagtg
acactggcctgccaggtgtcagcccagccagctgcccaggccacctggagcaaagacgga
gcccccctggagagcagcagccgtgtcctcatctctgccaccctcaagaacttccagctt
ctgaccatcctggtggtggtggctgaggacctgggtgtgtacacctgcagcgtgagcaat
gcgctggggacagtgaccaccacgggcgtcctccggaaggcagagcgcccctcatcttcg
ccatgcccggatatcggggaggtgtacgcggatggggtgctgctggtctggaagcccgtg
gaatcctacggccctgtgacctacattgtgcagtgcagcctagaaggcggcagctggacc
acactggcctccgacatctttgactgctgctacctgaccagcaagctctcccggggtggc
acctacaccttccgcacggcatgtgtcagcaaggcaggaatgggtccctacagcagcccc
tcggagcaagtcctcctgggagggcccagccacctggcctctgaggaggagagccagggg
cggtcagcccaacccctgcccagcacaaagaccttcgcattccagacacagatccagagg
ggccgcttcagcgtggtgcggcaatgctgggagaaggccagcgggcgggcgctggccgcc
aagatcatcccctaccaccccaaggacaagacagcagtgctgcgcgaatacgaggccctc
aagggcctgcgccacccgcacctggcccagctgcacgcagcctacctcagcccccggcac
ctggtgctcatcttggagctgtgctctgggcccgagctgctcccctgcctggccgagagg
gcctcctactcagaatccgaggtgaaggactacctgtggcagatgttgagtgccacccag
tacctgcacaaccagcacatcctgcacctggacctgaggtccgagaacatgatcatcacc
gaatacaacctgctcaaggtcgtggacctgggcaatgcacagagcctcagccaggagaag
gtgctgccctcagacaagttcaaggactacctagagaccatggctccagagctcctggag
ggccagggggctgttccacagacagacatctgggccatcggtgtgacagccttcatcatg
ctgagcgccgagtacccggtgagcagcgagggtgcacgcgacctgcagagaggactgcgc
aaggggctggtccggctgagccgctgctacgcggggctgtccgggggcgccgtggccttc
ctgcgcagcactctgtgcgcccagccctggggccggccctgcgcgtccagctgcctgcag
tgcccgtggctaacagaggagggcccggcctgttcgcggcccgcgcccgtgaccttccct
accgcgcggctgcgcgtcttcgtgcgcaatcgcgagaagagacgcgcgctgctgtacaag
aggcacaacctggcccaggtgcgctga

KEGG   Homo sapiens (human): 2023
Entry
2023              CDS       T01001                                 
Symbol
ENO1, ENO1-IT1, ENO1L1, HEL-S-17, MPB1, NNE, PPH
Name
(RefSeq) enolase 1
  KO
K01689  enolase 1/2/3 [EC:4.2.1.11]
Organism
hsa  Homo sapiens (human)
Pathway
hsa00010  Glycolysis / Gluconeogenesis
hsa01100  Metabolic pathways
hsa01200  Carbon metabolism
hsa01230  Biosynthesis of amino acids
hsa03018  RNA degradation
hsa04066  HIF-1 signaling pathway
hsa04820  Cytoskeleton in muscle cells
Module
hsa_M00001  Glycolysis (Embden-Meyerhof pathway), glucose => pyruvate
hsa_M00002  Glycolysis, core module involving three-carbon compounds
hsa_M00003  Gluconeogenesis, oxaloacetate => fructose-6P
Network
nt06017  Glycogen metabolism
nt06539  Cytoskeleton in muscle cells
  Element
N00731  Glycolysis
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09100 Metabolism
  09101 Carbohydrate metabolism
   00010 Glycolysis / Gluconeogenesis
    2023 (ENO1)
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   03018 RNA degradation
    2023 (ENO1)
 09130 Environmental Information Processing
  09132 Signal transduction
   04066 HIF-1 signaling pathway
    2023 (ENO1)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    2023 (ENO1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:hsa03019]
    2023 (ENO1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    2023 (ENO1)
Enzymes [BR:hsa01000]
 4. Lyases
  4.2  Carbon-oxygen lyases
   4.2.1  Hydro-lyases
    4.2.1.11  phosphopyruvate hydratase
     2023 (ENO1)
Messenger RNA biogenesis [BR:hsa03019]
 Prokaryotic type
  Bacterial mRNA degradation factors
   RNA degradosome components
    Other RNA degradosome components
     2023 (ENO1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Proteins found in most exosomes
   2023 (ENO1)
SSDB
Motif
Pfam: Enolase_C Enolase_N MR_MLE_C MAAL_C
Other DBs
NCBI-GeneID: 2023
NCBI-ProteinID: NP_001419
OMIM: 172430
HGNC: 3350
Ensembl: ENSG00000074800
UniProt: P06733 A0A024R4F1
Structure
LinkDB
Position
1:complement(8861000..8878686)
AA seq 434 aa
MSILKIHAREIFDSRGNPTVEVDLFTSKGLFRAAVPSGASTGIYEALELRDNDKTRYMGK
GVSKAVEHINKTIAPALVSKKLNVTEQEKIDKLMIEMDGTENKSKFGANAILGVSLAVCK
AGAVEKGVPLYRHIADLAGNSEVILPVPAFNVINGGSHAGNKLAMQEFMILPVGAANFRE
AMRIGAEVYHNLKNVIKEKYGKDATNVGDEGGFAPNILENKEGLELLKTAIGKAGYTDKV
VIGMDVAASEFFRSGKYDLDFKSPDDPSRYISPDQLADLYKSFIKDYPVVSIEDPFDQDD
WGAWQKFTASAGIQVVGDDLTVTNPKRIAKAVNEKSCNCLLLKVNQIGSVTESLQACKLA
QANGWGVMVSHRSGETEDTFIADLVVGLCTGQIKTGAPCRSERLAKYNQLLRIEEELGSK
AKFAGRNFRNPLAK
NT seq 1305 nt   +upstreamnt  +downstreamnt
atgtctattctcaagatccatgccagggagatctttgactctcgcgggaatcccactgtt
gaggttgatctcttcacctcaaaaggtctcttcagagctgctgtgcccagtggtgcttca
actggtatctatgaggccctagagctccgggacaatgataagactcgctatatggggaag
ggtgtctcaaaggctgttgagcacatcaataaaactattgcgcctgccctggttagcaag
aaactgaacgtcacagaacaagagaagattgacaaactgatgatcgagatggatggaaca
gaaaataaatctaagtttggtgcgaacgccattctgggggtgtcccttgccgtctgcaaa
gctggtgccgttgagaagggggtccccctgtaccgccacatcgctgacttggctggcaac
tctgaagtcatcctgccagtcccggcgttcaatgtcatcaatggcggttctcatgctggc
aacaagctggccatgcaggagttcatgatcctcccagtcggtgcagcaaacttcagggaa
gccatgcgcattggagcagaggtttaccacaacctgaagaatgtcatcaaggagaaatat
gggaaagatgccaccaatgtgggggatgaaggcgggtttgctcccaacatcctggagaat
aaagaaggcctggagctgctgaagactgctattgggaaagctggctacactgataaggtg
gtcatcggcatggacgtagcggcctccgagttcttcaggtctgggaagtatgacctggac
ttcaagtctcccgatgaccccagcaggtacatctcgcctgaccagctggctgacctgtac
aagtccttcatcaaggactacccagtggtgtctatcgaagatccctttgaccaggatgac
tggggagcttggcagaagttcacagccagtgcaggaatccaggtagtgggggatgatctc
acagtgaccaacccaaagaggatcgccaaggccgtgaacgagaagtcctgcaactgcctc
ctgctcaaagtcaaccagattggctccgtgaccgagtctcttcaggcgtgcaagctggcc
caggccaatggttggggcgtcatggtgtctcatcgttcgggggagactgaagataccttc
atcgctgacctggttgtggggctgtgcactgggcagatcaagactggtgccccttgccga
tctgagcgcttggccaagtacaaccagctcctcagaattgaagaggagctgggcagcaag
gctaagtttgccggcaggaacttcagaaaccccttggccaagtaa

KEGG   Homo sapiens (human): 2027
Entry
2027              CDS       T01001                                 
Symbol
ENO3, GSD13, MSE
Name
(RefSeq) enolase 3
  KO
K01689  enolase 1/2/3 [EC:4.2.1.11]
Organism
hsa  Homo sapiens (human)
Pathway
hsa00010  Glycolysis / Gluconeogenesis
hsa01100  Metabolic pathways
hsa01200  Carbon metabolism
hsa01230  Biosynthesis of amino acids
hsa03018  RNA degradation
hsa04066  HIF-1 signaling pathway
hsa04820  Cytoskeleton in muscle cells
Module
hsa_M00001  Glycolysis (Embden-Meyerhof pathway), glucose => pyruvate
hsa_M00002  Glycolysis, core module involving three-carbon compounds
hsa_M00003  Gluconeogenesis, oxaloacetate => fructose-6P
Network
nt06017  Glycogen metabolism
nt06539  Cytoskeleton in muscle cells
  Element
N00731  Glycolysis
N01821  Sarcomere, M-band
Disease
H00069  Glycogen storage disease
H01762  Muscle glycogen storage disease
H01953  Glycogen storage disease type XIII
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09100 Metabolism
  09101 Carbohydrate metabolism
   00010 Glycolysis / Gluconeogenesis
    2027 (ENO3)
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   03018 RNA degradation
    2027 (ENO3)
 09130 Environmental Information Processing
  09132 Signal transduction
   04066 HIF-1 signaling pathway
    2027 (ENO3)
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    2027 (ENO3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03019 Messenger RNA biogenesis [BR:hsa03019]
    2027 (ENO3)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    2027 (ENO3)
Enzymes [BR:hsa01000]
 4. Lyases
  4.2  Carbon-oxygen lyases
   4.2.1  Hydro-lyases
    4.2.1.11  phosphopyruvate hydratase
     2027 (ENO3)
Messenger RNA biogenesis [BR:hsa03019]
 Prokaryotic type
  Bacterial mRNA degradation factors
   RNA degradosome components
    Other RNA degradosome components
     2027 (ENO3)
Exosome [BR:hsa04147]
 Exosomal proteins
  Proteins found in most exosomes
   2027 (ENO3)
SSDB
Motif
Pfam: Enolase_C Enolase_N MR_MLE_C MAAL_C
Other DBs
NCBI-GeneID: 2027
NCBI-ProteinID: NP_001967
OMIM: 131370
HGNC: 3354
Ensembl: ENSG00000108515
UniProt: P13929
Structure
LinkDB
Position
17:4948710..4957129
AA seq 434 aa
MAMQKIFAREILDSRGNPTVEVDLHTAKGRFRAAVPSGASTGIYEALELRDGDKGRYLGK
GVLKAVENINNTLGPALLQKKLSVVDQEKVDKFMIELDGTENKSKFGANAILGVSLAVCK
AGAAEKGVPLYRHIADLAGNPDLILPVPAFNVINGGSHAGNKLAMQEFMILPVGASSFKE
AMRIGAEVYHHLKGVIKAKYGKDATNVGDEGGFAPNILENNEALELLKTAIQAAGYPDKV
VIGMDVAASEFYRNGKYDLDFKSPDDPARHITGEKLGELYKSFIKNYPVVSIEDPFDQDD
WATWTSFLSGVNIQIVGDDLTVTNPKRIAQAVEKKACNCLLLKVNQIGSVTESIQACKLA
QSNGWGVMVSHRSGETEDTFIADLVVGLCTGQIKTGAPCRSERLAKYNQLMRIEEALGDK
AIFAGRKFRNPKAK
NT seq 1305 nt   +upstreamnt  +downstreamnt
atggccatgcagaaaatctttgcccgggaaatcttggactccaggggcaaccccacggtg
gaggtggacctgcacacggccaagggccgattccgagcagctgtgcccagtggggcttcc
acgggtatctatgaggctctggaactaagagacggagacaaaggccgctacctggggaaa
ggagtcctgaaggctgtggagaacatcaacaatactctgggccctgctctgctgcaaaag
aaactaagcgttgtggatcaagaaaaagttgacaaatttatgattgagctagatgggacc
gagaataagtccaagtttggggccaatgccatcctgggcgtgtccttggccgtgtgtaag
gcgggagcagctgagaagggggtccccctgtaccgccacatcgcagatctcgctgggaac
cctgacctcatactcccagtgccagccttcaatgtgatcaacgggggctcccatgctgga
aacaagctggccatgcaggagttcatgattctgcctgtgggagccagctccttcaaggaa
gccatgcgcattggcgccgaggtctaccaccacctcaagggggtcatcaaggccaagtat
gggaaggatgccaccaatgtgggtgatgaaggtggcttcgcacccaacatcctggagaac
aatgaggccctggagctgctgaagacggccatccaggcggctggttacccagacaaggtg
gtgatcggcatggatgtggcagcatctgagttctatcgcaatgggaagtacgatcttgac
ttcaagtcgcctgatgatcccgcacggcacatcactggggagaagctcggagagctgtat
aagagctttatcaagaactatcctgtggtctccatcgaagacccctttgaccaggatgac
tgggccacttggacctccttcctctcgggggtgaacatccagattgtgggggatgacttg
acagtcaccaaccccaagaggattgcccaggccgttgagaagaaggcctgcaactgtctg
ctgctgaaggtcaaccagatcggctcggtgaccgaatcgatccaggcgtgcaaactggct
cagtctaatggctggggggtgatggtgagccaccgctctggggagactgaggacacattc
attgctgaccttgtggtggggctctgcacaggacagatcaagactggcgccccctgccgc
tcggagcgtctggccaaatacaaccaactcatgaggatcgaggaggctcttggggacaag
gcaatctttgctggacgcaagttccgtaacccgaaggccaagtga

KEGG   Homo sapiens (human): 84676
Entry
84676             CDS       T01001                                 
Symbol
TRIM63, IRF, MURF1, MURF2, RNF28, SMRZ
Name
(RefSeq) tripartite motif containing 63
  KO
K10655  tripartite motif-containing protein 63 [EC:2.3.2.27]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    84676 (TRIM63)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04121 Ubiquitin system [BR:hsa04121]
    84676 (TRIM63)
Enzymes [BR:hsa01000]
 2. Transferases
  2.3  Acyltransferases
   2.3.2  Aminoacyltransferases
    2.3.2.27  RING-type E3 ubiquitin transferase
     84676 (TRIM63)
Ubiquitin system [BR:hsa04121]
 Ubiquitin ligases (E3)
  Single Ring-finger type E3
   TRIM/RBCC proteins
    84676 (TRIM63)
SSDB
Motif
Pfam: zf-RING_UBOX zf-C3HC4 zf-C3HC4_2 zf-B_box zf-RING_5 zf-RING_2 zf-C3HC4_3 zf-C3HC4_4 zf-RING_6 Sina_RING Fzo_mitofusin DUF1664 ATP-synt_E_2 ANCHR-like_BBOX Phasin Spt4 Allexi_40kDa
Other DBs
NCBI-GeneID: 84676
NCBI-ProteinID: NP_115977
OMIM: 606131
HGNC: 16007
Ensembl: ENSG00000158022
UniProt: Q969Q1
Structure
LinkDB
Position
1:complement(26051301..26067630)
AA seq 353 aa
MDYKSSLIQDGNPMENLEKQLICPICLEMFTKPVVILPCQHNLCRKCANDIFQAANPYWT
SRGSSVSMSGGRFRCPTCRHEVIMDRHGVYGLQRNLLVENIIDIYKQECSSRPLQKGSHP
MCKEHEDEKINIYCLTCEVPTCSMCKVFGIHKACEVAPLQSVFQGQKTELNNCISMLVAG
NDRVQTIITQLEDSRRVTKENSHQVKEELSQKFDTLYAILDEKKSELLQRITQEQEKKLS
FIEALIQQYQEQLDKSTKLVETAIQSLDEPGGATFLLTAKQLIKSIVEASKGCQLGKTEQ
GFENMDFFTLDLEHIADALRAIDFGTDEEEEEFIEEEDQEEEESTEGKEEGHQ
NT seq 1062 nt   +upstreamnt  +downstreamnt
atggattataagtcgagcctgatccaggatgggaatcccatggagaacttggagaagcag
ctgatctgccctatctgcctggagatgtttaccaagccagtggtcatcttgccgtgccag
cacaacctgtgccggaagtgtgccaatgacatcttccaggctgcaaatccctactggacc
agccggggcagctcagtgtccatgtctggaggccgtttccgctgccccacctgccgccac
gaggtgatcatggatcgtcacggagtgtacggcctgcagaggaacctgctggtggagaac
atcatcgacatctacaaacaggagtgctccagtcggccgctgcagaagggcagtcacccc
atgtgcaaggagcacgaagatgagaaaatcaacatctactgtctcacgtgtgaggtgccc
acctgctccatgtgcaaggtgtttgggatccacaaggcctgcgaggtggccccattgcag
agtgtcttccagggacaaaagactgaactgaataactgtatctccatgctggtggcgggg
aatgaccgtgtgcagaccatcatcactcagctggaggattcccgtcgagtgaccaaggag
aacagtcaccaggtaaaggaagagctgagccagaagtttgacacgttgtatgccatcctg
gatgagaagaaaagtgagttgctgcagcggatcacgcaggagcaggagaaaaagcttagc
ttcatcgaggccctcatccagcagtaccaggagcagctggacaagtccacaaagctggtg
gaaactgccatccagtccctggacgagcctgggggagccaccttcctcttgactgccaag
caactcatcaaaagcattgtggaagcttccaagggctgccagctggggaagacagagcag
ggctttgagaacatggacttctttactttggatttagagcacatagcagacgccctgaga
gccattgactttgggacagatgaggaagaggaagaattcattgaagaagaagatcaggaa
gaggaagagtccacagaagggaaggaagaaggacaccagtaa

KEGG   Homo sapiens (human): 84675
Entry
84675             CDS       T01001                                 
Symbol
TRIM55, MURF-2, RNF29, muRF2
Name
(RefSeq) tripartite motif containing 55
  KO
K10654  tripartite motif-containing protein 55
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    84675 (TRIM55)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04121 Ubiquitin system [BR:hsa04121]
    84675 (TRIM55)
Ubiquitin system [BR:hsa04121]
 Ubiquitin ligases (E3)
  Single Ring-finger type E3
   TRIM/RBCC proteins
    84675 (TRIM55)
SSDB
Motif
Pfam: zf-RING_UBOX zf-C3HC4 zf-C3HC4_2 zf-B_box zf-RING_2 zf-RING_5 zf-C3HC4_4 zf-C3HC4_3 zf-RING_6 Sina_RING Cbl_N
Other DBs
NCBI-GeneID: 84675
NCBI-ProteinID: NP_908973
OMIM: 606469
HGNC: 14215
Ensembl: ENSG00000147573
UniProt: Q9BYV6
LinkDB
Position
8:66113351..66175485
AA seq 548 aa
MSASLNYKSFSKEQQTMDNLEKQLICPICLEMFTKPVVILPCQHNLCRKCASDIFQASNP
YLPTRGGTTMASGGRFRCPSCRHEVVLDRHGVYGLQRNLLVENIIDIYKQESTRPEKKSD
QPMCEEHEEERINIYCLNCEVPTCSLCKVFGAHKDCQVAPLTHVFQRQKSELSDGIAILV
GSNDRVQGVISQLEDTCKTIEECCRKQKQELCEKFDYLYGILEERKNEMTQVITRTQEEK
LEHVRALIKKYSDHLENVSKLVESGIQFMDEPEMAVFLQNAKTLLKKISEASKAFQMEKI
EHGYENMNHFTVNLNREEKIIREIDFYREDEDEEEEEGGEGEKEGEGEVGGEAVEVEEVE
NVQTEFPGEDENPEKASELSQVELQAAPGALPVSSPEPPPALPPAADAPVTQGEVVPTGS
EQTTESETPVPAAAETADPLFYPSWYKGQTRKATTNPPCTPGSEGLGQIGPPGSEDSNVR
KAEVAAAAASERAAVSGKETSAPAATSQIGFEAPPLQGQAAAPASGSGADSEPARHIFSF
SWLNSLNE
NT seq 1647 nt   +upstreamnt  +downstreamnt
atgagcgcatctctgaattacaaatctttttccaaagagcagcagaccatggataactta
gagaagcaactcatctgtcccatctgcttagagatgttcacgaaacctgtggtgattctc
ccttgtcagcacaacctgtgtaggaaatgtgccagtgatattttccaggcctctaacccg
tatttgcccacaagaggaggtaccaccatggcatcagggggccgattccgctgcccatcc
tgtagacatgaagtggttttggatagacatggggtatatggacttcagaggaacctgctg
gtggaaaatatcattgacatctacaagcaggagtccaccaggccagaaaagaaatccgac
cagcccatgtgcgaggaacatgaagaggagcgcatcaacatctactgtctgaactgcgaa
gtacccacctgctctctgtgcaaggtgtttggtgcacacaaagactgccaggtggctccc
ctcactcatgtgttccagagacagaagtctgagctcagtgatggcatcgccatcctcgtg
ggcagcaacgatcgagtccagggagtgatcagccagctggaagacacctgcaaaactatc
gaggaatgttgcagaaaacagaaacaagagctttgtgagaagtttgattacctgtatggc
attttggaggagaggaagaatgaaatgacccaagtcattacccgaacccaagaggagaaa
ctggaacatgtccgtgctctgatcaaaaagtattctgatcatttggagaacgtctcaaag
ttggttgagtcaggaattcagtttatggatgagccagaaatggcagtgtttctgcagaat
gccaaaaccctgctaaaaaaaatctcggaagcatcaaaggcatttcagatggagaaaata
gaacatggctatgagaacatgaaccacttcacagtcaacctcaatagagaagaaaagata
atacgtgaaattgacttttacagagaagatgaagatgaagaagaagaagaaggcggagaa
ggagaaaaagaaggagaaggagaagtgggaggagaagcagtagaagtggaagaggtagaa
aatgttcaaacagagtttccaggagaagatgaaaacccagaaaaagcttcagagctctct
caggtggagctgcaggctgcccctggggcacttccagtttcctctccagagccacctcca
gccctgccacctgctgcggatgcccctgtgacacagggggaggttgtacccactggctct
gagcagaccacagagtctgaaactccagtccctgcagcagcagaaactgcggatcccttg
ttttaccctagttggtataaaggccaaacccggaaagccaccaccaacccaccttgcacc
ccagggagcgaaggtctggggcaaatagggcctccaggttctgaggattcgaatgtacgg
aaggcagaagtggcagcagccgcagcgagtgagagggcagctgtgagtggtaaggaaact
agtgcacctgcagctacttctcagattggatttgaggctcctcccctccagggacaggct
gcagctccagcgagtggcagtggagctgattctgagccagctcgccatatcttctccttt
tcctggttgaactccctaaatgaatga

KEGG   Homo sapiens (human): 57159
Entry
57159             CDS       T01001                                 
Symbol
TRIM54, MURF, MURF-3, RNF30, muRF3
Name
(RefSeq) tripartite motif containing 54
  KO
K10653  tripartite motif-containing protein 54
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    57159 (TRIM54)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04121 Ubiquitin system [BR:hsa04121]
    57159 (TRIM54)
Ubiquitin system [BR:hsa04121]
 Ubiquitin ligases (E3)
  Single Ring-finger type E3
   TRIM/RBCC proteins
    57159 (TRIM54)
SSDB
Motif
Pfam: zf-RING_UBOX zf-C3HC4_2 zf-C3HC4 zf-B_box zf-RING_5 zf-RING_2 zf-C3HC4_4 zf-C3HC4_3 zf-RING_6 PHP Sina_RING UvrD_C DUF349 COG2_C Fmp27_WPPW MIF4G_like_2 FlgN DUF1421 DUF5799 FUSC zf-RING_11
Other DBs
NCBI-GeneID: 57159
NCBI-ProteinID: NP_912730
OMIM: 606474
HGNC: 16008
Ensembl: ENSG00000138100
UniProt: Q9BYV2
Structure
LinkDB
Position
2:27282429..27307435
AA seq 358 aa
MNFTVGFKPLLGDAHSMDNLEKQLICPICLEMFSKPVVILPCQHNLCRKCANDVFQASNP
LWQSRGSTTVSSGGRFRCPSCRHEVVLDRHGVYGLQRNLLVENIIDIYKQESSRPLHSKA
EQHLMCEEHEEEKINIYCLSCEVPTCSLCKVFGAHKDCEVAPLPTIYKRQKSELSDGIAM
LVAGNDRVQAVITQMEEVCQTIEDNSRRQKQLLNQRFESLCAVLEERKGELLQALAREQE
EKLQRVRGLIRQYGDHLEASSKLVESAIQSMEEPQMALYLQQAKELINKVGAMSKVELAG
RPEPGYESMEQFTVRVEHVAEMLRTIDFQPGASGEEEEVAPDGEEGSAGPEEERPDGP
NT seq 1077 nt   +upstreamnt  +downstreamnt
atgaacttcacagtgggtttcaagccgctgctaggggatgcacacagcatggacaacctg
gagaagcagctcatctgccccatctgcctggagatgttctccaaaccagtggtgatcctg
ccctgccaacacaacctgtgccgcaaatgtgccaacgacgtcttccaggcctcgaatcct
ctatggcagtcccggggctccaccactgtgtcttcaggaggccgtttccgctgcccatcg
tgcaggcatgaggttgtcctggacagacacggtgtctacggcctgcagcgaaacctgcta
gtggagaacattatcgacatttacaagcaggagtcatccaggccgctgcactccaaggct
gagcagcacctcatgtgcgaggagcatgaagaagagaagatcaatatttactgcctgagc
tgtgaggtgcccacctgctctctctgcaaggtcttcggtgcccacaaggactgtgaggtg
gccccactgcccaccatttacaaacgccagaagagtgagctcagcgatggcatcgcgatg
ctggtggcaggcaatgaccgcgtgcaagcagtgatcacacagatggaggaggtgtgccag
actatcgaggacaatagccggaggcagaagcagttgttaaaccagaggtttgagagcctg
tgcgcagtgctggaggagcgcaagggtgagctgctgcaggcgctggcccgggagcaagag
gagaagctgcagcgcgtccgcggcctcatccgtcagtatggcgaccacctggaggcctcc
tctaagctggtggagtctgccatccagtccatggaagagccacaaatggcgctgtatctc
cagcaggccaaggagctgatcaataaggtcggggccatgtcgaaggtggagctggcaggg
cggccggagccaggctatgagagcatggagcaattcaccgtaagggtggagcacgtggcc
gaaatgctgcggaccatcgacttccagccaggcgcttccggggaggaagaggaggtggcc
ccagacggagaggagggcagcgcggggccggaggaagagcggccggatgggccttaa

KEGG   Homo sapiens (human): 823
Entry
823               CDS       T01001                                 
Symbol
CAPN1, CANP, CANP1, CANPL1, SPG76, muCANP, muCL
Name
(RefSeq) calpain 1
  KO
K01367  calpain-1 [EC:3.4.22.52]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04141  Protein processing in endoplasmic reticulum
hsa04210  Apoptosis
hsa04217  Necroptosis
hsa04218  Cellular senescence
hsa05010  Alzheimer disease
hsa05022  Pathways of neurodegeneration - multiple diseases
hsa05131  Shigellosis
Network
nt06460  Alzheimer disease
nt06466  Pathways of neurodegeneration
nt06539  Cytoskeleton in muscle cells
  Element
N00997  Mutation-caused aberrant Abeta to electron transfer in Complex I
N01018  Mutation-caused aberrant Abeta to anterograde axonal transport
N01821  Sarcomere, M-band
Disease
H00266  Hereditary spastic paraplegia
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   04141 Protein processing in endoplasmic reticulum
    823 (CAPN1)
 09140 Cellular Processes
  09143 Cell growth and death
   04210 Apoptosis
    823 (CAPN1)
   04217 Necroptosis
    823 (CAPN1)
   04218 Cellular senescence
    823 (CAPN1)
 09160 Human Diseases
  09171 Infectious disease: bacterial
   05131 Shigellosis
    823 (CAPN1)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    823 (CAPN1)
   05022 Pathways of neurodegeneration - multiple diseases
    823 (CAPN1)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    823 (CAPN1)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.22  Cysteine endopeptidases
    3.4.22.52  calpain-1
     823 (CAPN1)
Peptidases and inhibitors [BR:hsa01002]
 Cysteine peptidases
  Family C2: calpain family
   823 (CAPN1)
SSDB
Motif
Pfam: Peptidase_C2 Calpain_III EF-hand_6 EF-hand_1 EF-hand_8 EF-hand_7 CAPN13-like_C_EFh EF-hand_5
Other DBs
NCBI-GeneID: 823
NCBI-ProteinID: NP_001185797
OMIM: 114220
HGNC: 1476
Ensembl: ENSG00000014216
UniProt: P07384 B2RDI5
Structure
LinkDB
Position
11:65181373..65212006
AA seq 714 aa
MSEEIITPVYCTGVSAQVQKQRARELGLGRHENAIKYLGQDYEQLRVRCLQSGTLFRDEA
FPPVPQSLGYKDLGPNSSKTYGIKWKRPTELLSNPQFIVDGATRTDICQGALGDCWLLAA
IASLTLNDTLLHRVVPHGQSFQNGYAGIFHFQLWQFGEWVDVVVDDLLPIKDGKLVFVHS
AEGNEFWSALLEKAYAKVNGSYEALSGGSTSEGFEDFTGGVTEWYELRKAPSDLYQIILK
ALERGSLLGCSIDISSVLDMEAITFKKLVKGHAYSVTGAKQVNYRGQVVSLIRMRNPWGE
VEWTGAWSDSSSEWNNVDPYERDQLRVKMEDGEFWMSFRDFMREFTRLEICNLTPDALKS
RTIRKWNTTLYEGTWRRGSTAGGCRNYPATFWVNPQFKIRLDETDDPDDYGDRESGCSFV
LALMQKHRRRERRFGRDMETIGFAVYEVPPELVGQPAVHLKRDFFLANASRARSEQFINL
REVSTRFRLPPGEYVVVPSTFEPNKEGDFVLRFFSEKSAGTVELDDQIQANLPDEQVLSE
EEIDENFKALFRQLAGEDMEISVKELRTILNRIISKHKDLRTKGFSLESCRSMVNLMDRD
GNGKLGLVEFNILWNRIRNYLSIFRKFDLDKSGSMSAYEMRMAIESAGFKLNKKLYELII
TRYSEPDLAVDFDNFVCCLVRLETMFRFFKTLDTDLDGVVTFDLFKWLQLTMFA
NT seq 2145 nt   +upstreamnt  +downstreamnt
atgtcggaggagatcatcacgccggtgtactgcactggggtgtcagcccaagtgcagaag
cagcgggccagggagctgggcctgggccgccatgagaatgccatcaagtacctgggccag
gattatgagcagctgcgggtgcgatgcctgcagagtgggaccctcttccgtgatgaggcc
ttccccccggtaccccagagcctgggttacaaggacctgggtcccaattcctccaagacc
tatggcatcaagtggaagcgtcccacggaactgctgtcaaacccccagttcattgtggat
ggagctacccgcacagacatctgccagggagcactgggggactgctggctcttggcggcc
atcgcctccctcactctcaacgacaccctcctgcaccgagtggttccgcacggccagagc
ttccagaatggctatgccggcatcttccatttccagctgtggcaatttggggagtgggtg
gacgtggtcgtggatgacctgctgcccatcaaggacgggaagctagtgttcgtgcactct
gccgaaggcaacgagttctggagcgccctgcttgagaaggcctatgccaaggtaaatggc
agctacgaggccctgtcagggggcagcacctcagagggctttgaggacttcacaggcggg
gttaccgagtggtacgagttgcgcaaggctcccagtgacctctaccagatcatcctcaag
gcgctggagcggggctccctgctgggctgctccatagacatctccagcgttctagacatg
gaggccatcactttcaagaagttggtgaagggccatgcctactctgtgaccggggccaag
caggtgaactaccgaggccaggtggtgagcctgatccggatgcggaacccctggggcgag
gtggagtggacgggagcctggagcgacagctcctcagagtggaacaacgtggacccatat
gaacgggaccagctccgggtcaagatggaggacggggagttctggatgtcattccgagac
ttcatgcgggagttcacccgcctggagatctgcaacctcacacccgacgccctcaagagc
cggaccatccgcaaatggaacaccacactctacgaaggcacctggcggcgggggagcacc
gcggggggctgccgaaactacccagccaccttctgggtgaaccctcagttcaagatccgg
ctggatgagacggatgacccggacgactacggggaccgcgagtcaggctgcagcttcgtg
ctcgcccttatgcagaagcaccgtcgccgcgagcgccgcttcggccgcgacatggagact
attggcttcgcggtctacgaggtccctccggagctggtgggccagccggccgtacacttg
aagcgtgacttcttcctggccaatgcgtctcgggcgcgctcagagcagttcatcaacctg
cgagaggtcagcacccgcttccgcctgccacccggggagtatgtggtggtgccctccacc
ttcgagcccaacaaggagggcgacttcgtgctgcgcttcttctcagagaagagtgctggg
actgtggagctggatgaccagatccaggccaatctccccgatgagcaagtgctctcagaa
gaggagattgacgagaacttcaaggccctcttcaggcagctggcaggggaggacatggag
atcagcgtgaaggagttgcggacaatcctcaataggatcatcagcaaacacaaagacctg
cggaccaagggcttcagcctagagtcgtgccgcagcatggtgaacctcatggatcgtgat
ggcaatgggaagctgggcctggtggagttcaacatcctgtggaaccgcatccggaattac
ctgtccatcttccggaagtttgacctggacaagtcgggcagcatgagtgcctacgagatg
cggatggccattgagtcggcaggcttcaagctcaacaagaagctgtacgagctcatcatc
acccgctactcggagcccgacctggcggtcgactttgacaatttcgtttgctgcctggtg
cggctagagaccatgttccgatttttcaaaactctggacacagatctggatggagttgtg
acctttgacttgtttaagtggttgcagctgaccatgtttgcatga

KEGG   Homo sapiens (human): 824
Entry
824               CDS       T01001                                 
Symbol
CAPN2, CANP2, CANPL2, CANPml, mCANP
Name
(RefSeq) calpain 2
  KO
K03853  calpain-2 [EC:3.4.22.53]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04141  Protein processing in endoplasmic reticulum
hsa04210  Apoptosis
hsa04217  Necroptosis
hsa04218  Cellular senescence
hsa04510  Focal adhesion
hsa05010  Alzheimer disease
hsa05022  Pathways of neurodegeneration - multiple diseases
hsa05131  Shigellosis
Network
nt06460  Alzheimer disease
nt06466  Pathways of neurodegeneration
nt06539  Cytoskeleton in muscle cells
  Element
N00997  Mutation-caused aberrant Abeta to electron transfer in Complex I
N01018  Mutation-caused aberrant Abeta to anterograde axonal transport
N01821  Sarcomere, M-band
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09123 Folding, sorting and degradation
   04141 Protein processing in endoplasmic reticulum
    824 (CAPN2)
 09140 Cellular Processes
  09143 Cell growth and death
   04210 Apoptosis
    824 (CAPN2)
   04217 Necroptosis
    824 (CAPN2)
   04218 Cellular senescence
    824 (CAPN2)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    824 (CAPN2)
 09160 Human Diseases
  09171 Infectious disease: bacterial
   05131 Shigellosis
    824 (CAPN2)
  09164 Neurodegenerative disease
   05010 Alzheimer disease
    824 (CAPN2)
   05022 Pathways of neurodegeneration - multiple diseases
    824 (CAPN2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    824 (CAPN2)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.22  Cysteine endopeptidases
    3.4.22.53  calpain-2
     824 (CAPN2)
Peptidases and inhibitors [BR:hsa01002]
 Cysteine peptidases
  Family C2: calpain family
   824 (CAPN2)
SSDB
Motif
Pfam: Peptidase_C2 Calpain_III EF-hand_6 EF-hand_1 EF-hand_8 EF-hand_7
Other DBs
NCBI-GeneID: 824
NCBI-ProteinID: NP_001739
OMIM: 114230
HGNC: 1479
Ensembl: ENSG00000162909
UniProt: P17655 Q59EF6
Structure
LinkDB
Position
1:223701597..223776018
AA seq 700 aa
MAGIAAKLAKDREAAEGLGSHDRAIKYLNQDYEALRNECLEAGTLFQDPSFPAIPSALGF
KELGPYSSKTRGIEWKRPTEICADPQFIIGGATRTDICQGALGDCWLLAAIASLTLNEEI
LARVVPLNQSFQENYAGIFHFQFWQYGEWVEVVVDDRLPTKDGELLFVHSAEGSEFWSAL
LEKAYAKINGCYEALSGGATTEGFEDFTGGIAEWYELKKPPPNLFKIIQKALQKGSLLGC
SIDITSAADSEAITFQKLVKGHAYSVTGAEEVESNGSLQKLIRIRNPWGEVEWTGRWNDN
CPSWNTIDPEERERLTRRHEDGEFWMSFSDFLRHYSRLEICNLTPDTLTSDTYKKWKLTK
MDGNWRRGSTAGGCRNYPNTFWMNPQYLIKLEEEDEDEEDGESGCTFLVGLIQKHRRRQR
KMGEDMHTIGFGIYEVPEELSGQTNIHLSKNFFLTNRARERSDTFINLREVLNRFKLPPG
EYILVPSTFEPNKDGDFCIRVFSEKKADYQAVDDEIEANLEEFDISEDDIDDGFRRLFAQ
LAGEDAEISAFELQTILRRVLAKRQDIKSDGFSIETCKIMVDMLDSDGSGKLGLKEFYIL
WTKIQKYQKIYREIDVDRSGTMNSYEMRKALEEAGFKMPCQLHQVIVARFADDQLIIDFD
NFVRCLVRLETLFKIFKQLDPENTGTIELDLISWLCFSVL
NT seq 2103 nt   +upstreamnt  +downstreamnt
atggcgggcatcgcggccaagctggcgaaggaccgggaggcggccgaggggctgggctcc
cacgacagggccatcaagtacctcaaccaggactacgaggcgctgcggaacgagtgcctg
gaggccgggacgctcttccaggacccgtccttcccggccatcccctcggccctgggcttc
aaggagttggggccctactccagcaaaacccggggcatcgagtggaagcgccccacggag
atctgcgctgacccccagtttatcattggaggagccacccgcacagacatctgccaagga
gccctgggtgactgctggctgctggcagccattgcctccctcaccttgaatgaagaaatc
ctggctcgagtcgtccccctaaaccagagcttccaggaaaactatgcagggatctttcac
ttccagttctggcaatacggcgagtgggtggaggtggtggtggatgacaggctgcccacc
aaggacggggagctgctctttgtgcattcagccgaagggagcgagttctggagcgccctg
ctggagaaggcatacgccaagatcaacggatgctatgaagcgctatcagggggtgccacc
actgagggcttcgaagacttcaccggaggcattgctgagtggtatgagttgaagaagccc
cctcccaacctgttcaagatcatccagaaagctctgcaaaaaggctctctccttggctgc
tccatcgacatcaccagcgccgcggactcggaggccatcacgtttcagaagctggtgaag
gggcacgcgtactcggtcaccggagccgaggaggttgaaagtaacggaagcctacagaaa
ctgatccgcatccgaaatccctggggagaagtggagtggacagggcggtggaatgacaac
tgcccaagctggaacactatagacccagaggagagggaaaggctgaccagacggcatgaa
gatggagaattctggatgtctttcagtgacttcctgaggcactattcccgcctggagatc
tgtaacctgaccccagacactctcaccagcgatacctacaagaagtggaaactcaccaaa
atggatgggaactggaggcggggctccaccgcgggaggttgcaggaactacccgaacaca
ttctggatgaaccctcagtacctgatcaagctggaggaggaggatgaggacgaggaggat
ggggagagcggctgcaccttcctggtggggctcattcagaagcaccgacggcggcagagg
aagatgggcgaggacatgcacaccatcggctttggcatctatgaggttccagaggagtta
agtgggcagaccaacatccacctcagcaaaaacttcttcctgacgaatcgcgccagggag
cgctcagacaccttcatcaacctccgggaggtgctcaaccgcttcaagctgccgccagga
gagtacattctcgtgccttccaccttcgaacccaacaaggatggggatttctgcatccgg
gtcttttctgaaaagaaagctgactaccaagctgtcgatgatgaaatcgaggccaatctt
gaagagttcgacatcagcgaggatgacattgatgatggattcaggagactgtttgcccag
ttggcaggagaggatgcggagatctctgcctttgagctgcagaccatcctgagaagggtt
ctagcaaagcgccaagatatcaagtcagatggcttcagcatcgagacatgcaaaattatg
gttgacatgctagattcggacgggagtggcaagctggggctgaaggagttctacattctc
tggacgaagattcaaaaataccaaaaaatttaccgagaaatcgacgttgacaggtctggt
accatgaattcctatgaaatgcggaaggcattagaagaagcaggtttcaagatgccctgt
caactccaccaagtcatcgttgctcggtttgcagatgaccagctcatcatcgattttgat
aattttgttcggtgtttggttcggctggaaacgctattcaagatatttaagcagctggat
cccgagaatactggaacaatagagctcgaccttatctcttggctctgtttctcagtactt
tga

KEGG   Homo sapiens (human): 825
Entry
825               CDS       T01001                                 
Symbol
CAPN3, CANP3, CANPL3, LGMD2, LGMD2A, LGMDD4, LGMDR1, nCL-1, p94
Name
(RefSeq) calpain 3
  KO
K08573  calpain-3 [EC:3.4.22.54]
Organism
hsa  Homo sapiens (human)
Pathway
hsa04820  Cytoskeleton in muscle cells
Network
nt06539  Cytoskeleton in muscle cells
  Element
N01821  Sarcomere, M-band
Disease
H00592  Calpainopathy
H00593  Limb-girdle muscular dystrophy
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    825 (CAPN3)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    825 (CAPN3)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.22  Cysteine endopeptidases
    3.4.22.54  calpain-3
     825 (CAPN3)
Peptidases and inhibitors [BR:hsa01002]
 Cysteine peptidases
  Family C2: calpain family
   825 (CAPN3)
SSDB
Motif
Pfam: Peptidase_C2 Calpain_III Calpain_u2 EF-hand_1 EF-hand_6 EF-hand_7 EF-hand_8 EF-hand_5 CAPN13-like_C_EFh SURF6
Other DBs
NCBI-GeneID: 825
NCBI-ProteinID: NP_000061
OMIM: 114240
HGNC: 1480
Ensembl: ENSG00000092529
UniProt: P20807
Structure
LinkDB
Position
15:42359501..42412317
AA seq 821 aa
MPTVISASVAPRTAAEPRSPGPVPHPAQSKATEAGGGNPSGIYSAIISRNFPIIGVKEKT
FEQLHKKCLEKKVLYVDPEFPPDETSLFYSQKFPIQFVWKRPPEICENPRFIIDGANRTD
ICQGELGDCWFLAAIACLTLNQHLLFRVIPHDQSFIENYAGIFHFQFWRYGEWVDVVIDD
CLPTYNNQLVFTKSNHRNEFWSALLEKAYAKLHGSYEALKGGNTTEAMEDFTGGVAEFFE
IRDAPSDMYKIMKKAIERGSLMGCSIDDGTNMTYGTSPSGLNMGELIARMVRNMDNSLLQ
DSDLDPRGSDERPTRTIIPVQYETRMACGLVRGHAYSVTGLDEVPFKGEKVKLVRLRNPW
GQVEWNGSWSDRWKDWSFVDKDEKARLQHQVTEDGEFWMSYEDFIYHFTKLEICNLTADA
LQSDKLQTWTVSVNEGRWVRGCSAGGCRNFPDTFWTNPQYRLKLLEEDDDPDDSEVICSF
LVALMQKNRRKDRKLGASLFTIGFAIYEVPKEMHGNKQHLQKDFFLYNASKARSKTYINM
REVSQRFRLPPSEYVIVPSTYEPHQEGEFILRVFSEKRNLSEEVENTISVDRPVKKKKTK
PIIFVSDRANSNKELGVDQESEEGKGKTSPDKQKQSPQPQPGSSDQESEEQQQFRNIFKQ
IAGDDMEICADELKKVLNTVVNKHKDLKTHGFTLESCRSMIALMDTDGSGKLNLQEFHHL
WNKIKAWQKIFKHYDTDQSGTINSYEMRNAVNDAGFHLNNQLYDIITMRYADKHMNIDFD
SFICCFVRLEGMFRAFHAFDKDGDGIIKLNVLEWLQLTMYA
NT seq 2466 nt   +upstreamnt  +downstreamnt
atgccgaccgtcattagcgcatctgtggctccaaggacagcggctgagccccggtcccca
gggccagttcctcacccggcccagagcaaggccactgaggctgggggtggaaacccaagt
ggcatctattcagccatcatcagccgcaattttcctattatcggagtgaaagagaagaca
ttcgagcaacttcacaagaaatgtctagaaaagaaagttctttatgtggaccctgagttc
ccaccggatgagacctctctcttttatagccagaagttccccatccagttcgtctggaag
agacctccggaaatttgcgagaatccccgatttatcattgatggagccaacagaactgac
atctgtcaaggagagctaggggactgctggtttctcgcagccattgcctgcctgaccctg
aaccagcaccttcttttccgagtcataccccatgatcaaagtttcatcgaaaactacgca
gggatcttccacttccagttctggcgctatggagagtgggtggacgtggttatagatgac
tgcctgccaacgtacaacaatcaactggttttcaccaagtccaaccaccgcaatgagttc
tggagtgctctgctggagaaggcttatgctaagctccatggttcctacgaagctctgaaa
ggtgggaacaccacagaggccatggaggacttcacaggaggggtggcagagttttttgag
atcagggatgctcctagtgacatgtacaagatcatgaagaaagccatcgagagaggctcc
ctcatgggctgctccattgatgatggcacgaacatgacctatggaacctctccttctggt
ctgaacatgggggagttgattgcacggatggtaaggaatatggataactcactgctccag
gactcagacctcgaccccagaggctcagatgaaagaccgacccggacaatcattccggtt
cagtatgagacaagaatggcctgcgggctggtcagaggtcacgcctactctgtcacgggg
ctggatgaggtcccgttcaaaggtgagaaagtgaagctggtgcggctgcggaatccgtgg
ggccaggtggagtggaacggttcttggagtgatagatggaaggactggagctttgtggac
aaagatgagaaggcccgtctgcagcaccaggtcactgaggatggagagttctggatgtcc
tatgaggatttcatctaccatttcacaaagttggagatctgcaacctcacggccgatgct
ctgcagtctgacaagcttcagacctggacagtgtctgtgaacgagggccgctgggtacgg
ggttgctctgccggaggctgccgcaacttcccagatactttctggaccaaccctcagtac
cgtctgaagctcctggaggaggacgatgaccctgatgactcggaggtgatttgcagcttc
ctggtggccctgatgcagaagaaccggcggaaggaccggaagctaggggccagtctcttc
accattggcttcgccatctacgaggttcccaaagagatgcacgggaacaagcagcacctg
cagaaggacttcttcctgtacaacgcctccaaggccaggagcaaaacctacatcaacatg
cgggaggtgtcccagcgcttccgcctgcctcccagcgagtacgtcatcgtgccctccacc
tacgagccccaccaggagggggaattcatcctccgggtcttctctgaaaagaggaacctc
tctgaggaagttgaaaataccatctccgtggatcggccagtgaaaaagaaaaaaaccaag
cccatcatcttcgtttcggacagagcaaacagcaacaaggagctgggtgtggaccaggag
tcagaggagggcaaaggcaaaacaagccctgataagcaaaagcagtccccacagccacag
cctggcagctctgatcaggaaagtgaggaacagcaacaattccggaacattttcaagcag
atagcaggagatgacatggagatctgtgcagatgagctcaagaaggtccttaacacagtc
gtgaacaaacacaaggacctgaagacacacgggttcacactggagtcctgccgtagcatg
attgcgctcatggatacagatggctctggaaagctcaacctgcaggagttccaccacctc
tggaacaagattaaggcctggcagaaaattttcaaacactatgacacagaccagtccggc
accatcaacagctacgagatgcgaaatgcagtcaacgacgcaggattccacctcaacaac
cagctctatgacatcattaccatgcggtacgcagacaaacacatgaacatcgactttgac
agtttcatctgctgcttcgttaggctggagggcatgttcagagcttttcatgcatttgac
aaggatggagatggtatcatcaagctcaacgttctggagtggctgcagctcaccatgtat
gcctga

KEGG   Homo sapiens (human): 1073
Entry
1073              CDS       T01001                                 
Symbol
CFL2, NEM7
Name
(RefSeq) cofilin 2
  KO
K05765  cofilin
Organism
hsa  Homo sapiens (human)
Pathway
hsa04360  Axon guidance
hsa04666  Fc gamma R-mediated phagocytosis
hsa04810  Regulation of actin cytoskeleton
hsa05133  Pertussis
hsa05170  Human immunodeficiency virus 1 infection
Network
nt06161  Human immunodeficiency virus 1 (HIV-1)
nt06539  Cytoskeleton in muscle cells
  Element
N00433  CXCR4-GNB/G-RAC signaling pathway
N01821  Sarcomere, M-band
Disease
H00698  Nemaline myopathy
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09140 Cellular Processes
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    1073 (CFL2)
 09150 Organismal Systems
  09151 Immune system
   04666 Fc gamma R-mediated phagocytosis
    1073 (CFL2)
  09158 Development and regeneration
   04360 Axon guidance
    1073 (CFL2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05170 Human immunodeficiency virus 1 infection
    1073 (CFL2)
  09171 Infectious disease: bacterial
   05133 Pertussis
    1073 (CFL2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:hsa04131]
    1073 (CFL2)
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:hsa04812]
    1073 (CFL2)
   04147 Exosome [BR:hsa04147]
    1073 (CFL2)
Membrane trafficking [BR:hsa04131]
 Others
  Actin-binding proteins
   Others
    1073 (CFL2)
Cytoskeleton proteins [BR:hsa04812]
 Eukaryotic cytoskeleton proteins
  Actin filaments / Microfilaments
   Actin-binding proteins
    ADF / Cofilin
     1073 (CFL2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Proteins found in most exosomes
   1073 (CFL2)
SSDB
Motif
Pfam: Cofilin_ADF
Other DBs
NCBI-GeneID: 1073
NCBI-ProteinID: NP_068733
OMIM: 601443
HGNC: 1875
Ensembl: ENSG00000165410
UniProt: Q9Y281 Q549N0
Structure
LinkDB
Position
14:complement(34709113..34714593)
AA seq 166 aa
MASGVTVNDEVIKVFNDMKVRKSSTQEEIKKRKKAVLFCLSDDKRQIIVEEAKQILVGDI
GDTVEDPYTSFVKLLPLNDCRYALYDATYETKESKKEDLVFIFWAPESAPLKSKMIYASS
KDAIKKKFTGIKHEWQVNGLDDIKDRSTLGEKLGGNVVVSLEGKPL
NT seq 501 nt   +upstreamnt  +downstreamnt
atggcttctggagttacagtgaatgatgaagtcatcaaagtttttaatgatatgaaagta
aggaaatcttctacacaagaggagatcaaaaagagaaagaaagcagttctcttctgttta
agcgatgacaaaagacaaataattgtagaggaagcaaagcagatcttggtgggtgacatt
ggtgatactgtagaggacccctacacatcttttgtgaagttgctacctctgaatgattgc
cgatatgctttgtacgatgccacatacgaaacaaaagagtctaagaaagaagacctagta
tttatattctgggctcctgaaagtgcacctttaaaaagcaagatgatttatgctagctct
aaagatgccattaaaaagaaatttacaggtattaaacatgagtggcaagtaaatggcttg
gatgatattaaggaccgttcgacacttggagagaaattgggaggcaatgtagtagtttca
cttgaaggaaaaccattataa

DBGET integrated database retrieval system