KEGG   Homo sapiens (human): 11091
Entry
11091             CDS       T01001                                 
Symbol
WDR5, BIG-3, BIG3, CFAP89, SWD3
Name
(RefSeq) WD repeat domain 5
  KO
K14963  COMPASS component SWD3
Organism
hsa  Homo sapiens (human)
Pathway
hsa04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    11091 (WDR5)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    11091 (WDR5)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HAT complexes
    ATAC complex
     11091 (WDR5)
    NSL complex
     11091 (WDR5)
   HMT complexes
    COMPASS/SET1 complex
     11091 (WDR5)
    COMPASS/SET1 complex (yeast)
     11091 (WDR5)
    MLL-HCF complex
     11091 (WDR5)
    MLL3/MLL4 complex
     11091 (WDR5)
SSDB
Motif
Pfam: WD40 ANAPC4_WD40 NBCH_WD40 Nup160 eIF2A Ge1_WD40 WD40_like VID27 CDtoxinA Nucleoporin_N
Other DBs
NCBI-GeneID: 11091
NCBI-ProteinID: NP_060058
OMIM: 609012
HGNC: 12757
Ensembl: ENSG00000196363
Vega: OTTHUMG00000131707
Pharos: P61964(Tchem)
UniProt: P61964
Structure
LinkDB
Position
9:134135199..134159968
AA seq 334 aa
MATEEKKPETEAARAQPTPSSSATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSPNGEWL
ASSSADKLIKIWGAYDGKFEKTISGHKLGISDVAWSSDSNLLVSASDDKTLKIWDVSSGK
CLKTLKGHSNYVFCCNFNPQSNLIVSGSFDESVRIWDVKTGKCLKTLPAHSDPVSAVHFN
RDGSLIVSSSYDGLCRIWDTASGQCLKTLIDDDNPPVSFVKFSPNGKYILAATLDNTLKL
WDYSKGKCLKTYTGHKNEKYCIFANFSVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGH
TDVVISTACHPTENIIASAALENDKTIKLWKSDC
NT seq 1005 nt   +upstreamnt  +downstreamnt
atggcgacggaggagaagaagcccgagaccgaggccgccagagcacagccaaccccttcg
tcatccgccactcagagcaagcctacacctgtgaagccaaactatgctctaaagttcacc
cttgctggccacaccaaagcagtgtcctccgtgaaattcagcccgaatggagagtggctg
gcaagttcatctgctgataaacttattaaaatttggggcgcgtatgatgggaaatttgag
aaaaccatatctggtcacaagctgggaatatccgatgtagcctggtcgtcagattctaac
cttcttgtttctgcctcagatgacaaaaccttgaagatatgggacgtgagctcgggcaag
tgtctgaaaaccctgaagggacacagtaattatgtcttttgctgcaacttcaatccccag
tccaaccttattgtctcaggatcctttgacgaaagcgtgaggatatgggatgtgaaaaca
gggaagtgcctcaagactttgccagctcactcggatccagtctcggccgttcattttaat
cgtgatggatccttgatagtttcaagtagctatgatggtctctgtcgcatctgggacacc
gcctcaggccagtgcctgaagacgctcatcgatgacgacaacccccccgtgtcttttgtg
aagttctccccgaacggcaaatacatcctggccgccacgctggacaacactctgaagctc
tgggactacagcaaggggaagtgcctgaagacgtacactggccacaagaatgagaaatac
tgcatatttgccaatttctctgttactggtgggaagtggattgtgtctggctcagaggat
aaccttgtttacatctggaaccttcagacgaaagagattgtacagaaactacaaggccac
acagatgtcgtgatctcaacagcttgtcacccaacagaaaacatcatcgcctctgctgcg
ctagaaaatgacaaaacaattaaactgtggaagagtgactgctaa

KEGG   Homo sapiens (human): 4297
Entry
4297              CDS       T01001                                 
Symbol
KMT2A, ALL-1, ALL1, CXXC7, HRX, HTRX, HTRX1, MLL, MLL1, MLL1A, TRX1, WDSTS
Name
(RefSeq) lysine methyltransferase 2A
  KO
K09186  [histone H3]-lysine4 N-trimethyltransferase MLL1 [EC:2.1.1.354]
Organism
hsa  Homo sapiens (human)
Pathway
hsa00310  Lysine degradation
hsa01100  Metabolic pathways
hsa04934  Cushing syndrome
hsa05202  Transcriptional misregulation in cancer
Network
nt06240  Transcription
nt06360  Cushing syndrome
  Element
N00119  MLL-AF4 fusion to transcriptional activation
N00120  MLL-ENL fusion to transcriptional activation
N00290  Mutation-inactivated MEN1 to transcription
Disease
H00001  B-cell acute lymphoblastic leukemia
H00002  T-cell acute lymphoblastic leukemia
H01879  Wiedemann-Steiner syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09100 Metabolism
  09105 Amino acid metabolism
   00310 Lysine degradation
    4297 (KMT2A)
 09160 Human Diseases
  09161 Cancer: overview
   05202 Transcriptional misregulation in cancer
    4297 (KMT2A)
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    4297 (KMT2A)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:hsa03000]
    4297 (KMT2A)
   03036 Chromosome and associated proteins [BR:hsa03036]
    4297 (KMT2A)
Enzymes [BR:hsa01000]
 2. Transferases
  2.1  Transferring one-carbon groups
   2.1.1  Methyltransferases
    2.1.1.354  [histone H3]-lysine4 N-trimethyltransferase
     4297 (KMT2A)
Transcription factors [BR:hsa03000]
 Eukaryotic type
  Zinc finger
   CXXC CpG-binding proteins
    4297 (KMT2A)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HMTs (histone methyltransferases)
    HKMTs (histone lysine methyltransferases)
     4297 (KMT2A)
   HMT complexes
    MLL-HCF complex
     4297 (KMT2A)
SSDB
Motif
Pfam: SET FYRC PHD FYRN zf-CXXC zf-HC5HC2H Bromodomain PHD_2 PHD_4
Other DBs
NCBI-GeneID: 4297
NCBI-ProteinID: NP_005924
OMIM: 159555
HGNC: 7132
Ensembl: ENSG00000118058
Vega: OTTHUMG00000166337
Pharos: Q03164(Tbio)
UniProt: Q03164
Structure
LinkDB
Position
11:118436492..118526832
AA seq 3969 aa
MAHSCRWRFPARPGTTGGGGGGGRRGLGGAPRQRVPALLLPPGPPVGGGGPGAPPSPPAV
AAAAAAAGSSGAGVPGGAAAASAASSSSASSSSSSSSSASSGPALLRVGPGFDAALQVSA
AIGTNLRRFRAVFGESGGGGGSGEDEQFLGFGSDEEVRVRSPTRSPSVKTSPRKPRGRPR
SGSDRNSAILSDPSVFSPLNKSETKSGDKIKKKDSKSIEKKRGRPPTFPGVKIKITHGKD
ISELPKGNKEDSLKKIKRTPSATFQQATKIKKLRAGKLSPLKSKFKTGKLQIGRKGVQIV
RRRGRPPSTERIKTPSGLLINSELEKPQKVRKDKEGTPPLTKEDKTVVRQSPRRIKPVRI
IPSSKRTDATIAKQLLQRAKKGAQKKIEKEAAQLQGRKVKTQVKNIRQFIMPVVSAISSR
IIKTPRRFIEDEDYDPPIKIARLESTPNSRFSAPSCGSSEKSSAASQHSSQMSSDSSRSS
SPSVDTSTDSQASEEIQVLPEERSDTPEVHPPLPISQSPENESNDRRSRRYSVSERSFGS
RTTKKLSTLQSAPQQQTSSSPPPPLLTPPPPLQPASSISDHTPWLMPPTIPLASPFLPAS
TAPMQGKRKSILREPTFRWTSLKHSRSEPQYFSSAKYAKEGLIRKPIFDNFRPPPLTPED
VGFASGFSASGTAASARLFSPLHSGTRFDMHKRSPLLRAPRFTPSEAHSRIFESVTLPSN
RTSAGTSSSGVSNRKRKRKVFSPIRSEPRSPSHSMRTRSGRLSSSELSPLTPPSSVSSSL
SISVSPLATSALNPTFTFPSHSLTQSGESAEKNQRPRKQTSAPAEPFSSSSPTPLFPWFT
PGSQTERGRNKDKAPEELSKDRDADKSVEKDKSRERDREREKENKRESRKEKRKKGSEIQ
SSSALYPVGRVSKEKVVGEDVATSSSAKKATGRKKSSSHDSGTDITSVTLGDTTAVKTKI
LIKKGRGNLEKTNLDLGPTAPSLEKEKTLCLSTPSSSTVKHSTSSIGSMLAQADKLPMTD
KRVASLLKKAKAQLCKIEKSKSLKQTDQPKAQGQESDSSETSVRGPRIKHVCRRAAVALG
RKRAVFPDDMPTLSALPWEEREKILSSMGNDDKSSIAGSEDAEPLAPPIKPIKPVTRNKA
PQEPPVKKGRRSRRCGQCPGCQVPEDCGVCTNCLDKPKFGGRNIKKQCCKMRKCQNLQWM
PSKAYLQKQAKAVKKKEKKSKTSEKKDSKESSVVKNVVDSSQKPTPSAREDPAPKKSSSE
PPPRKPVEEKSEEGNVSAPGPESKQATTPASRKSSKQVSQPALVIPPQPPTTGPPRKEVP
KTTPSEPKKKQPPPPESGPEQSKQKKVAPRPSIPVKQKPKEKEKPPPVNKQENAGTLNIL
STLSNGNSSKQKIPADGVHRIRVDFKEDCEAENVWEMGGLGILTSVPITPRVVCFLCASS
GHVEFVYCQVCCEPFHKFCLEENERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNK
CRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCA
KLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPESVAYT
CVNCTERHPAEWRLALEKELQISLKQVLTALLNSRTTSHLLRYRQAAKPPDLNPETEESI
PSRSSPEGPDPPVLTEVSKQDDQQPLDLEGVKRKMDQGNYTSVLEFSDDIVKIIQAAINS
DGGQPEIKKANSMVKSFFIRQMERVFPWFSVKKSRFWEPNKVSSNSGMLPNAVLPPSLDH
NYAQWQEREENSHTEQPPLMKKIIPAPKPKGPGEPDSPTPLHPPTPPILSTDRSREDSPE
LNPPPGIEDNRQCALCLTYGDDSANDAGRLLYIGQNEWTHVNCALWSAEVFEDDDGSLKN
VHMAVIRGKQLRCEFCQKPGATVGCCLTSCTSNYHFMCSRAKNCVFLDDKKVYCQRHRDL
IKGEVVPENGFEVFRRVFVDFEGISLRRKFLNGLEPENIHMMIGSMTIDCLGILNDLSDC
EDKLFPIGYQCSRVYWSTTDARKRCVYTCKIVECRPPVVEPDINSTVEHDENRTIAHSPT
SFTESSSKESQNTAEIISPPSPDRPPHSQTSGSCYYHVISKVPRIRTPSYSPTQRSPGCR
PLPSAGSPTPTTHEIVTVGDPLLSSGLRSIGSRRHSTSSLSPQRSKLRIMSPMRTGNTYS
RNNVSSVSTTGTATDLESSAKVVDHVLGPLNSSTSLGQNTSTSSNLQRTVVTVGNKNSHL
DGSSSSEMKQSSASDLVSKSSSLKGEKTKVLSSKSSEGSAHNVAYPGIPKLAPQVHNTTS
RELNVSKIGSFAEPSSVSFSSKEALSFPHLHLRGQRNDRDQHTDSTQSANSSPDEDTEVK
TLKLSGMSNRSSIINEHMGSSSRDRRQKGKKSCKETFKEKHSSKSFLEPGQVTTGEEGNL
KPEFMDEVLTPEYMGQRPCNNVSSDKIGDKGLSMPGVPKAPPMQVEGSAKELQAPRKRTV
KVTLTPLKMENESQSKNALKESSPASPLQIESTSPTEPISASENPGDGPVAQPSPNNTSC
QDSQSNNYQNLPVQDRNLMLPDGPKPQEDGSFKRRYPRRSARARSNMFFGLTPLYGVRSY
GEEDIPFYSSSTGKKRGKRSAEGQVDGADDLSTSDEDDLYYYNFTRTVISSGGEERLASH
NLFREEEQCDLPKISQLDGVDDGTESDTSVTATTRKSSQIPKRNGKENGTENLKIDRPED
AGEKEHVTKSSVGHKNEPKMDNCHSVSRVKTQGQDSLEAQLSSLESSRRVHTSTPSDKNL
LDTYNTELLKSDSDNNNSDDCGNILPSDIMDFVLKNTPSMQALGESPESSSSELLNLGEG
LGLDSNREKDMGLFEVFSQQLPTTEPVDSSVSSSISAEEQFELPLELPSDLSVLTTRSPT
VPSQNPSRLAVISDSGEKRVTITEKSVASSESDPALLSPGVDPTPEGHMTPDHFIQGHMD
ADHISSPPCGSVEQGHGNNQDLTRNSSTPGLQVPVSPTVPIQNQKYVPNSTDSPGPSQIS
NAAVQTTPPHLKPATEKLIVVNQNMQPLYVLQTLPNGVTQKIQLTSSVSSTPSVMETNTS
VLGPMGGGLTLTTGLNPSLPTSQSLFPSASKGLLPMSHHQHLHSFPAATQSSFPPNISNP
PSGLLIGVQPPPDPQLLVSESSQRTDLSTTVATPSSGLKKRPISRLQTRKNKKLAPSSTP
SNIAPSDVVSNMTLINFTPSQLPNHPSLLDLGSLNTSSHRTVPNIIKRSKSSIMYFEPAP
LLPQSVGGTAATAAGTSTISQDTSHLTSGSVSGLASSSSVLNVVSMQTTTTPTSSASVPG
HVTLTNPRLLGTPDIGSISNLLIKASQQSLGIQDQPVALPPSSGMFPQLGTSQTPSTAAI
TAASSICVLPSTQTTGITAASPSGEADEHYQLQHVNQLLASKTGIHSSQRDLDSASGPQV
SNFTQTVDAPNSMGLEQNKALSSAVQASPTSPGGSPSSPSSGQRSASPSVPGPTKPKPKT
KRFQLPLDKGNGKKHKVSHLRTSSSEAHIPDQETTSLTSGTGTPGAEAEQQDTASVEQSS
QKECGQPAGQVAVLPEVQVTQNPANEQESAEPKTVEEEESNFSSPLMLWLQQEQKRKESI
TEKKPKKGLVFEISSDDGFQICAESIEDAWKSLTDKVQEARSNARLKQLSFAGVNGLRML
GILHDAVVFLIEQLSGAKHCRNYKFRFHKPEEANEPPLNPHGSARAEVHLRKSAFDMFNF
LASKHRQPPEYNPNDEEEEEVQLKSARRATSMDLPMPMRFRHLKKTSKEAVGVYRSPIHG
RGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGCYMFRIDDSEVVDATMHGN
AARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDASNKLPCNCG
AKKCRKFLN
NT seq 11910 nt   +upstreamnt  +downstreamnt
atggcgcacagctgtcggtggcgcttccccgcccgacccgggaccaccgggggcggcggc
ggcggggggcgccggggcctagggggcgccccgcggcaacgcgtcccggccctgctgctt
ccccccgggcccccggtcggcggtggcggccccggggcgcccccctcccccccggctgtg
gcggccgcggcggcggcggcgggaagcagcggggctggggttccagggggagcggccgcc
gcctcagcagcctcctcgtcgtccgcctcgtcttcgtcttcgtcatcgtcctcagcctct
tcagggccggccctgctccgggtgggcccgggcttcgacgcggcgctgcaggtctcggcc
gccatcggcaccaacctgcgccggttccgggccgtgtttggggagagcggcgggggaggc
ggcagcggagaggatgagcaattcttaggttttggctcagatgaagaagtcagagtgcga
agtcccacaaggtctccttcagttaaaactagtcctcgaaaacctcgtgggagacctaga
agtggctctgaccgaaattcagctatcctctcagatccatctgtgttttcccctctaaat
aaatcagagaccaaatctggagataagatcaagaagaaagattctaaaagtatagaaaag
aagagaggaagacctcccaccttccctggagtaaaaatcaaaataacacatggaaaggac
atttcagagttaccaaagggaaacaaagaagatagcctgaaaaaaattaaaaggacacct
tctgctacgtttcagcaagccacaaagattaaaaaattaagagcaggtaaactctctcct
ctcaagtctaagtttaagacagggaagcttcaaataggaaggaagggggtacaaattgta
cgacggagaggaaggcctccatcaacagaaaggataaagaccccttcgggtctcctcatt
aattctgaactggaaaagccccagaaagtccggaaagacaaggaaggaacacctccactt
acaaaagaagataagacagttgtcagacaaagccctcgaaggattaagccagttaggatt
attccttcttcaaaaaggacagatgcaaccattgctaagcaactcttacagagggcaaaa
aagggggctcaaaagaaaattgaaaaagaagcagctcagctgcagggaagaaaggtgaag
acacaggtcaaaaatattcgacagttcatcatgcctgttgtcagtgctatctcctcgcgg
atcattaagacccctcggcggtttatagaggatgaggattatgaccctccaattaaaatt
gcccgattagagtctacaccgaatagtagattcagtgccccgtcctgtggatcttctgaa
aaatcaagtgcagcttctcagcactcctctcaaatgtcttcagactcctctcgatctagt
agccccagtgttgatacctccacagactctcaggcttctgaggagattcaggtacttcct
gaggagcggagcgatacccctgaagttcatcctccactgcccatttcccagtccccagaa
aatgagagtaatgataggagaagcagaaggtattcagtgtcggagagaagttttggatct
agaacgacgaaaaaattatcaactctacaaagtgccccccagcagcagacctcctcgtct
ccacctccacctctgctgactccaccgccaccactgcagccagcctccagtatctctgac
cacacaccttggcttatgcctccaacaatccccttagcatcaccatttttgcctgcttcc
actgctcctatgcaagggaagcgaaaatctattttgcgagaaccgacatttaggtggact
tctttaaagcattctaggtcagagccacaatacttttcctcagcaaagtatgccaaagaa
ggtcttattcgcaaaccaatatttgataatttccgaccccctccactaactcccgaggac
gttggctttgcatctggtttttctgcatctggtaccgctgcttcagcccgattgttttcg
ccactccattctggaacaaggtttgatatgcacaaaaggagccctcttctgagagctcca
agatttactccaagtgaggctcactctagaatatttgagtctgtaaccttgcctagtaat
cgaacttctgctggaacatcttcttcaggagtatccaatagaaaaaggaaaagaaaagtg
tttagtcctattcgatctgaaccaagatctccttctcactccatgaggacaagaagtgga
aggcttagtagttctgagctctcacctctcacccccccgtcttctgtctcttcctcgtta
agcatttctgttagtcctcttgccactagtgccttaaacccaacttttacttttccttct
cattccctgactcagtctggggaatctgcagagaaaaatcagagaccaaggaagcagact
agtgctccggcagagccattttcatcaagtagtcctactcctctcttcccttggtttacc
ccaggctctcagactgaaagagggagaaataaagacaaggcccccgaggagctgtccaaa
gatcgagatgctgacaagagcgtggagaaggacaagagtagagagagagaccgggagaga
gaaaaggagaataagcgggagtcaaggaaagagaaaaggaaaaagggatcagaaattcag
agtagttctgctttgtatcctgtgggtagggtttccaaagagaaggttgttggtgaagat
gttgccacttcatcttctgccaaaaaagcaacagggcggaagaagtcttcatcacatgat
tctgggactgatattacttctgtgactcttggggatacaacagctgtcaaaaccaaaata
cttataaagaaagggagaggaaatctggaaaaaaccaacttggacctcggcccaactgcc
ccatccctggagaaggagaaaaccctctgcctttccactccttcatctagcactgttaaa
cattccacttcctccataggctccatgttggctcaggcagacaagcttccaatgactgac
aagagggttgccagcctcctaaaaaaggccaaagctcagctctgcaagattgagaagagt
aagagtcttaaacaaaccgaccagcccaaagcacagggtcaagaaagtgactcatcagag
acctctgtgcgaggaccccggattaaacatgtctgcagaagagcagctgttgcccttggc
cgaaaacgagctgtgtttcctgatgacatgcccaccctgagtgccttaccatgggaagaa
cgagaaaagattttgtcttccatggggaatgatgacaagtcatcaattgctggctcagaa
gatgctgaacctcttgctccacccatcaaaccaattaaacctgtcactagaaacaaggca
ccccaggaacctccagtaaagaaaggacgtcgatcgaggcggtgtgggcagtgtcccggc
tgccaggtgcctgaggactgtggtgtttgtactaattgcttagataagcccaagtttggt
ggtcgcaatataaagaagcagtgctgcaagatgagaaaatgtcagaatctacaatggatg
ccttccaaagcctacctgcagaagcaagctaaagctgtgaaaaagaaagagaaaaagtct
aagaccagtgaaaagaaagacagcaaagagagcagtgttgtgaagaacgtggtggactct
agtcagaaacctaccccatcagcaagagaggatcctgccccaaagaaaagcagtagtgag
cctcctccacgaaagcccgtcgaggaaaagagtgaagaagggaatgtctcggcccctggg
cctgaatccaaacaggccaccactccagcttccaggaagtcaagcaagcaggtctcccag
ccagcactggtcatcccgcctcagccacctactacaggaccgccaagaaaagaagttccc
aaaaccactcctagtgagcccaagaaaaagcagcctccaccaccagaatcaggtccagag
cagagcaaacagaaaaaagtggctccccgcccaagtatccctgtaaaacaaaaaccaaaa
gaaaaggaaaaaccacctccggtcaataagcaggagaatgcaggcactttgaacatcctc
agcactctctccaatggcaatagttctaagcaaaaaattccagcagatggagtccacagg
atcagagtggactttaaggaggattgtgaagcagaaaatgtgtgggagatgggaggctta
ggaatcttgacttctgttcctataacacccagggtggtttgctttctctgtgccagtagt
gggcatgtagagtttgtgtattgccaagtctgttgtgagcccttccacaagttttgttta
gaggagaacgagcgccctctggaggaccagctggaaaattggtgttgtcgtcgttgcaaa
ttctgtcacgtttgtggaaggcaacatcaggctacaaagcagctgctggagtgtaataag
tgccgaaacagctatcaccctgagtgcctgggaccaaactaccccaccaaacccacaaag
aagaagaaagtctggatctgtaccaagtgtgttcgctgtaagagctgtggatccacaact
ccaggcaaagggtgggatgcacagtggtctcatgatttctcactgtgtcatgattgcgcc
aagctctttgctaaaggaaacttctgccctctctgtgacaaatgttatgatgatgatgac
tatgagagtaagatgatgcaatgtggaaagtgtgatcgctgggtccattccaaatgtgag
aatctttcagatgagatgtatgagattctatctaatctgccagaaagtgtggcctacact
tgtgtgaactgtactgagcggcaccctgcagagtggcgactggcccttgaaaaagagctg
cagatttctctgaagcaagttctgacagctttgttgaattctcggactaccagccatttg
ctacgctaccggcaggctgccaagcctccagacttaaatcccgagacagaggagagtata
ccttcccgcagctcccccgaaggacctgatccaccagttcttactgaggtcagcaaacag
gatgatcagcagcctttagatctagaaggagtcaagaggaagatggaccaagggaattac
acatctgtgttggagttcagtgatgatattgtgaagatcattcaagcagccattaattca
gatggaggacagccagaaattaaaaaagccaacagcatggtcaagtccttcttcattcgg
caaatggaacgtgtttttccatggttcagtgtcaaaaagtccaggttttgggagccaaat
aaagtatcaagcaacagtgggatgttaccaaacgcagtgcttccaccttcacttgaccat
aattatgctcagtggcaggagcgagaggaaaacagccacactgagcagcctcctttaatg
aagaaaatcattccagctcccaaacccaaaggtcctggagaaccagactcaccaactcct
ctgcatcctcctacaccaccaattttgagtactgataggagtcgagaagacagtccagag
ctgaacccacccccaggcatagaagacaatagacagtgtgcgttatgtttgacttatggt
gatgacagtgctaatgatgctggtcgtttactatatattggccaaaatgagtggacacat
gtaaattgtgctttgtggtcagcggaagtgtttgaagatgatgacggatcactaaagaat
gtgcatatggctgtgatcaggggcaagcagctgagatgtgaattctgccaaaagccagga
gccaccgtgggttgctgtctcacatcctgcaccagcaactatcacttcatgtgttcccga
gccaagaactgtgtctttctggatgataaaaaagtatattgccaacgacatcgggatttg
atcaaaggcgaagtggttcctgagaatggatttgaagttttcagaagagtgtttgtggac
tttgaaggaatcagcttgagaaggaagtttctcaatggcttggaaccagaaaatatccac
atgatgattgggtctatgacaatcgactgcttaggaattctaaatgatctctccgactgt
gaagataagctctttcctattggatatcagtgttccagggtatactggagcaccacagat
gctcgcaagcgctgtgtatatacatgcaagatagtggagtgccgtcctccagtcgtagag
ccggatatcaacagcactgttgaacatgatgaaaacaggaccattgcccatagtccaaca
tcttttacagaaagttcatcaaaagagagtcaaaacacagctgaaattataagtcctcca
tcaccagaccgacctcctcattcacaaacctctggctcctgttattatcatgtcatctca
aaggtccccaggattcgaacacccagttattctccaacacagagatcccctggctgtcga
ccgttgccttctgcaggaagtcctaccccaaccactcatgaaatagtcacagtaggtgat
cctttactctcctctggacttcgaagcattggctccaggcgtcacagtacctcttcctta
tcaccccagcggtccaaactccggataatgtctccaatgagaactgggaatacttactct
aggaataatgtttcctcagtctccaccaccgggaccgctactgatcttgaatcaagtgcc
aaagtagttgatcatgtcttagggccactgaattcaagtactagtttagggcaaaacact
tccacctcttcaaatttgcaaaggacagtggttactgtaggcaataaaaacagtcacttg
gatggatcttcatcttcagaaatgaagcagtccagtgcttcagacttggtgtccaagagc
tcctctttaaagggagagaagaccaaagtgctgagttccaagagctcagagggatctgca
cataatgtggcttaccctggaattcctaaactggccccacaggttcataacacaacatct
agagaactgaatgttagtaaaatcggctcctttgctgaaccctcttcagtgtcgttttct
tctaaagaggccctctccttcccacacctccatttgagagggcaaaggaatgatcgagac
caacacacagattctacccaatcagcaaactcctctccagatgaagatactgaagtcaaa
accttgaagctatctggaatgagcaacagatcatccattatcaacgaacatatgggatct
agttccagagataggagacagaaagggaaaaaatcctgtaaagaaactttcaaagaaaag
cattccagtaaatcttttttggaacctggtcaggtgacaactggtgaggaaggaaacttg
aagccagagtttatggatgaggttttgactcctgagtatatgggccaacgaccatgtaac
aatgtttcttctgataagattggtgataaaggcctttctatgccaggagtccccaaagct
ccacccatgcaagtagaaggatctgccaaggaattacaggcaccacggaaacgcacagtc
aaagtgacactgacacctctaaaaatggaaaatgagagtcaatccaaaaatgccctgaaa
gaaagtagtcctgcttcccctttgcaaatagagtcaacatctcccacagaaccaatttca
gcctctgaaaatccaggagatggtccagtggcccaaccaagccccaataatacctcatgc
caggattctcaaagtaacaactatcagaatcttccagtacaggacagaaacctaatgctt
ccagatggccccaaacctcaggaggatggctcttttaaaaggaggtatccccgtcgcagt
gcccgtgcacgttctaacatgttttttgggcttaccccactctatggagtaagatcctat
ggtgaagaagacattccattctacagcagctcaactgggaagaagcgaggcaagagatca
gctgaaggacaggtggatggggccgatgacttaagcacttcagatgaagacgacttatac
tattacaacttcactagaacagtgatttcttcaggtggagaggaacgactggcatcccat
aatttatttcgggaggaggaacagtgtgatcttccaaaaatctcacagttggatggtgtt
gatgatgggacagagagtgatactagtgtcacagccacaacaaggaaaagcagccagatt
ccaaaaagaaatggtaaagaaaatggaacagagaacttaaagattgatagacctgaagat
gctggggagaaagaacatgtcactaagagttctgttggccacaaaaatgagccaaagatg
gataactgccattctgtaagcagagttaaaacacagggacaagattccttggaagctcag
ctcagctcattggagtcaagccgcagagtccacacaagtaccccctccgacaaaaattta
ctggacacctataatactgagctcctgaaatcagattcagacaataacaacagtgatgac
tgtgggaatatcctgccttcagacattatggactttgtactaaagaatactccatccatg
caggctttgggtgagagcccagagtcatcttcatcagaactcctgaatcttggtgaagga
ttgggtcttgacagtaatcgtgaaaaagacatgggtctttttgaagtattttctcagcag
ctgcctacaacagaacctgtggatagtagtgtctcttcctctatctcagcagaggaacag
tttgagttgcctctagagctaccatctgatctgtctgtcttgaccacccggagtcccact
gtccccagccagaatcccagtagactagctgttatctcagactcaggggagaagagagta
accatcacagaaaaatctgtagcctcctctgaaagtgacccagcactgctgagcccagga
gtagatccaactcctgaaggccacatgactcctgatcattttatccaaggacacatggat
gcagaccacatctctagccctccttgtggttcagtagagcaaggtcatggcaacaatcag
gatttaactaggaacagtagcacccctggccttcaggtacctgtttccccaactgttccc
atccagaaccagaagtatgtgcccaattctactgatagtcctggcccgtctcagatttcc
aatgcagctgtccagaccactccaccccacctgaagccagccactgagaaactcatagtt
gttaaccagaacatgcagccactttatgttctccaaactcttccaaatggagtgacccaa
aaaatccaattgacctcttctgttagttctacacccagtgtgatggagacaaatacttca
gtattgggacccatgggaggtggtctcacccttaccacaggactaaatccaagcttgcca
acttctcaatctttgttcccttctgctagcaaaggattgctacccatgtctcatcaccag
cacttacattccttccctgcagctactcaaagtagtttcccaccaaacatcagcaatcct
ccttcaggcctgcttattggggttcagcctcctccggatccccaacttttggtttcagaa
tccagccagaggacagacctcagtaccacagtagccactccatcctctggactcaagaaa
agacccatatctcgtctacagacccgaaagaataaaaaacttgctccctctagtacccct
tcaaacattgccccttctgatgtggtttctaatatgacattgattaacttcacaccctcc
cagcttcctaatcatccaagtctgttagatttggggtcacttaatacttcatctcaccga
actgtccccaacatcataaaaagatctaaatctagcatcatgtattttgaaccggcaccc
ctgttaccacagagtgtgggaggaactgctgccacagcggcaggcacatcaacaataagc
caggatactagccacctcacatcagggtctgtgtctggcttggcatccagttcctctgtc
ttgaatgttgtatccatgcaaactaccacaacccctacaagtagtgcgtcagttccagga
cacgtcaccttaaccaacccaaggttgcttggtaccccagatattggctcaataagcaat
cttttaatcaaagctagccagcagagcctggggattcaggaccagcctgtggctttaccg
ccaagttcaggaatgtttccacaactggggacatcacagaccccctctactgctgcaata
acagcggcatctagcatctgtgtgctcccctccactcagactacgggcataacagccgct
tcaccttctggggaagcagacgaacactatcagcttcagcatgtgaaccagctccttgcc
agcaaaactgggattcattcttcccagcgtgatcttgattctgcttcagggccccaggta
tccaactttacccagacggtagacgctcctaatagcatgggactggagcagaacaaggct
ttatcctcagctgtgcaagccagccccacctctcctgggggttctccatcctctccatct
tctggacagcggtcagcaagcccttcagtgccgggtcccactaaacccaaaccaaaaacc
aaacggtttcagctgcctctagacaaagggaatggcaagaagcacaaagtttcccatttg
cggaccagttcttctgaagcacacattccagaccaagaaacgacatccctgacctcaggc
acagggactccaggagcagaggctgagcagcaggatacagctagcgtggagcagtcctcc
cagaaggagtgtgggcaacctgcagggcaagtcgctgttcttccggaagttcaggtgacc
caaaatccagcaaatgaacaagaaagtgcagaacctaaaacagtggaagaagaggaaagt
aatttcagctccccactgatgctttggcttcagcaagaacaaaagcggaaggaaagcatt
actgagaaaaaacccaagaaaggacttgtttttgaaatttccagtgatgatggctttcag
atctgtgcagaaagtattgaagatgcctggaagtcattgacagataaagtccaggaagct
cgatcaaatgcccgcctaaagcagctctcatttgcaggtgttaacggtttgaggatgctg
gggattctccatgatgcagttgtgttcctcattgagcagctgtctggtgccaagcactgt
cgaaattacaaattccgtttccacaagccagaggaggccaatgaaccccccttgaaccct
cacggctcagccagggctgaagtccacctcaggaagtcagcatttgacatgtttaacttc
ctggcttctaaacatcgtcagcctcctgaatacaaccccaatgatgaagaagaggaggag
gtacagctgaagtcagctcggagggcaactagcatggatctgccaatgcccatgcgcttc
cggcacttaaaaaagacttctaaggaggcagttggtgtctacaggtctcccatccatggc
cggggtcttttctgtaagagaaacattgatgcaggtgagatggtgattgagtatgccggc
aacgtcatccgctccatccagactgacaagcgggaaaagtattacgacagcaagggcatt
ggttgctatatgttccgaattgatgactcagaggtagtggatgccaccatgcatggaaat
gctgcacgcttcatcaatcactcgtgtgagcctaactgctattctcgggtcatcaatatt
gatgggcagaagcacattgtcatctttgccatgcgtaagatctaccgaggagaggaactc
acttacgactataagttccccattgaggatgccagcaacaagctgccctgcaactgtggc
gccaagaaatgccggaagttcctaaactaa

KEGG   Homo sapiens (human): 54554
Entry
54554             CDS       T01001                                 
Symbol
WDR5B
Name
(RefSeq) WD repeat domain 5B
  KO
K14963  COMPASS component SWD3
Organism
hsa  Homo sapiens (human)
Pathway
hsa04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    54554 (WDR5B)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    54554 (WDR5B)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HAT complexes
    ATAC complex
     54554 (WDR5B)
    NSL complex
     54554 (WDR5B)
   HMT complexes
    COMPASS/SET1 complex
     54554 (WDR5B)
    COMPASS/SET1 complex (yeast)
     54554 (WDR5B)
    MLL-HCF complex
     54554 (WDR5B)
    MLL3/MLL4 complex
     54554 (WDR5B)
SSDB
Motif
Pfam: WD40 ANAPC4_WD40 NBCH_WD40 eIF2A Nup160 Ge1_WD40 WD40_like Ricin_B_lectin
Other DBs
NCBI-GeneID: 54554
NCBI-ProteinID: NP_061942
HGNC: 17826
Ensembl: ENSG00000196981
Vega: OTTHUMG00000159489
Pharos: Q86VZ2(Tdark)
UniProt: Q86VZ2
LinkDB
Position
3:complement(122411846..122416062)
AA seq 330 aa
MATKESRDAKAQLALSSSANQSKEVPENPNYALKCTLVGHTEAVSSVKFSPNGEWLASSS
ADRLIIIWGAYDGKYEKTLYGHNLEISDVAWSSDSSRLVSASDDKTLKLWDVRSGKCLKT
LKGHSNYVFCCNFNPPSNLIISGSFDETVKIWEVKTGKCLKTLSAHSDPVSAVHFNCSGS
LIVSGSYDGLCRIWDAASGQCLKTLVDDDNPPVSFVKFSPNGKYILTATLDNTLKLWDYS
RGRCLKTYTGHKNEKYCIFANFSVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVV
ISAACHPTENLIASAALENDKTIKLWMSNH
NT seq 993 nt   +upstreamnt  +downstreamnt
atggcaaccaaggagtcaagagacgccaaagcacagttggccctctcctcatcggccaat
cagagcaaggaagtgcctgaaaacccaaactatgctctcaaatgtactcttgtgggacac
acggaagcagtgtcatcagttaagtttagtcctaatggagaatggctagcaagttcttct
gctgataggctaatcataatttggggagcatatgatggaaaatatgagaaaacactctat
ggtcataatttggaaatatcggatgttgcctggtcatcagattccagtcgtcttgtttct
gcctcagatgataaaactctaaaattatgggatgtgagatctggaaaatgtttgaaaaca
ctgaaggggcacagtaattatgtcttttgttgtaacttcaatccgccatccaaccttata
atctcgggatcttttgatgagactgtaaaaatatgggaggtgaaaacaggaaagtgtctc
aagactttgtctgctcattctgacccagtttctgctgttcattttaattgtagtgggtcc
ttgatagtgtcaggtagctatgatggcctctgtagaatctgggatgctgcatcaggtcag
tgtttaaaaacgctcgttgatgacgataaccctcctgtctcttttgtaaaattttctcca
aatggtaaatacattctcactgcaactttggacaacactcttaaactatgggattatagc
agaggcaggtgcctgaaaacatacactggtcataagaatgagaaatattgcatatttgcc
aatttttcagttactggtggaaagtggattgtgtctggttccgaggataacctggtttac
atttggaaccttcagactaaagagattgtgcagaaattacaaggccatacagatgttgtg
atctcagcagcttgtcatcctacagaaaacctcatcgcatcagcagcattagaaaatgac
aaaacaattaaactgtggatgagtaaccactaa

KEGG   Homo sapiens (human): 5929
Entry
5929              CDS       T01001                                 
Symbol
RBBP5, RBQ3, SWD1
Name
(RefSeq) RB binding protein 5, histone lysine methyltransferase complex subunit
  KO
K14961  COMPASS component SWD1
Organism
hsa  Homo sapiens (human)
Pathway
hsa04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    5929 (RBBP5)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    5929 (RBBP5)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HMT complexes
    COMPASS/SET1 complex
     5929 (RBBP5)
    COMPASS/SET1 complex (yeast)
     5929 (RBBP5)
    MLL-HCF complex
     5929 (RBBP5)
    MLL3/MLL4 complex
     5929 (RBBP5)
SSDB
Motif
Pfam: WD40 ANAPC4_WD40 DUF2457
Other DBs
NCBI-GeneID: 5929
NCBI-ProteinID: NP_005048
OMIM: 600697
HGNC: 9888
Ensembl: ENSG00000117222
Vega: OTTHUMG00000037104
Pharos: Q15291(Tbio)
UniProt: Q15291 A0A024R9B5
Structure
LinkDB
Position
1:complement(205086142..205121978)
AA seq 538 aa
MNLELLESFGQNYPEEADGTLDCISMALTCTFNRWGTLLAVGCNDGRIVIWDFLTRGIAK
IISAHIHPVCSLCWSRDGHKLVSASTDNIVSQWDVLSGDCDQRFRFPSPILKVQYHPRDQ
NKVLVCPMKSAPVMLTLSDSKHVVLPVDDDSDLNVVASFDRRGEYIYTGNAKGKILVLKT
DSQDLVASFRVTTGTSNTTAIKSIEFARKGSCFLINTADRIIRVYDGREILTCGRDGEPE
PMQKLQDLVNRTPWKKCCFSGDGEYIVAGSARQHALYIWEKSIGNLVKILHGTRGELLLD
VAWHPVRPIIASISSGVVSIWAQNQVENWSAFAPDFKELDENVEYEERESEFDIEDEDKS
EPEQTGADAAEDEEVDVTSVDPIAAFCSSDEELEDSKALLYLPIAPEVEDPEENPYGPPP
DAVQTSLMDEGASSEKKRQSSADGSQPPKKKPKTTNIELQGVPNDEVHPLLGVKGDGKSK
KKQAGRPKGSKGKEKDSPFKPKLYKGDRGLPLEGSAKGKVQAELSQPLTAGGAISELL
NT seq 1617 nt   +upstreamnt  +downstreamnt
atgaacctcgagttgctggagtcctttgggcagaactatccagaggaagctgatggaact
ttggattgtatcagcatggctttgacttgcacctttaacaggtggggcacactgcttgca
gttggctgtaatgatggccgaattgtcatctgggatttcttgacaagaggcattgctaaa
ataattagtgcacacatccatccagtgtgttctttatgctggagccgagatggtcataaa
ctcgtgagtgcttccactgataacatagtgtcacagtgggatgttctttcaggcgactgt
gaccagaggtttcgattcccttcacccatcttaaaagtccaatatcatccacgagatcag
aacaaggttctcgtgtgtcccatgaaatctgctcctgtcatgttgaccctttcagattcc
aaacatgttgttctgccggtggacgatgactccgatttgaacgttgtggcatcttttgat
aggcgaggggaatatatttatacgggaaacgcaaaaggcaagattttggtcctaaaaaca
gattctcaggatcttgttgcttccttcagagtgacaactggaacaagcaataccacagcc
attaagtcaattgagtttgcccggaaggggagttgctttttaattaacacggcagatcga
ataatcagagtttatgatggcagagaaatcttaacatgtggaagagatggagagcctgaa
cctatgcagaaattgcaggatttggtgaataggaccccatggaagaaatgttgtttctct
ggggatggggaatacatcgtggcaggttctgcccggcagcatgccctgtacatctgggag
aagagcattggcaacctggtgaagattctccatgggacgagaggagaactcctcttggat
gtagcttggcatcctgttcgacccatcatagcatccatttccagtggagtggtatctatc
tgggcacagaatcaagtagaaaactggagtgcatttgcaccagacttcaaagaattggat
gaaaatgtagaatacgaagaaagggaatcagagtttgatattgaagatgaagataagagt
gagcctgagcagacaggggctgatgctgcagaagatgaggaagtggatgtcaccagcgtg
gaccctattgctgccttctgtagcagtgatgaagagctggaagattcaaaggctctattg
tatttacccattgcccctgaggtagaagacccagaagaaaatccttacggccccccaccg
gatgcagtccaaacctccttgatggatgaaggggctagttcagagaagaagaggcagtcc
tcagcagatgggtcccagccacctaagaagaaacccaaaacaaccaatatagaacttcaa
ggagtaccaaatgatgaagtccatccactactgggtgtgaagggggatggcaaatccaag
aagaagcaagcaggccggcctaaaggatcaaaaggtaaagagaaagattctccatttaaa
ccgaaactctacaaaggggacagaggtttacctctggaaggatcagcgaagggtaaagtg
caggcggaactcagccagcccttgacagcaggaggagcaatctcagaactgttatga

KEGG   Homo sapiens (human): 8085
Entry
8085              CDS       T01001                                 
Symbol
KMT2D, AAD10, ALR, CAGL114, KABUK1, KMS, MLL2, MLL4, TNRC21
Name
(RefSeq) lysine methyltransferase 2D
  KO
K09187  [histone H3]-lysine4 N-trimethyltransferase MLL2 [EC:2.1.1.354]
Organism
hsa  Homo sapiens (human)
Pathway
hsa00310  Lysine degradation
hsa01100  Metabolic pathways
hsa04934  Cushing syndrome
Disease
H00570  Kabuki syndrome
H01613  Follicular lymphoma
H02434  Diffuse large B-cell lymphoma, not otherwise specified
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09100 Metabolism
  09105 Amino acid metabolism
   00310 Lysine degradation
    8085 (KMT2D)
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    8085 (KMT2D)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    8085 (KMT2D)
Enzymes [BR:hsa01000]
 2. Transferases
  2.1  Transferring one-carbon groups
   2.1.1  Methyltransferases
    2.1.1.354  [histone H3]-lysine4 N-trimethyltransferase
     8085 (KMT2D)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HMTs (histone methyltransferases)
    HKMTs (histone lysine methyltransferases)
     8085 (KMT2D)
   HMT complexes
    MLL-HCF complex
     8085 (KMT2D)
SSDB
Motif
Pfam: PHD FYRC FYRN SET zf-HC5HC2H zf-HC5HC2H_2 HMG_box RCC1
Other DBs
NCBI-GeneID: 8085
NCBI-ProteinID: NP_003473
OMIM: 602113
HGNC: 7133
Ensembl: ENSG00000167548
Vega: OTTHUMG00000166524
Pharos: O14686(Tbio)
UniProt: O14686 Q59FG6 Q6PIA1
Structure
LinkDB
Position
12:complement(49018978..49060794)
AA seq 5537 aa
MDSQKLAGEDKDSEPAADGPAASEDPSATESDLPNPHVGEVSVLSSGSPRLQETPQDCSG
GPVRRCALCNCGEPSLHGQRELRRFELPFDWPRCPVVSPGGSPGPNEAVLPSEDLSQIGF
PEGLTPAHLGEPGGSCWAHHWCAAWSAGVWGQEGPELCGVDKAIFSGISQRCSHCTRLGA
SIPCRSPGCPRLYHFPCATASGSFLSMKTLQLLCPEHSEGAAYLEEARCAVCEGPGELCD
LFFCTSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHT
FCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWFENYSLCHRCHKAQGGQTIRS
VAEQHTPVCSRFSPPEPGDTPTDEPDALYVACQGQPKGGHVTSMQPKEPGPLQCEAKPLG
KAGVQLEPQLEAPLNEEMPLLPPPEESPLSPPPEESPTSPPPEASRLSPPPEELPASPLP
EALHLSRPLEESPLSPPPEESPLSPPPESSPFSPLEESPLSPPEESPPSPALETPLSPPP
EASPLSPPFEESPLSPPPEELPTSPPPEASRLSPPPEESPMSPPPEESPMSPPPEASRLF
PPFEESPLSPPPEESPLSPPPEASRLSPPPEDSPMSPPPEESPMSPPPEVSRLSPLPVVS
RLSPPPEESPLSPPPEESPTSPPPEASRLSPPPEDSPTSPPPEDSPASPPPEDSLMSLPL
EESPLLPLPEEPQLCPRSEGPHLSPRPEEPHLSPRPEEPHLSPQAEEPHLSPQPEEPCLC
AVPEEPHLSPQAEGPHLSPQPEELHLSPQTEEPHLSPVPEEPCLSPQPEESHLSPQSEEP
CLSPRPEESHLSPELEKPPLSPRPEKPPEEPGQCPAPEELPLFPPPGEPSLSPLLGEPAL
SEPGEPPLSPLPEELPLSPSGEPSLSPQLMPPDPLPPPLSPIITAAAPPALSPLGELEYP
FGAKGDSDPESPLAAPILETPISPPPEANCTDPEPVPPMILPPSPGSPVGPASPILMEPL
PPQCSPLLQHSLVPQNSPPSQCSPPALPLSVPSPLSPIGKVVGVSDEAELHEMETEKVSE
PECPALEPSATSPLPSPMGDLSCPAPSPAPALDDFSGLGEDTAPLDGIDAPGSQPEPGQT
PGSLASELKGSPVLLDPEELAPVTPMEVYPECKQTAGQGSPCEEQEEPRAPVAPTPPTLI
KSDIVNEISNLSQGDASASFPGSEPLLGSPDPEGGGSLSMELGVSTDVSPARDEGSLRLC
TDSLPETDDSLLCDAGTAISGGKAEGEKGRRRSSPARSRIKQGRSSSFPGRRRPRGGAHG
GRGRGRARLKSTASSIETLVVADIDSSPSKEEEEEDDDTMQNTVVLFSNTDKFVLMQDMC
VVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECIVCEVCGQASDPS
RLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFHCEWQNSYTHCGP
CASLVTCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQAADEGFDCVSCQPYV
VKPVAPVAPPELVPMKVKEPEPQYFRFEGVWLTETGMALLRNLTMSPLHKRRQRRGRLGL
PGEAGLEGSEPSDALGPDDKKDGDLDTDELLKGEGGVEHMECEIKLEGPVSPDVEPGKEE
TEESKKRKRKPYRPGIGGFMVRQRKSHTRTKKGPAAQAEVLSGDGQPDEVIPADLPAEGA
VEQSLAEGDEKKKQQRRGRKKSKLEDMFPAYLQEAFFGKELLDLSRKALFAVGVGRPSFG
LGTPKAKGDGGSERKELPTSQKGDDGPDIADEESRGLEGKADTPGPEDGGVKASPVPSDP
EKPGTPGEGMLSSDLDRISTEELPKMESKDLQQLFKDVLGSEREQHLGCGTPGLEGSRTP
LQRPFLQGGLPLGNLPSSSPMDSYPGLCQSPFLDSRERGGFFSPEPGEPDSPWTGSGGTT
PSTPTTPTTEGEGDGLSYNQRSLQRWEKDEELGQLSTISPVLYANINFPNLKQDYPDWSS
RCKQIMKLWRKVPAADKAPYLQKAKDNRAAHRINKVQKQAESQINKQTKVGDIARKTDRP
ALHLRIPPQPGALGSPPPAAAPTIFIGSPTTPAGLSTSADGFLKPPAGSVPGPDSPGELF
LKLPPQVPAQVPSQDPFGLAPAYPLEPRFPTAPPTYPPYPSPTGAPAQPPMLGASSRPGA
GQPGEFHTTPPGTPRHQPSTPDPFLKPRCPSLDNLAVPESPGVGGGKASEPLLSPPPFGE
SRKALEVKKEELGASSPSYGPPNLGFVDSPSSGTHLGGLELKTPDVFKAPLTPRASQVEP
QSPGLGLRPQEPPPAQALAPSPPSHPDIFRPGSYTDPYAQPPLTPRPQPPPPESCCALPP
RSLPSDPFSRVPASPQSQSSSQSPLTPRPLSAEAFCPSPVTPRFQSPDPYSRPPSRPQSR
DPFAPLHKPPRPQPPEVAFKAGSLAHTSLGAGGFPAALPAGPAGELHAKVPSGQPPNFVR
SPGTGAFVGTPSPMRFTFPQAVGEPSLKPPVPQPGLPPPHGINSHFGPGPTLGKPQSTNY
TVATGNFHPSGSPLGPSSGSTGESYGLSPLRPPSVLPPPAPDGSLPYLSHGASQRSGITS
PVEKREDPGTGMGSSLATAELPGTQDPGMSGLSQTELEKQRQRQRLRELLIRQQIQRNTL
RQEKETAAAAAGAVGPPGSWGAEPSSPAFEQLSRGQTPFAGTQDKSSLVGLPPSKLSGPI
LGPGSFPSDDRLSRPPPPATPSSMDVNSRQLVGGSQAFYQRAPYPGSLPLQQQQQQLWQQ
QQATAATSMRFAMSARFPSTPGPELGRQALGSPLAGISTRLPGPGEPVPGPAGPAQFIEL
RHNVQKGLGPGGTPFPGQGPPQRPRFYPVSEDPHRLAPEGLRGLAVSGLPPQKPSAPPAP
ELNNSLHPTPHTKGPTLPTGLELVNRPPSSTELGRPNPLALEAGKLPCEDPELDDDFDAH
KALEDDEELAHLGLGVDVAKGDDELGTLENLETNDPHLDDLLNGDEFDLLAYTDPELDTG
DKKDIFNEHLRLVESANEKAEREALLRGVEPGPLGPEERPPPAADASEPRLASVLPEVKP
KVEEGGRHPSPCQFTIATPKVEPAPAANSLGLGLKPGQSMMGSRDTRMGTGPFSSSGHTA
EKASFGATGGPPAHLLTPSPLSGPGGSSLLEKFELESGALTLPGGPAASGDELDKMESSL
VASELPLLIEDLLEHEKKELQKKQQLSAQLQPAQQQQQQQQQHSLLSAPGPAQAMSLPHE
GSSPSLAGSQQQLSLGLAGARQPGLPQPLMPTQPPAHALQQRLAPSMAMVSNQGHMLSGQ
HGGQAGLVPQQSSQPVLSQKPMGTMPPSMCMKPQQLAMQQQLANSFFPDTDLDKFAAEDI
IDPIAKAKMVALKGIKKVMAQGSIGVAPGMNRQQVSLLAQRLSGGPSSDLQNHVAAGSGQ
ERSAGDPSQPRPNPPTFAQGVINEADQRQYEEWLFHTQQLLQMQLKVLEEQIGVHRKSRK
ALCAKQRTAKKAGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKEHTNLMAEYRNKQQ
QQQQQQQQQQQQHSAVLALSPSQSPRLLTKLPGQLLPGHGLQPPQGPPGGQAGGLRLTPG
GMALPGQPGGPFLNTALAQQQQQQHSGGAGSLAGPSGGFFPGNLALRSLGPDSRLLQERQ
LQLQQQRMQLAQKLQQQQQQQQQQQHLLGQVAIQQQQQQGPGVQTNQALGPKPQGLMPPS
SHQGLLVQQLSPQPPQGPQGMLGPAQVAVLQQQHPGALGPQGPHRQVLMTQSRVLSSPQL
AQQGQGLMGHRLVTAQQQQQQQQHQQQGSMAGLSHLQQSLMSHSGQPKLSAQPMGSLQQL
QQQQQLQQQQQLQQQQQQQLQQQQQLQQQQLQQQQQQQQLQQQQQQQLQQQQQQLQQQQQ
QQQQQFQQQQQQQQMGLLNQSRTLLSPQQQQQQQVALGPGMPAKPLQHFSSPGALGPTLL
LTGKEQNTVDPAVSSEATEGPSTHQGGPLAIGTTPESMATEPGEVKPSLSGDSQLLLVQP
QPQPQPSSLQLQPPLRLPGQQQQQVSLLHTAGGGSHGQLGSGSSSEASSVPHLLAQPSVS
LGDQPGSMTQNLLGPQQPMLERPMQNNTGPQPPKPGPVLQSGQGLPGVGIMPTVGQLRAQ
LQGVLAKNPQLRHLSPQQQQQLQALLMQRQLQQSQAVRQTPPYQEPGTQTSPLQGLLGCQ
PQLGGFPGPQTGPLQELGAGPRPQGPPRLPAPPGALSTGPVLGPVHPTPPPSSPQEPKRP
SQLPSPSSQLPTEAQLPPTHPGTPKPQGPTLEPPPGRVSPAAAQLADTLFSKGLGPWDPP
DNLAETQKPEQSSLVPGHLDQVNGQVVPEASQLSIKQEPREEPCALGAQSVKREANGEPI
GAPGTSNHLLLAGPRSEAGHLLLQKLLRAKNVQLSTGRGSEGLRAEINGHIDSKLAGLEQ
KLQGTPSNKEDAAARKPLTPKPKRVQKASDRLVSSRKKLRKEDGVRASEALLKQLKQELS
LLPLTEPAITANFSLFAPFGSGCPVNGQSQLRGAFGSGALPTGPDYYSQLLTKNNLSNPP
TPPSSLPPTPPPSVQQKMVNGVTPSEELGEHPKDAASARDSERALRDTSEVKSLDLLAAL
PTPPHNQTEDVRMESDEDSDSPDSIVPASSPESILGEEAPRFPHLGSGRWEQEDRALSPV
IPLIPRASIPVFPDTKPYGALGLEVPGKLPVTTWEKGKGSEVSVMLTVSAAAAKNLNGVM
VAVAELLSMKIPNSYEVLFPESPARAGTEPKKGEAEGPGGKEKGLEGKSPDTGPDWLKQF
DAVLPGYTLKSQLDILSLLKQESPAPEPPTQHSYTYNVSNLDVRQLSAPPPEEPSPPPSP
LAPSPASPPTEPLVELPTEPLAEPPVPSPLPLASSPESARPKPRARPPEEGEDSRPPRLK
KWKGVRWKRLRLLLTIQKGSGRQEDEREVAEFMEQLGTALRPDKVPRDMRRCCFCHEEGD
GATDGPARLLNLDLDLWVHLNCALWSTEVYETQGGALMNVEVALHRGLLTKCSLCQRTGA
TSSCNRMRCPNVYHFACAIRAKCMFFKDKTMLCPMHKIKGPCEQELSSFAVFRRVYIERD
EVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQMADFHSATALYPVGYEATRIYWSLR
TNNRRCCYRCSIGENNGRPEFVIKVIEQGLEDLVFTDASPQAVWNRIIEPVAAMRKEADM
LRLFPEYLKGEELFGLTVHAVLRIAESLPGVESCQNYLFRYGRHPLMELPLMINPTGCAR
SEPKILTHYKRPHTLNSTSMSKAYQSTFTGETNTPYSKQFVHSKSSQYRRLRTEWKNNVY
LARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNEHV
IDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQ
HKIPCHCGAWNCRKWMN
NT seq 16614 nt   +upstreamnt  +downstreamnt
atggacagccagaagctggctggtgaggataaagattcagaaccggcagctgatggacct
gcagcttctgaggacccaagtgccactgagtcagacctgcccaacccacatgtgggagag
gtctctgtccttagttctgggagtcccaggcttcaggagactcctcaggactgcagtggg
ggtccggtgcggcgttgtgctctctgtaactgcggggagcccagtctacacgggcagcgg
gagctacggcgctttgagttgccatttgattggccccggtgtccagtggtgtcccctggg
gggagcccagggcccaatgaggcagtgctgcccagtgaggacctatcacagattggtttc
cctgagggccttacacctgcccacctaggagaacctggagggtcctgctgggctcaccat
tggtgtgctgcatggtcggcaggcgtatgggggcaggagggcccagaactatgtggtgtg
gacaaggccatcttctcagggatctcacagcgctgctcccactgcaccaggctcggtgcc
tccatcccttgccgctcacctggatgtccacggctttaccacttcccctgcgcgactgcc
agcggttccttcctatccatgaaaacactgcagctgctatgcccagagcacagtgagggg
gctgcatatctggaggaggctcgctgtgcagtgtgtgaggggccaggggagttgtgtgac
ctgttcttctgtaccagctgtgggcatcactatcacggggcctgcctggacactgctctg
actgcccgcaaacgtgctggctggcagtgccctgaatgcaaagtgtgccaagcctgcagg
aaacctgggaatgactctaagatgttggtttgtgagacgtgtgacaaaggataccatact
ttctgcctaaaaccacccatggaggaactgcctgctcactcttggaagtgcaaggcgtgc
cgggtgtgccgggcctgtggggcgggctcagcagaactgaatcccaactcggagtggttt
gagaactactctctctgtcaccgctgtcacaaagcccagggaggtcagactatccgctcc
gttgctgagcagcataccccggtgtgtagcagattttcacccccagagcctggcgatacc
cccactgacgagcccgatgctctgtacgttgcatgccaagggcagccaaagggtgggcac
gtgacctctatgcaacccaaggaaccagggcccctgcaatgtgaagccaaaccactaggg
aaagcaggggtccaacttgagccccagttggaggcccccctaaacgaggagatgccactg
ctgcccccacctgaggagtcacccctgtccccaccacctgaggaatcacccacgtcccca
ccacctgaggcatcacgcctgtcaccaccacctgaggaattgcccgcatccccacttcct
gaggcattgcacctgtcccggccgctggaggaatcgcccctctctccgccgcctgaggag
tctcctctgtctcccccacctgaatcatcacctttttctccactggaggagtcgcccttg
tctccaccggaagagtcacccccatctcctgcacttgagacgcctctatccccaccacct
gaagcatcgcccctgtccccaccatttgaagaatctcctttgtccccgccacctgaggaa
ttgcccacttccccgccacctgaagcatctcgcctgtctccaccacctgaggagtcaccc
atgtcccctccacctgaagagtcacccatgtctccaccaccggaggcatctcgtctgttc
ccaccatttgaagagtctcctctgtcccctccacctgaggagtctcccctttccccacca
cctgaggcatcacgcctgtccccaccacctgaggactcgcctatgtccccaccacctgaa
gaatcacctatgtcccccccacctgaggtatcgcgcctatcccccctgcctgtggtgtca
cgcctgtctccaccgcctgaggaatctcccttgtccccaccgcctgaggagtctcccacg
tcccctccacctgaggcttcacgcctctccccaccacctgaggactcccccacatcccca
ccacctgaggactcacctgcttccccaccaccggaggactcgctcatgtccctgccgctg
gaggagtcacccctgttgccactacctgaggagccgcaactctgcccccggtccgagggg
ccgcacctgtcaccccggcctgaggagccgcacctgtccccccggcctgaggagccacac
ctatctccgcaggctgaggagccacacctgtccccccagcctgaggagccatgcctatgc
gctgtgcctgaggagccacacttgtccccccaggctgagggaccacatctgtcccctcag
cctgaggaattgcacctgtccccccagactgaggagccgcacctgtctcctgtgcctgag
gagccatgcttgtccccccaacctgaggaatcacacctgtccccccagtctgaggagcca
tgcctgtccccccggcctgaggaatcgcatctgtcccctgagcttgagaagccacccctg
tcccctcggcctgaaaagccccctgaggagccaggccaatgccctgcacctgaggagctg
cccttgttccctccccctggggaaccatccttatctcccttgcttggagagccagccctg
tctgagcctggggaaccacctctgtcccctctgcccgaggagctgccgttgtccccatct
ggggagccatccttgtcgcctcagctgatgccaccagatccccttcctcctccactctca
cccatcatcacagctgcggccccaccggccctgtctcctttgggggagttagagtacccc
tttggtgccaaaggggacagtgaccctgagtcaccgttggctgcccccatcctggagaca
cccatcagccctccaccagaagctaactgcactgaccctgagcctgtcccccctatgatc
cttcccccatctccaggctccccagtggggccggcttctcccatcctgatggagcccctt
cctcctcagtgttcgccactccttcagcattccctggttccccaaaactcccctccttcc
cagtgctctcctcctgccctaccactgtccgttccctccccgttgagtcccatagggaag
gtagtgggggtctcagatgaggctgagctgcacgagatggagactgagaaagtttcagaa
cctgaatgcccagccttggaacccagtgccaccagtcctctcccttccccaatgggggac
ctttcctgccccgcccccagccctgccccagccctggatgacttctctggcctaggggaa
gacacagcccctctggatgggattgatgctccgggttcacagccagagcctggacagacc
cctggcagtttggctagtgaacttaaaggctcccctgtgctcctggaccccgaggagctg
gcccctgtgacccctatggaggtctaccccgaatgcaagcagacagcagggcagggctca
ccatgtgaagaacaggaagagccacgtgcaccggtggcccccacaccacccactctcatc
aaatccgacatcgttaacgagatctctaatctgagccagggtgatgccagtgccagtttt
cctggctcagagcccctcctgggctctccagacccggaggggggtggctccctgtccatg
gagttgggggtctctacggatgttagtccagcccgagatgagggctccctacggctctgt
actgactcactgccagagactgatgactcactattgtgcgatgctgggacagctatcagc
ggaggcaaagctgagggggagaaggggcggcggcgcagctccccagcccgttcccgcatc
aaacagggtcgcagcagcagtttcccaggaagacgccggcctcgtggaggagcccatgga
ggacgtggtagaggacgggcccggctaaagtcaactgcttcttccattgagactctggta
gttgctgacattgatagctctcccagtaaggaggaggaggaagaagatgatgacaccatg
cagaataccgtggttctcttctccaacacagacaaatttgtcctaatgcaggacatgtgt
gtggtatgtggcagctttggccggggggcagagggccacctccttgcctgttcgcagtgc
tctcagtgctatcacccttactgtgtcaacagcaagatcaccaaggtgatgctgctcaag
ggctggcgttgtgtggagtgtattgtgtgtgaggtgtgtggccaggcctccgacccctca
cgcctgctgctctgtgatgactgtgatattagctaccacacatactgcctggacccccca
ctgctcaccgtccccaagggcggctggaagtgcaagtggtgtgtgtcctgtatgcagtgt
ggggctgcttcccctggcttccactgtgaatggcagaatagttacacacactgtgggccc
tgtgccagcctggtgacctgccctatctgtcatgctccttacgtagaagaggacctacta
atccagtgccgccactgtgaacggtggatgcatgcaggctgtgagagcctcttcacagag
gacgatgtggagcaggcagccgatgaaggctttgactgtgtctcctgccagccctacgtg
gtaaagcctgtggcgcctgttgcacctccagagctggtgcccatgaaggtgaaagagcca
gagccccagtactttcgcttcgaaggtgtgtggctgacagaaactggcatggccttgctg
cgtaacctgaccatgtcaccactgcacaagcggcgccaacggcgaggacggcttggcctc
ccaggcgaggcaggattggagggttctgagccctcagatgcccttggccctgatgacaag
aaggatggggacctggacaccgatgagctgctcaagggtgaaggtggtgtggagcacatg
gagtgcgaaattaaactggagggccccgtcagccctgatgtggagcctggcaaagaggag
accgaggaaagcaaaaaacgcaagcgtaaaccatatcggcctggcattggtggtttcatg
gtgcgacagcggaaatcccacacacgcacgaaaaaggggcctgctgcacaggcggaggtg
ttgagtggggatgggcagcccgacgaggtgatacctgctgacctgcctgcagagggcgcc
gtggagcagagcttagctgaaggggatgagaagaagaagcaacagcggcgagggcgcaag
aagagcaaactggaggacatgttccctgcttacttgcaggaagccttctttgggaaggag
ctgctggacctgagccgtaaggccctttttgcagttggggtgggccggccaagctttgga
ctagggaccccaaaagccaagggagatggaggctcagaaaggaaggaactccccacatcg
cagaaaggagatgatggtccagatattgcagatgaagaatcccgtggcctcgagggcaaa
gccgatacaccaggacctgaggatgggggcgtgaaggcatccccagtgcccagtgaccct
gagaagccaggcaccccaggtgaagggatgcttagctctgacttagacaggatttccaca
gaagaactgcccaagatggaatccaaggacctgcagcagctcttcaaggatgttctgggc
tctgaacgagaacagcatctgggttgtggaacccctggcctagaaggcagccgtacgcca
ctgcagaggccctttcttcaaggtggactccctttgggcaatctgccctccagcagccca
atggactcctacccaggcctctgccagtccccgttcctggattctagggagcgcgggggc
ttctttagcccggaacccggtgagcccgacagcccctggacgggctcaggtggcaccacg
ccctccacccccacaacccccaccacggagggtgagggcgacggactctcctataaccag
cggagtcttcagcgctgggagaaggatgaggagttgggccagctgtccaccatctcacct
gtgctctatgccaacattaattttcctaatctcaagcaagactacccagactggtcaagc
cgttgcaaacaaatcatgaagctctggagaaaggttccagcagctgacaaagccccctac
ctgcaaaaggccaaagataaccgggcagctcaccgcatcaacaaggtgcagaagcaggct
gagagccagatcaacaagcagaccaaggtgggcgacatagcccgtaagactgaccgaccg
gccctacatctccgcattcccccgcagccaggggcactgggcagcccgccccccgctgct
gcccccaccattttcattggcagccccactacccccgccggcttgtctacctctgcggac
gggttcctgaagccgccggcgggctcggtgcctggccctgactcgcctggtgagctcttc
ctcaagctcccaccccaggtgcccgcccaagtgccttcgcaggacccctttggactggcc
cctgcctatcccctggagccccgcttccccacggcaccgcccacctatcccccctatcct
agtcctacgggggcccctgcgcagcccccgatgctgggcgcctcatctcgtcctggggct
ggccagccaggggaattccacactaccccacctggcacccccagacaccagccctccaca
cctgacccattcctcaaaccccgctgcccctcgctggataacttggctgtgcctgagagc
cctggggtagggggaggcaaagcttccgagcccctgctctcgcccccaccttttggggag
tcccggaaggccctagaggtgaagaaggaagagcttggggcatcctctcctagctatggg
cccccaaacctgggctttgttgactcaccctcctcaggcacccacctgggtggcctggag
ttaaagacacctgatgtcttcaaagcccccctgacccctcgggcatctcaggtagagccc
cagagcccgggcttgggcctaaggccccaggagccaccccctgcccaggctttggcacct
tctcctccaagtcacccagacatctttcgccctggctcctacactgacccatatgctcag
cccccattgactcctcggccccaacctccgccccctgagagctgctgtgctctgccccct
cgctcactgccctccgaccctttctcccgagtgcctgccagtcctcagtcccagtccagc
tcccagtctccactgacaccccggcctctgtctgctgaagctttttgcccatcacccgtt
acccctcgcttccagtcccctgacccttattctcgcccaccctcacgccctcagtcccgt
gacccatttgccccattgcataagccaccccgaccccagccccctgaagttgcctttaag
gctgggtctctagcccacacttcgctgggggctggggggttcccagcagccctgcccgcg
gggccagcaggtgagctccatgccaaggtcccaagtgggcagccccccaattttgtccgg
tcccctgggacgggtgcatttgtgggcaccccctctcccatgcgtttcactttccctcag
gcagtaggggagccttccctaaagccccctgtccctcagcctggtctcccgccaccccat
gggatcaacagccattttgggcccggccccaccttgggcaagcctcaaagcacaaactac
acagtagccacagggaacttccacccatcgggcagccccctggggcccagcagcgggtcc
acaggggagagctatgggctgtccccactacgccctccgtcggttctgccaccacctgca
cccgacggatccctcccctacctgtcccatggagcctcacagcgatcaggcatcacctct
cctgtcgaaaagcgagaagacccagggactggaatgggtagctctttggcgacagctgaa
ctcccaggtacccaggacccaggcatgtccggccttagccaaacagagctggagaagcaa
cggcagcgccagcgactacgagagctgctgattcggcagcagatccagcgcaacaccctg
cggcaggagaaggaaacagctgcagcagctgcaggagcagtggggcctccaggcagctgg
ggtgctgagcccagcagccctgcctttgagcagctgagtcgaggccagaccccctttgct
gggacacaggacaagagcagccttgtggggttgcccccaagcaagctgagtggccccatc
ctggggccagggtccttccctagcgatgaccgactctcccggccacctccaccagccacg
ccttcctctatggatgtgaacagccggcaactggtaggaggctcccaagctttctatcag
cgagcaccctatcctgggtccctgcccttacagcagcaacagcaacaactgtggcagcaa
caacaggcaacagcagcaacctccatgcgatttgccatgtcagctcgctttccatcaact
cctggacctgaacttggccgccaagccctaggttccccgttggcgggaatttccacccgt
ctgccaggccctggtgagccagtgcctggtccagctggtcctgcccagttcattgagctg
cggcacaatgtacagaaaggactgggacctgggggcactccgtttcctggtcagggccca
cctcagagaccccgtttttaccctgtaagtgaggacccccaccgactggctcctgaaggg
cttcggggcctggcggtatcaggtcttcccccacagaaaccctcagccccaccggcccct
gaattgaacaacagtcttcatccaacaccccacaccaagggtcctaccctgccaactggt
ttggagctggtcaaccggcccccgtcgagcactgagcttggccgccccaatcctctggcc
ctggaagctgggaagttgccctgtgaggatcccgagctggatgacgattttgatgcccac
aaggccctagaggatgatgaagagcttgctcacctgggtctgggtgtggatgtggccaag
ggtgatgatgaacttggcaccttagaaaacctggagaccaatgacccccacttggatgac
ctgctcaatggagacgagtttgacctgctggcatatactgatcctgagctggacactggg
gacaagaaggatatcttcaatgagcacctgaggctggtagaatcggctaatgagaaggct
gaacgggaggccctgctgcggggggtggagccaggacccttgggccctgaggagcgccct
ccccctgctgctgatgcctctgaaccccgcctggcatctgtgctccctgaggtgaagccc
aaggtggaggagggtggacgccacccttctccttgccaattcaccattgctacccccaag
gtagagcccgcacctgctgccaattcccttggcctggggctaaagccaggacagagcatg
atgggcagccgggatacccggatgggcacagggccattttctagcagtgggcacacagct
gagaaggcctcctttggggccacgggaggaccaccagctcacctgctgacccccagccca
ctgagtggcccaggaggatcctccctgctggaaaagtttgagctcgagagtggggctttg
accttgcctggtggacctgcagcatctggggatgagctagacaagatggagagctcactg
gtagccagcgagttacccctgctcattgaggacctgttggagcatgagaagaaggagctg
cagaagaagcagcagctttcagcacagttgcagcctgcccagcagcagcagcaacagcag
cagcagcattccctactgtctgcaccaggccctgcccaggccatgtctttgccacatgag
ggctcttctcccagtttggctgggtcccaacagcagctttccctgggtcttgcaggtgcc
cgacagccaggtttgccccagccactgatgcccacccagccaccagctcatgccctccag
caacgcctggctccatccatggctatggtgtccaatcaagggcatatgctaagtgggcag
catggagggcaggcaggcttggtaccccagcagagctcacagccagtgctatcacagaag
cccatgggcaccatgccaccttccatgtgcatgaagccgcagcaattggcaatgcagcag
cagctggcaaacagcttcttcccagatacagacctggacaaatttgctgcagaagatatc
attgatcccattgcaaaggccaagatggtggctttgaaaggcatcaagaaagtgatggct
cagggcagcattggggtggcacctggtatgaacagacagcaagtgtctctgctagcccag
aggctctcggggggacctagcagtgatctgcagaaccatgtggcagctgggagtggccag
gagcggagtgctggtgatccctcccagcctcgtcccaacccgcccacttttgctcaggga
gtgatcaatgaagctgaccagcggcagtatgaggagtggctgttccatacccagcagctc
ctacagatgcagctgaaggtgctagaggagcagattggtgtacaccgcaagtcccggaag
gctctgtgtgccaagcagcgcactgccaaaaaagctggccgtgagttcccagaagctgat
gctgagaagctcaagctggttacagagcagcagagcaagatccagaaacaactggatcag
gtccggaaacagcagaaggagcacactaatctcatggcagaatatcggaacaagcagcag
caacaacagcagcagcagcagcaacaacagcaacagcactcagctgtgctggctctcagc
ccttcccagagtccccggctgctcaccaagctccctggtcagctgctccctggccatggg
ctgcagccaccacaggggcctccgggtgggcaagccggaggtcttcgcctgacccctggg
ggtatggcactacctggacagcctggtggccccttccttaatacagctctggcccaacag
cagcaacagcaacattctggtggggctggatccctggctggcccttcagggggcttcttc
cctggcaaccttgctcttcgaagcctcggacctgattcaaggcttttacaggaaaggcag
ctgcagctgcagcagcaacgtatgcagctggcccagaaactgcagcagcagcagcagcag
caacagcagcagcagcaccttctaggacaggtggcaatccagcagcaacagcagcagggt
cctggagtacagacaaaccaagctctgggtcccaagccccagggccttatgcctcccagc
agccaccaaggcctcctggtccagcagctgtcccctcaaccaccccaggggccccagggc
atgctgggccctgcccaggtggctgtgttgcagcagcagcaccctggagctttgggcccc
cagggccctcacagacaggtgcttatgacccagtcccgggtgctcagttccccccagctg
gcacagcagggtcagggccttatgggacacaggctggtcacagcccagcagcagcagcag
caacaacagcaccaacagcaagggtccatggcagggctgtcccatcttcagcagagtctg
atgtcacacagtgggcagcccaaactgagcgctcagcccatgggctctttacagcagctt
cagcagcagcagcagctgcaacagcaacagcaacttcagcagcagcagcagcagcagcta
caacagcaacagcaacttcagcagcaacagcttcaacagcagcaacagcagcagcagctt
caacaacagcagcagcaacagcttcaacagcagcaacagcagctacaacagcaacagcaa
caacaacagcagcagtttcaacagcagcagcaacagcagcagatgggccttttaaaccag
agtcgaactttactgtctcctcagcaacaacagcagcagcaagtggcacttggccctggc
atgccagcaaagcctcttcaacacttttctagccctggagccctgggtccaaccctcctc
ctgacgggcaaggaacaaaacaccgtagacccagccgtttcttcagaggccactgagggg
ccctctacacatcagggagggccgttagcaataggaactacccctgagtcaatggccact
gaaccaggagaggtaaagccctcactctctggggactcacaactcctgcttgtccaaccc
cagccccagcctcagcccagctctctgcagctgcagccacctctgaggcttccaggacaa
cagcagcagcaagttagcctgctccacacagcaggtggaggaagccatgggcagctaggc
agtggatcatcttctgaggcctcatctgtgccccacctgctggctcagccctctgtttcc
ttaggggatcagcctgggtccatgacccagaaccttctgggcccccaacagcccatgcta
gagcggcccatgcaaaataatacagggccacaacctcccaaaccaggacctgtcctccag
tctgggcagggtctgcctggggttggaatcatgcctacggtgggtcagcttcgagcacag
ctccaaggagtcctggccaaaaacccacagctgcggcacttaagtcctcagcagcagcag
cagctacaggcactcctcatgcagcggcagctgcagcagagtcaggcagtacgccagacc
ccaccctaccaggagcctgggacccagacctctcccctccagggcctcctgggctgccaa
cctcaacttgggggcttccctggaccacagacaggccccctccaggagctaggggcaggg
cctcgacctcagggcccaccccggctccctgccccaccaggagccttatctacaggacca
gtccttggccctgtccatcccacacctccaccatccagccctcaagagccaaagagacct
tcacaattaccttcccccagctcccagcttcccactgaggcccagctccctcccacccat
ccagggacccccaaacctcaggggccaaccttggagccgcctcctgggagggtctcacct
gctgctgcccagcttgcagataccttgtttagcaagggtctgggaccttgggatccccca
gacaacctagcagaaacccagaagccagagcagagcagcctggtacctgggcatctggac
caggtgaatggacaggtggtgcctgaggcatcccaactcagcatcaagcaggaacctcgg
gaagagccatgtgccctgggagcccagtcagtgaagagggaggccaatggggagccaata
ggggcaccaggaaccagcaaccacctcctgctggcaggccctcgctcagaagctgggcat
ctgctcttgcagaagctactccgggcaaagaatgtgcaactcagcactgggcgggggtcc
gaggggctgcgagctgagatcaacgggcacattgacagcaagctggctgggctggagcag
aaactacagggtacccccagcaacaaggaggatgcagcagcaaggaagcctttgacaccg
aagcccaagcgggtacagaaggcaagcgacaggttggtgagctcccgaaagaagctgcgg
aaggaggacggggtcagggccagcgaggccttgctgaaacagctgaaacaggagctgtcc
ctgctgcccctaacggagcctgctatcaccgccaattttagcctctttgccccctttggc
agtggctgcccagtcaatgggcagagccagctgaggggggcctttggaagtggggcgctg
cccactggccctgactactattcccagctgcttaccaagaataacctgagtaacccgccg
acaccaccctcgtcgctgccccccaccccacccccatcggtgcagcagaagatggtgaat
ggcgtcaccccatctgaagagctgggggagcaccccaaggatgctgcctctgcccgggat
agtgaaagggcactgagggatacttcagaggtgaagagtctagacctgctggctgccttg
cctacaccccctcacaatcagactgaggatgtcaggatggagagtgatgaggatagcgat
tctcctgacagcattgtgccagcttcatcccctgagagcatcttgggggaggaggcccct
cgtttccctcatctgggctcaggccggtgggagcaagaggaccgggccctctcccctgtc
atccccctcattcctcgggccagcatcccagtcttcccagataccaaaccttatggggcc
cttggcctggaggtccctggaaagctgcctgtcacaacttgggaaaagggcaaaggaagt
gaggtgtcagtcatgctcacagtctctgctgctgcagccaagaacctgaatggcgtgatg
gtggcagtggcggagctgctgagcatgaagatccccaactcctatgaggtgctgttccca
gagagccccgcccgggcaggcactgagccaaagaagggggaagctgagggtcctggtggg
aaggaaaagggtctggaaggcaagagcccagacactggccctgattggctgaagcagttt
gatgcagtgttgcctggctataccctgaagagccaactagacatcttgagcctcctgaaa
caggagagccccgccccagagccacccactcagcacagctatacctacaatgtctccaat
ctggatgtgcgacagctctcggccccacctcctgaagaaccctccccgcccccttccccc
ttggcaccttctcctgccagtccccctactgagcccttggttgaacttcccaccgaaccc
ttggctgagccacccgtcccctcacctctgccactggcctcatcccctgaatcagcccga
cccaagccccgtgcccggccccctgaagaaggtgaagattcccgtcctcctcgcctcaag
aaatggaaaggagtgcgctggaagcggcttcggctgctgctgaccatccagaagggcagt
gggcggcaggaggatgagcgggaagtggcagagtttatggagcagcttggcacagccttg
cgacctgacaaggtaccgcgagacatgcgtcgctgctgtttctgtcatgaggagggtgac
ggggccactgatgggcctgcccgtctgctgaacctggacctggacctgtgggtgcacctc
aactgtgccctttggtccacggaggtgtatgagacccagggcggggcactgatgaatgtg
gaggttgccctgcaccgaggactgctaaccaagtgctccctgtgccagcgaactggtgcc
accagcagctgcaatcgcatgcgttgccccaatgtctaccattttgcttgtgccatccgt
gccaagtgcatgttcttcaaggacaagaccatgctgtgtccaatgcataagatcaagggg
ccctgtgagcaagagctgagctcttttgctgtcttccggcgggtctacattgagcgggac
gaggtgaagcaaatcgctagcatcattcagcggggagaacggctgcacatgttccgtgtg
gggggccttgtgttccacgccatcggacagctgctgcctcaccagatggctgactttcat
agtgccactgccctctatcccgtgggctacgaggccacgcgcatctattggagcctccgc
accaacaatcgtcgctgctgctatcgctgttctattggtgagaacaacgggcggccggag
tttgtaatcaaagtcatcgagcagggcctggaggacctggtcttcactgacgcctctccc
caggccgtgtggaatcgcatcattgagcctgtggctgccatgagaaaagaggctgacatg
ctgcgactcttccctgagtatctgaagggcgaggagctctttgggctgacggtgcatgcc
gtgcttcgcatagctgaatcactgcccggggtggagagctgtcaaaactatttattccgc
tatgggcgccacccccttatggagctgccactcatgatcaaccccactggctgtgcccga
tcagagcctaaaatcctcacacactacaaacggccccataccctgaacagcaccagcatg
tctaaggcatatcagagcaccttcacaggcgagaccaacaccccctacagcaagcagttt
gtgcactccaagtcatctcagtaccggcggctgcgcaccgaatggaagaacaacgtgtac
ctggctcgctcccgtatccagggcctggggctctatgcagccaaggacctagaaaagcac
acaatggttatcgagtacattggcaccatcattcggaacgaggtggccaaccggcgggag
aaaatctacgaagagcagaatcgaggcatctacatgttccgaataaacaatgaacatgtg
attgatgctacgttgaccggcggccctgccaggtacattaaccattcctgtgcccctaac
tgtgtggccgaagtcgtgacatttgacaaagaggacaaaatcatcatcatctccagccgg
cgaatccccaaaggagaggagctaacctatgactatcagtttgattttgaggacgatcag
cacaagatcccctgccactgtggagcctggaattgtcggaaatggatgaactaa

KEGG   Homo sapiens (human): 9070
Entry
9070              CDS       T01001                                 
Symbol
ASH2L, ASH2, ASH2L1, ASH2L2, Bre2
Name
(RefSeq) ASH2 like, histone lysine methyltransferase complex subunit
  KO
K14964  Set1/Ash2 histone methyltransferase complex subunit ASH2
Organism
hsa  Homo sapiens (human)
Pathway
hsa04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    9070 (ASH2L)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    9070 (ASH2L)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HMT complexes
    COMPASS/SET1 complex
     9070 (ASH2L)
    MLL-HCF complex
     9070 (ASH2L)
    MLL3/MLL4 complex
     9070 (ASH2L)
SSDB
Motif
Pfam: SPRY PHD20L1_u1
Other DBs
NCBI-GeneID: 9070
NCBI-ProteinID: NP_004665
OMIM: 604782
HGNC: 744
Ensembl: ENSG00000129691
Vega: OTTHUMG00000164016
Pharos: Q9UBL3(Tbio)
UniProt: Q9UBL3
Structure
LinkDB
Position
8:38105493..38140080
AA seq 628 aa
MAAAGAGPGQEAGAGPGPGAVANATGAEEGEMKPVAAGAAAPPGEGISAAPTVEPSSGEA
EGGEANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGIC
TKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQS
RTQDEHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHP
DPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKR
KQQDGGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLEL
DCWAGKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGA
WYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGY
GQGDVLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSE
IIFYKNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMGWG
AVVEHTLADVLYHVETEVDGRRSPPWEP
NT seq 1887 nt   +upstreamnt  +downstreamnt
atggcggcggcaggagcaggacctggccaggaagcgggtgccgggcctggcccaggagcg
gtcgcaaatgcaacaggggcagaagagggggagatgaagccggtggcagcgggagcagcc
gctcctcctggagaggggatctctgctgctccgacagttgagcccagttccggggaggct
gaaggcggggaggcaaacttggtcgatgtaagcggtggcttggagacagaatcatctaat
ggaaaagatacactagaaggtgctggggatacatcagaggtgatggatactcaggcgggc
tccgtggatgaagagaatggccgacagttgggtgaggtagagctgcaatgtgggatttgt
acaaaatggttcacggctgacacatttggcatagatacctcatcctgtctacctttcatg
accaactacagttttcattgcaacgtctgccatcacagtgggaatacctatttcctccgg
aagcaagcaaacttgaaggaaatgtgccttagtgctttggccaacctgacatggcagtcc
cgaacacaggatgaacatccgaagacaatgttctccaaagataaggatattataccattt
attgataaatactgggagtgcatgacaaccagacagagacctgggaaaatgacttggcca
aataacattgttaaaacaatgagtaaagaaagagatgtattcttggtaaaggaacaccca
gatccaggcagtaaagatccagaagaagattaccccaaatttggacttttggatcaggac
cttagtaacattggtcctgcttatgacaaccaaaaacagagcagtgctgtgtctactagt
gggaatttaaatgggggaattgcagcaggaagcagcggaaaaggacgaggagccaagcgc
aaacagcaggatggagggaccacagggaccaccaagaaggcccggagtgaccctttgttt
tctgctcagcgccttccccctcatggctacccattggaacacccgtttaacaaagatggc
tatcggtatattctagctgagcctgatccgcacgcccctgaccccgagaagctggaactt
gactgctgggcaggaaaacctattcctggagacctctacagagcctgcttgtatgaacgg
gttttgttagccctacatgatcgagctccccagttaaagatctcagatgaccggctgact
gtggttggagagaagggctactctatggtgagggcctctcatggagtacggaaaggtgcc
tggtattttgaaatcactgtggatgagatgccaccagataccgctgccagactgggttgg
tcccagcccctaggaaaccttcaagctcctttaggttatgataaatttagctattcttgg
cggagcaaaaagggaaccaagttccaccagtccattggcaaacactactcttctggctat
ggacagggagacgtcctgggattttatattaatcttcctgaagacacagagacagccaag
tcattgccagacacatacaaagataaggctttgataaaattcaagagttatttgtatttt
gaggaaaaagactttgtggataaagcagagaagagcctgaagcagactccccatagtgag
ataatattttataaaaatggtgtcaatcaaggtgtggcttacaaagatatttttgagggg
gtttacttcccagccatctcactgtacaagagctgcacggtttccattaactttggacca
tgcttcaagtatcctccgaaggatctcacttaccgccctatgagtgacatgggctggggc
gccgtggtagagcacaccctggctgacgtcttgtatcacgtggagacagaagtggatggg
aggcgcagtcccccatgggaaccctga

DBGET integrated database retrieval system