KEGG   Loxodonta africana (African savanna elephant): 100653681
Entry
100653681         CDS       T04351                                 

Gene name
KMT2A
Definition
(RefSeq) LOW QUALITY PROTEIN: histone-lysine N-methyltransferase 2A
  KO
K09186  [histone H3]-lysine4 N-trimethyltransferase MLL1 [EC:2.1.1.354]
Organism
lav  Loxodonta africana (African savanna elephant)
Pathway
lav00310  Lysine degradation
lav01100  Metabolic pathways
lav04934  Cushing syndrome
lav05202  Transcriptional misregulation in cancer
Brite
KEGG Orthology (KO) [BR:lav00001]
 09100 Metabolism
  09105 Amino acid metabolism
   00310 Lysine degradation
    100653681 (KMT2A)
 09160 Human Diseases
  09161 Cancer: overview
   05202 Transcriptional misregulation in cancer
    100653681 (KMT2A)
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    100653681 (KMT2A)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:lav03000]
    100653681 (KMT2A)
   03036 Chromosome and associated proteins [BR:lav03036]
    100653681 (KMT2A)
Enzymes [BR:lav01000]
 2. Transferases
  2.1  Transferring one-carbon groups
   2.1.1  Methyltransferases
    2.1.1.354  [histone H3]-lysine4 N-trimethyltransferase
     100653681 (KMT2A)
Transcription factors [BR:lav03000]
 Eukaryotic type
  Zinc finger
   CXXC CpG-binding proteins
    100653681 (KMT2A)
Chromosome and associated proteins [BR:lav03036]
 Eukaryotic type
  Histone modification proteins
   HMTs (histone methyltransferases)
    HKMTs (histone lysine methyltransferases)
     100653681 (KMT2A)
   HMT complexes
    MLL-HCF complex
     100653681 (KMT2A)
SSDB
Motif
Pfam: FYRC SET FYRN PHD zf-CXXC zf-HC5HC2H Bromodomain PHD_2 PHD_4
Other DBs
NCBI-GeneID: 100653681
NCBI-ProteinID: XP_010596216
LinkDB
Position
Unknown
AA seq 3963 aa
MAHSCRWRFPARPGTTGGGGGGGRRGLGGAPRQRVPALLLPPGPPVGGGGPGAPPSPPAV
AAAAAAAGSSGAGVPGGAAAASAASSSSSSSSSSSSSASSGPALLRVGPGFDAALQVSAA
IGTNLRRFRAVFGESGGGGGSGEDEQFLGFGSDEEVRVRSPTRSPSVKTSPRKPRGRPRS
GSDRNSAILSDPAVFSPLNKSETKSGDKIKKKDSKSIEKKRGRPPTFPGVKIKITHGKDI
SELPKGNKEDSLKKIKRTPSATFQQATKIKKLRAGKLSPLKSKFKTGKLQIGRKGVQIVR
RRGRPPSTERVKTPSGLLINSELEKPQKVRKDKEGTPPLTKEDKTVVRQSPRRIKPVRII
PSSKRTDATIAKQLLQRAKKGAQKKIEKEAAQLQGRKVKTQVKNIRQFIMPVVSAISSRI
IKTPRRFIEDEDYDPPIKIARLESTPNSRFSAASCGSSEKSSAASQHSSQLSSDSSRSSS
PSVDTSTDSQASEEIQVLPEERSNTPEVHTPLPISQSPENDSNDRRSRRYSVSERSFGSR
TTKKLSTLQSAPQQQTSSSPPPPLLTPPPPLQPASSISDHTPWLMPPTIPLASPFLPASA
APMQEKRKSILREPTFRWTSLKHSRSEPQYFSSAKYAKEGLIRKPIFDNFRPPPLTPEDV
GFASGFSTSGTAASARLFSPLHSGTRFDMHKRSPLLRAPRFTPSEAHSRIFESVTLPGNR
NSAGTSSGVSNRKRKRKVFSPIRSEPRSPSHSMRTRSGRLSTSELSPLTPPSSVSSSLSI
SVSPLATSALNPTFTFPSHSLTQSGESAEKNQRPRKQTSAPAEPFSSSSPTPLFPWFTPG
SQTERGRNKDKAPEELSKDRDADKSVEKDKSRERDREREKENKRESRKEKRKKGSEIQSS
PALYPVGRVSKEKVLVGEDVATSSSAKKTTGRKKSSSLDSGTDIASVTLGDTTAVKTKIL
IKKGRGNLEKTNLDLGPTAPSLEKEKTLCLSTPSSSTVKHSTSSIGSMLAQADKLPMTDK
RVASLLKKAKAQLCKIEKSKSLKQTDQPKAQGQESDSSETSVRGPRIKHVCRRAAVALGR
KRAVFPDDMPTLSALPWEEREKILSSMGNDDKSSVAGSEDAEPLAPPIKPIKPVTRNKAP
QEPPVKKGRRSRRCGQCSGCQVPEDCGVCTNCLDKPKFGGRNIKKQCCKMRKCQNLQWMP
SKAYLQKQAKAVKKKEKKSKTSEKKESNVVKNVVDSSQKPTPSTREDPAPKKSGSEPPPR
KPTEEKSEDGSMSVPGPESKQVTTPASRKSSKQVSQPAPVTPPQPPSTGPLKKEVPRSTP
SEPKKKQPPPPESGPEQSKQKKVAPRPSIPVKQKPKEKEKPPPVNKQENAGTLNILSTLS
NGTSSKQKIPADGVHRIRVDFKEDCEAENVWEMGGLGILTSIPITPRVVCFLCASSGHVE
FVYCQVCCEPFHKFCLEENERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKCRNS
YHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFA
KGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPESVAYTCVNC
TERHPAEWRLALEKELQISLKQVLTALLNSRTTSHLLRYRQAAKPPDLNPETEESIPSRS
SPEGPDPPVLTEVSKQEDQQPLDLEGVKRKMDQGSYTSVLEFSDDIVKIIQAAINSDGGQ
PEIKKANSMVKSFFIRQMERVFPWFSVKKSRFWEPNKVSSNSGMLPNAVLPPSLDHNYAQ
WQEREENSHTEQPPLMKKIIPAPKPKGPGEPDSPTPLHPPTPPILNTDRSREDSPELNPP
PGIEDNRQCALCLTYGDDSANDAGRLLYIGQNEWTHVNCALWSAEVFEDDDGSLKNVHMA
VIRGKQLRCEFCQKPGATVGCCLTSCTSNYHFMCSRAKNCVFLDDKKVYCQRHRDLIKGE
VVPENGFEVFRRVFVDFEGISLRRKFLNGLEPENIHMMIGSMTIDCLGILNDLSDCEDKL
FPIGYQCSRVYWSTTDARKRCVYTCKIVECRPPVVEPDINSTVEHDENRTIAHSPTSFTE
ISSKESQNTAEIVSPPSPDRPPHSQTSSSCFYHVISKVPRIRTPSYSATQRSPGCRPLPS
AGSPTPTTHEIVTVGDPLLSSGLRSIGSRRHSTSSLSPQRSKLRIMSPMRTGNTYSRSSL
SSVSTTGTVSDLEKSAKAVDHVLGSLNSSTNLGQNTLTSSNLQRTVVTVGTKTSHLAGSS
SSEMKHTSASDLGSKSSSLKGEKTKMLSSKGLEGSAHNVAYPGIPKLAPQVHNTASGELN
VSKIGTFTEPSSVPYSSKEALSFPSLHSRGQRRDRDQHTDLSQPADLSSEADTEVKTLKL
SGVSNRSSIINEHVGSSSRDRRQKGKKSCKETFKEKHSTKSFLEPGQVATGEEGNLKPEF
VDEVLAPELIGQPPCNNVSSDKTGDKVLSIPGVPKASSMQVEGSAKELQTPRKRTVKVTL
TPLKMESENQSKNTLKESSPASPLQGESASPTEPMSASESPGDGPVAQPSPNDTSSQDSQ
SNNYQNLPVQDRNLMLPDGPKPQEDGSFKRRYPRRSARARSNMFFGLTPLYGVRSYGEED
IPFYSSSTGKKRGKRSAEGQVDGADDLSTSDEDDLYYYNFTRTVISSSGEERLASHNLFR
EEEQCDLPKISQLDGVDDGTESDTSVTATTRKSSQIPKRNGKENGTESLKIDRPEDAGEK
EHVIKSSAGHKNEPKMDNCHSVSRVKTQGQDSLEAQLSSLESSRRVHTSTPSDKNLLDTY
NTELLKSDSDNNNSDDCGNILPSDIMDFVLKNTPSMQALGESPESSSSELLNLGEGLGLD
SNRGKDMGLFEVFSQQLPTTEPVDSSVSSSISAEEQFELPLELPSDLSVLTTRSPTVPSQ
NPNRLAVISDSGEKRVTITEKSVASSEGDPALLSPGVDPTPEGHMTPDHFIQGHMDADHI
SSPPCSVEQGHGSSQDLTRNSSTPGLQVPVSPSVPIQNQKYVPNSTDSPGPSQISNAAVQ
TTPPHLKPATEKLIVVNQNMQPLYVLQTLPNGVTQKIQLTSSVSSTPSVMETNTSVLGPM
GTGLTLTTGLNPSLPTSQSLFPPASKGLLPMPHHQHLHSFSAATQSSFPPNISSPPSGLL
IGVQPPPDPQLLVSEASQRTDLSTTVATPSSGLKKRPISRLQTRKNKKLAPSSTPSNVAP
SDVVSNMTLINFTPSQLSNHPNLLDLGSLNTSSHRTVPNIIKRSKSGIMYFEQAPLLPPS
MGGTAAPAAGTSTISQDTSNLASGPVSGLASGSSVLNVVSMQTTTTPTSSASGPGHVTLT
NPRLLGNPDIGSISNLLIKASQQSLGIQDQPVALPPSSGMFPQLGTSQTPSAAAMTAASS
ICVLPSTQTTGITAASPSGEAEEHYQLQHVNQLLASKTGILSSQCDLDSASGTQVSNFTQ
TVDAPNSTGLEQNKALPSAMQASSASPGGSPPSGQQSASPSVPGPTKPKPKIKRIQLPLD
KGNGKKHKVSHLRTSSSEAHIPDQEASAAPLTSSTGTPGTEAEQQDTANVEQSSQKECGQ
PARQVAVLPEAQATRNPASEQESTEPKTVEEEESNFSSPLMLWLQQEQKRKESIAEKKPK
KGLVFEISSDDGFQICAESIEDAWKSLTDKVQEARSNARLKQLSFAGVNGVRMLGILHDA
VVFLIEQLSGAKHCRNYKFRFHKPEEANEPPLNPHGSARAEVHLRKSAFDMFNFLASKHR
QPPEYNPNDEEEEEVQLKSARRATSMDLPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCK
RNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGCYMFRIDDSEVVDATMHGNAARFIN
HSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRK
FLN
NT seq 11892 nt   +upstreamnt  +downstreamnt
atggcgcacagctgtcggtggcgcttccccgcccgacccgggaccaccgggggcggcggc
ggtggggggcgccggggcctagggggcgccccgcggcaacgcgtcccggctctgctgctt
ccccccgggcccccggtcggcggtggcggccccggggcgcccccctcccccccggctgtg
gcggcggcggcggcggcggcgggaagcagcggggccggggttccagggggagcggccgcc
gcctcagcagcctcttcgtcctcctcgtcttcgtcttcgtcatcgtcctcagcctcctcc
gggccggccctgctccgggtgggcccgggcttcgacgcggcgctgcaggtctcggccgcc
atcggcaccaacctgcgccggttccgggccgtgtttggggagagcggcgggggaggcggc
agcggagaggatgagcagttcttaggttttggctcagatgaagaagtcagagtgcgaagt
cccacaaggtctccttcagttaaaactagtcctcgaaaacctcgtgggagacctagaagt
ggctctgaccgaaattcagctatcctctcagatccagctgtattttcccctctaaataaa
tcagagaccaaatctggagataaaatcaagaagaaagattctaaaagtatagaaaagaag
agaggaagacctcccaccttccctggagtaaaaatcaaaataacacacggaaaggacatt
tcagagttaccaaagggaaataaagaagatagcctgaaaaaaattaaacggacaccttct
gctacatttcagcaagccacaaagattaaaaaattaagagcaggtaaactctctcctctc
aagtctaaatttaagacagggaagcttcaaataggaaggaagggggtgcagattgtacgc
cggcgaggaaggcctccatcaacagagagggtaaagaccccttcaggtctcctcattaac
tctgaactggagaagcctcagaaggtccggaaagacaaggaaggtacacctccgcttaca
aaagaagataagacagttgtcagacaaagccctcgaaggattaagccagttaggattatt
ccttcttcaaaaaggacagatgcaacaattgctaagcaactcttgcaaagggcaaaaaag
ggggctcaaaagaaaattgaaaaagaagcagctcagctgcaaggaagaaaagtgaaaaca
caggtcaaaaatattcgacagttcatcatgcctgttgtcagtgctatctcctcacggatc
attaaaacccctcgtcgatttatagaggatgaggactatgacccaccgattaaaatcgcc
cgactcgagtctaccccgaacagtagattcagtgccgcatcctgtggatcttctgaaaag
tcaagcgcagcttctcagcactcctctcagctgtcctcagattcctcccgatctagtagc
cccagtgtcgatacctccacagactctcaggcctctgaggagattcaggtacttcctgaa
gagcggagcaacacccctgaagttcatactccactgcctatttcccagtccccagaaaat
gatagtaacgataggagaagcagaaggtattcagtgtcagaaagaagttttggatctaga
acaactaaaaaattatcgactctacaaagcgccccccagcagcagacctcctcctctcca
cctccacctctgctcactccacccccaccactgcagccagcctccagtatctctgaccac
acaccttggcttatgcctccaacaatccccttagcatcaccatttttgcctgcttctgct
gctcccatgcaagagaagcgaaaatctattttgcgagaaccaacatttaggtggacttct
ttgaagcattctaggtcagagccacaatacttttcctcagcaaagtatgccaaagaaggt
ctcattcgcaaacccatatttgataatttccgaccccctccgctgactcctgaggatgtt
ggcttcgcatctggtttttctacatctggtactgctgcttcagcccgattgttttcacca
ctccattctggaactaggtttgatatgcacaaaaggagccctcttctgagagctccaaga
tttactccaagtgaggctcactctagaatatttgagtctgtaaccttgcctggtaatcga
aattctgctggaacatcttcaggagtatctaatagaaaaaggaaaagaaaagtgtttagc
cctattcgatctgaaccaagatctccttctcactccatgaggacaagaagtggaaggctt
agtacttctgagctatcacctctcaccccaccgtcttctgtctcttcctcattaagcatt
tctgttagtcctcttgccactagtgccttaaacccaacttttactttcccttctcattcc
ctgactcagtctggggaatctgcagagaaaaatcagagaccaaggaagcagactagtgct
ccagcagagccattttcatcaagcagtcctactcctctcttcccttggtttaccccaggc
tctcagacggaaagagggagaaataaagacaaggcccctgaggaactgtccaaagatcga
gatgctgataagagcgtggagaaggacaagagtagagagagagaccgggagagagaaaaa
gaaaataagcgggagtcaaggaaagagaaaagaaaaaagggatcagaaattcagagtagt
cctgctttgtatcctgtgggtagggtttccaaagagaaggttcttgttggtgaagatgtt
gccacttcatcttctgccaaaaaaacaacagggcggaagaagtcttcatcacttgattct
gggactgatattgcttctgtgactcttggggatacaacagctgtcaaaaccaaaatactt
ataaagaaagggagaggaaatctggaaaaaaccaacttggacctcggcccaactgcccca
tccctggagaaggagaaaaccctctgcctttccactccttcatctagcactgttaaacat
tccacttcctccataggctccatgttggctcaggcagacaagcttccaatgactgacaag
agggttgccagcctcctaaaaaaggccaaagcccagctctgcaagattgagaagagtaag
agtcttaagcaaactgaccaacccaaagcacagggtcaagaaagcgattcatcagagact
tctgtgcgaggaccccggattaaacatgtctgcagaagagctgctgttgcccttggccga
aaacgagctgtatttcctgatgacatgcccaccctgagtgccttaccatgggaagaacgg
gaaaagattctgtcttccatggggaatgacgacaagtcatcagttgctggctcagaagat
gctgaacctcttgctccacccatcaaaccaattaaacctgtcaccagaaacaaggcgcct
caggaacctccagtaaagaaaggacggcgatcaaggcggtgtgggcagtgttctggctgc
caggtgcctgaggactgtggtgtttgtactaattgcttagacaagcccaagtttggtggc
cgcaatataaagaagcagtgctgcaagatgagaaaatgccagaatctacagtggatgcct
tcgaaagcctaccttcagaaacaagctaaagctgtgaaaaagaaagagaaaaagtctaag
accagtgaaaagaaagagagcaatgttgtgaagaacgtagtggactccagtcagaaacct
accccatcaacaagagaggatcctgccccaaagaagagcggcagtgagcctcccccacga
aagcccactgaggagaagagtgaagatggaagtatgtctgtcccagggcccgaatccaaa
caagtcaccaccccagcttccaggaagtccagcaaacaggtctctcagccagcaccagtt
acccccccacagccgccaagcacaggaccactgaaaaaagaagttcccaggtccactcct
agtgagcccaagaaaaagcagcctccaccgccggaatcaggtccagagcaaagcaagcag
aagaaagtggctccccgcccaagtatccctgtaaaacaaaaaccaaaagaaaaggaaaaa
ccacctccagtcaataagcaggagaatgcaggcactttgaatatcctcagcactctctcc
aatggcactagttctaagcaaaaaatcccagcagatggagtccacaggatcagagtggac
tttaaggaggactgtgaagcagagaatgtgtgggagatgggaggcttgggtatcttgacc
tccattcctataacacctagggtggtttgctttctctgtgccagcagtgggcatgtcgag
tttgtttactgccaagtctgttgtgagcccttccacaagttttgtttagaggagaacgag
cgccctctggaggaccagctggaaaattggtgttgtcgtcgttgcaagttctgtcacgtt
tgtggaaggcagcatcaggctacaaagcagctgctggagtgtaataagtgccgaaacagc
tatcaccctgagtgcctgggaccaaactaccccaccaaacccacgaagaaaaagaaagtt
tggatctgtaccaagtgtgttcgctgcaaaagctgtggatccaccactccaggcaaaggg
tgggatgcgcagtggtctcatgatttctcgctgtgccatgattgtgccaaactctttgct
aaaggaaacttctgtcctctctgtgataagtgttacgatgatgatgactatgagagtaag
atgatgcagtgtgggaagtgtgatcgctgggtccattccaaatgtgagaatctttcagat
gaaatgtatgagattctatctaatctgccagaaagtgtggcctacacttgtgtgaactgt
actgagcggcaccctgcagagtggcgactagcccttgaaaaagagctacagatttctctg
aagcaagttctgacagccttgttgaattctcggaccaccagtcacttgctacgctaccgg
caggctgccaagcctccagacttaaatcctgagacagaggagagcataccctctcgcagc
tccccagaagggcctgacccaccagttcttactgaggtcagcaaacaggaagatcagcag
cctctagatctggagggagtcaagagaaaaatggaccaagggagctacacatctgtgttg
gagttcagtgatgatatcgtgaagatcattcaagcagccattaattcagatggagggcag
ccagaaattaaaaaagccaacagcatggtcaagtccttcttcattcggcaaatggaacgt
gtttttccatggttcagtgtcaaaaagtccaggttttgggagccaaataaagtatcaagc
aacagtgggatgttaccaaacgcagtgcttccaccttcacttgaccataattatgctcag
tggcaggagcgagaggaaaacagccacactgagcagcctcctttaatgaagaaaatcatt
ccagctcccaaacccaaagggcctggagaaccagactcaccaactcctctacaccctcct
acaccaccaattttgaatactgacaggagccgagaagacagtccggagctgaacccaccc
ccaggcatagaagataacagacagtgtgcattatgtttgacatatggcgatgacagtgct
aatgatgctggtcgtttgctatacattggccagaatgagtggacacatgtaaattgtgct
ttgtggtcagcggaagtgttcgaagatgatgatggatcgctgaagaatgtgcatatggct
gtgatcaggggcaagcaattgagatgtgaattctgccaaaagccgggagccaccgtgggt
tgctgtctcacatcctgcaccagcaactatcacttcatgtgttcccgagccaagaactgc
gtctttctagatgataaaaaggtgtattgtcaacgacatcgggatttgatcaaaggagag
gtggttcctgagaacggatttgaagtttttagaagagtgtttgtggactttgaaggaatc
agcttgagaaggaagtttctcaatggcttggaaccagaaaatatccacatgatgattggc
tcgatgacgattgactgcttgggaattctgaatgatctctctgactgtgaagataagctc
tttcctattggctatcagtgttccagggtatactggagcaccacagatgctcgcaagcgc
tgtgtatatacgtgcaagatagtggaatgccgtcctccagttgtagagccagatatcaac
agcactgttgaacatgatgaaaataggaccattgcccatagtccaacatcttttacagaa
atttcatctaaagagagtcaaaacacagctgaaattgtaagtcctccatcaccagaccga
cctcctcattctcaaacctccagctcctgtttttatcatgtcatctcaaaggtccctcgg
attcgaacacccagttattctgcaacacagagatcccctggctgtcggccattgccttct
gcaggaagtcctaccccaaccactcatgaaatagtcacagtgggtgatcctttactctcc
tctggacttcgaagcattggctccaggcgtcatagtacttcttccttgtcacctcagcgg
tccaaactccggataatgtctccaatgagaactggaaatacttactccaggagcagtctt
tcctcagtttccaccactgggactgtttcagatcttgagaaaagtgccaaagcagttgat
catgtattagggtcactgaattcaagtactaatttagggcaaaacactctcacctcttca
aatttacaaaggacagtggttactgtaggcactaaaaccagccacttggctggatcttct
tcttcggaaatgaagcataccagtgcctcagacttggggtccaagagctcctctttgaag
ggagagaagaccaaaatgctgagttccaagggcttagagggatctgcacataatgtggct
taccctggaattcctaaactggccccacaggttcataatacagcatctggagaattaaat
gttagtaaaattggaacctttacagaaccgtcttcggtgccatattcttctaaagaggcc
ctctcctttccatcactccattcgagagggcagaggcgcgatcgagaccaacacacagat
cttagccaaccagcagacctctcttcagaggcagatactgaagtcaaaaccttaaagctg
tctggagtgagcaacagatcgtctattatcaatgaacatgtgggatctagttccagagac
aggagacaaaaagggaaaaaatcttgtaaagaaactttcaaagaaaagcattccactaaa
tcttttttggaacctggtcaggtagcaactggtgaggaaggaaacttaaagccagagttt
gttgatgaggttttggctcctgagcttattgggcaaccgccatgtaataatgtttcttct
gataagactggggataaagtcctttctattccgggagtccccaaagcttcatccatgcaa
gtggaaggatctgccaaggaattacagacaccccggaaacgcacagtcaaagtaacactg
acacctctaaaaatggaaagtgaaaaccagtccaaaaatactttgaaagaaagtagtcct
gcttcccctctgcaaggagagtcagcatctccaacagaaccaatgtcagcctctgaaagt
ccgggagatggtccagtggcccagccaagccccaatgacacctcatcccaagattctcaa
agtaacaactatcagaatcttccagttcaggacagaaacctaatgcttccagatggcccc
aaacctcaggaagatggttcttttaagaggagatatcctcgccgcagtgcaagagcacgt
tctaatatgttctttgggctcaccccactctatggagtaagatcctatggtgaagaagac
attccattctacagcagctcaactgggaagaaacgaggcaagagatcagctgaaggacag
gtggatggggccgatgaccttagcacatcagatgaagatgacttatactactacaatttc
actagaacagtgatttcctcaagtggagaggaacggctggcatctcataatttatttcgg
gaggaggaacagtgtgatcttccaaaaatttcacagttggatggtgttgatgatgggaca
gagagtgatactagtgttacagccacaacaaggaaaagcagccagattccaaaaagaaat
ggtaaagagaatggaacagagagcttaaagattgatcgacctgaggatgctggtgaaaaa
gaacatgtcattaagagttctgctggccacaaaaatgagccaaagatggataactgccac
tctgtcagcagggttaaaacacagggacaggattccttggaagctcaactcagctcattg
gagtcaagccgcagagtccacacaagcaccccctcagacaaaaatttactggacacctat
aatactgagctcctgaaatctgattccgataataacaacagtgatgactgtggaaacatc
ttgccttcagacattatggactttgtgctaaagaatactccatccatgcaggctttgggt
gagagcccagagtcatcttcatcagaactcctgaatcttggtgaaggtttgggtcttgat
agtaatcgtgggaaagacatgggtctttttgaagtattttcccagcagctgccaacaaca
gaacctgtggacagtagtgtctcttcctcaatctcagctgaggagcagtttgagttgccg
ctagagctaccgtctgatctctcagtcctgaccacccggagtcccactgtccctagccag
aaccccaatagactagctgtgatctcagactcaggggagaagagagtaaccatcactgaa
aaatctgtggcctcctctgaaggtgacccggcactgttgagtccaggggtagatccaacc
cctgaaggccacatgactcctgatcattttatccaaggacacatggatgcagatcacatc
tccagccctccttgttcagtggaacaaggtcatggcagcagtcaggatttaactagaaac
agtagcacccctggccttcaggtacctgtttccccttctgttcctatccagaaccagaaa
tatgtgcccaattctactgacagtcctggcccatctcagatttctaatgcagctgtccag
accactccaccccacctgaaaccagccactgagaaactaattgttgttaaccagaacatg
cagccactttatgttctccaaactcttccaaatggagtgacccagaaaatccaattgacc
tcttctgttagttctacacccagtgtgatggagacaaatacttcagtattggggcccatg
ggaactggtctcaccctaaccacaggactaaatccaagcttgccaacttctcaatctttg
ttccctcctgctagcaaaggactgctccccatgccacatcaccagcacttacattccttc
tctgcagctactcaaagtagtttcccacccaacatcagcagtcctccttcaggcctactg
attggggttcagcctcctccagacccccaacttctggtttcagaagccagccagaggaca
gacctcagtaccacagtagccactccatcctctggactcaagaaaagacccatatctcgt
ctacagacccgaaagaataaaaaacttgctccctctagtaccccttcaaacgttgcccct
tccgatgtggtttctaatatgacattgatcaacttcacaccctcccagctttcaaaccac
cccaatctattagatttggggtcacttaatacttcatctcaccgaactgtccccaacatc
ataaaaaggtctaaatctggcatcatgtattttgaacaggcgcccctgttaccaccgagt
atgggaggaactgctgccccagcggcgggcacatcaaccataagccaggatactagcaac
ctcgcatcagggcccgtgtctggcttggcatctggttcctccgttttgaatgttgtatcc
atgcaaaccacaacaacccctacaagtagtgcatcaggtccaggacatgtcactttgacc
aacccaaggttgcttggtaacccagatattggttcaataagcaatcttttaatcaaagct
agccagcagagcctagggattcaggaccagcctgtggctttaccgccaagttcaggaatg
tttccacagctggggacatcgcagactccctctgctgctgcaatgacagcagcatctagc
atctgtgtgctcccctcaactcaaactacgggcataacagctgcttcaccttccggggaa
gcagaagagcactaccagcttcagcacgtgaaccagctccttgccagcaaaactgggatt
ctctcttcccagtgtgatctggattctgcttccgggacccaggtgtctaattttacccag
acagtagatgctcccaacagcacggggctagagcagaacaaggctttaccctcagctatg
caagccagctcagcctctcctgggggctctccaccctcgggacagcagtccgcaagcccg
tcagtgccgggtcccactaaacccaaaccaaaaatcaaacggattcagctgcctttggac
aaagggaatggcaagaagcacaaagtttcccatttgcggaccagttcttcggaagcacac
attccagaccaagaagccagcgcagcacccctgacgtcatccacagggactccaggaaca
gaggctgagcagcaggatactgctaacgtggaacagtcatcacagaaggagtgtggacag
cctgcaaggcaagtggctgttcttccagaggctcaggccacacgaaatccagcaagtgaa
caggagagtacagaacctaaaacggtggaagaagaagaaagtaatttcagctctccactg
atgctttggctccagcaagaacaaaaacggaaggagagcattgctgagaagaagccaaag
aaaggacttgtgtttgaaatttcaagcgatgatggctttcagatctgtgctgaaagtatt
gaagatgcctggaagtcattgacagacaaagtccaggaagctcgatctaacgcccgccta
aagcaactctcatttgcaggtgttaatggtgtgaggatgctggggattctccacgatgca
gttgtgttcctgattgagcagctctctggtgccaagcactgtaggaattacaaattccgc
ttccacaaaccagaggaggccaatgaaccccccctgaaccctcatggctcagccagggct
gaagtccacctgaggaagtcagcatttgacatgtttaatttcctggcttctaaacatcgg
cagcctcctgaatataaccccaatgatgaggaagaggaggaggtacagctgaagtccgct
cggagggcaactagcatggatctgccaatgcccatgcgtttccggcacttaaaaaagact
tctaaggaggcagttggtgtctacaggtctcccatccatggccggggtctgttctgtaag
agaaacattgatgcaggtgagatggtgattgagtatgcaggcaatgtcatccgttccatc
cagactgacaaacgagagaagtattatgacagcaagggcattggttgctacatgttccga
atcgatgactcagaggtagtggatgccaccatgcatggaaacgctgcacgcttcatcaat
cactcttgtgagcctaactgctattctcgggtcatcaatattgatggacagaagcacatt
gtcatctttgccatgcgtaagatctaccgaggggaggaactcacttacgactataagttc
cctattgaggatgccagcaacaagctgccttgcaactgtggcgccaagaaatgccggaag
ttcctaaactaa

KEGG   Loxodonta africana (African savanna elephant): 100659816
Entry
100659816         CDS       T04351                                 

Gene name
RBBP5
Definition
(RefSeq) retinoblastoma-binding protein 5 isoform X2
  KO
K14961  COMPASS component SWD1
Organism
lav  Loxodonta africana (African savanna elephant)
Pathway
lav04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:lav00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    100659816 (RBBP5)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:lav03036]
    100659816 (RBBP5)
Chromosome and associated proteins [BR:lav03036]
 Eukaryotic type
  Histone modification proteins
   HMT complexes
    COMPASS/SET1 complex
     100659816 (RBBP5)
    COMPASS/SET1 complex (yeast)
     100659816 (RBBP5)
    MLL-HCF complex
     100659816 (RBBP5)
    MLL3/MLL4 complex
     100659816 (RBBP5)
SSDB
Motif
Pfam: WD40 ANAPC4_WD40 Frtz
Other DBs
NCBI-GeneID: 100659816
NCBI-ProteinID: XP_010588553
LinkDB
Position
Unknown
AA seq 538 aa
MNLELLESFGQNYPEEADGTLDCISMALTCTFNRWGTLLAVGCNDGRIVIWDFLTRGIAK
IISAHIHPVCSLCWSRDGHKLVSASTDNIVSQWDVLSGDCDQRFRFPSPILKVQYHPRDQ
NKVLVCPMKSAPVMLTLSDSKHVVLPVDDDSDLNVVASFDRRGEYIYTGNAKGKILVLKT
DSQDLVASFRVTTGTSNTTAIKSIEFARKGSCFLINTADRIIRVYDGREILTCGRDGEPE
PMQKLQDLVNRTPWKKCCFSGDGEYIVAGSARQHALYIWEKSIGNLVKILHGTRGELLLD
VAWHPVRPIIASISSGVVSIWAQNQVENWSAFAPDFKELDENVEYEERESEFDIEDEDKS
EPEQTGADAAEDEEVDVTSVDPIAAFCSSDEELEDSKALLYLPIAPEVEDPEENPYGPPP
DAVQTSLMDEGASSEKKRQSSADGSQPPKKKPKTTNIELQGVPNDEVHPLLGVKGDGKSK
KKQAGRPKGSKGKEKDSPFKPKLYKGDRGLPLEGSAKGRVQAELSQPLTAGGAISELL
NT seq 1617 nt   +upstreamnt  +downstreamnt
atgaacctcgagttgctggagtcattcgggcagaactatccagaggaagctgatggcact
ttggactgtatcagcatggccctgacttgcacctttaacaggtggggcacactgcttgca
gttggctgtaatgatggccgaattgtcatctgggattttttgacaagaggaattgctaaa
ataatcagtgcacacatccatcccgtctgttctttatgctggagtcgagatggtcataag
cttgtgagtgcttccacggataacatagtgtcacagtgggatgttctttcaggagactgt
gaccagaggttccggttcccttcgcccattttaaaagtccagtatcacccacgagatcag
aacaaggtgctcgtgtgtcccatgaaatccgctcctgtcatgttgaccctttcagattcc
aaacacgttgttctgccggtagatgacgactccgatttgaacgtggttgcatcttttgat
aggcgaggagaatatatctatacgggaaatgcaaaaggcaagattttggtcctaaaaaca
gactctcaggatcttgtggcctccttcagagtaacaactggaaccagcaataccacagcc
attaagtccatagagtttgcccggaaggggagttgctttttaattaacacagcagatcga
ataatcagagtctatgatggcagagaaatcttaacctgtgggagagatggagaaccggaa
cctatgcagaaattgcaggacttggtgaataggaccccgtggaagaagtgctgtttctcc
ggggatggggagtacatagtggcggggtcagcccggcagcacgccctgtacatctgggag
aagagtattggcaacctggtgaagattctccatgggaccaggggagagctcctcctggat
gtggcttggcatcctgttcgacccatcatagcatccatttctagtggagtggtgtctatc
tgggcacaaaatcaagtagaaaattggagtgcatttgcaccagatttcaaggaattggac
gagaacgtggaatatgaggagagggaatcagagtttgatattgaagatgaagataagagt
gagcctgagcagacaggagctgatgctgccgaggatgaagaagtggatgtcaccagtgtg
gatcctatcgctgccttctgtagcagtgatgaagagctggaagattcaaaggctctattg
tatttacccattgcccctgaggtagaagacccggaagaaaatccttacggccccccgccg
gatgcagtccaaacgtctctgatggacgaaggggctagttcagagaagaagaggcagtct
tcagcagatgggtcccagccacctaagaagaaacccaaaacaaccaatatagaacttcaa
ggagtaccaaatgatgaagtccatccactactgggtgtgaagggggatggcaaatccaag
aagaagcaagcaggccgacctaaaggatcaaaaggtaaagagaaagattctccatttaaa
ccgaaactctacaaaggggacagaggtttacctctggaaggatccgcgaagggtagagtg
caggcggagctcagccagcccctgacagcagggggagcaatctcagaactgttgtga

KEGG   Loxodonta africana (African savanna elephant): 100665231
Entry
100665231         CDS       T04351                                 

Gene name
ASH2L
Definition
(RefSeq) set1/Ash2 histone methyltransferase complex subunit ASH2 isoform X2
  KO
K14964  Set1/Ash2 histone methyltransferase complex subunit ASH2
Organism
lav  Loxodonta africana (African savanna elephant)
Pathway
lav04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:lav00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    100665231 (ASH2L)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:lav03036]
    100665231 (ASH2L)
Chromosome and associated proteins [BR:lav03036]
 Eukaryotic type
  Histone modification proteins
   HMT complexes
    COMPASS/SET1 complex
     100665231 (ASH2L)
    MLL-HCF complex
     100665231 (ASH2L)
    MLL3/MLL4 complex
     100665231 (ASH2L)
SSDB
Motif
Pfam: SPRY
Other DBs
NCBI-GeneID: 100665231
NCBI-ProteinID: XP_003412583
UniProt: G3SPT1
LinkDB
Position
Unknown
AA seq 628 aa
MAAAGTGPGSGAGAGPGPTAASNAAAAEEGETKPVAAVAAAPAGEGTSAAPAAEPSSGEA
DSGDANLVDVSGGLETESSNGKDTLEGAGDTSEAMDTQAGSVDEENGRQLGEVELQCGIC
TKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQS
RTQDEHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHP
DPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQKQSGAVSTSGNLNGGIAAGSSGKGRGAKR
KQQDGGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLEL
DCWAGKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGA
WYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGY
GQGDVLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSE
IIFYKNGVNQGVAYRDIFEGVYFPAISLYKSCAVSINFGPCFKCPPKDLSYRPMSDMGWG
AVVEHTLADVLYHVETEVDGRRSPPWEP
NT seq 1887 nt   +upstreamnt  +downstreamnt
atggcggcggcgggaaccggccccgggtcgggagcgggtgccggaccgggcccgacagcg
gcctcaaatgcagccgcagcggaagagggagagacgaaaccggtggcggcggtagcagct
gctccagccggagaggggacgtctgctgctccagcagcggagcccagttctggagaggcc
gatagtggggatgcaaacttggttgatgtaagtggaggtttggagacagaatcctcgaat
gggaaagatacactagaaggtgctggggatacttcagaagcgatggacacccaggcaggc
tctgtggatgaagagaatggccggcagttgggagaggtggagctgcagtgcggaatatgt
acaaagtggttcacagcagacacctttggcatagatacctcatcgtgtttacctttcatg
accaactacagctttcactgtaatgtgtgccatcatagtggaaatacctatttccttcgg
aagcaagcaaacttgaaggaaatgtgcctcagtgctttggccaacctgacatggcagtcc
cgaacacaggatgaacacccgaaaaccatgttctccaaagataaggatattataccattt
attgataaatactgggagtgcatgacgaccagacagagacctgggaaaatgacgtggcca
aataacattgttaaaacgatgagtaaagaaagagatgtattcttggtaaaggaacaccct
gatcccgggagtaaagacccagaagaagattaccccaaatttggacttttggatcaggat
cttagtaacattggtcctgcttatgacaaccaaaaacagagcggtgctgtgtctaccagt
gggaatctaaatggaggaattgcagcaggaagcagtggaaaaggaagaggagctaagcgc
aagcagcaggacggagggaccactgggaccaccaagaaggcccggagtgaccctctgttt
tctgctcagcgcctcccccctcatggctaccccttggaacacccgtttaacaaagacggc
tatcggtatattctagccgagcctgatccccacgcccctgaccctgagaagcttgaactt
gactgctgggcaggaaaacctattcctggagacctctacagagcctgcttatatgaacgg
gttttgttagccctccatgatcgagctccccagctgaagatctcagatgaccgactgact
gtggttggagagaagggctactctatggtgcgggcttctcatggcgtacggaaaggtgcc
tggtactttgaaatcactgtggatgagatgcccccagacactgctgcgagactgggttgg
tcccagcccttaggtaaccttcaagctcccttgggttatgataaatttagctattcttgg
cggagcaaaaagggaaccaaattccaccagtccattggcaaacactactcttctggctat
ggacagggggatgtcctgggattttatatcaatcttcctgaagacacagagacagccaag
tcactgcctgatacttacaaagataaggctttgataaagttcaaaagttatttgtatttt
gaagaaaaagactttgtggataaggcagagaagagcctaaagcagactccccatagtgag
ataatattttataaaaatggtgtcaatcaaggtgtggcttacagagatatttttgagggc
gtttacttcccagccatctcactgtacaagagctgcgcagtttccattaactttggaccg
tgcttcaagtgtcctccaaaggatctctcttaccgccctatgagtgacatgggctggggc
gccgtggtagagcatactctggccgatgtcttatatcatgtggaaacagaagtggatggg
aggaggagccccccgtgggaaccctga

KEGG   Loxodonta africana (African savanna elephant): 100670013
Entry
100670013         CDS       T04351                                 

Gene name
KMT2D
Definition
(RefSeq) LOW QUALITY PROTEIN: histone-lysine N-methyltransferase 2D
  KO
K09187  [histone H3]-lysine4 N-trimethyltransferase MLL2 [EC:2.1.1.354]
Organism
lav  Loxodonta africana (African savanna elephant)
Pathway
lav00310  Lysine degradation
lav01100  Metabolic pathways
lav04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:lav00001]
 09100 Metabolism
  09105 Amino acid metabolism
   00310 Lysine degradation
    100670013 (KMT2D)
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    100670013 (KMT2D)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:lav03036]
    100670013 (KMT2D)
Enzymes [BR:lav01000]
 2. Transferases
  2.1  Transferring one-carbon groups
   2.1.1  Methyltransferases
    2.1.1.354  [histone H3]-lysine4 N-trimethyltransferase
     100670013 (KMT2D)
Chromosome and associated proteins [BR:lav03036]
 Eukaryotic type
  Histone modification proteins
   HMTs (histone methyltransferases)
    HKMTs (histone lysine methyltransferases)
     100670013 (KMT2D)
   HMT complexes
    MLL-HCF complex
     100670013 (KMT2D)
SSDB
Motif
Pfam: PHD FYRC FYRN SET zf-HC5HC2H zf-HC5HC2H_2 HMG_box
Other DBs
NCBI-GeneID: 100670013
NCBI-ProteinID: XP_023394755
LinkDB
Position
Unknown
AA seq 5321 aa
MDSQKPSGEDKDSEPAADGPAASEESGTAEPDLPNPHVGEVSVSSSGDPRLQEPSQDCSG
SPVRRCALCNCGEPSVHGQRELRCFELPFDWPRRPLVSPGGDPGPSEPAQPSEDLSQIGF
PEGLTPAHLGEPGGPCWAHHWCAAWSAGVWGQEGPELCGVDKAIFSGISQRCSHCTRLGA
SIPCRSPGCPRLYHFPCATASGSFLSMKTLQLLCPEHSEGAAHLEEARCAVCEGPGELYD
LFFCTSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSEMLVCETCDKGYHT
FCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWFENYSLCHRCHKAQGGQPIGP
VAEQRPSICSRFSPPEPGDTPTVEPDALYVACQGQPKGGHVTSMQSKEPGPLQCEAKPLG
RTGAQLEPQLEAPINEEMPLLPPPEESPLSPPPEESPTSPPPEASRLSPPPEESPLSPPP
EESPLSPPPESSPFSPLEESPFSPPEESPPSPPVETPLSPPSEASPLSPPFEESPLSPPP
EELPSSPPPEASRLSPPPEESPMSPPPEESPMSPPPEASRLDPPFEESPLSPPPEESPLS
PPPEASRLFPPPEDSPMSPPPEDSPMSPPPEVSRLSPPPEESPLSPPPEESPTSPPPEAS
RLSPPPEDSSPSLPPEDLPASSPPENLLTCLPPEESPLSPLPEEPCLCPRPEESESQLCP
QPEEVQFCYRPGEPHLSPRPEEPRLSPRPEEPRLSPRPEEPRLSPRPEEPRLSPGPEEPR
LSVQPEELRLFPVSEEPCLSPVPEEPHLSPQPEEPPEEPGLCPVPEELPLLPPHGEPPMS
PLLREPALSEPGEPPLSPLPEDLSMSPSGEPSLSPQLMPPDPLPPPLSPIITAVAPPALS
PLGELEYPFGAKGDSDPESPLAAPILETPISPPPEANCTDPEPVPPMILPPSPGSPMGPA
SPILMEPLPSQCSPFLQPSLPPESFPHSQCSPPALPLSIRSPLSPMEKTAEVSDEAEQHE
METEKVPEPECPALEPSPTSPLPSPVGDLSCPAPSPAPALDDFSGLGEDTAPLDGTDTPG
SQPEAGQTPGSSTACELKGSPVLLDPEELAPVTPMEVYGPECKQAGQGSPCEMQEEPHAP
VAPTPPTLIKSDIVNEISNLSQGDASASFPGSEPLLGSPDPEGGGSLSMELGVSTDVSPA
RDEGSLRLCTDSLPETDDSLLCEAGTTVSGGKADGDKGRRRSSPARSRIKQGRSSSFPGR
RRPRGGAHGGRGRGRARLKSTTSSIETLVVADIDSSPSKEEEEEDDDTMQNTVVLFSNTD
KFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECIVCE
VCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFHCEW
QNSYTHCGPCASLVTCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEEEVEQAADEGF
DCVSCQPYVIKPAAPVAPPELVPIKVKEPEPQYFRFEGVWLTETGMAVLRNLTMSPLHKR
RQRRGRLGLPGEAGLEGSEPSDALGPDDKKDGDLDTDELIKGEGGVEHMECEIKLEGPVS
PDMEPGKEETEESKKRKRKPYRPGIGGFMVRQRKSHTRVKKGPAAQTEVLSGDGQPDEVM
PADLPAESSVEQSLADGDEKKKQQRRGRKKSKLEDMFPAYLQEAFFGKELLDLSRKALFA
VGVGRPSFGLGTPKLKGDGGPERKDPPTLQKGDDGPDVADEESRGLEGKADTPGPEDGGV
KASPVPSDPEKPGTPGEGMLSSDLDRIPTEELPKMESKDLQQLFKDVLGSEREQHLGCGT
PGLDGSRTPLQRPFLQGGLPLGSLPSNSPMDSYPGLCQSPFLDSRERGGFFSPEPGEPDS
PWTGSGGTTPSTPTTPTTEGEGDGLSYNQRSLQRWEKDEELGQLSTISPVLYANINFPSL
KQDYPDWSSRCKQIMKLWRKVPAADKAPYLQKAKDNRAAHRINKVQKQAESQINKQTKVG
DIARKTDRPALHLRIPPQPGALGSPPPAAAPTIFIGSPTTPAGLSTSADGFLKPPAGTAP
GPDSPGELFVKLPPQVPAQVPSQDPFGLAPAYALEPRFPTAPPAYPPYPSPTGAPVQPPT
MGASSRSGTGQPGEFHTTPPGTPRHQPSTPDPFLKPRCPSLDNLAVPESPGVGGGKASEP
LLSPPPFGESRKALEVKKEELGAASPSYGPPNLSFVDSPSSGPHLGGPELKAPDVFKAPL
TPRASQVEPQSPGLGLRPQEPPPAQALAPSPPSHPDIFRPGPYPDPYAQPPLTPRPQPPP
PESCCALPPRSLPSDPFSRVPASPQSQSSSQSPLTPRPLSAEAFCPSPVTPRFQSPDPYS
RPPSRPQSRDPFAPLHKPPRPQPPEAAFKAGPLAHTPLGAGGFPAALPSGPTGELHAKVP
SGQPSNFARSPGTGAFVGTSSPMRFTFPQAVGEPSLKPPAPQPGPPPPHGINSHFGPGAT
LGKPQSTNYAVATGNFHTSGSPLGPSSGSTGEGYGLSPLRPASVLPPPAPDGSLPYLSHG
ASQRAGITSPVEKREDPGAGMGSSLAAPELPGTQDPGMPSLSQTELEKQRQRQRLRELLI
RQQIQRNTLRQEKETAAAAAGAVGPPGSWGAEPSSPAFEQLSRGQTPFNGTQDKSSLVGL
PQTKLGGPVLGPGAFPSDDRLSRPPPPATPSSMDVNSRQLVGGSQAFYQRAPYPGPLPLQ
QQQQLWQQQQQQQQATAATSMRLAMSTRFPSTPGPELGRQALGSPLTGIPTRLPGPGEPV
PGPPGPAQFIELRHNVQKGLGPGGPPFPVQGPPQRPRFYAVSEDPHRLAPEGLRGLAVSG
LPPQKPSAPPATELNNSLHPTPHTKGPNLSTGLELVSRPPSSTELGRPPPLALEAGKLPC
EDSELDDDFDAHKALEDDEELAHLGLGVDVAKGDDELSTLENLETNDPHLDDLLNGDEFD
LLAYTDPELDTGDKKDIFNEHLRLVESANEKAEREALLQGVEPGPLGPEERPAPAPAPDP
DASEPRLAPVLPEVKTKVEEGGPHPSPCQFTITTPKVESAPATNSLGLGPKPGQSVIGSR
DTRIVTGPFSSSGHTGEKGPFGATGGPPAHLLAPSPLSGSGGSSLLEKFELDSGTLALPG
GHAASGDELDKMESSLVASELPLLIEDLLEHEKKELQKKQQLSAQLQPAQQQQQQHSLLS
TPGSAQAMPLPHEGSSSLAGPQQQLALGLGGARQPGLAQPLMPTQPPAHALQQRLAPSMA
MVSNQGHMLGGQHGGQAGLVPQQSPQPVLTQKPVGTMPPSMCMKPQQLAMQQQLANSFFP
DTDLDKFAAEDIIDPIAKAKMVALKGIKKVMAQGSIGVAPGMNRQQVSLLAQRLSGGPGN
DLQNHVAAGSGQERSANDPSQPRPNPPTFAQGVINEADQRQYEEWLFHTQQLLQMQLKVL
EEQIGVHRKSRKALCAKQRTAKKAGREFPEADAEKLKLVTEQQSKIQKQLDQVRKQQKEH
TNLMAEYRNKQQQQQQQQQQQQQHSAVLALSPSQSPRLLTKLPGQLLPGHGLQPPQGPLG
GQAGGLRLPPGGMALPGQPGGPFLNTALAQQQQQQHSGGAGALTGPSGGFFPGSLALRGL
GPDSRLLQERQLQLQQQRMQLAQKLQQQQQQQQQQQQQQQQQQHLLGQLQQQQQQQQLQQ
QQQQLQQQQQQQQQFQQQQQQMGLLNQSRTLMSPQQQQQQQQQQQQQQQQQQQQQQVTLG
PGMPAKPLQHFSSPGALGPTLLLTGKEQGIVETALPPEVTEGPTTHQGGPLAIGTTPESM
AAEPGEVKPSVSGDSQLLLVQPQAQPQPNSLQLQPSLRLPGQQQQQVNLLHTAGGGSHGQ
LGSGPSSEASSMPHLLSQPSVSLGEQPGPMTQNLLGPQHPLGLERPMQSNVGPQPPKPGP
VPQSGQGLPGPGVMPTVGQLRAQLQGVLAKTPQLRHLSPQQQQQLQALLMQRQLQQSQAV
RQTPPFQEPGTQPSPLQGLLGRQPQLGSFPGSQTGPLQELGAGPRSQGPPRLSAPQGALS
TGSVLGPVHPTPPPSSPQEPKRPSPQLPSPSSQVPSEVQLTPTQPGTPKPQGPPLELPSG
RVSPAAAQLVDTFFGKGLGPWDPPDNLAEAQKPDQSSLVPGHLEQVNGQVVPEPPHLSIK
QEPREEPCALGAQAVKREANGEPLGAPGTSNHLLLAGPRSEAGHLLLQKLLRAKNVQLST
GRGPEGLRSEINGHIDSKLAGLEQKLQGTPSNKEDAAARKPLTPKPKRVQKASDRLVSSR
KKLRKEDGVRASEALLKQLKQELSLLPLTEPTITTNFSLFAPFGSGCSISGQSQLRGAFG
SGALPTGPDYYSQLLTKNNLSNPPTPPSSLPPTPPPSVQQKMVNGVTPSEELGEHPKDAA
SAQETEGALRSASEVKSLDLLAALPTPPHNQTEDVRMESDEDSDSPDSIVPASSPESILG
EEAPRFPQLGSCRWEQDDRALSPVIPIIPRASIPVFPDTKPYGALDLEVPGKLPATTWEK
GKGSEVSVMLTVSAAAAKNLNGVMVAVAELLSMKIPNSYEVLFAESPARAGTEPKKGEAE
GPGGKEKGLGIKSPEAGPDWLKQFDAVLPGYTLKSQLDILSLLKQESPAPEPPTQHSYTY
NVSNLDVRQLSAPPPEEPSPPPSPLAPSPASPPAERLVELPAEPTADPSVPSPLPLASSP
ESARPKSRARPPEEGEDSRPPRLKKWKGVRWKRLRLLLTIQKSSGRQEDEREVAEFMEQL
GTALRPDKVPRDMRRCCFCHEEGDGATDGPARLLNLDLDLWVHLNCALWSTEVYETQGGA
LMNVEVALHRGLLTKCSLCQRTGATSSCNRMRCPNVYHFACAIRAKCMFFKDKTMLCPMH
KIKGPCEQELSSFAVFRRVYIERDEVKQIASIIQRGERLHMFRVGGLVFHAIGQLLPHQM
ADFHSATALYPVGYEATRIYWSLRTNNRRCCYRCSIGENNGRPEFIIKVTEQGLEDLVFS
DASPQAVWNRIIEPVAAMRKEADMLRLFPEYLKGEELFGLTVHAVLRIAESLPGVESCQN
YLFRYGRHPLMELPLMINPTGCARSEPKILTHYKRPHTLNSTSMSKAYQSTFTGETNTPY
SKQFVHSKSSQYRRLRTEWKNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVA
NRREKIYEEQNRGIYMFRINNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIII
ISSRRIPKGEELTYDYQFDFEDDQHKIPCHCGAWNCRKWMN
NT seq 15966 nt   +upstreamnt  +downstreamnt
atggacagccagaagccgtctggtgaggataaagattcagaaccagcagctgatggacct
gcagcctctgaggagtcaggcaccgctgagccagaccttcccaacccacacgtgggggag
gtctctgtctccagttctggggatcccaggcttcaggagccttcccaggactgcagtggg
agtccagtgcggcgttgtgctctctgtaactgcggggagcccagtgtgcatgggcagcgg
gagttacggtgctttgagttgccatttgactggccgcggcgtccactggtatcccctggg
ggggacccagggcccagtgaaccagcgcagcccagtgaagacctatcacagattggtttc
cctgagggcctgacccccgctcatctaggagaacctggagggccctgctgggctcaccat
tggtgtgctgcatggtcggcaggcgtctgggggcaagagggcccagaactatgtggtgtg
gacaaggccatcttctcaggaatctcacagcgctgctcccactgcaccaggctcggtgcc
tccatcccttgccgctcgcccggatgtccacggctttaccacttcccctgcgcgactgcc
agcggttccttcttatccatgaagacactgcagctgctatgcccagagcacagtgagggg
gccgcacatctggaggaggctcgttgtgcagtatgtgaggggccgggggagttgtatgac
ctgttcttctgtaccagctgtgggcatcactatcatggggcctgcctggacactgctctg
actgcccgcaagcgtgctggctggcagtgccccgaatgcaaagtgtgccaagcctgcagg
aaacctgggaatgactctgagatgttggtctgtgagacgtgtgacaaaggataccatacc
ttctgcctaaaaccacccatggaggaactgccagctcactcttggaagtgtaaggcatgc
cgggtatgccgggcctgtggggcaggctcagcagagctgaatcccaactcggaatggttt
gagaactactcactctgtcaccgctgccacaaagcccagggaggtcagcccatcggtcct
gttgctgagcagcgtccctctatctgtagcagattctcacccccagagcctggcgatacc
cccactgttgagcccgatgctctgtatgttgcatgccaagggcagccaaagggtgggcac
gtgacctctatgcaatccaaggaaccggggcccctgcaatgtgaagccaaaccactaggg
agaacaggggcccaacttgagccccagttggaggcccccataaatgaggagatgccactg
ctgcccccacctgaggagtcgcccctatccccaccgcccgaggagtcacccacatcccca
ccgcctgaggcatcgcgcctgtccccaccgcctgaggaatcacccctctctccaccgcct
gaggagtctcccctgtctcccccacctgaatcatcacccttttctccacttgaggagtca
cccttctctccaccagaggagtctcctccatccccgccagttgagacacccctgtcccca
ccatctgaagcatcacccctgtccccaccatttgaggagtctcctctgtcccctccaccc
gaggaattgcccagttccccgcctcctgaagcatctcgcctgtctccaccaccagaggag
tcacccatgtctcctccacctgaagagtcacctatgtctccgccacctgaggcgtctcgt
ctggacccaccatttgaagagtctcccctgtcccctccacctgaggagtctccactgtcc
ccaccacctgaagcatcacgcctgttcccaccgcctgaggactcacccatgtccccacca
cctgaagactcacctatgtcccccccacctgaagtgtcacgcctatctccaccgcctgag
gaatctcccctgtccccaccacctgaggagtctcccacatctcctccacctgaggcttcg
cgcctgtccccaccacctgaggactcatctccgtccctgccacctgaagacttacctgct
tcctcaccaccagagaacttgctcacatgcctaccgccggaagaatcacccctgtcgcca
ctgcctgaggagccgtgtctctgcccccgacctgaggagtcggagtcacaattgtgccct
cagcctgaggaggtgcaattctgttaccggcctggggagccacacctgtccccccggccc
gaggagccgcgcctgtccccccggcccgaggagccgcgcctgtccccccggcccgaggag
ccgcgcctgtccccccggcccgaggagccgcgcctgtcccccgggcccgaggagccgcgc
ctgtccgtgcagccggaggagctgcgccttttccctgtatctgaggagccgtgcctgtcc
cctgtacctgaagagccacacttgtccccccagcctgaggagccccccgaggagccaggc
ttgtgccctgtgcctgaggagttacccttgttacccccacatggggagccacccatgtcc
cctttgcttagagagccggccctgtctgagcctggggaaccacctctatctcctctgcct
gaggatctgtccatgtccccatctggggagccatccttgtcacctcaactgatgccacca
gatcctcttcctcctccactctcacccattatcacagctgtggccccaccagccctgtct
cctttgggggagttagagtacccctttggtgccaaaggggacagtgaccctgagtcaccg
ttggctgcacccatcctagagacaccgatcagccctccaccggaagcaaattgcactgac
cctgagcctgtaccccctatgatccttcccccatctccaggctccccaatgggacctgct
tctcccatcctgatggagcccttaccctctcagtgttctccattccttcagccttccctg
cctcccgagagcttccctcattcccagtgctcccctcctgctctgcctctgtccatccgc
tccccgctgagtcccatggagaagactgcggaggtctcagatgaggctgagcaacatgaa
atggagactgaaaaagtcccagaacctgagtgcccagccttagagcccagccctaccagt
cctcttccctcccccgtgggagacctttcctgccctgcacccagccctgccccagcactg
gatgacttctctggccttggggaagacacagcccctcttgatgggactgacactcctggt
tcacagccagaagctggacagacccctggcagttctacagcttgtgaacttaagggttcc
cctgtgctcctggaccccgaggagctggcccccgtgacccctatggaggtctatggtcca
gaatgcaaacaggcagggcagggctcaccatgtgagatgcaggaggagccacatgcacca
gtggcccccaccccacccactctcatcaaatccgacatcgttaatgagatctcgaacctg
agccagggggatgccagtgctagttttcctggttcagagcccctgctgggctctcctgac
cccgaggggggtggctccctgtccatggagctgggggtatctacagacgtcagcccagcc
cgagatgagggctccttgcggctttgtactgactccttgccagagactgatgactcgcta
ttgtgtgaagctgggacaactgtcagcggaggcaaagccgatggggacaaggggaggcgg
cgcagttcccctgcccgttcccgcattaagcagggacgcagcagtagtttcccaggaaga
cgccggccacggggaggagcacatggaggacgtggaagaggacgggcccggctaaaatca
actacttcttccattgagactctggtagttgctgatatcgatagctctcccagcaaggag
gaggaagaagaggatgacgacaccatgcaaaataccgtggttctcttctccaacacagac
aaatttgtcctaatgcaggacatgtgtgtggtatgtggcagctttggccggggagcagaa
ggccatctccttgcctgttcccagtgctctcagtgctatcacccttactgtgttaacagc
aagatcaccaaggtgatgctgctgaagggctggcggtgcgtggagtgtattgtgtgtgag
gtgtgtggccaggcctccgacccctcacgcctgctgctctgtgatgactgtgacattagc
taccatacatactgcctggaccccccactgctcaccgtgcccaaaggtggctggaagtgc
aagtggtgtgtctcctgtatgcagtgtggggccgcttcccctggcttccactgtgaatgg
cagaatagttacacacactgtgggccctgtgctagcctggtaacctgccctatctgtcat
gccccatatgtggaagaggaccttctcattcagtgccgccactgtgaacggtggatgcat
gctggctgcgagagtctcttcacagaggaagaagtggaacaggctgcagatgagggcttt
gactgtgtctcctgccagccctatgttataaagcctgcggcgcctgttgcacctccagag
ttggtgcctatcaaagtgaaagagccagagccccagtactttcgcttcgagggtgtgtgg
ctgacagaaactggcatggccgtgctgcgtaacctgaccatgtcgcctctgcacaagcgg
cgccagcggcgggggcggctcggcctcccaggcgaggcagggctggaaggttccgagccc
tcagatgcccttggccctgatgacaagaaagatggggacctggacactgatgagcttatc
aagggtgaaggtggtgtggagcacatggagtgtgaaattaaactagagggccccgtcagc
cctgacatggagcctggcaaggaggagaccgaggaaagcaagaaacggaagcgcaagcca
tatcgacctggcattggtggcttcatggtgcgacagcggaaatcccacacacgcgtgaaa
aagggtcctgcggcacagacagaggtgttgagtggtgatgggcagcccgacgaggtgatg
cctgccgacctacctgcagagagctctgtggagcagagcttagctgatggggatgagaag
aagaagcagcagcggcgagggcgcaagaagagcaaattagaggacatgtttcctgcttac
ttgcaggaagccttctttggaaaggagctgctggacctgagccgtaaggccctttttgca
gttggggtgggccggccaagctttggactaggaaccccaaaactcaagggggatggaggc
ccagaaaggaaggatccccccaccttgcagaaaggagatgatggtccagatgttgcagat
gaagaatcacgtggcctcgagggcaaggctgatacaccaggacctgaggatgggggcgta
aaggcatccccagtgcccagtgaccccgagaagccaggcaccccgggtgaaggaatgctt
agctctgacttagacaggattcccacagaagaattgcccaagatggaatccaaggacctg
cagcagctcttcaaggatgttctgggttccgaacgagagcagcatctgggttgtggaacc
cctggcctagatggcagccgtacgccactgcagaggccctttctccaaggtggactccct
ttgggcagcctcccatccaacagcccaatggactcctaccctggcctctgccagtcccca
ttcctggatagtagggagcgcgggggctttttcagcccggaacccggtgagccagacagc
ccgtggacaggctcagggggcaccacgccctctacccccaccacccccaccaccgaggga
gagggcgatgggctctcctataaccagcggagtcttcagcgctgggagaaggatgaggag
ttgggccagctctccaccatttcacctgtgctctatgccaacattaactttcccagtctc
aagcaagattacccagactggtctagtcgatgcaaacaaatcatgaagctctggagaaag
gttccagctgctgacaaagccccctacctgcaaaaggccaaagataaccgggcggctcac
cgcatcaataaggtgcaaaagcaggctgagagccagatcaacaagcagaccaaggtgggc
gacatagcccgtaagactgaccgaccggccctacatctccgcattcccccccagccaggg
gcactgggcagtccgccccctgctgctgcccccaccattttcattggcagccccactacc
cctgccggcttgtctacctctgcggacgggttcctgaagccgccggcaggcacggcgccc
ggccccgactcacctggtgagctcttcgtcaagctcccgccccaggtgcctgcccaagtg
ccttcgcaggacccctttggactggcccctgcctacgccctggagccccgcttccccaca
gcaccacccgcctatcccccttatcctagcccaactggggcccccgtacagcctccaaca
atgggcgcctcatctcgttctgggactggtcagccaggggaatttcacacgaccccacct
ggcactccccgacaccagccttccacacctgaccccttcctcaaaccccgctgcccctca
ctggacaatctggctgtgcctgagagcccaggggtagggggaggcaaggcttctgagccc
ctgctctcacccccgccttttggggagtcccggaaggcactagaggtgaagaaggaagag
cttggggcagcctctcctagctatgggcccccaaacctgagttttgttgactcaccctcc
tcaggcccccacctaggtggcccagaattaaaggcacctgatgtcttcaaggcccctctg
acccctcgggcatctcaggtagagccccagagcccgggcttgggcctacggccccaggag
ccacctcctgcccaggctttggccccttctcccccgagccaccctgacatctttcgccct
ggcccttaccctgacccctatgcccagcccccactgactcctcggccccagccaccaccc
cctgagagctgctgtgccctgccccctcgctcgctgccctctgaccctttttcccgagtt
cctgccagtcctcagtcccagtccagctcccagtccccactgacaccccgtcctctgtct
gctgaggctttctgcccatccccggttacccctcgcttccagtcccctgacccttattct
cgcccaccctcacgcccgcagtcccgtgacccatttgccccattgcataagccaccccga
ccccagcctcctgaagctgccttcaaggctgggcctctagcccacactccgctgggagct
gggggcttcccagcagccctgccctcagggccgacaggtgagctccatgccaaggtccca
agtgggcagccctccaattttgcccggtcccctgggaccggtgcatttgtgggtacctcc
tctcccatgcgtttcactttccctcaggcggtaggggagccttccttaaagcctcctgcc
cctcaacctggtccccccccaccccatgggatcaatagccattttgggcccggcgccacc
ttgggcaagccccaaagcacaaactacgcagtagccacggggaacttccacacatcaggc
agccccctggggcccagcagtgggtccacaggagagggctatgggctgtccccactacgc
cctgcatcagtcctgccaccacctgcacccgatggatccctcccctacctgtcccatgga
gcctcacagcgggcagggatcacctctccagttgaaaagcgagaagacccaggggctgga
atgggcagctccctagcagcacctgaactcccaggtacccaggatccaggcatgcccagc
ctcagtcagacggagctggagaagcaacgacagcgccaacgactgcgggagctgctgatt
aggcagcagattcagcgcaacaccctgcggcaggagaaggagacagctgcagcagctgca
ggagcagtggggcccccgggcagctggggtgcggagcccagcagccctgcctttgagcag
ctgagtcgaggccagactcccttcaatgggacccaggacaagagcagccttgtgggactg
ccccaaaccaagctgggtggccctgtcctggggccaggggcttttcccagtgacgaccga
ctctcccggccacctccaccagccaccccttcctctatggatgtgaacagccgacaactg
gtaggaggctcccaagccttctatcaacgagcaccctatcctgggcccctgcccttacaa
caacaacagcaactgtggcagcaacaacagcagcagcagcaggcaacagcagcaacctcc
atgcgacttgcaatgtctactcgctttccgtcaactcctgggcctgaacttggccgccaa
gccctaggttcccccttgacaggaattcccacccgtttacctggtcctggtgagccagtg
cctgggccacctggtcctgcccagttcattgagttgcggcacaatgtgcagaaaggacta
ggacctggggggcctccattccctgttcaggggccccctcagagaccccgtttctatgct
gtaagtgaagatccccaccgactagcccctgaaggacttcggggcctggcggtatcaggg
cttcctccacagaaaccctcagccccaccagctactgagttgaacaacagcctccatcca
acaccccacaccaagggtcccaacctgtccactggcttggagctggtcagccgaccccct
tcgagcaccgaacttggccgcccccctcctctggctctggaagctgggaagctgccctgt
gaggattctgagctggacgatgactttgatgcccacaaggccctagaggatgacgaggag
cttgctcacctgggcttgggtgtggatgtggccaagggtgatgatgagctgagcactctg
gaaaacttggagactaatgacccccacctagatgacctgctcaatggagatgagtttgac
cttctggcttatactgaccctgaactggacactggggacaagaaggacattttcaatgag
catctgaggctggtggaatcagccaatgagaaagctgagcgagaggccctgctgcagggg
gtggagccaggacctttgggacctgaggagcgtcctgcccctgcccctgcccctgatcct
gatgcctctgagcctcgcctggcaccggtactccctgaagtgaaaaccaaggtggaggag
ggtgggccccacccttccccttgccagttcaccattaccacccccaaggtggagtcagca
cctgccaccaattctttgggcctggggccgaagccaggacagagtgtgattggcagccgg
gacacgcgtattgtcacaggacccttttccagcagtgggcacacaggtgaaaagggcccc
tttggggccacaggaggaccaccagctcacctgttggcccccagcccactgagtggctca
ggagggtcttccttgctggaaaagtttgagctagatagtgggaccctggccttgcctggt
ggacatgcagcatctggggatgaactagacaagatggagagctcactcgtggccagtgag
ttgcccctactcattgaagatcttttggagcatgagaagaaggagctgcagaagaagcag
cagctttcggcacagctgcagcctgcccagcagcaacagcagcagcattccctgctgtct
acaccaggctctgcccaggccatgcctttgccacatgagggctcttccagtttggctggg
ccccagcagcagcttgccctgggtcttggaggtgcccgacagccaggcttggcccaacca
ctgatgcccacccagccaccagctcatgccctccaacagcgcctggccccatccatggcc
atggtgtctaaccaagggcatatgctaggtggacagcatggagggcaagcaggattggta
cctcagcagagcccacagccagtgctgacacagaagcccgtgggtactatgccaccttca
atgtgcatgaagccccagcagctggcgatgcagcagcagcttgctaacagcttctttccg
gatacagacctagacaaatttgctgcagaagatattattgatcccattgcaaaggccaag
atggtggctttgaaaggcatcaagaaagtgatggctcagggcagcattggggtggcacct
ggtatgaacaggcagcaagtgtcactgctagcccagcggctctcaggggggcctggcaat
gacctgcagaaccatgtggcagctgggagtggccaggagcggagtgccaacgacccctcc
cagcctcgccccaacccacccacttttgctcagggggtaatcaatgaggccgaccagcgg
caatatgaagagtggctgttccacacccaacagctcctacagatgcagctgaaggtgtta
gaggagcagattggtgtgcaccgtaagtcccggaaggctctgtgtgccaagcagcgcact
gccaaaaaggctggccgtgagttcccagaggccgatgctgagaagctcaagctggtcacg
gagcagcagagcaagatccagaaacaactggatcaggtccgaaaacaacagaaggaacac
actaatctcatggcagaatatcggaataaacagcaacaacaacagcagcagcaacaacag
cagcaacagcactcagctgtgctggcactcagcccttctcagagtccccgactactcacc
aagctccctggtcagctcctcccaggccatgggctgcagccacctcagggacctctgggt
gggcaagctggaggtcttcgcctaccccctgggggcatggcactacctggacaaccaggt
ggccctttcctcaacacagccctggcccaacagcagcaacagcaacattctgggggagct
ggggccctgactggtccctctggcggcttcttccctggcagccttgctcttcggggcctg
ggacctgattcaaggctcttacaggaaaggcagctgcaactgcagcagcaacgcatgcag
cttgctcagaaactgcagcagcaacagcagcagcagcagcagcagcagcagcagcagcag
cagcagcaacaccttctaggacagcttcaacagcagcaacagcagcagcagcttcaacag
cagcagcagcagcttcaacagcagcagcagcagcagcagcaatttcaacagcagcagcag
cagatgggcctcttgaaccagagtcgaactttaatgtctcctcagcaacagcagcagcag
cagcagcagcagcagcagcagcagcagcagcagcagcagcagcagcaggtgacacttggc
cctggcatgccagccaagcctcttcaacacttttctagtcctggagccctgggcccaacc
ctcctcctgacgggcaaggaacaaggcattgtagagacagctctccctccagaggtcact
gagggacccacaacacatcagggaggcccactagcaatagggactacacccgaatcgatg
gctgctgaaccaggggaggtaaagccttcagtttctggggactcacaactcctgcttgtc
caaccccaggcccaacctcagcccaactccctgcagctgcagccatctctgaggctccca
ggacaacagcagcagcaagtgaacttgctccacacagcaggtgggggaagccatgggcag
ctaggcagtggaccatcttctgaggcctcatctatgccccacctactgtcacaaccctct
gtttccttaggggagcagcctggacccatgacccagaaccttctgggtccccaacatcct
cttgggctggagcggcccatgcaaagtaatgtagggccacagcctcccaaaccaggacct
gtcccccagtctgggcagggcctgcctgggcctggagtcatgcctacagtgggtcagctt
cgagcacagctccaaggcgtcttggccaaaaccccacagctgcgccacttgagtcctcag
cagcagcagcagctacaggcgcttctcatgcagcggcagctgcagcagagtcaggcagta
cgccagacccctcccttccaggagcctgggacccagccttcacccctccagggccttcta
ggccgccagccccaacttgggagcttccctggatcccaaacaggccccctccaggagcta
ggggcagggcctcgatctcagggcccaccccggctctccgccccacaaggagccttatcg
acaggatcagtccttggccctgtccatcccacacctccaccatccagcccccaagaacca
aagagaccttccccgcaattaccttcccccagctcccaggtcccctctgaggtccagctc
actcccacccagccagggactccaaagccccaggggccacccttggagctgccttctggg
agggtctcccctgctgctgcccagcttgtggacaccttcttcggcaaggggctgggacct
tgggaccccccagacaacctcgcagaagcccagaagccagatcagagcagcctggtacct
gggcatctggagcaggtgaatgggcaggtggtacctgagccaccccatctcagcatcaag
caggagcctcgggaagagccatgcgccttgggggcccaggcggtgaagagggaggccaat
ggggagccactaggggcaccaggtaccagcaaccacctcctgctggcaggcccccgctca
gaggctggacatctgctcttgcagaagcttctacgggcaaagaatgtacagctcagcact
gggcgggggcccgaggggctgcgatctgagatcaacgggcacattgacagcaagctggct
gggctggagcagaaactacagggtactcccagcaacaaggaggatgcagcagcaaggaag
ccattgaccccgaagcccaagcgggtacagaaggcaagcgacaggttggtgagctcccga
aagaagctgcggaaggaggacggggtcagggccagcgaggccttgctgaaacagctgaaa
caggagctgtccctgctgcccctaacggagcctaccatcaccaccaattttagcctcttt
gccccctttggcagtggctgctcaatcagtgggcagagccagctgaggggggcctttgga
agtggggcactgcccactggccctgactactattcccagctgcttacaaagaataacctg
agtaacccgccgacaccaccctcgtcgctgccccccaccccacccccatcggtgcagcag
aagatggtgaatggcgtcaccccatctgaagagctgggggagcaccccaaggatgccgcc
tctgcccaagagactgaaggggcattgaggagtgcttcagaggtgaagagtctagacctt
ctcgccgccttgcctacaccccctcacaaccagactgaggatgtcaggatggagagtgat
gaagacagcgattctcctgacagcattgtgccagcttcatcccccgagagcatcctaggg
gaggaggcgcctcgtttccctcagctaggctcatgccggtgggagcaggatgaccgggcc
ctgtccccagtcatccccatcattcctcgggccagcatcccagtcttcccagataccaaa
ccttacggggccttggacctggaggtccctggaaagctgcctgccacaacttgggaaaag
ggcaaaggaagtgaggtctcagtcatgctcacggtctctgctgctgcagccaagaacctg
aatggtgtgatggtggcagtggcagagctgctgagcatgaagatccccaactcctacgaa
gtgctcttcgcagagagccctgcccgggcaggcactgagcccaagaagggggaagctgag
ggccctggtgggaaggaaaagggtctgggaatcaagagcccagaagctggccctgattgg
ctgaaacagtttgatgcagtgttgcctggctataccctcaagagccagctagacatcttg
agcctcctgaaacaggagagcccagccccagagccacccacccagcacagctacacctac
aacgtctccaatctggatgtgcgacagctctcggccccgcctcctgaagaaccctcccca
cctccctcccccttggcaccctctcctgccagcccccctgctgaacgcctggttgaactc
ccagccgaacccacagctgatccatcagtcccttcacctctgcctctggcctcgtcccct
gaatcagcccggcccaagtcccgagcccggccccctgaggaaggggaggattcccgccct
cctcgcctcaaaaagtggaagggggtgcgctggaagcggctgcgactgctactgactatc
cagaagagcagcgggcggcaggaggatgagcgggaagtggcagagtttatggagcagctt
ggcacagcactgcgacctgacaaggtgcctcgagacatgcggcgctgctgcttttgccat
gaggagggtgatggggccactgatgggcctgcccgcctgctgaacctggacctggacctg
tgggtacacctcaactgtgccctgtggtccacagaggtgtatgagacccagggtggggcg
ctgatgaatgtggaggttgctttgcaccggggactgctaactaagtgctccctgtgccag
cgcaccggtgccaccagcagctgcaatcgcatgcgttgccccaacgtctaccattttgcc
tgtgccatccgcgccaagtgcatgttcttcaaggacaagaccatgctatgcccaatgcat
aagatcaaggggccctgcgagcaggagctgagttcttttgctgtcttccggcgggtctac
attgagcgggacgaggtgaagcaaatcgccagcatcatccagcggggagagcggctgcac
atgttccgtgtggggggccttgtgttccatgccatcggacagctgctccctcaccagatg
gctgacttccatagtgccactgccctctatcccgtgggctacgaggccacgcgcatctac
tggagcctccggaccaacaaccgccgctgctgctaccgctgctccatcggtgagaacaat
ggacggccggagttcataatcaaagtcacggaacagggcctggaggacctggtcttcagt
gacgcctccccccaggccgtgtggaatcgcatcatcgagccggtggctgccatgagaaaa
gaggctgacatgctgcggctcttccctgagtacctgaaaggcgaggagctctttgggctg
acggtgcacgctgtgcttcgcatagctgaatcactgcctggggtggagagctgtcaaaac
tatttattccgctatggccgtcaccccctgatggagctgccactcatgatcaaccccact
ggctgtgcccgatcggaacctaaaatcctcacacactacaaacggccccataccctgaac
agcaccagcatgtccaaggcatatcagagcaccttcacaggcgagaccaacaccccatac
agcaagcagtttgtgcactccaagtcatctcagtaccggcggctgcgcactgagtggaag
aacaacgtatatctggctcgctcccgtatccagggcctggggctctatgcagccaaggac
ctagagaagcacacaatggtcatcgagtatattggcaccatcattcgtaacgaggtggcc
aaccggcgggagaaaatctatgaagagcagaatcgaggtatctatatgttccggataaac
aatgaacatgtgattgatgctacattgaccggaggccctgccaggtacattaaccattcc
tgtgcccctaactgtgtggcagaagttgtgacatttgacaaggaggacaaaatcatcatc
atctccagtcggcgaatccccaaaggagaggagctgacgtatgactatcagtttgacttt
gaggacgatcagcacaagatcccctgccactgtggagcctggaattgtcggaaatggatg
aactaa

KEGG   Loxodonta africana (African savanna elephant): 100671380
Entry
100671380         CDS       T04351                                 

Gene name
WDR5B
Definition
(RefSeq) WD repeat-containing protein 5B
  KO
K14963  COMPASS component SWD3
Organism
lav  Loxodonta africana (African savanna elephant)
Pathway
lav04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:lav00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    100671380 (WDR5B)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:lav03036]
    100671380 (WDR5B)
Chromosome and associated proteins [BR:lav03036]
 Eukaryotic type
  Histone modification proteins
   HAT complexes
    ATAC complex
     100671380 (WDR5B)
    NSL complex
     100671380 (WDR5B)
   HMT complexes
    COMPASS/SET1 complex
     100671380 (WDR5B)
    COMPASS/SET1 complex (yeast)
     100671380 (WDR5B)
    MLL-HCF complex
     100671380 (WDR5B)
    MLL3/MLL4 complex
     100671380 (WDR5B)
SSDB
Motif
Pfam: WD40 ANAPC4_WD40 eIF2A Nup160 Ge1_WD40 WD40_like Frtz CDtoxinA Ricin_B_lectin
Other DBs
NCBI-GeneID: 100671380
NCBI-ProteinID: XP_003413011
UniProt: G3TG94
LinkDB
Position
Unknown
AA seq 330 aa
MATEEAGDSKGESALSSSANRSNQVPEKPNYALRFTLLGHTEAVSSVKFSPDGEWLASSA
ADKLIKIWSVRDGKYEKTLCGHSLEISDVAWSSDSSRLVSASDDKTLKIWEVRSGKCLKT
LKGHSNYVFCCNFNPLSNLIVSGSFDESVKIWEVETGKCLKTLSAHSDPVSAVHFNCSGS
LIVSGSYDGLCRIWDAASGQCLKTLVDDDNPPVSFVQFSPNGKYILTATLDSTLKLWDYS
RGRCLKTYTGHKNEKYCIFANFSVTGGKWIVSGSEDNMVYIWNLQTKEIVQKLQGHTDVV
ISATCHPTENIIASAALENDKTIKLWMSDH
NT seq 993 nt   +upstreamnt  +downstreamnt
atggccacagaggaggcaggagatagcaaaggggagtcagccctctcctcgtcagccaat
cggagcaatcaagtgcctgaaaaaccaaactatgccctgagatttactctcctgggacat
acggaagcagtgtcatcagttaagtttagtcctgatggagaatggctagcaagttctgct
gctgataaactaattaaaatctggagtgtccgtgacggaaaatacgagaaaacactgtgt
ggtcacagtctggaaatatcagatgttgcctggtcatcagattcgagccgtctagtttct
gcctcagatgataaaaccctaaagatatgggaagtgagatctggaaaatgtttgaaaaca
cttaaggggcacagtaattatgtcttttgctgtaatttcaacccgctatccaacctcatc
gtttcaggatcttttgatgagagtgtgaaaatatgggaggtggagacaggaaagtgcctc
aagacgttgtccgctcattctgacccagtgtctgcagttcactttaattgtagtgggtcc
ttgatagtatcaggtagctacgatggtctctgtagaatctgggacgctgcatcaggtcag
tgtttaaaaaccctggttgatgatgataaccctcctgtctcttttgtacagttttctcca
aatggtaaatacattctcactgcgactttggacagtactctgaaactatgggattatagc
agaggcagatgcctgaaaacgtacactggacacaagaatgaaaaatactgcatatttgcc
aatttttcagttactggtggaaagtggatcgtgtctggttcagaggataacatggtttac
atttggaaccttcagactaaagagattgtacagaaattacaaggtcacacagatgttgtg
atctcagcaacttgccatcctacagaaaacatcattgcgtcagcagcgttagaaaatgac
aaaacaattaaactatggatgagcgaccactaa

KEGG   Loxodonta africana (African savanna elephant): 100673925
Entry
100673925         CDS       T04351                                 

Gene name
WDR5
Definition
(RefSeq) WD repeat-containing protein 5
  KO
K14963  COMPASS component SWD3
Organism
lav  Loxodonta africana (African savanna elephant)
Pathway
lav04934  Cushing syndrome
Brite
KEGG Orthology (KO) [BR:lav00001]
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04934 Cushing syndrome
    100673925 (WDR5)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:lav03036]
    100673925 (WDR5)
Chromosome and associated proteins [BR:lav03036]
 Eukaryotic type
  Histone modification proteins
   HAT complexes
    ATAC complex
     100673925 (WDR5)
    NSL complex
     100673925 (WDR5)
   HMT complexes
    COMPASS/SET1 complex
     100673925 (WDR5)
    COMPASS/SET1 complex (yeast)
     100673925 (WDR5)
    MLL-HCF complex
     100673925 (WDR5)
    MLL3/MLL4 complex
     100673925 (WDR5)
SSDB
Motif
Pfam: WD40 ANAPC4_WD40 Nup160 eIF2A WD40_like Ge1_WD40 Frtz VID27 CDtoxinA Nucleoporin_N
Other DBs
NCBI-GeneID: 100673925
NCBI-ProteinID: XP_003423046
LinkDB
Position
Unknown
AA seq 334 aa
MATEEKKPETEAARAQPTPSSSATQSKPTPVKPNYALKFTLAGHTKAVSSVKFSPNGEWL
ASSSADKLIKIWGAYDGKFEKTISGHKLGISDVAWSSDSNLLVSASDDKTLKIWDVSSGK
CLKTLKGHSNYVFCCNFNPQSNLIVSGSFDESVRIWDVKTGKCLKTLPAHSDPVSAVHFN
RDGSLIVSSSYDGLCRIWDTASGQCLKTLIDDDNPPVSFVKFSPNGKYILAATLDNTLKL
WDYSKGKCLKTYTGHKNEKYCIFANFSVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGH
TDVVISTACHPTENIIASAALENDKTIKLWKSDC
NT seq 1005 nt   +upstreamnt  +downstreamnt
atggcaacggaggagaagaagccggagaccgaagctgccagggcacagccgaccccttcg
tcgtcggccactcagagcaagcctacacctgtcaaaccaaactacgctctgaagttcacc
ttggctggccacacaaaagccgtgtcctccgtgaagttcagtcccaatggagaatggctg
gccagttcctctgctgataaactcattaaaatctggggtgcttatgatggaaaatttgag
aaaaccatatctggtcacaaactgggtatatcagatgtggcctggtcgtcagactccaac
ctgctcgtctctgcctcagacgacaagaccttaaagatctgggacgtgagctcggggaag
tgtctgaaaaccctgaagggacacagtaattacgtcttttgctgcaacttcaacccccag
tccaacctcatcgtctccggatcttttgatgaaagtgtgaggatatgggatgtgaagaca
gggaagtgcctcaagacgttgcctgctcactcggacccagtttcagccgttcacttcaac
cgagatggatccttgatagtttcaagtagctatgatggcctctgtcggatctgggacaca
gcctcaggccagtgcttgaagacgctgatagatgatgacaaccccccggtgtcctttgtg
aaattctctccgaatggcaagtacatcctggctgctacgctggacaacacgctgaaactc
tgggactacagcaaggggaagtgcctgaagacatacaccggtcacaagaatgagaaatac
tgcatatttgcgaacttttcagttactggtggcaagtggatcgtgtccggttctgaagat
aacctggtttacatctggaatctccagacgaaagagattgtacagaagttacaaggccac
acagatgttgtgatctcaacggcgtgtcacccaacagagaacatcatcgcatcagctgcg
ttagaaaacgacaaaacgattaaactgtggaagagtgactgttaa

DBGET integrated database retrieval system