KEGG   Homo sapiens (human): 8314
Entry
8314              CDS       T01001                                 
Symbol
BAP1, HUCEP-13, KURIS, TPDS1, UBM2, UCHL2, UVM2, hucep-6
Name
(RefSeq) BRCA1 associated deubiquitinase 1
  KO
K08588  ubiquitin carboxyl-terminal hydrolase BAP1 [EC:3.4.19.12]
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Disease
H02623  Kury-Isidor syndrome
H02624  Tumor predisposition syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    8314 (BAP1)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    8314 (BAP1)
  09182 Protein families: genetic information processing
   04121 Ubiquitin system [BR:hsa04121]
    8314 (BAP1)
   03036 Chromosome and associated proteins [BR:hsa03036]
    8314 (BAP1)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.19  Omega peptidases
    3.4.19.12  ubiquitinyl hydrolase 1
     8314 (BAP1)
Peptidases and inhibitors [BR:hsa01002]
 Cysteine peptidases
  Family C12: ubiquitin C-terminal hydrolase family
   8314 (BAP1)
Ubiquitin system [BR:hsa04121]
 Deubiquitinating enzyme (DUB)
  Ubiquitin-specific proteases (UBPs)
   UCH
    8314 (BAP1)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     8314 (BAP1)
SSDB
Motif
Pfam: Peptidase_C12 UCH_C
Other DBs
NCBI-GeneID: 8314
NCBI-ProteinID: NP_004647
OMIM: 603089
HGNC: 950
Ensembl: ENSG00000163930
UniProt: Q92560
Structure
LinkDB
Position
3:complement(52401008..52410008)
AA seq 729 aa
MNKGWLELESDPGLFTLLVEDFGVKGVQVEEIYDLQSKCQGPVYGFIFLFKWIEERRSRR
KVSTLVDDTSVIDDDIVNNMFFAHQLIPNSCATHALLSVLLNCSSVDLGPTLSRMKDFTK
GFSPESKGYAIGNAPELAKAHNSHARPEPRHLPEKQNGLSAVRTMEAFHFVSYVPITGRL
FELDGLKVYPIDHGPWGEDEEWTDKARRVIMERIGLATAGEPYHDIRFNLMAVVPDRRIK
YEARLHVLKVNRQTVLEALQQLIRVTQPELIQTHKSQESQLPEESKSASNKSPLVLEANR
APAASEGNHTDGAEEAAGSCAQAPSHSPPNKPKLVVKPPGSSLNGVHPNPTPIVQRLPAF
LDNHNYAKSPMQEEEDLAAGVGRSRVPVRPPQQYSDDEDDYEDDEEDDVQNTNSALRYKG
KGTGKPGALSGSADGQLSVLQPNTINVLAEKLKESQKDLSIPLSIKTSSGAGSPAVAVPT
HSQPSPTPSNESTDTASEIGSAFNSPLRSPIRSANPTRPSSPVTSHISKVLFGEDDSLLR
VDCIRYNRAVRDLGPVISTGLLHLAEDGVLSPLALTEGGKGSSPSIRPIQGSQGSSSPVE
KEVVEATDSREKTGMVRPGEPLSGEKYSPKELLALLKCVEAEIANYEACLKEEVEKRKKF
KIDDQRRTHNYDEFICTFISMLAQEGMLANLVEQNISVRRRQGVSIGRLHKQRKPDRRKR
SRPYKAKRQ
NT seq 2190 nt   +upstreamnt  +downstreamnt
atgaataagggctggctggagctggagagcgacccaggcctcttcaccctgctcgtggaa
gatttcggtgtcaagggggtgcaagtggaggagatctacgaccttcagagcaaatgtcag
ggccctgtatatggatttatcttcctgttcaaatggatcgaagagcgccggtcccggcga
aaggtctctaccttggtggatgatacgtccgtgattgatgatgatattgtgaataacatg
ttctttgcccaccagctgatacccaactcttgtgcaactcatgccttgctgagcgtgctc
ctgaactgcagcagcgtggacctgggacccaccctgagtcgcatgaaggacttcaccaag
ggtttcagccctgagagcaaaggatatgcgattggcaatgccccggagttggccaaggcc
cataatagccatgccaggcccgagccacgccacctccctgagaagcagaatggccttagt
gcagtgcggaccatggaggcgttccactttgtcagctatgtgcctatcacaggccggctc
tttgagctggatgggctgaaggtctaccccattgaccatgggccctggggggaggacgag
gagtggacagacaaggcccggcgggtcatcatggagcgtatcggcctcgccactgcaggg
gagccctaccacgacatccgcttcaacctgatggcagtggtgcccgaccgcaggatcaag
tatgaggccaggctgcatgtgctgaaggtgaaccgtcagacagtactagaggctctgcag
cagctgataagagtaacacagccagagctgattcagacccacaagtctcaagagtcacag
ctgcctgaggagtccaagtcagccagcaacaagtccccgctggtgctggaagcaaacagg
gcccctgcagcctctgagggcaaccacacagatggtgcagaggaggcggctggttcatgc
gcacaagccccatcccacagccctcccaacaaacccaagctagtggtgaagcctccaggc
agcagcctcaatggggttcaccccaaccccactcccattgtccagcggctgccggccttt
ctagacaatcacaattatgccaagtcccccatgcaggaggaagaagacctggcggcaggt
gtgggccgcagccgagttccagtccgcccaccccagcagtactcagatgatgaggatgac
tatgaggatgacgaggaggatgacgtgcagaacaccaactctgcccttaggtataagggg
aagggaacagggaagccaggggcattgagcggttctgctgatgggcaactgtcagtgctg
cagcccaacaccatcaacgtcttggctgagaagctcaaagagtcccagaaggacctctca
attcctctgtccatcaagactagcagcggggctgggagtccggctgtggcagtgcccaca
cactcgcagccctcacccacccccagcaatgagagtacagacacggcctctgagatcggc
agtgctttcaactcgccactgcgctcgcctatccgctcagccaacccgacgcggccctcc
agccctgtcacctcccacatctccaaggtgctttttggagaggatgacagcctgctgcgt
gttgactgcatacgctacaaccgtgctgtccgtgatctgggtcctgtcatcagcacaggc
ctgctgcacctggctgaggatggggtgctgagtcccctggcgctgacagagggtgggaag
ggttcctcgccctccatcagaccaatccaaggcagccaggggtccagcagcccagtggag
aaggaggtcgtggaagccacggacagcagagagaagacggggatggtgaggcctggcgag
cccttgagtggggagaaatactcacccaaggagctgctggcactgctgaagtgtgtggag
gctgagattgcaaactatgaggcgtgcctcaaggaggaggtagagaagaggaagaagttc
aagattgatgaccagagaaggacccacaactacgatgagttcatctgcacctttatctcc
atgctggctcaggaaggcatgctggccaacctagtggagcagaacatctccgtgcggcgg
cgccaaggggtcagcatcggccggctccacaagcagcggaagcctgaccggcggaaacgc
tctcgcccctacaaggccaagcgccagtga

KEGG   Homo sapiens (human): 171023
Entry
171023            CDS       T01001                                 
Symbol
ASXL1, BOPS, MDS
Name
(RefSeq) ASXL transcriptional regulator 1
  KO
K11471  additional sex combs-like protein
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Disease
H01481  Myelodysplastic syndrome
H02047  Bohring-Opitz syndrome
H02410  Myelodysplastic/myeloproliferative neoplasms
H02411  Chronic myelomonocytic leukemia
H02412  Atypical chronic myeloid leukemia
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    171023 (ASXL1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    171023 (ASXL1)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     171023 (ASXL1)
SSDB
Motif
Pfam: ASXH PHD_3 HARE-HTH
Other DBs
NCBI-GeneID: 171023
NCBI-ProteinID: NP_056153
OMIM: 612990
HGNC: 18318
Ensembl: ENSG00000171456
UniProt: Q8IXJ9
Structure
LinkDB
Position
20:32358331..32439319
AA seq 1541 aa
MKDKQKKKKERTWAEAARLVLENYSDAPMTPKQILQVIEAEGLKEMRSGTSPLACLNAML
HSNSRGGEGLFYKLPGRISLFTLKKDALQWSRHPATVEGEEPEDTADVESCGSNEASTVS
GENDVSLDETSSNASCSTESQSRPLSNPRDSYRASSQANKQKKKTGVMLPRVVLTPLKVN
GAHVESASGFSGCHADGESGSPSSSSSGSLALGSAAIRGQAEVTQDPAPLLRGFRKPATG
QMKRNRGEEIDFETPGSILVNTNLRALINSRTFHALPSHFQQQLLFLLPEVDRQVGTDGL
LRLSSSALNNEFFTHAAQSWRERLADGEFTHEMQVRIRQEMEKEKKVEQWKEKFFEDYYG
QKLGLTKEESLQQNVGQEEAEIKSGLCVPGESVRIQRGPATRQRDGHFKKRSRPDLRTRA
RRNLYKKQESEQAGVAKDAKSVASDVPLYKDGEAKTDPAGLSSPHLPGTSSAAPDLEGPE
FPVESVASRIQAEPDNLARASASPDRIPSLPQETVDQEPKDQKRKSFEQAASASFPEKKP
RLEDRQSFRNTIESVHTEKPQPTKEEPKVPPIRIQLSRIKPPWVVKGQPTYQICPRIIPT
TESSCRGWTGARTLADIKARALQVRGARGHHCHREAATTAIGGGGGPGGGGGGATDEGGG
RGSSSGDGGEACGHPEPRGGPSTPGKCTSDLQRTQLLPPYPLNGEHTQAGTAMSRARRED
LPSLRKEESCLLQRATVGLTDGLGDASQLPVAPTGDQPCQALPLLSSQTSVAERLVEQPQ
LHPDVRTECESGTTSWESDDEEQGPTVPADNGPIPSLVGDDTLEKGTGQALDSHPTMKDP
VNVTPSSTPESSPTDCLQNRAFDDELGLGGSCPPMRESDTRQENLKTKALVSNSSLHWIP
IPSNDEVVKQPKPESREHIPSVEPQVGEEWEKAAPTPPALPGDLTAEEGLDPLDSLTSLW
TVPSRGGSDSNGSYCQQVDIEKLKINGDSEALSPHGESTDTASDFEGHLTEDSSEADTRE
AAVTKGSSVDKDEKPNWNQSAPLSKVNGDMRLVTRTDGMVAPQSWVSRVCAVRQKIPDSL
LLASTEYQPRAVCLSMPGSSVEATNPLVMQLLQGSLPLEKVLPPAHDDSMSESPQVPLTK
DQSHGSLRMGSLHGLGKNSGMVDGSSPSSLRALKEPLLPDSCETGTGLARIEATQAPGAP
QKNCKAVPSFDSLHPVTNPITSSRKLEEMDSKEQFSSFSCEDQKEVRAMSQDSNSNAAPG
KSPGDLTTSRTPRFSSPNVISFGPEQTGRALGDQSNVTGQGKKLFGSGNVAATLQRPRPA
DPMPLPAEIPPVFPSGKLGPSTNSMSGGVQTPREDWAPKPHAFVGSVKNEKTFVGGPLKA
NAENRKATGHSPLELVGHLEGMPFVMDLPFWKLPREPGKGLSEPLEPSSLPSQLSIKQAF
YGKLSKLQLSSTSFNYSSSSPTFPKGLAGSVVQLSHKANFGASHSASLSLQMFTDSSTVE
SISLQCACSLKAMIMCQGCGAFCHDDCIGPSKLCVLCLVVR
NT seq 4626 nt   +upstreamnt  +downstreamnt
atgaaggacaaacagaagaagaagaaggagcgcacgtgggccgaggccgcgcgcctggta
ttagaaaactactcggatgctccaatgacaccaaaacagattctgcaggtcatagaggca
gaaggactaaaggaaatgagaagtgggacttcccctctcgcatgcctcaatgctatgcta
cattccaattcaagaggaggagaggggttgttttataaactgcctggccgaatcagcctt
ttcacgctcaagaaggatgccctgcagtggtctcgccatccagctacagtggagggagag
gagccagaggacacggctgatgtggagagctgtgggtctaatgaagccagcactgtgagt
ggtgaaaacgatgtatctcttgatgaaacatcttcgaacgcatcctgttctacagaatct
cagagtcgacctctttccaatcccagggacagctacagagcttcctcacaggcgaacaaa
caaaagaaaaagactggggtgatgctgcctcgagttgtcctgactcctctgaaggtaaac
ggggcccacgtggaatctgcatcagggttctcgggctgccacgccgatggcgagagcggc
agcccgtccagcagcagcagcggctctctggccctgggcagcgctgctattcgtggccag
gccgaggtcacccaggaccctgccccgctcctgagaggcttccggaagccagccacaggt
caaatgaagcgcaacagaggggaagaaatagattttgagacacctgggtccattcttgtc
aacaccaacctccgtgccctgatcaactctcggaccttccatgccttaccatcacacttc
cagcagcagctcctcttcctcctgcctgaagtagacagacaggtggggacggatggcctg
ttgcgtctcagcagcagtgcactaaataacgagttttttacccatgcggctcagagctgg
cgggagcgcctggctgatggtgaatttactcatgagatgcaagtcaggatacgacaggaa
atggagaaggaaaagaaggtggaacaatggaaagaaaagttctttgaagactactatgga
cagaagctgggtttgaccaaagaagagtcattgcagcagaacgtgggccaggaggaggct
gaaatcaaaagtggcttgtgtgtcccaggagaatcagtgcgtatacagcgtggtccagcc
acccgacagcgagatgggcattttaagaaacgctctcggccagatctccgaaccagagcc
agaaggaatctgtacaaaaaacaggagtcagaacaagcaggggttgctaaggatgcaaaa
tctgtggcctcagatgttcccctctacaaggatggggaggctaagactgacccagcaggg
ctgagcagtccccatctgccaggcacatcctctgcagcacccgacctggagggtcccgaa
ttcccagttgagtctgtggcttctcggatccaggctgagccagacaacttggcacgtgcc
tctgcatctccagacagaattcctagcctgcctcaggaaactgtggatcaggaacccaag
gatcagaagaggaaatcctttgagcaggcggcctctgcatcctttcccgaaaagaagccc
cggcttgaagatcgtcagtcctttcgtaacacaattgaaagtgttcacaccgaaaagcca
cagcccactaaagaggagcccaaagtcccgcccatccggattcaactttcacgtatcaaa
ccaccctgggtggttaaaggtcagcccacttaccagatatgcccccggatcatccccacc
acggagtcctcctgccggggttggactggcgccaggaccctcgcagacattaaagcccgt
gctctgcaggtccgaggggcgagaggtcaccactgccatagagaggcggccaccactgcc
atcggaggggggggtggcccgggtggaggtggcggcggggccaccgatgagggaggtggc
agaggcagcagcagtggtgatggtggtgaggcctgtggccaccctgagcccaggggaggc
ccgagcacccctggaaagtgtacgtcagatctacagcgaacacaactactgccgccttat
cctctaaatggggagcatacccaggccggaactgccatgtccagagctaggagagaggac
ctgccttctctgagaaaggaggaaagctgcctactacagagggctacagttggactcaca
gatgggctaggagatgcctcccaactccccgttgctcccactggggaccagccatgccag
gccttgcccctactgtcctcccaaacctcagtagctgagagattagtggagcagcctcag
ttgcatccggatgttagaactgaatgtgagtctggcaccacttcctgggaaagtgatgat
gaggagcaaggacccaccgttcctgcagacaatggtcccattccgtctctagtgggagat
gatacattagagaaaggaactggccaagctcttgacagtcatcccactatgaaggatcct
gtaaatgtgacccccagttccacacctgaatcctcaccgactgattgcctgcagaacaga
gcatttgatgacgaattagggcttggtggctcatgccctcctatgagggaaagtgatact
agacaagaaaacttgaaaaccaaggctctcgtttctaacagttctttgcattggataccc
atcccatcgaatgatgaggtagtgaaacagcccaaaccagaatccagagaacacatacca
tctgttgagccccaggttggagaggagtgggagaaagctgctcccacccctcctgcattg
cctggggatttgacagctgaggagggtctagatcctcttgacagccttacttcactctgg
actgtgccatctcgaggaggcagtgacagcaatggcagttactgtcaacaggtggacatt
gaaaagctgaaaatcaacggagactctgaagcactgagtcctcacggtgagtccacggat
acagcctctgactttgaaggtcacctcacggaggacagcagtgaggctgacactagagaa
gctgcagtgacaaagggatcttcggtggacaaggatgagaaacccaattggaaccaatct
gccccactgtccaaggtgaatggtgacatgcgtctggttacaaggacagatgggatggtt
gctcctcagagctgggtgtctcgagtatgtgcggtccgccaaaagatcccagattcccta
ctgctggccagtactgagtaccagccaagagccgtgtgcctgtccatgcctgggtcctca
gtggaggccactaacccacttgtgatgcagttgctgcagggtagcttgcccctagagaag
gttcttccaccagcccacgatgacagcatgtcagaatccccacaagtaccacttacaaaa
gaccagagccatggctcgctacgcatgggatctttacatggtcttggaaaaaacagtggc
atggttgatggaagcagccccagttctttaagggctttgaaggagcctcttctgccagat
agctgtgaaacaggcactggtcttgccaggattgaggccacccaggctcctggagcaccc
caaaagaattgcaaggcagtcccaagttttgactccctccatccagtgacaaatcccatt
acatcctctaggaaactggaagaaatggattccaaagagcagttctcttcctttagttgt
gaagatcagaaggaagtccgtgctatgtcacaggacagtaattcaaatgctgctccagga
aagagcccaggagatcttactacctcgagaacacctcgtttctcatctccaaatgtgatc
tcctttggtccagagcagacaggtcgggccctgggtgatcagagcaatgttacaggccaa
gggaagaagctttttggctctgggaatgtggctgcaacccttcagcgccccaggcctgcg
gacccgatgcctcttcctgctgagatccctccagtttttcccagtgggaagttgggacca
agcacaaactccatgtctggtggggtacagactccaagggaagactgggctccaaagcca
catgcctttgttggcagcgtcaagaatgagaagacttttgtggggggtcctcttaaggca
aatgccgagaacaggaaagctactgggcatagtcccctggaactggtgggtcacttggaa
gggatgccctttgtcatggacttgcccttctggaaattaccccgagagccagggaagggg
ctcagtgagcctctggagccttcttctctcccctcccaactcagcatcaagcaggcattt
tatgggaagctttctaaactccaactgagttccaccagctttaattattcctctagctct
cccacctttcccaaaggccttgctggaagtgtggtgcagctgagccacaaagcaaacttt
ggtgcgagccacagtgcatcactttccttgcaaatgttcactgacagcagcacggtggaa
agcatctcgctccagtgtgcgtgcagcctgaaagccatgatcatgtgccaaggctgcggt
gcgttctgtcacgatgactgtattggaccctcaaagctctgtgtattgtgccttgtggtg
agataa

KEGG   Homo sapiens (human): 55252
Entry
55252             CDS       T01001                                 
Symbol
ASXL2, ASXH2, SHAPNS
Name
(RefSeq) ASXL transcriptional regulator 2
  KO
K11471  additional sex combs-like protein
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Disease
H02803  Neurodevelopmental disorder with histone modification defect
H02855  Shashi-Pena syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    55252 (ASXL2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    55252 (ASXL2)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     55252 (ASXL2)
SSDB
Motif
Pfam: ASXH PHD_3 HARE-HTH
Other DBs
NCBI-GeneID: 55252
NCBI-ProteinID: NP_060733
OMIM: 612991
HGNC: 23805
Ensembl: ENSG00000143970
UniProt: Q76L83
Structure
LinkDB
Position
2:complement(25733753..25878487)
AA seq 1435 aa
MREKGRRKKGRTWAEAAKTVLEKYPNTPMSHKEILQVIQREGLKEIRSGTSPLACLNAML
HTNSRGEEGIFYKVPGRMGVYTLKKDVPDGVKELSEGSEESSDGQSDSQSSENSSSSSDG
GSNKEGKKSRWKRKVSSSSPQSGCPSPTIPAGKVISPSQKHSKKALKQALKQQQQKKQQQ
QCRPSISISSNQHLSLKTVKAASDSVPAKPATWEGKQSDGQTGSPQNSNSSFSSSVKVEN
TLLGLGKKSFQRSERLHTRQMKRTKCADIDVETPDSILVNTNLRALINKHTFSVLPGDCQ
QRLLLLLPEVDRQVGPDGLMKLNGSALNNEFFTSAAQGWKERLSEGEFTPEMQVRIRQEI
EKEKKVEPWKEQFFESYYGQSSGLSLEDSKKLTASPSDPKVKKTPAEQPKSMPVSEASLI
RIVPVVSQSECKEEALQMSSPGRKEECESQGEVQPNFSTSSEPLLSSALNTHELSSILPI
KCPKDEDLLEQKPVTSAEQESEKNHLTTASNYNKSESQESLVTSPSKPKSPGVEKPIVKP
TAGAGPQETNMKEPLATLVDQSPESLKRKSSLTQEEAPVSWEKRPRVTENRQHQQPFQVS
PQPFLNRGDRIQVRKVPPLKIPVSRISPMPFHPSQVSPRARFPVSITSPNRTGARTLADI
KAKAQLVKAQRAAAAAAAAAAAAASVGGTIPGPGPGGGQGPGEGGEGQTARGGSPGSDRV
SETGKGPTLELAGTGSRGGTRELLPCGPETQPQSETKTTPSQAQPHSVSGAQLQQTPPVP
PTPAVSGACTSVPSPAHIEKLDNEKLNPTRATATVASVSHPQGPSSCRQEKAPSPTGPAL
ISGASPVHCAADGTVELKAGPSKNIPNPSASSKTDASVPVAVTPSPLTSLLTTATLEKLP
VPQVSATTAPAGSAPPSSTLPAASSLKTPGTSLNMNGPTLRPTSSIPANNPLVTQLLQGK
DVPMEQILPKPLTKVEMKTVPLTAKEERGMGALIATNTTENSTREEVNERQSHPATQQQL
GKTLQSKQLPQVPRPLQLFSAKELRDSSIDTHQYHEGLSKATQDQILQTLIQRVRRQNLL
SVVPPSQFNFAHSGFQLEDISTSQRFMLGFAGRRTSKPAMAGHYLLNISTYGRGSESFRR
THSVNPEDRFCLSSPTEALKMGYTDCKNATGESSSSKEDDTDEESTGDEQESVTVKEEPQ
VSQSAGKGDTSSGPHSRETLSTSDCLASKNVKAEIPLNEQTTLSKENYLFTRGQTFDEKT
LARDLIQAAQKQMAHAVRGKAIRSSPELFSSTVLPLPADSPTHQPLLLPPLQTPKLYGSP
TQIGPSYRGMINVSTSSDMDHNSAVPGSQVSSNVGDVMSFSVTVTTIPASQAMNPSSHGQ
TIPVQAFSEENSIEGTPSKCYCRLKAMIMCKGCGAFCHDDCIGPSKLCVSCLVVR
NT seq 4308 nt   +upstreamnt  +downstreamnt
atgagggaaaagggacgtaggaagaagggcaggacctgggcggaggccgccaagacggtc
ttagaaaaataccccaatacacccatgagtcataaagaaattcttcaagttatccagaga
gaaggactaaaagaaatcagaagtgggacttctcctcttgcatgcctgaatgcaatgctt
cacacaaactccagaggtgaagagggcatcttctataaggttccaggtagaatgggagta
tatactttgaagaaagatgtgccggatggggtgaaagagctgtcagaaggttcagaagaa
agcagtgatggtcagtcagattcccagagttctgagaacagcagcagcagcagtgatggt
ggcagcaacaaggagggaaaaaagagcaggtggaaaaggaaagtatcgtcgtcctccccg
cagtcaggctgcccatcacccaccattccagcaggtaaagtcatttctccatcacagaag
cacagcaagaaggcactaaagcaggcgctaaagcagcaacagcagaagaagcagcagcag
caatgcaggccaagcatatccatctcctccaaccagcatctctcactaaagactgtcaaa
gcagccagtgactctgtacctgccaaacctgcaacatgggaaggaaagcaatctgatgga
cagacaggcagccctcaaaactcaaactccagcttttcttcctcagttaaagtggaaaat
actttactaggcttggggaagaagtcattccagagatctgagagactccataccagacaa
atgaaaagaactaaatgtgctgacattgacgttgagacaccggactccattctggttaat
acaaatctgcgagcactgatcaacaagcacacattttcagtccttcctggagattgccag
caacgactgcttttactactcccagaggtagatcgacaggttggtccagatggtttaatg
aagttaaatggctcagcccttaacaatgaattcttcacttcagcagcccaaggctggaag
gaaagactctcagaaggtgagtttacacctgagatgcaggtgagaattcgacaagagatt
gagaaggagaaaaaagtggagccatggaaagaacaattctttgaaagctactatgggcag
agttctggcctgagccttgaagattctaagaaattgacagcttctcccagtgatcccaaa
gtaaagaaaaccccagctgaacaaccaaaatccatgcctgtgtcagaggcctctcttatc
agaatagttccagtagtctcccagtcagagtgtaaagaagaagcattgcaaatgtcatca
ccaggcagaaaagaagagtgtgaaagccaaggtgaagtgcagccgaacttctccacatct
tcagagcccctgctttcctcagctctcaatacacatgagcttagcagcattcttcccatc
aagtgcccaaaggatgaggatctcttggagcagaagccagtcacctctgctgaacaggaa
tctgagaagaaccatctcaccacagcttctaattataacaaaagtgaaagccaagaatct
ttagttacatcgccaagcaaacccaagagtcctggggttgaaaaaccaatagtgaagccc
acagcaggagcgggtccacaggagactaatatgaaagaacctctagcaactcttgttgat
cagagcccagaaagcctcaagaggaagtcttccctcacccaagaagaggcccctgtgagc
tgggagaagaggccacgtgtcactgagaatcgccagcaccagcagccatttcaggtctca
ccacagccctttctcaatagaggggacagaatccaggtgcgaaaagtaccacctctcaag
atcccggtctccagaatctcccccatgccgtttcatccatcgcaggtctctcccagggct
cgttttccagtctccatcactagtcctaacagaacaggagccagaactcttgcagacatc
aaagcaaaagcccaactggtcaaagcacagagggcagcagctgccgctgccgccgcagct
gctgcagccgcctcagttggagggaccattccaggacctggcccagggggtggacaaggt
ccaggagagggtggtgaagggcagactgctagaggaggcagtccaggctcagacagagtc
agtgaaactggaaagggccccacactggaactggcaggaactggaagcaggggaggtacg
agagagcttttaccctgtggtccagagactcagccccagtctgagaccaagaccacccca
agccaggcacagcctcatagtgtctctggagcacaactacagcaaacccccccagtgcct
ccaacacctgccgtcagtggagcatgcacaagtgtcccatcaccagcccacatagagaaa
ttggataatgaaaaactgaaccccaccagagcaacagccacagtggcctctgtcagccat
ccacaagggcccagtagttgcagacaggagaaagcaccttctccaacaggtcctgctcta
atctcaggtgcctcacctgttcattgtgcagctgatggcacagttgagctcaaagcaggt
cctagtaagaatatacctaacccttcagcctcatcaaagacagatgctagtgtgccagtg
gctgtaactccctcccctttaacatctttattgaccacagccactttagaaaagcttcct
gtaccccaggtcagtgcaactacagcacctgctggatcagctccaccctcgagcactttg
ccagcagcttctagccttaaaaccccaggaacttctttaaacatgaatggacccacttta
agaccaacctctagtatccctgctaataatcctttagtgactcagctgcttcaaggcaaa
gatgttcccatggagcaaattctgcctaaacctctcaccaaagttgaaatgaaaacggtt
ccactgactgcaaaagaggaaagggggatgggagcgctcatagctaccaacacaacagaa
aatagcaccagagaggaagttaatgagagacagtcccatccagctacgcagcagcagctg
ggcaaaaccttgcaaagtaagcagctcccccaggttccaaggccccttcagctcttttca
gctaaggagctgagggactccagcattgacacacaccaataccacgaaggactaagtaaa
gcaacccaagatcagatccttcagactctcattcagagggttcggaggcagaatcttctc
tcagttgtgccgccctcacagttcaacttcgctcactcaggtttccagctggaagacatc
tccacaagccagaggttcatgctgggttttgctggcagaaggacatccaaacctgcaatg
gcagggcactacttactgaatatttctacctacggccggggctcagagagctttaggagg
acccattctgtaaaccctgaagatcgtttttgtctaagcagccccactgaagccttgaaa
atgggatatacagactgtaaaaatgcaacaggagagagtagcagcagcaaagaagatgac
actgatgaggaaagtactggtgatgagcaggaatctgtcacagtgaaagaggagccccag
gtttcccagagtgctggcaagggtgacacaagttcaggacctcacagcagggaaactcta
tctaccagtgattgcttagctagcaagaatgtgaaggctgagataccattgaatgagcaa
accactttaagtaaggagaattacctgttcactagaggccaaacatttgatgaaaagacc
ctagccagagatttaattcaggcagcacagaagcagatggctcatgcagtgagaggtaag
gcaatccgtagcagccccgagcttttcagttctactgttcttcctctgcctgcagacagc
cccacccaccagcccctactccttccacccctgcaaaccccgaagttgtatggaagcccc
acccagatagggccaagctatagaggcatgatcaatgtctccacctcatctgacatggac
cataactctgctgtaccaggtagccaggtatctagcaatgtaggtgatgtcatgtcattt
tcagtgactgtcactaccatccctgctagccaagctatgaatcccagcagccatggccag
accattcctgttcaggcgttctccgaagagaacagcatagagggcacgccttcgaaatgt
tactgccgcttgaaagccatgatcatgtgcaaaggctgtggcgctttctgccatgatgat
tgcatcggcccctccaaactgtgcgtctcctgccttgtcgttcggtaa

KEGG   Homo sapiens (human): 80816
Entry
80816             CDS       T01001                                 
Symbol
ASXL3, BRPS, KIAA1713
Name
(RefSeq) ASXL transcriptional regulator 3
  KO
K11471  additional sex combs-like protein
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Disease
H02382  Bainbridge-Ropers syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    80816 (ASXL3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    80816 (ASXL3)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     80816 (ASXL3)
SSDB
Motif
Pfam: ASXH HARE-HTH PHD_3
Other DBs
NCBI-GeneID: 80816
NCBI-ProteinID: NP_085135
OMIM: 615115
HGNC: 29357
Ensembl: ENSG00000141431
UniProt: Q9C0F0
LinkDB
Position
18:33578219..33751195
AA seq 2248 aa
MKDKRKKKDRTWAEAARLALEKHPNSPMTAKQILEVIQKEGLKETSGTSPLACLNAMLHT
NTRIGDGTFFKIPGKSGLYALKKEESSCPADGTLDLVCESELDGTDMAEANAHGEENGVC
SKQVTDEASSTRDSSLTNTAVQSKLVSSFQQHTKKALKQALRQQQKRRNGVSMMVNKTVP
RVVLTPLKVSDEQSDSPSGSESKNGEADSSDKEMKHGQKSPTGKQTSQHLKRLKKSGLGH
LKWTKAEDIDIETPGSILVNTNLRALINKHTFASLPQHFQQYLLLLLPEVDRQMGSDGIL
RLSTSALNNEFFAYAAQGWKQRLAEGEFTPEMQLRIRQEIEKEKKTEPWKEKFFERFYGE
KLGMSREESVKLTTGPNNAGAQSSSSCGTSGLPVSAQTALAEQQPKSMKSPASPEPGFCA
TLCPMVEIPPKDIMAELESEDILIPEESVIQEEIAEEVETSICECQDENHKTIPEFSEEA
ESLTNSHEEPQIAPPEDNLESCVMMNDVLETLPHIEVKIEGKSESPQEEMTVVIDQLEVC
DSLIPSTSSMTHVSDTEHKESETAVETSTPKIKTGSSSLEGQFPNEGIAIDMELQSDPEE
QLSENACISETSFSSESPEGACTSLPSPGGETQSTSEESCTPASLETTFCSEVSSTENTD
KYNQRNSTDENFHASLMSEISPISTSPEISEASLMSNLPLTSEASPVSNLPLTSETSPMS
DLPLTSETSSVSSMLLTSETTFVSSLPLPSETSPISNSSINERMAHQQRKSPSVSEEPLS
PQKDESSATAKPLGENLTSQQKNLSNTPEPIIMSSSSIAPEAFPSEDLHNKTLSQQTCKS
HVDTEKPYPASIPELASTEMIKVKNHSVLQRTEKKVLPSPLELSVFSEGTDNKGNELPSA
KLQDKQYISSVDKAPFSEGSRNKTHKQGSTQSRLETSHTSKSSEPSKSPDGIRNESRDSE
ISKRKTAEQHSFGICKEKRARIEDDQSTRNISSSSPPEKEQPPREEPRVPPLKIQLSKIG
PPFIIKSQPVSKPESRASTSTSVSGGRNTGARTLADIKARAQQARAQREAAAAAAVAAAA
SIVSGAMGSPGEGGKTRTLAHIKEQTKAKLFAKHQARAHLFQTSKETRLPPPLSSKEGPP
NLEVSSTPETKMEGSTGVIIVNPNCRSPSNKSAHLRETTTVLQQSLNPSKLPETATDLSV
HSSDENIPVSHLSEKIVSSTSSENSSVPMLFNKNSVPVSVCSTAISGAIKEHPFVSSVDK
SSVLMSVDSANTTISACNISMLKTIQGTDTPCIAIIPKCIESTPISATTEGSSISSSMDD
KQLLISSSSASNLVSTQYTSVPTPSIGNNLPNLSTSSVLIPPMGINNRFPSEKIAIPGSE
EQATVSMGTTVRAALSCSDSVAVTDSLVAHPTVAMFTGNMLTINSYDSPPKLSAESLDKN
SGPRNRADNSGKPQQPPGGFAPAAINRSIPCKVIVDHSTTLTSSLSLTVSVESSEASLDL
QGRPVRTEASVQPVACPQVSVISRPEPVANEGIDHSSTFIAASAAKQDSKTLPATCTSLR
ELPLVPDKLNEPTAPSHNFAEQARGPAPFKSEADTTCSNQYNPSNRICWNDDGMRSTGQP
LVTHSGSSKQKEYLEQSCPKAIKTEHANYLNVSELHPRNLVTNVALPVKSELHEADKGFR
MDTEDFPGPELPPPAAEGASSVQQTQNMKASTSSPMEEAISLATDALKRVPGAGSSGCRL
SSVEANNPLVTQLLQGNLPLEKVLPQPRLGAKLEINRLPLPLQTTSVGKTAPERNVEIPP
SSPNPDGKGYLAGTLAPLQMRKRENHPKKRVARTVGEHTQVKCEPGKLLVEPDVKGVPCV
ISSGISQLGHSQPFKQEWLNKHSMQNRIVHSPEVKQQKRLLPSCSFQQNLFHVDKNGGFH
TDAGTSHRQQFYQMPVAARGPIPTAALLQASSKTPVGCNAFAFNRHLEQKGLGEVSLSSA
PHQLRLANMLSPNMPMKEGDEVGGTAHTMPNKALVHPPPPPPPPPPPPLALPPPPPPPPP
LPPPLPNAEVPSDQKQPPVTMETTKRLSWPQSTGICSNIKSEPLSFEEGLSSSCELGMKQ
VSYDQNEMKEQLKAFALKSADFSSYLLSEPQKPFTQLAAQKMQVQQQQQLCGNYPTIHFG
STSFKRAASAIEKSIGILGSGSNPATGLSGQNAQMPVQNFADSSNADELELKCSCRLKAM
IVCKGCGAFCHDDCIGPSKLCVACLVVR
NT seq 6747 nt   +upstreamnt  +downstreamnt
atgaaagacaagaggaagaagaaggaccgcacctgggccgaggctgcccgcctggcacta
gaaaaacaccccaactcaccaatgacagcaaagcagatattggaagtcattcagaaagaa
gggttaaaagaaacaagtggaacctctccattagcctgtctgaatgcaatgcttcacact
aacactcgaataggggatggaacattcttcaaaatccctggaaagtcaggcctctatgct
ctcaaaaaagaggagtcgtcatgcccagcagatggcacgttggatttagtctgtgaatct
gaattggatggtacagatatggccgaggcaaatgcccatggagaagaaaatggagtttgt
tcgaagcaggtaactgatgaagcatcttccactcgagattcaagccttactaacacagca
gtgcaaagcaagttagtgtcttccttccagcagcacaccaaaaaggctcttaaacaggct
ttgaggcagcagcagaaaagaagaaatggagtctcaatgatggtaaacaagactgttcct
cgtgttgttttgacaccattaaaggtgtctgatgagcagtcggattcgccttcaggatct
gaatctaaaaatggtgaagcagacagttcagataaagaaatgaaacatgggcaaaaatct
cccactggaaaacaaacaagtcagcacttaaaacgattaaaaaagtctggtttagggcac
ttgaaatggaccaaagctgaggacattgacatagaaaccccaggatctattcttgtcaac
actaacttgagggcattaataaataaacatacgtttgcttccttacctcagcattttcaa
caatacctcctgcttttgctcccagaagtggataggcagatgggaagtgatggaatttta
cgcctcagtacttcagctctaaataatgaattctttgcatatgcagcacaagggtggaaa
cagcgactggcagaaggagagtttaccccagaaatgcagttgcggataaggcaagaaatt
gagaaggaaaagaaaacagaaccttggaaagaaaaattctttgagaggttttatggagaa
aagctgggcatgtcaagagaggaatctgtgaagctcactactggaccaaacaacgctgga
gctcaaagtagttcttcatgtgggacttctggccttccagtttctgcacagacagccttg
gcagaacaacagccaaaaagcatgaaaagcccagcttctccagagcctggtttctgtgct
actctttgccctatggtagaaattccacctaaagatataatggcagaattggagtcagag
gatatcttgatccctgaagaatctgtaattcaggaggaaattgcagaagaggtagagact
agtatctgtgaatgccaggatgaaaatcataagacaatacctgaattttctgaggaggct
gaaagtctaaccaattctcatgaagaaccccaaatagcacctcctgaagataacttggaa
tcctgtgttatgatgaatgatgttttagaaactttgcctcatattgaagttaagatagaa
gggaagtcagaatcaccccaggaagaaatgacagttgttatcgatcagttagaagtctgt
gactctcttattccttccacttcatctatgactcatgtcagtgacacagaacataaggag
tcagaaactgcagtagagaccagtacccccaaaataaaaacagggtcatcttctctagaa
ggccagtttccaaatgaaggaattgctatagatatggagctacagagtgaccctgaagaa
cagctttcagaaaatgcctgcatctctgaaacgtccttttcttctgagagcccagaggga
gcctgtaccagcctgccttctccaggaggggaaacacagtccacatcagaagaatcatgt
actccagcctcccttgagacaacattttgttctgaggtatctagcactgaaaatacagac
aaatacaaccagagaaattccactgatgaaaactttcatgcatctttgatgtcagaaata
tctccaatatccacttcacctgaaatatcagaagcatctcttatgtccaacttaccatta
acatctgaagcatcaccagtatccaacttacctttaacatcagaaacctcaccgatgtct
gacttacctttaacatcagaaacttcttcagtgtcttccatgcttctcacctctgagacc
acttttgtatccagtttgccacttccttcagaaacatctccaatttccaactcttccata
aatgagagaatggcacatcagcaaagaaagtcaccttctgtatctgaagagccactctcc
ccgcagaaagatgagtcttccgccactgccaaacctctgggagagaaccttacctcccag
cagaagaatctgtctaatactcccgaacccatcataatgagttcttcttccattgctcct
gaagcatttccgtctgaagatttgcacaataagaccctgagtcagcaaacctgtaaatca
catgttgacactgagaagccctaccctgcttcaattccagaacttgcttctactgaaatg
ataaaagttaaaaatcatagcgtcctgcaaagaacagaaaaaaaagtgttaccttcacca
ttggaattatctgtcttttctgaagggacagataataagggaaatgagcttccatctgct
aaattacaggacaagcaatatatctcatcagtggataaggctccattttcagaaggctct
agaaataaaacacataagcaagggagtacacagagtcggttagaaacctcacatacttcc
aagtcatcagagccctccaagtcacctgatgggataagaaatgaaagtagagattcagag
atatcaaagagaaaaactgcagagcaacacagctttggaatctgtaaggaaaagagagct
aggatagaagatgatcagtcaacccggaacatatcatctagcagcccacctgagaaagaa
cagcctcccagagaggaaccaagggttccccctctcaagattcagctttccaaaattggg
ccaccttttataatcaagagccaaccagtctccaaacctgagtctcgagcatccactagc
acatctgtcagtggcgggaggaacacaggagccaggaccctcgcagatatcaaggcccgg
gcccaacaagctcgggcccagcgagaggctgctgcagctgctgctgtggctgctgcagcg
agcattgtctctggagccatgggaagtccaggagagggtggaaagacgagaactctggca
cacatcaaagagcagacaaaggctaagctctttgcaaagcatcaagctcgagcccatctc
ttccagacctctaaagagacccggttgcctcctccgctcagctcaaaggaagggcctcca
aacttagaagtctcttctacccctgaaacaaaaatggaaggttcgactggtgtcattatt
gtcaatccaaactgtagatctcctagcaacaagtctgcccacctccgggagaccaccact
gtactacagcagtctcttaacccaagtaaacttccagaaactgccactgacttatctgtg
catagttctgatgaaaacatacctgtgtcacatttatctgagaaaattgtttcatctacc
tcttctgaaaatagcagtgtgcccatgctttttaataaaaattctgtccctgtatctgtt
tgcagcactgctatatcgggagcaattaaagaacatccctttgtgagttctgttgataaa
tcctctgtcctaatgtctgttgacagtgcaaacactacaatttctgcttgtaatataagc
atgttaaaaaccatccagggaactgacactccatgcatagccattataccaaaatgtatt
gaaagcactcccatttcagccactacagagggctccagcatatcaagctccatggatgat
aagcagttactaatatcaagcagcagtgctagtaacttagtctccactcagtacacctct
gtgccaactccctccatcggaaacaatttgccaaacctctccactagctctgtcttgatt
cccccaatgggaattaacaacagatttccttctgagaagatagccatacctgggagtgaa
gaacaggccactgtatccatgggtaccactgtgagagcagccctcagctgcagtgattct
gtagcggtcacagactctctggttgcacacccgaccgtcgcaatgtttactggaaacatg
ctgacaataaactcttatgatagtcctcccaagttaagtgctgaaagcttggacaaaaat
tcagggcctcgaaacagggcagataattctggaaaacctcagcaaccaccagggggcttt
gcaccagcagccataaaccgatcaattccgtgtaaagtcatcgttgaccacagcaccacg
ctgacctccagtttgtctctgactgtctccgttgaaagctcagaagccagcttggacctg
cagggcagaccagtgaggacagaggcatccgtacagcccgtggcgtgtcctcaggtgtct
gtgattagcaggcctgagccagttgccaacgaaggtatagatcacagttccactttcatt
gctgcttcggcagcaaaacaagacagtaaaacattgccggccacctgcacaagtctccga
gaattaccccttgttccagataaattaaatgagccgactgctcccagtcataactttgct
gagcaggcacgtggcccagctcctttcaaaagtgaagcagacacaacctgtagcaatcag
tataacccaagtaaccggatttgctggaatgatgatgggatgaggagcacaggacagcct
ctggttactcactcgggttcaagtaaacaaaaagaatatctagagcaaagctgtccaaag
gctatcaaaactgaacatgccaactacttgaacgtgtcagaacttcatcccaggaatctt
gtaacaaatgttgctcttcctgtgaaatctgaacttcacgaagcagacaagggctttaga
atggacactgaagacttccctggccctgagctgcctcctccggctgcagagggagcctct
agtgtacaacaaacacagaacatgaaagcttccacctcaagtcccatggaagaggctatt
tccttggctaccgatgccctgaagagagtccctggtgcagggagctcaggctgtcgtctg
tcctctgtggaggctaacaatccgctggtgacgcagttactacagggcaacctgcctttg
gaaaaagtgttgccacagcccagattgggagccaagcttgaaatcaacaggcttccattg
cctcttcaaactacctcagtgggtaaaacagcaccagagagaaacgttgaaattccgccc
agctctccaaatccagatggtaagggctacttggcagggactctggcaccactccaaatg
agaaagcgagaaaaccaccccaaaaagagagtagctaggactgtaggagaacacactcaa
gttaaatgtgaaccaggaaaattgttggtggagccagatgttaaaggggtgccttgtgtc
atcagttccggcatcagtcagctaggacacagccagccatttaagcaagaatggctaaac
aagcactccatgcagaacagaattgttcacagccctgaggtcaaacagcaaaagcggctg
ctcccctcgtgtagcttccagcagaacctatttcatgttgacaagaatggcggcttccac
actgacgctggtacctcacacagacagcagttttaccaaatgcctgtggctgccaggggc
cccattcctactgcagctctgttacaggcctcttccaagaccccagtggggtgtaatgca
tttgccttcaacaggcatcttgaacagaagggattgggagaggttagtctttcctcagca
cctcaccagctaaggttagccaacatgttatcccccaatatgcccatgaaagaaggtgat
gaggtgggaggcactgcacacacaatgccaaacaaagcactagtacatccgccgccgcca
ccgcctccccctccccctccacccttggctttgcccccgcctccccccccaccacctccg
ctacctccacctctccctaatgcagaagtcccatctgatcaaaaacaacctccagttacc
atggaaaccactaagagacttagttggccacagtccacgggcatatgtagcaatataaaa
tcggaacctctttcttttgaggaaggtttaagcagcagctgtgaactgggcatgaaacaa
gtttcctatgaccagaatgaaatgaaagaacagttaaaagcattcgcgctaaaaagtgca
gatttctcttcctatttgctttctgagccacaaaagccttttacccaattagctgctcag
aaaatgcaggtgcagcaacaacagcagctctgtggaaattatccaacaatacactttggt
agcacgagtttcaaaagggcagcatctgcaattgaaaagtccattgggattttgggaagt
ggctccaatcctgccacaggcttgtctggtcagaacgctcagatgcccgttcagaacttt
gccgacagcagcaatgcagatgaattggaactgaaatgctcttgccggctgaaagccatg
attgtgtgcaaaggctgtggggccttctgccatgacgactgcataggtccttcaaaactt
tgtgtagcatgcctggttgtacgataa

KEGG   Homo sapiens (human): 8473
Entry
8473              CDS       T01001                                 
Symbol
OGT, HINCUT-1, HRNT1, MRX106, O-GLCNAC, OGT1, XLID106
Name
(RefSeq) O-linked N-acetylglucosamine (GlcNAc) transferase
  KO
K09667  protein O-GlcNAc transferase [EC:2.4.1.255]
Organism
hsa  Homo sapiens (human)
Pathway
hsa00514  Other types of O-glycan biosynthesis
hsa03083  Polycomb repressive complex
hsa04931  Insulin resistance
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Disease
H00480  X-linked intellectual developmental disorder
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09100 Metabolism
  09107 Glycan biosynthesis and metabolism
   00514 Other types of O-glycan biosynthesis
    8473 (OGT)
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    8473 (OGT)
 09160 Human Diseases
  09167 Endocrine and metabolic disease
   04931 Insulin resistance
    8473 (OGT)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01003 Glycosyltransferases [BR:hsa01003]
    8473 (OGT)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    8473 (OGT)
Enzymes [BR:hsa01000]
 2. Transferases
  2.4  Glycosyltransferases
   2.4.1  Hexosyltransferases
    2.4.1.255  protein O-GlcNAc transferase
     8473 (OGT)
Glycosyltransferases [BR:hsa01003]
 O-Glycan biosynthesis
  O-linked GlcNAc type
   8473 (OGT)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HAT complexes
    NSL complex
     8473 (OGT)
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     8473 (OGT)
SSDB
Motif
Pfam: Glyco_transf_41 TPR_1 TPR_11 TPR_2 TPR_17 TPR_8 TPR_12 TPR_19 TPR_16 TPR_14 TPR_10 TPR_7 TPR_Slam TPR_CcmH_CycH TPR_6 TPR_9 TPR-S NatA_aux_su ARM_TT21_5th TPR_15 TPR_MalT TPR_NPHP3 ARM_TT21_C FAT HAT_PRP39_C TPR_21 SHPRH_helical-1st ARM_TT21_2nd Wzy_C_2 MIT EST1_DNA_bind HAT_PRP39_N ANAPC3 SNAP HAT_Syf1_CNRKL1_N BTAD ARM_TT21 TPR_EMC2 Suf HAT_Syf1_CNRKL1_C Tcf25 TPR_20
Other DBs
NCBI-GeneID: 8473
NCBI-ProteinID: NP_858058
OMIM: 300255
HGNC: 8127
Ensembl: ENSG00000147162
UniProt: O15294
Structure
LinkDB
Position
X:71533104..71575892
AA seq 1046 aa
MASSVGNVADSTEPTKRMLSFQGLAELAHREYQAGDFEAAERHCMQLWRQEPDNTGVLLL
LSSIHFQCRRLDRSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKP
DFIDGYINLAAALVAAGDMEGAVQAYVSALQYNPDLYCVRSDLGNLLKALGRLEEAKACY
LKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAYINLGNVLKEARI
FDRAVAAYLRALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAIELQPHFPDAYCNLA
NALKEKGSVAEAEDCYNTALRLCPTHADSLNNLANIKREQGNIEEAVRLYRKALEVFPEF
AAAHSNLASVLQQQGKLQEALMHYKEAIRISPTFADAYSNMGNTLKEMQDVQGALQCYTR
AIQINPAFADAHSNLASIHKDSGNIPEAIASYRTALKLKPDFPDAYCNLAHCLQIVCDWT
DYDERMKKLVSIVADQLEKNRLPSVHPHHSMLYPLSHGFRKAIAERHGNLCLDKINVLHK
PPYEHPKDLKLSDGRLRVGYVSSDFGNHPTSHLMQSIPGMHNPDKFEVFCYALSPDDGTN
FRVKVMAEANHFIDLSQIPCNGKAADRIHQDGIHILVNMNGYTKGARNELFALRPAPIQA
MWLGYPGTSGALFMDYIITDQETSPAEVAEQYSEKLAYMPHTFFIGDHANMFPHLKKKAV
IDFKSNGHIYDNRIVLNGIDLKAFLDSLPDVKIVKMKCPDGGDNADSSNTALNMPVIPMN
TIAEAVIEMINRGQIQITINGFSISNGLATTQINNKAATGEEVPRTIIVTTRSQYGLPED
AIVYCNFNQLYKIDPSTLQMWANILKRVPNSVLWLLRFPAVGEPNIQQYAQNMGLPQNRI
IFSPVAPKEEHVRRGQLADVCLDTPLCNGHTTGMDVLWAGTPMVTMPGETLASRVAASQL
TCLGCLELIAKNRQEYEDIAVKLGTDLEYLKKVRGKVWKQRISSPLFNTKQYTMELERLY
LQMWEHYAAGNKPDHMIKPVEVTESA
NT seq 3141 nt   +upstreamnt  +downstreamnt
atggcgtcttccgtgggcaacgtggccgacagcacagaaccaacgaaacgtatgctttcc
ttccaagggttagctgagttggcacatcgagaatatcaggcaggagattttgaggcagct
gagagacactgcatgcagctctggagacaagagccagacaatactggtgtgcttttatta
ctttcatctatacacttccagtgtcgaaggctggacagatctgctcactttagcactctg
gcaattaaacagaacccccttctggcagaagcttattcgaatttggggaatgtgtacaag
gaaagagggcagttgcaggaggcaattgagcattatcgacatgcattgcgtctcaaacct
gatttcatcgatggttatattaacctggcagccgccttggtagcagcgggtgacatggaa
ggggcagtacaagcttacgtctctgctcttcagtacaatcctgatttgtactgtgttcgc
agtgacctggggaacctgctcaaagccctgggtcgcttggaagaagccaaggcatgttat
ttgaaagcaattgagacgcaaccgaactttgcagtagcttggagtaatcttggctgtgtt
ttcaatgcacaaggggaaatttggcttgcaattcatcactttgaaaaggctgtcaccctt
gacccaaactttctggatgcttatatcaatttaggaaatgtcttgaaagaggcacgcatt
tttgacagagctgtggcagcttatcttcgtgccctaagtttgagtccaaatcacgcagtg
gtgcacggcaacctggcttgtgtatactatgagcaaggcctgatagatctggcaatagac
acctacaggcgggctatcgaactacaaccacatttccctgatgcttactgcaacctagcc
aatgctctcaaagagaagggcagtgttgctgaagcagaagattgttataatacagctctc
cgtctgtgtcccacccatgcagactctctgaataacctagccaatatcaaacgagaacag
ggaaacattgaagaggcagttcgcttgtatcgtaaagcattagaagtcttcccagagttt
gctgctgcccattcaaatttagcaagtgtactgcagcagcagggaaaactgcaggaagct
ctgatgcattataaggaggctattcgaatcagtcctacctttgctgatgcctactctaat
atgggaaacactctaaaggagatgcaggatgttcagggagccttgcagtgttatacgcgt
gccatccaaattaatcctgcatttgcagatgcacatagcaatctggcttccattcataag
gattcagggaatattccagaagccatagcttcttaccgcacggctctgaaacttaagcct
gattttcctgatgcttattgtaacttggctcattgcctgcagattgtctgtgattggaca
gactatgatgagcgaatgaagaagttggtcagtattgtggctgaccagttagagaagaat
aggttgccttctgtgcatcctcatcatagtatgctatatcctctttctcatggcttcagg
aaggctattgctgagaggcacggcaacctgtgcttagataagattaatgttcttcataaa
ccaccatatgaacatccaaaagacttgaagctcagtgatggtcggctgcgtgtaggatat
gtgagttccgactttgggaatcatcctacttctcaccttatgcagtctattccaggcatg
cacaatcctgataaatttgaggtgttctgttatgccctgagcccagacgatggcacaaac
ttccgagtgaaggtgatggcagaagccaatcatttcattgatctttctcagattccatgc
aatggaaaagcagctgatcgcatccatcaggatggaattcatatccttgtaaatatgaat
ggctatactaagggcgctcgaaatgagctttttgctctcaggccagctcctattcaggca
atgtggctgggataccctgggacgagtggtgcgcttttcatggattatattatcactgat
caggaaacttcgccagctgaagttgctgagcagtattccgagaaattggcttatatgccc
cacactttttttattggtgatcatgctaatatgttccctcacctgaagaaaaaagcagtc
atcgattttaagtccaatgggcacatttatgacaatcggatagttctgaatggcatcgac
ctcaaagcatttcttgatagtctaccagatgtgaaaattgtcaagatgaagtgtcctgat
ggaggagacaatgcagatagcagtaacacagctcttaatatgcctgttattcctatgaat
actattgcagaagcagttattgaaatgattaaccgaggacagattcaaataacaattaat
ggattcagtattagcaatggactggcaactactcagatcaacaataaggctgcaactgga
gaggaggttccccgtaccattattgtaaccacccgttctcagtacgggttaccagaagat
gccatcgtatactgtaactttaatcagttgtataaaattgacccttctactttgcagatg
tgggcaaacattctgaagcgtgttcccaatagtgtactctggctgttgcgttttccagca
gtaggagaacctaatattcaacagtatgcacaaaacatgggcctgccccagaaccgtatc
attttttcacctgttgctcctaaagaggaacacgtcaggagaggccagctggctgatgtc
tgcttggacactccactctgtaatgggcacaccacagggatggatgtcctctgggcaggg
acccccatggtgactatgccaggagagactcttgcttctcgagttgcagcatcccagctc
acttgcttaggttgtcttgagcttattgctaaaaacagacaagaatatgaagacatagct
gtgaagctgggaactgatctagaatacctgaagaaagttcgtggcaaagtctggaagcaa
agaatatctagccctctgttcaacaccaaacaatacacaatggaactagagcggctctat
ctacagatgtgggagcattatgcagctggcaacaaacctgaccacatgattaagcctgtt
gaagtcactgagtcagcataa

KEGG   Homo sapiens (human): 3054
Entry
3054              CDS       T01001                                 
Symbol
HCFC1, CFF, HCF, HCF-1, HCF1, HFC1, MAHCX, MRX3, PPP1R89, VCAF, XLID3
Name
(RefSeq) host cell factor C1
  KO
K14966  host cell factor 1
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
hsa04980  Cobalamin transport and metabolism
hsa05168  Herpes simplex virus 1 infection
Network
nt06168  Herpes simplex virus 1 (HSV-1)
nt06523  Epigenetic regulation by Polycomb complexes
nt06538  Cobalamin transport and metabolism
  Element
N00588  HSV VP16 to Oct-1-mediated transcription
N01585  Deubiquitination of H2AK119
N01810  Regulation of MMACHC expression
Disease
H00480  X-linked intellectual developmental disorder
H02222  Methylmalonic acidemia and hyperhomocysteinemia, cblX type
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    3054 (HCFC1)
 09150 Organismal Systems
  09154 Digestive system
   04980 Cobalamin transport and metabolism
    3054 (HCFC1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05168 Herpes simplex virus 1 infection
    3054 (HCFC1)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:hsa01009]
    3054 (HCFC1)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    3054 (HCFC1)
   03029 Mitochondrial biogenesis [BR:hsa03029]
    3054 (HCFC1)
Protein phosphatases and associated proteins [BR:hsa01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     3054 (HCFC1)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HAT complexes
    NSL complex
     3054 (HCFC1)
   HMT complexes
    COMPASS/SET1 complex
     3054 (HCFC1)
    MLL-HCF complex
     3054 (HCFC1)
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     3054 (HCFC1)
Mitochondrial biogenesis [BR:hsa03029]
 Mitochondrial quality control factors
  Regulator of mitochondrial biogenesis
   Other regulator of mitochondrial biogenesis
    3054 (HCFC1)
SSDB
Motif
Pfam: Kelch_KLHDC2_KLHL20_DRC7 Beta-prop_ATRN-LZTR1 Kelch_5 Kelch_3 Kelch_4 Kelch_1 Kelch_6 Kelch_2 fn3
Other DBs
NCBI-GeneID: 3054
NCBI-ProteinID: NP_005325
OMIM: 300019
HGNC: 4839
Ensembl: ENSG00000172534
UniProt: P51610
Structure
LinkDB
Position
X:complement(153947557..153971818)
AA seq 2035 aa
MASAVSPANLPAVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGGNEGIVDELH
VYNTATNQWFIPAVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKYSNDLYELQASRWEW
KRLKAKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNIPRYLNDLYILELRP
GSGVVAWDIPITYGVLPPPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTL
TWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTNTLACLN
LDTMAWETILMDTLEDNIPRARAGHCAVAINTRLYIWSGRDGYRKAWNNQVCCKDLWYLE
TEKPPPPARVQLVRANTNSLEVSWGAVATADSYLLQLQKYDIPATAATATSPTPNPVPSV
PANPPKSPAPAAAAPAVQPLTQVGITLLPQAAPAPPTTTTIQVLPTVPGSSISVPTAART
QGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSS
PQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTMAVTPGTTTLPATVKVASSPV
MVSNPATRMLKTAAAQVGTSVSSATNTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKT
ITLVKSPISVPGGSALISNLGKVMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGT
ILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTKPGTTTIIKTIPMSAIITQ
AGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPG
QPGTILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGH
STSASLATPITTLGTIATLSSQVINPTAITVSAAQTTLTAAGGLTTPTITMQPVSQPTQV
TLITAPSGVEAQPVHDLPVSILASPTTEQPTATVTIADSGQGDVQPGTVTLVCSNPPCET
HETGTTNTATTTVVANLGGHPQPTQVQFVCDRQEAAASLVTSTVGQQNGSVVRVCSNPPC
ETHETGTTNTATTATSNMAGQHGCSNPPCETHETGTTNTATTAMSSVGANHQRDARRACA
AGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLGPSMARE
PGGRSPAFVQLAPLSSKVRLSSPSIKDLPAGRHSHAVSTAAMTRSSVGAGEPRMAPVCES
LQGGSPSTTVTVTALEALLCPSATVTQVCSNPPCETHETGTTNTATTSNAGSAQRVCSNP
PCETHETGTTHTATTATSNGGTGQPEGGQQPPAGRPCETHQTTSTGTTMSVSVGALLPDA
TSSHRTVESGLEVAAAPSVTPQAGTALLAPFPTQRVCSNPPCETHETGTTHTATTVTSNM
SSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPP
PEELQVSPGPRQQLPPRQLLQSASTALMGESAEVLSASQTPELPAAVDLSSTGEPSSGQE
SAGSAVVATVVVQPPPPTQSEVDQLSLPQELMAEAQAGTTTLMVTGLTPEELAVTAAAEA
AAQAAATEEAQALAIQAVLQAAQQAVMGTGEPMDTSEAAATVTQAELGHLSAEGQEGQAT
TIPIVLTQQELAALVQQQQLQEAQAQQQHHHLPTEALAPADSLNDPAIESNCLNELAGTV
PSTVALLPSTATESLAPSNTFVAPQPVVVASPAKLQAAATLTEVANGIESLGVKPDLPPP
PSKAPMKKENQWFDVGVIKGTNVMVTHYFLPPDDAVPSDDDLGTVPDYNQLKKQELQPGT
AYKFRVAGINACGRGPFSEISAFKTCLPGFPGAPCAIKISKSPDGAHLTWEPPSVTSGKI
IEYSVYLAIQSSQAGGELKSSTPAQLAFMRVYCGPSPSCLVQSSSLSNAHIDYTTKPAII
FRIAARNEKGYGPATQVRWLQETSKDSSGTKPANKRPMSSPEMKSAPKKSKADGQ
NT seq 6108 nt   +upstreamnt  +downstreamnt
atggcttcggccgtgtcgcccgccaacttgccagcggtgcttctgcagccccgctggaag
cgagtggtgggctggtcgggtccggtgccacggccccgccacggccaccgcgccgtggcc
atcaaggagctcatcgtggtgtttggcggcggcaacgagggaatagtggacgaactgcac
gtgtacaacacggcaaccaaccagtggttcatcccagccgtgaggggggacattccccct
gggtgtgcagcctatggcttcgtgtgtgacgggactcgcctcctggtgtttggtgggatg
gtggagtatgggaaatacagcaatgacctctacgaactccaggcgagccggtgggagtgg
aagagactcaaagcaaagacgcccaaaaacgggccccctccgtgtcctcgactcgggcac
agcttctcccttgtgggcaacaaatgctacctgtttgggggtctggccaatgatagcgag
gacccaaagaacaacattccaaggtacctgaatgacttatatatcctggaattacggcca
ggctctggagtggtagcctgggacattcccatcacttacggggtcctaccaccaccccgg
gagtcacatactgccgtggtctacaccgaaaaagacaataagaagtccaagctggtgatc
tacggcgggatgagtggctgcaggctgggggacctgtggaccctagatattgacaccctg
acgtggaataagcccagtctcagcggggtggcgcctcttcctcgcagtctccactcggca
accaccatcggaaataaaatgtacgtgtttggtggctgggtgcctctcgtcatggatgac
gtcaaagtggccacacacgagaaggagtggaagtgtaccaacacgctggcttgtctcaac
ctggataccatggcctgggagaccatcctgatggatacactggaggacaacatcccccgt
gctcgggctggccactgcgcagtcgccatcaacacccgcctgtacatttggagtgggcgt
gacggctaccgcaaggcctggaacaaccaggtctgctgcaaggacctctggtacctagag
acagaaaagccaccacccccagcccgagtacaactggtacgcgccaacaccaactccctg
gaggtgagctggggggcagtggcaacagccgacagctaccttctccagctccagaaatat
gacattcctgccacggctgctactgccacctcccctacacccaatccggtcccatctgtg
cctgccaaccctcccaagagccctgccccagcagcagccgcacctgctgtgcagccgctg
acccaagtaggcatcacgctcctgccccaggctgcccccgcacccccgaccaccaccacc
atccaggtcttgccaacggtgcctggcagctccatttctgtgcccaccgcagccaggact
caaggtgtccctgctgttctcaaagtgaccggtcctcaggctacaacaggaactccattg
gtcaccatgcgacctgccagccaggctgggaaagcccctgtcaccgtgacctcccttccc
gccggagtgcggatggttgtgccaacacagagtgcccagggaacggtgattggcagtagc
ccacagatgagtgggatggccgcactggccgctgcggccgctgccacccagaagatcccc
ccttcctcggcacccacggtgctgagtgtcccagcgggtaccaccatcgtgaagaccatg
gctgtgacacctggcactaccaccctcccagccactgtgaaggtggcctcctcgccagtc
atggtgagcaaccctgccactcgcatgctgaagactgcagccgcccaggtggggacatcg
gtttcctccgccaccaacacgtctacccgccctatcatcacagtgcacaagtcaggcact
gtgacagtggcccagcaagcccaggtggtgaccacagttgtgggcggggtcaccaagacc
atcaccctggtgaagagccccatctctgtcccaggaggcagtgctctgatttccaatctg
ggcaaagtgatgtcggtggtccagaccaaaccagttcagacttcagcagtcacaggccag
gcgtccacgggtcctgtgactcagatcatccagaccaaagggcccctgccagcgggaaca
atcctgaagctggtgacctcagcagatggcaagcccaccaccatcatcactaccacgcag
gccagtggggcggggaccaagcccaccatcctgggcatcagcagcgtctcccccagtacc
accaagcccggcacgaccaccatcatcaaaaccatccccatgtcggccatcatcacccag
gcgggcgccacgggtgtgaccagcagtcctggcatcaagtcccccatcaccatcatcacc
accaaggtgatgacttcaggaactggagcacctgcgaaaatcatcactgctgtccccaaa
attgccactggccacgggcagcagggagtgacccaggtggtgcttaagggggccccggga
cagccaggcaccatcctccgcactgtgcccatggggggtgttcgcctggtcacacccgtc
accgtctccgccgtcaagccagccgtcaccacgttggttgtgaaaggcaccacaggtgtc
acgaccctaggcacagtgacaggcaccgtctccaccagccttgccggggcggggggccac
agcactagtgcttccctggccacgcccatcaccaccttgggcaccattgccaccctctca
agccaggtgatcaaccccactgccatcactgtgtcggccgcacagaccacgctgacagcg
gcaggcgggctcacaaccccaaccatcaccatgcagcccgtgtcccagcccacccaggta
actctgatcacggcacctagtggggtggaggcccagcctgtgcatgacctccctgtgtcc
attctggcctccccgactacagaacagcccaccgccacagttaccatcgccgactcaggc
cagggtgatgtgcagcctggcactgtcaccttggtgtgctccaacccaccctgtgagacc
cacgagactggcaccaccaacacggccaccactactgttgtggctaaccttgggggacac
ccccagcccacccaagtgcagttcgtctgtgacagacaggaggcagctgcttctcttgtg
acctcgactgtgggccagcagaatggtagcgtggtccgagtctgttcgaacccgccctgc
gagacccacgagacgggcaccaccaacaccgccaccaccgccacctccaacatggccggg
cagcatggctgctcaaacccaccctgcgagacccacgagacgggcaccaccaacactgcc
actacagccatgtcgagcgtcggcgccaaccaccagcgagatgcccgtcgggcctgtgca
gctggcacccctgccgtgatccggatcagtgtggccactggggcgctggaggcagcccag
ggctctaagtcccagtgccaaacccgccagaccagcgcgaccagcaccaccatgactgtg
atggccaccggggccccgtgctcggccggcccactccttgggccgagcatggcacgggag
cccgggggccgcagccctgcttttgtgcagttggcccctctgagcagcaaagtcaggctg
agcagcccaagcattaaggaccttcctgcggggcgccacagccatgcggtcagcaccgct
gccatgacccgttccagcgtgggtgctggggagccccgcatggcacctgtgtgcgagagc
ctccagggtggctcgcccagcaccacagtgactgtgacagccctggaggcactgctgtgc
ccctcggccaccgtgacccaagtctgctccaacccaccatgtgagacccacgagacaggc
accaccaacaccgccactacctcgaatgcaggcagcgcccagagggtgtgctccaacccg
ccatgcgagacccacgagacgggcaccacccacacggccaccaccgctacttcaaacggg
ggcacgggccagcccgagggtgggcagcagccccctgctggtcgcccctgtgagacacac
cagaccacttccactggcaccaccatgtcggtcagcgtgggtgccctgcttcccgacgcc
acttcttcccacaggaccgtggagtctggcctagaggtggcggcggcacccagcgtcacc
ccccaggctggcaccgcgctgctggctcctttcccaacacagagggtgtgctccaacccc
ccctgtgagacccacgagacgggcaccactcacacggccaccactgtcacttccaacatg
agttcaaaccaagaccccccacctgctgccagcgatcagggagaggtggagagcacccag
ggcgacagcgtgaacatcaccagctccagtgccatcacgacaaccgtgtcctccacactg
acgcgggctgtgaccaccgtgacgcagtccacaccggtcccgggcccctctgtgccgccc
ccagaggaactccaggtgtcgccaggtcctcgccagcagctgccgccacggcagcttctg
cagtcggcttccacagccctgatgggggagtccgccgaggtcctgtcagcctcccagacc
cctgagctcccggccgccgtggatctgagcagcacaggggagccatcttcgggccaggag
tctgccggctctgcggtggtggccactgtggtggtccagccacccccacccacacagtcc
gaagtagaccagttatcacttccccaagagctaatggccgaggcccaagctggcaccacc
accctcatggtaacggggctcacccccgaggagctggcagtgacggctgctgcagaagca
gctgcccaggccgcagccacggaggaagcccaggccctggccatccaggcggtgctccag
gccgcgcagcaggccgtcatgggcaccggcgagcccatggacacctccgaggcagcagca
accgtgactcaggcggagctggggcacctgtcggccgagggtcaggagggccaggccacc
accatacccattgtgctgacacagcaggagctggctgccctggtgcagcagcagcagctg
caggaggcccaggcccagcagcagcatcaccacctccccactgaggccctggcccctgcc
gacagtctcaacgacccagccattgagagcaattgcctcaatgagctggccggcacggtc
cccagcactgtggcgctgctgccctcaacggccactgagagcctggctccatccaacaca
tttgtggccccccagccggttgtggtggccagcccagccaagctgcaggctgcagctacc
ctgaccgaagtggccaatggcatcgagtccctgggtgtgaagccagacctgccgccccca
cccagcaaagcccccatgaagaaggaaaaccagtggtttgatgtgggagtcattaagggc
accaatgtaatggtgacacactatttcctgccaccagatgatgctgtcccatcagacgat
gatttgggcaccgtccctgactataaccagctgaagaagcaggagctgcagccaggcaca
gcctataagtttcgtgttgccggaatcaatgcctgtggccgggggcccttcagcgaaatc
tcagcctttaagacgtgcctgcctggtttcccaggggccccttgtgccattaaaatcagc
aaaagtccggatggtgctcacctcacctgggagccaccctctgtgacctccggcaagatt
atcgagtactccgtgtacctggccatccagagctcacaggctgggggcgagctcaagagc
tccaccccggcccagctggccttcatgcgggtgtactgcgggcccagcccctcctgcctg
gtgcagtcctccagcctttccaacgcccacatcgactacaccaccaagcccgccatcatc
ttccgcatcgccgcccgcaatgagaagggctatggcccggccacacaagtgaggtggctg
caggaaaccagtaaagacagctctggcaccaagccagccaacaagcggcccatgtcctct
ccagaaatgaaatctgctccaaagaaatctaaggccgatggtcagtga

KEGG   Homo sapiens (human): 221937
Entry
221937            CDS       T01001                                 
Symbol
FOXK1, FOXK1L
Name
(RefSeq) forkhead box K1
  KO
K09404  forkhead box protein K
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    221937 (FOXK1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:hsa03000]
    221937 (FOXK1)
   03036 Chromosome and associated proteins [BR:hsa03036]
    221937 (FOXK1)
Transcription factors [BR:hsa03000]
 Eukaryotic type
  Helix-turn-helix
   Fork head/winged helix other regulators
    221937 (FOXK1)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     221937 (FOXK1)
SSDB
Motif
Pfam: Forkhead FHA
Other DBs
NCBI-GeneID: 221937
NCBI-ProteinID: NP_001032242
OMIM: 616302
HGNC: 23480
Ensembl: ENSG00000164916
UniProt: P85037
Structure
LinkDB
Position
7:4682295..4771442
AA seq 733 aa
MAEVGEDSGARALLALRSAPCSPVLCAAAAAAAFPAAAPPPAPAQPQPPPGPPPPPPPPL
PPGAIAGAGSSGGSSGVSGDSAVAGAAPALVAAAAASVRQSPGPALARLEGREFEFLMRQ
PSVTIGRNSSQGSVDLSMGLSSFISRRHLQLSFQEPHFYLRCLGKNGVFVDGAFQRRGAP
ALQLPKQCTFRFPSTAIKIQFTSLYHKEEAPASPLRPLYPQISPLKIHIPEPDLRSMVSP
VPSPTGTISVPNSCPASPRGAGSSSYRFVQNVTSDLQLAAEFAAKAASEQQADTSGGDSP
KDESKPPFSYAQLIVQAISSAQDRQLTLSGIYAHITKHYPYYRTADKGWQNSIRHNLSLN
RYFIKVPRSQEEPGKGSFWRIDPASEAKLVEQAFRKRRQRGVSCFRTPFGPLSSRSAPAS
PTHPGLMSPRSGGLQTPECLSREGSPIPHDPEFGSKLASVPEYRYSQSAPGSPVSAQPVI
MAVPPRPSSLVAKPVAYMPASIVTSQQPAGHAIHVVQQAPTVTMVRVVTTSANSANGYIL
TSQGAAGGSHDAAGAAVLDLGSEARGLEEKPTIAFATIPAAGGVIQTVASQMAPGVPGHT
VTILQPATPVTLGQHHLPVRAVTQNGKHAVPTNSLAGNAYALTSPLQLLATQASSSAPVV
VTRVCEVGPKEPAAAVAATATTTPATATTASASASSTGEPEVKRSRVEEPSGAVTTPAGV
IAAAGPQGPGTGE
NT seq 2202 nt   +upstreamnt  +downstreamnt
atggccgaagtcggcgaggacagcggcgcccgcgccctgctcgcgctgcgctcggcgccc
tgcagcccagtgctgtgcgccgcagccgccgccgccgccttccccgcggccgcacccccg
ccggcccccgcgcagccccagcctccgcccgggccgccgccgccgccgccaccgccgctg
cctccgggcgcgatcgcgggcgcgggctcctccgggggctcctccggggtatccggggac
tccgcggtcgcgggcgcggcgccggccctggtggccgcggcggccgcctcggtacggcag
agcccggggccggcgctggcgcggctggagggccgcgagttcgagttcctcatgcgccag
cccagcgtcaccatcggccgcaactcgtcgcagggctcggtggacttgagcatgggcctg
tccagcttcatctcgcggcgccacctgcagctcagcttccaggagccgcacttctacctg
cgctgcctcggcaagaacggcgtcttcgtggacggggccttccagagacgcggcgcgccc
gccctgcagctgcccaagcagtgtaccttccggtttcccagcacggccatcaagatccag
ttcacgtcgctctatcacaaagaagaggccccagcctccccgctgcggccactgtacccc
cagatctcccctctgaagatccacatcccggagccggacctccggagcatggtcagcccc
gtcccctccccgacgggcaccatcagtgtccccaactcctgcccagccagtccacgcggt
gccggctcctccagttaccgctttgtgcagaacgtgacctcggacctgcagctggcagca
gagtttgcagcaaaggccgcgtcggagcagcaggcagacacgtctggaggagacagcccc
aaggatgagtcaaagccgccgttctcctacgcgcagctgatcgtgcaggccatctcctcc
gcccaggaccggcagctgaccctgagcgggatctacgcccacatcaccaagcattacccc
tactaccggacggccgacaaaggctggcagaattctatccggcacaacctctctttgaac
cgttactttatcaaagtcccacgttcccaggaggagcctgggaaggggtccttttggcga
atagaccctgcctctgaagccaagctcgtggaacaggcattccggaaacggaggcagagg
ggtgtctcctgcttccgcacccccttcgggcctctgtcctcaaggagcgctccagcttcg
cccacacaccccgggctgatgtcccctcgctccggcggcctgcagaccccagagtgcctg
tctcgggagggctcccccattccacacgaccctgagtttgggtccaagttagcttctgtc
ccagagtaccggtattcccaaagcgcacccggctcccccgtcagcgcccagccagtgatc
atggccgtgcctccccgaccgtccagcctcgtggccaagcccgtggcctacatgcccgcc
tccatcgtaacctcacagcagcccgcgggccacgccatccacgtcgtgcagcaggccccc
accgtcaccatggtcagggtggtcaccacatctgccaactcggccaacggatacatcctc
accagccagggcgcggcggggggctcccatgatgcggcgggcgcagccgtgctggacctg
ggcagcgaggccagaggcctggaggagaaacccaccattgcgtttgccacaatccccgcg
gctggtggagtcatccagacggtggccagccagatggcccccggggtccccggacacacg
gtcaccatcctgcagcccgccacacccgtgaccctcgggcagcaccaccttccagtccgg
gccgtgacccagaacggaaagcatgcggttcccacgaacagtttagccggcaacgcttac
gccctcaccagccctttgcagctccttgcgacccaagcgagttcatccgcgccggtggtg
gtcacccgggtgtgcgaggtggggcccaaggagccagcagcagccgtcgcggccacggcc
accaccaccccagccactgccaccaccgcctctgcctccgcctcttccactggagagccc
gaggtcaaaaggtcccgggtggaggagcccagtggtgctgtaaccacaccggctggagtg
atcgcagctgccggcccccaggggccaggcaccggggagtga

KEGG   Homo sapiens (human): 3607
Entry
3607              CDS       T01001                                 
Symbol
FOXK2, ILF, ILF-1, ILF1, nGTBP
Name
(RefSeq) forkhead box K2
  KO
K09404  forkhead box protein K
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    3607 (FOXK2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:hsa03000]
    3607 (FOXK2)
   03036 Chromosome and associated proteins [BR:hsa03036]
    3607 (FOXK2)
Transcription factors [BR:hsa03000]
 Eukaryotic type
  Helix-turn-helix
   Fork head/winged helix other regulators
    3607 (FOXK2)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     3607 (FOXK2)
SSDB
Motif
Pfam: Forkhead FHA
Other DBs
NCBI-GeneID: 3607
NCBI-ProteinID: NP_004505
OMIM: 147685
HGNC: 6036
Ensembl: ENSG00000141568
UniProt: Q01167
Structure
LinkDB
Position
17:82519732..82604602
AA seq 660 aa
MAAAAAALSGAGTPPAGGGAGGGGAGGGGSPPGGWAVARLEGREFEYLMKKRSVTIGRNS
SQGSVDVSMGHSSFISRRHLEIFTPPGGGGHGGAAPELPPAQPRPDAGGDFYLRCLGKNG
VFVDGVFQRRGAPPLQLPRVCTFRFPSTNIKITFTALSSEKREKQEASESPVKAVQPHIS
PLTINIPDTMAHLISPLPSPTGTISAANSCPSSPRGAGSSGYKVGRVMPSDLNLMADNSQ
PENEKEASGGDSPKDDSKPPYSYAQLIVQAITMAPDKQLTLNGIYTHITKNYPYYRTADK
GWQNSIRHNLSLNRYFIKVPRSQEEPGKGSFWRIDPASESKLIEQAFRKRRPRGVPCFRT
PLGPLSSRSAPASPNHAGVLSAHSSGAQTPESLSREGSPAPLEPEPGAAQPKLAVIQEAR
FAQSAPGSPLSSQPVLITVQRQLPQAIKPVTYTVATPVTTSTSQPPVVQTVHVVHQIPAV
SVTSVAGLAPANTYTVSGQAVVTPAAVLAPPKAEAQENGDHREVKVKVEPIPAIGHATLG
TASRIIQTAQTTPVQTVTIVQQAPLGQHQLPIKTVTQNGTHVASVPTAVHGQVNNAAASP
LHMLATHASASASLPTKRHNGDQPEQPELKRIKTEDGEGIVIALSVDTPPAAVREKGVQN
NT seq 1983 nt   +upstreamnt  +downstreamnt
atggcggcggccgcggcggcgctctcgggcgcgggcacgccacccgcgggcggcggggcc
gggggcggcggggccgggggcggcgggtccccgccgggcggctgggccgtggcgcgcctg
gagggccgcgagttcgagtatctgatgaagaagcgctcggtgaccatcggccgcaactcg
tcgcagggctcggtggacgtgagcatgggccactcgagcttcatctcccggcgccacctc
gagatcttcacgcccccgggcggcggcggccatggcggggccgctccggagctgccgccc
gcgcagcccaggcccgacgccggcggcgacttctacctgcgctgcttgggcaagaacggg
gtattcgtggacggcgtgttccagaggcgcggggcgccgccgctgcagctgccgcgcgtg
tgcacattcaggttcccgagcacaaacatcaagataacgttcactgccctgtccagcgag
aagagagagaagcaggaggcgtctgagtctccagtgaaggccgtacagccacacatctcg
cccctgaccatcaacattccagacaccatggcccacctcatcagccctctgccctccccc
acgggaaccatcagcgctgcaaactcctgcccctccagcccccggggagcggggtcttca
gggtacaaggtgggccgagtgatgccatctgacctcaatttaatggctgacaactcacag
cctgaaaatgaaaaggaagcttcaggtggagacagcccgaaggatgattcaaagccgcct
tactcctacgcgcagctgatagttcaggcgattacgatggctcccgacaaacagctcacc
ctgaacgggatttatacacacatcactaaaaattatccctactacaggactgcggacaag
ggctggcagaattcaattcgccacaatctctctctgaatcgttatttcatcaaagtgccg
cgttcccaggaagaaccaggcaaaggctcgttctggaggatagacccagcctctgaaagc
aaattaatagaacaggcttttaggaaacgacggcctaggggcgtgccctgctttagaacc
cctctgggaccgctctcttctaggagtgccccagcctctcccaatcacgcgggagtgctg
tctgctcactctagtggcgcccagacccctgagagcctgtcgagggaaggttcgccggcc
cccctggagcctgagcctggcgctgcacagcccaaactcgctgtcatccaggaagcccgg
tttgcccagagcgccccagggtcacctctgtccagtcagccagtcttaatcaccgtccag
cggcagctaccacaggccatcaagcctgtcacctacactgtggccaccccagtgaccacc
tcgacctcccagccacccgtcgtgcagacggttcacgtcgtccaccagatcccagcggtg
tcggtcaccagtgtggccggactggccccagcgaacacgtacactgtctctggacaagct
gtggtcaccccggcagccgtgctggcccctcctaaggcagaggcccaggagaatggagac
cacagggaagtcaaagtgaaagtagagcctattcccgccattggccacgccacgctcggc
actgccagccggatcattcagacggcacagaccaccccggtccagacggtgaccatagta
caacaggcacctctaggtcaacaccagctaccaataaaaactgtaacacaaaacggcact
cacgtggcatcagtccccactgcggtccacggccaggtgaacaatgccgcggcgagtcct
ttgcacatgttggcaacacacgcatccgcatcggcctccctgcccacaaagcgccacaac
ggtgaccagccggagcagccggagctgaagcggatcaagacagaagacggcgagggcatc
gtcattgccctgagcgtggacacgccaccggcagccgtaagggaaaagggtgtccagaac
tag

KEGG   Homo sapiens (human): 7528
Entry
7528              CDS       T01001                                 
Symbol
YY1, DELTA, GADEVS, INO80S, NF-E1, UCRBP, YIN-YANG-1
Name
(RefSeq) YY1 transcription factor
  KO
K09201  transcription factor YY
Organism
hsa  Homo sapiens (human)
Pathway
hsa03082  ATP-dependent chromatin remodeling
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Disease
H02490  Gabriele-de Vries syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03082 ATP-dependent chromatin remodeling
    7528 (YY1)
   03083 Polycomb repressive complex
    7528 (YY1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:hsa03000]
    7528 (YY1)
   03036 Chromosome and associated proteins [BR:hsa03036]
    7528 (YY1)
   03029 Mitochondrial biogenesis [BR:hsa03029]
    7528 (YY1)
Transcription factors [BR:hsa03000]
 Eukaryotic type
  Zinc finger
   Cys2His2 others
    7528 (YY1)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     7528 (YY1)
Mitochondrial biogenesis [BR:hsa03029]
 Mitochondrial quality control factors
  Regulator of mitochondrial biogenesis
   Ubiquitas transcription factors
    7528 (YY1)
SSDB
Motif
Pfam: zf-H2C2_2 zf-C2H2 zf-C2H2_4 TFIIIA_zf-C2H2 Raffinose_syn GRP IBR
Other DBs
NCBI-GeneID: 7528
NCBI-ProteinID: NP_003394
OMIM: 600013
HGNC: 12856
Ensembl: ENSG00000100811
UniProt: P25490
Structure
LinkDB
Position
14:100239144..100282788
AA seq 414 aa
MASGDTLYIATDGSEMPAEIVELHEIEVETIPVETIETTVVGEEEEEDDDDEDGGGGDHG
GGGGHGHAGHHHHHHHHHHHPPMIALQPLVTDDPTQVHHHQEVILVQTREEVVGGDDSDG
LRAEDGFEDQILIPVPAPAGGDDDYIEQTLVTVAAAGKSGGGGSSSSGGGRVKKGGGKKS
GKKSYLSGGAGAAGGGGADPGNKKWEQKQVQIKTLEGEFSVTMWSSDEKKDIDHETVVEE
QIIGENSPPDYSEYMTGKKLPPGGIPGIDLSDPKQLAEFARMKPRKIKEDDAPRTIACPH
KGCTKMFRDNSAMRKHLHTHGPRVHVCAECGKAFVESSKLKRHQLVHTGEKPFQCTFEGC
GKRFSLDFNLRTHVRIHTGDRPYVCPFDGCNKKFAQSTNLKSHILTHAKAKNNQ
NT seq 1245 nt   +upstreamnt  +downstreamnt
atggcctcgggcgacaccctctacatcgccacggacggctcggagatgccggccgagatc
gtggagctgcacgagatcgaggtggagaccatcccggtggagaccatcgagaccacagtg
gtgggcgaggaggaggaggaggacgacgacgacgaggacggcggcggtggcgaccacggc
ggcgggggcggccacgggcacgccggccaccaccaccaccaccatcaccaccaccaccac
ccgcccatgatcgctctgcagccgctggtcaccgacgacccgacccaggtgcaccaccac
caggaggtgatcctggtgcagacgcgcgaggaggtggtgggcggcgacgactcggacggg
ctgcgcgccgaggacggcttcgaggatcagattctcatcccggtgcccgcgccggccggc
ggcgacgacgactacattgaacaaacgctggtcaccgtggcggcggccggcaagagcggc
ggcggcggctcgtcgtcgtcgggaggcggccgcgtcaagaagggcggcggcaagaagagc
ggcaagaagagttacctcagcggcggggccggcgcggcgggcggcggcggcgccgacccg
ggcaacaagaagtgggagcagaagcaggtgcagatcaagaccctggagggcgagttctcg
gtcaccatgtggtcctcagatgaaaaaaaagatattgaccatgagacagtggttgaagaa
cagatcattggagagaactcacctcctgattattcagaatatatgacaggaaagaaactt
cctcctggaggaatacctggcattgacctctcagatcccaaacaactggcagaatttgct
agaatgaagccaagaaaaattaaagaagatgatgctccaagaacaatagcttgccctcat
aaaggctgcacaaagatgttcagggataactcggccatgagaaaacatctgcacacccac
ggtcccagagtccacgtctgtgcagaatgtggcaaagcttttgttgagagttcaaaacta
aaacgacaccaactggttcatactggagagaagccctttcagtgcacgttcgaaggctgt
gggaaacgcttttcactggacttcaatttgcgcacacatgtgcgaatccataccggagac
aggccctatgtgtgccccttcgatggttgtaataagaagtttgctcagtcaactaacctg
aaatctcacatcttaacacatgctaaggccaaaaacaaccagtga

KEGG   Homo sapiens (human): 55777
Entry
55777             CDS       T01001                                 
Symbol
MBD5, C2DELq23.1, DEL2Q23.1, MRD1
Name
(RefSeq) methyl-CpG binding domain protein 5
  KO
K23219  methyl-CpG-binding domain protein 5
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
Network
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N01585  Deubiquitination of H2AK119
Disease
H00773  Autosomal dominant intellectual developmental disorder
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    55777 (MBD5)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    55777 (MBD5)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     55777 (MBD5)
  Heterochromatin formation proteins
   MBPs (metylated DNA binding proteins)
    55777 (MBD5)
SSDB
Motif
Pfam: PWWP MBD
Other DBs
NCBI-GeneID: 55777
NCBI-ProteinID: NP_060798
OMIM: 611472
HGNC: 20444
Ensembl: ENSG00000204406
UniProt: Q9P267
LinkDB
Position
2:148020927..148516971
AA seq 1494 aa
MNGGKECDGGDKEGGLPAIQVPVGWQRRVDQNGVLYVSPSGSLLSCLEQVKTYLLTDGTC
KCGLECPLILPKVFNFDPGAAVKQRTAEDVKADEDVTKLCIHKRKIIAVATLHKSMEAPH
PSLVLTSPGGGTNATPVVPSRAATPRSVRNKSHEGITNSVMPECKNPFKLMIGSSNAMGR
LYVQELPGSQQQELHPVYPRQRLGSSEHGQKSPFRGSHGGLPSPASSGSQIYGDGSISPR
TDPLGSPDVFTRSNPGFHGAPNSSPIHLNRTPLSPPSVMLHGSPVQSSCAMAGRTNIPLS
PTLTTKSPVMKKPMCNFSTNMEIPRAMFHHKPPQGPPPPPPPSCALQKKPLTSEKDPLGI
LDPIPSKPVNQNPVIINPTSFHSNVHSQVPMMNVSMPPAVVPLPSNLPLPTVKPGHMNHG
SHVQRVQHSASTSLSPSPVTSPVHMMGTGIGRIEASPQRSRSSSTSSDHGNFMMPPVGPQ
ATSSGIKVPPRSPRSTIGSPRPSMPSSPSTKSDGHHQYKDIPNPLIAGISNVLNTPSSAA
FPTASAGSSSVKSQPGLLGMPLNQILNQHNAASFPASSLLSAAAKAQLANQNKLAGNNSS
SSSNSGAVAGSGNTEGHSTLNTMFPPTANMLLPTGEGQSGRAALRDKLMSQQKDALRKRK
QPPTTVLSLLRQSQMDSSAVPKPGPDLLRKQGQGSFPISSMSQLLQSMSCQSSHLSSNST
PGCGASNTALPCSANQLHFTDPSMNSSVLQNIPLRGEAVHCHNANTNFVHSNSPVPNHHL
AGLINQIQASGNCGMLSQSGMALGNSLHPNPPQSRISTSSTPVIPNSIVSSYNQTSSEAG
GSGPSSSIAIAGTNHPAITKTTSVLQDGVIVTTAAGNPLQSQLPIGSDFPFVGQEHALHF
PSNSTSNNHLPHPLNPSLLSSLPISLPVNQQHLLNQNLLNILQPSAGEGDMSSINNTLSN
HQLTHLQSLLNNNQMFPPNQQQQQLLQGYQNLQAFQGQSTIPCPANNNPMACLFQNFQVR
MQEDAALLNKRISTQPGLTALPENPNTTLPPFQDTPCELQPRIDPSLGQQVKDGLVVGGP
GDASVDAIYKAVVDAASKGMQVVITTAVNSTTQISPIPALSAMSAFTASIGDPLNLSSAV
SAVIHGRNMGGVDHDGRLRNSRGARLPKNLDHGKNVNEGDGFEYFKSASCHTSKKQWDGE
QSPRGERNRWKYEEFLDHPGHIHSSPCHERPNNVSTLPFLPGEQHPILLPPRNCPGDKIL
EENFRYNNYKRTMMSFKERLENTVERCAHINGNRPRQSRGFGELLSTAKQDLVLEEQSPS
SSNSLENSLVKDYIHYNGDFNAKSVNGCVPSPSDAKSISSEDDLRNPDSPSSNELIHYRP
RTFNVGDLVWGQIKGLTSWPGKLVREDDVHNSCQQSPEEGKVEPEKLKTLTEGLEAYSRV
RKRNRKSGKLNNHLEAAIHEAMSELDKMSGTVHQIPQGDRQMRPPKPKRRKISR
NT seq 4485 nt   +upstreamnt  +downstreamnt
atgaatggaggcaaagagtgtgacggaggggacaaggaaggaggtcttccagctatacaa
gttcctgtgggttggcagcgtcgtgtggatcaaaatggagtgctttatgtcagtcccagt
gggtctttgttatcttgcttggagcaggttaaaacatacctgcttactgatggaacatgc
aagtgtggcttggaatgtcctcttattcttcccaaggtatttaattttgatcctggagct
gctgtgaaacagagaaccgcagaagatgttaaggcagatgaagatgtcacaaagctatgc
atacataaaagaaaaattattgcagtggccacacttcataaaagcatggaagccccacat
ccttctctggtgctcaccagtcccggaggaggaacaaatgcaactccagtagtaccttct
cgggcagcaactccaagatcagtaagaaataagtctcatgaaggaattacaaattctgta
atgcctgaatgtaagaatcctttcaagttaatgattggatcatcaaatgccatgggaagg
ctatatgtacaagaactgcctggaagccaacaacaagaactccaccctgtctacccccga
cagagattgggcagcagtgaacatggacagaaatctccattccgtggcagccatggaggc
ctgcccagcccagcgtcatcaggttcccagatatatggagatggttcaatctctccaagg
actgacccacttggaagtcctgatgttttcacaagaagtaatcctggttttcatggagct
cccaattctagtcctattcacctgaataggactcctctttctccaccttcagtaatgcta
catggttctcctgtacagtcatcctgtgcaatggctggaaggactaatatacctctttcc
ccaaccttgactacaaagagtccagtaatgaaaaaaccaatgtgtaatttttcaactaat
atggaaataccacgagcaatgttccaccacaaaccaccccaaggcccacctccccctcct
ccaccttcttgtgctcttcagaaaaagccattaacatctgagaaagatccacttggcatt
cttgaccctattcctagtaaaccagtgaatcagaaccctgttatcattaatccaaccagt
ttccattcaaatgtccactctcaggtacctatgatgaatgtaagcatgcctcctgctgtt
gttcctttgccaagtaatctcccattgccaactgtaaaacctggtcacatgaatcatggg
agtcatgtacaaagagttcagcattcagcttcaacctccctgtccccttctccagtgaca
tcccccgtgcacatgatggggactggaattggaaggattgaggcatcgccccaaagatca
cgctcatcttccacatcatcagatcatggaaatttcatgatgccacctgtaggaccccag
gccacttctagtggtattaaggttccacccaggtcaccaaggtcaacaatagggtcccca
aggccatcaatgccatcaagcccttctaccaagtccgatggacatcatcagtacaaggat
atccctaacccattaattgctggaataagtaatgtactaaataccccaagcagtgcagct
tttcctactgcatctgccggaagtagttctgtaaagagtcagcctggtttgctgggaatg
cctttaaatcagatcttgaaccagcacaatgctgcctcctttccagcaagtagtttactc
tcagcagcagccaaagcacagctagcaaatcaaaacaaacttgctggtaacaacagtagc
agcagtagcaattctggagctgttgccggcagtggcaacactgaaggacatagcacttta
aacaccatgttccctcctactgccaacatgcttctcccaacaggtgaagggcaaagtggt
cgagcagcactaagagataagctgatgtctcagcaaaaagacgcattgcggaaaagaaaa
caaccacctacgacagtgttgagtttgctcagacagtctcaaatggatagttctgcagtt
cctaaacctggacctgacttgctaaggaagcagggtcagggttcatttcccatcagttca
atgtctcagttactacagtctatgagttgtcaaagctctcacttgagtagcaatagtacc
ccgggttgtggggcctcaaatactgctttgccttgctctgctaaccagctgcattttaca
gatcccagtatgaactctagtgttcttcagaacatacctttaagaggggaagccgtgcac
tgccacaatgcaaacactaactttgttcacagtaacagtccagtccccaaccaccatctt
gcaggtttaataaatcagattcaggctagcgggaactgtgggatgctcagtcagtcgggc
atggctttaggaaattccttacatcccaatccacctcagtcaagaatttcaacgtcctcc
actccagtgataccaaacagcattgttagcagctataatcaaacaagttctgaagcaggc
ggttcaggaccatcatcctccatagccatagcgggcaccaaccaccctgccatcacaaag
acaacatctgttcttcaagatggcgtcatagtcaccactgcagctggaaacccactgcag
agtcagctacccattgggagtgattttccttttgttggccaggagcacgcacttcatttt
ccatccaacagcacttcaaacaaccatcttccacaccccttgaaccccagcctcctcagt
tctctacctatctctttgccagtgaatcaacagcatctcctaaaccagaatctattaaat
atcctccagccttcagcaggagaaggtgatatgtcatcaataaacaatactttgagtaac
catcaactgactcatctacagtcgctgttaaacaacaatcagatgtttcctccaaatcag
caacagcagcaacttctccaggggtaccagaatctccaggcgttccaaggacagtccaca
attccttgcccagctaacaataaccccatggcttgtctgtttcagaactttcaggtgaga
atgcaggaagatgcagctctcctaaacaaaagaataagcactcagcctgggctcacagca
cttcctgagaatccaaacactacacttccaccttttcaagatacaccttgtgagttgcaa
ccgaggattgacccatctcttggtcaacaggtgaaggatggcctcgttgtgggtggccca
ggtgatgcttccgtagatgccatttacaaagcagttgtcgatgcagccagcaaaggaatg
caggttgtcatcaccactgcagtcaacagtacaactcagatcagccccattccagctctg
agtgccatgagtgccttcactgcctcaattggtgacccattaaatctctccagtgctgtc
agtgcggtcattcatggacggaacatgggaggtgttgatcatgatggtaggctgaggaat
tcaagaggggctcggctgcccaagaatctagaccatgggaaaaatgtgaacgaaggagat
gggtttgaatatttcaagtcagcaagttgccacacatccaaaaaacagtgggacggggag
caaagccccagaggggagcgaaacaggtggaagtacgaggaatttttagatcatccaggc
catatccacagtagtccttgtcatgaaaggcccaacaatgtctctacactgccatttctg
cctggggaacagcacccaatactgttaccaccaagaaactgtccaggggataaaattcta
gaggaaaatttcaggtataataactacaaaagaactatgatgagttttaaggagagacta
gagaacactgtggaaagatgtgcacacataaatgggaatagacctcgacagagtcgggga
tttggagagctgctaagcactgcaaagcaagacctggtcctagaggagcagtctccaagt
tcctcaaatagtttggaaaattctctggtcaaagactacatccattacaatggagacttt
aatgccaaaagcgttaatgggtgtgtgcctagcccttcagatgctaaaagcattagtagt
gaagatgacctaaggaacccagactccccctcttcaaatgaattgatacattatagacca
aggacgttcaatgttggcgacttggtctggggccaaatcaaaggactgacttcctggcct
ggaaaattagtaagagaagacgacgttcacaattcatgtcaacaaagccccgaggaaggg
aaggtggagcccgagaagttgaagacactaacagaaggtttggaagcctacagccgtgtc
cggaaaaggaacagaaaaagtggaaagctaaataaccatttagaagctgctattcatgag
gccatgagtgaactggacaaaatgtctgggactgtacaccaaatcccacagggtgacaga
caaatgagaccccccaaacccaagaggaggaagatctccagataa

DBGET integrated database retrieval system