KEGG   Homo sapiens (human): 57492
Entry
57492             CDS       T01001                                 
Symbol
ARID1B, 6A3-5, BAF250B, BRIGHT, CSS1, DAN15, ELD/OSA1, MRD12, OSA2, P250R, SMARCF2
Name
(RefSeq) AT-rich interaction domain 1B
  KO
K11653  AT-rich interactive domain-containing protein 1
Organism
hsa  Homo sapiens (human)
Pathway
hsa04714  Thermogenesis
hsa05225  Hepatocellular carcinoma
Disease
H00773  Autosomal dominant intellectual developmental disorder
H01403  Coffin-Siris syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09159 Environmental adaptation
   04714 Thermogenesis
    57492 (ARID1B)
 09160 Human Diseases
  09162 Cancer: specific types
   05225 Hepatocellular carcinoma
    57492 (ARID1B)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:hsa03000]
    57492 (ARID1B)
   03021 Transcription machinery [BR:hsa03021]
    57492 (ARID1B)
   03036 Chromosome and associated proteins [BR:hsa03036]
    57492 (ARID1B)
Transcription factors [BR:hsa03000]
 Eukaryotic type
  Helix-turn-helix
   ARID
    57492 (ARID1B)
Transcription machinery [BR:hsa03021]
 Eukaryotic type
  RNA polymerase II system
   Coactivators
    BAF/PBAF complex
     57492 (ARID1B)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Chromatin remodeling factors
   BAF complex
    57492 (ARID1B)
SSDB
Motif
Pfam: BAF250_C ARID
Other DBs
NCBI-GeneID: 57492
NCBI-ProteinID: NP_059989
OMIM: 614556
HGNC: 18040
Ensembl: ENSG00000049618
Pharos: Q8NFD5(Tbio)
UniProt: Q8NFD5
Structure
LinkDB
Position
6:156776026..157210779
AA seq 2319 aa
MAARAAAAAAAAAARARARAGSGERRAPPGPRPAPGARDLEAGARGAAAAAAAPGPMLGG
GGDGGGGLNSVHHHPLLPRHELNMAHNAGAAAAAGTHSAKSGGSEAALKEGGSAAALSSS
SSSSAAAAAASSSSSSGPGSAMETGLLPNHKLKTVGEAPAAPPHQQHHHHHHAHHHHHHA
HHLHHHHALQQQLNQFQQQQQQQQQQQQQQQQQQHPISNNNSLGGAGGGAPQPGPDMEQP
QHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAGGRYEHPGLGALGTQQPPVAV
PGGGGGPAAVPEFNNYYGSAAPASGGPGGRAGPCFDQHGGQQSPGMGMMHSASAAAAGAP
GSMDPLQNSHEGYPNSQCNHYPGYSRPGAGGGGGGGGGGGGGSGGGGGGGGAGAGGAGAG
AVAAAAAAAAAAAGGGGGGGYGGSSAGYGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGS
AAGGFQRFAGQNQHPSGATPTLNQLLTSPSPMMRSYGGSYPEYSSPSAPPPPPSQPQSQA
AAAGAAAGGQQAAAGMGLGKDMGAQYAAASPAWAAAQQRSHPAMSPGTPGPTMGRSQGSP
MDPMVMKRPQLYGMGSNPHSQPQQSSPYPGGSYGPPGPQRYPIGIQGRTPGAMAGMQYPQ
QQMPPQYGQQGVSGYCQQGQQPYYSQQPQPPHLPPQAQYLPSQSQQRYQPQQDMSQEGYG
TRSQPPLAPGKPNHEDLNLIQQERPSSLPDLSGSIDDLPTGTEATLSSAVSASGSTSSQG
DQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPG
SQSESSSHPALSQSPMPQERGFMAGTQRNPQMAQYGPQQTGPSMSPHPSPGGQMHAGISS
FQQSNSSGTYGPQMSQYGPQGNYSRPPAYSGVPSASYSGPGPGMGISANNQMHGQGPSQP
CGAVPLGRMPSAGMQNRPFPGNMSSMTPSSPGMSQQGGPGMGPPMPTVNRKAQEAAAAVM
QAAANSAQSRQGSFPGMNQSGLMASSSPYSQPMNNSSSLMNTQAPPYSMAPAMVNSSAAS
VGLADMMSPGESKLPLPLKADGKEEGTPQPESKSKKSSSSTTTGEKITKVYELGNEPERK
LWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLN
VGTSSSAASSLKKQYIQYLFAFECKIERGEEPPPEVFSTGDTKKQPKLQPPSPANSGSLQ
GPQTPQSTGSNSMAEVPGDLKPPTPASTPHGQMTPMQGGRSSTISVHDPFSDVSDSSFPK
RNSMTPNAPYQQGMSMPDVMGRMPYEPNKDPFGGMRKVPGSSEPFMTQGQMPNSSMQDMY
NQSPSGAMSNLGMGQRQQFPYGASYDRRHEPYGQQYPGQGPPSGQPPYGGHQPGLYPQQP
NYKRHMDGMYGPPAKRHEGDMYNMQYSSQQQEMYNQYGGSYSGPDRRPIQGQYPYPYSRE
RMQGPGQIQTHGIPPQMMGGPLQSSSSEGPQQNMWAARNDMPYPYQNRQGPGGPTQAPPY
PGMNRTDDMMVPDQRINHESQWPSHVSQRQPYMSSSASMQPITRPPQPSYQTPPSLPNHI
SRAPSPASFQRSLENRMSPSKSPFLPSMKMQKVMPTVPTSQVTGPPPQPPPIRREITFPP
GSVEASQPVLKQRRKITSKDIVTPEAWRVMMSLKSGLLAESTWALDTINILLYDDSTVAT
FNLSQLSGFLELLVEYFRKCLIDIFGILMEYEVGDPSQKALDHNAARKDDSQSLADDSGK
EEEDAECIDDDEEDEEDEEEDSEKTESDEKSSIALTAPDAAADPKEKPKQASKFDKLPIK
IVKKNNLFVVDRSDKLGRVQEFNSGLLHWQLGGGDTTEHIQTHFESKMEIPPRRRPPPPL
SSAGRKKEQEGKGDSEEQQEKSIIATIDDVLSARPGALPEDANPGPQTESSKFPFGIQQA
KSHRNIKLLEDEPRSRDETPLCTIAHWQDSLAKRCICVSNIVRSLSFVPGNDAEMSKHPG
LVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEVLRDNTLVTLAN
ISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRLVLETLCKLS
IQDNNVDLILATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQGDALAARAIA
VQKGSIGNLISFLEDGVTMAQYQQSQHNLMHMQPPPLEPPSVDMMCRAAKALLAMARVDE
NRSEFLLHEGRLLDISISAVLNSLVASVICDVLFQIGQL
NT seq 6960 nt   +upstreamnt  +downstreamnt
atggccgcgcgggcagcagcggcggcggcggcggcggcggcgcgggcgcgggcgcgggca
ggcagcggcgaacggcgggcgccccccgggccgcggccggcgcccggagcccgggacctg
gaggcgggggcgcgcggcgcggcggcggcggcggcggcaccgggacccatgctggggggc
ggcggcgacggcggcggcggcctgaacagtgtgcaccaccaccccctgctcccccgtcac
gaactcaacatggcccataacgcgggcgccgcggccgccgccggcacccacagcgccaag
agcggcggctccgaggcggctctcaaggagggtggaagcgccgccgcgctgtcctcctcc
tcctcctcctccgcggcggcagcggcggcatcctcttcctcctcgtcgggcccgggctcg
gccatggagacggggctgctccccaaccacaaactgaaaaccgttggcgaagcccccgcc
gcgccgccccaccagcagcaccaccaccaccaccatgcccaccaccaccaccaccatgcc
caccacctccaccaccaccacgcactacagcagcagctaaaccagttccagcagcagcag
cagcagcagcaacagcagcagcagcagcagcagcaacagcaacatcccatttccaacaac
aacagcttgggcggcgcgggcggcggcgcgcctcagcccggccccgacatggagcagccg
caacatggaggcgccaaggacagtgctgcgggcggccaggccgaccccccgggcccgccg
ctgctgagcaagccgggcgacgaggacgacgcgccgcccaagatgggggagccggcgggc
ggccgctacgagcacccgggcttgggcgccctgggcacgcagcagccgccggtcgccgtg
cccgggggcggcggcggcccggcggccgtcccggagtttaataattactatggcagcgct
gcccctgcgagcggcggccccggcggccgcgctgggccttgctttgatcaacatggcgga
caacaaagccccgggatggggatgatgcactccgcctccgccgccgccgccggggccccc
ggcagcatggaccccctgcagaactcccacgaagggtaccccaacagccagtgcaaccat
tatccgggctacagccggcccggcgcgggcggcggcggcggcggcggcggcggaggagga
ggaggcagcggaggaggaggaggaggaggaggagcaggagcaggaggagcaggagcggga
gctgtggcggcggcggccgcggcggcggcggcagcagcaggaggcggcggcggcggcggc
tatgggggctcgtccgcggggtacggggtgctgagctccccccggcagcagggcggcggc
atgatgatgggccccgggggcggcggggccgcgagcctcagcaaggcggccgccggctcg
gcggcggggggcttccagcgcttcgccggccagaaccagcacccgtcgggggccaccccg
accctcaatcagctgctcacctcgcccagccccatgatgcggagctacggcggcagctac
cccgagtacagcagccccagcgcgccgccgccgccgccgtcgcagccccagtcccaggcg
gcggcggcgggggcggcggcgggcggccagcaggcggccgcgggcatgggcttgggcaag
gacatgggcgcccagtacgccgctgccagcccggcctgggcggccgcgcaacaaaggagt
cacccggcgatgagccccggcacccccggaccgaccatgggcagatcccagggcagccca
atggatccaatggtgatgaagagacctcagttgtatggcatgggcagtaaccctcattct
cagcctcagcagagcagtccgtacccaggaggttcctatggccctccaggcccacagcgg
tatccaattggcatccagggtcggactcccggggccatggccggaatgcagtaccctcag
cagcagatgccacctcagtatggacagcaaggtgtgagtggttactgccagcagggccaa
cagccatattacagccagcagccgcagcccccgcacctcccaccccaggcgcagtatctg
ccgtcccagtcccagcagaggtaccagccgcagcaggacatgtctcaggaaggctatgga
actagatctcaacctcctctggcccccggaaaacctaaccatgaagacttgaacttaata
cagcaagaaagaccatcaagtttaccagatctgtctggctccattgatgacctccccacg
ggaacggaagcaactttgagctcagcagtcagtgcatccgggtccacgagcagccaaggg
gatcagagcaacccggcgcagtcgcctttctccccacatgcgtcccctcatctctccagc
atcccggggggcccatctccctctcctgttggctctcctgtaggaagcaaccagtctcga
tctggcccaatctctcctgcaagtatcccaggtagtcagatgcctccgcagccacccggg
agccagtcagaatccagttcccatcccgccttgagccagtcaccaatgccacaggaaaga
ggttttatggcaggcacacaaagaaaccctcagatggctcagtatggacctcaacagaca
ggaccatccatgtcgcctcatccttctcctgggggccagatgcatgctggaatcagtagc
tttcagcagagtaactcaagtgggacttacggtccacagatgagccagtatggaccacaa
ggtaactactccagacccccagcgtatagtggggtgcccagtgcaagctacagcggccca
gggcccggtatgggtatcagtgccaacaaccagatgcatggacaagggccaagccagcca
tgtggtgctgtgcccctgggacgaatgccatcagctgggatgcagaacagaccatttcct
ggaaatatgagcagcatgacccccagttctcctggcatgtctcagcagggagggccagga
atggggccgccaatgccaactgtgaaccgtaaggcacaggaggcagccgcagcagtgatg
caggctgctgcgaactcagcacaaagcaggcaaggcagtttccccggcatgaaccagagt
ggacttatggcttccagctctccctacagccagcccatgaacaacagctctagcctgatg
aacacgcaggcgccgccctacagcatggcgcccgccatggtgaacagctcggcagcatct
gtgggtcttgcagatatgatgtctcctggtgaatccaaactgcccctgcctctcaaagca
gacggcaaagaagaaggcactccacagcccgagagcaagtcaaagaagtccagctcctcc
accactactggggagaagatcacgaaggtgtacgagctggggaatgagccagagagaaag
ctctgggtcgaccgatacctcaccttcatggaagagagaggctctcctgtctcaagtctg
cctgccgtgggcaagaagcccctggacctgttccgactctacgtctgcgtcaaagagatc
gggggtttggcccaggttaataaaaacaagaagtggcgtgagctggcaaccaacctaaac
gttggcacctcaagcagtgcagcgagctccctgaaaaagcagtatattcagtacctgttt
gcctttgagtgcaagatcgaacgtggggaggagcccccgccggaagtcttcagcaccggg
gacaccaaaaagcagcccaagctccagccgccatctcctgctaactcgggatccttgcaa
ggcccacagaccccccagtcaactggcagcaattccatggcagaggttccaggtgacctg
aagccacctaccccagcctccacccctcacggccagatgactccaatgcaaggtggaaga
agcagtacaatcagtgtgcacgacccattctcagatgtgagtgattcatccttcccgaaa
cggaactccatgactccaaacgccccctaccagcagggcatgagcatgcccgatgtgatg
ggcaggatgccctatgagcccaacaaggacccctttgggggaatgagaaaagtgcctgga
agcagcgagccctttatgacgcaaggacagatgcccaacagcagcatgcaggacatgtac
aaccaaagtccctccggagcaatgtctaacctgggcatggggcagcgccagcagtttccc
tatggagccagttacgaccgaaggcatgaaccttatgggcagcagtatccaggccaaggc
cctccctcgggacagccgccgtatggagggcaccagcccggcctgtacccacagcagccg
aattacaaacgccatatggacggcatgtacgggcccccagccaagcgccacgagggcgac
atgtacaacatgcagtacagcagccagcagcaggagatgtacaaccagtatggaggctcc
tactcgggcccggaccgcaggcccatccagggccagtacccgtatccctacagcagggag
aggatgcagggcccggggcagatccagacacacggaatcccgcctcagatgatgggcggc
ccgctgcagtcgtcctccagtgaggggcctcagcagaatatgtgggcagcacgcaatgat
atgccttatccctaccagaacaggcagggccctggcggccctacacaggcgcccccttac
ccaggcatgaaccgcacagacgatatgatggtacccgatcagaggataaatcatgagagc
cagtggccttctcacgtcagccagcgtcagccttatatgtcgtcctcagcctccatgcag
cccatcacacgcccaccacagccgtcctaccagacgccaccgtcactgccaaatcacatc
tccagggcgcccagcccagcgtccttccagcgctccctggagaaccgcatgtctccaagc
aagtctccttttctgccgtctatgaagatgcagaaggtcatgcccacggtccccacatcc
caggtcaccgggccaccaccccaaccacccccaatcagaagggagatcacctttcctcct
ggctcagtagaagcatcacaaccagtcttgaaacaaaggcgaaagattacctccaaagat
atcgttactcctgaggcgtggcgtgtgatgatgtcccttaaatcaggtcttttggctgag
agtacgtgggctttggacactattaatattcttctgtatgatgacagcactgttgctact
ttcaatctctcccagttgtctggatttctcgaacttttagtcgagtactttagaaaatgc
ctgattgacatttttggaattcttatggaatatgaagtgggagaccccagccaaaaagca
cttgatcacaacgcagcaaggaaggatgacagccagtccttggcagacgattctgggaaa
gaggaggaagatgctgaatgtattgatgacgacgaggaagacgaggaggatgaggaggaa
gacagcgagaagacagaaagcgatgaaaagagcagcatcgctctgactgccccggacgcc
gctgcagacccaaaggagaagcccaagcaagccagtaagttcgacaagctgccaataaag
atagtcaaaaagaacaacctgtttgttgttgaccgatctgacaagttggggcgtgtgcag
gagttcaatagtggccttctgcactggcagctcggcgggggtgacaccaccgagcacatt
cagactcactttgagagcaagatggaaattcctcctcgcaggcgcccacctcccccctta
agctccgcaggtagaaagaaagagcaagaaggcaaaggcgactctgaagagcagcaagag
aaaagcatcatagcaaccatcgatgacgtcctctctgctcggccaggggcattgcctgaa
gacgcaaaccctgggccccagaccgaaagcagtaagtttccctttggtatccagcaagcc
aaaagtcaccggaacatcaagctgctggaggacgagcccaggagccgagacgagactcct
ctgtgtaccatcgcgcactggcaggactcgctggctaagcgatgcatctgtgtgtccaat
attgtccgtagcttgtcattcgtgcctggcaatgatgccgaaatgtccaaacatccaggc
ctggtgctgatcctggggaagctgattcttcttcaccacgagcatccagagagaaagcga
gcaccgcagacctatgagaaagaggaggatgaggacaagggggtggcctgcagcaaagat
gagtggtggtgggactgcctcgaggtcttgagggataacacgttggtcacgttggccaac
atttccgggcagctagacttgtctgcttacacggaaagcatctgcttgccaattttggat
ggcttgctgcactggatggtgtgcccgtctgcagaggcacaagatccctttccaactgtg
ggacccaactcggtcctgtcgcctcagagacttgtgctggagaccctctgtaaactcagt
atccaggacaataatgtggacctgatcttggccactcctccatttagtcgtcaggagaaa
ttctatgctacattagttaggtacgttggggatcgcaaaaacccagtctgtcgagaaatg
tccatggcgcttttatcgaaccttgcccaaggggacgcactagcagcaagggccatagct
gtgcagaaaggaagcattggaaacttgataagcttcctagaggatggggtcacgatggcc
cagtaccagcagagccagcacaacctcatgcacatgcagcccccgcccctggaaccacct
agcgtagacatgatgtgcagggcggccaaggctttgctagccatggccagagtggacgaa
aaccgctcggaattccttttgcacgagggccggttgctggatatctcgatatcagctgtc
ctgaactctctggttgcatctgtcatctgtgatgtactgtttcagattgggcagttatga

KEGG   Homo sapiens (human): 8289
Entry
8289              CDS       T01001                                 
Symbol
ARID1A, B120, BAF250, BAF250a, BM029, C1orf4, CSS2, ELD, MRD14, OSA1, P270, SMARCF1, hELD, hOSA1
Name
(RefSeq) AT-rich interaction domain 1A
  KO
K11653  AT-rich interactive domain-containing protein 1
Organism
hsa  Homo sapiens (human)
Pathway
hsa04714  Thermogenesis
hsa05225  Hepatocellular carcinoma
Disease
H00048  Hepatocellular carcinoma
H00773  Autosomal dominant intellectual developmental disorder
H01403  Coffin-Siris syndrome
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09159 Environmental adaptation
   04714 Thermogenesis
    8289 (ARID1A)
 09160 Human Diseases
  09162 Cancer: specific types
   05225 Hepatocellular carcinoma
    8289 (ARID1A)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:hsa03000]
    8289 (ARID1A)
   03021 Transcription machinery [BR:hsa03021]
    8289 (ARID1A)
   03036 Chromosome and associated proteins [BR:hsa03036]
    8289 (ARID1A)
Transcription factors [BR:hsa03000]
 Eukaryotic type
  Helix-turn-helix
   ARID
    8289 (ARID1A)
Transcription machinery [BR:hsa03021]
 Eukaryotic type
  RNA polymerase II system
   Coactivators
    BAF/PBAF complex
     8289 (ARID1A)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Chromatin remodeling factors
   BAF complex
    8289 (ARID1A)
SSDB
Motif
Pfam: BAF250_C ARID
Other DBs
NCBI-GeneID: 8289
NCBI-ProteinID: NP_006006
OMIM: 603024
HGNC: 11110
Ensembl: ENSG00000117713
Pharos: O14497(Tbio)
UniProt: O14497
Structure
LinkDB
Position
1:26696015..26782104
AA seq 2285 aa
MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAAGQESEG
PAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEP
PGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQHGGQQSPG
LAALQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPG
SGAAAAAGSKPPPSSSASASSSSSSFAQQRFGAMGGGGPSAAGGGTPQPTATPTLNQLLT
SPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGAQQRSHHA
PMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYGGTNPYSQQQGPPSGPQQGHGYPGQ
PYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQ
QQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQST
TQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPP
QELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSPG
VSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPAAVP
GNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALSPRQPSGGQ
IHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGINPMGAGGQ
MHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQETAVA
MHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYSMGGTMANNSAG
MAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEKITKLYELGGEPER
KMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNL
NVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADSKKSQPKIQPPSPAGSGS
MQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQ
KRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSDPFMSSGQGPNGGMGDPY
SRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYS
PSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFPSQQTTMYQQQQQNYKRPMDGTYGPP
AKRHEGEMYSVPYSTGQGQPQQQQLPPAQPQPASQQQAAQPSPQQDVYNQYGNAYPATAT
AATERRPAGGPQNQFPFQFGRDRVSAPPGTNAQQNMPPQMMGGPIQASAEVAQQGTMWQG
RNDMTYNYANRQSTGSAPQGPAYHGVNRTDEMLHTDQRANHEGSWPSHGTRQPPYGPSAP
VPPMTRPPPSNYQPPPSMQNHIPQVSSPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPV
PASHIAPAPVQPPMIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGL
LAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPG
QRTLLDPGRFSKVSSPAPMEGGEEEEELLGPKLEEEEEEEVVENDEEIAFSGKDKPASEN
SEEKLISKFDKLPVKIVQKNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFE
SKTELLPSRPHAPCPPAPRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSS
TLTEDGAKSSEAIKESSKFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKR
CVCVSNTIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQG
VSCNKVEWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQ
DPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKN
PVCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQN
PPFEPTSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLF
LIGQS
NT seq 6858 nt   +upstreamnt  +downstreamnt
atggccgcgcaggtcgcccccgccgccgccagcagcctgggcaacccgccgccgccgccg
ccctcggagctgaagaaagccgagcagcagcagcgggaggaggcggggggcgaggcggcg
gcggcggcagcggccgagcgcggggaaatgaaggcagccgccgggcaggaaagcgagggc
cccgccgtggggccgccgcagccgctgggaaaggagctgcaggacggggccgagagcaat
gggggtggcggcggcggcggagccggcagcggcggcgggcccggcgcggagccggacctg
aagaactcgaacgggaacgcgggccctaggcccgccctgaacaataacctcacggagccg
cccggcggcggcggtggcggcagcagcgatggggtgggggcgcctcctcactcagccgcg
gccgccttgccgcccccagcctacggcttcgggcaaccctacggccggagcccgtctgcc
gtcgccgccgccgcggccgccgtcttccaccaacaacatggcggacaacaaagccctggc
ctggcagcgctgcagagcggcggcggcgggggcctggagccctacgcggggccccagcag
aactctcacgaccacggcttccccaaccaccagtacaactcctactaccccaaccgcagc
gcctaccccccgcccgccccggcctacgcgctgagctccccgagaggtggcactccgggc
tccggcgcggcggcggctgccggctccaagccgcctccctcctccagcgcctccgcctcc
tcgtcgtcttcgtccttcgctcagcagcgcttcggggccatggggggaggcggcccctcc
gcggccggcgggggaactccccagcccaccgccacccccaccctcaaccaactgctcacg
tcgcccagctcggcccggggctaccagggctaccccgggggcgactacagtggcgggccc
caggacgggggcgccggcaagggcccggcggacatggcctcgcagtgttggggggctgcg
gcggcggcagctgcggcggcggccgcctcgggaggggcccaacaaaggagccaccacgcg
cccatgagccccgggagcagcggcggcggggggcagccgctcgcccggacccctcagcca
tccagtccaatggatcagatgggcaagatgagacctcagccatatggcgggactaaccca
tactcgcagcaacagggacctccgtcaggaccgcagcaaggacatgggtacccagggcag
ccatacgggtcccagaccccgcagcggtacccgatgaccatgcagggccgggcgcagagt
gccatgggcggcctctcttatacacagcagattcctccttatggacaacaaggccccagc
gggtatggtcaacagggccagactccatattacaaccagcaaagtcctcaccctcagcag
cagcagccaccctactcccagcaaccaccgtcccagacccctcatgcccaaccttcgtat
cagcagcagccacagtctcaaccaccacagctccagtcctctcagcctccatactcccag
cagccatcccagcctccacatcagcagtccccggctccatacccctcccagcagtcgacg
acacagcagcacccccagagccagcccccctactcacagccacaggctcagtctccttac
cagcagcagcaacctcagcagccagcaccctcgacgctctcccagcaggctgcgtatcct
cagccccagtctcagcagtcccagcaaactgcctattcccagcagcgcttccctccaccg
caggagctatctcaagattcatttgggtctcaggcatcctcagccccctcaatgacctcc
agtaagggagggcaagaagatatgaacctgagccttcagtcaagaccctccagcttgcct
gatctatctggttcaatagatgacctccccatggggacagaaggagctctgagtcctgga
gtgagcacatcagggatttccagcagccaaggagagcagagtaatccagctcagtctcct
ttctctcctcatacctcccctcacctgcctggcatccgaggcccttccccgtcccctgtt
ggctctcccgccagtgttgctcagtctcgctcaggaccactctcgcctgctgcagtgcca
ggcaaccagatgccacctcggccacccagtggccagtcggacagcatcatgcatccttcc
atgaaccaatcaagcattgcccaagatcgaggttatatgcagaggaacccccagatgccc
cagtacagttccccccagcccggctcagccttatctccgcgtcagccttccggaggacag
atacacacaggcatgggctcctaccagcagaactccatggggagctatggtccccagggg
ggtcagtatggcccacaaggtggctaccccaggcagccaaactataatgccttgcccaat
gccaactaccccagtgcaggcatggctggaggcataaaccccatgggtgccggaggtcaa
atgcatggacagcctggcatcccaccttatggcacactccctccagggaggatgagtcac
gcctccatgggcaaccggccttatggccctaacatggccaatatgccacctcaggttggg
tcagggatgtgtcccccaccagggggcatgaaccggaaaacccaagaaactgctgtcgcc
atgcatgttgctgccaactctatccaaaacaggccgccaggctaccccaatatgaatcaa
gggggcatgatgggaactggacctccttatggacaagggattaatagtatggctggcatg
atcaaccctcagggacccccatattccatgggtggaaccatggccaacaattctgcaggg
atggcagccagcccagagatgatgggccttggggatgtaaagttaactccagccaccaaa
atgaacaacaaggcagatgggacacccaagacagaatccaaatccaagaaatccagttct
tctactacaaccaatgagaagatcaccaagttgtatgagctgggtggtgagcctgagagg
aagatgtgggtggaccgttatctggccttcactgaggagaaggccatgggcatgacaaat
ctgcctgctgtgggtaggaaacctctggacctctatcgcctctatgtgtctgtgaaggag
attggtggattgactcaggtcaacaagaacaaaaaatggcgggaacttgcaaccaacctc
aatgtgggcacatcaagcagtgctgccagctccttgaaaaagcagtatatccagtgtctc
tatgcctttgaatgcaagattgaacggggagaagaccctcccccagacatctttgcagct
gctgattccaagaagtcccagcccaagatccagcctccctctcctgcgggatcaggatct
atgcaggggccccagactccccagtcaaccagcagttccatggcagaaggaggagactta
aagccaccaactccagcatccacaccacacagtcagatccccccattgccaggcatgagc
aggagcaattcagttgggatccaggatgcctttaatgatggaagtgactccacattccag
aagcggaattccatgactccaaaccctgggtatcagcccagtatgaatacctctgacatg
atggggcgcatgtcctatgagccaaataaggatccttatggcagcatgaggaaagctcca
gggagtgatcccttcatgtcctcagggcagggccccaacggcgggatgggtgacccctac
agtcgtgctgccggccctgggctaggaaatgtggcgatgggaccacgacagcactatccc
tatggaggtccttatgacagagtgaggacggagcctggaatagggcctgagggaaacatg
agcactggggccccacagccgaatctcatgccttccaacccagactcggggatgtattct
cctagccgctaccccccgcagcagcagcagcagcagcagcaacgacatgattcctatggc
aatcagttctccacccaaggcaccccttctggcagccccttccccagccagcagactaca
atgtatcaacagcaacagcagaattacaagcggccaatggatggcacatatggccctcct
gccaagcggcacgaaggggagatgtacagcgtgccatacagcactgggcaggggcagcct
cagcagcagcagttgcccccagcccagccccagcctgccagccagcaacaagctgcccag
ccttcccctcagcaagatgtatacaaccagtatggcaatgcctatcctgccactgccaca
gctgctactgagcgccgaccagcaggcggcccccagaaccaatttccattccagtttggc
cgagaccgtgtctctgcaccccctggcaccaatgcccagcaaaacatgccaccacaaatg
atgggcggccccatacaggcatcagctgaggttgctcagcaaggcaccatgtggcagggg
cgtaatgacatgacctataattatgccaacaggcagagcacgggctctgccccccagggc
cccgcctatcatggcgtgaaccgaacagatgaaatgctgcacacagatcagagggccaac
cacgaaggctcgtggccttcccatggcacacgccagcccccatatggtccctctgcccct
gtgccccccatgacaaggccccctccatctaactaccagcccccaccaagcatgcagaat
cacattcctcaggtatccagccctgctcccctgccccggccaatggagaaccgcacctct
cctagcaagtctccattcctgcactctgggatgaaaatgcagaaggcaggtcccccagta
cctgcctcgcacatagcacctgcccctgtgcagccccccatgattcggcgggatatcacc
ttcccacctggctctgttgaagccacacagcctgtgttgaagcagaggaggcggctcaca
atgaaagacattggaaccccggaggcatggcgggtaatgatgtccctcaagtctggtctc
ctggcagagagcacatgggcattagataccatcaacatcctgctgtatgatgacaacagc
atcatgaccttcaacctcagtcagctcccagggttgctagagctccttgtagaatatttc
cgacgatgcctgattgagatctttggcattttaaaggagtatgaggtgggtgacccagga
cagagaacgctactggatcctgggaggttcagcaaggtgtctagtccagctcccatggag
ggtggggaagaagaagaagaacttctaggtcctaaactagaagaggaagaagaagaggaa
gtagttgaaaatgatgaggagatagccttttcaggcaaggacaagccagcttcagagaat
agtgaggagaagctgatcagtaagtttgacaagcttccagtaaagatcgtacagaagaat
gatccatttgtggtggactgctcagataagcttgggcgtgtgcaggagtttgacagtggc
ctgctgcactggcggattggtgggggggacaccactgagcatatccagacccacttcgag
agcaagacagagctgctgccttcccggcctcacgcaccctgcccaccagcccctcggaag
catgtgacaacagcagagggtacaccagggacaacagaccaggaggggcccccacctgat
ggacctccagaaaaacggatcacagccactatggatgacatgttgtctactcggtctagc
accttgaccgaggatggagctaagagttcagaggccatcaaggagagcagcaagtttcca
tttggcattagcccagcacagagccaccggaacatcaagatcctagaggacgaaccccac
agtaaggatgagaccccactgtgtacccttctggactggcaggattctcttgccaagcgc
tgcgtctgtgtgtccaataccattcgaagcctgtcatttgtgccaggcaatgactttgag
atgtccaaacacccagggctgctgctcatcctgggcaagctgatcctgctgcaccacaag
cacccagaacggaagcaggcaccactaacttatgaaaaggaggaggaacaggaccaaggg
gtgagctgcaacaaagtggagtggtggtgggactgcttggagatgctccgggaaaacacc
ttggttacactcgccaacatctcggggcagttggacctatctccataccccgagagcatt
tgcctgcctgtcctggacggactcctacactgggcagtttgcccttcagctgaagcccag
gaccccttttccaccctgggccccaatgccgtcctttccccgcagagactggtcttggaa
accctcagcaaactcagcatccaggacaacaatgtggacctgattctggccacacccccc
ttcagccgcctggagaagttgtatagcactatggtgcgcttcctcagtgaccgaaagaac
ccggtgtgccgggagatggctgtggtactgctggccaacctggctcagggggacagcctg
gcagctcgtgccattgcagtgcagaagggcagtatcggcaacctcctgggcttcctagag
gacagccttgccgccacacagttccagcagagccaggccagcctcctccacatgcagaac
ccaccctttgagccaactagtgtggacatgatgcggcgggctgcccgcgcgctgcttgcc
ttggccaaggtggacgagaaccactcagagtttactctgtacgaatcacggctgttggac
atctcggtatcaccgttgatgaactcattggtttcacaagtcatttgtgatgtactgttt
ttgattggccagtcatga

DBGET integrated database retrieval system