Falco peregrinus (peregrine falcon): 101914395
Help
Entry
101914395 CDS
T02856
Gene name
GLI1
Definition
(RefSeq) GLI family zinc finger 1
KO
K16797
zinc finger protein GLI1
Organism
fpg
Falco peregrinus (peregrine falcon)
Pathway
fpg04340
Hedgehog signaling pathway
Brite
KEGG Orthology (KO) [BR:
fpg00001
]
09130 Environmental Information Processing
09132 Signal transduction
04340 Hedgehog signaling pathway
101914395 (GLI1)
09180 Brite Hierarchies
09182 Protein families: genetic information processing
03000 Transcription factors [BR:
fpg03000
]
101914395 (GLI1)
Transcription factors [BR:
fpg03000
]
Eukaryotic type
Zinc finger
Cys2His2 GLI-like
101914395 (GLI1)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
zf-C2H2
zf-H2C2_2
FOXP-CC
zf-C2H2_4
Motif
Other DBs
NCBI-GeneID:
101914395
NCBI-ProteinID:
XP_005236853
LinkDB
All DBs
Position
Un
AA seq
1304 aa
AA seq
DB search
MFNPVSPPATGYAEHCCLRPPHGPAPGAPGPQGLDFPLCHQSNLMSGHRGYGLVPGTEHP
GSGDGSRFSTPRGAGKLGKKRALSISPLSDSSIDLQTVIRTSPNSLVAFINSRCASASGS
YGHLSISTISPSLGFQSPSGQQKGQGHLYSHTPPPPPPCSSHEHLSTRPGLRHHAPACGT
LKHCQQLKLEWSLSSPLTVKYPEKRSEGDISSPASTGTQDPLLGMLDVREDLEKEDGKPE
SETVYETNCYWDGCAKEFDTQEQLVHHINNEHIHGEKKEFVCHWAACSREQRPFKAQYML
VVHMRRHTGEKPHKCTFEGCNKAYSRLENLKTHLRSHTGEKPYVCEHEGCNKAFSNASDR
AKHQNRTHSNEKPYVCKIPGCTKRYTDPSSLRKHVKTVHGPDAHVTKKHRGDVVPGRALP
TPSGPLDMKQEKDTNGPMDARKDDGKLLVPDLALKPQPSPGGQSSCSSDHSPLGSTTNND
SGVEMAGNAGGSYEDLSTLEDVVPGEPMGTSGLMALHKLENLRIDKLKQMRKPSATKGLN
LPAIPGAGLPGEVPGIPLPPPAASHRRIAELSAAETGMPLNERRSSATSTISSAYTVSRR
SSLVSPYLAGPCLGGEAGAVPGAASLADGYDPISPDESRRSSDASHSGGLPGVGSLTPAQ
RYCLKAKYAAATGGPPPTPLPSVERVGMGGHGSLPGDYLGPVKPRFLANGLLRRHSSNDY
TGYAGSIPPHLVPRNGVRRASDPARTAANPHAVPKVHRFKSMGNVNVPGAGRTALQPLGG
SDANLQHHVFSSRPPSISENVFLESVSIEGPGAGVESGLLEMEQYLNYPEESFPCQGTGV
ELQCEGLYSSAHQTTAGMQLNTGGHGDMEEGLLQSEFSLPQCQMNQHFTSMHAGNGTMPA
PWDEPPQSNLEMSSGQPSVSASAAAMSGPHCHHQSTDYPLPSSCGHQPKLVSSCQDGGFS
GGHWLNRLQIKSEQHYPVPAPALAPCQNAKLAGPVQCPASFGQAMNVGSGGYQSEEQAPV
SYMGMLSPGTRRAQTPTMQTKEVMVRSYVQAQQALMWGDQLTSKGGEAGMGLGSEPGQCQ
AVQAPLYLSPKYSGYQTKPDHPQGLAETQHLLNAPCFNPEMVPHPPGGPKPPGHQNSLNY
VGNLAQPSHSYEGVEASSRRVLRLPPARPTPEGPSNALLCYPGQSMHLQVGKGGHKLLGQ
MAASCGGPGHYGGSLEGLKGSSYCYLDSGEQVANSLDSLDLENTHLDFAAIVEDPETPAL
LPGPASPAGGLLLPASGGANMAVGDMSSMLSTLAGESHFLNSLS
NT seq
3915 nt
NT seq
+upstream
nt +downstream
nt
atgttcaaccccgtcagccccccggccaccggctatgctgagcactgctgcctgcgcccc
ccccacggaccagccccgggtgccccagggccacaaggacttgactttcccttatgccat
cagtcaaacctcatgagcggtcaccgtggttatgggttggtgccaggaactgagcaccca
ggcagtggtgacggctcacggttctcaacgccccgtggtgcaggcaaactgggcaaaaag
cgggcactgtccatctcacccctgtcagactccagcattgacctgcagacggtaatccgc
acctcacccaactccctcgttgccttcatcaactcccgctgtgcctctgccagtggctcc
tatggccacctctccatcagcaccatcagtccatcactgggattccagagtccatcaggt
cagcagaagggccaaggccacctgtacagccacacaccccccccaccaccaccatgtagc
tcccatgagcacctgtccacccgcccagggctccgccaccatgccccagcttgtgggacc
ctcaaacactgccagcagctgaagttggagtggagcctgagcagccctctgactgtcaaa
tacccagagaagaggtccgagggcgatatctccagcccagcttccacaggcacccaggac
cccttgctggggatgctcgatgttcgggaagatctggagaaggaggacgggaaacctgag
tctgagaccgtgtatgaaacgaattgctactgggacggctgtgctaaggagtttgatacg
caggagcagctggtgcatcacatcaacaatgagcatatccatggggagaagaaggagttt
gtgtgccactgggcagcatgttcccgagagcagaggcccttcaaggctcagtacatgttg
gtggtgcacatgcgacgtcacacaggcgagaaaccccacaaatgcacgttcgaaggctgt
aacaaagcctactcacgcctggagaacctcaagacgcatctgcgctcacacacgggtgaa
aaaccctacgtgtgtgaacacgagggttgcaacaaggccttctccaatgcttctgaccgg
gccaagcatcaaaaccgcactcactccaatgagaagccctatgtgtgcaagattccaggc
tgcaccaagcgctacacagaccccagttccctgcgcaaacacgtgaagacggtgcacggc
cctgatgcccatgtcaccaagaagcaccggggggatgtggtgccaggccgtgcactgccc
acccccagtggccccctagacatgaagcaggagaaagacacaaatggccccatggatgcc
cggaaggatgacggcaagctcttggtgcccgacttggccttgaagccgcagcccagccca
ggtgggcagtcgtcctgcagcagtgaccactccccacttggcagcaccaccaacaatgac
agtggagtggagatggcgggtaacgcaggtgggagctacgaggatctgtccacactggag
gatgtggtgcccggtgagcctatgggcacctcgggtctcatggccctccacaagctagaa
aatcttcgcattgataaactgaagcagatgaggaagccatcggctaccaagggcctgaat
ttgccagccatccctggagccggcctgcctggggaggtgcctgggatccccctgccaccg
ccagctgcctcgcaccggcgcattgcggagctgtcagcagcagagacgggcatgccactg
aacgagcgccgaagcagtgccaccagtactataagctcagcctacactgtgagccggcgc
tcatccctggtgtccccgtacctggctgggccatgcctgggtggtgaggcaggggcagtg
cctggcgcggcaagcctggcagacggctatgaccccatctctccggacgagtcgcgacgc
tccagtgatgccagccactccggagggctgccaggtgtgggcagccttaccccagcccag
cgctactgtctcaaggccaaatacgctgcagccacaggtggcccacccccaaccccactg
cccagcgtggagcgggtgggcatgggtggccacggcagcttacctggggactaccttggg
ccagtcaaaccccgcttccttgcaaatggtttgctgcgaaggcacagttccaatgactac
acgggctatgcaggcagcattccccctcacctcgtgccccggaatggtgtgcggcgggcc
agcgatcctgcccggactgcagccaaccctcatgctgtgcccaaggtgcatcgttttaag
agcatgggtaacgtgaatgtgccaggagcaggcagaactgctctgcagcccttgggtggt
tctgatgctaaccttcagcaccatgtcttctcctcgcgcccacccagcatcagtgaaaat
gtcttcctggagagcgtgagcatagaaggccctggtgcaggcgtggagtctggcttgcta
gagatggaacagtacttgaactaccctgaggaaagcttcccttgccaggggaccggtgtg
gagctgcaatgtgaaggcctctatagcagtgctcaccagaccacagcgggtatgcagctg
aacacaggagggcatggggacatggaggaggggttgcttcagtccgagttctccctccct
cagtgccagatgaaccagcacttcacaagtatgcatgctggcaacgggactatgccagct
ccttgggatgaacctccccaaagcaatctggagatgagttctgggcagccaagtgtcagt
gcttctgctgctgccatgtcggggccacactgtcaccatcaaagtactgactacccacta
cccagttcttgtgggcaccaacccaaacttgtgagctcgtgccaggacggtgggttttct
ggggggcactggcttaaccgactacagatcaaatctgagcagcattacccagtacctgcc
ccagcacttgctccctgccagaatgccaagcttgctgggcccgtgcagtgtcctgcatca
ttcggccaagccatgaatgtagggtctggtggctaccagtctgaggaacaggcacctgtt
agttacatgggcatgttgagcccgggcactagaagagcccagacccccaccatgcagacc
aaagaagtgatggtccgtagctatgtgcaggcccagcaggctttgatgtggggagaccaa
ctaacctccaagggaggggaggctggcatggggctgggcagtgaacctgggcagtgccaa
gctgtgcaggcaccactgtatctcagccccaaatactctggttaccaaaccaaaccagat
cacccgcagggtctggcagagacccagcacctcctaaacgccccatgcttcaacccagag
atggtgcctcacccacctggtggccctaaaccacctggtcaccaaaacagtctgaactat
gtgggcaacctggctcagccaagtcattcctacgagggagtggaagccagctcccggcgc
gtgcttcgcttgccccctgcccgtcccacacctgaggggcccagtaatgccctgctgtgt
tacccaggccagagcatgcatttgcaggtgggcaaaggtgggcataagctgctgggccaa
atggcggcaagctgtgggggtcccgggcattatggtgggagcttggaaggactcaaaggc
agctcatattgctacctggattcaggggagcaggtggccaacagcctggactccttggac
ttggagaatacgcaccttgactttgctgccattgtggaagatccagagacacctgcgctg
ctgcctgggcccgcaagccctgctggtggcctcctgcttcctgcatctggtggtgccaac
atggctgtgggtgacatgagctccatgttgagcactctggcaggggagagccatttcctc
aactccctgtcataa
Falco peregrinus (peregrine falcon): 101914564
Help
Entry
101914564 CDS
T02856
Gene name
GLI3
Definition
(RefSeq) GLI family zinc finger 3
KO
K06230
zinc finger protein GLI3
Organism
fpg
Falco peregrinus (peregrine falcon)
Pathway
fpg04340
Hedgehog signaling pathway
Brite
KEGG Orthology (KO) [BR:
fpg00001
]
09130 Environmental Information Processing
09132 Signal transduction
04340 Hedgehog signaling pathway
101914564 (GLI3)
09180 Brite Hierarchies
09182 Protein families: genetic information processing
03000 Transcription factors [BR:
fpg03000
]
101914564 (GLI3)
Transcription factors [BR:
fpg03000
]
Eukaryotic type
Zinc finger
Cys2His2 GLI-like
101914564 (GLI3)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
zf-C2H2
zf-H2C2_2
zf-C2H2_4
FOXP-CC
Motif
Other DBs
NCBI-GeneID:
101914564
NCBI-ProteinID:
XP_005233009
LinkDB
All DBs
Position
Un
AA seq
1573 aa
AA seq
DB search
MEAQSHSSTTTEKKKVENSIVKCSSRTDVSEKAVASSTTSNEDESPGQTYHRERRNAITM
QPQGGQGLGKISEEPSTSSEERASLIKKEIHGSISHLPEPSVPYRGTLFTMDPRNGYMDP
HYHPPHLFPAFHPPVPIDARHHEGRYHYEPSPIPPLHVPSALSSSPTYSELPFLRISPHR
NPAATSESPFSTPHPYINPYMDYIRSLHSSPSLSMISAARGLSPTDAPHAGVSPAEYYHQ
MALLAGQRSPYADIIPSAATAGAGALHMEYLHAMDSARFPSPRLSARPSRKRTLSISPLS
DHSFDLQTMIRTSPNSLVTILNNSRSSSSASGSYGHLSASAISPALSFTYPPTPVSLQQM
HQQIISRQQTLGSAFGHSPPLIHPAPTFPTQRPIPGIPSVLNPVQVSSGPSESTQQNKPT
SESAVSSTGDPMHNKRSKIKPDEDLPSPGAGSVQEQPEGVTLVKEEGDKDESKQEPEVVY
ETNCHWEGCSREFDTQEQLVHHINNDHIHGEKKEFVCRWLDCSREQKPFKAQYMLVVHMR
RHTGEKPHKCTFEGCTKAYSRLENLKTHLRSHTGEKPYVCEHEGCNKAFSNASDRAKHQN
RTHSNEKPYVCKIPGCTKRYTDPSSLRKHVKTVHGPEAHVTKKQRGDIHPRPPPPRDPGS
HSQTRSPGHQTQGAIGEQKDLSNTTSKREECLQVKAVKSEKPMTSQPSPGGQSTCSSEQS
PISNYSNNGIELTLTSGGSVGDLSVIDETPIMDSTISTATTALGLQARRNMTGTKWMEQV
KLERLKQVNGMLPRLNPVPPSKAPTLPPLIGNGAQSNSSCSVGGSMTILPNRSELSSTDI
TVLNMLNRRDSNTSTISSAYLSSRRSSGISPCFSSRRSSDASQAEGRPQNVSVADSYDPI
STDASRRSSEASQCDDLPSLLSLTPAQQYRLKAKYAAATGGPPPTPLPNMERMSLKTRMA
LLGDCRESGVSPLPPVNAPRRCSDGGANGYSRRHFLSHDALGNGMRRASDPVRMACDNLS
VPRVQRFNSLNSFNPPALPPSMEKRNLVLQNYTRSEGGVFRGFSSPCPPSISENVALEAA
TMEAGGSLNDEDLLPDDVVQYLNSQNQGIYDHLLNNVLDSNKMHHGSVLGNDNPSNFDQA
PPPSSQQAGSETNKSDLPIQWNEVSSGSSDLSPSKLKCGQRSTVQQTRAFRLYNNMMVQQ
KNLERSNVDQQNGYLVENNNSYGLQQNAVLGSGAANSFSVQPNKPYSESISRQAMMSGAM
DNSCGMAVQGQKLRSSNVPVSGNQQNFGHPMASSDQAASMANGMQNSSMMEQEYLQNQPV
GDDVHYQGVNQSGQMMLGQVSPTSQSSLYQGPQSCPPVSHTVGSQPSGLLVAKSYQPCTN
YSSNRRQNMLRNNLAQQQGHVSDGNQTYRVNTIKMEIQGQSQQFCSNAQNYSGQLYDQTM
GFSYQAMKTGSFFGSEANCLLQGTATANSSELLSPGANQVSSTVDSIDSNSLEGVQIDFD
AIIDDGDHVSLISGALSPSIIQNLSRNSSRLTTPRASLTFPAMPVSTTNMAIGDMSSLLT
SLAEESKFLAVMQ
NT seq
4722 nt
NT seq
+upstream
nt +downstream
nt
atggaggcgcagtcccatagctctaccacgacagagaagaaaaaagtggagaattccatc
gtgaaatgctccagtcgaacagatgtcagtgagaaagctgttgcctccagcacaacttcc
aatgaggatgaaagtcctgggcagacctatcacagagagagaaggaacgcgatcacaatg
cagccgcagggtgggcaaggcctcggcaagatcagtgaagagccttccacgtcgagcgag
gaaagggcttcattaatcaagaaggagatccatggatctatatcgcaccttcccgaacct
tctgtaccttaccgcgggacgctctttaccatggacccccgaaacggttacatggaccct
cattaccatccgcctcacctctttccagcattccaccctcctgtaccaattgatgcgaga
catcatgagggacgctaccattatgaaccatctcccattcctcctctgcatgtgccttct
gccttatctagtagcccgacatactcagagcttccgttccttagaatttccccgcaccga
aatcctgctgcaacatcagagtctcccttcagcacccctcacccatacattaacccttac
atggactacatcaggtccctgcacagcagcccgtccctttccatgatctcggcagcccgt
gggctcagcccaacagacgctccacatgctggcgtcagtccagctgaatattaccatcag
atggctctgttggcaggccagcgcagcccgtatgcagacatcattccttcagctgccact
gcaggagctggtgctcttcatatggaatatcttcacgctatggatagcgcgaggtttccg
agtccgagattgtcagctagaccaagccgaaagcgtacgttgtccatatcccccctatct
gatcacagctttgaccttcagaccatgatacggacatctccaaattccttggtcacaatt
ctcaataattctcgtagcagttcctcagccagtggttcttatggccacttatctgcaagt
gcaataagccctgctctgagtttcacataccctcctacaccagtatccctccagcaaatg
catcagcaaattataagtcgtcagcaaaccctaggttcagcctttggacacagccctcca
ctcatccatcctgctccaacttttcctacccagagacctattcccggcatccccagtgtt
ctgaaccctgtccaggtcagctctggaccttctgagtccacacagcagaataaacccaca
agtgaatctgccgtgagcagcactggagatcccatgcacaacaagcgctccaagataaag
cctgacgaggacctgcccagcccgggagcaggaagcgtgcaggaacagccagaaggagtg
accctggtaaaagaggaaggggacaaagatgaaagcaaacaggagcctgaagtggtctat
gagacaaactgccactgggaaggctgttcccgggagtttgacacgcaggaacagctagtg
catcacataaacaatgaccatatacacggtgagaagaaagagttcgtgtgccgatggctg
gactgttcccgggagcagaaaccattcaaagcccagtatatgctggtggtccacatgagg
agacatacaggggaaaagccacacaaatgcacttttgaaggttgtacaaaggcctactcc
agactagaaaacttgaaaacacacttgagatctcacactggagaaaaaccatatgtgtgt
gaacacgaaggctgcaacaaagctttctccaatgcgtccgacagggccaagcaccaaaac
aggactcattccaatgagaaaccgtatgtttgcaagataccgggctgcacaaagcgctac
acagatcccagttctctccggaaacatgtgaagaccgtgcatggcccggaggcgcatgtc
accaagaagcaacggggagatatccacccaaggcctccgccaccgagagacccaggcagc
cactctcagacccggtcaccaggccatcagactcagggtgcgattggtgagcagaaggac
ctcagcaacactacctcaaagcgtgaagaatgcctccaagtgaaagcagtcaagtcggaa
aaaccaatgacatctcagccaagccctggtggtcagtctacatgcagcagcgaacagtcc
cccatcagcaactattccaacaatgggatcgagcttactctgaccagtggtggtagtgta
ggagacctcagtgtcatcgatgaaaccccaatcatggactctaccatttccacagccacc
acagcacttggcttacaggccaggaggaatatgacagggaccaaatggatggagcaagtg
aaattagaaaggttgaaacaagttaacggaatgcttccaagactgaaccccgttccacct
tccaaagccccaaccttgccgcctctcataggaaatggtgcccagtcaaacagcagctgc
agtgtaggaggatccatgactattctgccaaacaggagtgaactttcaagtacagacatc
actgtgttgaacatgctgaacaggcgggacagcaatactagcaccatcagttcagcctac
ttgagcagccgcagatcctctggaatttcaccttgcttctctagccggaggtccagcgat
gcctcccaggcagagggaagaccacagaacgtgagcgttgcagactcctacgaccccatc
tcaacggatgcctcccggcgctccagtgaagcgagccagtgtgatgacctgccaagtctt
ctcagcctcaccccggcccagcaatataggctgaaagctaaatatgcagcagctactggt
ggacccccaccaactccactgcctaacatggagaggatgagcctcaaaacaaggatggca
ctcttgggtgactgcagggagtccggagtatctccactgcctccagtgaatgcccctcga
agatgtagtgatggtggggcaaatggttacagcaggaggcattttctgtctcacgatgct
ctaggaaacgggatgaggagagccagtgatccagtaagaatggcctgtgacaacctctct
gtccctagagtccagcgtttcaacagtcttaatagctttaaccctcctgctttgcctcca
tccatggaaaagcgcaaccttgttcttcagaactatacccgttctgagggtggcgtcttc
cgtggctttagctccccctgtcctccaagcatcagcgagaacgttgccctggaggctgct
acgatggaagcgggtggcagtctgaatgatgaggatctcctgccagatgacgtggttcag
tatctgaattcccagaaccagggcatatacgaccacttgttgaacaatgtcctagacagc
aacaaaatgcatcatggctcagtgttaggaaacgacaaccccagcaactttgaccaagcc
cctccaccaagcagtcagcaggcaggttctgagacaaacaaaagcgacttgcccattcag
tggaatgaagtaagctcaggaagctctgacttatctccttcgaaactgaaatgtggccag
cgctccacagtgcagcagactcgggccttcagactgtacaacaatatgatggttcagcag
aagaacctggagagaagcaacgtggaccagcagaatggctatctggtggagaacaacaat
tcctacggtttacagcaaaacgcagttcttggcagcggagccgctaattctttcagcgtg
cagcccaataagccttacagtgaaagcatcagcaggcaagcaatgatgtctggagcaatg
gacaattcctgtggcatggcagttcaggggcagaagctgagaagcagcaacgtgccagtg
agtgggaaccagcaaaattttggccatcccatggcatccagcgatcaagctgccagtatg
gcaaatgggatgcagaacagcagtatgatggaacaggagtatctgcaaaaccaaccggtc
ggagatgacgttcattaccagggagtcaatcaatccggtcaaatgatgctggggcaggtt
agtcctacctcacaaagcagcctgtatcaggggccgcagagttgtccgccagtgtctcac
accgttggtagccagccttcaggtttgttggtggccaaaagttaccagccatgcaccaat
tacagcagcaacagacggcaaaacatgttgagaaacaacctggcacaacagcaaggacac
gtaagtgatggcaaccagacgtacagggtaaacaccattaagatggagatccaaggtcaa
tcacagcagttctgctctaatgcacagaattactctggtcagttatatgaccaaaccatg
ggctttagctaccaagctatgaaaacaggttcgttctttggttcggaagctaactgcctg
ctgcaggggactgcgactgcaaactcatcggaacttctttccccgggggctaaccaagtg
tcgagcacagttgacagcattgacagcaacagcctagagggtgtgcagattgatttcgat
gctatcatagatgatggggaccatgtcagcttaatttcgggagccctgagcccgagtatc
attcagaatctctcccgcaattcctcacgcctcaccactccccgagcgtctcttacattc
ccagctatgcccgtaagcacaaccaacatggctattggtgacatgagctctttgttgacc
tcacttgcagaagaaagcaagtttcttgctgttatgcaatag
Falco peregrinus (peregrine falcon): 101920502
Help
Entry
101920502 CDS
T02856
Gene name
GLI2
Definition
(RefSeq) GLI family zinc finger 2
KO
K16798
zinc finger protein GLI2
Organism
fpg
Falco peregrinus (peregrine falcon)
Pathway
fpg04340
Hedgehog signaling pathway
Brite
KEGG Orthology (KO) [BR:
fpg00001
]
09130 Environmental Information Processing
09132 Signal transduction
04340 Hedgehog signaling pathway
101920502 (GLI2)
09180 Brite Hierarchies
09182 Protein families: genetic information processing
03000 Transcription factors [BR:
fpg03000
]
101920502 (GLI2)
Transcription factors [BR:
fpg03000
]
Eukaryotic type
Zinc finger
Cys2His2 GLI-like
101920502 (GLI2)
BRITE hierarchy
SSDB
Ortholog
Paralog
GFIT
Motif
Pfam:
zf-C2H2
zf-H2C2_2
FOXP-CC
zf-C2H2_4
Motif
Other DBs
NCBI-GeneID:
101920502
NCBI-ProteinID:
XP_005244303
LinkDB
All DBs
Position
Un
AA seq
1528 aa
AA seq
DB search
METSASTAAGKKEGKGAVLEGNGFAETGKKPTPLAAAGAAVAQGVPQHIFPAFHAPLPID
MRHQEGRYHYEPHSIHAIHGPPPLSGSPVISDISLIRLSPHPTGPGESPFSPPHPYVAPH
MEHYLRSVHGSPTLSVISAARGLSPADVAHEHLKERGLFGLPPPPPGANPADYYHQMTLM
AGHPNPYGDLLMQSGGAASTAHLHDYLSPVDVSRFSSPRVTPRLSRKRALSISPLSDASI
DLQTMIRTSPNSLVAYINNSRSSSAASGSYGHLSAGTISPAFSFPHPINPVTYQQILTQQ
RGLSSAFGHTPPLIQPSPTFPPRQHMAVISVNPPPAQISSNSNCISDSSQSKQSSESAVS
STVNPVINKRSKVKTEVEGLPPASPTTQEHLTDLKEDLDKDECKQEPEVIYETNCHWEGC
TKEYDTQEQLVHHINNDHIHGEKKEFVCRWQDCTREQKPFKAQYMLVVHMRRHTGEKPHK
CTFEGCSKAYSRLENLKTHLRSHTGEKPYVCEHEGCNKAFSNASDRAKHQNRTHSNEKPY
VCKIPGCTKRYTDPSSLRKHVKTVHGPDAHVTKKQRNDVHPRPPPLKENGDNEASAKQSS
KISEESPEANSTTRSMEDCLQVKTIKTESSVMCQSSPGGQSSCSSEPSPLGSTNNNDSGV
EMNMHSGGSLGDLTALDDNAPVVDSTVSSGNSAVSLQLRKHMTTMQRLEQLKKEKLKTVK
DSCSWVSPAPQARNTKLPPISGNGSILESSGGSSATLPNPRIMELSVNEVTMLNQLNERR
DSTTSTISSAYTVSRRSSGISPYFSSRRSSEASQLGHRPNNTSSADSYDPISTDASRRSS
EASQCSGMPGLLNLTPAQHYRLKAKYAAATGGPPPTPLPNMERMTLKNRISLMDGPDPTL
PSIRLPPGPRRCSDGNTYGYPSAAAFPHEVPGNCTRRASDPVRRPAGDPQALPRVHRFNS
TNSMNPFHPPHPTDRRNFGLQNYGRSDGSLPRHTYSPRPVSISENIAMEAMSGEVEAPVG
DDDIVLPDDVVQYIKSQSNGTAAESTSMGYSNEMQSFQGSGKLQSPALPSQRRMAAAETS
MSHLGPMMAECPMSFDASSDLNKNNMPVQWNEVSSGTVDIMSNQSKQQFSQGNLAVVQQK
QNFGQYQNYNQQQIQLPENSMNVTQQSFMQRNVGMDGQRLNCMQLRQQPMSLGPSMNPDL
GLHTGYNQPHQMLSPSAISGNPNQISPSCSSMAAKSRPHLHPQQVDMATNPSLMVGSNSE
RTMLGQVMHEPSQQNYSSQPTHLNFPMAQETFHQPIMTSNQPSFEPQQSMMGSATQAYPS
GMVQPHPPPEPSPASRPRGLRTVQQLGYMRTPHPTNTISPGQEAAEAMHKRTSNMLPAPA
QQCADGMRENNLMYYYGQIHMYEQNSSFDNHADCRVRQQQCALNSKPAALPSPGANQVSS
TVDSQGLEPPQIDFDAIMDDGDHSSLMSGTLSPSILQNLSQNSSRLTTPRNSLTLPTIPA
GISNMAIGDMSSMLTTLAEESKFLNMMS
NT seq
4587 nt
NT seq
+upstream
nt +downstream
nt
atggaaacttctgcttcaacggctgctgggaaaaaagaagggaaaggtgcagttctggag
gggaatggctttgcggagacggggaagaaacccactcctttagcagctgcaggagcagca
gtggcacaaggagtgccgcagcacatttttccggccttccacgctcctttgccaattgac
atgcgccaccaggaaggacgataccattacgaacctcactctatccacgctatccatggg
cctcctcctctgagtggaagccccgtcatctctgacatctccctcatccgcctgtctcca
caccccaccgggcctggcgagtcgcccttcagcccaccgcacccctacgtggcaccccac
atggagcactacctccgctctgtccacggcagccccacgctctccgtcatctctgctgcc
aggggcctcagccctgctgacgtggctcacgagcacctgaaggagcgaggtctcttcggg
cttcccccaccgccaccaggagccaaccctgccgactactaccaccaaatgacgctgatg
gccggccacccaaacccctacggggacctcctgatgcagagcgggggagcagccagcaca
gcccacctccatgattacctcagccctgtcgatgtgtcccggttttcgagcccacgggtg
actccgagattaagccgaaaacgagccctgtccatctccccgctgtccgacgccagcatt
gatctccagacaatgatcaggacctccccaaactctcttgtggcgtatatcaacaactcc
agaagcagttcagcagcaagtggctcctacggccatttatctgctgggacaatcagtcca
gcgttcagcttcccccaccccatcaaccctgtgacataccagcagatcctgacgcagcag
agaggcctgagctcagcctttggacacactcctcccctgatccagccatccccaaccttc
cccccacggcagcacatggcagtcatctccgtcaacccaccgccggcacagatcagcagc
aacagcaactgcatctctgactccagccagagcaagcagagcagcgagtcggccgtgagc
agcacagtcaatccagtaattaacaagcgcagcaaagtcaagactgaagttgagggtttg
cctccagcgtccccaaccacacaggagcatctgacagacctgaaagaagatctagataag
gatgaatgtaaacaggagcctgaggttatttatgagacgaactgtcactgggaagggtgc
acaaaggaatatgacactcaagagcagcttgtccatcacatcaacaacgatcacatccat
ggggagaagaaggagtttgtctgccgctggcaggactgtacacgggagcagaagcccttc
aaggctcagtacatgttagtggtgcacatgcgaaggcacaccggagagaagccacacaag
tgcacgtttgagggttgctccaaagcctattcccgcctggagaacttaaagacacacctg
aggtcccacactggagaaaaaccctatgtctgtgaacacgaaggctgcaataaagccttt
tccaatgcctcggacagagccaagcaccagaaccggacgcattccaatgagaaaccctac
gtctgtaaaatccctggctgcacgaagcggtacacagaccctagttccctcaggaaacat
gtcaagactgtgcacgggcctgatgcccatgtcacgaagaaacagcgcaacgatgttcac
ccgaggccacctccactcaaggaaaatggggacaatgaggcaagcgccaagcagagcagc
aagatttcagaggagagccccgaggccaacagcaccacaaggagcatggaggattgctta
caagtcaaaactataaaaacggagagttctgtgatgtgtcagtccagtcctggtggccag
tcgtcgtgcagcagtgaaccgtcaccccttggcagcaccaacaacaatgacagtggagta
gaaatgaacatgcacagtgggggaagtctgggagatctgacggcgttggatgacaatgct
cctgttgtggactcaacggtctcatctggtaattcggcagtcagcctgcagctaaggaaa
cacatgacgacgatgcaacgacttgaacagctcaagaaagagaaactcaagacagttaag
gattcctgctcatgggtgagcccagctccacaagccagaaacaccaagctgcctcccatc
tcaggaaacggctctattctagaaagtagtggtggttcttctgctacgctgcctaatccc
agaataatggagctgtctgtcaatgaggttacaatgctgaaccaactcaacgagcgccgt
gatagtacaacaagcaccatcagctctgcttacactgtcagccgcagatcatcagggatc
tccccatacttctccagccgccgttccagcgaggcttcacagcttgggcaccgtcccaat
aacacaagctctgctgactcctatgacccaatttcgacggatgcctcccgtcggtcaagc
gaggcgagccagtgcagtgggatgccgggtctgctaaacctcacaccagcccagcactac
aggctgaaagccaagtatgctgctgccacagggggccctcctccaactcccctgccgaat
atggagaggatgactctgaagaacagaatctcacttatggatggaccagatcccaccttg
ccctccatccgcctcccaccaggccccaggcgttgcagcgacggtaacacctatggctat
ccatcagctgcagcgtttccccatgaggtgccaggcaactgcacaagacgggctagcgac
ccggtgaggagacctgcaggagacccccaagccctcccacgagttcaccgcttcaacagc
accaacagcatgaaccccttccatcctccacaccccacggacaggaggaattttggcctc
cagaactacgggcgctcagatgggagcctcccccggcatacctactcaccccggccggtg
agcatcagtgaaaacatcgccatggaagccatgtctggggaggtggaggcacctgtcgga
gatgatgacattgtgctgccagatgatgtggtgcagtatatcaaatcccagagcaatggt
acagcagccgagagcacttccatggggtacagcaatgagatgcagagcttccagggaagt
gggaagctgcagtctccagccttgcccagccagcgcaggatggcagcggccgagacgagc
atgagccacttgggacccatgatggcagagtgtcccatgagctttgatgcttcctcagac
ctgaataaaaataacatgcccgtccagtggaacgaagtcagctcaggtacggttgatatc
atgtccaatcagtcgaagcagcagttttcacaagggaacttagcagtggtccagcagaag
cagaactttggtcagtaccagaactataaccagcagcagatacagctgccagagaacagt
atgaatgtaacacagcagagctttatgcaaagaaatgtgggcatggatgggcagaggcta
aactgtatgcagctgcggcagcagcccatgagcctaggccccagcatgaaccctgatctg
ggcttgcacacagggtataaccaacctcatcagatgctgagccccagtgcaatcagtggc
aacccaaatcagatctctccttcttgtagcagcatggcagcaaagtccagaccccacctt
caccctcagcaagtggacatggcgaccaacccttctttaatggttggcagcaacagcgag
cgcacaatgctgggccaggtcatgcatgaaccgagtcagcagaactactcatctcagccg
acccacctgaacttccccatggctcaagagacctttcatcagcccatcatgacctccaac
cagcccagctttgagcctcagcaaagcatgatgggctcggccacgcaggcatatccgtcc
ggcatggtccagcctcatcctccgcccgagccaagcccggccagcagaccccgaggcctc
cgcaccgtccagcaattgggctacatgcgaactccccaccctacaaacaccatcagcccg
ggccaggaggcggcggaggccatgcataaaagaacaagcaacatgttgcctgcaccagcc
cagcagtgcgcggacggcatgagggaaaacaatttgatgtactattacggtcaaatccac
atgtatgaacaaaacagcagcttcgataatcacgcggactgccgggtcagacagcagcag
tgtgcgctgaacagcaagccggccgctctgccttccccgggcgcgaaccaggtatccagc
acggtggactcgcaaggccttgagccaccccagatagattttgatgccatcatggacgat
ggggaccattcgagcctgatgtcaggaaccctgagccccagcatcctgcagaacctctcc
cagaactcctctcgcctgacgactccacgaaactctctgacactgcccaccatacccgcg
ggaataagcaatatggcaataggcgatatgagctccatgctaaccacgctggcagaagag
agtaaatttctaaacatgatgtcataa
DBGET
integrated database retrieval system