KEGG   Rousettus aegyptiacus (Egyptian rousette): 107501496
Entry
107501496         CDS       T06036                                 

Gene name
GLI1
Definition
(RefSeq) LOW QUALITY PROTEIN: zinc finger protein GLI1
  KO
K16797  zinc finger protein GLI1
Organism
ray  Rousettus aegyptiacus (Egyptian rousette)
Pathway
ray04024  cAMP signaling pathway
ray04340  Hedgehog signaling pathway
ray05200  Pathways in cancer
ray05217  Basal cell carcinoma
Brite
KEGG Orthology (KO) [BR:ray00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04340 Hedgehog signaling pathway
    107501496 (GLI1)
   04024 cAMP signaling pathway
    107501496 (GLI1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    107501496 (GLI1)
  09162 Cancer: specific types
   05217 Basal cell carcinoma
    107501496 (GLI1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:ray03000]
    107501496 (GLI1)
Transcription factors [BR:ray03000]
 Eukaryotic type
  Zinc finger
   Cys2His2 GLI-like
    107501496 (GLI1)
SSDB
Motif
Pfam: zf-C2H2 zf-H2C2_2 FOXP-CC zf-C2H2_4 IL17
Other DBs
NCBI-GeneID: 107501496
NCBI-ProteinID: XP_015983183
LinkDB
Position
Unknown
AA seq 1095 aa
MFNSMTPPPVSSYGEPCCLRPLPSQGAPSMGTEGLSGLPFCHQAGLMSGPHSYGPARETN
SCTEAPFFPPPRSAVKLTKKRALSISPLSDANLDLQTVIRTSPSSLVAFINSRCASPGGS
YGHLSISTMSPSLGFPPQMTHQKGTSSSFGVQPCGPHDPTQGGMMPHPQSRGPLPTCQLK
SELDMLVSKCPEEPLEGDMSSPNSTGTQDVLLGMLDGREDLEREEKPEPESVYETDCRWD
GCSQEFDSQEQLVHHINSEHIHGERKEFVCHWGGCSRELRPFKAQYMLVVHMRRHTGEKP
HKCTFEGCRKSYSRLENLKTHLRSHTGEKPYMCEHEGCSKAFSNASDRAKHQNRTHSNEK
PYVCKLPGCTKRYTDPSSLRKHVKTVHGPDAHVTKRHRGDGPLPRAPSLSTVEPKRERDG
GPVREESRLTVPEGTMKPQPSPGAQSSCSSDHSPAGSAANTDSGVEMTGNTGGSTEDLSN
LDEGPCIAGTGLSTLRRLENLRLDQLHQLRPIGPRGLKLPSLTHTGTPVSRRLGPPVTLD
RRSSSSSSVSSAYTVSRRSSLTSPFPPSSPPENGASSLPGLTPAQHYLLRARYASTRGGG
TPPTAAPSWRSRAEYPGYNPNVGVTRRASDPARAADRPAPARVQRFKSLGCVHSPPTVAG
GGRSFDPPLPTSVYSPQPPSITENVTMDTRGLREEPEVGTSMMGSGLNPYMDFPPADTLG
YGGPEGAATEPYGARGPGSLPLGPGPPTNYSHNACSQQVSYPDPTPETWGELPSHSGLYP
GPKSPAGTYSQCPRLEHYGHIQVKPEQGCPMGSDSTGLGPCLNAHPNEGPPRPQTLFSHY
PQPSPPQYSQSGPYTQPTPDYLPSESRPGPDFDSPTHSTGQLKTQLVCNYVQSQQELLWE
GGGRGDPPVQEPFYQSPKFLGGSQVNPSSAKVPVATYGSGFAPNLPSHKSGSYPTPSTCH
DNFAMGTTKASHRAAAPPRLLPPLPTCYGPLKAGGTNPSCGHPEVGRLGGGPNLYPPPEG
QVCNPLDSLDLDNTQLDFVAILDEAQGLSPPPSHDQGDSSEHTPPPSGPPNMAVGNMSVL
LGSLPGETQFLNSSA
NT seq 3288 nt   +upstreamnt  +downstreamnt
atgttcaactcgatgaccccaccgccagtcagtagctatggcgaaccctgctgtctccgg
cccctccccagtcagggggcccccagcatggggacagaaggactatctggtctgcccttc
tgccaccaggccggcctcatgtctggcccccacagttatgggccagccagggagaccaac
agctgcactgaggccccattctttcctcctcctcggagtgcagtcaagttgacgaagaag
cgggcactctccatctcacctctgtcagatgccaacctggacctgcaaacagttatccgc
acctcacccagctccctcgtggccttcatcaactcacgctgtgcttctccagggggctcc
tatggtcacctctccatcagcaccatgagcccatctctgggattcccaccccagatgact
caccaaaaagggacctcgtcttcctttggggtccagccctgcggtccccatgaccccacc
caggggggaatgatgccacatcctcagtcccggggacccctcccaacttgccagctgaag
tctgagctggacatgctggttagcaagtgcccggaggagcctttggaaggtgacatgtcc
agtcccaactccacaggcacacaggatgtcctgctggggatgctggatgggcgggaagac
ctagagagagaagagaagccagagcctgaatctgtgtatgagactgactgccgttgggat
ggctgcagccaggaatttgactcccaggagcagctggtgcaccacatcaacagcgagcac
attcatggggagcggaaggagttcgtgtgccattgggggggctgctccagggagctgagg
cccttcaaagcccagtacatgctggtggtccacatgcgcagacacacaggcgagaagcca
cacaagtgcacgtttgagggatgccggaagtcatactcacgcctcgaaaatctgaagacg
cacctacggtcgcacactggtgagaagccatacatgtgtgagcacgagggctgcagtaaa
gccttcagcaatgccagcgaccgagccaagcaccagaatcggacccactccaatgagaag
ccttatgtgtgtaagctccctggctgcaccaaacgctacaccgatcccagctcgctccgg
aaacatgtcaagacagtgcatggtcccgacgcccacgtgaccaagcggcaccgaggggac
ggccctctgccccgggcaccatccctttccactgtggagcccaagagagagcgggatgga
ggccccgtcagggaggagagcagactgactgtaccagaggggaccatgaagccacagcca
agccctggggcccagtcgtcttgcagcagtgaccactccccagcaggcagtgcagctaat
acagacagtggtgtggaaatgactggaaatacagggggcagcactgaggacctatccaac
ttggacgagggaccttgcattgctggcactggtctgtccactcttcgccgccttgagaac
ctcaggctggaccaactccatcaactccggccaatagggccccggggcctcaaactgccc
agcttgacccacaccggcacccctgtgtcccgtcgtttgggccctccagttactcttgat
cgtcgcagcagcagctccagcagcgtcagctcagcctatactgtcagtcgccgctcctct
ctgacctcccctttcccccctagttctccaccagagaatggggcatcctcgctgcctggc
ctcacacctgcccagcactacctgctccgggcaagatatgcttcaaccaggggaggtggt
accccacccactgcagcaccttcttggagaagccgagctgagtatccaggatacaacccc
aatgtaggggtcacccggagagccagtgacccagcccgggctgctgaccgcccagcccca
gccagagtccaacggttcaagagcttgggttgtgtccacagcccccctacggtggcagga
ggaggacggagctttgatcccccactcccaacctctgtctactcaccacagccccctagt
atcactgagaatgtcaccatggataccagagggctacgggaggagccagaggttgggacc
tccatgatgggcagtggtctgaacccctatatggacttcccacctgctgatactctgggt
tatgggggacctgaaggggcagcaactgagccttatggagctaggggtccaggctccctg
cctcttggacctggtccacccaccaactacagccacaacgcttgttctcagcaggtctcc
tatcctgatcccactccagaaacatggggtgaactcccttcccactctgggctataccca
ggccctaagtctccagctggaacctacagccagtgtcctcgtctagaacattatggacac
atacaggtcaagccggaacagggatgtccgatgggttccgactccacaggactgggaccc
tgcctcaacgctcaccccaatgagggacctccacgtccacagactctgttctctcactac
ccccagccttcccctccccagtattcccagtcaggcccctatactcagccaacccctgat
tatcttccttcagaatctaggcctggcccagattttgattctcctactcattccacagga
cagctcaagactcagcttgtgtgtaattatgttcagtctcaacaggagctgctctgggag
ggtggaggtaggggagatcccccagtccaggaaccattctaccagagtcccaagtttctg
gggggctcccaggttaacccaagctctgccaaggtcccagtggccacctatggatctggc
tttgcacctaacttgcccagtcataagtcaggctcctatcccactccttcgacatgccat
gataattttgcaatggggacaaccaaggcttcccatagggcagcagcgccacctcgactt
ctgcctccactgcccacttgctatgggcccctcaaggcagggggcaccaaccccagctgt
ggccaccctgaggtgggcaggttgggagggggtcctaacttgtaccctcctcctgaaggg
caggtatgtaaccccttggactctcttgatcttgacaacactcagctggactttgtggct
attctggatgaggcccaggggctgagtccacccccttcccatgatcagggggacagctct
gaacataccccacctccctctggacctcccaacatggctgtgggcaacatgagtgtctta
ttgggatccctacctggggagacacaattcctcaactctagtgcctaa

KEGG   Rousettus aegyptiacus (Egyptian rousette): 107501918
Entry
107501918         CDS       T06036                                 

Gene name
GLI2
Definition
(RefSeq) LOW QUALITY PROTEIN: zinc finger protein GLI2
  KO
K16798  zinc finger protein GLI2
Organism
ray  Rousettus aegyptiacus (Egyptian rousette)
Pathway
ray04340  Hedgehog signaling pathway
ray04390  Hippo signaling pathway
ray05200  Pathways in cancer
ray05217  Basal cell carcinoma
Brite
KEGG Orthology (KO) [BR:ray00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04340 Hedgehog signaling pathway
    107501918 (GLI2)
   04390 Hippo signaling pathway
    107501918 (GLI2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    107501918 (GLI2)
  09162 Cancer: specific types
   05217 Basal cell carcinoma
    107501918 (GLI2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:ray03000]
    107501918 (GLI2)
Transcription factors [BR:ray03000]
 Eukaryotic type
  Zinc finger
   Cys2His2 GLI-like
    107501918 (GLI2)
SSDB
Motif
Pfam: zf-C2H2 zf-H2C2_2 FOXP-CC zf-C2H2_4
Other DBs
NCBI-GeneID: 107501918
NCBI-ProteinID: XP_015983987
LinkDB
Position
Unknown
AA seq 1578 aa
METSTSATAAEKKEAESAILEGAAFPDPGRKASALAVATVAAAAAAAAQGVPQHLLPPFH
APLPIDMRHQEGRYHYEPHSVHGMHGPPALSGSPVISDISLIRLSPHPAGPGESLFNAPH
PYVSPHMEHYLRAMHSSPTLPTISASRGLSPADVAHEHLKERGLFGLPPTGTNPSDYYHQ
MTFMAGHPTPYGDLLMQPGGATGAPHLHDYLNPVDVSRFSSPRVTPRLSRKRALSISPLS
DASLDLQRMIRTSPNSLVAYINNSRSSSAASGSYGHLSAGTLSPAFTFPHPINPVAYQQI
LSQQRGLGSAFGHTPPLIQPSPTFLAQQPIALTSINATPTQISSGSSNCLSDTNQNKQNS
ESAVSSTVNPIVIHKRSKVKTEAEGLRPASPLALTQGQVAGQSSHGLCALLLPQEQLADL
KEDLDRDDCKQEAEVVIYETNCHWEDCTKEYDTQEQLVHHINNEHIHGEKKEFVCRWQAC
TREQKPFKAQYMLVVHMRRHTGEKPHKCTFEGCSKAYSRLENLKTHLRSHTGEKPYVCEH
EGCNKAFSNASDRAKHQNRTHSNEKPYICKIPGCTKRYTDPSSLRKHVKTVHGPDAHVTK
KQRNDVHLRAPLLKENGDNEASTEPSGRGSEESAEASSTSQAVEDCLHVKAIKTESSGLC
QSSPGAQSSCSSEPSPLGSAPNNDSGVDMPGSLGDLTALDDTPPGADASALANPSTGGLQ
LRQHLTAMHRLEQLKKEKLKSLKDSCSWAGPAPHTRNTKLPPLPGSGSILENFSSGGGGR
PVGLLPNPRLSELSASEVTMLSHLQERRDSSTSTVSSAYTVSRRSSGISPYFSSRRSSEA
SPLGAGPPHHASSADSYDPISTDASRRSSEASQCSGSGAGPLSLTPAQQYSLRAKYAAAT
GGPPPTPLPGLERGSLRARLALLDMPERALGPRRGSGGPAYGHAGPAPAFPHEAPGGGAR
RASDPVRRPDALAPPRVHRFHSAHDVNPAPLPPCASERRGLRLQGLTSTDGSLTRSVYSP
RPPSISENAVMEAMVAGTDSAGPEGNLMLPEDDLVLPDDVVQYIKAHASGALDESAPQVY
PPESTCFSEHSKLPSPGLHGQRRMAAADSHMGPSARVLGDCQLGYGASSSLNKNNMPVQW
NEVSSGTMDVLATQVKPPAFPQGNLAVVPQKPTTFGQYPSYSPQGLQPSPGVLESTQPHL
QSCSGAPSTSRVNYMQQVRQPGTGGQCPNMTTTVSPHTTYGQAHPQLSSSTMGGALNQFS
PSCSNLAAKQGHLGLPQQMEVAPDPTMVGSSHREFGVPNSTLAGMAPSHPAQSYPQQSHH
LATSMNQEGYRQGPSHLPSHQPGFMELQQGTVGLAGSSFGLVQPRPPPEPSPAGRHRAVR
AGQQLAYARATGHAMAAVSANQEMAESVPKGAMATMVSLPPQLPPQDPGGAQDHNMLYYY
GQIHMYEQNGSLENHVGCQVMRPQPPQPQACSESIQSQPLPSPGVNQVSSTVDSQLLEAA
QIDFDAIMDDGDHASLLSGTLSPSLLHSLSQNSSRLTTPRNSLTLPSIPTGISNMAVGDM
SSMLTSLAEESKFLNMMT
NT seq 4737 nt   +upstreamnt  +downstreamnt
atggagacgtccacgtcagccactgccgcggagaagaaggaagccgaaagcgcgatcctg
gagggcgccgctttccccgacccgggcaggaaggcctctgccctggcggtggccacggtg
gcggcagctgcggcggcagctgcccaaggagtgccgcagcaccttttgccaccattccat
gctcccttaccgattgacatgagacaccaggaaggacggtaccattacgagcctcactct
gtccatggcatgcacgggcctcctgccctgagcggcagccctgtcatctctgatatctca
ttgatccggctttctccacaccccgctggccctggggagtccctcttcaatgccccccac
ccatacgtgagcccccacatggagcattaccttcgcgccatgcacagcagccccacgctc
cccacgatctctgcatccaggggcctcagccctgctgatgtggcccatgagcaccttaag
gagaggggactgtttggcctcccgcctacggggaccaacccctcagactattaccaccag
atgaccttcatggcgggccatcccacaccatatggagacctgctgatgcagcctgggggt
gccactggtgctccccatctccatgactacctcaatccagtggatgtgtcccgtttctcc
agcccacgggtaacaccccgcctgagccgaaagcgcgcactgtccatctccccgctctcg
gacgccagcctcgacctacaaaggatgatccggacctcgcccaactcgctggtggcctac
atcaacaactcccgcagcagctcagcagccagtggctcctatgggcacctgtcggcgggc
accctcagcccagccttcacctttccccatcccatcaaccccgtggcctaccagcagatc
ctgagccagcagaggggtctgggctcggccttcggacacacgccacccctgatccagccc
tcgcccaccttcctggcccagcagcctatagccctcacctccatcaatgccacgcccact
cagatcagcagcggcagcagcaactgtctgagtgacaccaaccagaacaagcagaacagc
gagtcggctgtgagcagcaccgtcaaccctatcgtcattcataagcgcagcaaggtcaag
actgaggctgagggcctgcggccagcctcccctttggccctgacacagggccaggtggct
ggacaaagctcacatgggttgtgtgccctgctcctcccccaggagcagctggctgacctc
aaggaagacctagacagggacgactgtaagcaggaggccgaggtggtcatctatgagacc
aactgccactgggaagactgcaccaaggagtatgacacccaggagcagctggtgcatcac
atcaacaatgagcacatccacggggagaagaaggagtttgtgtgccgctggcaggcctgc
acgcgagagcagaagcccttcaaggcgcagtacatgctggtggtgcacatgcgccggcac
acaggcgagaagccccacaagtgcacgtttgagggctgctcgaaggcctactctcgcctg
gagaacctgaagacgcatctgcggtcccacaccggggagaagccatacgtgtgtgagcac
gagggctgcaacaaggccttctccaacgcctccgaccgcgccaagcaccagaaccgcacc
cactccaatgagaaaccctatatctgcaagatcccaggctgtaccaagcgatacacagac
ccaagttctctccggaagcatgtgaaaacagtgcacggcccagatgctcatgtcaccaag
aagcagcgcaatgatgtgcacctccgggccccgctgctcaaggagaatggggacaatgag
gccagcaccgagcccagcggccggggctctgaggagagcgctgaggccagcagcaccagc
caggccgtggaggactgcctgcatgtcaaagccatcaagactgagagttccgggctgtgt
cagtccagccccggggcccagtcgtcctgcagcagcgagccctctcccttgggcagtgcc
cccaacaatgacagcggtgtggacatgcctgggagcctgggagacctgacagctctagat
gacacgcccccaggggccgatgcttcagccctggccaacccctccactggtggcctgcag
ctgcgccaacacctgaccgccatgcaccggttagagcagctcaagaaggagaagctcaag
tcactcaaggattcctgctcatgggccgggccggctccacacacccggaacaccaagctg
cctcccctcccgggaagtggctccatcctggaaaacttcagcagcggcgggggcggcagg
ccggtgggactgctgccaaacccaaggctgtcagagctttctgcaagcgaggtgaccatg
ctgagccacctacaagagcgccgcgacagctccaccagcacggtcagctcagcctacacc
gtgagtcgccgctcctccggcatctcgccgtacttctccagccgccgctccagcgaggcc
tcacccctgggcgcaggtcccccgcatcatgccagctctgccgactcctacgaccccatc
tccaccgatgcatcacggcgctcaagtgaggccagccagtgcagtggcagcggcgcaggg
ccgctcagcctgactccggcgcagcagtacagcctgcgggccaagtacgcagcggccacc
ggcgggccgccccccacgccgctgcctggcctggagcgcgggagtctacgggcccggctg
gcactgctggatatgcccgagcgtgccttgggaccccggagaggcagcggcgggccggcc
tatggccatgcggggcctgcgcccgccttccctcacgaggcaccgggaggaggggcgcgg
cgggccagcgacccggtgaggcggcccgacgccctggctcccccgcgggtgcaccgcttc
cacagtgcccacgatgtgaaccccgcaccgctgccaccctgtgcctccgagaggcgcggc
ctccgcctgcagggcctcactagcaccgacggcagcctgacccgcagtgtctactccccc
cggccgcccagcatcagcgagaacgcggtgatggaggccatggtggcagggacagacagt
gcggggcctgaaggcaacctcatgctgcccgaggacgaccttgtgctgcccgatgacgtg
gtgcagtacatcaaggcgcatgccagtggcgccctggatgagagtgccccgcaggtatat
cctccggaaagcacttgcttctctgagcactccaaattgcctagcccgggactgcacggc
cagcgtaggatggcggctgccgactcccacatgggcccctcggcccgggtactgggagac
tgccagctgggatatggggcctcctctagcctaaacaaaaacaacatgcctgtgcagtgg
aacgaggtgagctctggcaccatggatgtcctggccacccaggtgaagcctccagctttt
ccacagggcaacctggctgtggtgccgcagaagccaacaacctttggccagtacccaagc
tatagtccacaaggcctgcagccgagccctggggtcctggagagcacacagccacacctt
cagtcttgcagtggagccccctccacatccagagtaaattatatgcagcaggtgcggcag
ccaggaacaggtggccagtgtcccaatatgaccaccactgtgagcccccataccacctac
ggccaagcccacccccagctgagctccagtaccatgggtggggccctcaaccagttctcc
ccatcctgcagcaacctggcagccaagcaaggccacctggggctcccccagcaaatggaa
gttgctcccgaccctaccatggtgggtagcagccacagggaatttggggttcccaattcc
accctggctgggatggcgccatctcacccagcccagagctacccacagcagagccatcac
ctggcaacctccatgaaccaagagggctaccgccagggccccagccatctgccttcccac
cagcctggtttcatggagctccagcaaggcacagttgggctggccggatcaagctttggc
ctagtgcaaccccggccaccccctgagcccagccctgccggccgccaccgtgcagtacgt
gctgggcagcagcttgcctatgccagagccactggccatgccatggctgctgtgtcagcc
aaccaggagatggcagagtcggtgcccaagggagcaatggccaccatggtgtctctgcct
cctcagctgcccccacaggatccaggtggggcccaggaccacaacatgctctactactat
ggccagatccacatgtatgaacagaatggaagcctagagaaccacgtaggctgccaggtc
atgaggccccagccaccgcagccacaggcctgctcggagagcatccagtcccagcccttg
ccctcaccaggggtcaaccaggtatccagcactgtggactcccagctcctggaggctgcc
cagattgattttgacgccatcatggatgacggcgatcacgccagcttgctctcgggcacc
ctgagccccagcctcctccacagcctctcccagaactcctcccgcctcaccaccccccgg
aactctctgactctgccctccatccccacgggcatcagcaatatggctgtcggggacatg
agctctatgcttaccagcctggccgaggagagcaagttcctgaacatgatgacctaa

KEGG   Rousettus aegyptiacus (Egyptian rousette): 107502401
Entry
107502401         CDS       T06036                                 

Gene name
GLI3
Definition
(RefSeq) transcriptional activator GLI3
  KO
K06230  zinc finger protein GLI3
Organism
ray  Rousettus aegyptiacus (Egyptian rousette)
Pathway
ray04024  cAMP signaling pathway
ray04340  Hedgehog signaling pathway
ray05200  Pathways in cancer
ray05217  Basal cell carcinoma
Brite
KEGG Orthology (KO) [BR:ray00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04340 Hedgehog signaling pathway
    107502401 (GLI3)
   04024 cAMP signaling pathway
    107502401 (GLI3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    107502401 (GLI3)
  09162 Cancer: specific types
   05217 Basal cell carcinoma
    107502401 (GLI3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:ray03000]
    107502401 (GLI3)
Transcription factors [BR:ray03000]
 Eukaryotic type
  Zinc finger
   Cys2His2 GLI-like
    107502401 (GLI3)
SSDB
Motif
Pfam: zf-C2H2 zf-H2C2_2 zf-C2H2_4 FOXP-CC
Other DBs
NCBI-GeneID: 107502401
NCBI-ProteinID: XP_015985055
LinkDB
Position
Unknown
AA seq 1485 aa
TGILFQAFLLYSLAFLSFSDPPHLFPAFHPPVPIDARHHEGRYHYDPSPIPPLHVPSALS
SSPTYSDLPFIRISPHRNPAAASESPFSPPHPYINPYMDYIRSLHSSPSLSMISAARGLS
PTDAPHTGVSPAEYYHQMALLAGQRSPYADIIPSAPTAGTGAIHMEYLHAMDSTRFPSPR
LSARPSRKRTLSISPLSDHSFDLQTMIRTSPNSLVTILNNSRSSSSASGSYGHLSASAIS
PALSFTYPSAPMSLHMHQQILSRQQSLGSAFGHSPPLIHPAPTFPTQRPIPGIPTVLNPV
QVSSGPSESSQQNKPTSESAVSSTGDLMHNKRSKIKPDEDLPSPGPRGQQEQPEGTTLVK
EEGDKDESKQEPEVIYETNCHWEGCTKEFDTQEQLVHHINNDHIHGEKKEFVCRWLDCSR
EQKPFKAQYMLVVHMRRHTGEKPHKCTFEGCTKAYSRLENLKTHLRSHTGEKPYVCEHEG
CNKAFSNASDRAKHQNRTHSNEKPYVCKIPGCTKRYTDPSSLRKHVKTVHGPEAHVTKKQ
RGDIHPRPPPPRDSGNHSQSRSPGRQTQGALGEQKDLSNTTSKQEECLQVKTVKAEKPMT
SQPSPGGQSSCSSQQSPISNYSNSGLELPLTDGDGMGDLSAIDETPIMDSTISTATTALT
LQARRNPPGTKWMEHVKLERLKQVNGMLPRLNPILSPKAPAVSPLIGNGTQSHNSCNLGG
PMTLLPNRSDLSGVDVTMLNMLNRRDSSASTISSAYLSSRRSSGISPCFSSRRSSEASQA
EGRPHNVSVADSYDPISTDASRRSSEASQNDGLPSLLTLTPAQQYRLKAKYAAATGGPPP
TPLPNMERMSLKTRMALFGDSKESTGALPPVHPPRRCSDGGTHIYGRRPLLPQDTLSHSV
RRASDPVRTVSENFSLPRVQRFNSLSSFNSLVLPPSLEKRNLVLQNYTRPEGGQSRHFHS
SPCPPSITENVTLESLTMDADVNLNDEDFLPDDVVQYLNSQNQAGYEQHFQSDISDDCKL
PPGPGLGSGHSNFDSPGLPDSHASQQYRTLEQPCPEHSKADLPIQWNEVSSGSADLSSSR
LKCGQRSTAQPARGFGLYSNMGVHPQNPLRSGGGQVGGYQTLGEHGSAYDGPEHFVSHSG
GTGTNGNAFHEQPYKVQQYVNYLTRQPGAHSSLDSACSAGSQAAKLRSTTMQGNGSQPDF
NLSMVPSESAGGTVNGMPTRDLMGQGYLAHQLLSESIHHQGAGRAGQQMLEQVSATSYIH
IYQGPESSLPGPPSIGGQPSSLAAVRGYQPCVNYGGSRRQAVPRGSLTLQPGQHSVTSQT
CRVNGIKMEMQGQPHQLCSNVQSYSGQFYDQTIGFSQQDVKTGSFSISEANCLLQGASAE
NSELLSPGANQVTSTVDSLDNHDLEGVQIDFDAIIDDGDHASLMSGALSPSIIQNLSHNS
SRLTTPRTSLPLPALSVSTTNMAIGDMSSLLTSLAEESKFLAVMQ
NT seq 4458 nt   +upstreamnt  +downstreamnt
actggaattctgtttcaagctttccttttatacagtttagcctttctttctttttcagac
cctcctcacctttttcctgcattccatcctcctgtgccaatcgatgcaagacatcacgag
gggcgttaccattatgatccatctccgattcctccattgcatgtgccttctgccttatct
agtagcccaacatattcagacctgcccttcattaggatctccccgcaccggaaccccgct
gcagcttctgagtcccccttcagccctccacatccctacatcaatccttacatggactac
atccggtctctgcatagcagcccgtccctctccatgatctctgcagcccgggggctgagc
cccacagatgcgccccacactggagtcagtccagcagaatactatcatcagatggctctg
ttagctggccagcgcagcccctatgcagacatcattccttcagctcccactgctggcact
ggagccattcacatggaatatcttcatgccatggatagcaccagattccccagccccagg
ctgtcagccaggccaagccgaaaacgtacattgtccatatcaccactgtccgatcatagc
tttgaccttcagacaatgataaggacatctcccaactccttggtcacaattctcaataat
tctcgtagcagctcctcagcaagtggctcctatggtcacttatctgcgagtgcaatcagc
cctgccttgagtttcacctacccttctgcacccatgtctcttcatatgcatcaacaaatc
ttaagccgacaacagagcttaggctcggcctttggacatagccctccactcatccatcct
gccccaacttttccaacacagaggcctattcctgggatccctacggttctgaaccccgtc
caggtcagctctggcccttccgaatcctcacagcagaacaagcccacaagtgagtctgcg
gtgagcagtactggcgacctgatgcacaacaagcggtccaagatcaaacctgatgaggac
ctccccagccctgggccacggggtcagcaggaacagccagaaggaaccaccctggtgaaa
gaggaaggggacaaagatgaaagcaagcaagagcctgaagtcatctatgagacaaattgc
cactgggaaggctgcaccaaggagtttgacacccaggagcagctggtgcatcatataaat
aatgaccatattcatggagagaagaaggaatttgtttgtcggtggttagattgctcaaga
gaacagaaacccttcaaagcccagtatatgttggtagtgcacatgagaagacacacagga
gagaagcctcacaaatgcacttttgaaggttgcacaaaggcctactcaagactagaaaac
ttgaaaacacacttaagatctcacactggagagaaaccatatgtctgtgagcatgaaggt
tgtaacaaggctttctcaaatgcctccgatcgcgccaagcaccaaaacagaacacattcc
aatgagaaaccatacgtgtgcaaaatcccaggctgcactaagcgctacacagacccaagc
tcccttcggaaacacgtgaagacagtgcatggcccagaggctcatgtcaccaagaagcag
cgtggggacatacatcctcggcccccgccaccacgagattctggcaaccattcacagtcc
cggtcacctggccggcagactcagggagcccttggtgagcagaaggacctcagcaacact
acctcaaaacaggaggagtgtctgcaagtgaaaacggtcaaggcggagaagcccatgaca
tctcagccaagccctggtggtcagtcttcatgcagcagccaacaatcccccatcagcaac
tattccaacagtgggctcgagcttcctctgacagatggagatggtatgggagacctcagt
gccattgacgaaacccccatcatggactcgaccatttccactgccaccacggccctcacc
ctgcaagccaggagaaacccaccagggaccaaatggatggagcacgtaaaactggaaagg
ctcaaacaagtgaatggaatgctaccgcgcctgaaccccatcctatcccccaaagcccct
gcggtctctcctctcataggaaatggtacccagtcccataacagctgcaatttgggtggg
cccatgaccctcctccctaacagaagtgacctctctggggttgatgtcaccatgctgaac
atgctcaataggagggacagcagcgccagtaccatcagctctgcctacctgagcagccgc
cgttcctcgggcatctctccctgcttctctagccggaggtccagcgaggcatcccaggcc
gagggccggcctcacaatgtaagtgtcgctgactcctatgaccccatctccactgacgcc
tcacgcaggtccagtgaagccagccagaacgatggcctgcctagcctactcaccctcacg
ccagcccagcagtaccgcctgaaagccaagtatgccgccgccacaggtggaccgccaccc
acgccactgcccaatatggagaggatgagcctgaagaccagaatggccctattcggggac
agtaaggagtccacaggagccctgccccctgtacaccctcctcgaagatgtagcgatggg
ggcacccacatctatggccggcgccctctgctgccccaggatacgctgagccacagtgta
agaagagccagcgacccagtaaggacagtttcagagaacttctccctgcccagggtgcag
cgtttcaacagcctgagcagcttcaactctctggttttgcctccatccctggaaaaacgt
aacttagtgcttcagaattacacacgacctgagggtggccagtcccgccatttccactcc
tctccttgtcctccaagcattactgagaatgtcaccctggagtccttgaccatggacgct
gatgtgaacctgaatgatgaggatttcttacccgatgatgtggtacagtatttaaattcc
cagaaccaagctgggtatgagcagcacttccagagtgacatctctgacgactgcaaactg
ccccccgggcctggcttggggagtgggcacagcaactttgactctcctgggctaccagac
agccacgccagccagcagtaccgcacgctggagcagccttgccctgaacacagcaaagct
gacctgcccattcagtggaacgaggttagctctggcagtgctgacctgtcctcctctagg
ctaaagtgtggccagaggtccacagcacagccagcccgaggctttgggttatatagcaac
atgggcgtccacccacagaatccgctgaggagtgggggtggccaggtagggggctatcag
acccttggggagcatggcagtgcctacgacggcccagaacactttgtgagccacagtggc
gggactggcaccaatggaaatgccttccatgaacagccttataaggtccagcagtatgtg
aactatctcaccaggcagcctggggcccacagctccctcgacagtgcctgcagtgctggg
agtcaagctgcaaagctgagaagtaccaccatgcaagggaacgggagccagccagatttc
aacctgtcgatggtgcccagtgagtcagctggtggcacagtgaatggcatgccaactcga
gacctgatggggcagggctacctggctcatcagctactcagtgaaagtatacaccaccag
ggggcaggccgagcggggcagcagatgctagagcaggttagtgctacctcatacatccat
atctatcaagggccagagagcagcctgccagggcctcccagcattgggggccagccgtcc
agcttggcagctgtcaggggctaccagccatgtgtcaactatggaggcagcaggcgccag
gctgtgccgagaggcagccttactctacagccaggacagcacagtgtcacaagtcagacc
tgcagggtgaatggcatcaagatggagatgcaagggcagcctcatcaactctgctctaat
gttcagagttactctggtcagttctatgaccaaaccattggcttcagtcagcaagatgtg
aaaactggttcattctctatttcagaagccaactgcctgctacagggggccagcgccgaa
aactctgaattactttccccaggtgctaaccaggtgacaagcacagttgacagccttgac
aaccatgatctggaaggggtgcagattgattttgacgcaatcatagatgacggagaccac
gccagcctgatgtcaggagccctgagcccaagtattattcagaacctttcccataactcc
tcccgcctcacaacaccacgaacatccctcccattgccagcactgtctgtgagcaccacc
aacatggccattggggacatgagttctttgttgacctcccttgcggaagaaagcaaattc
cttgcagttatgcaatag

DBGET integrated database retrieval system