Homo sapiens (human): 3713
Help
Entry
3713 CDS
T01001
Symbol
IVL
Name
(RefSeq) involucrin
KO
K28330
involucrin
Organism
hsa
Homo sapiens (human)
Pathway
hsa04382
Cornified envelope formation
Network
nt06545
Cornified envelope formation
Element
N01923
Closslinking of envoplakin, periplakin and involucrin
N01924
Loricrin oligomerization with small proline-rich proteins and further stabilization
N01929
Peroxidation and deglycosylation of acylglucosylceramide
Brite
KEGG Orthology (KO) [BR:
hsa00001
]
09150 Organismal Systems
09158 Development and regeneration
04382 Cornified envelope formation
3713 (IVL)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Involucrin
Involucrin_N
Motif
Other DBs
NCBI-GeneID:
3713
NCBI-ProteinID:
NP_005538
OMIM:
147360
HGNC:
6187
Ensembl:
ENSG00000163207
UniProt:
P07476
LinkDB
All DBs
Position
1:152908546..152911886
Genome browser
AA seq
585 aa
AA seq
DB search
MSQQHTLPVTLSPALSQELLKTVPPPVNTHQEQMKQPTPLPPPCQKVPVELPVEVPSKQE
EKHMTAVKGLPEQECEQQQKEPQEQELQQQHWEQHEEYQKAENPEQQLKQEKTQRDQQLN
KQLEEEKKLLDQQLDQELVKRDEQLGMKKEQLLELPEQQEGHLKHLEQQEGQLKHPEQQE
GQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPQQQE
GQLELSEQQEGQLELSEQQEGQLKHLEHQEGQLEVPEEQMGQLKYLEQQEGQLKHLDQQE
KQPELPEQQMGQLKHLEQQEGQPKHLEQQEGQLEQLEEQEGQLKHLEQQEGQLEHLEHQE
GQLGLPEQQVLQLKQLEKQQGQPKHLEEEEGQLKHLVQQEGQLKHLVQQEGQLEQQERQV
EHLEQQVGQLKHLEEQEGQLKHLEQQQGQLEVPEQQVGQPKNLEQEEKQLELPEQQEGQV
KHLEKQEAQLELPEQQVGQPKHLEQQEKHLEHPEQQDGQLKHLEQQEGQLKDLEQQKGQL
EQPVFAPAPGQVQDIQPALPTKGEVLLPVEHQQQKQEVQWPPKHK
NT seq
1758 nt
NT seq
+upstream
nt +downstream
nt
atgtcccagcaacacacactgccagtgaccctctcccctgccctcagtcaggagctcctc
aagactgttcctcctccagtcaatacccatcaggagcaaatgaaacagccaactccactg
cctcccccatgccagaaggtgcctgtcgagctcccagtggaggtcccatcaaagcaagag
gaaaagcacatgactgctgtaaagggactgcctgagcaagaatgtgagcaacagcagaag
gagccacaggagcaggagctgcagcaacagcactgggaacagcatgaggaatatcagaaa
gcagaaaacccagagcagcagcttaagcaggagaaaacacaaagggatcagcagctaaac
aaacagctggaagaagagaagaagctcttagaccagcaactggatcaagagctagtcaag
agagatgagcaactgggaatgaagaaagagcaactgttggagctcccagagcagcaggag
gggcacctgaagcacctagagcagcaggagggacagctgaagcacccggagcagcaggag
gggcagctggagctcccagagcagcaggaggggcagctggagctcccagagcagcaggag
gggcagctggagctcccagagcagcaggaggggcagctggagctcccagagcagcaggag
gggcagctggagctcccagagcagcaggaggggcagctggagctcccacagcagcaggag
gggcagctggagctctctgagcagcaggaggggcagctggagctctctgagcagcaggag
ggacagctgaagcacctggagcaccaggaggggcagctggaggtcccagaggagcagatg
gggcagctgaagtacctggaacagcaggaggggcagctgaagcacctggatcagcaggag
aagcagccagagctcccagagcagcagatggggcagctgaagcacctggagcagcaggag
gggcagcctaagcatctggagcagcaggaggggcaactggagcagctggaggagcaggag
gggcagctgaagcacctggagcagcaggaggggcagctggagcacctggagcaccaggaa
gggcagctggggctcccagagcagcaggtgctgcagctgaagcagctagagaagcagcag
gggcagccaaagcacctggaggaggaggaggggcagctgaagcacctggtgcagcaggag
gggcagctgaagcatctggtgcagcaggaggggcagctggagcagcaggagaggcaggtg
gagcacctggagcagcaggtggggcagctgaagcacctagaggagcaggagggacaactg
aagcatctggagcagcagcaggggcagttggaggtcccagagcagcaggtggggcagcca
aagaacctggagcaggaggagaagcaactggagctcccagagcagcaagagggccaggtg
aagcacctggagaagcaggaggcacagctggagctcccagagcagcaggtaggacagcca
aagcacctggaacagcaggaaaagcacctagagcacccagagcagcaggacggacaacta
aaacatctggagcagcaggaggggcagctgaaggacctggagcagcagaaggggcagctg
gagcagcctgtgtttgccccagctccaggccaggtccaagacattcaaccagccctgccc
acaaagggagaagtattgcttcctgtagagcaccagcagcagaagcaggaggtgcagtgg
ccacccaaacataaataa
Homo sapiens (human): 2125
Help
Entry
2125 CDS
T01001
Symbol
EVPL, EVPK
Name
(RefSeq) envoplakin
KO
K10383
envoplakin
Organism
hsa
Homo sapiens (human)
Pathway
hsa04382
Cornified envelope formation
Network
nt06545
Cornified envelope formation
Element
N01922
Localization of envoplakin and periplakin to the plasma membrane (PM)
N01923
Closslinking of envoplakin, periplakin and involucrin
Brite
KEGG Orthology (KO) [BR:
hsa00001
]
09150 Organismal Systems
09158 Development and regeneration
04382 Cornified envelope formation
2125 (EVPL)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04812 Cytoskeleton proteins [BR:
hsa04812
]
2125 (EVPL)
Cytoskeleton proteins [BR:
hsa04812
]
Eukaryotic cytoskeleton proteins
Intermediate filaments
Intermediate filament-binding proteins
2125 (EVPL)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Plectin
Spectrin_1st_PEPL
SH3_10
Spectrin_4
SR_plectin_7
Spectrin_3
Motif
Other DBs
NCBI-GeneID:
2125
NCBI-ProteinID:
NP_001979
OMIM:
601590
HGNC:
3503
Ensembl:
ENSG00000167880
UniProt:
Q92817
Structure
PDB
PDBj
LinkDB
All DBs
Position
17:complement(76006845..76027306)
Genome browser
AA seq
2033 aa
AA seq
DB search
MFKGLSKGSQGKGSPKGSPAKGSPKGSPSRHSRAATQELALLISRMQANADQVERDILET
QKRLQQDRLNSEQSQALQHQQETGRSLKEAEVLLKDLFLDVDKARRLKHPQAEEIEKDIK
QLHERVTQECAEYRALYEKMVLPPDVGPRVDWARVLEQKQKQVCAGQYGPGMAELEQQIA
EHNILQKEIDAYGQQLRSLVGPDAATIRSQYRDLLKAASWRGQSLGSLYTHLQGCTRQLS
ALAEQQRRILQQDWSDLMADPAGVRREYEHFKQHELLSQEQSVNQLEDDGERMVELRHPA
VGPIQAHQEALKMEWQNFLNLCICQETQLQHVEDYRRFQEEADSVSQTLAKLNSNLDAKY
SPAPGGPPGAPTELLQQLEAEEKRLAVTERATGDLQRRSRDVAPLPQRRNPPQQPLHVDS
ICDWDSGEVQLLQGERYKLVDNTDPHAWVVQGPGGETKRAPAACFCIPAPDPDAVARASR
LASELQALKQKLATVQSRLKASAVESLRPSQQAPSGSDLANPQAQKLLTQMTRLDGDLGQ
IERQVLAWARAPLSRPTPLEDLEGRIHSHEGTAQRLQSLGTEKETAQKECEAFLSTRPVG
PAALQLPVALNSVKNKFSDVQVLCSLYGEKAKAALDLERQIQDADRVIRGFEATLVQEAP
IPAEPGALQERVSELQRQRRELLEQQTCVLRLHRALKASEHACAALQNNFQEFCQDLPRQ
QRQVRALTDRYHAVGDQLDLREKVVQDAALTYQQFKNCKDNLSSWLEHLPRSQVRPSDGP
SQIAYKLQAQKRLTQEIQSRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSA
PKRPRVAPLQESIQAQEKNLAKAYTEVAAAQQQLLQQLEFARKMLEKKELSEDIRRTHDA
KQGSESPAQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQRPLERLEEKEVV
EFYRDPQLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPHLLTKEVT
QVERDPGLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEVV
KVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVK
KVEQDPGLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQERVHEIF
QVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKPTVEYKEVTQEVVRH
ERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRH
EKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQK
DPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELR
QLTLRIQELEKRPPTVQEKIIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRECK
NLQVQIDVLQKAKSQEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRL
RERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLL
SQKTESERQKAAQRGQELSRLEAAILREKDQIYEKERTLRDLHAKVSREELSQETQTRET
NLSTKISILEPETGKDMSPYEAYKRGIIDRGQYLQLQELECDWEEVTTSGPCGEESVLLD
RKSGKQYSIEAALRCRRISKEEYHLYKDGHLPISEFALLVAGETKPSSSLSIGSIISKSP
LASPAPQSTSFFSPSFSLGLGDDSFPIAGIYDTTTDNKCSIKTAVAKNMLDPITGQKLLE
AQAATGGIVDLLSRERYSVHKAMERGLIENTSTQRLLNAQKAFTGIEDPVTKKRLSVGEA
VQKGWMPRESVLPHLQVQHLTGGLIDPKRTGRIPIQQALLSGMISEELAQLLQDESSYEK
DLTDPISKERLSYKEAMGRCRKDPLSGLLLLPAALEGYRCYRSASPTVPRSLR
NT seq
6102 nt
NT seq
+upstream
nt +downstream
nt
atgttcaaggggctgagcaaaggctcccaggggaaggggtcccccaagggctcccccgcc
aaggggtcccccaaaggctcccccagcaggcacagccgggctgccacccaggagctggcc
cttctcatctcccgcatgcaagccaacgccgaccaggtggagcgggacatcctggagacg
cagaagaggctgcagcaggaccggctgaacagtgagcagagccaggccctgcagcaccag
caggagacgggccgcagcctgaaggaggctgaggtgctgctcaaggacctcttcctggac
gtggacaaggcccggcggctcaagcacccgcaggctgaggagattgagaaggacatcaag
cagctgcacgagcgggtgacccaggagtgtgcggagtaccgtgccctgtacgagaagatg
gtgctgccccccgacgtgggacccagggtcgactgggcacgcgtgctggagcagaaacag
aagcaggtctgcgcaggccagtacgggccgggcatggcggagctggagcaacagatcgcc
gagcacaacatcctgcagaaggagatcgacgcctatgggcagcagctgcggagcctcgtg
gggccggatgcagccaccatccggagccaataccgagacctactgaaggcggcgtcgtgg
cgcgggcagagcctgggcagcctgtacacgcacctccagggctgcacgcggcagctgagc
gccctggctgagcagcagcgccgcatcctgcagcaggactggagcgacctcatggccgac
cctgcgggcgtgcggcgggagtacgagcacttcaagcagcacgagctgctgagccaggag
cagagcgtgaaccagctggaggacgacggcgagcgcatggtggagctgcggcaccccgcg
gtggggcccatccaggcccaccaggaggccctgaagatggagtggcagaacttcctgaac
ctgtgtatctgccaggagacccagctgcagcacgtggaggactaccgccggttccaggaa
gaggccgactcagtcagccagaccctggcgaagctcaactccaacttggatgccaagtac
agccctgcacctgggggcccccctggcgcccccacagagctgctgcaacagctggaggca
gaggaaaaacggctggccgtcaccgagagggccactggggacctgcagcggcgaagccgg
gatgtggcccctctgccacagcgaagaaacccccctcagcagcccctgcacgtggacagc
atctgcgactgggactcaggagaagtgcagctgctgcagggtgagcggtataagctggta
gataacactgacccgcacgcctgggtcgtgcagggccctggcggggagaccaagcgtgct
cccgccgcctgcttctgcatcccagcaccagaccctgatgctgtggccagggcctcccgg
ctggcctcagagctgcaggccctgaagcagaaattggccacagtccagagccgcctgaag
gccagtgctgtggagtctcttcggcccagccagcaggctccatctggctcagacctggcc
aacccacaggcccagaagctcctgacacagatgacccggctggatggagacctgggacag
atagagaggcaggtgctggcctgggcgcgggccccgctgagccgccccacacccttggag
gacttggagggccgcatccacagccatgagggcacagcccagcgcctgcagagcctggga
acggagaaggagacagcccagaaggagtgcgaggcgtttctgtccacgcggcccgtgggc
cccgctgccctgcagctgcccgtagccctcaacagcgtgaagaacaagttcagtgacgtg
caggttctgtgcagcctctacggggagaaagccaaggctgccctggatctggagcggcag
atccaggatgcggacagggtcatccgaggcttcgaggccaccctggtgcaggaggccccc
atccctgctgaaccgggggctctgcaggagagggtcagcgagctgcagcgccagcggagg
gagctgctggaacagcagacctgcgtgctgcggctacaccgcgcgctgaaggcctcggag
cacgcatgcgctgccctgcagaacaacttccaggagttctgccaagacctgcctcgccag
cagcgccaggtgcgagccctcaccgaccgctaccacgccgtaggggaccagctggacctg
cgggagaaggtggtgcaggatgccgccctcacctaccagcagttcaagaactgcaaggat
aacctgagctcctggctggagcacctgccccgcagccaggtgcggcccagcgacggcccc
agccagatcgcctacaagctgcaggcgcagaagaggctgacgcaggagatccagagccga
gagcgggacagggccacagcatcccacctctcccaggccctgcaggcagcgctccaggac
tatgagctccaggcagacacctaccgctgctctttggagcccaccctggcagtgtcagcc
cccaagagaccccgagtggctcccctgcaagagagcatccaagcccaggagaagaacctt
gcaaaggcctatactgaggttgcagcagcacagcagcagctgctccagcagctggagttt
gctagaaaaatgctggagaagaaggagctcagtgaggacatccgaaggacccatgatgca
aagcagggctccgagagccctgcccaagcagggagagagtcagaggccctgaaggcccag
ctggaagaggagaggaagcgggtggcccgggtgcagcatgagctggaggcgcagaggagc
caactgctgcagctgaggacccagcggcccttggagaggctggaggagaaggaagtggta
gagttctaccgggacccccagctggagggcagcctgtccagggtgaaggcccaggtggag
gaggagggcaagcggcgggctggcctgcaggcagacctggaagtggcagcccagaaggtc
gtgcagctggaaagcaagaggaagaccatgcagcctcatctgctgaccaaggaggtcacc
caggtggagagggaccccggcctggacagccaggcggcccagctcaggatccagatccag
cagctccgcggggaggatgccgtcatctcggcccggctggaagggctgaagaaggagcta
ctggcccttgagaagagggaggtggacgtgaaggagaaggtcgtggtgaaagaggtagtc
aaggtggagaagaatctggaaatggtcaaggcagcccaggctctgaggctgcagatggag
gaggatgctgcgcggaggaagcaggcggaggaggctgtggccaagctacaggctcgcatc
gaagacctggagcgggctatcagctcggtggagcccaaggtcatcgtgaaggaggtgaag
aaggtggagcaggacccagggctcctccaggagtcctccaggctgaggagcctcctcgag
gaggagaggaccaagaacgcgacgctggccagggagctgagcgacctgcacagcaagtac
agcgtggtggagaagcagaggcccaaagtgcagctccaggagcgcgtccacgagatcttc
caggtggatccggagacagagcaggagatcactcggctcaaggccaagctgcaggagatg
gcgggcaagaggagcggtgtggagaaggaggtggagaagctgctgcccgacctggaggtc
ctgcgggcccagaagcccacggtggagtacaaggaggtgacccaggaggtggtgaggcat
gagaggagccccgaggtgctgcgtgagatcgaccgcctgaaggctcagctcaacgagctc
gtcaacagccacgggcgctcccaggagcagctcatccgcctgcagggtgagcgcgacgag
tggaggcgcgagcgggccaaggtggagaccaagacggtgagcaaggaggtggtgcgccac
gagaaggacccggtgctggagaaagaagcagagcggctccgccaggaggtgcgggaggcg
gcccagaagaggcgggccgcggaggacgcggtgtacgagctgcagagcaagcgcctgctg
ctggagaggaggaagcccgaggagaaggtggtggtgcaggaggtggtggtcacccagaag
gacccgaagctgcgcgaggagcacagccggctgagcgggagcctggatgaggaggtgggc
cggcggcgccagctagagcttgaggtgcagcagctgcgggccggcgtggaggagcaggag
ggcctgctcagcttccaggaggaccgcagcaagaagctggccgtggagagggagctgcgg
cagctgaccttgaggatccaggagctcgagaagcggcctcccacggtgcaggagaagatc
atcatggaggaagtggtcaagctggagaaggacccggacctggagaagtccacggaagcc
ctgcggtgggacctggaccaggagaagacccaggtaaccgagctgaatcgggagtgcaag
aacctgcaggtccagattgacgtcctccagaaagccaaatcgcaggagaagaccatctac
aaggaagtgatccgggtgcagaaggaccgcgtcctggaagatgagcgggcccgcgtgtgg
gagatgctcaacagggagcgcacggcccggcaggcccgggaggaggaggcacggcgcctg
cgggagcgcattgaccgggccgagacgctggggagaacctggtcccgggaggagtccgag
ctgcagagggcccgggaccaggccgaccaggagtgtgggcggctgcagcaggagctgcgg
gctctggagaggcagaagcagcagcagacactgcagctgcaggaggagtcgaagctgctc
agccagaagacggagagcgagcgacagaaggcggcccagcggggccaggagctctcgcgg
ctggaggcggccatcctccgcgagaaggaccagatctacgagaaggagcggacgctccgg
gacctccacgccaaggtgagccgggaggagctcagccaggagacccagacgcgagagacc
aacctttccaccaagatctccatcctggaacccgagacggggaaggacatgtccccatac
gaggcctacaagaggggcatcatcgacaggggccagtacttgcagctgcaggagctcgag
tgtgactgggaggaggtcaccacctcggggccctgtggggaggagtctgtgctcctggac
cgcaagagcgggaagcagtactccatcgaggccgccctccgctgccggcgcatctctaag
gaggagtaccatctgtacaaggacggccacctgcccatctccgagtttgcgctgcttgta
gctggggagaccaagccaagctcctcactctccatcggctctatcatctccaagtccccg
ctcgcctccccggccccccagagcaccagtttcttctctcccagcttctctctcgggctc
ggtgatgacagcttccctatcgccgggatctatgacacaaccacagacaacaagtgcagc
atcaagacggccgtggccaagaacatgctggaccccatcactgggcagaagctactggag
gcccaggcggccacagggggcatcgtggacctgctcagccgtgagcgctactctgtgcac
aaggcgatggagaggggcctgatcgagaacacctccacacagaggctgcttaacgcccag
aaggccttcaccggcatcgaggaccccgtcaccaagaagaggctctcggtgggcgaggcc
gtccagaagggctggatgccccgggagagcgtgctcccacacctgcaggtgcagcacctg
accggggggctcatcgaccccaagaggacaggccgcatccccatccagcaggccctcctc
tccgggatgatcagtgaagagctggcccagctcctgcaggacgagtccagctacgagaag
gatttgacagaccccatctccaaggaacggctgagctacaaggaggccatgggccgctgc
cgcaaagaccccctgagcggcctgctgctcctgccagcggcactggaggggtaccgctgc
taccgctccgcctcccccaccgtcccgcgctcccttcgctga
Homo sapiens (human): 5493
Help
Entry
5493 CDS
T01001
Symbol
PPL
Name
(RefSeq) periplakin
KO
K10386
periplakin
Organism
hsa
Homo sapiens (human)
Pathway
hsa04382
Cornified envelope formation
Network
nt06545
Cornified envelope formation
Element
N01922
Localization of envoplakin and periplakin to the plasma membrane (PM)
N01923
Closslinking of envoplakin, periplakin and involucrin
Brite
KEGG Orthology (KO) [BR:
hsa00001
]
09150 Organismal Systems
09158 Development and regeneration
04382 Cornified envelope formation
5493 (PPL)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04812 Cytoskeleton proteins [BR:
hsa04812
]
5493 (PPL)
Cytoskeleton proteins [BR:
hsa04812
]
Eukaryotic cytoskeleton proteins
Intermediate filaments
Intermediate filament-binding proteins
5493 (PPL)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Spectrin_1st_PEPL
SH3_10
Plectin
Spectrin
ATG16
Motif
Other DBs
NCBI-GeneID:
5493
NCBI-ProteinID:
NP_002696
OMIM:
602871
HGNC:
9273
Ensembl:
ENSG00000118898
UniProt:
O60437
Structure
PDB
PDBj
LinkDB
All DBs
Position
16:complement(4882507..4937148)
Genome browser
AA seq
1756 aa
AA seq
DB search
MNSLFRKRNKGKYSPTVQTRSISNKELSELIEQLQKNADQVEKNIVDTEAKMQSDLARLQ
EGRQPEHRDVTLQKVLDSEKLLYVLEADAAIAKHMKHPQGDMIAEDIRQLKERVTNLRGK
HKQIYRLAVKEVDPQVNWAALVEEKLDKLNNQSFGTDLPLVDHQVEEHNIFHNEVKAIGP
HLAKDGDKEQNSELRAKYQKLLAASQARQQHLSSLQDYMQRCTNELYWLDQQAKGRMQYD
WSDRNLDYPSRRRQYENFINRNLEAKEERINKLHSEGDQLLAAEHPGRNSIEAHMEAVHA
DWKEYLNLLICEESHLKYMEDYHQFHEDVKDAQELLRKVDSDLNQKYGPDFKDRYQIELL
LRELDDQEKVLDKYEDVVQGLQKRGQQVVPLKYRRETPLKPIPVEALCDFEGEQGLISRG
YSYTLQKNNGESWELMDSAGNKLIAPAVCFVIPPTDPEALALADSLGSQYRSVRQKAAGS
KRTLQQRYEVLKTENPGDASDLQGRQLLAGLDKVASDLDRQEKAITGILRPPLEQGRAVQ
DSAERAKDLKNITNELLRIEPEKTRSTAEGEAFIQALPGSGTTPLLRTRVEDTNRKYEHL
LQLLDLAQEKVDVANRLEKSLQQSWELLATHENHLNQDDTVPESSRVLDSKGQELAAMAC
ELQAQKSLLGEVEQNLQAAKQCSSTLASRFQEHCPDLERQEAEVHKLGQRFNNLRQQVER
RAQSLQSAKAAYEHFHRGHDHVLQFLVSIPSYEPQETDSLSQMETKLKNQKNLLDEIASR
EQEVQKICANSQQYQQAVKDYELEAEKLRSLLDLENGRRSHVSKRARLQSPATKVKEEEA
ALAAKFTEVYAINRQRLQNLEFALNLLRQQPEVEVTHETLQRNRPDSGVEEAWKIRKELD
EETERRRQLENEVKSTQEEIWTLRNQGPQESVVRKEVLKKVPDPVLEESFQQLQRTLAEE
QHKNQLLQEELEALQLQLRALEQETRDGGQEYVVKEVLRIEPDRAQADEVLQLREELEAL
RRQKGAREAEVLLLQQRVAALAEEKSRAQEKVTEKEVVKLQNDPQLEAEYQQLQEDHQRQ
DQLREKQEEELSFLQDKLKRLEKERAMAEGKITVKEVLKVEKDAATEREVSDLTRQYEDE
AAKARASQREKTELLRKIWALEEENAKVVVQEKVREIVRPDPKAESEVANLRLELVEQER
KYRGAEEQLRSYQSELEALRRRGPQVEVKEVTKEVIKYKTDPEMEKELQRLREEIVDKTR
LIERCDLEIYQLKKEIQALKDTKPQVQTKEVVQEILQFQEDPQTKEEVASLRAKLSEEQK
KQVDLERERASQEEQIARKEEELSRVKERVVQQEVVRYEEEPGLRAEASAFAESIDVELR
QIDKLRAELRRLQRRRTELERQLEELERERQARREAEREVQRLQQRLAALEQEEAEAREK
VTHTQKVVLQQDPQQAREHALLRLQLEEEQHRRQLLEGELETLRRKLAALEKAEVKEKVV
LSESVQVEKGDTEQEIQRLKSSLEEESRSKRELDVEVSRLEARLSELEFHNSKSSKELDF
LREENHKLQLERQNLQLETRRLQSEINMAATETRDLRNMTVADSGTNHDSRLWSLERELD
DLKRLSKDKDLEIDELQKRLGSVAVKREQRENHLRRSIVVIHPDTGRELSPEEAHRAGLI
DWNMFVKLRSQECDWEEISVKGPNGESSVIHDRKSGKKFSIEEALQSGRLTPAQYDRYVN
KDMSIQELAVLVSGQK
NT seq
5271 nt
NT seq
+upstream
nt +downstream
nt
atgaactcgctcttcaggaagagaaacaaaggcaaatacagccccactgtgcagacccgg
agcatctctaacaaggagctctcggagctgatcgagcagctgcagaagaatgccgaccag
gtggagaagaacatcgtggacacagaggccaagatgcagagtgacctggctcggctgcag
gagggtcggcagcctgagcaccgggacgtgaccctgcagaaggtgttggactctgagaag
ctgctctatgtgctagaggcggatgcggccattgccaagcacatgaagcacccacagggg
gacatgatcgccgaggatatccgccagctgaaggagcgtgtgaccaacctgcgcgggaaa
cacaagcagatctacaggctggcggtgaaggaagtggatccacaggtcaactgggcggca
ctggtggaggagaagctggacaagctgaacaaccagagctttgggactgacctgccgctg
gtggaccaccaagtggaggagcataacatcttccacaatgaggtcaaggccatcgggccc
cacctggccaaggacggggacaaggagcagaacagcgaactccgggccaagtaccagaaa
ctgctggcagcatcacaggcccggcagcagcacctgagttcgctgcaggactacatgcag
cgctgcaccaatgagctgtactggctggaccagcaggccaagggccgcatgcagtacgac
tggagtgaccgcaacctcgactaccccagccgccggcgccagtatgagaatttcatcaac
cggaacctggaggccaaagaggagagaatcaacaaactgcacagcgagggcgaccagctg
ctggcggccgagcaccccgggaggaactccattgaggcgcacatggaggctgtgcacgca
gactggaaggagtacctgaacctgctcatctgcgaggagagccacctcaagtacatggag
gactaccaccagtttcacgaagacgtgaaggacgctcaggagctgctgcgcaaggtggac
tcggacctgaaccagaagtatggccctgacttcaaggaccggtaccagattgagctgctg
ctgcgggagctggatgaccaggagaaggtgctggacaagtatgaggacgtggtgcagggg
ctgcagaagcgaggccagcaggtggtgcccctcaagtaccgccgggagactccgctcaag
cccatccccgtggaggcactctgtgactttgagggggagcagggcctgatctcgcggggc
tacagctacaccctgcagaagaacaacggggagagctgggagctcatggacagcgctggg
aacaagctgattgctccggccgtgtgttttgtgatcccccccacagaccctgaggccctg
gctctggctgacagcctgggcagccagtaccggagcgtgcggcagaaggcagctgggagc
aaacgcacgctgcagcagcggtatgaggtgctgaagaccgagaatcccggagatgcctct
gacctacaggggcggcagctgctggctggcttggacaaggtggccagcgacctggaccgg
caggagaaggccatcacagggatcctgcggccaccactggagcaaggccgggctgtgcag
gacagtgccgagcgggccaaggacctcaagaacatcaccaacgagctactgcggattgaa
cctgagaagacgcggagcacggctgagggcgaagccttcatccaggccctcccaggcagt
ggcaccacacccctgctgaggacccgggtggaggacaccaaccggaaatacgagcacctc
ctgcagctgctggacttggcccaggagaaggttgatgtggccaaccgcctggagaagagc
ctgcagcagagctgggagttgctggccacacacgagaaccatctgaatcaggatgacaca
gtgcctgagagcagccgtgtcctggacagcaaggggcaggagctggcggccatggcctgt
gagttacaggcccagaagtccctcctgggtgaggtggagcagaacttgcaggcggccaag
cagtgctcgagcacactggccagccgcttccaggagcactgtccggacctggagcgccag
gaggccgaggtgcacaagctgggccagcgtttcaacaacctgcgccagcaggtggaacgc
agggcgcagagcctacagagcgccaaggcagcctacgagcacttccaccgcggccatgac
cacgtgctgcagttcctagtcagcatccccagttacgagccccaggagacagacagcctc
agccagatggagaccaagctgaagaaccagaagaacctgctagatgagatagcaagtagg
gagcaggaagtacagaagatctgtgccaattcccagcagtaccagcaagctgtaaaggac
tatgagttagaagcagaaaaactaaggtctcttctcgacttggagaatggaaggagaagc
cacgtgagcaagagagccaggctccaatctcctgccaccaaagtgaaggaagaggaagca
gcacttgccgccaagttcactgaagtttatgccatcaacagacagaggctgcagaatctg
gagtttgctctgaatctcctcagacagcagccggaagtagaagtgacccatgagaccctg
caaaggaataggccggactctggagtggaggaggcgtggaagatcaggaaggaactggat
gaggagactgagcggaggcggcagctggagaacgaggtcaagagcacccaggaagaaatc
tggaccttgaggaatcaggggcctcaggaatcggtggtgaggaaggaggtgctcaagaag
gtgccggatcccgtgctggaggagagcttccagcagctgcagcggacgctggcagaggag
cagcacaagaaccagctgctgcaggaggagctggaggcactgcagctgcagctgcgtgcc
ctggagcaggagaccagagacggggggcaggagtacgtggtcaaggaggtcctgcgcatc
gagcctgacagggcccaggcggatgaggtcttgcagctgcgggaggagctggaggcactg
aggcggcagaagggcgcccgggaggcagaggtgctcctcctgcagcagcgtgtggccgcc
ctggctgaagagaagagccgggcgcaggagaaggtcacagagaaagaggtggtgaaactg
cagaatgacccccagctggaggcagagtaccagcagctgcaggaggaccaccagcgccag
gaccagctcagggagaagcaggaggaggagctgagcttcctccaggacaagctcaagagg
ctagagaaggagcgggccatggccgagggcaagatcaccgtcaaggaggtgctcaaggtg
gagaaggacgcggccaccgagagggaggtcagcgatctcacccgccaatatgaggacgag
gctgccaaggctcgcgctagccagagggagaagacggagctgctccgaaagatatgggcc
ttggaggaggagaacgccaaagtggtggtgcaggagaaggtgcgggagatcgtgcggcca
gaccccaaggcggaaagtgaagtggcgaacctccgcctggagcttgtggagcaggagcga
aagtaccggggtgccgaggagcagctccggagctaccagagtgagctggaggccctcagg
aggcgaggcccccaggtggaagtcaaagaggtgactaaggaagtcattaagtacaagact
gaccctgagatggagaaggagcttcagcggctcagggaggagatcgtggacaagaccaga
ctgatcgaaaggtgtgatttagagatctaccagctgaaaaaggaaatccaggccctgaaa
gacaccaaaccccaggtccagaccaaagaggtggtccaggagatcctccaattccaagaa
gaccctcaaaccaaggaggaggtggcgtctctgagggcaaagctctcagaggagcagaag
aaacaagtggatctggagagggaaagagcttcccaggaagagcagatcgcccggaaagag
gaggagctctcgcgggtgaaggaaagggtggtgcagcaggaggtggtcaggtatgaggag
gagccaggcctgcgggccgaggcgagcgcctttgccgagagcatcgatgtggagctgcgg
cagattgacaagctgcgggcagagctgcggcggctgcagcgccggcgcaccgagcttgag
cggcagctggaggagctagagcgcgagcggcaggcccgcagggaggccgagcgcgaggta
cagcggttgcagcagcggctggcagcgctggagcaggaagaagctgaggcccgtgagaag
gtaacccatacgcagaaggtggtgctgcagcaggacccgcagcaggcgcgagagcatgcc
ctgctccgactccagctggaagaagagcagcaccggcggcagctcctggagggggagctc
gagaccctccggaggaaactggctgcactggagaaggcggaggtcaaggagaaggtggtg
ctctccgagagtgtccaggtggagaagggcgacaccgagcaagagatccagaggctcaag
agcagcctggaggaggagagccgcagcaagcgcgagctggacgtcgaggtgagccggctg
gaagccaggctttcggagctggaattccataactccaagtcatccaaggaactagacttt
ctgagggaagagaaccacaaattacagctggagaggcaaaacctgcagctggagacccga
aggctccaatcggaaatcaacatggcagcgacggaaacacgagacctgcggaacatgacc
gtggcggactctgggaccaaccatgactccagactgtggtccctggagagggaactggat
gacctcaagaggctctccaaggacaaagacctcgagatcgacgagctgcagaagcgcctg
ggctccgtggccgtcaagcgggagcagcgggagaaccacctgcggcgctccatcgtagtc
atccaccctgacacaggccgcgagctgtccccggaggaagcccaccgtgccgggctcatt
gactggaacatgttcgtgaaactcagaagccaggagtgcgactgggaggagatctcagtg
aagggtcccaatggggagtcctcagtgatacacgacaggaagtctggcaagaagttctcc
atcgaagaggccctgcagagtggcaggctgacccctgctcagtatgaccgctatgtcaac
aaggatatgtccatccaggagctggcggtcttggtatctgggcagaagtag
DBGET
integrated database retrieval system