KEGG   Homo sapiens (human): 3713
Entry
3713              CDS       T01001                                 
Symbol
IVL
Name
(RefSeq) involucrin
  KO
K28330  involucrin
Organism
hsa  Homo sapiens (human)
Pathway
hsa04382  Cornified envelope formation
Network
nt06545  Cornified envelope formation
  Element
N01923  Closslinking of envoplakin, periplakin and involucrin
N01924  Loricrin oligomerization with small proline-rich proteins and further stabilization
N01929  Peroxidation and deglycosylation of acylglucosylceramide
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09158 Development and regeneration
   04382 Cornified envelope formation
    3713 (IVL)
SSDB
Motif
Pfam: Involucrin Involucrin_N
Other DBs
NCBI-GeneID: 3713
NCBI-ProteinID: NP_005538
OMIM: 147360
HGNC: 6187
Ensembl: ENSG00000163207
UniProt: P07476
LinkDB
Position
1:152908546..152911886
AA seq 585 aa
MSQQHTLPVTLSPALSQELLKTVPPPVNTHQEQMKQPTPLPPPCQKVPVELPVEVPSKQE
EKHMTAVKGLPEQECEQQQKEPQEQELQQQHWEQHEEYQKAENPEQQLKQEKTQRDQQLN
KQLEEEKKLLDQQLDQELVKRDEQLGMKKEQLLELPEQQEGHLKHLEQQEGQLKHPEQQE
GQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPEQQEGQLELPQQQE
GQLELSEQQEGQLELSEQQEGQLKHLEHQEGQLEVPEEQMGQLKYLEQQEGQLKHLDQQE
KQPELPEQQMGQLKHLEQQEGQPKHLEQQEGQLEQLEEQEGQLKHLEQQEGQLEHLEHQE
GQLGLPEQQVLQLKQLEKQQGQPKHLEEEEGQLKHLVQQEGQLKHLVQQEGQLEQQERQV
EHLEQQVGQLKHLEEQEGQLKHLEQQQGQLEVPEQQVGQPKNLEQEEKQLELPEQQEGQV
KHLEKQEAQLELPEQQVGQPKHLEQQEKHLEHPEQQDGQLKHLEQQEGQLKDLEQQKGQL
EQPVFAPAPGQVQDIQPALPTKGEVLLPVEHQQQKQEVQWPPKHK
NT seq 1758 nt   +upstreamnt  +downstreamnt
atgtcccagcaacacacactgccagtgaccctctcccctgccctcagtcaggagctcctc
aagactgttcctcctccagtcaatacccatcaggagcaaatgaaacagccaactccactg
cctcccccatgccagaaggtgcctgtcgagctcccagtggaggtcccatcaaagcaagag
gaaaagcacatgactgctgtaaagggactgcctgagcaagaatgtgagcaacagcagaag
gagccacaggagcaggagctgcagcaacagcactgggaacagcatgaggaatatcagaaa
gcagaaaacccagagcagcagcttaagcaggagaaaacacaaagggatcagcagctaaac
aaacagctggaagaagagaagaagctcttagaccagcaactggatcaagagctagtcaag
agagatgagcaactgggaatgaagaaagagcaactgttggagctcccagagcagcaggag
gggcacctgaagcacctagagcagcaggagggacagctgaagcacccggagcagcaggag
gggcagctggagctcccagagcagcaggaggggcagctggagctcccagagcagcaggag
gggcagctggagctcccagagcagcaggaggggcagctggagctcccagagcagcaggag
gggcagctggagctcccagagcagcaggaggggcagctggagctcccacagcagcaggag
gggcagctggagctctctgagcagcaggaggggcagctggagctctctgagcagcaggag
ggacagctgaagcacctggagcaccaggaggggcagctggaggtcccagaggagcagatg
gggcagctgaagtacctggaacagcaggaggggcagctgaagcacctggatcagcaggag
aagcagccagagctcccagagcagcagatggggcagctgaagcacctggagcagcaggag
gggcagcctaagcatctggagcagcaggaggggcaactggagcagctggaggagcaggag
gggcagctgaagcacctggagcagcaggaggggcagctggagcacctggagcaccaggaa
gggcagctggggctcccagagcagcaggtgctgcagctgaagcagctagagaagcagcag
gggcagccaaagcacctggaggaggaggaggggcagctgaagcacctggtgcagcaggag
gggcagctgaagcatctggtgcagcaggaggggcagctggagcagcaggagaggcaggtg
gagcacctggagcagcaggtggggcagctgaagcacctagaggagcaggagggacaactg
aagcatctggagcagcagcaggggcagttggaggtcccagagcagcaggtggggcagcca
aagaacctggagcaggaggagaagcaactggagctcccagagcagcaagagggccaggtg
aagcacctggagaagcaggaggcacagctggagctcccagagcagcaggtaggacagcca
aagcacctggaacagcaggaaaagcacctagagcacccagagcagcaggacggacaacta
aaacatctggagcagcaggaggggcagctgaaggacctggagcagcagaaggggcagctg
gagcagcctgtgtttgccccagctccaggccaggtccaagacattcaaccagccctgccc
acaaagggagaagtattgcttcctgtagagcaccagcagcagaagcaggaggtgcagtgg
ccacccaaacataaataa

KEGG   Homo sapiens (human): 2125
Entry
2125              CDS       T01001                                 
Symbol
EVPL, EVPK
Name
(RefSeq) envoplakin
  KO
K10383  envoplakin
Organism
hsa  Homo sapiens (human)
Pathway
hsa04382  Cornified envelope formation
Network
nt06545  Cornified envelope formation
  Element
N01922  Localization of envoplakin and periplakin to the plasma membrane (PM)
N01923  Closslinking of envoplakin, periplakin and involucrin
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09158 Development and regeneration
   04382 Cornified envelope formation
    2125 (EVPL)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:hsa04812]
    2125 (EVPL)
Cytoskeleton proteins [BR:hsa04812]
 Eukaryotic cytoskeleton proteins
  Intermediate filaments
   Intermediate filament-binding proteins
    2125 (EVPL)
SSDB
Motif
Pfam: Plectin Spectrin_1st_PEPL SH3_10 Spectrin_4 SR_plectin_7 Spectrin_3
Other DBs
NCBI-GeneID: 2125
NCBI-ProteinID: NP_001979
OMIM: 601590
HGNC: 3503
Ensembl: ENSG00000167880
UniProt: Q92817
Structure
LinkDB
Position
17:complement(76006845..76027306)
AA seq 2033 aa
MFKGLSKGSQGKGSPKGSPAKGSPKGSPSRHSRAATQELALLISRMQANADQVERDILET
QKRLQQDRLNSEQSQALQHQQETGRSLKEAEVLLKDLFLDVDKARRLKHPQAEEIEKDIK
QLHERVTQECAEYRALYEKMVLPPDVGPRVDWARVLEQKQKQVCAGQYGPGMAELEQQIA
EHNILQKEIDAYGQQLRSLVGPDAATIRSQYRDLLKAASWRGQSLGSLYTHLQGCTRQLS
ALAEQQRRILQQDWSDLMADPAGVRREYEHFKQHELLSQEQSVNQLEDDGERMVELRHPA
VGPIQAHQEALKMEWQNFLNLCICQETQLQHVEDYRRFQEEADSVSQTLAKLNSNLDAKY
SPAPGGPPGAPTELLQQLEAEEKRLAVTERATGDLQRRSRDVAPLPQRRNPPQQPLHVDS
ICDWDSGEVQLLQGERYKLVDNTDPHAWVVQGPGGETKRAPAACFCIPAPDPDAVARASR
LASELQALKQKLATVQSRLKASAVESLRPSQQAPSGSDLANPQAQKLLTQMTRLDGDLGQ
IERQVLAWARAPLSRPTPLEDLEGRIHSHEGTAQRLQSLGTEKETAQKECEAFLSTRPVG
PAALQLPVALNSVKNKFSDVQVLCSLYGEKAKAALDLERQIQDADRVIRGFEATLVQEAP
IPAEPGALQERVSELQRQRRELLEQQTCVLRLHRALKASEHACAALQNNFQEFCQDLPRQ
QRQVRALTDRYHAVGDQLDLREKVVQDAALTYQQFKNCKDNLSSWLEHLPRSQVRPSDGP
SQIAYKLQAQKRLTQEIQSRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSA
PKRPRVAPLQESIQAQEKNLAKAYTEVAAAQQQLLQQLEFARKMLEKKELSEDIRRTHDA
KQGSESPAQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQRPLERLEEKEVV
EFYRDPQLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPHLLTKEVT
QVERDPGLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEVV
KVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVK
KVEQDPGLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQERVHEIF
QVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKPTVEYKEVTQEVVRH
ERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRH
EKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQK
DPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELR
QLTLRIQELEKRPPTVQEKIIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRECK
NLQVQIDVLQKAKSQEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRL
RERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLL
SQKTESERQKAAQRGQELSRLEAAILREKDQIYEKERTLRDLHAKVSREELSQETQTRET
NLSTKISILEPETGKDMSPYEAYKRGIIDRGQYLQLQELECDWEEVTTSGPCGEESVLLD
RKSGKQYSIEAALRCRRISKEEYHLYKDGHLPISEFALLVAGETKPSSSLSIGSIISKSP
LASPAPQSTSFFSPSFSLGLGDDSFPIAGIYDTTTDNKCSIKTAVAKNMLDPITGQKLLE
AQAATGGIVDLLSRERYSVHKAMERGLIENTSTQRLLNAQKAFTGIEDPVTKKRLSVGEA
VQKGWMPRESVLPHLQVQHLTGGLIDPKRTGRIPIQQALLSGMISEELAQLLQDESSYEK
DLTDPISKERLSYKEAMGRCRKDPLSGLLLLPAALEGYRCYRSASPTVPRSLR
NT seq 6102 nt   +upstreamnt  +downstreamnt
atgttcaaggggctgagcaaaggctcccaggggaaggggtcccccaagggctcccccgcc
aaggggtcccccaaaggctcccccagcaggcacagccgggctgccacccaggagctggcc
cttctcatctcccgcatgcaagccaacgccgaccaggtggagcgggacatcctggagacg
cagaagaggctgcagcaggaccggctgaacagtgagcagagccaggccctgcagcaccag
caggagacgggccgcagcctgaaggaggctgaggtgctgctcaaggacctcttcctggac
gtggacaaggcccggcggctcaagcacccgcaggctgaggagattgagaaggacatcaag
cagctgcacgagcgggtgacccaggagtgtgcggagtaccgtgccctgtacgagaagatg
gtgctgccccccgacgtgggacccagggtcgactgggcacgcgtgctggagcagaaacag
aagcaggtctgcgcaggccagtacgggccgggcatggcggagctggagcaacagatcgcc
gagcacaacatcctgcagaaggagatcgacgcctatgggcagcagctgcggagcctcgtg
gggccggatgcagccaccatccggagccaataccgagacctactgaaggcggcgtcgtgg
cgcgggcagagcctgggcagcctgtacacgcacctccagggctgcacgcggcagctgagc
gccctggctgagcagcagcgccgcatcctgcagcaggactggagcgacctcatggccgac
cctgcgggcgtgcggcgggagtacgagcacttcaagcagcacgagctgctgagccaggag
cagagcgtgaaccagctggaggacgacggcgagcgcatggtggagctgcggcaccccgcg
gtggggcccatccaggcccaccaggaggccctgaagatggagtggcagaacttcctgaac
ctgtgtatctgccaggagacccagctgcagcacgtggaggactaccgccggttccaggaa
gaggccgactcagtcagccagaccctggcgaagctcaactccaacttggatgccaagtac
agccctgcacctgggggcccccctggcgcccccacagagctgctgcaacagctggaggca
gaggaaaaacggctggccgtcaccgagagggccactggggacctgcagcggcgaagccgg
gatgtggcccctctgccacagcgaagaaacccccctcagcagcccctgcacgtggacagc
atctgcgactgggactcaggagaagtgcagctgctgcagggtgagcggtataagctggta
gataacactgacccgcacgcctgggtcgtgcagggccctggcggggagaccaagcgtgct
cccgccgcctgcttctgcatcccagcaccagaccctgatgctgtggccagggcctcccgg
ctggcctcagagctgcaggccctgaagcagaaattggccacagtccagagccgcctgaag
gccagtgctgtggagtctcttcggcccagccagcaggctccatctggctcagacctggcc
aacccacaggcccagaagctcctgacacagatgacccggctggatggagacctgggacag
atagagaggcaggtgctggcctgggcgcgggccccgctgagccgccccacacccttggag
gacttggagggccgcatccacagccatgagggcacagcccagcgcctgcagagcctggga
acggagaaggagacagcccagaaggagtgcgaggcgtttctgtccacgcggcccgtgggc
cccgctgccctgcagctgcccgtagccctcaacagcgtgaagaacaagttcagtgacgtg
caggttctgtgcagcctctacggggagaaagccaaggctgccctggatctggagcggcag
atccaggatgcggacagggtcatccgaggcttcgaggccaccctggtgcaggaggccccc
atccctgctgaaccgggggctctgcaggagagggtcagcgagctgcagcgccagcggagg
gagctgctggaacagcagacctgcgtgctgcggctacaccgcgcgctgaaggcctcggag
cacgcatgcgctgccctgcagaacaacttccaggagttctgccaagacctgcctcgccag
cagcgccaggtgcgagccctcaccgaccgctaccacgccgtaggggaccagctggacctg
cgggagaaggtggtgcaggatgccgccctcacctaccagcagttcaagaactgcaaggat
aacctgagctcctggctggagcacctgccccgcagccaggtgcggcccagcgacggcccc
agccagatcgcctacaagctgcaggcgcagaagaggctgacgcaggagatccagagccga
gagcgggacagggccacagcatcccacctctcccaggccctgcaggcagcgctccaggac
tatgagctccaggcagacacctaccgctgctctttggagcccaccctggcagtgtcagcc
cccaagagaccccgagtggctcccctgcaagagagcatccaagcccaggagaagaacctt
gcaaaggcctatactgaggttgcagcagcacagcagcagctgctccagcagctggagttt
gctagaaaaatgctggagaagaaggagctcagtgaggacatccgaaggacccatgatgca
aagcagggctccgagagccctgcccaagcagggagagagtcagaggccctgaaggcccag
ctggaagaggagaggaagcgggtggcccgggtgcagcatgagctggaggcgcagaggagc
caactgctgcagctgaggacccagcggcccttggagaggctggaggagaaggaagtggta
gagttctaccgggacccccagctggagggcagcctgtccagggtgaaggcccaggtggag
gaggagggcaagcggcgggctggcctgcaggcagacctggaagtggcagcccagaaggtc
gtgcagctggaaagcaagaggaagaccatgcagcctcatctgctgaccaaggaggtcacc
caggtggagagggaccccggcctggacagccaggcggcccagctcaggatccagatccag
cagctccgcggggaggatgccgtcatctcggcccggctggaagggctgaagaaggagcta
ctggcccttgagaagagggaggtggacgtgaaggagaaggtcgtggtgaaagaggtagtc
aaggtggagaagaatctggaaatggtcaaggcagcccaggctctgaggctgcagatggag
gaggatgctgcgcggaggaagcaggcggaggaggctgtggccaagctacaggctcgcatc
gaagacctggagcgggctatcagctcggtggagcccaaggtcatcgtgaaggaggtgaag
aaggtggagcaggacccagggctcctccaggagtcctccaggctgaggagcctcctcgag
gaggagaggaccaagaacgcgacgctggccagggagctgagcgacctgcacagcaagtac
agcgtggtggagaagcagaggcccaaagtgcagctccaggagcgcgtccacgagatcttc
caggtggatccggagacagagcaggagatcactcggctcaaggccaagctgcaggagatg
gcgggcaagaggagcggtgtggagaaggaggtggagaagctgctgcccgacctggaggtc
ctgcgggcccagaagcccacggtggagtacaaggaggtgacccaggaggtggtgaggcat
gagaggagccccgaggtgctgcgtgagatcgaccgcctgaaggctcagctcaacgagctc
gtcaacagccacgggcgctcccaggagcagctcatccgcctgcagggtgagcgcgacgag
tggaggcgcgagcgggccaaggtggagaccaagacggtgagcaaggaggtggtgcgccac
gagaaggacccggtgctggagaaagaagcagagcggctccgccaggaggtgcgggaggcg
gcccagaagaggcgggccgcggaggacgcggtgtacgagctgcagagcaagcgcctgctg
ctggagaggaggaagcccgaggagaaggtggtggtgcaggaggtggtggtcacccagaag
gacccgaagctgcgcgaggagcacagccggctgagcgggagcctggatgaggaggtgggc
cggcggcgccagctagagcttgaggtgcagcagctgcgggccggcgtggaggagcaggag
ggcctgctcagcttccaggaggaccgcagcaagaagctggccgtggagagggagctgcgg
cagctgaccttgaggatccaggagctcgagaagcggcctcccacggtgcaggagaagatc
atcatggaggaagtggtcaagctggagaaggacccggacctggagaagtccacggaagcc
ctgcggtgggacctggaccaggagaagacccaggtaaccgagctgaatcgggagtgcaag
aacctgcaggtccagattgacgtcctccagaaagccaaatcgcaggagaagaccatctac
aaggaagtgatccgggtgcagaaggaccgcgtcctggaagatgagcgggcccgcgtgtgg
gagatgctcaacagggagcgcacggcccggcaggcccgggaggaggaggcacggcgcctg
cgggagcgcattgaccgggccgagacgctggggagaacctggtcccgggaggagtccgag
ctgcagagggcccgggaccaggccgaccaggagtgtgggcggctgcagcaggagctgcgg
gctctggagaggcagaagcagcagcagacactgcagctgcaggaggagtcgaagctgctc
agccagaagacggagagcgagcgacagaaggcggcccagcggggccaggagctctcgcgg
ctggaggcggccatcctccgcgagaaggaccagatctacgagaaggagcggacgctccgg
gacctccacgccaaggtgagccgggaggagctcagccaggagacccagacgcgagagacc
aacctttccaccaagatctccatcctggaacccgagacggggaaggacatgtccccatac
gaggcctacaagaggggcatcatcgacaggggccagtacttgcagctgcaggagctcgag
tgtgactgggaggaggtcaccacctcggggccctgtggggaggagtctgtgctcctggac
cgcaagagcgggaagcagtactccatcgaggccgccctccgctgccggcgcatctctaag
gaggagtaccatctgtacaaggacggccacctgcccatctccgagtttgcgctgcttgta
gctggggagaccaagccaagctcctcactctccatcggctctatcatctccaagtccccg
ctcgcctccccggccccccagagcaccagtttcttctctcccagcttctctctcgggctc
ggtgatgacagcttccctatcgccgggatctatgacacaaccacagacaacaagtgcagc
atcaagacggccgtggccaagaacatgctggaccccatcactgggcagaagctactggag
gcccaggcggccacagggggcatcgtggacctgctcagccgtgagcgctactctgtgcac
aaggcgatggagaggggcctgatcgagaacacctccacacagaggctgcttaacgcccag
aaggccttcaccggcatcgaggaccccgtcaccaagaagaggctctcggtgggcgaggcc
gtccagaagggctggatgccccgggagagcgtgctcccacacctgcaggtgcagcacctg
accggggggctcatcgaccccaagaggacaggccgcatccccatccagcaggccctcctc
tccgggatgatcagtgaagagctggcccagctcctgcaggacgagtccagctacgagaag
gatttgacagaccccatctccaaggaacggctgagctacaaggaggccatgggccgctgc
cgcaaagaccccctgagcggcctgctgctcctgccagcggcactggaggggtaccgctgc
taccgctccgcctcccccaccgtcccgcgctcccttcgctga

KEGG   Homo sapiens (human): 5493
Entry
5493              CDS       T01001                                 
Symbol
PPL
Name
(RefSeq) periplakin
  KO
K10386  periplakin
Organism
hsa  Homo sapiens (human)
Pathway
hsa04382  Cornified envelope formation
Network
nt06545  Cornified envelope formation
  Element
N01922  Localization of envoplakin and periplakin to the plasma membrane (PM)
N01923  Closslinking of envoplakin, periplakin and involucrin
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09158 Development and regeneration
   04382 Cornified envelope formation
    5493 (PPL)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04812 Cytoskeleton proteins [BR:hsa04812]
    5493 (PPL)
Cytoskeleton proteins [BR:hsa04812]
 Eukaryotic cytoskeleton proteins
  Intermediate filaments
   Intermediate filament-binding proteins
    5493 (PPL)
SSDB
Motif
Pfam: Spectrin_1st_PEPL SH3_10 Plectin Spectrin ATG16
Other DBs
NCBI-GeneID: 5493
NCBI-ProteinID: NP_002696
OMIM: 602871
HGNC: 9273
Ensembl: ENSG00000118898
UniProt: O60437
Structure
LinkDB
Position
16:complement(4882507..4937148)
AA seq 1756 aa
MNSLFRKRNKGKYSPTVQTRSISNKELSELIEQLQKNADQVEKNIVDTEAKMQSDLARLQ
EGRQPEHRDVTLQKVLDSEKLLYVLEADAAIAKHMKHPQGDMIAEDIRQLKERVTNLRGK
HKQIYRLAVKEVDPQVNWAALVEEKLDKLNNQSFGTDLPLVDHQVEEHNIFHNEVKAIGP
HLAKDGDKEQNSELRAKYQKLLAASQARQQHLSSLQDYMQRCTNELYWLDQQAKGRMQYD
WSDRNLDYPSRRRQYENFINRNLEAKEERINKLHSEGDQLLAAEHPGRNSIEAHMEAVHA
DWKEYLNLLICEESHLKYMEDYHQFHEDVKDAQELLRKVDSDLNQKYGPDFKDRYQIELL
LRELDDQEKVLDKYEDVVQGLQKRGQQVVPLKYRRETPLKPIPVEALCDFEGEQGLISRG
YSYTLQKNNGESWELMDSAGNKLIAPAVCFVIPPTDPEALALADSLGSQYRSVRQKAAGS
KRTLQQRYEVLKTENPGDASDLQGRQLLAGLDKVASDLDRQEKAITGILRPPLEQGRAVQ
DSAERAKDLKNITNELLRIEPEKTRSTAEGEAFIQALPGSGTTPLLRTRVEDTNRKYEHL
LQLLDLAQEKVDVANRLEKSLQQSWELLATHENHLNQDDTVPESSRVLDSKGQELAAMAC
ELQAQKSLLGEVEQNLQAAKQCSSTLASRFQEHCPDLERQEAEVHKLGQRFNNLRQQVER
RAQSLQSAKAAYEHFHRGHDHVLQFLVSIPSYEPQETDSLSQMETKLKNQKNLLDEIASR
EQEVQKICANSQQYQQAVKDYELEAEKLRSLLDLENGRRSHVSKRARLQSPATKVKEEEA
ALAAKFTEVYAINRQRLQNLEFALNLLRQQPEVEVTHETLQRNRPDSGVEEAWKIRKELD
EETERRRQLENEVKSTQEEIWTLRNQGPQESVVRKEVLKKVPDPVLEESFQQLQRTLAEE
QHKNQLLQEELEALQLQLRALEQETRDGGQEYVVKEVLRIEPDRAQADEVLQLREELEAL
RRQKGAREAEVLLLQQRVAALAEEKSRAQEKVTEKEVVKLQNDPQLEAEYQQLQEDHQRQ
DQLREKQEEELSFLQDKLKRLEKERAMAEGKITVKEVLKVEKDAATEREVSDLTRQYEDE
AAKARASQREKTELLRKIWALEEENAKVVVQEKVREIVRPDPKAESEVANLRLELVEQER
KYRGAEEQLRSYQSELEALRRRGPQVEVKEVTKEVIKYKTDPEMEKELQRLREEIVDKTR
LIERCDLEIYQLKKEIQALKDTKPQVQTKEVVQEILQFQEDPQTKEEVASLRAKLSEEQK
KQVDLERERASQEEQIARKEEELSRVKERVVQQEVVRYEEEPGLRAEASAFAESIDVELR
QIDKLRAELRRLQRRRTELERQLEELERERQARREAEREVQRLQQRLAALEQEEAEAREK
VTHTQKVVLQQDPQQAREHALLRLQLEEEQHRRQLLEGELETLRRKLAALEKAEVKEKVV
LSESVQVEKGDTEQEIQRLKSSLEEESRSKRELDVEVSRLEARLSELEFHNSKSSKELDF
LREENHKLQLERQNLQLETRRLQSEINMAATETRDLRNMTVADSGTNHDSRLWSLERELD
DLKRLSKDKDLEIDELQKRLGSVAVKREQRENHLRRSIVVIHPDTGRELSPEEAHRAGLI
DWNMFVKLRSQECDWEEISVKGPNGESSVIHDRKSGKKFSIEEALQSGRLTPAQYDRYVN
KDMSIQELAVLVSGQK
NT seq 5271 nt   +upstreamnt  +downstreamnt
atgaactcgctcttcaggaagagaaacaaaggcaaatacagccccactgtgcagacccgg
agcatctctaacaaggagctctcggagctgatcgagcagctgcagaagaatgccgaccag
gtggagaagaacatcgtggacacagaggccaagatgcagagtgacctggctcggctgcag
gagggtcggcagcctgagcaccgggacgtgaccctgcagaaggtgttggactctgagaag
ctgctctatgtgctagaggcggatgcggccattgccaagcacatgaagcacccacagggg
gacatgatcgccgaggatatccgccagctgaaggagcgtgtgaccaacctgcgcgggaaa
cacaagcagatctacaggctggcggtgaaggaagtggatccacaggtcaactgggcggca
ctggtggaggagaagctggacaagctgaacaaccagagctttgggactgacctgccgctg
gtggaccaccaagtggaggagcataacatcttccacaatgaggtcaaggccatcgggccc
cacctggccaaggacggggacaaggagcagaacagcgaactccgggccaagtaccagaaa
ctgctggcagcatcacaggcccggcagcagcacctgagttcgctgcaggactacatgcag
cgctgcaccaatgagctgtactggctggaccagcaggccaagggccgcatgcagtacgac
tggagtgaccgcaacctcgactaccccagccgccggcgccagtatgagaatttcatcaac
cggaacctggaggccaaagaggagagaatcaacaaactgcacagcgagggcgaccagctg
ctggcggccgagcaccccgggaggaactccattgaggcgcacatggaggctgtgcacgca
gactggaaggagtacctgaacctgctcatctgcgaggagagccacctcaagtacatggag
gactaccaccagtttcacgaagacgtgaaggacgctcaggagctgctgcgcaaggtggac
tcggacctgaaccagaagtatggccctgacttcaaggaccggtaccagattgagctgctg
ctgcgggagctggatgaccaggagaaggtgctggacaagtatgaggacgtggtgcagggg
ctgcagaagcgaggccagcaggtggtgcccctcaagtaccgccgggagactccgctcaag
cccatccccgtggaggcactctgtgactttgagggggagcagggcctgatctcgcggggc
tacagctacaccctgcagaagaacaacggggagagctgggagctcatggacagcgctggg
aacaagctgattgctccggccgtgtgttttgtgatcccccccacagaccctgaggccctg
gctctggctgacagcctgggcagccagtaccggagcgtgcggcagaaggcagctgggagc
aaacgcacgctgcagcagcggtatgaggtgctgaagaccgagaatcccggagatgcctct
gacctacaggggcggcagctgctggctggcttggacaaggtggccagcgacctggaccgg
caggagaaggccatcacagggatcctgcggccaccactggagcaaggccgggctgtgcag
gacagtgccgagcgggccaaggacctcaagaacatcaccaacgagctactgcggattgaa
cctgagaagacgcggagcacggctgagggcgaagccttcatccaggccctcccaggcagt
ggcaccacacccctgctgaggacccgggtggaggacaccaaccggaaatacgagcacctc
ctgcagctgctggacttggcccaggagaaggttgatgtggccaaccgcctggagaagagc
ctgcagcagagctgggagttgctggccacacacgagaaccatctgaatcaggatgacaca
gtgcctgagagcagccgtgtcctggacagcaaggggcaggagctggcggccatggcctgt
gagttacaggcccagaagtccctcctgggtgaggtggagcagaacttgcaggcggccaag
cagtgctcgagcacactggccagccgcttccaggagcactgtccggacctggagcgccag
gaggccgaggtgcacaagctgggccagcgtttcaacaacctgcgccagcaggtggaacgc
agggcgcagagcctacagagcgccaaggcagcctacgagcacttccaccgcggccatgac
cacgtgctgcagttcctagtcagcatccccagttacgagccccaggagacagacagcctc
agccagatggagaccaagctgaagaaccagaagaacctgctagatgagatagcaagtagg
gagcaggaagtacagaagatctgtgccaattcccagcagtaccagcaagctgtaaaggac
tatgagttagaagcagaaaaactaaggtctcttctcgacttggagaatggaaggagaagc
cacgtgagcaagagagccaggctccaatctcctgccaccaaagtgaaggaagaggaagca
gcacttgccgccaagttcactgaagtttatgccatcaacagacagaggctgcagaatctg
gagtttgctctgaatctcctcagacagcagccggaagtagaagtgacccatgagaccctg
caaaggaataggccggactctggagtggaggaggcgtggaagatcaggaaggaactggat
gaggagactgagcggaggcggcagctggagaacgaggtcaagagcacccaggaagaaatc
tggaccttgaggaatcaggggcctcaggaatcggtggtgaggaaggaggtgctcaagaag
gtgccggatcccgtgctggaggagagcttccagcagctgcagcggacgctggcagaggag
cagcacaagaaccagctgctgcaggaggagctggaggcactgcagctgcagctgcgtgcc
ctggagcaggagaccagagacggggggcaggagtacgtggtcaaggaggtcctgcgcatc
gagcctgacagggcccaggcggatgaggtcttgcagctgcgggaggagctggaggcactg
aggcggcagaagggcgcccgggaggcagaggtgctcctcctgcagcagcgtgtggccgcc
ctggctgaagagaagagccgggcgcaggagaaggtcacagagaaagaggtggtgaaactg
cagaatgacccccagctggaggcagagtaccagcagctgcaggaggaccaccagcgccag
gaccagctcagggagaagcaggaggaggagctgagcttcctccaggacaagctcaagagg
ctagagaaggagcgggccatggccgagggcaagatcaccgtcaaggaggtgctcaaggtg
gagaaggacgcggccaccgagagggaggtcagcgatctcacccgccaatatgaggacgag
gctgccaaggctcgcgctagccagagggagaagacggagctgctccgaaagatatgggcc
ttggaggaggagaacgccaaagtggtggtgcaggagaaggtgcgggagatcgtgcggcca
gaccccaaggcggaaagtgaagtggcgaacctccgcctggagcttgtggagcaggagcga
aagtaccggggtgccgaggagcagctccggagctaccagagtgagctggaggccctcagg
aggcgaggcccccaggtggaagtcaaagaggtgactaaggaagtcattaagtacaagact
gaccctgagatggagaaggagcttcagcggctcagggaggagatcgtggacaagaccaga
ctgatcgaaaggtgtgatttagagatctaccagctgaaaaaggaaatccaggccctgaaa
gacaccaaaccccaggtccagaccaaagaggtggtccaggagatcctccaattccaagaa
gaccctcaaaccaaggaggaggtggcgtctctgagggcaaagctctcagaggagcagaag
aaacaagtggatctggagagggaaagagcttcccaggaagagcagatcgcccggaaagag
gaggagctctcgcgggtgaaggaaagggtggtgcagcaggaggtggtcaggtatgaggag
gagccaggcctgcgggccgaggcgagcgcctttgccgagagcatcgatgtggagctgcgg
cagattgacaagctgcgggcagagctgcggcggctgcagcgccggcgcaccgagcttgag
cggcagctggaggagctagagcgcgagcggcaggcccgcagggaggccgagcgcgaggta
cagcggttgcagcagcggctggcagcgctggagcaggaagaagctgaggcccgtgagaag
gtaacccatacgcagaaggtggtgctgcagcaggacccgcagcaggcgcgagagcatgcc
ctgctccgactccagctggaagaagagcagcaccggcggcagctcctggagggggagctc
gagaccctccggaggaaactggctgcactggagaaggcggaggtcaaggagaaggtggtg
ctctccgagagtgtccaggtggagaagggcgacaccgagcaagagatccagaggctcaag
agcagcctggaggaggagagccgcagcaagcgcgagctggacgtcgaggtgagccggctg
gaagccaggctttcggagctggaattccataactccaagtcatccaaggaactagacttt
ctgagggaagagaaccacaaattacagctggagaggcaaaacctgcagctggagacccga
aggctccaatcggaaatcaacatggcagcgacggaaacacgagacctgcggaacatgacc
gtggcggactctgggaccaaccatgactccagactgtggtccctggagagggaactggat
gacctcaagaggctctccaaggacaaagacctcgagatcgacgagctgcagaagcgcctg
ggctccgtggccgtcaagcgggagcagcgggagaaccacctgcggcgctccatcgtagtc
atccaccctgacacaggccgcgagctgtccccggaggaagcccaccgtgccgggctcatt
gactggaacatgttcgtgaaactcagaagccaggagtgcgactgggaggagatctcagtg
aagggtcccaatggggagtcctcagtgatacacgacaggaagtctggcaagaagttctcc
atcgaagaggccctgcagagtggcaggctgacccctgctcagtatgaccgctatgtcaac
aaggatatgtccatccaggagctggcggtcttggtatctgggcagaagtag

DBGET integrated database retrieval system