KEGG   Homo sapiens (human): 100293534
Entry
100293534         CDS       T01001                                 
Symbol
C4B_2
Name
(RefSeq) complement C4-B-like preproprotein
  KO
K03989  complement component 4
Organism
hsa  Homo sapiens (human)
Pathway
hsa04610  Complement and coagulation cascades
hsa04936  Alcoholic liver disease
hsa05133  Pertussis
hsa05150  Staphylococcus aureus infection
hsa05171  Coronavirus disease
hsa05322  Systemic lupus erythematosus
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    100293534 (C4B_2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease
    100293534 (C4B_2)
  09171 Infectious disease: bacterial
   05133 Pertussis
    100293534 (C4B_2)
   05150 Staphylococcus aureus infection
    100293534 (C4B_2)
  09163 Immune disease
   05322 Systemic lupus erythematosus
    100293534 (C4B_2)
  09167 Endocrine and metabolic disease
   04936 Alcoholic liver disease
    100293534 (C4B_2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    100293534 (C4B_2)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    100293534 (C4B_2)
Peptidases and inhibitors [BR:hsa01002]
 Peptidase inhibitors
  Family I39: alpha2M family
   100293534 (C4B_2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of hepatic cells
   100293534 (C4B_2)
  Exosomal proteins of other cancer cells
   100293534 (C4B_2)
SSDB
Motif
Pfam: TED_complement C4_MG1 CO4A-B_CUB_C A2M_recep A2M A2M_BRD NTR MG2 MG3 ANATO MG4 ParB_C DUF7363 Big_1
Other DBs
NCBI-GeneID: 100293534
NCBI-ProteinID: NP_001229752
HGNC: 42398
Ensembl: ENSP00000412786.2
UniProt: P0C0L5
Structure
LinkDB
Position
6:3283246..3303870
AA seq 1744 aa
MRLLWGLIWASSFFTLSLQKPRLLLFSPSVVHLGVPLSVGVQLQDVPRGQVVKGSVFLRN
PSRNNVPCSPKVDFTLSSERDFALLSLQVPLKDAKSCGLHQLLRGPEVQLVAHSPWLKDS
LSRTTNIQGINLLFSSRRGHLFLQTDQPIYNPGQRVRYRVFALDQKMRPSTDTITVMVEN
SHGLRVRKKEVYMPSSIFQDDFVIPDISEPGTWKISARFSDGLESNSSTQFEVKKYVLPN
FEVKITPGKPYILTVPGHLDEMQLDIQARYIYGKPVQGVAYVRFGLLDEDGKKTFFRGLE
SQTKLVNGQSHISLSKAEFQDALEKLNMGITDLQGLRLYVAAAIIEYPGGEMEEAELTSW
YFVSSPFSLDLSKTKRHLVPGAPFLLQALVREMSGSPASGIPVKVSATVSSPGSVPEVQD
IQQNTDGSGQVSIPIIIPQTISELQLSVSAGSPHPAIARLTVAAPPSGGPGFLSIERPDS
RPPRVGDTLNLNLRAVGSGATFSHYYYMILSRGQIVFMNREPKRTLTSVSVFVDHHLAPS
FYFVAFYYHGDHPVANSLRVDVQAGACEGKLELSVDGAKQYRNGESVKLHLETDSLALVA
LGALDTALYAAGSKSHKPLNMGKVFEAMNSYDLGCGPGGGDSALQVFQAAGLAFSDGDQW
TLSRKRLSCPKEKTTRKKRNVNFQKAINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRA
ARVQQPDCREPFLSCCQFAESLRKKSRDKGQAGLQRALEILQEEDLIDEDDIPVRSFFPE
NWLWRVETVDRFQILTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLP
MSVRRFEQLELRPVLYNYLDKNLTVSVHVSPVEGLCLAGGGGLAQQVLVPAGSARPVAFS
VVPTAAAAVSLKVVARGSFEFPVGDAVSKVLQIEKEGAIHREELVYELNPLDHRGRTLEI
PGNSDPNMIPDGDFNSYVRVTASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAP
TLAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRGSSTWLTA
FVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNDETVA
LTAFVTIALHHGLAVFQDEGAEPLKQRVEASISKASSFLGEKASAGLLGAHAAAITAYAL
TLTKAPADLRGVAHNNLMAMAQETGDNLYWGSVTGSQSNAVSPTPAPRNPSDPMPQAPAL
WIETTAYALLHLLLHEGKAEMADQAAAWLTRQGSFQGGFRSTQDTVIALDALSAYWIASH
TTEERGLNVTLSSTGRNGFKSHALQLNNRQIRGLEEELQFSLGSKINVKVGGNSKGTLKV
LRTYNVLDMKNTTCQDLQIEVTVKGHVEYTMEANEDYEDYEYDELPAKDDPDAPLQPVTP
LQLFEGRRNRRRREAPKVVEEQESRVHYTVCIWRNGKVGLSGMAIADVTLLSGFHALRAD
LEKLTSLSDRYVSHFETEGPHVLLYFDSVPTSRECVGFEAVQEVPVGLVQPASATLYDYY
NPERRCSVFYGAPSKSRLLATLCSAEVCQCAEGKCPRQRRALERGLQDEDGYRMKFACYY
PRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNFLVRASCRLRLEPG
KEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQ
GCQV
NT seq 5235 nt   +upstreamnt  +downstreamnt
atgaggctgctctgggggctgatctgggcatccagcttcttcaccttatctctgcagaag
cccaggttgctcttgttctctccttctgtggttcatctgggggtccccctatcggtgggg
gtgcagctccaggatgtgccccgaggacaggtagtgaaaggatcagtgttcctgagaaac
ccatctcgtaataatgtcccctgctccccaaaggtggacttcacccttagctcagaaaga
gacttcgcactcctcagtctccaggtgcccttgaaagatgcgaagagctgtggcctccat
caactcctcagaggccctgaggtccagctggtggcccattcgccatggctaaaggactct
ctgtccagaacgacaaacatccagggtatcaacctgctcttctcctctcgccgggggcac
ctctttttgcagacggaccagcccatttacaaccctggccagcgggttcggtaccgggtc
tttgctctggatcagaagatgcgcccgagcactgacaccatcacagtcatggtggagaac
tctcacggcctccgcgtgcggaagaaggaggtgtacatgccctcgtccatcttccaggat
gactttgtgatcccagacatctcagagccagggacctggaagatctcagcccgattctca
gatggcctggaatccaacagcagcacccagtttgaggtgaagaaatatgtccttcccaac
tttgaggtgaagatcacccctggaaagccctacatcctgacggtgccaggccatcttgat
gaaatgcagttagacatccaggccaggtacatctatgggaagccagtgcagggggtggca
tatgtgcgctttgggctcctagatgaggatggtaagaagactttctttcgggggctggag
agtcagaccaagctggtgaatggacagagccacatttccctctcaaaggcagagttccag
gacgccctggagaagctgaatatgggcattactgacctccaggggctgcgcctctacgtt
gctgcagccatcattgagtatccaggtggggagatggaggaggcagagctcacatcctgg
tattttgtgtcatctcccttctccttggatcttagcaagaccaagcgacaccttgtgcct
ggggcccccttcctgctgcaggccttggtccgtgagatgtcaggctccccagcttctggc
attcctgtcaaagtttctgccacggtgtcttctcctgggtctgttcctgaagtccaggac
attcagcaaaacacagacgggagcggccaagtcagcattccaataattatccctcagacc
atctcagagctgcagctctcagtatctgcaggctccccacatccagcgatagccaggctc
actgtggcagccccaccttcaggaggccccgggtttctgtctattgagcggccggattct
cgacctcctcgtgttggggacactctgaacctgaacttgcgagccgtgggcagtggggcc
accttttctcattactactacatgatcctatcccgagggcagatcgtgttcatgaatcga
gagcccaagaggaccctgacctcggtctcggtgtttgtggaccatcacctggcaccctcc
ttctactttgtggccttctactaccatggagaccacccagtggccaactccctgcgagtg
gatgtccaggctggggcctgcgagggcaagctggagctcagcgtggacggtgccaagcag
taccggaacggggagtccgtgaagctccacttagaaaccgactccctagccctggtggcg
ctgggagccttggacacagctctgtatgctgcaggcagcaagtcccacaagcccctcaac
atgggcaaggtctttgaagctatgaacagctatgacctcggctgtggtcctgggggtggg
gacagtgcccttcaggtgttccaggcagcgggcctggccttttctgatggagaccagtgg
accttatccagaaagagactaagctgtcccaaggagaagacaacccggaaaaagagaaac
gtgaacttccaaaaggcgattaatgagaaattgggtcagtatgcttccccgacagccaag
cgctgctgccaggatggggtgacacgtctgcccatgatgcgttcctgcgagcagcgggca
gcccgcgtgcagcagccggactgccgggagcccttcctgtcctgctgccaatttgctgag
agtctgcgcaagaagagcagggacaagggccaggcgggcctccaacgagccctggagatc
ctgcaggaggaggacctgattgatgaggatgacattcccgtgcgcagcttcttcccagag
aactggctctggagagtggaaacagtggaccgctttcaaatattgacactgtggctcccc
gactctctgaccacgtgggagatccatggcctgagcctgtccaaaaccaaaggcctatgt
gtggccaccccagtccagctccgggtgttccgcgagttccacctgcacctccgcctgccc
atgtctgtccgccgctttgagcagctggagctgcggcctgtcctctataactacctggat
aaaaacctgactgtgagcgtccacgtgtccccagtggaggggctgtgcctggctgggggc
ggagggctggcccagcaggtgctggtgcctgcgggctctgcccggcctgttgccttctct
gtggtgcccacggcagccgccgctgtgtctctgaaggtggtggctcgagggtccttcgaa
ttccctgtgggagatgcggtgtccaaggttctgcagattgagaaggaaggggccatccat
agagaggagctggtctatgaactcaaccccttggaccaccgaggccggaccttggaaata
cctggcaactctgatcccaatatgatccctgatggggactttaacagctacgtcagggtt
acagcctcagatccattggacactttaggctctgagggggccttgtcaccaggaggcgtg
gcctccctcttgaggcttcctcgaggctgtggggagcaaaccatgatctacttggctccg
acactggctgcttcccgctacctggacaagacagagcagtggagcacactgcctcccgag
accaaggaccacgccgtggatctgatccagaaaggctacatgcggatccagcagtttcgg
aaggcggatggttcctatgcggcttggttgtcacggggcagcagcacctggctcacagcc
tttgtgttgaaggtcctgagtttggcccaggagcaggtaggaggctcgcctgagaaactg
caggagacatctaactggcttctgtcccagcagcaggctgacggctcgttccaggacctc
tctccagtgatacataggagcatgcaggggggtttggtgggcaatgatgagactgtggca
ctcacagcctttgtgaccatcgcccttcatcatgggctggccgtcttccaggatgagggt
gcagagccattgaagcagagagtggaagcctccatctcaaaggcaagctcatttttgggg
gagaaagcaagtgctgggctcctgggtgcccacgcagctgccatcacggcctatgccctg
acactgaccaaggcccctgcggacctgcggggtgttgcccacaacaacctcatggcaatg
gcccaggagactggagataacctgtactggggctcagtcactggttctcagagcaatgcc
gtgtcgcccaccccagctcctcgcaacccatccgaccccatgccccaggccccagccctg
tggattgaaaccacagcctacgccctgctgcacctcctgcttcacgagggcaaagcagag
atggcagaccaggctgcggcctggctcacccgtcagggcagcttccaagggggattccgc
agtacccaagacacggtgattgccctggatgccctgtctgcctactggattgcctcccac
accactgaggagaggggtctcaatgtgactctcagctccacaggccggaatgggttcaag
tcccacgcgctgcagctgaacaaccgccagattcgcggcctggaggaggagctgcagttt
tccttgggcagcaagatcaatgtgaaggtgggaggaaacagcaaaggaaccctgaaggtc
cttcgtacctacaatgtcctggacatgaagaacacgacctgccaggacctacagatagaa
gtgacagtcaaaggccacgtcgagtacacgatggaagcaaacgaggactatgaggactat
gagtacgatgagcttccagccaaggatgacccagatgcccctctgcagcccgtgacaccc
ctgcagctgtttgagggtcggaggaaccgccgcaggagggaggcgcccaaggtggtggag
gagcaggagtccagggtgcactacaccgtgtgcatctggcggaacggcaaggtggggctg
tctggcatggccatcgcggacgtcaccctcctgagtggattccacgccctgcgtgctgac
ctggagaagctgacctccctctctgaccgttacgtgagtcactttgagaccgaggggccc
cacgtcctgctgtattttgactcggtccccacctcccgggagtgcgtgggctttgaggct
gtgcaggaagtgccggtggggctggtgcagccggccagcgcaaccctgtacgactactac
aaccccgagcgcagatgttctgtgttttacggggcaccaagtaagagcagactcttggcc
accttgtgttctgctgaagtctgccagtgtgctgaggggaagtgccctcgccagcgtcgc
gccctggagcggggtctgcaggacgaggatggctacaggatgaagtttgcctgctactac
ccccgtgtggagtacggcttccaggttaaggttctccgagaagacagcagagctgctttc
cgcctctttgagaccaagatcacccaagtcctgcacttcaccaaggatgtcaaggccgct
gctaatcagatgcgcaacttcctggttcgagcctcctgccgccttcgcttggaacctggg
aaagaatatttgatcatgggtctagatggggccacctatgacctcgagggacacccccag
tacctgctggactcgaatagctggatcgaggagatgccctctgaacgcctgtgccggagc
acccgccagcgggcagcctgtgcccagctcaacgacttcctccaggagtatggcactcag
gggtgccaggtgtga

KEGG   Homo sapiens (human): 110384692
Entry
110384692         CDS       T01001                                 
Symbol
C4A_2
Name
(RefSeq) complement C4A (Rodgers blood group)-like preproprotein
  KO
K03989  complement component 4
Organism
hsa  Homo sapiens (human)
Pathway
hsa04610  Complement and coagulation cascades
hsa04936  Alcoholic liver disease
hsa05133  Pertussis
hsa05150  Staphylococcus aureus infection
hsa05171  Coronavirus disease
hsa05322  Systemic lupus erythematosus
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    110384692 (C4A_2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease
    110384692 (C4A_2)
  09171 Infectious disease: bacterial
   05133 Pertussis
    110384692 (C4A_2)
   05150 Staphylococcus aureus infection
    110384692 (C4A_2)
  09163 Immune disease
   05322 Systemic lupus erythematosus
    110384692 (C4A_2)
  09167 Endocrine and metabolic disease
   04936 Alcoholic liver disease
    110384692 (C4A_2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    110384692 (C4A_2)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    110384692 (C4A_2)
Peptidases and inhibitors [BR:hsa01002]
 Peptidase inhibitors
  Family I39: alpha2M family
   110384692 (C4A_2)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of hepatic cells
   110384692 (C4A_2)
  Exosomal proteins of other cancer cells
   110384692 (C4A_2)
SSDB
Motif
Pfam: TED_complement C4_MG1 CO4A-B_CUB_C A2M_recep A2M A2M_BRD NTR MG2 MG3 ANATO MG4 ParB_C DUF7363 Big_1 SQHop_cyclase_C
Other DBs
NCBI-GeneID: 110384692
NCBI-ProteinID: NP_001338929
Ensembl: ENSP00000388662.2
LinkDB
Position
6:3356831..3377455
AA seq 1744 aa
MRLLWGLIWASSFFTLSLQKPRLLLFSPSVVHLGVPLSVGVQLQDVPRGQVVKGSVFLRN
PSRNNVPCSPKVDFTLSSERDFALLSLQVPLKDAKSCGLHQLLRGPEVQLVAHSPWLKDS
LSRTTNIQGINLLFSSRRGHLFLQTDQPIYNPGQRVRYRVFALDQKMRPSTDTITVMVEN
SHGLRVRKKEVYMPSSIFQDDFVIPDISEPGTWKISARFSDGLESNSSTQFEVKKYVLPN
FEVKITPGKPYILTVPGHLDEMQLDIQARYIYGKPVQGVAYVRFGLLDEDGKKTFFRGLE
SQTKLVNGQSHISLSKAEFQDALEKLNMGITDLQGLRLYVAAAIIEYPGGEMEEAELTSW
YFVSSPFSLDLSKTKRHLVPGAPFLLQALVREMSGSPASGIPVKVSATVSSPGSVPEVQD
IQQNTDGSGQVSIPIIIPQTISELQLSVSAGSPHPAIARLTVAAPPSGGPGFLSIERPDS
RPPRVGDTLNLNLRAVGSGATFSHYYYMILSRGQIVFMNREPKRTLTSVSVFVDHHLAPS
FYFVAFYYHGDHPVANSLRVDVQAGACEGKLELSVDGAKQYRNGESVKLHLETDSLALVA
LGALDTALYAAGSKSHKPLNMGKVFEAMNSYDLGCGPGGGDSALQVFQAAGLAFSDGDQW
TLSRKRLSCPKEKTTRKKRNVNFQKAINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRA
ARVQQPDCREPFLSCCQFAESLRKKSRDKGQAGLQRALEILQEEDLIDEDDIPVRSFFPE
NWLWRVETVDRFQILTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLP
MSVRRFEQLELRPVLYNYLDKNLTVSVHVSPVEGLCLAGGGGLAQQVLVPAGSARPVAFS
VVPTAAAAVSLKVVARGSFEFPVGDAVSKVLQIEKEGAIHREELVYELNPLDHRGRTLEI
PGNSDPNMIPDGDFNSYVRVTASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAP
TLAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRDSSTWLTA
FVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDPCPVLDRSMQGGLVGNDETVA
LTAFVTIALHHGLAVFQDEGAEPLKQRVEASISKANSFLGEKASAGLLGAHAAAITAYAL
TLTKAPVDLLGVAHNNLMAMAQETGDNLYWGSVTGSQSNAVSPTPAPRNPSDPMPQAPAL
WIETTAYALLHLLLHEGKAEMADQAAAWLTRQGSFQGGFRSTQDTVIALDALSAYWIASH
TTEERGLNVTLSSTGRNGFKSHALQLNNRQIRGLEEELQFSLGSKINVKVGGNSKGTLKV
LRTYNVLDMKNTTCQDLQIEVTVKGHVEYTMEANEDYEDYEYDELPAKDDPDAPLQPVTP
LQLFEGRRNRRRREAPKVVEEQESRVHYTVCIWRNGKVGLSGMAIADVTLLSGFHALRAD
LEKLTSLSDRYVSHFETEGPHVLLYFDSVPTSRECVGFEAVQEVPVGLVQPASATLYDYY
NPERRCSVFYGAPSKSRLLATLCSAEVCQCAEGKCPRQRRALERGLQDEDGYRMKFACYY
PRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNFLVRASCRLRLEPG
KEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQ
GCQV
NT seq 5235 nt   +upstreamnt  +downstreamnt
atgaggctgctctgggggctgatctgggcatccagcttcttcaccttatctctgcagaag
cccaggttgctcttgttctctccttctgtggttcatctgggggtccccctatcggtgggg
gtgcagctccaggatgtgccccgaggacaggtagtgaaaggatcagtgttcctgagaaac
ccatctcgtaataatgtcccctgctccccaaaggtggacttcacccttagctcagaaaga
gacttcgcactcctcagtctccaggtgcccttgaaagatgcgaagagctgtggcctccat
caactcctcagaggccctgaggtccagctggtggcccattcgccatggctaaaggactct
ctgtccagaacgacaaacatccagggtatcaacctgctcttctcctctcgccgggggcac
ctctttttgcagacggaccagcccatttacaaccctggccagcgggttcggtaccgggtc
tttgctctggatcagaagatgcgcccgagcactgacaccatcacagtcatggtggagaac
tctcacggcctccgcgtgcggaagaaggaggtgtacatgccctcgtccatcttccaggat
gactttgtgatcccagacatctcagagccagggacctggaagatctcagcccgattctca
gatggcctggaatccaacagcagcacccagtttgaggtgaagaaatatgtccttcccaac
tttgaggtgaagatcacccctggaaagccctacatcctgacggtgccaggccatcttgat
gaaatgcagttagacatccaggccaggtacatctatgggaagccagtgcagggggtggca
tatgtgcgctttgggctcctagatgaggatggtaagaagactttctttcgggggctggag
agtcagaccaagctggtgaatggacagagccacatttccctctcaaaggcagagttccag
gacgccctggagaagctgaatatgggcattactgacctccaggggctgcgcctctacgtt
gctgcagccatcattgagtatccaggtggggagatggaggaggcagagctcacatcctgg
tattttgtgtcatctcccttctccttggatcttagcaagaccaagcgacaccttgtgcct
ggggcccccttcctgctgcaggccttggtccgtgagatgtcaggctccccagcttctggc
attcctgtcaaagtttctgccacggtgtcttctcctgggtctgttcctgaagtccaggac
attcagcaaaacacagacgggagcggccaagtcagcattccaataattatccctcagacc
atctcagagctgcagctctcagtatctgcaggctccccacatccagcgatagccaggctc
actgtggcagccccaccttcaggaggccccgggtttctgtctattgagcggccggattct
cgacctcctcgtgttggggacactctgaacctgaacttgcgagccgtgggcagtggggcc
accttttctcattactactacatgatcctatcccgagggcagatcgtgttcatgaatcga
gagcccaagaggaccctgacctcggtctcggtgtttgtggaccatcacctggcaccctcc
ttctactttgtggccttctactaccatggagaccacccagtggccaactccctgcgagtg
gatgtccaggctggggcctgcgagggcaagctggagctcagcgtggacggtgccaagcag
taccggaacggggagtccgtgaagctccacttagaaaccgactccctagccctggtggcg
ctgggagccttggacacagctctgtatgctgcaggcagcaagtcccacaagcccctcaac
atgggcaaggtctttgaagctatgaacagctatgacctcggctgtggtcctgggggtggg
gacagtgcccttcaggtgttccaggcagcgggcctggccttttctgatggagaccagtgg
accttatccagaaagagactaagctgtcccaaggagaagacaacccggaaaaagagaaac
gtgaacttccaaaaggcgattaatgagaaattgggtcagtatgcttccccgacagccaag
cgctgctgccaggatggggtgacacgtctgcccatgatgcgttcctgcgagcagcgggca
gcccgcgtgcagcagccggactgccgggagcccttcctgtcctgctgccaatttgctgag
agtctgcgcaagaagagcagggacaagggccaggcgggcctccaacgagccctggagatc
ctgcaggaggaggacctgattgatgaggatgacattcccgtgcgcagcttcttcccagag
aactggctctggagagtggaaacagtggaccgctttcaaatattgacactgtggctcccc
gactctctgaccacgtgggagatccatggcctgagcctgtccaaaaccaaaggcctatgt
gtggccaccccagtccagctccgggtgttccgcgagttccacctgcacctccgcctgccc
atgtctgtccgccgctttgagcagctggagctgcggcctgtcctctataactacctggat
aaaaacctgactgtgagcgtccacgtgtccccagtggaggggctgtgcctggctgggggc
ggagggctggcccagcaggtgctggtgcctgcgggctctgcccggcctgttgccttctct
gtggtgcccacggcagccgccgctgtgtctctgaaggtggtggctcgagggtccttcgaa
ttccctgtgggagatgcggtgtccaaggttctgcagattgagaaggaaggggccatccat
agagaggagctggtctatgaactcaaccccttggaccaccgaggccggaccttggaaata
cctggcaactctgatcccaatatgatccctgatggggactttaacagctacgtcagggtt
acagcctcagatccattggacactttaggctctgagggggccttgtcaccaggaggcgtg
gcctccctcttgaggcttcctcgaggctgtggggagcaaaccatgatctacttggctccg
acactggctgcttcccgctacctggacaagacagagcagtggagcacactgcctcccgag
accaaggaccacgccgtggatctgatccagaaaggctacatgcggatccagcagtttcgg
aaggcggatggttcctatgcggcttggttgtcacgggacagcagcacctggctcacagcc
tttgtgttgaaggtcctgagtttggcccaggagcaggtaggaggctcgcctgagaaactg
caggagacatctaactggcttctgtcccagcagcaggctgacggctcgttccaggacccc
tgtccagtgttagacaggagcatgcaggggggtttggtgggcaatgatgagactgtggca
ctcacagcctttgtgaccatcgcccttcatcatgggctggccgtcttccaggatgagggt
gcagagccattgaagcagagagtggaagcctccatctcaaaggcaaactcatttttgggg
gagaaagcaagtgctgggctcctgggtgcccacgcagctgccatcacggcctatgccctg
acactgaccaaggcgcctgtggacctgctcggtgttgcccacaacaacctcatggcaatg
gcccaggagactggagataacctgtactggggctcagtcactggttctcagagcaatgcc
gtgtcgcccaccccggctcctcgcaacccatccgaccccatgccccaggccccagccctg
tggattgaaaccacagcctacgccctgctgcacctcctgcttcacgagggcaaagcagag
atggcagaccaggctgcggcctggctcacccgtcagggcagcttccaagggggattccgc
agtacccaagacacggtgattgccctggatgccctgtctgcctactggattgcctcccac
accactgaggagaggggtctcaatgtgactctcagctccacaggccggaatgggttcaag
tcccacgcgctgcagctgaacaaccgccagattcgcggcctggaggaggagctgcagttt
tccttgggcagcaagatcaatgtgaaggtgggaggaaacagcaaaggaaccctgaaggtc
cttcgtacctacaatgtcctggacatgaagaacacgacctgccaggacctacagatagaa
gtgacagtcaaaggccacgtcgagtacacgatggaagcaaacgaggactatgaggactat
gagtacgatgagcttccagccaaggatgacccagatgcccctctgcagcccgtgacaccc
ctgcagctgtttgagggtcggaggaaccgccgcaggagggaggcgcccaaggtggtggag
gagcaggagtccagggtgcactacaccgtgtgcatctggcggaacggcaaggtggggctg
tctggcatggccatcgcggacgtcaccctcctgagtggattccacgccctgcgtgctgac
ctggagaagctgacctccctctctgaccgttacgtgagtcactttgagaccgaggggccc
cacgtcctgctgtattttgactcggtccccacctcccgggagtgcgtgggctttgaggct
gtgcaggaagtgccggtggggctggtgcagccggccagcgcaaccctgtacgactactac
aaccccgagcgcagatgttctgtgttttacggggcaccaagtaagagcagactcttggcc
accttgtgttctgctgaagtctgccagtgtgctgaggggaagtgccctcgccagcgtcgc
gccctggagcggggtctgcaggacgaggatggctacaggatgaagtttgcctgctactac
ccccgtgtggagtacggcttccaggttaaggttctccgagaagacagcagagctgctttc
cgcctctttgagaccaagatcacccaagtcctgcacttcaccaaggatgtcaaggccgct
gctaatcagatgcgcaacttcctggttcgagcctcctgccgccttcgcttggaacctggg
aaagaatatttgatcatgggtctagatggggccacctatgacctcgagggacacccccag
tacctgctggactcgaatagctggatcgaggagatgccctctgaacgcctgtgccggagc
acccgccagcgggcagcctgtgcccagctcaacgacttcctccaggagtatggcactcag
gggtgccaggtgtga

KEGG   Homo sapiens (human): 720
Entry
720               CDS       T01001                                 
Symbol
C4A, C4, C4A2, C4A3, C4A4, C4A6, C4AD, C4S, CO4, CPAMD2, RG
Name
(RefSeq) complement C4-A isoform 1 preproprotein
  KO
K03989  complement component 4
Organism
hsa  Homo sapiens (human)
Pathway
hsa04610  Complement and coagulation cascades
hsa04936  Alcoholic liver disease
hsa05133  Pertussis
hsa05150  Staphylococcus aureus infection
hsa05171  Coronavirus disease
hsa05322  Systemic lupus erythematosus
Network
nt06164  Kaposi sarcoma-associated herpesvirus (KSHV)
nt06171  SARS coronavirus 2 (SARS-CoV-2)
nt06513  Complement cascade
  Element
N01487  Classical pathway of complement cascade, C4/C2 to C3 convertase formation
N01491  Lectin pathway of complement cascade, C4/C2 to C3 convertase formation
Disease
H00080  Systemic lupus erythematosus
H00102  Classic complement pathway component defects
H01649  Schizophrenia
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    720 (C4A)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease
    720 (C4A)
  09171 Infectious disease: bacterial
   05133 Pertussis
    720 (C4A)
   05150 Staphylococcus aureus infection
    720 (C4A)
  09163 Immune disease
   05322 Systemic lupus erythematosus
    720 (C4A)
  09167 Endocrine and metabolic disease
   04936 Alcoholic liver disease
    720 (C4A)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    720 (C4A)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    720 (C4A)
Peptidases and inhibitors [BR:hsa01002]
 Peptidase inhibitors
  Family I39: alpha2M family
   720 (C4A)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of hepatic cells
   720 (C4A)
  Exosomal proteins of other cancer cells
   720 (C4A)
SSDB
Motif
Pfam: TED_complement C4_MG1 CO4A-B_CUB_C A2M_recep A2M A2M_BRD NTR MG2 MG3 ANATO MG4 ParB_C DUF7363 Big_1 SQHop_cyclase_C
Other DBs
NCBI-GeneID: 720
NCBI-ProteinID: NP_009224
OMIM: 120810
HGNC: 1323
Ensembl: ENSP00000396688.2
CPD: C22541 C22542 C22543 C22544
UniProt: P0C0L4
Structure
LinkDB
Position
6:31982057..32002681
AA seq 1744 aa
MRLLWGLIWASSFFTLSLQKPRLLLFSPSVVHLGVPLSVGVQLQDVPRGQVVKGSVFLRN
PSRNNVPCSPKVDFTLSSERDFALLSLQVPLKDAKSCGLHQLLRGPEVQLVAHSPWLKDS
LSRTTNIQGINLLFSSRRGHLFLQTDQPIYNPGQRVRYRVFALDQKMRPSTDTITVMVEN
SHGLRVRKKEVYMPSSIFQDDFVIPDISEPGTWKISARFSDGLESNSSTQFEVKKYVLPN
FEVKITPGKPYILTVPGHLDEMQLDIQARYIYGKPVQGVAYVRFGLLDEDGKKTFFRGLE
SQTKLVNGQSHISLSKAEFQDALEKLNMGITDLQGLRLYVAAAIIESPGGEMEEAELTSW
YFVSSPFSLDLSKTKRHLVPGAPFLLQALVREMSGSPASGIPVKVSATVSSPGSVPEVQD
IQQNTDGSGQVSIPIIIPQTISELQLSVSAGSPHPAIARLTVAAPPSGGPGFLSIERPDS
RPPRVGDTLNLNLRAVGSGATFSHYYYMILSRGQIVFMNREPKRTLTSVSVFVDHHLAPS
FYFVAFYYHGDHPVANSLRVDVQAGACEGKLELSVDGAKQYRNGESVKLHLETDSLALVA
LGALDTALYAAGSKSHKPLNMGKVFEAMNSYDLGCGPGGGDSALQVFQAAGLAFSDGDQW
TLSRKRLSCPKEKTTRKKRNVNFQKAINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRA
ARVQQPDCREPFLSCCQFAESLRKKSRDKGQAGLQRALEILQEEDLIDEDDIPVRSFFPE
NWLWRVETVDRFQILTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLP
MSVRRFEQLELRPVLYNYLDKNLTVSVHVSPVEGLCLAGGGGLAQQVLVPAGSARPVAFS
VVPTAAAAVSLKVVARGSFEFPVGDAVSKVLQIEKEGAIHREELVYELNPLDHRGRTLEI
PGNSDPNMIPDGDFNSYVRVTASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAP
TLAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRDSSTWLTA
FVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDPCPVLDRSMQGGLVGNDETVA
LTAFVTIALHHGLAVFQDEGAEPLKQRVEASISKANSFLGEKASAGLLGAHAAAITAYAL
TLTKAPVDLLGVAHNNLMAMAQETGDNLYWGSVTGSQSNAVSPTPAPRNPSDPMPQAPAL
WIETTAYALLHLLLHEGKAEMADQASAWLTRQGSFQGGFRSTQDTVIALDALSAYWIASH
TTEERGLNVTLSSTGRNGFKSHALQLNNRQIRGLEEELQFSLGSKINVKVGGNSKGTLKV
LRTYNVLDMKNTTCQDLQIEVTVKGHVEYTMEANEDYEDYEYDELPAKDDPDAPLQPVTP
LQLFEGRRNRRRREAPKVVEEQESRVHYTVCIWRNGKVGLSGMAIADVTLLSGFHALRAD
LEKLTSLSDRYVSHFETEGPHVLLYFDSVPTSRECVGFEAVQEVPVGLVQPASATLYDYY
NPERRCSVFYGAPSKSRLLATLCSAEVCQCAEGKCPRQRRALERGLQDEDGYRMKFACYY
PRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNFLVRASCRLRLEPG
KEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQ
GCQV
NT seq 5235 nt   +upstreamnt  +downstreamnt
atgaggctgctctgggggctgatctgggcatccagcttcttcaccttatctctgcagaag
cccaggttgctcttgttctctccttctgtggttcatctgggggtccccctatcggtgggg
gtgcagctccaggatgtgccccgaggacaggtagtgaaaggatcagtgttcctgagaaac
ccatctcgtaataatgtcccctgctccccaaaggtggacttcacccttagctcagaaaga
gacttcgcactcctcagtctccaggtgcccttgaaagatgcgaagagctgtggcctccat
caactcctcagaggccctgaggtccagctggtggcccattcgccatggctaaaggactct
ctgtccagaacgacaaacatccagggtatcaacctgctcttctcctctcgccgggggcac
ctctttttgcagacggaccagcccatttacaaccctggccagcgggttcggtaccgggtc
tttgctctggatcagaagatgcgcccgagcactgacaccatcacagtcatggtggagaac
tctcacggcctccgcgtgcggaagaaggaggtgtacatgccctcgtccatcttccaggat
gactttgtgatcccagacatctcagagccagggacctggaagatctcagcccgattctca
gatggcctggaatccaacagcagcacccagtttgaggtgaagaaatatgtccttcccaac
tttgaggtgaagatcacccctggaaagccctacatcctgacggtgccaggccatcttgat
gaaatgcagttagacatccaggccaggtacatctatgggaagccagtgcagggggtggca
tatgtgcgctttgggctcctagatgaggatggtaagaagactttctttcgggggctggag
agtcagaccaagctggtgaatggacagagccacatttccctctcaaaggcagagttccag
gacgccctggagaagctgaatatgggcattactgacctccaggggctgcgcctctacgtt
gctgcagccatcattgagtctccaggtggggagatggaggaggcagagctcacatcctgg
tattttgtgtcatctcccttctccttggatcttagcaagaccaagcgacaccttgtgcct
ggggcccccttcctgctgcaggccttggtccgtgagatgtcaggctccccagcttctggc
attcctgtcaaagtttctgccacggtgtcttctcctgggtctgttcctgaagtccaggac
attcagcaaaacacagacgggagcggccaagtcagcattccaataattatccctcagacc
atctcagagctgcagctctcagtatctgcaggctccccacatccagcgatagccaggctc
actgtggcagccccaccttcaggaggccccgggtttctgtctattgagcggccggattct
cgacctcctcgtgttggggacactctgaacctgaacttgcgagccgtgggcagtggggcc
accttttctcattactactacatgatcctatcccgagggcagatcgtgttcatgaatcga
gagcccaagaggaccctgacctcggtctcggtgtttgtggaccatcacctggcaccctcc
ttctactttgtggccttctactaccatggagaccacccagtggccaactccctgcgagtg
gatgtccaggctggggcctgcgagggcaagctggagctcagcgtggacggtgccaagcag
taccggaacggggagtccgtgaagctccacttagaaaccgactccctagccctggtggcg
ctgggagccttggacacagctctgtatgctgcaggcagcaagtcccacaagcccctcaac
atgggcaaggtctttgaagctatgaacagctatgacctcggctgtggtcctgggggtggg
gacagtgcccttcaggtgttccaggcagcgggcctggccttttctgatggagaccagtgg
accttatccagaaagagactaagctgtcccaaggagaagacaacccggaaaaagagaaac
gtgaacttccaaaaggcgattaatgagaaattgggtcagtatgcttccccgacagccaag
cgctgctgccaggatggggtgacacgtctgcccatgatgcgttcctgcgagcagcgggca
gcccgcgtgcagcagccggactgccgggagcccttcctgtcctgctgccaatttgctgag
agtctgcgcaagaagagcagggacaagggccaggcgggcctccaacgagccctggagatc
ctgcaggaggaggacctgattgatgaggatgacattcccgtgcgcagcttcttcccagag
aactggctctggagagtggaaacagtggaccgctttcaaatattgacactgtggctcccc
gactctctgaccacgtgggagatccatggcctgagcctgtccaaaaccaaaggcctatgt
gtggccaccccagtccagctccgggtgttccgcgagttccacctgcacctccgcctgccc
atgtctgtccgccgctttgagcagctggagctgcggcctgtcctctataactacctggat
aaaaacctgactgtgagcgtccacgtgtccccagtggaggggctgtgcctggctgggggc
ggagggctggcccagcaggtgctggtgcctgcgggctctgcccggcctgttgccttctct
gtggtgcccacggcagccgccgctgtgtctctgaaggtggtggctcgagggtccttcgaa
ttccctgtgggagatgcggtgtccaaggttctgcagattgagaaggaaggggccatccat
agagaggagctggtctatgaactcaaccccttggaccaccgaggccggaccttggaaata
cctggcaactctgatcccaatatgatccctgatggggactttaacagctacgtcagggtt
acagcctcagatccattggacactttaggctctgagggggccttgtcaccaggaggcgtg
gcctccctcttgaggcttcctcgaggctgtggggagcaaaccatgatctacttggctccg
acactggctgcttcccgctacctggacaagacagagcagtggagcacactgcctcccgag
accaaggaccacgccgtggatctgatccagaaaggctacatgcggatccagcagtttcgg
aaggcggatggttcctatgcggcttggttgtcacgggacagcagcacctggctcacagcc
tttgtgttgaaggtcctgagtttggcccaggagcaggtaggaggctcgcctgagaaactg
caggagacatctaactggcttctgtcccagcagcaggctgacggctcgttccaggacccc
tgtccagtgttagacaggagcatgcaggggggtttggtgggcaatgatgagactgtggca
ctcacagcctttgtgaccatcgcccttcatcatgggctggccgtcttccaggatgagggt
gcagagccattgaagcagagagtggaagcctccatctcaaaggcaaactcatttttgggg
gagaaagcaagtgctgggctcctgggtgcccacgcagctgccatcacggcctatgccctg
acactgaccaaggcgcctgtggacctgctcggtgttgcccacaacaacctcatggcaatg
gcccaggagactggagataacctgtactggggctcagtcactggttctcagagcaatgcc
gtgtcgcccaccccggctcctcgcaacccatccgaccccatgccccaggccccagccctg
tggattgaaaccacagcctacgccctgctgcacctcctgcttcacgagggcaaagcagag
atggcagaccaggcttcggcctggctcacccgtcagggcagcttccaagggggattccgc
agtacccaagacacggtgattgccctggatgccctgtctgcctactggattgcctcccac
accactgaggagaggggtctcaatgtgactctcagctccacaggccggaatgggttcaag
tcccacgcgctgcagctgaacaaccgccagattcgcggcctggaggaggagctgcagttt
tccttgggcagcaagatcaatgtgaaggtgggaggaaacagcaaaggaaccctgaaggtc
cttcgtacctacaatgtcctggacatgaagaacacgacctgccaggacctacagatagaa
gtgacagtcaaaggccacgtcgagtacacgatggaagcaaacgaggactatgaggactat
gagtacgatgagcttccagccaaggatgacccagatgcccctctgcagcccgtgacaccc
ctgcagctgtttgagggtcggaggaaccgccgcaggagggaggcgcccaaggtggtggag
gagcaggagtccagggtgcactacaccgtgtgcatctggcggaacggcaaggtggggctg
tctggcatggccatcgcggacgtcaccctcctgagtggattccacgccctgcgtgctgac
ctggagaagctgacctccctctctgaccgttacgtgagtcactttgagaccgaggggccc
cacgtcctgctgtattttgactcggtccccacctcccgggagtgcgtgggctttgaggct
gtgcaggaagtgccggtggggctggtgcagccggccagcgcaaccctgtacgactactac
aaccccgagcgcagatgttctgtgttttacggggcaccaagtaagagcagactcttggcc
accttgtgttctgctgaagtctgccagtgtgctgaggggaagtgccctcgccagcgtcgc
gccctggagcggggtctgcaggacgaggatggctacaggatgaagtttgcctgctactac
ccccgtgtggagtacggcttccaggttaaggttctccgagaagacagcagagctgctttc
cgcctctttgagaccaagatcacccaagtcctgcacttcaccaaggatgtcaaggccgct
gctaatcagatgcgcaacttcctggttcgagcctcctgccgccttcgcttggaacctggg
aaagaatatttgatcatgggtctggatggggccacctatgacctcgagggacacccccag
tacctgctggactcgaatagctggatcgaggagatgccctctgaacgcctgtgccggagc
acccgccagcgggcagcctgtgcccagctcaacgacttcctccaggagtatggcactcag
gggtgccaggtgtga

KEGG   Homo sapiens (human): 721
Entry
721               CDS       T01001                                 
Symbol
C4B, C4B1, C4B12, C4B2, C4B3, C4B5, C4BD, C4B_2, C4F, CH, CO4, CPAMD3
Name
(RefSeq) complement C4-B preproprotein
  KO
K03989  complement component 4
Organism
hsa  Homo sapiens (human)
Pathway
hsa04610  Complement and coagulation cascades
hsa04936  Alcoholic liver disease
hsa05133  Pertussis
hsa05150  Staphylococcus aureus infection
hsa05171  Coronavirus disease
hsa05322  Systemic lupus erythematosus
Network
nt06164  Kaposi sarcoma-associated herpesvirus (KSHV)
nt06171  SARS coronavirus 2 (SARS-CoV-2)
nt06513  Complement cascade
  Element
N01487  Classical pathway of complement cascade, C4/C2 to C3 convertase formation
N01491  Lectin pathway of complement cascade, C4/C2 to C3 convertase formation
Disease
H00102  Classic complement pathway component defects
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    721 (C4B)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease
    721 (C4B)
  09171 Infectious disease: bacterial
   05133 Pertussis
    721 (C4B)
   05150 Staphylococcus aureus infection
    721 (C4B)
  09163 Immune disease
   05322 Systemic lupus erythematosus
    721 (C4B)
  09167 Endocrine and metabolic disease
   04936 Alcoholic liver disease
    721 (C4B)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    721 (C4B)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:hsa04147]
    721 (C4B)
Peptidases and inhibitors [BR:hsa01002]
 Peptidase inhibitors
  Family I39: alpha2M family
   721 (C4B)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of hepatic cells
   721 (C4B)
  Exosomal proteins of other cancer cells
   721 (C4B)
SSDB
Motif
Pfam: TED_complement C4_MG1 CO4A-B_CUB_C A2M_recep A2M A2M_BRD NTR MG2 MG3 ANATO MG4 ParB_C DUF7363 Big_1
Other DBs
NCBI-GeneID: 721
NCBI-ProteinID: NP_001002029
OMIM: 120820
HGNC: 1324
Ensembl: ENSP00000415941.2
CPD: C22541 C22542 C22543 C22544
UniProt: P0C0L4 P0C0L5
Structure
LinkDB
Position
6:32014795..32035418
AA seq 1744 aa
MRLLWGLIWASSFFTLSLQKPRLLLFSPSVVHLGVPLSVGVQLQDVPRGQVVKGSVFLRN
PSRNNVPCSPKVDFTLSSERDFALLSLQVPLKDAKSCGLHQLLRGPEVQLVAHSPWLKDS
LSRTTNIQGINLLFSSRRGHLFLQTDQPIYNPGQRVRYRVFALDQKMRPSTDTITVMVEN
SHGLRVRKKEVYMPSSIFQDDFVIPDISEPGTWKISARFSDGLESNSSTQFEVKKYVLPN
FEVKITPGKPYILTVPGHLDEMQLDIQARYIYGKPVQGVAYVRFGLLDEDGKKTFFRGLE
SQTKLVNGQSHISLSKAEFQDALEKLNMGITDLQGLRLYVAAAIIESPGGEMEEAELTSW
YFVSSPFSLDLSKTKRHLVPGAPFLLQALVREMSGSPASGIPVKVSATVSSPGSVPEVQD
IQQNTDGSGQVSIPIIIPQTISELQLSVSAGSPHPAIARLTVAAPPSGGPGFLSIERPDS
RPPRVGDTLNLNLRAVGSGATFSHYYYMILSRGQIVFMNREPKRTLTSVSVFVDHHLAPS
FYFVAFYYHGDHPVANSLRVDVQAGACEGKLELSVDGAKQYRNGESVKLHLETDSLALVA
LGALDTALYAAGSKSHKPLNMGKVFEAMNSYDLGCGPGGGDSALQVFQAAGLAFSDGDQW
TLSRKRLSCPKEKTTRKKRNVNFQKAINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRA
ARVQQPDCREPFLSCCQFAESLRKKSRDKGQAGLQRALEILQEEDLIDEDDIPVRSFFPE
NWLWRVETVDRFQILTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLP
MSVRRFEQLELRPVLYNYLDKNLTVSVHVSPVEGLCLAGGGGLAQQVLVPAGSARPVAFS
VVPTAATAVSLKVVARGSFEFPVGDAVSKVLQIEKEGAIHREELVYELNPLDHRGRTLEI
PGNSDPNMIPDGDFNSYVRVTASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAP
TLAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRGSSTWLTA
FVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNDETVA
LTAFVTIALHHGLAVFQDEGAEPLKQRVEASISKASSFLGEKASAGLLGAHAAAITAYAL
TLTKAPADLRGVAHNNLMAMAQETGDNLYWGSVTGSQSNAVSPTPAPRNPSDPMPQAPAL
WIETTAYALLHLLLHEGKAEMADQAAAWLTRQGSFQGGFRSTQDTVIALDALSAYWIASH
TTEERGLNVTLSSTGRNGFKSHALQLNNRQIRGLEEELQFSLGSKINVKVGGNSKGTLKV
LRTYNVLDMKNTTCQDLQIEVTVKGHVEYTMEANEDYEDYEYDELPAKDDPDAPLQPVTP
LQLFEGRRNRRRREAPKVVEEQESRVHYTVCIWRNGKVGLSGMAIADVTLLSGFHALRAD
LEKLTSLSDRYVSHFETEGPHVLLYFDSVPTSRECVGFEAVQEVPVGLVQPASATLYDYY
NPERRCSVFYGAPSKSRLLATLCSAEVCQCAEGKCPRQRRALERGLQDEDGYRMKFACYY
PRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNFLVRASCRLRLEPG
KEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQ
GCQV
NT seq 5235 nt   +upstreamnt  +downstreamnt
atgaggctgctctgggggctgatctgggcatccagcttcttcaccttatctctgcagaag
cccaggttgctcttgttctctccttctgtggttcatctgggggtccccctatcggtgggg
gtgcagctccaggatgtgccccgaggacaggtagtgaaaggatcagtgttcctgagaaac
ccatctcgtaataatgtcccctgctccccaaaggtggacttcacccttagctcagaaaga
gacttcgcactcctcagtctccaggtgcccttgaaagatgcgaagagctgtggcctccat
caactcctcagaggccctgaggtccagctggtggcccattcgccatggctaaaggactct
ctgtccagaacgacaaacatccagggtatcaacctgctcttctcctctcgccgggggcac
ctctttttgcagacggaccagcccatttacaaccctggccagcgggttcggtaccgggtc
tttgctctggatcagaagatgcgcccgagcactgacaccatcacagtcatggtggagaac
tctcacggcctccgcgtgcggaagaaggaggtgtacatgccctcgtccatcttccaggat
gactttgtgatcccagacatctcagagccagggacctggaagatctcagcccgattctca
gatggcctggaatccaacagcagcacccagtttgaggtgaagaaatatgtccttcccaac
tttgaggtgaagatcacccctggaaagccctacatcctgacggtgccaggccatcttgat
gaaatgcagttagacatccaggccaggtacatctatgggaagccagtgcagggggtggca
tatgtgcgctttgggctcctagatgaggatggtaagaagactttctttcgggggctggag
agtcagaccaagctggtgaatggacagagccacatttccctctcaaaggcagagttccag
gacgccctggagaagctgaatatgggcattactgacctccaggggctgcgcctctacgtt
gctgcagccatcattgagtctccaggtggggagatggaggaggcagagctcacatcctgg
tattttgtgtcatctcccttctccttggatcttagcaagaccaagcgacaccttgtgcct
ggggcccccttcctgctgcaggccttggtccgtgagatgtcaggctccccagcttctggc
attcctgtcaaagtttctgccacggtgtcttctcctgggtctgttcctgaagtccaggac
attcagcaaaacacagacgggagcggccaagtcagcattccaataattatccctcagacc
atctcagagctgcagctctcagtatctgcaggctccccacatccagcgatagccaggctc
actgtggcagccccaccttcaggaggccccgggtttctgtctattgagcggccggattct
cgacctcctcgtgttggggacactctgaacctgaacttgcgagccgtgggcagtggggcc
accttttctcattactactacatgatcctatcccgagggcagatcgtgttcatgaatcga
gagcccaagaggaccctgacctcggtctcggtgtttgtggaccatcacctggcaccctcc
ttctactttgtggccttctactaccatggagaccacccagtggccaactccctgcgagtg
gatgtccaggctggggcctgcgagggcaagctggagctcagcgtggacggtgccaagcag
taccggaacggggagtccgtgaagctccacttagaaaccgactccctagccctggtggcg
ctgggagccttggacacagctctgtatgctgcaggcagcaagtcccacaagcccctcaac
atgggcaaggtctttgaagctatgaacagctatgacctcggctgtggtcctgggggtggg
gacagtgcccttcaggtgttccaggcagcgggcctggccttttctgatggagaccagtgg
accttatccagaaagagactaagctgtcccaaggagaagacaacccggaaaaagagaaac
gtgaacttccaaaaggcgattaatgagaaattgggtcagtatgcttccccgacagccaag
cgctgctgccaggatggggtgacacgtctgcccatgatgcgttcctgcgagcagcgggca
gcccgcgtgcagcagccggactgccgggagcccttcctgtcctgctgccaatttgctgag
agtctgcgcaagaagagcagggacaagggccaggcgggcctccaacgagccctggagatc
ctgcaggaggaggacctgattgatgaggatgacattcccgtgcgcagcttcttcccagag
aactggctctggagagtggaaacagtggaccgctttcaaatattgacactgtggctcccc
gactctctgaccacgtgggagatccatggcctgagcctgtccaaaaccaaaggcctatgt
gtggccaccccagtccagctccgggtgttccgcgagttccacctgcacctccgcctgccc
atgtctgtccgccgctttgagcagctggagctgcggcctgtcctctataactacctggat
aaaaacctgactgtgagcgtccacgtgtccccagtggaggggctgtgcctggctgggggc
ggagggctggcccagcaggtgctggtgcctgcgggctctgcccggcctgttgccttctct
gtggtgcccacggcagccaccgctgtgtctctgaaggtggtggctcgagggtccttcgaa
ttccctgtgggagatgcggtgtccaaggttctgcagattgagaaggaaggggccatccat
agagaggagctggtctatgaactcaaccccttggaccaccgaggccggaccttggaaata
cctggcaactctgatcccaatatgatccctgatggggactttaacagctacgtcagggtt
acagcctcagatccattggacactttaggctctgagggggccttgtcaccaggaggcgtg
gcctccctcttgaggcttcctcgaggctgtggggagcaaaccatgatctacttggctccg
acactggctgcttcccgctacctggacaagacagagcagtggagcacactgcctcccgag
accaaggaccacgccgtggatctgatccagaaaggctacatgcggatccagcagtttcgg
aaggcggatggttcctatgcggcttggttgtcacggggcagcagcacctggctcacagcc
tttgtgttgaaggtcctgagtttggcccaggagcaggtaggaggctcgcctgagaaactg
caggagacatctaactggcttctgtcccagcagcaggctgacggctcgttccaggacctc
tctccagtgatacataggagcatgcaggggggtttggtgggcaatgatgagactgtggca
ctcacagcctttgtgaccatcgcccttcatcatgggctggccgtcttccaggatgagggt
gcagagccattgaagcagagagtggaagcctccatctcaaaggcaagctcatttttgggg
gagaaagcaagtgctgggctcctgggtgcccacgcagctgccatcacggcctatgccctg
acactgaccaaggcccctgcggacctgcggggtgttgcccacaacaacctcatggcaatg
gcccaggagactggagataacctgtactggggctcagtcactggttctcagagcaatgcc
gtgtcgcccaccccggctcctcgcaacccatccgaccccatgccccaggccccagccctg
tggattgaaaccacagcctacgccctgctgcacctcctgcttcacgagggcaaagcagag
atggcagaccaggctgcggcctggctcacccgtcagggcagcttccaagggggattccgc
agtacccaagacacggtgattgccctggatgccctgtctgcctactggattgcctcccac
accactgaggagaggggtctcaatgtgactctcagctccacaggccggaatgggttcaag
tcccacgcgctgcagctgaacaaccgccagattcgcggcctggaggaggagctgcagttt
tccttgggcagcaagatcaatgtgaaggtgggaggaaacagcaaaggaaccctgaaggtc
cttcgtacctacaatgtcctggacatgaagaacacgacctgccaggacctacagatagaa
gtgacagtcaaaggccacgtcgagtacacgatggaagcaaacgaggactatgaggactat
gagtacgatgagcttccagccaaggatgacccagatgcccctctgcagcccgtgacaccc
ctgcagctgtttgagggtcggaggaaccgccgcaggagggaggcgcccaaggtggtggag
gagcaggagtccagggtgcactacaccgtgtgcatctggcggaacggcaaggtggggctg
tctggcatggccatcgcggacgtcaccctcctgagtggattccacgccctgcgtgctgac
ctggagaagctgacctccctctctgaccgttacgtgagtcactttgagaccgaggggccc
cacgtcctgctgtattttgactcggtccccacctcccgggagtgcgtgggctttgaggct
gtgcaggaagtgccggtggggctggtgcagccggccagcgcaaccctgtacgactactac
aaccccgagcgcagatgttctgtgttttacggggcaccaagtaagagcagactcttggcc
accttgtgttctgctgaagtctgccagtgtgctgaggggaagtgccctcgccagcgtcgc
gccctggagcggggtctgcaggacgaggatggctacaggatgaagtttgcctgctactac
ccccgtgtggagtacggcttccaggttaaggttctccgagaagacagcagagctgctttc
cgcctctttgagaccaagatcacccaagtcctgcacttcaccaaggatgtcaaggccgct
gctaatcagatgcgcaacttcctggttcgagcctcctgccgccttcgcttggaacctggg
aaagaatatttgatcatgggtctggatggggccacctatgacctcgagggacacccccag
tacctgctggactcgaatagctggatcgaggagatgccctctgaacgcctgtgccggagc
acccgccagcgggcagcctgtgcccagctcaacgacttcctccaggagtatggcactcag
gggtgccaggtgtga

DBGET integrated database retrieval system