KEGG   Bombus pyrosoma: 122572712
Entry
122572712         CDS       T07776                                 
Name
(RefSeq) collagen alpha-1(XVIII) chain isoform X1
  KO
K06823  collagen type XVIII alpha
Organism
bpyo  Bombus pyrosoma
Brite
KEGG Orthology (KO) [BR:bpyo00001]
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bpyo04147]
    122572712
   00535 Proteoglycans [BR:bpyo00535]
    122572712
   00536 Glycosaminoglycan binding proteins [BR:bpyo00536]
    122572712
Exosome [BR:bpyo04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   122572712
Proteoglycans [BR:bpyo00535]
 Extracellular matrix (ECM) proteoglycans
  Basement membrane proteoglycans
   122572712
Glycosaminoglycan binding proteins [BR:bpyo00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   122572712
SSDB
Motif
Pfam: Endostatin Collagen Collagen_trimer Laminin_G_3 DUF1554 RH_dom
Other DBs
NCBI-GeneID: 122572712
NCBI-ProteinID: XP_043593977
LinkDB
Position
LG11:complement(3986322..4330612)
AA seq 1178 aa
MRPRLQWLIFVLFCVTRVNADFFNEKKELEYDLLQASVALTDDNNLYIDDGLDGFPSFGF
RPGSEVKQPYRLYLPEKLPAEFTLVATFKPTSFRTSYLFAVLNPFETVVQLGIRISDGPG
TNQNVSLVYTNSDLHSHSEEVAKFTVPKLTKKWSKIVIRVSTTDVTLYLNCHEMAKQRVT
RIPLELMFDTASTLYIAQAGPHIQEKYDGLLQSLKLYSGHPADLVRCTADFNFNPDEDLG
SGDINNDLIDGLEDVDTNVPDIGRDDDEDRSEESNPPPFITPPPPNPDYKGPKGEKGDKG
DKGESVRGPPGPPGPPGQDEGPPGKKGEPGTCTCNATALMASFTMPKMIQGPKGEQGVPG
QEGKQGQMGLTGAAGPPGERGLEGPQGPKGDKGDVGIAGPEGPQGQKGEPGRDGIPGEKG
AQGPPGPPGKGEFSGYDPSWKPRGIYRTEGITMRPGLPGQKGEAGLPGSPGPKGETGIAG
AKGYKGEPGHKGAKGDHGKEGPRGIQGFKGEPGAPGAPGLPGAPGENGRPAEKGDKGDTG
PEGKPGPAGAPGPPGLPGLSGSGGINVGEAMLREKGDKGESGARGYKGDKGTKGEKGDKG
DSGPAGIPGVNGIQGPQGNKGEPGKDGVPGAPGVVGAKGEKGERGPPGATAIASSGDYIT
IKGEKGAEGKRGRRGRPGPPGPVGPPGKTGAMGEIGLPGWANSMKGRPGTPGLPGPVGPV
GPKGEKGEPGTPSPYGVSVGIKGDKGDDGFPGIPGQPGRDGQRGPPGPPGPPGPPSQGNY
IPVPGPPGPPGPPGPPGLSLIGQKGEPGIGRSHIFGERDYYGVRQVPKNIKDFKHNMFFG
TLQGPRTSLDELKALRELKQLKELKEHLGAATTATRGPLESTTKIVPGAVTFQNTEAMTK
MSAVSPVGTLAYIIDEQALLVRVNNGWQYIALGSLLPITTPAPPTTSPPPVNPPFEASNL
INQIPVKADGTGWYPRMLRMAALNEPFTGDMHGVRGADYACYRQAKRAGLRGTFRAFLSS
RVQNVDSIVRLGDRDLPIVNIKGDVLFNSWKEMFNGNGAYFSQNPRIYSFNGKNILTDFA
WPEKVAWHGSHKLGDRAMDTYCDAWHSSSSDRYGLGSPLTGGRLLEQVRYSCDNKFALLC
IEVTSETTRRRRSVEISEDEDEMSENDYKEYLDSLMEN
NT seq 3537 nt   +upstreamnt  +downstreamnt
atgcggcctagattgcagtggttgatattcgtcctgttctgtgtgacgcgcgtcaacgcg
gactttttcaacgagaaaaaagagctggaatatgatttacttcaagcttccgtggcatta
acagacgacaataatttgtacatcgacgatggtctcgacggatttccaagttttggcttc
cgacctggctccgaagttaaacagccgtatcgattatatttgccggaaaaattgccagca
gagttcactttggtagcaacttttaagccgacgtcatttagaaccagctatctcttcgcc
gtcctaaatcccttcgaaactgttgtgcaattaggcatccggatatccgatggtccagga
acaaaccagaatgtctcactggtctacaccaattctgatttacattcgcattcggaagag
gtggcgaaattcacggtgccgaagttgacaaagaaatggtcgaagatcgtaatcagagta
tcgacgactgatgtcaccttatatctaaactgtcacgaaatggccaagcaacgagtaacg
aggattcctctggagttgatgttcgacacagccagcacgttgtacatcgctcaagctggg
cctcatattcaggaaaaatacgatggtctactgcaatccttgaagctctattccggacat
ccagcggatttggtaagatgcacggccgacttcaacttcaacccagacgaggatctcggc
tcaggcgatatcaataacgacttgattgacggtttggaagacgttgacactaatgtgcca
gacatcggacgagacgatgacgaagacagaagcgaggaaagcaatccaccgccattcatc
acaccccctcctcccaatccggactacaaagggccgaagggtgagaaaggtgataaagga
gacaaaggagagagcgttaggggacctccaggtccacctggaccacctggtcaagacgag
ggtccaccagggaagaaaggagaacccggcacgtgcacctgcaatgcaacagccctgatg
gcatcctttacgatgccaaagatgatccaaggaccgaaaggggaacaaggagtgccgggg
caggaggggaaacaaggccaaatggggctgacgggtgcagctggaccacccggagagaga
ggactggaaggaccacagggtcctaagggtgacaaaggagacgtgggaatagcaggaccg
gaaggtcctcaaggacagaaaggggaacctggtcgtgatggaatacccggagagaagggc
gctcaaggacctccaggaccgccaggaaagggtgaattttctggatacgaccccagttgg
aaacctcgaggcatttataggacggagggcatcaccatgagaccaggactgccaggacag
aaaggcgaagccggacttccaggaagcccaggaccgaaaggagagacaggaatcgccggt
gccaaaggatacaaaggcgaaccagggcacaagggtgcaaaaggcgatcacggaaaagaa
ggtccgcgaggaattcaaggattcaagggtgaacctggtgctccgggagcaccggggctt
cctggcgcaccaggtgagaatggaagaccagcagagaaaggtgacaaaggagacacagga
ccagaagggaaaccaggccctgcaggtgcacctggaccaccaggattgcctggattgagc
ggctctggtggaataaacgttggagaagcgatgttaagggaaaagggcgacaagggcgag
agtggtgcacgaggatacaaaggagataagggcaccaaaggcgagaagggagacaaaggt
gattctggaccagccggaattcctggtgtaaatggtatccaaggaccgcaaggcaacaaa
ggcgagccagggaaagacggagttcccggagcaccgggagtcgtcggtgcaaagggcgaa
aaaggtgaaagaggtccgccaggagctacagctatagcgagctctggagactacatcact
atcaaaggtgaaaaaggagcagaaggcaagagaggtagaagagggcgtcctggaccacca
ggaccggtcggcccacctgggaagacgggagcgatgggagaaattgggttacctggatgg
gcgaactccatgaaaggtcgtcctggaactcccggacttcctggacctgttggaccagtg
gggcccaagggagaaaaaggagaacccggtacaccgagcccttacggagtctctgtcggc
ataaaaggcgataagggagacgatggtttccctggaattcctggacaacctggaagagac
ggtcaaagaggacctccagggccacccggaccacctggaccaccgtctcaaggaaactat
attccagttccgggtcccccggggcctcctgggccaccaggaccacctggactatctttg
attggacaaaaaggagaaccaggaattggtagaagtcacatctttggcgaaagagactat
tacggagttagacaggttccaaagaatataaaagattttaaacacaatatgttctttggc
acgctgcaaggacctagaactagcctagacgaattgaaggctctgcgagaactaaagcaa
ctgaaggaacttaaagaacatttaggtgctgcaaccaccgcaactagaggtcccttagaa
agtaccacgaagatcgtaccaggagctgtgactttccaaaataccgaagctatgaccaaa
atgtctgccgtgagcccagttggaacattagcttatatcattgacgaacaagctctgctc
gttagagtgaacaatggctggcaatacatcgcactgggttcgcttctaccaataactaca
ccagcaccaccaaccacgtcaccgcctccggtaaatccacctttcgaggcttccaacctg
atcaaccagatacccgtaaaagcagacggaacagggtggtatccacggatgttacgaatg
gctgctttgaacgagccatttacgggagacatgcacggtgtacgtggagctgactacgct
tgttatcgacaagcgaagcgagctggtttgaggggtacgttccgagcattcctcagctcc
agagttcaaaatgttgacagcatcgtgagacttggggaccgtgatcttcctatcgttaac
ataaagggcgatgtacttttcaactcgtggaaagagatgttcaacggaaacggggcctac
ttctcgcaaaatccaaggatttacagtttcaatggcaaaaatatcctcaccgactttgca
tggccagaaaaggtggcatggcacggatcgcacaaattaggagaccgggcaatggacacg
tattgcgacgcttggcattcaagtagttcggatcgttacggattaggatcaccgttaact
gggggacgcctcttggagcaagttcgttattcctgcgacaacaagttcgcgctgctctgt
atagaagtaaccagcgagaccacaagaaggaggagaagcgtcgaaatttcggaggatgag
gacgaaatgtcggagaacgattataaggagtacttggattctctgatggaaaactaa

DBGET integrated database retrieval system