KEGG   Colletes gigas: 122404627
Entry
122404627         CDS       T07492                                 
Name
(RefSeq) collagen alpha-1(XVIII) chain isoform X1
  KO
K06823  collagen type XVIII alpha
Organism
cgig  Colletes gigas
Brite
KEGG Orthology (KO) [BR:cgig00001]
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:cgig04147]
    122404627
   00535 Proteoglycans [BR:cgig00535]
    122404627
   00536 Glycosaminoglycan binding proteins [BR:cgig00536]
    122404627
Exosome [BR:cgig04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   122404627
Proteoglycans [BR:cgig00535]
 Extracellular matrix (ECM) proteoglycans
  Basement membrane proteoglycans
   122404627
Glycosaminoglycan binding proteins [BR:cgig00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   122404627
SSDB
Motif
Pfam: Endostatin Collagen Collagen_trimer Laminin_G_3 DUF1554
Other DBs
NCBI-GeneID: 122404627
NCBI-ProteinID: XP_043264698
LinkDB
Position
Unknown
AA seq 1133 aa
MRPKLQLVIFAAFCATYANADFFGEKKELVYDLLQASVASTDDNNLYIDDALDGFPAFGF
RPGSEVKQPYRLYLPEKLPAEFTLVATLKPMSFRTSYLFAVLNPFETVVQLGIRISDGPG
TNQNVSLIYTNSDQNSHSEEVAKFTVPKLTKKWSKIVIRVSLTEVTLYLNCHEMAKQRVT
RMPPELVFDTASTLYIAQAGPHIQEKYDGLLQTLKLYSGHPADLVKCTADFNFDAEEDLG
SGDLDPELIDGSGNIDINKIDIGRDDDEDRSEESNPPPFITPPPPNPDYKGPKGDKGDKG
DKGESVRGPPGPPGPPGQDEGPQGEKGEPGTCACNATALMASFTMPKMIQGPKGEQGVPG
QEGKQGQMGLTGAAGPPGERGLEGPHGAKGDKGDVGVQGPEGPQGQKGEPGRDGIPGEKG
AQGPPGPPGKDEYSSYNLSWGRRGAYRMDEITMRPGLPGQKGEAGIPGNPGQKGESGIVG
AKGNRGESGHKGVKGDHGKEGPRGIQGFKGEPGAPGTPGLPGAPGENGRPAEKGDKGDTG
PEGKLGPPGPPGPPGVSGVTGPGAINVGESVLGEKGDRGEMGPRGHKGDKGTKGEKGDKG
NSGPAGIPGVNGIQGPQGNKGEPGKDGDAGAPGVGGSKGDKGERGPPGATAIASSGDYIT
IKGEKGTAGKRGTKGHRGPPGPAGPPGKPGIMGEIGLPGWMGRPGNPGLPGPVGPGGPKG
EKGEPGTPGPYGVSVGALNNVQIKGDKGDEGFPGIPGQAGRDGQRGPPGPPGPPSQGNYI
PVPGPPGPPGPPGPPGMLGGQKEESGSSRSHIFGEKDYYGIRQGSATVATRGPLESTTKI
VPGAVTFQNTEAMTKMSAVSPVGTLAYIIDEQALLVRVNNGWQYIALGSLLSITTPAPPT
TSPPPANPPFEASNLINQIPVKADGTGWYPRMLRMAALNEPFTGDMHGVRGADYACYRQA
KRAGLRGTFRAFLSSRVQNVDSIVRLGDRDLPIVNIKGDVLFNSWKEMFNGNGAYFSQNP
RIYSFNGKNILSDFAWPEKVAWHGSHKLGDRAMDTYCDAWHSSNSDRYGLGSPLTGGRLL
EQVRYSCDNKFALLCIEVTSEITRRRRSVEVTEDAEEMSENDYKEYLDALMDN
NT seq 3402 nt   +upstreamnt  +downstreamnt
atgcggcctaaattgcaattggtgatattcgccgcgttctgtgcaacgtacgccaacgcg
gatttcttcggcgagaagaaagaactggtctatgatttactgcaggcatccgtggcatcg
acagacgacaataatttgtacatcgacgatgctctcgatggatttccggctttcggtttc
cgacctggctccgaagttaaacaaccctatcgattatatctcccggaaaaattgccagcc
gagtttactctggtggcaactttgaagccaatgtccttcagaaccagttatctgtttgca
gttctcaatcccttcgaaactgttgtgcaattaggcattcgaatttcggacgggccggga
accaatcaaaacgtgtcgttgatttacacgaactccgatcagaattcgcattcggaggaa
gtagcgaaattcactgtgccgaagctaacaaagaaatggtcgaagatcgttatcagagtt
tctctcaccgaggtcactttataccttaattgccacgaaatggccaaacaaagagtgacc
agaatgcctccggaattagtgttcgataccgccagcacgctctacatcgctcaagctgga
ccgcatattcaagagaaatacgacggtctactgcaaacgttgaagctctattccggacat
ccggcggatttggtaaagtgcacagccgacttcaatttcgatgcggaggaggaccttggc
tcgggcgatcttgatcccgagctgatcgacggttcggggaatattgacattaataagata
gacatcggacgagacgatgacgaagacagaagcgaggaaagcaatccaccgccattcatc
acacctcctcctcccaatccggactacaaagggccgaaaggcgataaaggtgataaagga
gacaagggggaaagcgtcagaggacccccagggcctccaggtccaccaggacaggacgag
ggtccgcaaggggagaaaggagagcctggcacgtgcgcctgcaacgcaacagccctgatg
gcgtcctttacgatgccaaagatgatccagggaccgaaaggagaacaaggagtaccgggt
caagaggggaaacaagggcaaatggggctgacgggcgcagcaggaccgccgggagagaga
ggactcgaaggacctcatggtgctaaaggcgataaaggagacgtgggcgtgcagggaccg
gaaggtccccaaggacaaaaaggtgaaccgggtcgtgacggaataccgggggaaaaaggc
gcccaaggacctccaggaccgccgggaaaggacgaatactctagctacaatctcagttgg
ggacgccggggagcttacaggatggatgaaatcacgatgaggccagggctgccaggacag
aagggcgaggcaggcattccagggaatccaggacagaaaggggagtcgggaatcgtgggt
gccaaaggaaacaggggcgaatcgggacacaagggcgttaaaggcgatcacgggaaagaa
gggccgcgaggaattcagggattcaaaggtgaacccggggctcctggaacgccggggcta
cctggggctccgggtgagaacggaaggccggctgagaagggcgacaagggcgacacggga
ccggaagggaaactaggtcctccgggaccaccaggaccaccaggcgtgtccggcgtcact
ggacctggggcaataaacgttggggaatcagtattaggagagaaaggtgacaggggcgag
atgggtccgcgcgggcataaaggagataagggcaccaaaggggagaaaggagataaaggt
aactcaggaccagccgggattcccggtgtgaacggcattcaaggaccccaaggcaacaaa
ggagagccaggcaaagatggagacgccggcgcgccaggagtcggtggttcaaagggcgat
aaaggcgagagaggaccgccaggagccacggctatagcgagctccggtgactacattact
atcaagggtgagaagggcacggcgggcaagaggggtaccaaaggacatcgaggaccacct
ggtcccgctggaccacctggaaaaccgggaattatgggagaaattgggttacccggatgg
atgggccgccctggaaatcctggactacccggacctgttggaccaggaggacccaagggg
gaaaaaggtgaacccggaacaccgggtccttacggagtttccgttggtgccttgaacaat
gttcagataaaaggcgataaaggagacgaaggtttccctgggattcctggacaagctgga
agggacggccaaagaggacctccaggaccacctggaccaccgtctcaaggaaactatatt
ccagtcccaggccccccagggccccctggaccaccaggaccgccaggaatgctaggtgga
cagaaagaagagtcaggatcaagtagaagccatatatttggcgagaaggactattacgga
atcagacaaggttccgcgaccgtggcaacgagaggtcctttggagagcacgacgaagatc
gtacctggagcagtgactttccaaaacaccgaagccatgacaaaaatgtccgctgtaagt
ccggttggaacattggcgtacatcatagacgaacaggcgcttctcgttagagtgaacaat
ggctggcaatacattgcattgggatctcttttatcaataactactccagcaccgccgacc
acgtcgccacctccagcaaacccaccttttgaggcttccaacctgatcaaccagatacct
gtaaaagcagatggaacggggtggtatccacgaatgttacgaatggctgctctgaacgaa
ccgtttaccggagacatgcacggcgtacgaggagctgactatgcctgttatcgacaagca
aagcgagctggcttgaggggaacgttccgtgcattcctcagctccagggttcaaaacgtc
gacagcatcgtgagactcggggacagagacctccctatcgttaacataaagggagacgtc
ctcttcaactcttggaaggaaatgttcaacggaaacggagcgtacttctctcagaaccct
agaatttacagtttcaatgggaagaacatcctcagtgattttgcatggccagaaaaggtg
gcttggcacggctcgcacaaattaggagaccgcgcgatggacacgtactgcgacgcctgg
cattcgagcaattctgatcgttacggattaggatcgcccctaacggggggtcgtctcttg
gagcaagttcgttattcgtgcgacaacaagttcgctctgctctgcatagaagtgaccagc
gagattacaagaagaaggcggagcgtcgaagttacggaggacgcggaggaaatgtcggag
aacgattacaaggagtacttggacgctctgatggacaactaa

DBGET integrated database retrieval system