KEGG   Solenopsis invicta (red fire ant): 105196855
Entry
105196855         CDS       T03916                                 
Name
(RefSeq) collagen alpha-1(XXII) chain isoform X1
  KO
K06823  collagen type XVIII alpha
Organism
soc  Solenopsis invicta (red fire ant)
Brite
KEGG Orthology (KO) [BR:soc00001]
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:soc04147]
    105196855
   00535 Proteoglycans [BR:soc00535]
    105196855
   00536 Glycosaminoglycan binding proteins [BR:soc00536]
    105196855
Exosome [BR:soc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   105196855
Proteoglycans [BR:soc00535]
 Extracellular matrix (ECM) proteoglycans
  Basement membrane proteoglycans
   105196855
Glycosaminoglycan binding proteins [BR:soc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   105196855
SSDB
Motif
Pfam: Endostatin Collagen Collagen_trimer Laminin_G_3 DUF1554
Other DBs
NCBI-GeneID: 105196855
NCBI-ProteinID: XP_025990791
LinkDB
Position
6:9125247..9355081
AA seq 1164 aa
MPLRSLLLIFAFALCATCNRADFFGGKEVVYDLMVATVSSMMDENNLYMDDGVDGFPAFG
FRPGSEVKQPYRLYLPEKLPAEFTLVATFKPTSFRTSYLFAVLNPFETVVQLGIRISDGP
GSNQNVSLVYTNSDEHSHSEEVAKFTVPKLTKKWSKIVIKVLTNDVTLYLNCHEMARQRV
TRIPQELVFDTASTLYIAQAGPHIQERYDGLLQSLKLYAGHPPDLVKCSADFDFSADEEV
ASGDYDVSLFDGSGDADLNKISRDVDEDKSEESNPPPFITPPPPNPDYKGPKGEKGDKGD
KGESVRGPPGPPGPPGRDEDWLMKIPQGPPGQKGDPGTCTCNATALMSSFTMPKMIQGPK
GEPGVPGQEGKQGLMGLTGAAGPPGERGLHGPSGSKGDKGDIGIAGPEGPQGQKGEPGRD
GIPGEKGAQGPPGPPGKGEFSGYDPSWKPRNIYRPEGITMRPGLPGQKGEPGISGNPGPK
GESGIPGSKGIKGEPGYKGVKGDHGKDGPRGIQGFKGEPGAPGAPGLPGAPGENGRPAEK
GDKGDTGPEGKLGPPGPPGPPGMGGSGSINVGDLGFGTKGDKGDGGARGYKGDKGTKGEK
GDRGDSGPAGIPGINGIQGPQGDKGEPGKDGVSGLPGTPGTKGERGERGPPGATTVANSG
DYITIKGEKGAEGKRGRRGRPGPPGPVGPPGKPGTTGEIGLPGWMNTMKGRPGTPGIPGS
IGPMGPKGDKGEPGAPSPYGVSVGIKGDKGDDGFPGIPGQPGREGQRGPPGPPGPPGAPS
QGKYMPVPGPPGPPGPPGPPGLSLIGQKGEPGIGRSHVFGERDYYPPRQGARSSLDELKA
LRELKQLKELKEQLGVVTATRGPLESTTKIVPGAVTFQNTEAMTKMSSVSPVGTLAYIID
EQALLVRVNNGWQYIALGSLLPITTPAPPTTSPPPANPPFEASNLINQIPVKADGTGWYP
RMLRMAALNEPFTGDMHGIRGADYACYRQAKRAGLRGTFRAFLSSRVQNVDSIVRLGDRD
LPIVNIKGDVLFNSWKEMFNGNGAYFSQNPRIYSFNGKNIFTDFAWPDKVAWHGSHKLGD
RAMDTYCDAWHSSSSDRYGLGSPLTGGRLLDQVRYSCDNKFALLCIEVTSELVRRRRNAD
NRPDDDIEMSENDYMEYLEELMQY
NT seq 3495 nt   +upstreamnt  +downstreamnt
atgccgcttaggtcgttactgttgatcttcgccttcgcgctatgcgcgacgtgcaaccgc
gctgacttcttcggcgggaaggaagtggtgtacgatttgatggtggcaacggtgtcatcg
atgatggacgagaataacctttacatggacgacggcgtagacggatttccggcgtttggc
tttcgacccggctccgaagtgaagcaaccctacagactgtacctcccggagaaattgccg
gccgagttcaccctagtggccacgttcaagccgacctcctttagaacgagctacctcttc
gccgttcttaatcccttcgaaaccgttgtccaattgggcattagaatttcggatgggccg
ggctctaaccagaatgtctcactggtctacactaattcggacgagcattcgcattcggag
gaggtggcgaaattcaccgtgccaaagcttacgaagaagtggtcgaagatcgtcattaag
gttttgacgaacgacgtcaccctctatctcaattgccacgagatggctaggcagagggtt
accaggatcccccaggaattagtattcgatactgccagtacgctgtacatcgcgcaagcc
ggaccgcacatccaggaacgatacgacggtctactgcaatctttgaaattgtacgccggc
catcctccggaccttgtaaaatgctctgccgattttgacttttcagcagatgaagaggtc
gcttctggtgattacgatgtcagtttgttcgacggttctggcgacgctgatttaaataag
atcagtcgagatgtggacgaagacaaaagcgaggagagtaacccgccaccattcatcacg
cccccgcctccgaatccagattataaaggtccgaaaggggagaagggtgacaaaggggat
aagggcgagagcgtcagaggtcccccaggtcctcctggccctcctggtcgcgacgaagat
tggctaatgaagataccacagggaccacctggtcagaagggagaccccggcacttgcacc
tgcaacgccacggccttaatgtcttcctttacgatgccaaagatgatacaagggccgaag
ggtgagccaggagtacccggacaggaaggcaagcagggcctaatgggcctcacgggtgcg
gcaggaccgccaggggaaagaggactgcatggtccgtccgggagtaaaggtgacaagggt
gatattggaatagcagggccagaaggtcctcaaggtcagaaaggcgaaccaggtcgggat
ggaatacccggtgaaaaaggtgcgcaaggtccgccggggccgcccggaaaaggagaattt
tctggctatgaccccagttggaaacctcgaaatatttacagaccggaaggtatcacaatg
agaccaggacttccaggacaaaagggtgagccaggcatttcagggaatcctggtcctaaa
ggcgagtctggaatacccggatctaaaggtatcaaaggggaacctggctacaaaggtgtc
aagggtgatcatgggaaagatggtcctagaggaattcagggctttaagggtgaacctggc
gcacccggtgcaccagggttgcctggcgcaccaggtgaaaacgggcgaccggccgaaaaa
ggcgacaagggcgatacgggacctgaaggaaaactgggtcctccagggccacccggtcca
cccggtatgggaggttccggtagcataaacgtgggagatctaggctttggcacaaaaggt
gataaaggcgatggtggggcgcgcggatacaagggcgataaaggcacgaaaggcgaaaag
ggcgataggggtgactccggaccagctggtattcccggaataaacggtattcaaggacct
caaggagataaaggcgaaccgggtaaagacggagtatcaggattacccggtacacctggc
actaaaggcgagagaggtgagagaggtcctcctggagccactaccgtcgccaattccgga
gactatatcaccatcaaaggtgagaaaggcgccgaaggaaaacggggtagaagaggacga
cctggaccgcctggtcctgtgggtcctcccgggaaaccaggaactacgggagagatcggt
ctaccaggatggatgaacacgatgaagggtcgtcctgggactcctggaattccgggaagt
atcggtccaatgggacccaagggagacaagggagagcctggcgcaccgagtccttacggt
gtttctgtcggtatcaaaggcgacaagggagacgatggttttcctggaattcctgggcag
ccgggaagagaaggtcaaagaggacctccaggacctcccggaccacctggagcaccgtct
caaggaaaatatatgccagttccaggacctccgggtcctcccggaccgcctggtccgcct
ggcttgtctttaatcggacagaaaggagaacccgggatcggtagaagccatgttttcggg
gaaagagattactatccccccaggcaaggagccagaagtagtctggacgaattgaaagct
ctacgtgaactcaagcaactgaaagaactgaaagagcaattaggtgttgttactgctaca
aggggacctttagaaagtacaacgaaaattgtgccaggagctgttacgttccaaaataca
gaagccatgacaaaaatgtctagcgttagtccagtcgggactctggcttatatcatagac
gaacaagctttactagtcagagttaacaatggatggcaatacattgctctcggatcactt
ttgcctatcactacgccagcaccaccaacaacgtctccaccgccagcaaatcctcctttc
gaagcatccaatttaattaatcagatacccgtgaaagccgacggaacaggatggtatccg
cgaatgttgaggatggctgcattgaacgagccgttcaccggagatatgcatggtatacga
ggagcagactacgcttgctatcgacaagcaaagcgagcaggcttgagaggtaccttccgc
gctttccttagctctcgagttcaaaatgttgatagcattgtcagattaggagatcgagat
cttcctatagttaacataaagggcgacgtgctattcaactcttggaaagaaatgtttaat
ggaaatggagcgtacttctctcaaaatcccagaatctacagcttcaatggaaagaacatt
tttactgacttcgcgtggcctgataaagtcgcgtggcacggttctcataaacttggcgat
cgagcaatggacacttactgcgatgcctggcactcgagcagctcggatcgttatggatta
ggctcgccgttgaccggcggccgattgttagaccaagtacgatattcgtgcgacaataaa
tttgcgctgctctgcatcgaggtgacgagcgagctggtgaggagacgacggaacgccgac
aatcggccagacgacgacatcgagatgtcggagaacgactatatggagtacttggaggaa
ctcatgcaatactga

DBGET integrated database retrieval system