KEGG   Musca domestica (house fly): 101889444
Entry
101889444         CDS       T03448                                 
Name
(RefSeq) collagen alpha-1(XVIII) chain isoform X1
  KO
K06823  collagen type XVIII alpha
Organism
mde  Musca domestica (house fly)
Brite
KEGG Orthology (KO) [BR:mde00001]
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mde04147]
    101889444
   00535 Proteoglycans [BR:mde00535]
    101889444
   00536 Glycosaminoglycan binding proteins [BR:mde00536]
    101889444
Exosome [BR:mde04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   101889444
Proteoglycans [BR:mde00535]
 Extracellular matrix (ECM) proteoglycans
  Basement membrane proteoglycans
   101889444
Glycosaminoglycan binding proteins [BR:mde00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   101889444
SSDB
Motif
Pfam: Endostatin Collagen Collagen_trimer
Other DBs
NCBI-GeneID: 101889444
NCBI-ProteinID: XP_058980471
LinkDB
Position
Unknown
AA seq 1166 aa
MQIMCSWRKMFSLLLIVLTVGRTQTIELTGQAIKDAIAEYDLLAMINENLNGIEFDYAED
GFPAYKILQIADIKSPYRMILPEKLNAFAIQTTFQTTSPKGGYLFSVVDPLDTVVQFGVH
FSPVIKDSWNVSLLYTDSNISPISKRLVTYQLPYEPKKWIKLAFKVMSDKVVFYYNCEER
ETTSIVRDPLELVFASSSTLYLAQAGPKLGGNMEGFLQKLNVYGNPEALSITCIPQPKET
DFLSQDYNDVDFSNLASRKPNRGFKFPSLEEASGDYSDYFSWDKATSIYDGSGMPPQQTQ
YQHERPYRTIKGEKGDRGPKGPPGDSIRGPPGPPGPAGPKGEPGSFPPFIDFASNPEAKY
TGKCTCNASDILEAIKENDLLRETIRGPPGLPGKEGKTGAPGRTVTITEKGATGVPGERG
APGPKGDRGDRGDPGPRGPEGLQGQKGEPGVDGMPGVVGPPGPPGPPGLPENYDESLMGN
SMGSLRSSSPGPKGEPGIKGEIGRPGERGQPGQKGERGDPGMSGEKGDRGHTGPHGPAGA
KGEPGQPGIPGISGSPGAPGLKGDRGLPGEIGPLGPPGPPGQIVYADTPYGLQNTTQCTC
PAGPPGPMGPRGPAGYDGAPGLNGEPGPAGSIGLTGLPGSKGDKGEKGLRGLTGPKGDRG
PEGPPGQAFFAGGFEGMAMNGTKGEKGDKGMRGRRGKPGTAGPIGPPGKPGIMGEMGHAG
RPGVPGPKGDLGPKGAKGEPGGREGPKGEKGDRGTDGRDGKPGPPGLPAASGEGVQYVPM
PGPPGPPGPPGPPGLPGLSISGPKGEPGVDSRSGYYGDASYYGRPGPPGPPGPPGSSAGS
SSSRHHDREEDEETPYFSASSWNMRIVPGAVTFPNIDEMTKRSAMNPPGTLAYITEEEAL
LVRVNKGWQYIALGTLVPIATPPPPTTMAPPVRVDIQSSNLLNNLPPLLNTPTFTTAPEY
ETWYPRMLRIAALNEPYIGDLQGIRGADFACYRQGRRAGLLGTFKAFLSSRVQNLDSIVR
VADRDLPVVNTRGDVLFNSWKGIFNGQGGFFSQAPRIYSFSGKNVLTDPLWPQKHVWHGA
LPNGERSIDTYCDAWHSGARDKIGYASNLLGNKLLDQERLTCDSKLIVLCVEALSQDRRK
KRDLSDSDQEFYTAEEYEEHLRSVLT
NT seq 3501 nt   +upstreamnt  +downstreamnt
atgcaaatcatgtgttcttggcggaaaatgttttcgcttctattaattgtccttaccgtg
ggaaggacgcagacaattgaattaactggccaagctatcaaagatgccattgccgaatac
gatttactggccatgattaatgaaaatttaaatggcattgaatttgattatgccgaggat
ggttttccagcttacaaaattttgcaaattgctgatataaaatcaccctatcgtatgatc
ctacccgaaaaactaaacgcttttgccatacagacgacatttcagacaacatcacctaag
ggtggttatctattcagtgtcgtggatcctctagatacagttgtgcaatttggtgtgcac
ttttctccggtcatcaaagattcatggaacgtatcactactgtacacagattcaaatatc
agtccgattagcaaacgtttggtcacctatcagttgccatatgaaccaaagaaatggata
aaattggcatttaaagttatgagcgacaaggtggtattctattacaattgcgaagagagg
gagacgacatcgattgtaagagatccattggaattggtatttgcctcatcatccactttg
tatttggctcaagctgggccgaaattgggtggcaacatggagggatttttacaaaaactc
aatgtttatggtaacccagaagctttatccataacttgtatacctcagcccaaggaaacg
gactttttatcacaagattataatgatgttgattttagtaacttagcttcgagaaaacca
aatcggggttttaagtttcccagtttagaagaagcatccggtgattattcagattatttt
agttgggacaaagcgacaagcatatacgatggcagtggtatgccgccacagcaaacgcaa
tatcaacatgaaagaccttatcgtactataaaaggtgaaaaaggtgatcgtggacccaag
ggaccacctggtgatagcatacgtggtcccccgggaccaccaggtcctgctggtcctaag
ggtgaaccaggatcatttccgccttttatcgattttgcgagtaatccggaagcgaaatac
actggcaaatgtacatgtaatgcgtctgatatcctagaagctatcaaagaaaacgatttg
ctgcgcgaaactatacgagggccaccaggactaccgggtaaagagggtaaaactggagct
cctggacgaacggttacaataaccgaaaagggtgctactggggtgccaggtgaacgcggt
gcccccggtcccaaaggtgaccgcggcgataggggcgatcctggtcctcgtggtcccgaa
ggtctacagggtcagaagggtgaaccaggtgtcgacggtatgcctggtgttgtgggtcct
cctggaccgccaggaccacctggtttgccggagaattatgatgaatcgttaatgggcaac
tcaatgggatcattacgcagcagttcacctggacccaagggcgaaccaggcattaaaggt
gaaattggtcgaccaggtgaaagaggtcaaccggggcagaaaggtgaacgtggagaccca
ggaatgagtggtgaaaaaggcgataggggtcataccggaccacatggtcctgctggagct
aaaggcgaaccaggacagccgggtatacccggcattagcggctctcccggagctcctggc
ttgaaaggcgatcgtggtctacccggcgagattggcccactggggccacccggaccacca
ggacaaattgtttatgcggatacaccatatggattacaaaatacgacacagtgtacttgc
ccggctggtccgcctggtccaatgggtccacgaggtcctgctggctatgatggtgcaccc
ggtttgaacggtgagcctggtccagctggatcaatcggtttgaccggcctgcctggcagt
aagggcgacaaaggcgaaaaaggtttgcgtggcttgactggaccgaaaggtgatcgagga
ccagaaggtccgcctggacaggcctttttcgctggtggtttcgaaggcatggcaatgaat
ggcaccaagggtgaaaagggtgataaaggtatgcgtggtcgtcgcggcaaaccgggaaca
gctggaccaatcggacctccgggcaaaccaggtattatgggcgaaatgggacatgcggga
cgacccggagtacccggaccgaaaggtgatttggggccgaaaggagccaaaggcgaacca
ggcggacgagaaggtcctaaaggagaaaaaggagatcgtggtaccgatggtcgtgatggc
aagcctggtccaccaggtttacccgcagcatctggagaaggtgtacaatatgttcctatg
ccaggaccacctggtcctcccggccctcctggaccacccggcctgccgggactttcaata
tctggaccaaaaggtgaacccggcgtcgattctagaagtggctattatggtgatgcttca
tattatggtcgaccaggaccaccaggcccacccggaccacctggctcgtcagccggtagc
agttccagccgccatcacgatcgtgaagaggatgaggaaacaccttatttttctgcatct
tcttggaatatgcgtatcgttcctggagctgttacctttcccaacattgatgaaatgaca
aagagatctgccatgaatccaccgggcactttggcttacattaccgaagaggaagctttg
ttggtccgagtcaataaaggctggcagtatatagcgcttggcacattggtacccattgca
acgccacccccaccgaccactatggcacctccggtgcgtgtggatatacaatcttccaat
ttattaaataatttaccacctctactcaatacaccaacgtttactacggctccagaatat
gaaacgtggtatccacgaatgttaagaattgctgccttgaatgagccatacattggtgat
ctacaaggtatacgaggagctgattttgcctgttatcgtcaaggtagaagggcaggtctt
ttgggcacatttaaggctttcctatcctcgagggttcaaaatctagattcaattgtacgt
gtcgcagaccgtgatctgccggtggtgaatacaagaggtgatgtcctctttaattcgtgg
aagggcatatttaatggccagggaggtttcttttcgcaagcgccacgtatctacagtttt
agcggcaaaaacgttctgacagatcccttatggccacaaaaacatgtctggcatggcgct
ttacccaatggcgaacgttcgattgacacatattgtgatgcctggcatagtggagcaaga
gacaaaattggctatgccagcaatttgttgggaaacaaattattggatcaggagcgacta
acgtgcgatagcaaattaatagtactttgtgtagaagcactttcgcaggatcgacggaaa
aaacgagatttgagtgacagcgatcaagaattttacacggctgaggagtacgaagaacat
ctaagaagtgtactgacttga

DBGET integrated database retrieval system