KEGG   Zerene cesonia (dogface butterfly): 119840645
Entry
119840645         CDS       T07293                                 
Name
(RefSeq) collagen alpha-1(XV) chain-like isoform X1
  KO
K06823  collagen type XVIII alpha
Organism
zce  Zerene cesonia (dogface butterfly)
Brite
KEGG Orthology (KO) [BR:zce00001]
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:zce04147]
    119840645
   00535 Proteoglycans [BR:zce00535]
    119840645
   00536 Glycosaminoglycan binding proteins [BR:zce00536]
    119840645
Exosome [BR:zce04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   119840645
Proteoglycans [BR:zce00535]
 Extracellular matrix (ECM) proteoglycans
  Basement membrane proteoglycans
   119840645
Glycosaminoglycan binding proteins [BR:zce00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   119840645
SSDB
Motif
Pfam: Endostatin Collagen Collagen_trimer Laminin_G_3 3keto-disac_hyd
Other DBs
NCBI-GeneID: 119840645
NCBI-ProteinID: XP_038223259
LinkDB
Position
7:complement(6128331..6244408)
AA seq 1203 aa
MMESARRDISNYTKRVYNGKFNVKELQLQRLSPISRHHIIRWLWLTLLFHVTICSANDGS
FGSKYPNDIPEYDLLHAIGVPFSNPKTQYFDEGLDGFPAYGLKPGSDIKSPYRLFMPEKL
YAEFSITATVRPANKDGGFLFSVVNPLETVVQLGVQLIPSGPGLTNISLLYTDANIYALS
QTIASFVVPSFAKKWTRFAIRVTNDNITLFLNCVEFDSMSVKRNPPELVFDSASTLYVGQ
AGPIIRGAFHGAFQELKLFGSPSQAEIQCVNTFEEIGNGSGGDEIYIDNYLVDQEEGDVE
GSGRYGTMPPFPPPPPGLNGYPPILRGEKGERGPRGPPGESIRGPPGPPGPPGAPGTPGA
TTIAESSGSGDDVTSVKQIFGENYVSLGQCGCNSSTILALLESAPELQGPPGPPGLIGAD
GRTGPPGIPGQMGPAGERGPMGPRGEKGDRGDDGIRGPEGQPGQKGEPGVDGRPGSPGPP
GPPGNPGSSDYNNFESNWKPRQIYKESLLGSYGGAIGRPGAPGPKGDAGQPGPIGLQGER
GFPGPKGERGLVGQTGPKGDRGYSGPKGDRGVKGDRGDPGHDGRPGLPGANGRLGEKGEK
GERGTPGPPGPPALPLGFTTEDSEFLTTGRLSPTSKGEKGEKGEKGSGGNDGIPGRPGKD
GIPGERGDIGPSGMPGTSGPAGPPGLKGERGERGPPGPVTIASAGSDIITVKGDKGDLGP
RGRRGRPGPPGPRGLQGLQGIPGPPGKPGEKGDIGLPGWMNNKGRPGTLGPPGNPGAIGP
KGEKGYPGVSLLDISMLKGEKGDRGNDGLPGPKGAEGPPGPPGTAFKSDVVQYIPGPPGP
PGQPGLPGPPGISIVGPKGEPGLSYFEENPVHGSTKYFGRPGPPLDTRSHADESSKTAPG
AAVFRTTEEMMRLAASSPVGALAYVMEEQALFVKVNSGWQYVLLGSLVTQAPLTTTQAPM
PTPMPAASLVHVPHLSNFVDNSPPIATNGPTLKLAALNEPLSGDMHGVRRADYACYRQAR
RAGLKGTFRAFLTSRIQNLDSIVRYADRHLPVVNTYGEVLFKSFSEIFDGNGGILAGTPR
IFSFSGKNIMMDANWPHKLVWHGSHASGERALDTFCDEWQSGEPAMRGMAASLYSHKLLS
QERYACNNRFAVLCIQATAHGNDRRKRDALRYNSTLDDEDYLYNADEYQELLNDIFAQPF
AEN
NT seq 3612 nt   +upstreamnt  +downstreamnt
atgatggagagcgctcggcgtgacatcagcaattacaccaaacgcgtctacaatgggaag
tttaacgtcaaagaactgcaacttcaaagactttctcccatctcgagacatcatatcata
aggtggttgtggctgactcttctctttcacgtgacgatttgctcagcaaatgatggcagt
tttggatcgaaatacccgaacgatatccccgaatacgacttgctgcacgctatcggagtt
ccatttagcaatccgaagacccagtacttcgatgagggcctcgatggttttccggcttac
gggctaaagccgggatctgatattaaatctccgtatcgtctctttatgccggagaaactt
tacgctgaattctcaataacagcgacagtacggccggcgaataaagatggcgggttcctc
ttttcagtggtcaaccctttggaaacagtggttcaattaggtgtccagctaatccccagc
gggcctggcttgaccaatatctctctgttgtacacggatgcgaatatttacgctctgtcg
cagaccattgcatcgttcgtggtgccatcgtttgcgaagaagtggacgcgcttcgctata
agagtgactaacgataacatcacactgtttctaaattgcgtagaatttgatagtatgtca
gttaaaaggaacccgccagaactggtatttgattctgcgtcgacgctttatgttggacaa
gctggtccgatcatacgaggcgcatttcatggtgcttttcaagagctgaagctatttggt
tcaccatcgcaagctgaaatacaatgcgtgaatacgtttgaggaaataggtaatggcagt
ggaggagacgagatttatatagacaattatttggtggaccaagaagagggggacgtggag
gggtcagggcggtacgggaccatgccgccatttccgccgccgcccccaggtctaaatggc
tacccaccgatattaagaggcgaaaaaggagagaggggaccaaggggtcctccaggggaa
tctattcgtggccctccaggacccccaggcccaccaggggctccagggactccaggcgct
acaactattgctgaatcatcgggttctggagacgacgttacaagcgtgaagcaaatcttc
ggcgagaactacgtatcgctcggacaatgtggttgtaattcaagcactatactggctctg
ttggaatcagctcccgaactacaggggccaccagggcccccgggtttgattggagctgat
ggacgaacgggccctccagggatacctggacaaatgggccctgccggcgaacgtggacct
atgggtccgagaggtgaaaagggcgatcgtggcgatgatggtattcgcggtccggaaggc
cagcctggtcagaagggagagccaggtgtcgatggaaggcctggaagtccggggccacca
ggccctcctggcaaccccggttcctctgattacaataactttgaatccaattggaagcct
agacagatttacaaggaatcattattgggttcatacggaggagcaattggcaggcccggt
gctccgggtccaaagggggacgcgggacaacctggccccataggccttcaaggtgaacga
ggttttccagggcccaaaggtgaaagaggacttgtaggacaaactgggccgaaaggtgac
cgtggctattctggtccaaaaggggatagaggagttaagggtgatcgtggcgaccctgga
catgacggacgtcctggtttaccgggagctaatggccgactgggtgaaaagggtgaaaaa
ggagaacgaggtactcccggccccccaggaccaccagctttgcctttgggtttcacaaca
gaagattctgaattcttaacgacaggacgactctcgcctacctctaagggggagaaggga
gaaaagggtgaaaaaggaagcggtggtaatgatggaataccaggtaggcccggtaaagac
gggatccccggagagcgcggtgacattgggccttctgggatgccaggcacatcgggaccc
gcagggcctccaggcttgaagggcgagaggggagaacggggacctccaggccctgtcact
atcgcttcagctggttctgatatcattaccgttaagggtgataaaggcgatttagggcca
agaggcagaagagggcggccaggacccccgggacctcgtggccttcaaggtctgcaaggg
atccctggaccaccggggaaacctggtgaaaagggtgatataggtctcccaggctggatg
aacaataagggtcgtccaggtactctagggcccccaggaaatccgggcgctataggtccg
aaaggagaaaagggttaccctggtgtcagtcttttggatatttccatgttaaaaggtgaa
aagggtgatcgcggtaatgatggattgcccgggccaaaaggagcagagggaccaccaggg
cccccgggtacagccttcaaatccgacgtagtgcaatacatcccgggacctcccgggcct
ccaggccagccggggcttccgggaccccctggcatatccatagttggacctaagggagaa
ccaggtcttagctactttgaagaaaaccccgttcatggcagcacgaagtattttggcaga
cctggacctcctttggatacaagaagccacgctgatgaatccagcaaaactgctcctggg
gctgctgtgttccgtacaacagaagaaatgatgcggcttgctgcatcaagccctgtggga
gcattagcttatgttatggaggaacaagctctctttgtcaaagtgaattcaggctggcaa
tacgttttgttaggatcgctagtgacgcaggctcctctcacgacaacccaagccccaatg
ccaacgcccatgccggctgccagcttagtgcatgtaccacatttatcaaacttcgtggat
aattccccgccaatagctactaatgggccaacgctaaaattagcagctctgaacgaacct
ctgtcaggtgatatgcatggtgtgcgacgagctgactacgcgtgttatcgacaggcgaga
cgcgctggtctgaaaggaacctttagagctttcctgactagcaggatacaaaatttagat
tccatagtacgatatgcggaccgacacttgcccgtcgtgaacacatatggagaggttttg
ttcaagtcattctctgaaatatttgatggcaacggcggcattttagcgggaacgccgaga
atcttcagttttagtgggaagaacatcatgatggacgctaattggcctcacaaactagtc
tggcacgggtcgcacgcgagcggcgagcgcgctctggacacgttctgcgacgagtggcag
agcggcgagccggcgatgcgcggcatggccgcctcgctgtactctcataagctgctgtca
caagagagatatgcctgcaacaatcgctttgcagtgctgtgtatacaagccacagctcat
ggtaatgatagacgaaaacgggatgcattgaggtacaattcgactctagacgacgaagat
tacctttacaatgcggacgagtaccaagaactactgaatgatatatttgctcaaccgttt
gcagagaactag

DBGET integrated database retrieval system