KEGG   Branchiostoma floridae (Florida lancelet): 118412003
Entry
118412003         CDS       T01074                                 
Name
(RefSeq) collagen alpha-5(IV) chain-like isoform X1
  KO
K06237  collagen type IV alpha
Organism
bfo  Branchiostoma floridae (Florida lancelet)
Pathway
bfo04382  Cornified envelope formation
bfo04512  ECM-receptor interaction
bfo04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bfo00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    118412003
 09140 Cellular Processes
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    118412003
 09150 Organismal Systems
  09158 Development and regeneration
   04382 Cornified envelope formation
    118412003
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bfo04147]
    118412003
   00536 Glycosaminoglycan binding proteins [BR:bfo00536]
    118412003
Exosome [BR:bfo04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   118412003
Glycosaminoglycan binding proteins [BR:bfo00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   118412003
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 118412003
NCBI-ProteinID: XP_035670450
UniProt: A0A9J7KUT7
LinkDB
Position
3:14033144..14099860
AA seq 1231 aa
MGTLWDATRIHLAIFLLLVVHANAQFSRKKSSSCDCPEGPEGPAGPPGPPGLPGMDGMMG
MNGERGPPGPRGGPGSKGPPGPAGGTGEPGMPGFPGKQGPKGIPGNRGQAGPPGPAGPPG
EAGEPGNEGKPGMMGMAGPPGPQGLIGPPGKTGQSGGPGSRGLPGPPGNNGGPGMPGLKG
HRGFPGSPGPPGPPGMVGRDGKEGKEGDAGPPGPAGPSGRPGDQGPAGKSGSPGRPGSIG
PPGPPGNSGERGTPGPPGFQGAPGMAGKAGIQGKTGNDGAPGAIGPPGPPGQRGIGMPGP
PGPPGEGKPGKDGDKGSAGLPGRPGMPGRPGPPGNPGQSKEGPEGPPGPPGPAGLGRRGP
PGPPGPPGAMSQAARQPGPPGLPGDRGRPGPPGEQGSPGPAGLPGRGIKGHQGMPGMPGQ
QGQPGPPGPPGIGEEGPPGKQGKEGEPGIGMPGEPGQRGPPGPSGKLGRPGQPGPPGPPG
IGQPGNDGQAGPPGPPGSPGFPGSPGEGKRGSPGRSGQPGPPGPPGAAGKPGIGKPGRTG
EPGPRGDKGDDGVGTPGPRGPPGLPGLPAPAVDLSALAGALAKQMPSPKGIRGPPGPPGS
PGSPGSGGRGSPGPPGEPGPPGPMSPAGAGGGPPGPSGAPGPPGPPGPSGQGMPGPSGPP
GPLGPPGPGGPPGPPGTGLKGDRGPPGPPGGDGTGPGPNGGPGPKGDPGPGGPPGPGGPP
GPNGPPGPPGPSGIGLPGPQGPPGPGGPPGPQGVGLKGDPGPIGGPGPQGPPGPPGLPGI
GEKGDIGQPGEPGEPGQRGPRGPPGIGQLGPPGPAGPGGPPGPPGVGVKGEAGPPGFPGA
AGPPGPGGPPGVGTQGPPGALGPPGPAGPPGAPGQTDGPGTVGPPGPPGPLGPPGTPGLP
GVGLKGEPGAGGPQGPPGPPGVGLKGEPGVLGPPGPAGPTGPPGQGIPGDAGPPGLPGLP
GARGLPGLDGLPGPVGQGGFDLLTRHSQTDQIPNCPENMTKVWDGYSLLHVEGNERAHDQ
DLGRPDSCLPRFGILPFFSCEPDDVCKYGYRDDKTYWLSTIERFEQQPRVPAEGTDIRQY
ISRCAVCLTTNLVLAFHSQRASVTPDCPEGWRGIWTGYSFLMHTAAGEGGGQPLTSPGSC
LDHFRSAPFIECVSDVCDVSSNQLSFWLIGIPGGEREQFNQPRRVVGSLEVLRQRISRCR
VCVYNGGLYEDIVAKEAPPGVTATGVFNFGG
NT seq 3696 nt   +upstreamnt  +downstreamnt
atggggacgttatgggacgcgacgaggatacacctcgccatcttcctactattagttgta
catgcaaatgctcaattttctagaaagaaaagttcgtcctgcgactgccctgagggtcct
gagggaccggccggaccacctgggcctcccggtttacctggcatggatggaatgatggga
atgaacggagagagaggaccacctggaccaaggggtggaccgggcagcaaaggtccgcca
ggtcccgccggaggcacaggagaaccgggaatgccgggtttcccaggaaagcaaggcccc
aaaggcatccctggcaacagaggtcaagctggtccccccggacctgctgggccaccggga
gaagcaggagagcctggtaacgagggcaagccaggcatgatgggaatggcaggcccacct
ggacctcagggactgatcggaccccctggcaaaacaggtcaatcaggaggacctggaagt
aggggactgccgggtccacctggtaacaatggcgggccaggcatgccgggattgaaaggt
cataggggattcccaggatcaccaggcccacctggtccaccaggaatggtcggaagagac
ggcaaagagggcaaggaaggcgatgctggaccgcctggacccgctggcccatccggccga
cctggagaccaaggacccgctggaaaatctggaagcccgggtcgccccggtagtatcgga
ccaccaggaccccctggcaattcgggtgaacggggaacccctggacctcctggcttccaa
ggagcaccgggcatggctggaaaagcaggcatccaaggaaaaaccggaaatgacggtgcc
cctggagccattggtccaccaggaccaccaggacaacgtggcattggaatgccaggacca
cctggacctcctggagaagggaaacctgggaaggacggtgataaaggatcagctggattg
ccgggcagacccggaatgccgggaagaccgggaccaccaggtaaccctggacagtcgaag
gaaggtccagaaggcccacctggtccacctggcccagccggtttaggaagaagaggaccc
cccggtccacctggacccccaggagcgatgagccaagcagcacggcaacctggtccccct
ggcctgcccggcgacagaggcagacctgggccgcctggcgagcagggatcgcccggacca
gcaggactacctggtcgtggcatcaaggggcaccaaggtatgccggggatgcccggccaa
cagggacaaccaggaccgccaggaccacctggaatcggagaagaaggacctccaggaaaa
caaggaaaggaaggtgaaccgggtattggtatgcccggcgagccaggacaaagaggtcca
cctggtccctcaggaaagctcggacgccccggtcagcccggaccgccaggacctcccggt
atcggacagcctggaaatgatggacaggcaggcccacctggtccacctggatctcctgga
ttcccaggctcacctggtgagggaaagagaggttcacctggaagatcaggccaacctgga
ccacctggaccaccaggagctgcaggaaaaccaggaataggaaaaccaggccgcacagga
gagccaggacctcgaggagacaagggtgatgatggcgtgggcacccctggcccacgtggt
cccccaggtctccctggtcttccagctccagctgtagacctatcagctcttgcaggtgca
cttgccaagcagatgccaagtccgaagggaatacgaggaccacccggtccaccaggttcc
cccggctcaccaggatcaggaggacgcggttcacccggccctccaggagaaccaggacct
cctgggcccatgtcacccgcaggagcaggtgggggacccccaggcccatctggcgcacca
ggcccacctggtccacctggcccgtctggacagggcatgcctggcccatccggcccccct
ggaccacttggcccgcctggacccggaggacctcccggaccacctggaactggactcaag
ggtgaccgtggtccccctggcccccctggaggagacggtactggcccaggccctaacgga
gggccgggacccaaaggagatcccggaccagggggaccccccggaccaggaggacctcca
ggacctaatggaccacctggaccacctggaccatcaggaattggattgccgggaccgcaa
ggtccacctggccctggaggaccaccaggaccgcaaggcgtcggactgaagggtgaccca
ggccccatcggcgggccagggcctcagggaccacctggtccaccaggtctaccagggatt
ggagagaaaggagacattggccagccgggagaaccaggggaacctggccaaaggggcccc
agaggacctccaggaatcggacagttgggaccacccggaccagcaggacctggtggacct
ccaggacctcctggtgttggcgtaaagggtgaggctggaccacctggcttccctggagct
gccggacctcctggacccggcggaccacccggagttggcacacagggtccgccaggtgcc
ctcggcccacctggtcctgctggaccaccaggagcaccaggccagacagatggtcctggg
acagtcggcccccctgggcccccaggacctctaggtccccctggaacgccaggacttccc
ggtgttggactgaagggagaaccaggagctggcggacctcagggaccaccaggtcctcct
ggtgttggactaaagggagagccaggtgtattaggaccccctggacctgcaggacctaca
ggtccacccggacagggcatacctggtgacgcaggacctcccggtctgcccggtctccct
ggcgctaggggattgcccggtttggacggactgcccggacctgtcggacaaggagggttc
gatttgctgacccgccacagccaaacggaccagattcccaactgccccgaaaatatgacc
aaggtctgggatggatacagtctgcttcatgtggaaggcaatgagcgagctcacgatcaa
gatttaggccgccctgactcctgtctaccgaggttcggcatcttgccattcttcagctgc
gagccggacgatgtctgcaagtacggctacagggacgacaagacctactggctctccacc
atcgagaggttcgaacaacagccccgagtgccggctgagggaacggacatccgacagtac
atcagccgctgtgcagtgtgtctgacaacaaacctggtgttagccttccacagtcagcgg
gcctcggtcaccccagactgtccggagggttggaggggaatctggactggatacagcttc
cttatgcacaccgctgcaggagaaggagggggtcaaccactaaccagccccggaagctgc
ttggaccacttccggtctgcaccattcatcgaatgcgttagcgatgtctgtgacgtgtcg
tccaaccagctcagtttctggctcatcgggatccccggtggagagagggaacaattcaac
cagccgcgcagagtcgtgggcagcttagaggttctcaggcaaagaataagtcgatgcagg
gtctgtgtttacaacggtggactgtacgaagatatcgttgctaaggaagctcctcctggt
gtaactgcaactggtgtttttaactttggcggctga

DBGET integrated database retrieval system