KEGG   Buteo buteo (common buzzard): 142027057
Entry
142027057         CDS       T11399                                 
Symbol
COL9A3
Name
(RefSeq) collagen alpha-3(IX) chain isoform X1
  KO
K08131  collagen type IX alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142027057 (COL9A3)
   04518 Integrin signaling
    142027057 (COL9A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142027057 (COL9A3)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142027057 (COL9A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:bbut00535]
    142027057 (COL9A3)
Proteoglycans [BR:bbut00535]
 Extracellular matrix (ECM) proteoglycans
  Others
   142027057 (COL9A3)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 142027057
NCBI-ProteinID: XP_074876757
LinkDB
Position
2:76870184..76909770
AA seq 695 aa
MAGFSALGLLFLCQLLATTPAQRVGPQGPPGPRGPPGPSGKDGIDGEPGPSGLPGPPHWS
ESASAPQSAMGCSKLYQGPKGAPGKPGAAGEAGLPGLPGVDGLTGTDGPPGPNGPPGDRG
ALGPAGPPGPAGKGLPGPPGPPGPSGLPGGNGFRGPPGPFGLPGFPGPPGPPGPPGLPGS
FPDGGGDLQCPALCPPGPPGPPGMPGFKGHTGHKGEPGEIGKEGEKGNPGPPGPPGIPGS
VGLQGPRGLRGLPGPMGPAGDRGDIGFRGPPGIPGPPGKAGDQGNKGPQGFRGPKGDTGR
PGPKGNPGARGLIGEPGMPGKDGRDGAPGLDGEKGDAARMGAPGEKGPNGLPGLPGRAGV
KGSKGEPGSPGEMGEAGPSGEPGIPGEVGIPGERGLPGPRGATGPLGLQGPMGVPGVRGF
QGPKGASGEPGLPGPTGIRGELGDRGPTGVPGPKGNQGIAGADGLPGDKGELGPFGPPGQ
KGEPGKRGELGPKGVQGPNGTAGAPGIPGHPGPMGHQGEQGIPGITGKPGPPGKEASEQH
IRELCGEMINDQIAQLAANLRKPLSPGMTGRPGPAGPPGPPGAMGSVGHPGPRGPPGYRG
PTGELGDPGPRGDTGEKGDKGPVGQGIDGPDGDQGLQGLPGVPGITKNGRDGAQGEPGLP
GDPGTPGAIGAQGTPGICDTSACMGAVGASISKKS
NT seq 2088 nt   +upstreamnt  +downstreamnt
atggctggattctccgctcttgggcttctcttcctatgccagctattagccacaacccca
gctcagagagtaggaccacaaggcccccctggaccacgaggaccccctggaccatctggc
aaggacggcatcgatggagagccgggcccttctggtttgccaggcccaccacattggtct
gagtctgcttcagctcctcagtccgccatggggtgctcaaaattgtaccaggggccgaaa
ggtgcaccagggaagcctggagcagctggtgaagccgggttgcctggccttcctggagtg
gatggtttaacaggaactgatggcccacctggcccaaatgggcctccaggagaccgtggt
gcattaggacctgccgggccacctggaccagctggcaaaggactaccaggacctccaggg
cctccaggacccagtgggctcccaggtggaaatggatttcggggtcctcctggtcccttt
ggtctgccaggattcccagggcctcctggaccacccggacctcccggtcttccaggctcc
tttcctgatggcggtggggaccttcagtgcccagctctgtgcccaccgggtcctccaggt
cccccaggaatgccagggttcaagggacacactggtcacaaaggagaacctggtgaaata
ggaaaagaaggagagaagggtaaccctggccctccaggtcctccaggaattcccggcagt
gttggcctgcagggcccaagaggactaagaggcctccctggtccaatgggtccggctggt
gacagaggtgacattggtttcagagggccacctggaatcccaggacctccagggaaagct
ggagaccaaggaaacaaaggacctcaaggattcagggggcccaaaggcgatactggtaga
cctggaccaaaaggaaacccaggggctcgtggactcataggagaacctggcatgcctggc
aaggatggacgggatggagctcctggactcgatggtgagaagggtgatgcagctcgcatg
ggtgctcctggagagaagggaccaaacggactgcctggactgcctggaagagcaggtgtt
aaaggctcaaagggtgaaccaggcagtcctggagagatgggagaggcaggtccctctgga
gagccaggcatccctggtgaagttggtattcctggtgagagaggtctgccgggacccaga
ggagcaactggaccacttggtcttcaaggcccgatgggagtccctggtgtcagaggtttc
cagggtcccaaaggtgccagtggtgaacctggccttcctggtccaactggaattcgtgga
gagttaggtgacaggggtcctactggtgttcctggtcccaaaggtaaccagggcattgct
ggagcagacggcctcccaggtgacaaaggagaactgggtccctttggtccccctggccaa
aaaggagagccaggaaaaagaggagaacttggtccaaaaggtgtccaaggacctaatggg
acagcgggagcaccagggatcccaggacatccagggcccatgggccatcagggagagcaa
ggcattcctggcatcacgggaaaacctgggccacctggcaaagaggccagtgaacaacat
ataagagaactctgtggggaaatgataaatgatcaaattgcacagttagctgctaatctg
agaaagcccttgtctcctggaatgacgggtcgtcctggcccagctggtcccccaggtcct
cctggagctatgggaagtgttgggcatcctggtcctcgtggtccccctggatacagaggg
ccaacaggagaacttggtgatcctggtcccagaggtgacactggtgaaaagggagataaa
ggacctgttggtcaaggtattgatgggcctgatggtgatcaaggacttcaaggactacct
ggtgtacctggaattactaaaaatggtcgtgatggtgctcagggtgaacctggtcttcca
ggggatcctggtactcctggtgctattggggctcagggaactccaggaatatgtgacacg
tcggcttgcatgggagctgttggagcatcaatttccaaaaaatcataa

KEGG   Buteo buteo (common buzzard): 142031034
Entry
142031034         CDS       T11399                                 
Symbol
COL6A2
Name
(RefSeq) collagen alpha-2(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142031034 (COL6A2)
   04518 Integrin signaling
    142031034 (COL6A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142031034 (COL6A2)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142031034 (COL6A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bbut04147]
    142031034 (COL6A2)
   00536 Glycosaminoglycan binding proteins [BR:bbut00536]
    142031034 (COL6A2)
Exosome [BR:bbut04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   142031034 (COL6A2)
Glycosaminoglycan binding proteins [BR:bbut00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   142031034 (COL6A2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   142031034 (COL6A2)
SSDB
Motif
Pfam: VWA Collagen VWA_2
Other DBs
NCBI-GeneID: 142031034
NCBI-ProteinID: XP_074883875
LinkDB
Position
5:complement(10633423..10665277)
AA seq 1019 aa
MFRQAFFSALLCVALVPLHAQFDGDPGDVSPTSCAEKRNCPINVYFIIDTSESVALQTVP
IQSLVDQIKQFIPMFINKLESELYQSQVYITWQFGGLHYSDVVEIYSPLTSSKDIYLTRL
LAINYLGRGTFTDCAISNMTEQIQTQMASGVNFAIVITDGHVTGSPCGGMKVQAERARDM
GIKLFAVAPSENVYEQGLREIANLPHELYRNNYAITQKDTLEIDENTIDRIIQAMKHEAY
GECYRMSCLEIAGPPGPKGYRGQKGAKGNMGEPGSPGLKGRQGDPGIEGPIGYPGPKGVP
GLKGEKGEIGSDGRRGAAGLAGRNGTDGQKGKLGRIGPPGCKGDPGDKGPDGYPGDAGDQ
GERGDEGIKGDPGRPGRSGPPGPPGEKGSPGLPGNPGPQGPPGSKGRRGEAGPPGPKGEP
GRRGDPGTKGSKGTPGAKGERGDPGPEGPRGLPGEVGSKGARGDQGLPGPRGPSGAVGEP
GKIGSRGDPGDLGPRGDAGPPGPKGDRGRPGFSYPGPRGPQGDKGEKGQPGPKGIRGELG
PKGVQGTKGAKGEPGDPGPGGEPGLRGPTGEAGPEGTPGPPGDPGLTDCDVMTYVRETCG
CCDCEKRCGALDIMFVIDSSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSQTGARVGVV
QYSHEGTFEAIKLDDERIDSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREKA
QVFAVVITDGRYDPRDDDKNLGALCGRDVVVNTIGIGDMFDQPEQSETLVSIACNEPQRV
QKMRLFSDLVAEEFIDKMEDVLCPDPQIVCPELPCQTELAVAQCTQRPIDVVFLLDGSER
IGEQNFHRAHHFVEEVARQLTLARSNNDNMNARIALLQYGSERDQDVVFPLTYNLTEISS
ALAQIKYLDSSSNIGSAIIHAINNIVLSPGDGQRLARRNAELSFVFITDGITGSKNLEEA
INSMKKQDVMPTVVALGSDVDMDVLLKLGLGDRAAIFREKDYDSLSQPSFFDRFIRWIC
NT seq 3060 nt   +upstreamnt  +downstreamnt
atgttccgacaagcctttttttccgctctcctttgtgtggctctagttcctctccatgct
cagtttgatggggatcctggagatgtcagtcctacctcctgtgcagaaaagaggaactgc
cccatcaatgtgtacttcatcattgacacctcagagagtgtcgctctgcagactgtgcct
atccagagcctcgtggatcaaataaagcagttcatccctatgttcatcaataaactggag
agtgagctctaccagagccaagtctacatcacttggcagtttggtggacttcattactcg
gatgtggtggaaatttacagccctttaacgagcagcaaagacatatacctcacccggctg
cttgctattaactaccttggccggggcaccttcacggactgcgctatctccaacatgacg
gagcagatccagacccagatggcctcgggtgtgaactttgcaatagtcatcactgatgga
catgtcactggcagcccctgcggggggatgaaggtgcaggctgagagggcacgagacatg
gggatcaagctctttgcagtggcccccagcgagaacgtctacgagcagggactgcgagag
atcgccaacctgccacacgagctgtaccgcaacaactatgccatcacgcagaaagacacc
ctggaaattgatgagaacaccatcgataggataatccaagctatgaaacacgaagcctac
ggagagtgctacaggatgagctgcctggagattgcaggtccgcctggtcccaagggatac
cgagggcagaagggtgcaaaggggaacatgggtgagccaggctcccctgggctgaaggga
cgacagggtgacccgggtattgaaggtccaatcggataccctggccccaagggtgtccca
ggactgaaaggagagaagggtgagattggatccgatggacggaggggtgccgctggtttg
gcaggcaggaatggcaccgatggacaaaagggcaagctggggagaattggaccaccgggc
tgcaagggagaccctggtgacaagggccctgatggctacccaggagatgcaggagaccaa
ggtgaaagaggagatgaaggcataaagggagatcctggccggcctggtcgcagtggaccc
ccaggacctccaggagaaaaaggaagcccgggacttccaggcaaccctggacctcaaggg
cctcctggaagtaagggaagaagaggtgaagcaggacctcctggacccaaaggagagcct
ggtcgacgtggagatccagggacaaaaggcagcaaaggcactcctggggccaaaggagaa
aggggagatcctggtccagagggtcctcgtggcctgcccggtgaggttggaagcaaagga
gcaaggggagaccaaggactgccaggacctcgaggcccgtcaggtgctgtgggagagcca
ggcaagataggatcacgtggggaccccggtgacttgggcccaagaggtgacgccggacca
ccaggaccaaagggtgacagaggcagacctggctttagctaccctgggcctagaggacca
cagggtgacaagggtgagaaggggcagccaggacccaagggaataagaggtgagcttgga
ccaaaaggagtgcaagggaccaaaggagcgaagggggaaccgggtgaccctggcccggga
ggggagcctggactccgcggtccaactggggaagcaggccctgaggggacccctggccca
ccaggagatcctggcctcactgactgcgacgtgatgacctacgtgcgagagacatgtgga
tgctgtgactgtgagaagcgctgcggtgccctggacatcatgttcgtcatagacagctca
gagagtattggctacaccaacttcaccctggagaaaaattttgttgtcaacgtggtcagc
cggctggggtctattgccaaggaccccaagtcgcagacaggtgcccgtgttggggtggtg
cagtacagtcacgaggggaccttcgaagccatcaagcttgatgatgagcgcattgattca
ctgtcaagcttcaaggaagcagtgaagcgcctggagtggatcgctggaggcacctggaca
ccatctgcccttcagtttgcctacaacaagctcatcaaggaaagcaggcgagagaaagcc
caggtgtttgctgtggtgatcacagatgggcgctacgacccccgggatgatgacaagaac
ctaggggctctttgcggtagagatgtggtcgtcaacaccattggcattggagacatgttt
gatcagccagaacagagcgagaccctggtctccattgcctgcaatgagccccagcgagtc
cagaagatgaggctcttctcagacctagtggcggaggaattcattgacaagatggaagac
gtgctctgcccagatccacagatcgtctgccctgagctgccctgtcagacagagctcgca
gtggcccagtgcacacagcgccccattgacgttgtcttcttgctcgatggctctgaaagg
atcggggagcagaacttccacagggcccaccactttgtggaggaggtggcccgacagctc
acgctagccaggagcaacaacgacaacatgaatgcccggattgcgctgctgcagtacggc
agtgagagggaccaggatgtggtcttcccactgacctacaacctgacggaaatctccagc
gccctggcacagatcaagtacctggactcatcctccaacattgggtcagccatcatacac
gccatcaacaacatcgtgctcagcccgggagatgggcagcgacttgcccggcgcaatgct
gagctctcattcgtcttcatcactgatggcatcacgggaagcaagaacctcgaggaggcc
atcaattccatgaagaagcaagatgtcatgcccacggtagtggctcttgggagtgatgta
gacatggacgtcctgctgaagcttggcctcggggaccgggcagccatcttccgggagaag
gactatgacagcctctcccagcccagcttctttgacaggttcattaggtggatatgttaa

KEGG   Buteo buteo (common buzzard): 142031037
Entry
142031037         CDS       T11399                                 
Symbol
COL6A1
Name
(RefSeq) collagen alpha-1(VI) chain
  KO
K06238  collagen type VI alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142031037 (COL6A1)
   04518 Integrin signaling
    142031037 (COL6A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142031037 (COL6A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142031037 (COL6A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bbut04147]
    142031037 (COL6A1)
   00536 Glycosaminoglycan binding proteins [BR:bbut00536]
    142031037 (COL6A1)
Exosome [BR:bbut04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   142031037 (COL6A1)
Glycosaminoglycan binding proteins [BR:bbut00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   142031037 (COL6A1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   142031037 (COL6A1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 DUF1194 vWA-TerF-like
Other DBs
NCBI-GeneID: 142031037
NCBI-ProteinID: XP_074883903
LinkDB
Position
5:complement(10689908..10715840)
AA seq 1022 aa
MRLRDFLFTLLLPAGFLWGGLWAQRPEITRVANAEDCPVDLFFVLDTSESVALRVKPFGD
LVAQVKDFTNRFIDKLTNRYYRCDRNLVWNAGALHYSDEVVLIKSLTPMPSGRNELKNRV
SDVNYIGKGTYTDCAIKRGIEELLISGSHHKENKYLIVVTDGHPLEGYKEPCGGLDDAAN
EAKHLGIKVFSVAISPNHLDQRLNIIATDHAYRRNFTATSLKPTRELDVEETINTIIDMI
KVNTEQSCCSFECQPPRGPPGPPGDPGNEGERGKPGLPGQKGEAGEPGRPGDMGPVGYQG
MKGDKGSRGEKGSRGAKGAKGEKGRRGIDGIDGMKGEAGYPGLPGCKGSPGFDGAQGPPG
PKGDPGAYGPKGGKGEPGDDGKPGRQGIPGSPGEKGAPGNQGEPGPAGEMGDEGAPGPDG
PAGERGSNGEGGPPGSPGDRGPRGEPGEPGPPGDQGREGPLGPPGDQGEAGPPGPKGYRG
DDGPRGNEGPKGLPGAPGLPGDPGLMGERGEDGPPGNGTIGFPGAPGRPGDRGDPGINGT
KGYVGPKGDEGEAGDPGNDNPTPGPRGIKGAKGHRGPEGRPGPPGPVGPPGPDECEILDI
IMKMCSCCECTCGPVDLLFVLDSSESIGLQNFQIAKDFIIKVIDRLSKDERVKFEAGESR
VGVVQYSHDNTQELVAMGDANIDNIGALKQAVKNLKWIAGGTYTGEALQFTKDNLLRRFT
SDKRVAIVITDGRSDTLRDPTPLNSLCDVTPVVSLGIGDIFRNPPNPDHLNDIACLSRPT
RPGLSIQRDNYAELLDDTFLQNITSYVCQEKKCPDYTCPITFNGLADITLLVDSSTSVGS
KNFETTKKFVKRLAERFLEAGKPADDSVRISVVQYSGRNQQKVEAQFQYNYTVIAKAIDN
MEFINDATDINAALRYVTGLYQRSSRAGAKKRVLVFSDGNSQGITARAIERAVQEAQQAG
IEIYVLAVGSQANEPNIRVLVTGKSADYDVVYGERHLFRVPDYTSLLRGVFYQTVSRKIA
VD
NT seq 3069 nt   +upstreamnt  +downstreamnt
atgaggctgcgagactttctattcactttgctgctcccggccggcttcctgtgggggggc
ttgtgggcccagcgcccagaaatcacccgggtcgcaaatgcagaagactgccccgtggac
ctgttcttcgtcctggacacctcggagagtgtcgccctgagggtgaagcccttcggggac
ctggttgcccaagtgaaagatttcaccaaccgattcattgataagctgacaaacaggtac
taccgctgcgaccgcaacctggtgtggaacgctggggcactgcactacagcgatgaggtt
gtgctcatcaagagcctcaccccgatgcccagcggccggaatgagctgaagaaccgtgtc
tcggacgtgaactacatcgggaagggcacctacactgactgtgccataaagcggggcatt
gaggagctgctcatcagtggctcccatcacaaggagaataaatacctgattgtggtgacc
gatggacaccccttggaagggtacaaggagccctgtggaggcctggacgatgctgccaat
gaagccaaacatttggggatcaaagtgttctctgtcgccatctcccctaaccacctggac
cagcggctcaacatcatcgccacagatcatgcctaccgccgcaacttcactgccaccagc
ctgaagccaacccgggagctcgacgtggaggaaaccatcaacaccatcatcgacatgatt
aaagttaacacggagcaatcgtgctgctccttcgagtgccagcctccccgagggcccccc
ggacctcccggcgacccgggcaatgagggagaaagaggcaagcccggtcttcctggccag
aagggagaggctggagagccaggcaggccaggggacatgggacctgttggctaccaagga
atgaagggtgacaaaggaagtcgaggagaaaagggctcaagaggagccaaaggagccaag
ggtgagaagggcaggcgtggaattgatggcatcgatggcatgaagggtgaagctggatac
cccgggttacctggctgcaaaggctcacctggatttgatggagctcaaggcccacctgga
ccgaagggagatcccggtgcttacgggcccaagggaggaaagggggaacctggggacgac
ggaaaacctggccgacaggggattcctggcagccctggagagaagggcgcacctggaaac
cagggtgagcctggaccggcaggagagatgggtgatgagggtgctccaggcccagacggt
cctgctggagaacggggcagcaatggagaaggaggccccccaggctcgccaggtgaccgc
gggccaagaggagagccgggagagcctggaccacctggtgaccaagggcgggaaggaccc
ctcggaccacctggcgaccagggcgaggctggacccccaggacccaagggctacagggga
gatgacggcccccgcggcaatgagggcccaaaggggttgcctggagcacctgggctgcct
ggagaccctggcctgatgggagaaaggggggaggatggccctcctgggaatggcacaatt
ggatttccaggcgctccggggcgaccaggagacagaggtgaccctggcattaatggaacg
aaaggctacgtcggtcctaaaggtgatgaaggagaagctggagaccctggcaatgataat
ccaacccctggccccaggggcatcaaaggagcgaagggccacaggggacctgaaggccgc
ccaggaccaccaggacctgtgggacctccaggaccagacgaatgtgaaatcttggacata
atcatgaagatgtgctcttgctgtgagtgcacctgtggtcccgtggacctcctcttcgtg
ttggacagctcagagagcattggcctgcagaacttccagattgccaaggacttcatcatc
aaagtcattgaccgcctgagcaaggacgagcgtgtgaagtttgaagctggggaatcccgt
gtgggcgttgtgcagtacagccatgacaacacgcaggagctggtggccatgggggatgcc
aacatagacaacatcggggcactcaaacaggcagtcaagaacctgaaatggatagctgga
ggcacctacactggagaagccctgcagttcaccaaggacaacctgctgcggagattcacc
tccgacaagcgcgttgccatcgtcatcactgatggacgctccgacacactgcgggatcct
accccgctcaactctctgtgcgatgtcaccccggtggtttcccttgggataggtgacatt
ttccggaaccctccaaatccagatcacctgaatgacatcgcttgtttaagcagaccgacc
aggccaggactttcaattcaaagggacaactatgccgagctcctggatgacaccttcttg
cagaacatcacctcgtacgtctgccaagagaagaaatgtccggactacacctgcccaatc
accttcaacgggctggcagacatcacgctgctggtggatagctccaccagcgtggggagc
aagaactttgagaccaccaagaagtttgtaaagcggctggcggagcggttcctggaggct
ggcaaacctgctgacgactctgtacgcatctcagtggtccagtacagtggcaggaaccag
cagaaagtggaagctcagttccagtacaactacacagtcattgccaaagccatcgacaac
atggagttcattaatgacgccacagacatcaacgctgctttgcggtacgtcacagggctc
taccagcggtcctcccgtgctggggcaaagaagagggtgctggtgttttctgatggcaac
tctcaagggatcaccgcaagggccattgagagggctgtgcaggaagctcagcaggccggt
atcgagatctatgtgctggcggtgggcagccaagccaatgagcccaacatccgtgtcctg
gtcacggggaaaagcgcagattacgatgtggtctacggggagcgccacctcttccgtgtg
cctgactatacctctctgctgcgtggtgtcttctaccaaactgtctccaggaagatagct
gttgactga

KEGG   Buteo buteo (common buzzard): 142031054
Entry
142031054         CDS       T11399                                 
Symbol
COL6A3
Name
(RefSeq) collagen alpha-3(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142031054 (COL6A3)
   04518 Integrin signaling
    142031054 (COL6A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142031054 (COL6A3)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142031054 (COL6A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bbut04147]
    142031054 (COL6A3)
   00536 Glycosaminoglycan binding proteins [BR:bbut00536]
    142031054 (COL6A3)
Exosome [BR:bbut04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   142031054 (COL6A3)
Glycosaminoglycan binding proteins [BR:bbut00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   142031054 (COL6A3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   142031054 (COL6A3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen Kunitz_BPTI VWA_3 fn3
Other DBs
NCBI-GeneID: 142031054
NCBI-ProteinID: XP_074883959
LinkDB
Position
5:complement(13107215..13170534)
AA seq 3217 aa
MRKHRHLPFVAMFCLLLSGFCSVHAQQQAAVKTVAVADIIFLVDSSWSIGKEHFQLVREF
LYDVVKALDVGGNDFRFALVQFSGNPHTEFQLNTYPSTRDVLSHIANMPYMGGGTKTGKG
LEYLIENHLTKAAGSRASEGIPQVIIVLTDGRSQDDVALPSSVLKSAHVNMFAVGVQDAV
EGELKEIASEPFDMHLFNLENFTALQGIVGDLVASVRSSMTPEKAGAKGLVKDITAQESA
DLIFLIDGSNNIGSVNFQAIRDFLVNLIESLRVGAQQIHIGVVQYSDEPRTEFALNSYST
KADVLDAVKALSFRGGEEANTGAALEFVVENLFTQAGGSRIEEAVPQILVLISGGESSDD
IREGLLAVKQASIFSFSIGVLNADSAELQQIATDGSFAFTALDIHNLDALRELLLPNIVG
VAQRLILLEAPTIVTEVIEVNKKDIVFLIDGSTALGTGPFNSIRDFVAKIVQRLEVGPDL
IRVAVAQYADTVKPEFYFNTHQSRKDVMANVKRMKLMGGTALNTGSALDFVRNNFFTSAA
GCRMEEGVLPMLVLITGGKSRDAVDQAAAEMKRNRIVILAIGSRNADLAELQEIAHERDF
VFNPNDFRVQFMQAILPEVLSPIRTLSGGLIIQEPPSVQVTKRDIIFLLDGSLNVGNANF
PFVRDFVATLVNYLDVGSDKIRVGLVQFSDTPKTEFFLYSYQTKSDIIQRMGQLRPKGGS
VLNTGSALNFVLSNHFTEAGGSRINEQVPQVLVLVMAGRSADPFLHVSNELARAGVLTFA
VGVRSVDKAELEQIAFNPRMVYFMDDFSALAALPQELNKPITTYVSGGVEEVPLAPTESK
KDILFLIDGSANLLGSFPAVRDFVHKVISDLNVGPDATRVAVAQFSDTIQVEFDFAELPS
KQDMLLKVKRMKIKTGKQLNIGAALDEAIRRLFVKEAGSRIEEGVPQFLVLLVAGRSTDD
VEQPSDALKQAGVVTFAIKAKNADPVELERIVYAPQFILNVDSLPRISELQPNIVNLLKT
IQLQPTVVERGEKKDVVFLIDGSDGVRRGFPLLKTFVQRVVESLDIGRDKVRVAVAQYSN
VIQPEFLLDTHEDKADLISAIQELKVMGGSPLNTGAALDYLIRNVFTVSSGSRIAEGVPQ
FLILLTADRSQDDVRRPSVVLKTSGTVPFGIGIGNADLTELQMISFLPDFAISVPDFSQL
DTVQQVVSNRVIRLTKKEIESLAPDLVFTSPSPAGVKRDVVFMVDGSRYAAQEFYLIRDL
IERIVNNLDVGFDTTRISVVQFSEHPHVEFLLNAHSTKDEVQSAVRQLRPQGGQQVNVGE
ALEFVAKTIFTRPSGSRIEEGIPQFLIILSSRKSDDDLEYPSLQVKQVGVAPMVVAKNVD
PEEMVQISLSPDYVFQVSSFQELPSLEQKLLTPIETLTGDQIRRLLGDVQIREDISGDEK
DIVFLIDSSDSVRPDGLAHIRDFISRIVQQLEVGPNKVRIGVVQFSNNVFPEFYLKTHKS
KNAVLQAIRRLRLRGGSPVNAGKALDYVVKNYFIKSAGSRIEDGVPQHLIVILGDRSQDD
VNRPANVIVSSNIKPLGVGARNVDRDQLQIITNDPERVLVVQDFTGLPTLEKKVQSILEE
LPIPTTETPGYLVPGGKKQADIVFLLDGSINLGRDNFQEVIQFVYSVVDAIYRDGDSIQV
GLAQYNSDVTDEFFLKDYSTKPQILDAINKVIYKGGRVANTGAAIKHLQAKHFVKEAGSR
IDQRVPQIAFIVTGGKSTDDGPSASLEIAQKGVKVFAVGVRNIDLKEVSRLASESATGFR
VSTVQELSELNEQVLVTLEAAMEEKLCLGPTDVTRDCDLDVILGFDVTDVGPGQNIFNSQ
RGLESRVESVLNRITQMQKISCTGNQAPNVRVAIMAQARGGPVEGLDFSEYQPELFEKFQ
GMRTRGPYFLTADTLKSYLNKFRSAPSSSTKVIIHFTDGADDSIDQLEAASSALHTEGVN
ALIFVGLDRVSNFDKVMQLEFGRGFTYNRPLRVNLLDLDFELAEQLDNIAERTCCKVPCK
CSGQRGDRGVPGPIGPKGTTGDLGYGGYPGDEGGPGERGPPGVNGTQGFQGCPGHRGTKG
SRGFPGEKGELGEMGLDGIDGEEGDKGLPGSSGEKGYSGRRGDKGIKGERGERGDRGLRG
DPGDSGADNTQRGSRGQKGEIGPMGEPGPVGLNGQDGGVGRKGMTGRRGPIGVKGTKGSL
GQAGPAGEQGTRGPQGPPGQLGTPGIRGEQGIPGPRAGGGLPGTPGERGRIGPLGRKGEP
GNPGLKGPNGQQGPRGEMGDDGRDGIGGPGPKGRKGERGFLGYPGPKGGPGDRGGAGGPG
PKGNRGRRGNAGDPGTPGQKGEVGYPGPSGLKGDKGESRDQCALVRNIKDKCREYISPKE
CPVYPTELAFAIDTSSGVVRDVFNRMKQTVLRVVNNLTIAESNCPRGARVALVTYNNEVT
TEIRFADARKKSSLLQQIQNFQATLTTKPRSLETAMSFVARNTFKRARSGFLMRKVAVFF
SNGDTRASPQLNDAVLQLYDAGVMPVFLTSRQDPVLTRALEINNTAVGHAIVLPASGSQL
NETIRRVLTCHICLDVCDPDPICGFGSQRPVFRDRRAAPTDVDTDIAFIMDSSESTTPLQ
FNEMKKYISHLVSNLEISSEPGVSQHHARVAVLQQAPYEYETNSSFPPVKTEFSLIDYGS
KEKIINYLHNQMTQLHGTRAIGSAIKHTMAHIFESAPNPRDLKVMVLMMTGKVNKQELEN
LQKVVTDAKCKGYFFVVLGIGRKVNAKNIYSLASEPNDVFFKLVDKPGELHEEPLLRFGR
LLPSFIRSDFAFYLSPEIRKQCEGLQSDQQAKRQSPGHTGNKVVYAAPNATISRTLGAST
TVSASIKPVASTNTRARTTTASTTAQTRATGQTTASTTVQVNATTQSAASTATHTKATGR
TTTSTTAAAAGGRRRQGTKTHDIQITDVTENSARLRWASPEPHNSYAFDITVTLAHDHSL
VQKQNLTGTEHVIRGLRSGQKYLVVITGYQKSQPKVTYTGTFSTKTQAQPKVSLANMMLN
TEPLEGPESDWPDPCLLDFDMGMQCKDYQIVWFFDYKNKICSQGWYGGCGGNANRFEAEA
ECISKCLKPSAAEKAMQQPPLEKRLSSVTDICRLQKDEGTCRNFVLKWYYDPETKSCARF
WYGGCGGNENRFNTQKECEKVCVPGNINPGVVTTIGT
NT seq 9654 nt   +upstreamnt  +downstreamnt
atgaggaaacacagacatttgccctttgtggcaatgttttgcctcctactctcaggcttt
tgttcagttcatgcccagcagcaagcagctgtcaaaaccgttgctgtggctgatataata
tttctagtggattcctcttggagcattgggaaggaacacttccaactcgttcgagagttt
ctgtatgatgttgtaaaagctttagatgtgggaggaaatgattttcgctttgccctggtc
cagttcagcggaaacccccacacagagttccagttaaatacctacccctctacccgagac
gtcctgtcccatattgccaacatgccttacatggggggaggcaccaagactggaaaagga
ttagagtacctaatcgagaaccacctcactaaagctgctggaagcagagccagtgaaggc
atcccccaggttattatagtgttaacggacggacgatcccaggatgatgtggctctgcca
tcatctgtccttaaatcggcccatgtaaacatgtttgcggtcggcgtgcaggatgcagtg
gaaggggagttaaaggaaatagcgagcgaacccttcgatatgcaccttttcaacctagag
aattttaccgctctccaaggcatagttggagacttagtggcgagtgtccgttcatccatg
actccagaaaaggctggggccaaaggacttgttaaagacatcacagctcaagagtccgct
gaccttattttccttattgacggatcaaacaacatcggaagtgtcaattttcaagccata
cgtgatttccttgtaaatttgattgaaagcctcagagttggagctcagcaaatacacatt
ggggtggtgcagtatagtgatgaacccagaaccgagttcgctttgaacagctactccaca
aaagcggatgttctggatgccgtgaaggcacttagtttccgcggtggcgaggaagcgaac
actggtgcagcactggagtttgtggtggagaacctcttcacccaggcaggaggcagcagg
atagaggaagcggtgcctcagattttagtgctgataagtggtggagagtctagtgatgat
atccgagagggcttgttagcggtgaagcaagctagtatattttcgtttagcattggggtc
ctgaatgctgacagtgcagagctgcagcagattgctactgatggaagcttcgcatttact
gccctggatatccataaccttgatgctcttcgagaattattactgcccaacattgttgga
gtggcccaaaggctcattttgttagaggccccaaccattgtgactgaagttattgaagtg
aacaagaaggatatagtcttcctgatagatggctcaactgcattggggactggccccttt
aactcaattcgtgatttcgttgccaaaattgtccaaaggctggaggttggacccgacctg
atccgagttgcagtggcgcagtacgcagacactgtgaagccagagttttattttaacacc
caccagagcagaaaggatgtcatggctaacgtgaagagaatgaagctcatgggtggtacg
gcactcaacactggctcggcactggattttgtgaggaacaacttcttcaccagtgctgct
gggtgcaggatggaggaaggggtgctccctatgctggtactcatcactggtggtaagtcc
agggatgctgttgaccaagccgcagcagagatgaaaaggaaccggatagtgatccttgcc
attgggtcgagaaatgctgatctggcagaacttcaagaaattgcccatgagagagatttt
gtgttcaacccaaatgacttccgtgttcagttcatgcaagccattcttcctgaagtactg
tcacctatccggacactctctggaggactgattatccaagaaccaccatcagttcaagta
accaaaagggatatcatctttcttttggatggatcactcaacgttggaaatgccaacttc
ccctttgtgcgtgactttgttgccaccctagttaactaccttgatgtcggcagtgacaaa
atccgagttggcttagtgcaatttagcgacactcctaaaactgagttctttctatattca
taccaaaccaaatcagacataattcaacgtatggggcagctgaggcccaagggggggtca
gtgctgaacactggctctgcactgaactttgtgctttcaaatcacttcacggaagctggt
gggagcagaataaatgaacaagtgccgcaggtcctagtcctggtgatggcagggaggtct
gccgatcccttcttgcacgtttccaatgaattagctcgggctggagtgctgacttttgct
gttggagttaggagtgtggataaggcagaacttgaacagattgcctttaatccaagaatg
gtatattttatggatgatttcagtgctctggcagctttaccccaggagttaaataagcct
ataacaacatatgttagtggaggtgtggaagaggttccactcgccccaacagaaagcaag
aaagatattttattcctgattgatggctcagccaacctcttgggtagctttcctgccgtc
cgggactttgtacacaaagtcatttctgacctgaacgtgggtcccgatgccacacgagta
gctgtagctcaattcagtgacaccatccaagtcgaatttgactttgctgaactcccatct
aagcaagacatgcttctcaaagtgaaaaggatgaagataaaaaccgggaagcaactgaac
attggagctgcgctagatgaagctataaggaggctgttcgtgaaggaagctggaagcagg
attgaagaaggtgttcctcagtttttggtcctccttgttgctgggagatcaaccgacgat
gtggagcaaccatcagatgctctgaagcaagctggagttgtgacctttgctatcaaagcc
aaaaatgctgacccagtagagctggaaaggatcgtttacgccccacaattcattctgaat
gttgactccctccctcggatttcagaacttcagccaaacatagtcaacctgttgaaaact
attcagcttcagccaacagtagttgagagaggtgagaagaaggatgtggtgtttctaatc
gatggctcggatggtgtcagaagaggtttccctctactgaagacatttgtccaaagagtt
gttgaaagcctggatattggtcgtgacaaggtccgtgttgccgttgcacaatacagcaat
gtcatacaaccagaattcttacttgacacccatgaagacaaagcagatctcatcagtgcc
atccaggagctgaaggttatgggagggtctcctctgaacactggagcagcccttgactat
ctcatcaggaatgtgttcacggtgtcgagtggcagcaggatagcggagggtgtcccgcag
ttcctgattctgctgaccgctgaccggtcccaggatgatgtgaggaggccctcagtggtc
ctgaagacgagtggcacggtgccctttggcatcgggattggaaatgccgacctcacagag
ctgcagatgatttcttttctcccagattttgctatatctgtgcctgacttcagccagctg
gatacggtgcaacaggtcgtctcgaacagagtcatccggctgaccaagaaagagatagag
tcacttgcccctgatctggtttttacatcacccagcccagcgggtgtgaagagggatgtt
gtgttcatggttgatggctcccggtatgccgcccaggagttctatctgatccgtgatctc
attgagaggatagtgaacaacttggatgtgggctttgacaccaccaggatttcagtggtc
cagttcagcgagcatccccacgtggagttcctgctcaatgcccactccaccaaggatgag
gtgcagagtgcagtgaggcagctgcggccgcagggtggccaacaggtgaacgtaggggag
gcccttgagtttgtggcaaagaccatcttcactcggccctccgggagccgcatagaggag
ggcatccctcagtttttgatcatcctctcctcccgcaagtccgacgatgacttagagtac
ccatcactccaagtcaaacaagtgggcgtggcacccatggtagtagcgaagaatgtggat
cctgaggagatggtacagatttcccttagtcctgattatgtattccaagtctccagtttc
caggaacttcccagcctcgagcagaagctgctgactcctattgaaaccctgactggggac
caaatcagacggctgctgggagatgtgcaaatccgcgaagatatttctggtgatgaaaaa
gacatagtcttcctcattgacagctctgacagtgttaggcctgatgggcttgctcacatt
cgggatttcatcagcagaattgttcagcagctggaagttgggccaaacaaagtgcgaatt
ggcgtggtgcagttcagcaataatgtcttccctgagttctacttgaagacccacaagtcc
aaaaatgccgtcctgcaagccatccgccgcttgaggcttagaggggggtcccctgtgaat
gctggcaaagccctcgactacgtggtgaagaactacttcatcaagtctgcagggagcagg
atagaagatggagtgcctcagcacctaatcgtcatccttggagaccggtcccaggatgac
gtgaacaggcctgctaatgtgattgtttcatcgaacattaaacctctgggtgttggagcc
aggaatgtggacagggaccagctgcagatcattaccaatgatcctgagcgcgtgcttgtg
gtacaggacttcacgggactcccaactttagaaaagaaggtgcagagtattcttgaagag
cttccgatacctacaacagaaacccctggatatttagtacctggaggtaaaaagcaggct
gacattgtgtttttgctggatggctccatcaatctggggagagataacttccaggaagtt
atccagtttgtctactctgtggtggacgccatttatagggatggcgactctatccaggta
ggactggcccagtacaactcagatgtcaccgacgagtttttcctcaaggactactccacc
aaacctcagattttagatgccatcaacaaagtcatctacaaaggcggtcgagtggcaaac
actggcgcagctataaagcacctccaggcaaaacattttgtgaaagaggctggcagccga
attgaccagagggtgccccagattgccttcattgtcacaggtgggaagtccacagatgat
gggccgagtgcctccttggaaattgcacagaaaggcgtcaaggtatttgcggttggcgtg
agaaacatcgacctgaaggaagtcagcaggctggccagcgaaagcgctacaggtttcaga
gtgtccacagttcaggagctgtctgagctgaatgagcaggtcctggttacgctggaagct
gctatggaggagaaattgtgcctaggaccaacggatgttacaagagattgcgacttagat
gtgatacttggcttcgatgtcacagatgttggccccggccaaaatatattcaattcccag
agaggtctggagtccagagttgagtctgtgctgaacagaataacgcagatgcagaagatc
agctgcactggcaaccaggcacccaacgtcagggtagccatcatggcccaggctcggggg
ggacccgtggagggactggacttctccgagtaccagcctgagctcttcgaaaagtttcaa
ggcatgcgcacccgtgggccctatttcctcacagcagatacactgaagtcctacctgaac
aaattcagatccgcaccctccagcagcaccaaagtcataattcattttactgatggtgca
gatgactcgatagatcagctggaggctgcttcgtctgctttgcatacagaaggtgtcaat
gctctcatcttcgtcgggctggaccgagtgagcaattttgacaaagtgatgcagctggag
tttggtcgtgggttcacgtacaacagaccgctcagagttaatctgctggacttggatttt
gagctggcagagcagcttgacaacattgcagagaggacatgctgcaaggtcccatgcaag
tgctcagggcagagaggggacagaggtgtgccaggccccataggaccgaagggtaccaca
ggtgatctcggctatggaggctatcctggtgatgaaggaggaccgggtgagcgtggacca
ccaggtgtgaacgggacgcagggtttccagggatgccccgggcacagaggaaccaagggt
tctcggggattcccaggagagaagggtgaactgggagaaatgggtctagatggcattgat
ggagaagagggagacaaaggtttgcccggctcctctggagagaagggatactccggcagg
aggggtgataagggaattaaaggcgaacgaggagaacgaggtgaccgtgggctccgtgga
gacccgggagattcaggagctgataatactcaacggggatccagaggccaaaagggtgaa
attggaccaatgggagaaccaggaccggtggggctgaacggccaagacggtggagtcggg
aggaagggtatgacaggacgaaggggaccaataggagttaagggtaccaagggctcactc
ggtcaagcagggccagctggagaacaaggcacaagagggccacagggtcctcccggccag
cttggcacaccaggcataagaggagagcagggcattcctggacccagggccggtggtgga
ctaccaggaaccccaggagagcggggcaggatcggtcccctgggacgaaagggtgagcct
ggaaaccctggactcaaagggccaaatggacagcagggcccacgaggagagatgggtgat
gatggacgagatggaattggtggcccgggtcctaaaggcagaaagggagaacgtggcttc
ttaggctacccagggcccaagggtggacctggtgaccgcggtggtgctggaggccctggt
cctaaaggaaacagaggccgcagaggaaatgcaggagatcctgggacacccggtcagaaa
ggagaggttggataccctggaccatctggacttaaaggcgataaaggagaatccagagat
caatgtgcgcttgtgcgaaacatcaaagacaaatgccgtgagtacatcagtcccaaggag
tgccctgtctacccaaccgagctggcctttgctattgacacctcctcgggtgttgtgagg
gatgttttcaacaggatgaagcaaactgtgctcagagtggtcaacaacttaaccatcgct
gagagcaactgtcctcggggtgcccgggtggctctggtcacctacaacaatgaggtcacc
acagagatccgctttgctgacgccaggaagaaatcaagcctcctccagcagatccaaaac
ttccaggcgaccctgaccacaaagccacgcagcctcgagaccgccatgtcctttgtggcc
aggaataccttcaagcgtgcccgaagtggcttccttatgaggaaggtggctgtctttttc
agcaacggggacacaagagcctccccgcagctcaatgatgccgtgctgcaactctacgat
gctggggtcatgcccgtgttcctcaccagcaggcaggacccagttctcaccagagctctg
gagatcaacaacacagcggttggacatgcaattgttcttccagccagtggcagtcagctg
aatgaaactatccggagggtcttgacttgtcacatttgtctggatgtgtgtgaccctgat
cccatctgtggttttggtagtcagagacctgtattcagagacagaagggcagccccaaca
gatgtagacactgacatagccttcatcatggatagctcagagtccaccacccctctgcag
ttcaacgagatgaagaagtacatttcccacctcgtaagtaacctggaaatcagctctgag
ccaggagtatctcagcaccatgcaagagtggcagttctccagcaagctccttatgaatac
gaaaccaactcaagtttcccaccagtaaaaactgagttttcactaattgattatggatcc
aaagagaagatcattaattaccttcacaaccaaatgacccagctccatggcacgagggct
ataggaagtgcaattaaacacacaatggctcatatttttgaaagtgcaccaaaccctcgg
gatctaaaagtcatggttctgatgatgaccggaaaagttaataaacaagaacttgagaac
ctgcagaaggttgttacggatgcaaaatgcaaagggtacttctttgttgtcttgggcatt
ggcaggaaagtgaatgccaagaatatctatagcctagctagtgagccaaatgatgtgttc
ttcaaattggtcgataaaccaggtgagcttcatgaagagcccttactacgctttgggaga
ctgctgccgtcttttatcagaagcgactttgctttctacctgtctccagagataaggaaa
cagtgtgaggggctccagagtgatcagcaagccaagagacagagccctgggcacaccggg
aataaagtagtgtatgcagcacccaatgcaaccataagccggacccttggtgccagcacc
acagtcagtgcgagcatcaagcctgttgcgagcaccaacacccgtgccagaaccaccaca
gccagcaccacagcccagaccagagccactggccagaccacagccagcaccactgtccag
gtgaacgccaccacgcagagcgcagcaagcaccgccacccacaccaaagccaccggccgc
accaccacgagcaccacagccgccgcggcaggtggcaggagaagacaaggcaccaagacc
cacgacatccagatcacagatgtcaccgagaacagtgccaggctgcgctgggccagcccc
gagccacacaacagctacgcttttgatatcaccgtcaccttagcacatgaccactctctc
gtgcagaagcaaaacctgaccggcactgagcatgtcatccgaggacttagaagcggacag
aaataccttgttgtcattacaggctaccagaaatcccaacctaaagtaacctacacaggg
acattcagcacaaagactcaggcacagcccaaggtgtccctggccaacatgatgctcaac
accgagccgctggaagggcccgagagtgattggccagacccttgcttgctggactttgac
atgggcatgcagtgcaaggactatcagattgtgtggttctttgactacaaaaacaagatc
tgcagccagggttggtatggtggctgtggtggtaatgccaatagatttgaggccgaagct
gagtgcattagcaaatgcttaaaaccatcagcagctgagaaagccatgcagcagccaccc
cttgagaaaagattgtcatcagtcacagatatctgccggctgcaaaaagatgaggggacc
tgcaggaactttgtgctgaaatggtattacgaccctgagaccaagagctgcgcccggttc
tggtacggcggctgcggaggcaacgagaacagatttaacacacagaaagaatgtgaaaaa
gtttgcgtccctggtaacatcaaccctggcgtggtgacaacaataggaacatag

KEGG   Buteo buteo (common buzzard): 142032766
Entry
142032766         CDS       T11399                                 
Symbol
COL4A3
Name
(RefSeq) collagen alpha-3(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04382  Cornified envelope formation
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142032766 (COL4A3)
   04518 Integrin signaling
    142032766 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142032766 (COL4A3)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142032766 (COL4A3)
 09150 Organismal Systems
  09158 Development and regeneration
   04382 Cornified envelope formation
    142032766 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bbut04147]
    142032766 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:bbut00536]
    142032766 (COL4A3)
Exosome [BR:bbut04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   142032766 (COL4A3)
Glycosaminoglycan binding proteins [BR:bbut00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   142032766 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 142032766
NCBI-ProteinID: XP_074887928
LinkDB
Position
7:complement(19317980..19384166)
AA seq 1763 aa
MPAEDGRTDGGCRRCPRQETRGGEERGGEGWPGRPRPGQPRPLSPPQGAGSAEAGAKQQP
PGQKPRAGGGQSPRALGRAPRDAAQAAPSAGMAGRGCPSRLAPLLVLALLWPGRAAAAAE
SCVCKGKSPCFCDSMKGSKGEKGFPGVPGPPGHRGFPGPEGPPGPQGPKGSPGLTGLVGP
KGIRGLPGIPGFSGPPGLPGEPGTVGPPGRSGVPGCNGTKGDRGFPGPPGRRGAPGILGI
AGIKGEKGCPADGYNQGYGAKGDPGLPGMPGLQGAQGAQGYPGELGPQGPPGASGMPGRA
GPRGPKGLMGLKVIGTKGRKGDTGLTGPPGPPGTVIVTLSGPDNTTDLKGEKGEKGSRGL
PGPMGFTGPVGDSQSEKGDQGEPGAQGKPGKEGAPGPPGLPGQKGEQGRPGLAGRQGAKG
MKGNTGPPGACGRTEYYDSVIGEKGDEGPPGPPGPKGQRGMPGPQGPPGGAGRPGVSRPG
LRGPPGFPGPKGTKGEKGQTGLCIVGPPGLPGITGRPGSVGLPGPPGEPGRVTYRTGLPG
LPGEPGCTGPPGPPGDTGIKGDEAYVCTDCSYIPGIPGLPGSPGPQGQDGMPGREGTRGP
KGSPGSPGAPGFPGAQGPPGFRGDQGHKGSKGEPGYVYPEGPKGERGDPGARGDKGRKGS
SGFLGRPGSKGCKGSKGEEGLAGSKGDRGLPGLPGSLGLPGEMGLPGLPGYGPQGIRGFK
GSKGVPGPPGLPGESGLKGETTRITPLPASPGPQGPEGLPGPPGVPGRVGARGQPGDSGH
LGPTGDTGFPGLGFPGNPGPKGDKGLAGGRGSPGCEGKIGNPGRAGNLGQPGEKGDPGMV
IPGEPGRLGSPGGHGIPGSKGDTGFPGLQGLPGRAGHDGDDGLRGDRGSPGVPGDPGFPG
KPAECLTGLPGLPGGQGATGPPGGRGSRGSKGKLGPPGLSIPGIYGEPGEPGLIGLQGNT
GLPGAKGQSGRPGISGVPGSRGEPGIIGLLGIPGHPGMIGRPGNPGSRGSSGFPGLKGRK
GFLGAKGEPGDKGFPGPSETLVGIGNKGEQGLKGVQGRMGLRGEKGDAGVQGPPGLDGSE
GIPGLPGVKGFMGLPGIPGGQGFPGAAGNKGDMGIPGPKGQKGSPGFPGPSGDPGPQGYG
GFPGDKGDPGYPSAGPPGEPGPKGDPGPPGVLGSKGEKGSQGHPGHKGVPGPHGIRGETG
SAGIPGKLGPPGDTGVKGLQGHPGQQGITGPPGAVGHPGKNGLVGERGNQGQDGMPGRPG
VKGEPGAPGRGIPGLRGIPGRRGIKGDMGLPGFPGPPGMKGHQGDQGPIGPPGLVGLPGF
QGATGMAITGQKGNRGIAGADGRPGAPGFPGPPGLPTPSMKGSKGVRGADGIPGPTGPDG
DIGPPGPKGIEGRSGYPGARGDPGFLGFPGVKGEKGNQGPPGPQGAEGPRGPKGQRGPPG
VPGRIFSVPGSKGPPGLPGIPGTPGDQGIQGIPGLQGPKGVKGLPGRFGHPGQPGLPGPK
GDRGFPGQRGQPGLIGFPGLQGLPGSPGTITAGPTRRGFMFTRHSQSTKIPSCPHGTSQI
YVGYSLLFVQGNERAHGQDLGTAGSCLQRFTTMPFLFCNTNDVCSFASRNDYSYWLSTAS
VMPVDMAPISGRALEPHISRCVVCEGAAMVIAVHSQTTVVPACPEGWISLWKGFSFVMYT
SAGSEASGQALASPGSCLEEFRAIPFIECHGRGTCNYYTNSYSFWLASLNPRRMFRKPLP
QTLKAGELENIISRCQVCMKRPV
NT seq 5292 nt   +upstreamnt  +downstreamnt
atgcccgctgaggacggacggacggacggcggctgccggcggtgcccgaggcaggagacg
cgggggggagaggagaggggaggagaggggtggcccggccggccccgccccggccagccc
cggcccctgtccccaccccagggcgccgggagcgccgaggcgggtgcgaagcagcagccc
ccaggccaaaagccgcgggcgggcggggggcagagtccgagagccctgggccgagctccg
cgggacgccgcgcaggcagcgccgagcgcggggatggccggcaggggctgccccagccgt
ctggctcccctgctggtgctcgccctgctctggccgggccgcgccgccgcggcggctgag
agctgtgtatgcaaaggtaaaagtccctgcttttgtgatagcatgaaaggtagcaagggt
gaaaagggctttcctggtgttccaggtccacctggtcatagggggtttcctggtccagaa
gggcctccagggccacagggaccaaagggttctccagggcttacgggcttagttggaccc
aaaggtatacgaggactcccaggaatacctggattttcaggtcccccaggacttccaggt
gaaccgggaactgttggccctcctgggcgctcgggtgttccaggatgcaatggcacaaag
ggtgatcgaggttttccaggtcctccaggtagaagaggtgctccaggcattctaggtatt
gctggtatcaaaggagagaaggggtgccctgcagatggatataaccaagggtatggtgca
aaaggtgatcctggattgccaggaatgcctggacttcagggtgcccagggtgctcaagga
tatcctggggagcttggaccacagggtcctccaggagcttcaggcatgccagggagagca
ggacctcgtggacctaagggtcttatgggactgaaagtaataggaaccaaaggaagaaag
ggtgacactgggttaactggaccccctggaccaccaggaaccgttattgtgacattaagt
ggaccagacaacacaacggacttgaaaggagagaaaggagaaaaaggatcaagaggactt
ccagggccaatggggttcacagggcctgttggtgattctcaaagtgaaaagggtgaccaa
ggagaacccggagcacagggtaaacctggaaaagaaggtgccccaggtccacctggctta
ccgggacaaaagggtgaacagggcagaccaggactggctggcaggcaaggagctaagggg
atgaaaggaaacactggtccacctggagcttgtggtcgaacggagtattatgatagtgtt
attggtgaaaaaggagatgaaggaccccctggacctccaggaccaaaaggtcaacgtggc
atgcctgggccacagggtcctcccggaggtgctggtagaccaggtgtgtcaagacctggt
ctgagaggtcccccaggatttcctgggccaaaaggaacaaaaggagagaaaggacaaact
ggcttgtgtattgtaggccctccaggactacctggtattactggcagacctggttctgtt
ggattaccaggtcccccaggagaaccaggtagagtaacatatagaacaggcttaccagga
cttcctggagagccagggtgtacaggacctccgggacctccaggagatacgggcattaaa
ggtgatgaagcttacgtgtgtactgattgcagctatattccgggaatacctggtcttcct
ggttctcctgggccacaaggccaggatggcatgcctggtagggaaggaacaagaggtcca
aaaggttctccaggttctccaggagcaccggggtttccaggagctcaaggacctccaggt
tttcgaggagatcaaggacataaaggatcaaaaggtgagccaggatacgtatatccagaa
gggccaaagggtgagcgaggagatccaggggccagaggagataaaggaagaaaaggttca
agtggatttctgggaagacctggctctaaaggatgcaagggctctaaaggtgaagaggga
ttggctggttcaaaaggagacagaggactacctggattgcctggcagtcttggtcttcct
ggagagatggggcttcctggactgccaggatatggaccacaaggaataagaggtttcaaa
ggctctaaaggagtacctggtccacctggattacctggagaatctggtctgaaaggagag
accactaggataactccattacctgcatcgccaggacctcaaggaccagaaggtttacct
ggacccccaggagtgcctggtagggttggagcaagaggtcagccaggtgattcaggacat
ctaggaccaacaggtgacacaggctttccagggcttgggtttcctggaaacccaggtcca
aaaggagataaaggacttgcaggaggcagagggtcaccaggttgtgaaggaaagattgga
aatccaggccgagctggaaacctaggacaaccaggagaaaagggtgacccaggcatggtt
attcctggagagccaggtagactgggttcaccaggaggtcatggcattcctggctcaaaa
ggtgatactggatttccaggacttcaaggcctaccagggcgtgctggacatgatggtgat
gatggcctaagaggagatcgtggctcacctggagttcctggagatccaggttttccaggg
aaacctgctgaatgtcttactggcctgccaggtttgccaggtggccagggagctacaggc
ccaccaggaggaagaggttcacgaggttcaaaaggaaagctaggtcctcctggcttgagt
atcccaggaatctatggagagcctggggaacctggattaataggtcttcagggaaatacg
ggattacctggtgccaagggacagtctggacggccaggaatctctggcgtgccaggctca
aggggagagccaggtataataggcctgttaggaattcctggacaccctggcatgattgga
agaccaggcaacccgggtagcagaggttcttctggcttcccaggactgaagggaagaaaa
gggtttcttggagcaaaaggcgaaccaggtgataaaggttttcctggaccatctgagacc
ttggttggcattgggaataaaggagagcaaggcttaaaaggagttcaaggaagaatgggt
ctaagaggagaaaaaggtgatgctggtgtgcaaggtcccccaggtcttgatgggtctgaa
ggaatacctggtctaccaggtgtcaaaggatttatgggtcttccagggattcctggagga
caaggattcccaggtgcagcaggaaacaaaggggacatgggaattccaggtccaaaagga
cagaaaggatctccgggctttccaggaccatcaggtgaccctggtccacagggctatggt
ggctttccaggtgacaaaggagatcctgggtacccatctgctgggcctccaggagagcct
ggtccaaagggagatccaggccccccaggagttttgggcagcaaaggagaaaaaggatcc
caaggtcatccaggacataaaggagtaccaggaccacatgggatcagaggtgaaactgga
agtgcaggaatcccagggaagcttggacctcctggagatactggtgttaaaggactgcag
ggccacccaggacagcagggcattacaggccctcctggtgctgttggacaccctggaaaa
aatggacttgttggtgaaagaggtaatcaaggtcaggatggtatgcctggtcgcccagga
gtaaagggagaaccaggagcaccagggagaggaatccctggccttcgaggtattcctggt
cgtagaggaatcaaaggggacatgggcctgcctggttttcctgggccaccaggtatgaaa
ggtcatcagggtgaccaaggacccattggacctcctggcttagtggggttacctggattt
caaggcgcaacaggcatggcaattactggccaaaaggggaatagaggtattgctggtgca
gatgggagaccaggtgctccaggttttccagggcctcctggtcttcccactcccagcatg
aaaggaagcaaaggtgttagaggggcagatggcataccagggccaacaggcccagacggt
gacattggtcctcccggtccaaaaggaatagaaggtcgatcaggttatcctggtgctaga
ggggaccctggatttcttggattcccaggtgtaaaaggagaaaagggtaatcaaggaccc
cctggaccacaaggtgcagaagggccaaggggaccaaagggccagcgtggtccacctgga
gtccctgggagaatattctctgtcccaggaagcaagggtcctcctggactgccaggaatc
ccaggaacaccaggtgatcaaggaattcaagggattcctggactacaaggacctaaagga
gtaaaaggtctaccaggacgttttggtcatccaggtcaaccaggactgcctggtccaaaa
ggagacaggggatttccaggacaaagaggacaacctggactgattggctttccaggtcta
caggggttacctggatcaccaggtactatcactgctggtccaacaagaagaggttttatg
tttacccggcatagtcagtcaacaaagattccttcatgcccacatgggacatcacagatt
tatgttggctattcgctgctttttgtacaaggaaatgaacgagcacatggacaagatctt
ggaacagctggtagctgcttgcagagatttaccaccatgccattcttattctgtaacacc
aatgatgtttgtagttttgcttctcgcaatgactattcatactggctgtcaactgcatca
gtaatgccagtagacatggcaccgatttctggcagggcactagagccccatataagcagg
tgtgttgtctgtgaaggagctgcaatggtaatagcagttcacagccaaacaactgtagtt
cctgcatgtccagaaggctggatatctctctggaagggtttttcttttgttatgtataca
agtgccggctcagaagcttctggccaagcattggcatctcctggatcttgtttggaagaa
tttcgagccatcccattcatagagtgccatggcagggggacatgcaactactacacaaat
tcctatagtttctggttggcatcattaaatccaagaagaatgttcaggaaacctctacca
cagactttgaaagcaggagaactggaaaatatcatcagccgttgtcaggtctgcatgaag
agaccagtctaa

KEGG   Buteo buteo (common buzzard): 142032768
Entry
142032768         CDS       T11399                                 
Symbol
COL4A4
Name
(RefSeq) collagen alpha-4(IV) chain
  KO
K06237  collagen type IV alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04382  Cornified envelope formation
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142032768 (COL4A4)
   04518 Integrin signaling
    142032768 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142032768 (COL4A4)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142032768 (COL4A4)
 09150 Organismal Systems
  09158 Development and regeneration
   04382 Cornified envelope formation
    142032768 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bbut04147]
    142032768 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:bbut00536]
    142032768 (COL4A4)
Exosome [BR:bbut04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   142032768 (COL4A4)
Glycosaminoglycan binding proteins [BR:bbut00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   142032768 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 142032768
NCBI-ProteinID: XP_074887939
LinkDB
Position
7:19389420..19461353
AA seq 1681 aa
MALTKDSFERIKWLITAAWWLLIVFSAQEIDGGGYAYIEPCGGQDCSVCRCFPEKGSRGQ
PGELGAQGPIGSLGSTGPAGLPGEKGRRGENGQPGPAGEKGDKGPTGVPGFPGLDGVPGL
PGSEGPRGRRGLDGCNGSRGDPGFPGENGYVGPRGPYGNPGQKGEKGNPVYVSHFGKGPP
GDRGDPGPPGMPGPRGSRGTTGPSGYPGQPGLPGIPGYPGLPGEQGNPGIGVDGQKGEPG
DIGLPGPPGSPLLVGPPGAQLFKGEKGQKGLPGVTGYRGPRGPKGVLGRGEKGEKGIPGF
PGLRGNPGSYGPAGFPGMKGETGFAGFPGQPGYPGIQGDPGEKGPPGPPGAVGTLLLPVK
GPQGDPGFPGPAGDMGSVGPIGPAGLLGRPGDDGTSLPGLPGVSGAPGPQGFQGDPGFPG
TGESIPGRPGFPGPPGLPGQPGRQGLPGLPSIICTDRGIPGEPGAKGQIGLPGRKGEKGE
KGDQGLCSCSAGPPGPRGMQGPPGTQGKKGQMGYPGRHGEKGDTGLSGAVGSPGLPGTPG
SAGQHGEKGEKGDPGRVRIKGIKGERGPTGVPGFPGQRGNDGRDGELGLPGEKGAEGDSG
VPLPGDKGFPGVPGLPGVKGQMGLPGLGFPGPPGVRGSPGEFGDTGNIGLPGPKGQKGET
VCITLSYPGNPGPPGFKGVQGPKGLKGLPGHPGPNGFDGQKGHRGRPGAGIPGPEGFRGE
AGDPGDEGERGSTADGKNGPPGPPGIHGQKGVPGDTTYGPPGIPGNRGLPGPPGAQGARG
DPGIPGLQGPPGTPGFPGAKGFRGPEGDIGAPGCPGFPGPPCDTGLPGPPGLRGVMGLPG
PQGLPGFKGQIGDRGLAGPPGIKGLKGSPGSRGPPGPPGSRGLPGLPGNKGPPGFPGQTG
SKGIQGPQGFPGLPGTQGPMGISGVKGEEGKMGPPGPTGECGDMGIKGERGPPGDSGGVN
IRLEKGQKGEPGFPGENGFRGERGEKGNTGFRGNPGFPGKNGVPGLPGDHGDTGLMGFPG
SRGFPGPKGFKGITGFQGQPGDQGDIGLPGIPGNPGLTGPKGSKGKRGDPTPLLGTHGRK
GPPGDPGLPGLCGFPGEKGSPGIQGQPGRPGFKGDPGYPGIPGFPGATGPQGLPGEPGEK
GKHGILGPPGLQGLPGSQGRKGLPGLPGLDGLDGLKGQKGSAGAPGQSETGPPGYSGELG
PKGDRGEPGWPGISIPGPPGERGFPGFPGRRGPVGPTGPVGRYPDSASPGPPGDQGPPGL
DGIRGQPGNPGPPGETIFVRGDPGDSGIRGAPGNPGQRGQQGARGLPGNQGRGGPNGPMG
IHGPQGPPGAIGQPGDQGFQGKPGPRGPTGFIGDPGEPGKIDDSCPTIPGLPGEAGQRGD
DGSVGLPGPIGHPGPQGRKGEEGSCGLPGQEGLPGAPGPPGDQGNIGEQGYTGPQGPPGQ
TGIPGPPGPHIRSAGGFLLVLHSQSDREPLCPQGMPKLWTGYSLLYLEGQEKAHNQDLGL
AGSCLPVFNTMPFAYCNINQVCYYASRNDKSYWLSSAAPLPMMPLSEEEIQPYISRCAVC
EAPAQAVAVHSQDQSIPPCPMNWRSLWIGYSFLMHTGSGDQGGGQSLMSPGSCLEDFRSA
PFIECQGQRGTCQFFANEYSFWLTTVMPELQFASAPLSGTLKEGQEQRKKISRCQVCLKH
G
NT seq 5046 nt   +upstreamnt  +downstreamnt
atggcattaacaaaggattcatttgaacggattaagtggcttatcacagctgcctggtgg
ctgctgattgttttttcagcacaagaaattgacgggggaggatatgcatatattgagcca
tgtgggggacaggattgttcagtttgccggtgctttccagaaaaaggatcgagaggtcag
cctggtgagctgggagctcaaggtccgattgggtccttgggatctacaggaccagctggt
ctgccaggggaaaaaggacggagaggtgagaatgggcagcctgggccagctggagaaaaa
ggtgataagggcccgactggagtcccaggatttcctggtttagatggtgttcctggatta
ccaggaagtgaaggacccaggggcaggcgtggtttagacggctgcaatggttcaagagga
gatccaggatttccaggcgaaaatggttatgttggaccaagaggtccttatggaaatcca
ggacaaaaaggagaaaaaggaaatccagtgtatgtttcacattttggaaaaggacctcca
ggagacagaggtgatcctggaccccccggcatgcccggacctagagggtcaagaggtaca
acgggcccatctggatacccaggccaaccaggcttgcctggtattccaggttaccctggt
ttgccaggtgaacagggaaatccaggtattggagtggatggacaaaagggggagccgggt
gacattggacttcctgggccaccagggtcaccactgttagttggacctcctggagcccag
ctgttcaaaggagaaaagggccaaaaaggactacctggggtaacaggatacagagggcca
cgtggacccaaaggtgtacttgggagaggagaaaaaggtgaaaaaggcattcctggattc
cctggcttgcggggtaatcctggttcttatggacctgcaggttttcctggaatgaagggg
gaaacaggatttgctggttttcctggacaacctggctacccaggcatacagggggatcca
ggagaaaaaggacccccaggtcctcctggagctgttggaactctgctgcttcctgtaaaa
ggtccacaaggagatcctggatttccaggtcctgctggtgatatggggtcagttggacca
attggacctgcaggcctgctaggaagaccaggagatgatggaacaagtctgcctggcctg
ccaggagtaagtggggcaccaggacctcaaggttttcaaggtgaccctggtttccctgga
acaggtgaatcaatcccaggaagacctggatttcctggaccaccaggcctaccaggacag
ccaggaagacaaggcttacctggactacccagtataatctgtactgacagagggatccct
ggagaacctggagcaaaaggtcagataggccttccaggaagaaaaggtgaaaaaggagaa
aagggagatcaaggcctctgctcatgtagtgctggtccccctggtccccgtggtatgcaa
ggacctccaggtactcaaggaaagaaaggacaaatgggataccctggaagacatggagaa
aagggcgacacaggcttatctggtgcagtgggatcaccaggactccctgggacacctggt
tctgctggacaacatggtgaaaaaggagagaagggtgacccaggaagagttagaataaaa
ggaataaaaggtgaaagaggtcctactggggttccaggatttccaggccaaagaggtaat
gatggcagagatggggaactaggattgcctggggaaaaaggagccgagggtgattctgga
gtaccactgccgggtgataaaggattccctggggtcccaggacttcctggggtaaaaggg
cagatgggattgcctggtttggggtttcccggccctccaggagtcagaggtagccctgga
gagtttggggatactggtaacattgggctacctggtccaaagggccagaaaggtgagact
gtctgcattacgttatcataccctggaaatccaggtcccccaggttttaaaggtgttcaa
ggaccaaagggtttaaaaggacttcccggtcatcctggaccaaatggctttgatgggcaa
aaaggccatcgaggcaggccaggagcaggaatccctgggccagaaggctttcgaggcgaa
gctggtgatcctggtgatgaaggtgaaagaggatctacagctgatggtaaaaatggccct
ccaggtccacctggcatacatggtcagaaaggtgttccaggtgataccacatatggtcct
ccaggaatcccaggcaacaggggactgccaggaccccctggtgcacaaggagcaaggggt
gatccaggcattcctgggcttcaaggaccgcctggtactccaggatttccaggtgccaaa
ggttttagaggcccagaaggtgatataggagcaccaggctgtccaggcttccctggtccg
ccatgtgacacaggcttaccagggccaccaggactcagaggtgtcatgggcctgccagga
cctcaaggcttaccaggctttaaaggtcagatcggagacaggggcctagctgggccacct
ggtatcaaaggtctcaaaggctctcctggctcacgaggtcccccaggtcctccaggctcc
cgaggacttccaggacttccaggcaataaaggaccccctggtttcccaggacaaacagga
agcaaagggattcaaggaccccaaggattcccaggactcccaggaacacaaggcccaatg
ggaatctctggtgtaaaaggtgaagaaggaaaaatgggtccacctgggcctactggagaa
tgtggggatatgggaataaaaggtgaaagaggacctccaggagacagtggcggtgttaat
atccgtcttgagaaaggacaaaagggggaaccagggtttcctggtgagaacggatttaga
ggtgaaagaggtgaaaaaggtaatactggtttcagaggcaacccaggatttcccgggaag
aatggtgtcccaggactgccaggtgaccatggtgacactggactgatggggtttccagga
tcacggggcttcccaggaccaaagggctttaaaggaattacaggttttcaaggacaacca
ggtgaccagggtgacattggcttaccaggaatccctggaaatccaggactgactggccca
aaaggatctaaaggcaagagaggtgacccaactcctctacttggtacacatggtcgaaaa
gggcctcctggagatcccggattaccaggactctgtggattcccaggagagaagggttca
ccaggtatccaagggcaacctggaagaccaggattcaagggtgaccctggatacccagga
atacctggatttccaggagctacaggtcctcagggattgccaggagaacctggagaaaaa
gggaaacatggaattttgggacctccaggattgcaaggattacctggatcccaaggcaga
aaaggtcttcctggactacccggactggatggcttggatggactaaaaggtcaaaaaggt
tctgcaggagctccaggtcaaagtgaaacagggcccccaggatactcaggagaacttgga
ccaaaaggagacagaggtgaacctggatggcctggtatttctattccaggcccccctgga
gaaaggggattcccaggattcccaggtagaagaggacctgttggacccactggacctgtg
ggaagatatcctgatagtgcttctcctggtcctccaggtgaccaaggaccacctggttta
gatggcatcagaggtcagcctgggaatcctggtcctcctggagagactatctttgtgcga
ggagatccaggtgatagtggtattcgaggggcaccaggtaatcctgggcagcgagggcaa
caaggagccagaggtcttccagggaaccaaggaagaggaggcccaaatggtcctatggga
atccatggcccacagggtccacctggtgctataggacaaccaggagaccaaggttttcaa
ggaaaacctggtcccagaggtcccacaggttttataggtgaccctggcgaaccaggaaaa
atagatgattcgtgccctacaatcccagggcttcccggcgaagcagggcaaagaggagat
gatggctctgtaggtttaccagggccaataggccaccctggaccccaaggaagaaaaggt
gaagaagggtcatgtggcctgccaggtcaagaaggcttacctggggcacctggaccacca
ggtgaccaagggaatataggagagcaaggatacactgggccacaaggtccacctggccag
actggtatcccaggaccaccaggcccccatattagatctgcaggtggattcttgctcgtc
cttcacagtcagtcagacagagaaccactctgtccacaggggatgccaaaattatggact
ggctatagtctgctatacctagaaggacaagagaaagctcataatcaagatcttggtctg
gcaggatcttgtcttcctgtgtttaataccatgccctttgcttactgtaatatcaaccaa
gtttgttactatgccagtcgaaatgataagtcttactggttgtcaagcgctgctccttta
ccaatgatgccactttctgaagaagaaatacaaccatatataagcagatgtgctgtatgt
gaggccccagcccaagcggtagcagttcacagccaagatcagtcaattcctccctgccca
atgaactggaggagtctttggataggatattcttttttgatgcacactggatctggtgat
cagggtggtggacagtcccttatgtcacctggaagttgcctagaagacttcagatcagca
ccattcattgaatgccagggtcagcggggaacatgtcaattttttgctaatgaatacagt
ttctggctaacaactgtgatgcctgaattgcagtttgcatcagcccccctgtcaggaact
cttaaagaaggacaagagcagaggaaaaaaatcagtagatgccaggtatgcctgaagcat
ggctaa

KEGG   Buteo buteo (common buzzard): 142034674
Entry
142034674         CDS       T11399                                 
Symbol
COL1A1
Name
(RefSeq) collagen alpha-1(I) chain
  KO
K06236  collagen type I alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142034674 (COL1A1)
   04518 Integrin signaling
    142034674 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142034674 (COL1A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142034674 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:bbut00536]
    142034674 (COL1A1)
Glycosaminoglycan binding proteins [BR:bbut00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   142034674 (COL1A1)
SSDB
Motif
Pfam: Collagen COLFI VWC VWC2L_2nd TILa
Other DBs
NCBI-GeneID: 142034674
NCBI-ProteinID: XP_074892456
LinkDB
Position
9:43400834..43416191
AA seq 1453 aa
MFSFVDSRLLLLIAATVLLTRGQGEEDIQTGSCIQDGLTYNDKDVWKPEPCQICVCDSGN
ILCDEVICEDTSDCPNAEIPFGECCPICPDTDASPVYPESAGVEGPKGDTGPKGDRGLPG
PPGRDGIPGQPGLPGPPGPPGPPGLGGNFAPQMSYGYDEKAGGMAVPGPMGPAGPRGLPG
PPGAPGPQGFQGPPGEPGEPGASGPMGPRGPAGPPGKNGDDGEAGKPGRPGERGPPGPQG
ARGLPGTAGLPGMKGHRGFSGLDGAKGEPGPAGPKGEPGSPGENGAPGQMGPRGLPGERG
RPGPSGPAGARGNDGAPGAAGPPGPTGPAGPPGFPGAAGAKGETGPQGARGSEGPQGARG
EPGPPGPAGAAGPAGNPGADGQPGAKGATGAPGIAGAPGFPGARGPSGPQGPSGAPGPKG
NSGEPGAPGNKGDTGAKGEPGPAGVQGPPGPAGEEGKRGARGEPGPAGLPGPAGERGAPG
SRGFPGADGIAGPKGPPGERGSPGPAGPKGSPGEAGRPGEPGLPGAKGLTGSPGSPGPDG
KTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKPGERGAPGPPGAVGAAG
KDGEAGAQGPPGPTGPAGERGEQGPAGAPGFQGLPGPAGPPGEAGKPGEQGVPGDAGAPG
PAGARGERGFPGERGVQGPPGPQGPRGANGAPGNDGAKGDAGAPGAPGNQGPPGLQGMPG
ERGAAGLPGAKGDRGDPGPKGADGAPGKDGLRGLTGPIGPPGPAGAPGDKGEAGPPGPAG
PTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGETGDAGAKGDAGPPGPAGPTGAPG
PAGAVGAPGPKGARGSAGPPGATGFPGAAGRVGPPGPSGNIGLPGPPGPSGKEGGKGPRG
ETGPAGRPGEPGPAGPPGPPGEKGSPGADGPIGAPGTPGPQGIAGQRGVVGLPGQRGERG
FPGLPGPSGEPGKQGPSGSPGERGPPGPMGPPGLAGPPGEAGREGAPGAEGAPGRDGAAG
PKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPQGPAGPAGPAGARGPAGPQG
PRGDKGETGEQGDRGMKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAAG
KDGLNGLPGPIGPPGPRGRTGDVGPVGPPGPPGPPGPPGPPSGGFDFSFLPQPPQEKAHD
GGRYYRADDANVMRDRDLEVDTTLKSLSQQIENIRSPEGTRKNPARTCRDLKMCHGDWKS
GEYWIDPNQGCNLDAIKVYCNMETGETCVYPTQATIAQKNWYLSKNPKEKKHIWFGETMS
DGFQFEYGGEGSNPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDRDTGNLKKALLLQ
GANEIEIRAEGNSRFTYGVTEDGCTSHTGAWGKTVIEYKTTKTSRLPIIDLAPMDVGAPD
QEFGIDIGPVCFL
NT seq 4362 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggattctcggttactgctgttgatagcagcgactgtactactcacc
cgcgggcaaggagaagaagacatccaaactggaagctgcatacaggatgggctaacgtac
aacgataaggatgtgtggaaacccgaaccctgccagatctgcgtctgcgacagcggcaac
atcctctgcgatgaggtgatctgcgaggacacctccgactgccccaacgccgagatcccc
ttcggagagtgctgccccatctgtcccgacaccgacgcctcccctgtctacccagaaagc
gctggagtagagggtcctaagggagacaccggccccaaaggagacaggggactccccggc
ccccctggcagagatggcatccctggacagcctggcctcccgggacccccaggccctcca
ggtcctccaggcctcggcggaaacttcgctcctcaaatgtcttatggctacgacgagaaa
gccggtggcatggccgtgcccggtcccatgggtccagctggtccccgcggtctccccggt
cctcctggtgctcctggtcctcaaggtttccaaggtccccctggtgaacccggagagcct
ggtgcttctggtcccatgggtccccgtggtccagccggcccccctggcaagaacggagat
gacggtgaagctggaaagcccggccgtcccggagagcgcggtccccccggtccccagggt
gcacgtggtctcccaggaaccgccggtctgccgggcatgaagggtcacagaggcttcagt
ggtctggatggtgccaagggtgagcccggtcctgctggccccaagggtgagcctggcagc
cccggagagaacggtgctcctgggcagatgggtcctcgtgggcttcccggcgagagaggc
cgtcccggtccatctggccctgctggtgctcgtggtaacgacggtgctcctggtgctgct
ggtcctcccggtccgactggccctgctggtccccccggcttccccggtgctgctggtgct
aagggtgaaactggtccccagggagctcgtggcagtgaaggtccccaaggtgcccgcggt
gagcccggtccccccggccctgctggcgctgctggtcctgctggcaaccccggtgctgat
ggtcaacctggtgccaagggcgcaaccggtgctcctggcattgctggcgctcccggcttc
cccggtgcccgcggtccctccggaccccagggtcccagcggtgcccccggtcccaagggt
aacagcggtgaacccggtgctccaggcaacaagggagacactggtgccaaaggcgaaccc
ggtcccgctggtgtccaaggtccccccggcccagctggcgaagaaggcaagagaggagct
cgtggtgagcccggccccgctgggcttcctggccccgccggcgaacgcggtgctcccggc
agccgcggtttccctggtgctgatggcattgctggtcccaagggtccccccggcgagcgt
ggctcccccggccccgctggccccaaaggatctcctggtgaagctggacgccccggggaa
cccggcctccctggtgccaagggtctgactggaagccctgggagtcccggtcccgacggc
aagactggcccccccggtcccgctggtcaagacggccgccccggcccccccggccccccc
ggagctagaggtcaagccggcgtgatgggtttccccggtcccaaaggtgctgcgggtgag
cccggcaaacccggcgagagaggtgctcctggtccccccggcgccgttggtgctgctggc
aaagatggtgaagctggtgcccaaggtcctcccggccctaccggtcccgctggagaaaga
ggtgaacaaggtcccgccggtgctcctggcttccagggtctgccgggccccgctggtccc
cccggtgaggctggcaagcccggcgagcagggtgtccccggagatgctggtgcccccggt
cccgccggtgccaggggtgagagaggtttccccggtgaacgcggtgtccaaggccccccc
ggtccccaaggtcctcgtggtgctaacggtgctcccggtaacgatggtgctaagggtgat
gctggtgctcccggtgcccccgggaaccaaggcccccccggtctgcagggtatgcccgga
gagcgtggtgctgccggcctgccaggcgccaagggtgacagaggcgaccccggtcccaaa
ggtgctgacggcgctcctggcaaagacggtctccgaggtctgactggccccatcggcccc
cccggccccgctggtgctcctggtgacaagggtgaagctggtccccccggtcctgctggt
cccactggtgcccgtggtgctcccggcgaccgcggcgagcccggccctcccggtcctgct
ggatttgctggccccccaggtgccgatggccagcctggtgctaaaggtgaaactggtgat
gctggagccaagggtgatgccggtccccccggccctgctggccccactggtgctcctggc
cctgccggtgctgttggtgctcccggtcccaaaggtgctcgcggtagcgctggaccccct
ggtgctactggtttccctggtgctgctggaagagttggtccccccggcccctctgggaac
atcggtctccccggcccccccggccccagcgggaaggaaggtggcaaaggaccccgcggt
gaaaccggccccgctggccgccccggtgagcctggccctgctggcccccccggccccccc
ggcgagaagggttctcctggcgctgacggccccatcggcgctcctggcacccccggaccc
caaggtatcgctggccagcgcggtgtcgtcggcctccccggacagagaggcgagagaggc
tttcccggtctgcctggcccctctggtgaacccggcaagcaaggtccctccggttcccct
ggcgagcgcggtcctcccggccccatgggcccccccggcttggctggaccccccggtgaa
gctggacgtgagggtgctcccggtgctgaaggtgcccccggtcgtgatggtgctgctggt
cccaagggtgaccgtggtgagactggccctgctggcccccctggtgctcccggtgccccc
ggtgcccccggccccgtcggtcctgctggcaagagtggagatcgcggtgagaccggtccc
caaggtcccgctggccctgctggtcctgctggtgctcgtggtcctgctggtccacaaggt
ccccgtggtgacaaaggtgaaactggtgaacagggtgacagaggcatgaagggtcacaga
ggcttctccggtctccagggcccacctggtcctcctggctctcctggtgaacaaggtcct
tctggtgcttctggtcccgccggtccaagaggtcctcccggctccgctggtgctgccggc
aaagatggtctcaacgggctgcccggccccatcggtccccccggcccccgcggtcgcacc
ggcgacgtcggccccgtcggtccccctggcccacccggcccccctggtcctcccggcccc
cccagcggcggcttcgacttcagcttcctgccccagccaccccaggagaaggcccacgac
ggcggacgctactaccgagccgacgacgccaacgtgatgcgcgaccgggacctggaggtt
gacaccaccctcaagagcctgagccaacagatcgagaacatccgcagccccgagggcacc
cgcaaaaaccctgcccgtacctgccgcgacctgaagatgtgccacggcgactggaagagc
ggcgaatactggatcgaccccaaccaaggctgcaacctggatgccatcaaggtctactgt
aacatggagacgggcgagacgtgcgtctacccaacccaggccaccatcgcccagaagaac
tggtacctcagcaagaaccccaaggagaagaagcacatctggttcggcgagacgatgagc
gacggcttccagttcgagtacggcggtgagggctccaacccagccgacgtcgccatccag
ctgaccttcctccgcctgatgtccaccgaggcctcccagaacatcacctaccactgcaag
aacagcgtcgcctacatggaccgggacaccggcaacctgaagaaggcccttctcctccaa
ggcgccaacgagatcgagatcagggctgaaggcaacagccgcttcacctacggcgtcacc
gaggacggctgcacgagtcacaccggcgcttggggcaagacagtcatcgagtacaagacg
acaaagacctctcgcctgcctatcatcgatttggctcccatggacgttggcgctccagac
caggaattcggcatcgacatcggccccgtctgcttcttgtaa

KEGG   Buteo buteo (common buzzard): 142038260
Entry
142038260         CDS       T11399                                 
Name
(RefSeq) uncharacterized protein LOC142038260 isoform X1
  KO
K06238  collagen type VI alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142038260
   04518 Integrin signaling
    142038260
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142038260
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142038260
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:bbut04147]
    142038260
   00536 Glycosaminoglycan binding proteins [BR:bbut00536]
    142038260
Exosome [BR:bbut04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   142038260
Glycosaminoglycan binding proteins [BR:bbut00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   142038260
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   142038260
SSDB
Motif
Pfam: Collagen Kunitz_BPTI VWA
Other DBs
NCBI-GeneID: 142038260
NCBI-ProteinID: XP_074899622
LinkDB
Position
12:complement(39649943..39659808)
AA seq 1288 aa
MGLRGVCFPAALLVLAAAGVRSRQAGPCTLHVLVALDVTDYQQSNLQPYLERVLRDLTAL
DHLTCSPLNLSLSLQSTQREGETLFQERLREPWMDVLQRLARAHAFQRSYLNQLALQSFL
GTLARQAADAKVLLVFTDGLDDDARKMKEEATSAWLQDQADLLVTVAVNNVTGLGDLQQM
EFGRWLAGGQHLTVDMPEVGGHVAQELLALAERTCCQMCPCTCVGLPGPRGPGGSHGEKG
VTGSKGRAGDEGEHGHSGEQGPRGLHGSRGMQGCPGQHGPKGFMGHPGEQVRPCLVVYPV
APAGTDGGQGHGQLAAAGRPRGRAGKRGTRGGEWQMVMMEGWDGLGCWGKWGKARTWLLH
PMTYNLAPDRVTSLQGPPGEAGYDGVDGEQGEAGTPGHPGEKGSRGRQGRKGSRGARGEK
GPPGPCGEVGPPGRTSPEPGTPGWRGDGGPQGDPGQEGPPGPPGPPGPPAHPTSCQKGQQ
GVQGKKGNRGSPGPEGQKGYGGAQGLPGLRGTNGVTGHPGRRGLQGLPGADGSQGAVGPA
GPKGDKGRAGDKGKKGSVGPRGPKGTLGENGCDRRGAPGRKGDKGARGLPGYPGAQGDGG
ERGSPGDKGARGLPGRRGHPGSRGEAGGRGSAGPPGEMGPKGSPGTPPSTPCELKAFIRR
SCVTTSPSCPLFPTELVLVLETSSTVSPALFSRMKELLALLLRDLQISPSGCPAGARVAM
LAYAATPTYLLRAGEMGSRTALLNRLRRLSPTRSSRRGRLAAAMRFVGHHVLKRVRPAAL
GRKVVLFVTSGRNQNLEGIGEVALQYEALGIVPAVLTFNPLPEVVRAFQVNSLFRVVQLS
AAEPAGDAAVLRDAVLPCILCFDLCHPESCTAAVPSPDPPDVDLALVVDNAAPGMLAERL
EAVGELFHHLLGRLQLTGPDLVQRGTRVALVLTGPSAPGQDLAEVPFGLPGSGEQLRERL
HLALVPRPAAASASGAVVWTLRHIFPQGSGDRLRVLFVVGTGAALLWDGEARQALAPFAP
CEDFGILVLSLGRAGTEQPEAAVPEALPAWRYHSLRLGSVHPPEMGYAERTALGFLRRLR
AESSQHPGTPGCPQKMPSAGTRTGKPPPGTPAQTPEVPTGMPTAPTSPGKGRRAAAVPGP
CTLDKDPGSACARFSVMWYHRRETGSCERFWYGGCGGNANRFGSEQDCIRACVDPGLDEA
GVGESNVTRAACLEARDPGPCHSFSPKWFFEGHQPGRCSLFWYGGCGGSRNRFESREQCE
AACLSPGRANPPAPHPVNASACTSAPGY
NT seq 3867 nt   +upstreamnt  +downstreamnt
atggggctgcgcggggtctgcttccctgcggctcttctcgtcctcgcagccgcaggggtc
cggagccggcaggcgggaccgtgcaccctccacgtgctggtggccctggatgtcaccgac
taccagcaatccaacctgcagccctacctggagcgggtcctgcgggacctgaccgcgctg
gaccacctcacctgcagcccgctcaacctgagcctcagcctgcagagcacgcagcgggaa
ggggaaacccttttccaggagcggctgcgggagccctggatggacgtgctgcaacgcctg
gcacgggcgcacgctttccagcgctcctacctcaaccagctcgccctgcagagcttcctc
ggcaccctggcgcggcaggcggctgatgccaaggtgctgctggtgttcacggatgggctg
gacgacgatgcgaggaagatgaaggaggaggcaacgtcggcttggctgcaagaccaggct
gacctgctggtgacggtggcggtgaacaacgtcacggggctgggggacctgcagcagatg
gaatttgggagatggctggcgggcgggcagcacctcaccgtggacatgccggaggtcggc
gggcacgtggcgcaggagctgctggccctggcagaacggacgtgctgccagatgtgtccc
tgcacctgcgtggggctgcccggtccacgaggccccggtggctcccacggggagaagggg
gtgacaggcagcaaggggcgtgctggagacgagggcgagcatggacacagcggggagcag
ggaccccgaggtcttcacggcagccgagggatgcagggatgcccagggcagcatggaccg
aagggcttcatgggtcaccccggggagcaggtacgtccctgcctcgtggtttatcctgtg
gctcctgcgggcaccgatggtggccaaggccatggacagctggcggcggcagggagacct
cgaggacgggcagggaagagagggacgcggggaggggaatggcagatggtcatgatggag
gggtgggatggcctggggtgctgggggaaatggggcaaagccagaacctggctattgcac
cccatgacatataacctggcaccagaccgtgtcacctccttgcagggacctcctggagaa
gccggttatgacggtgtggacggggagcagggtgaagcggggacccccggccaccccgga
gagaaggggtcccgggggaggcaggggcggaaaggttcccggggtgcccggggggagaag
ggtcccccaggcccttgcggcgaggtggggccacccgggaggacaagccccgagcccggc
accccagggtggagaggagatgggggtccccagggagaccccggccaggagggtcccccg
gggccaccggggcccccgggtccccctgctcacccgacctcctgccagaaagggcagcag
ggagtgcagggcaagaaggggaaccgtggcagccctggccccgagggacagaagggctac
ggaggtgcccaggggctgccggggctgcgggggacgaacggtgtcaccggtcacccaggg
cgcagaggacttcagggtctgcccggagcggatggctctcagggagcagtcggaccagct
ggtcccaaaggggacaaggggcgggcaggagacaagggcaagaaggggagcgtgggaccg
cggggacccaaaggcaccctgggtgagaacggctgtgaccggagaggtgctccagggagg
aagggtgacaagggtgctcggggcctgccgggataccccggggctcagggtgatgggggt
gagagaggatcccctggggacaagggtgcacgagggctgcccgggcgcaggggccatccc
gggagcagaggagaagcaggtggccgcggcagtgcgggacctccgggagagatgggccca
aagggctcgccgggcactcccccctccacgccttgcgagctcaaggctttcatccgccgg
agctgcgtcaccacctcgccctcctgcccgcttttccccaccgagctggttctggtgctg
gaaacctcatccaccgtatcgccagccctcttctcccgcatgaaggagctgctggccctg
ctgctgcgggacctgcagatctccccctcgggctgcccggcgggggccagggtggccatg
ctggcctacgcggccacccctacctacctgctgcgtgccggggagatggggagcaggacc
gcgctgctgaaccgcctccgccgcctctcgcccacccgttcctcccgccgcggccgcctg
gccgccgccatgcgtttcgtggggcaccacgtgctcaaaagggtccggccggccgccctg
ggcaggaaagtggtcctcttcgtcaccagcggccgaaaccagaacctggagggcatcggg
gaggttgctctgcagtacgaagccctgggcatcgtgccggcagtgctcaccttcaacccg
ctgcccgaggtggtgcgggctttccaggttaacagcctcttccgagtggtccagctctcg
gcggcggagccggctggcgatgcggcggtgctgcgggatgcggtgctgccctgcatcctc
tgcttcgatctctgccaccccgagagctgcaccgcagccgtgccgtcccctgacccgccg
gacgtggacttggccttggtggtggacaacgcagccccggggatgctggcggagaggctg
gaggctgtcggcgagctcttccaccatctcctggggcgcctgcagctcacggggccggat
ctggtgcagcgcggcacccgcgtcgccctggtgctgacgggaccctccgctcccgggcag
gatttggccgaggtccccttcgggctgcccggctctggggagcagctccgggagcgtctc
cacctcgccttggtgcccaggccggctgcggcatccgccagcggcgcggtggtgtggacc
ctccggcacatcttcccgcagggctccggagaccggctccgcgtcctcttcgtggtgggg
acgggggccgcgttgttgtgggacggggaggcacggcaggcgctggccccctttgccccg
tgtgaggattttggcatcttggtgctctccctggggcgagccgggacagagcagccggag
gcggccgtgccggaggcactgccggcatggcggtaccattccctgcgcctgggctcggtt
cacccacccgagatgggatacgcggagaggacggcgctgggcttcctccggaggctgcgg
gcggagagcagccagcaccccggcaccccggggtgcccccagaagatgccctccgccggc
accaggaccggcaagccccccccagggacccctgcgcagacccccgaggtccccactggg
atgcccacggcacccaccagccccggtaagggccggagggctgcggccgttcccggaccg
tgcaccctggacaaggaccccggcagcgcctgcgcccgattctccgtgatgtggtaccac
cggcgggagacggggtcctgcgagcgcttctggtacgggggctgcgggggcaacgccaac
cgcttcggcagcgagcaggactgcatccgcgcctgcgtggacccgggtcttgacgaagcc
ggcgtgggcgagagcaacgtgacgcgggctgcctgcctggaggcgagggacccgggaccc
tgccactccttctcgcccaagtggttcttcgagggtcaccaacccggccgctgctcgctt
ttctggtacgggggctgcggcggcagccgcaatcgcttcgagagccgggagcagtgcgaa
gccgcctgcctgtccccgggcagggcgaaccccccggccccccaccccgtcaacgcctct
gcctgcacctctgcgcccggctactag

KEGG   Buteo buteo (common buzzard): 142039679
Entry
142039679         CDS       T11399                                 
Symbol
COL9A1
Name
(RefSeq) collagen alpha-1(IX) chain isoform X1
  KO
K08131  collagen type IX alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142039679 (COL9A1)
   04518 Integrin signaling
    142039679 (COL9A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142039679 (COL9A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142039679 (COL9A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:bbut00535]
    142039679 (COL9A1)
Proteoglycans [BR:bbut00535]
 Extracellular matrix (ECM) proteoglycans
  Others
   142039679 (COL9A1)
SSDB
Motif
Pfam: Collagen Laminin_G_2
Other DBs
NCBI-GeneID: 142039679
NCBI-ProteinID: XP_074902530
LinkDB
Position
15:complement(4395174..4464867)
AA seq 919 aa
MKSNWKITAFFYLCSFLWSFISATIQQQSRLPVILSASQRTDLCPTIRIGQDDLPGFDLI
SQFQIEKAASQGIIQRVVGSTSLQVAYKLGPNVDFRIPTSAIYSNGLPDEYSFLTTFRMT
GATLQKYWTIWQIQDSSGKEQVGVNLNGQMKSVEFSYKGVDGSLQTASFLHLPFLFDSQW
HKLMISVEANSITLFIDCIKIESLNIKPKGKISIDGFAVLGKLKNNPEISVPFEIQWMVI
HCDPLRPQREGCGELPARISQTVIERGLPGPPGPPGPPGPPGVPGIDGIDGERGPNGPPG
PPGPDGDAGKPGSPGLPGEPGADGFTGPDGSRGATGPKGQKGEPGLPGARGFPGKGLLGP
PGPAGAAGLPGEVGRAGPPGDPGKRGPPGPPGPPGPRGTIGLQDGDPLCPNACPPGRPGH
AGLMGMKGQKGSKGESGEPGKQGYKGEEGDQGPSGEVGAQGPPGILGIRGITGIMGPKGT
KGARGLDGEPGPQGLPGAPGDQGQRGPLGEAGPKGERGPQGTRGINGLPGPKGESGLPGV
DGREGIPGMPGAKGEPGKPGAPGDAGLQGLPGLPGSPGVKGIPGPKGNRGPPGVPGLMGN
SGKPGEQGPEGEAGPTGPRGPPGSRGEPGPVGPPGLPGKWGPQGDIGLPGLPGPPGLPGG
KGDRGSVGEPGPKGEQGAPGAEGDGGEKGDLGDMGLPGAKGAVGNPGDPGSRGPEGSRGL
PGMEGPRGSPGPRGLQGEQGAPGLPGSQGPAGKEPTDQHIKQVCMRVMQEQLAQLAASLK
RPEFGAPGLPGRPGPPGAPGPAGENGFPGQLGPRGLPGLKGPPGEIGRKGPKGEPGERGE
RGFPGRGLKGYPGPRGLPGEPGKPSYGREGRDGERGLPGVAGQPGVPGPPGPPGPPGYCE
PSSCRMQAGQRAGKNTKGP
NT seq 2760 nt   +upstreamnt  +downstreamnt
atgaaaagcaactggaaaattacagctttcttttacctgtgtagttttctgtggtctttc
atctcagccaccatccagcaacaatcaagactaccagtcattctgagtgccagtcagaga
actgatctctgcccaacaatcaggattggccaagatgacttaccaggctttgacctgatt
tctcagttccaaatagaaaaagctgcttcccaaggaataatccagagagtagtgggctcc
acttctctccaagtggcttacaaattgggacccaacgtagacttcaggatcccaaccagt
gccatatattccaatggattgcctgatgaatattcgtttctaactacttttcggatgact
ggggccacgcttcagaaatactggactatttggcagattcaggattcttcaggaaaagaa
caagttggagtgaatcttaatggtcaaatgaaaagtgttgagttctcttataaaggagtg
gatggaagtctccagactgcatcatttttgcatttgcctttcttgtttgactcccaatgg
cacaaacttatgataagtgtggaagcaaacagcatcacactttttattgactgtattaaa
atagaatccttaaacataaaaccaaaagggaaaatcagcattgatggctttgctgtgctt
ggaaaacttaaaaataatcctgaaatttcagttccgttcgaaatccagtggatggtgatt
cactgcgaccccctgcgaccccagcgcgaaggctgtggcgagctgccggcccggataagc
cagacggtgatcgagagaggtctccctgggccaccaggccccccaggtccgccagggcca
ccaggagttcccggtattgatggcattgatggggagagaggtcctaacggcccccctggt
ccaccgggtccagatggggatgcaggcaaaccaggatccccaggcctgcctggagagcca
ggagctgatggattcacaggacctgatggctcacgtggagccacaggaccaaaaggtcag
aagggtgagccaggactgccaggtgctcgtggatttccaggcaaggggcttcttggacca
cctggtccagctggtgcagcaggacttcctggggaagtaggtcgtgctggtccacctgga
gatccaggaaaaagagggccaccagggccaccaggaccaccaggtcctcggggaacaatt
ggcctgcaagatggtgacccgttgtgtcccaatgcttgtccacctggccgcccaggacat
gctggcttaatgggaatgaagggacagaaaggttcaaaaggagagtctggtgaacccgga
aaacaaggttataagggtgaagaaggggaccaaggacccagtggtgaagtgggagctcaa
ggccctccaggcattctaggtatcagaggtataacaggtataatgggacctaaaggtacc
aaaggagctcgtgggcttgatggtgagcctgggccccaaggtcttcctggtgcacctggg
gatcaaggacagagaggtccactgggagaggcaggtccgaagggtgaaagaggccctcaa
ggtacaagaggaataaatggtctccctgggcctaaaggagagtctggcttaccaggagtt
gatggccgggaagggatccctggaatgcctggagcaaaaggtgaaccaggaaaacctgga
gctccaggtgatgcagggcttcagggactgccaggtttaccaggctctccaggtgtgaag
ggcattcctggcccaaagggtaatagaggtccccctggggtgccaggtttgatgggaaat
tctggtaaaccgggtgaacagggaccagagggagaagcaggtccaacaggaccccgagga
ccaccaggtagcagaggggagccaggtcctgtgggtcctccaggtctaccaggaaaatgg
ggtccccaaggtgatattggacttcctggactgccaggtcctccaggcctacctggtggt
aagggtgaccggggttcagtgggggaaccaggacctaagggtgaacaaggtgcacctgga
gcagaaggagatggaggagaaaagggtgacttaggtgatatgggattaccaggagcaaag
ggagctgttggtaatcctggagaccctggttcccgtgggcctgagggaagtcggggactg
cctggcatggaagggccacgaggttcacctggaccgcgaggcttgcagggtgaacagggt
gccccaggtctgcctggcagccaaggtccagctggaaaagaaccaactgatcagcacatt
aagcaggtttgcatgagagtcatgcaagaacaactagctcagctggcagccagtcttaaa
aggccggaatttggtgctccaggtcttcctggccgaccggggccaccaggtgctccaggg
cctgctggcgaaaatggcttcccaggacagcttggacctcgtggcttgcctggccttaaa
ggtccccctggtgagattggtcgtaaaggtcctaaaggtgaaccaggagaaagaggagaa
agaggatttccaggcagaggactgaaaggttaccctggaccaagaggtcttccaggtgaa
ccaggcaaacccagctacggcagggaaggccgtgatggtgaacgaggtctccctggggtg
gccggtcagcctggggttcctggtcctcccggccctcccggccctcctgggtactgcgag
ccgtcgtcttgcagaatgcaagctggacagagagctggtaagaacacgaaagggccgtga

KEGG   Buteo buteo (common buzzard): 142040058
Entry
142040058         CDS       T11399                                 
Symbol
COL9A2
Name
(RefSeq) collagen alpha-2(IX) chain
  KO
K08131  collagen type IX alpha
Organism
bbut  Buteo buteo (common buzzard)
Pathway
bbut04510  Focal adhesion
bbut04512  ECM-receptor interaction
bbut04518  Integrin signaling
bbut04820  Cytoskeleton in muscle cells
Brite
KEGG Orthology (KO) [BR:bbut00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    142040058 (COL9A2)
   04518 Integrin signaling
    142040058 (COL9A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    142040058 (COL9A2)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    142040058 (COL9A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:bbut00535]
    142040058 (COL9A2)
Proteoglycans [BR:bbut00535]
 Extracellular matrix (ECM) proteoglycans
  Others
   142040058 (COL9A2)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 142040058
NCBI-ProteinID: XP_074903383
LinkDB
Position
16:complement(2513094..2527452)
AA seq 725 aa
MSPLLLWCPAVQSGLLPYLSAARRGHCRSVPAALAMAGCSPLLQLWFLLQAACLCLAQIR
GPPGEPGPRGPPGPPGVPGVDGIDGDKGSAGAPGSPGAKGEPGAPGPDGPPGKPGIDGLT
GAKGEPGPIGGPGLKGQPGLPGPPGLPGPSLPGPPGLPGQVGLPGEIGVLGPKGDPGPDG
PRGPPGPPGKPGPPGHIQGLEGSADFLCPTSCPPGPKGPQGLQGLKGHRGRPGALGEPGR
QGRQGPKGDVGVSGEQGVPGPPGPQGLRGYPGMAGPKGETGPAGYKGMVGTIGAAGQPGR
EGPKGPPGDPGEKGELGGRGIRGPQGDIGPKGENGLPGIDGKDGTPGIPGVKGSVGQAGR
PGTPGHRGQAGLPGQPGSKGGPGDKGEVGARGQTGVTGAPGQIGEPGPRGEQGPQGVPGM
KGDRGERGLVGPPGEQGKTGPKGEQGPPGIPGPQGLPGVKGDKGSPGKTGPKGGTGDPGV
HGLAGLKGEKGESGEPGPKGQQGIQGELGFPGPSGDAGSPGPRGYPGPPGPRGLVGERGV
PGMPGQRGVAGRDAGDQHIVDVVLKMLQEQLSEVAVSARRAALGGVGAMGPPGPPGPPGP
PGEQGPHGPMGPRGVPGILGAAGQIGNVGPKGKRGEKGERGEAGRGHPGMPGPPGIPGLP
GIPGHALDGKAGERGLPGSPGEAGRPGLPGPAGLPGFCEPAACLGASAYAAARLTEPGAV
KGPIY
NT seq 2178 nt   +upstreamnt  +downstreamnt
atgagccccctcctcctctggtgtccggctgtgcagtcggggctgctgccatacctgagc
gcggcgagacgaggccactgccgctcggtgccggccgccctcgccatggccggctgctcc
cctcttctccagctctggttcttgctccaagccgcttgcctctgcctggcccaaattcga
gggccaccgggagaacccggcccacgaggtccccctggtcccccaggagtgccgggagtg
gatggcattgacggtgacaaaggctctgccggagcccccgggtccccgggtgccaagggt
gagcctggagcccctggtccagacgggcccccagggaagccgggcatcgacggcctgacg
ggagcgaaaggggagccggggcccattggtgggccaggacttaaaggccagcctggactc
ccagggccgccggggctccccggtccttcgctgccaggaccacccgggcttccaggccag
gtcgggcttcctggagagatcggagtgctgggacccaagggcgaccctggacccgacggc
ccacggggtcccccaggccctccagggaaacccggtcccccaggacacatccaaggtctg
gaaggcagcgccgatttcttgtgcccgaccagctgcccgccaggtcccaagggcccccaa
ggactgcagggactgaagggacacagaggccgtcccggtgccctcggggagcccggcagg
cagggcaggcagggacccaagggtgatgttggcgtctctggagaacaaggtgtcccaggc
cctccgggtccgcagggcctgaggggttaccccgggatggcaggacccaagggcgagacg
ggtcccgctggttacaaggggatggtcgggaccatcggagcggccgggcagccgggcagg
gaaggccccaagggaccacccggggaccctggtgagaagggagaactgggtggccgtggc
atccgaggcccccaaggagacatcggccccaagggcgagaatggtctcccgggcatcgat
ggcaaagacggcaccccgggtatcccaggcgtgaagggcagcgtgggacaggctggccgc
ccaggaacgccgggacaccgaggacaagccggcttgccgggccagccgggaagcaaaggt
ggtccaggagacaagggtgaagtgggtgctcggggccagaccggtgtcaccggtgccccg
gggcagatcggtgagcctggacctcggggtgagcagggaccgcagggcgtccccgggatg
aagggtgaccggggcgagcgtggcctcgtgggtccccccggcgaacaggggaaaacgggg
ccaaagggtgagcagggtccaccggggatcccaggaccccaaggcttgccaggagtcaaa
ggagacaagggctccccagggaaaactggccccaaaggcggtactggagaccccggtgtt
cacggcctcgcagggctgaagggagagaagggtgaatcaggagagccggggccgaaggga
cagcaaggcatccagggcgagctcggcttccccggcccctcgggggacgcaggctcaccc
ggcccacgaggatacccgggaccccccggcccacgaggactcgtcggggagcgcggcgtg
ccagggatgcccggccagcggggcgtagcgggccgggatgctggggaccagcacatcgtc
gacgtggtcctgaagatgctgcaagagcagctgtcggaggtggcggtcagtgccaggaga
gctgccctgggcggagtgggtgccatgggaccccccggaccccctggtcccccggggcca
cccggtgagcaaggtcctcatggacccatgggacctcgaggcgtccccggcatcctgggt
gctgctgggcagattggcaacgtaggacctaaagggaagcgaggcgagaagggtgagcgg
ggagaagctggacgtggccacccgggcatgccgggtcccccggggatcccaggtctcccc
ggcatccccggccacgcgctcgacggcaaggctggagaacggggcttaccgggctccccg
ggagaggctggccggccgggtttgcccggtcccgctgggctgccgggattttgcgaaccg
gccgcctgcctgggagcatcggcgtacgccgccgcacgcttgacggagccgggagccgtc
aagggacccatctactga

DBGET integrated database retrieval system