KEGG   Ursus arctos horribilis: 113241173
Entry
113241173         CDS       T05909                                 

Gene name
COL4A5
Definition
(RefSeq) collagen alpha-5(IV) chain
  KO
K06237  collagen, type IV, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113241173 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113241173 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113241173 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113241173 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    113241173 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113241173 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113241173 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113241173 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113241173 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113241173 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113241173 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113241173 (COL4A5)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   113241173 (COL4A5)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113241173 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 113241173
NCBI-ProteinID: XP_026334538
UniProt: A0A3Q7UEE1
LinkDB
Position
Unknown
AA seq 1669 aa
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCTGVKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGLKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGPAGPTGLPGPTGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGVKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDIGPTGPPGLVIPRPGTGVTVGEKGN
IGLPGLPGEKGQRGFPGIQGPPGLPGPPGTAVTGPPGPPGFPGERGQKGDEGPPGISIPG
SPGLDGQPGAPGLPGPPGPPGPHIPPSDEICEAGPPGPPGSPGDRGLQGEQGVKGDKGDT
CFNCIGTGVSGPPGQPGLPGLPGPPGSLGFPGQKGEKGHAGLTGPKGLTGIPGAPGPPGF
PGSKGDPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGIAFKG
ERGPPGNPGLPGLPGNRGPMGPVGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPSGPPGFPGQKGEPGLGLPGPPGPPGLPGFKGTLGPKGDRGFP
GPPGLPGRTGLDGLPGPKGIEIIATFQFAFNFHRSLHGIPGEKGDPGPPGFDVPGPPGER
GSPGIPGAPGPMGPPGTPGLPGKAGASGFPGAKGEMGIMGPPGPPGPLGIPGRSGVPGLK
GDDGLQGHPGPPGPAGEKGGKGEPGLPGPPGPVDPDLLGSKGEKGDPGLPGIPGVSGPKG
YQGLRGDPGQPGLSGQPGLPGPSGPKGNPGLPGKPGLTGPPGLKGSIGDMGFPGPQGVKG
SSGPPGVPGQPGSPGLPGQKGEKGDPGVSGIGLPGLPGPKGEPGLPGYPGNPGIKGSRGD
TGLPGLPGTPGAKGQPGLPGFPGTPGLPGAKGINGPPGNPGLPGEPGPVGGGGRPGPPGP
PGEKGNPGQDGIPGPAGQKGEPGQPGFGIPGPPGLPGLSGQKGDGGLPGIPGNPGLPGPK
GEPGFQGFPGVQGPPGPPGTPGPALEGPKGNPGPQGPPGRPGPPGFQGLPGPEGPRGLPG
NGGIKGERGNPGQPGQPGVPGLKGDQGPPGQQGNPGRPGLNGMKGDPGLPGVPGFPGMKG
PSGIPGSAGPEGDPGLVGPPGPPGLPGPSGQSIIIKGDVGPPGIPGQPGLKGLPGLPGPQ
GLPGPIGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGPPGPDGMQGPPGPPGTS
SIAHGFLITRHSQTTDAPQCPHGTVQIYEGFSLLYVQGNKRAHGQDLGTAGSCLRRFSTM
PFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMEPLKGRSIQPFISRCAVCEAPAVVIAV
HSQTIQIPRCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQVCMKRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcggcttgctatgggtgttctccaggatcaaaatgtgactgt
actggtgtaaaaggagaaaagggggagagaggatttccaggtttggaaggccatccaggt
ttgcctggatttccaggtccagaagggcctccagggcctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaagggatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccatgatggggccccgggacctcaaggt
atccctggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggacctccagggatcccaggtttgaagggagaaccaggcagt
ataattatgtcatcactgccaggaccaaagggtaatccaggatatccaggtcctcctgga
atacaaggcccagctggtcccactggtttaccagggccaactggtcccccaggaccacca
ggtttgatgggccctcctggtccaccaggacttccaggaccaaaggggaatatgggctta
aatttccagggacccaagggtgaaaagggtgagcaaggtcttcagggcccccctgggcca
cctggacagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtcct
ccaggtggtgtgaaaggtgagaagggtgagcaaggagagcctggcaaaagaggtaaaccg
ggcaaagatggagagaatggccaaccaggaattccaggtttgcctggtgatcctggttac
cctggtgaaccaggaagggatggagaaaagggccaaaaaggtgacattggcccaactggg
cctcctggacttgtgattcctaggcctgggaccggtgtaactgtaggagaaaaaggaaac
attgggttacctggcttgcctggagaaaaaggacagcgaggatttcctggaatacaaggt
ccacctggtcttcctggacctccaggaactgcagttacaggtcctcctggcccccctggc
tttcctggagaaaggggccagaaaggtgatgaaggtccacctggaatttctattcctgga
tctcctggacttgatggacagcctggggctcctggacttccagggcctcctggccctcct
ggccctcacatccctcctagtgatgagatatgtgaagcaggccctccaggccctccggga
tctccaggtgatagaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacacc
tgcttcaactgtattggaactggtgtttcagggcctccaggtcaacctggtctgcctggt
cttccaggtcctccaggatctcttggtttccctggacagaagggagaaaaaggacatgct
ggtctaaccggtcccaaaggattaacaggcataccaggagctccaggtcctccaggcttt
cctggatctaaaggtgatcctggtgacatcctcacttttccaggaatgaagggtgacaag
ggagagttgggttccccaggagcccctgggcttcctggtttacccggtacccctggacag
gatggactgccagggcttcctggccccaaaggagaacctggtggaattgcttttaagggt
gaaagaggtccccctgggaatccaggcttgccaggtctcccagggaatagggggcctatg
ggccctgtgggttttggccctccaggcccggtaggtgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggtgatccaggtcagaccataacc
cagccagggaagcctggcctgcctggtaacccaggcagagatggtgaagtaggtcttcca
ggtgaccctggacttcccggccagccaggcttgccaggaatacctggtagtaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggttttcctggaatt
ccaggaccgccaggagcacctgggacacctggaagaattggtctagaagggccttctggg
ccaccaggctttccaggacagaagggagaacctggacttgggctgcctgggccacctgga
cccccaggactcccaggtttcaaaggaacacttggtccaaaaggggatcgtggtttccca
ggacctccaggtcttccaggacgcactggcttggatgggctccctggaccaaaaggtata
gaaattattgctacttttcaatttgcttttaactttcatagaagcctgcatggaatacca
ggagagaagggagatccagggcctcctgggtttgatgttccaggaccccctggagagaga
ggcagtccagggattcctggagcacctggtcctatgggacccccaggaacaccagggctt
ccaggaaaagcaggtgcctctggatttccaggtgccaaaggtgaaatgggtatcatggga
cctccaggcccaccaggacctttgggaattcctggcaggagtggtgttcctggtctcaaa
ggtgatgatggtttgcagggtcacccaggacctcctggccctgcaggagaaaaaggtggt
aaaggagagcctggccttccaggcccacctggaccagtggatccagatctgctgggctca
aaaggagagaaaggggaccctggcttaccaggtattcctggagtttcagggccaaaaggt
taccagggtttgcgtggagacccagggcaacctggactgagtggacaacctggattacca
ggaccatcaggtcccaaaggtaaccctggtctccctgggaagccaggacttacaggacct
cctggacttaaaggaagcataggtgacatgggttttccaggacctcagggtgtgaaaggg
tcttctggacctcctggagttcctggacagcctggctccccaggattacctggacagaaa
ggagaaaaaggtgatcctggtgtttcaggcattggtctcccaggtcttcccggcccaaag
ggtgaacctggtctgcctggatatccaggaaaccctggaatcaaaggttctaggggagat
actggtttgcctggattaccagggacccctggagcaaaaggacaaccaggccttcctgga
ttccctggaaccccaggacttcctggagcaaaaggtattaatggtcctcctgggaaccct
ggccttccaggagaacctggtcctgtaggtggtggaggtcgtcctggaccaccagggcct
ccaggtgaaaaaggcaacccaggtcaagatggtattcctggaccagctggacagaagggt
gaaccaggtcaaccgggctttggaatcccaggaccccccggactcccaggactttctggt
caaaagggtgatggaggattacctggcattccaggaaaccctggccttcctggtccaaag
ggagaaccaggctttcagggtttccctggtgtgcaaggtcccccaggccctcctggtact
ccaggtccagctctggaaggccctaaaggcaaccctgggccccagggtcctcctgggaga
ccaggtcctccaggttttcaaggtctaccaggtccagaaggtccccgaggtctccctgga
aatggaggtattaaaggagagagaggaaacccaggccaacctgggcaacctggtgtgcct
ggtttgaaaggagatcaaggaccaccaggacaacagggtaatcctggtcgaccaggtctc
aatggaatgaaaggagatcctggtctccctggtgttccaggatttccaggcatgaaggga
cccagtggaatacctggttcagctggccctgagggggatccaggacttgttggcccccca
ggtccccctggattacctggtccttcaggacagagtattataatcaaaggagatgttggt
ccaccagggatcccaggccagcctggattaaaaggtctaccaggactaccaggacctcaa
ggcttaccaggtccaattggccctccaggagatccaggacgcaatggactccctggcttt
gatggtgcaggagggcgcaaaggagacccaggtttgccaggccagccaggtacccgtggt
ttggatggtccccctggaccagatggaatgcaaggtcccccaggtcctccaggaacttcc
tctattgcccatggattcctcatcacacgtcacagccagacaacagatgcaccacaatgc
ccacatggaactgtacaaatttatgaaggcttttctctcctgtatgttcaaggaaataaa
agagctcacggtcaagacttggggacggctggcagctgccttcgtcgcttcagtaccatg
cctttcatgttctgcaacatcaataacgtttgcaactttgcttcaagaaatgactattct
tactggctgtccaccccagagcccatgccaatgagcatggagcccctgaagggccggagc
atccagccatttattagtcgatgtgcagtatgtgaagctccagccgtggtgatcgcagtt
cacagtcagaccatccagattccccgttgtcctcagggatgggattctctctggattggc
tattctttcatgatgcacacaagcgcaggagcagagggctcaggccaagccttggcctcc
cctggttcctgcttggaagagtttcgttcggctcccttcattgaatgtcatgggcggggt
acctgtaactactatgccaattcctacagcttttggctagcaactgtagatgtgtcagac
atgttcagcaaacctcagtcagaaacgctgaaagcaggagacttgaggacacgtattagc
cgatgtcaagtgtgcatgaagaggacataa

KEGG   Ursus arctos horribilis: 113241234
Entry
113241234         CDS       T05909                                 

Gene name
COL4A6
Definition
(RefSeq) collagen alpha-6(IV) chain
  KO
K06237  collagen, type IV, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113241234 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113241234 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113241234 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113241234 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    113241234 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113241234 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113241234 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113241234 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113241234 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113241234 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113241234 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113241234 (COL4A6)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   113241234 (COL4A6)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113241234 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 113241234
NCBI-ProteinID: XP_026334595
UniProt: A0A3Q7U2T4
LinkDB
Position
Unknown
AA seq 1706 aa
MHPGLWLLLVTLCLTEELAGAGEKKSYGKPCGGQDCRGGCKCFPEKGARGRPGPIGIQGP
TGPQGFAGPSGLAGLKGERGSPGPLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGP
PGLDGCNGTQGAVGYQGPDGYPGLLGPPGLPGQKGSKGEPVLAQGTFKGMKGEPGLPGLD
GINGPPGAPGSPGAIGPMGPPGLQGPPGPPGFPGPDGNMGLGFQGEKGVKGDVGLPGPAG
PPPSTGELEFMGFPKGKKGSKGEPGLPGFPGISGPPGLPGFGTAGEKGEKGIPGLPGPRG
PMGLEGLQGPPGSQGKKGSSGFPGLNGFPGIKGEKGGIGLPGPDVFIDTDGAVISGYPGD
PGMPGLPGLKGDEGIQGLLGPSGVPGLPALPGAPGALGPQGVPGLKGDQGNPGRTTVGTA
GSPGRDGLPGLPGLPGPPGPAFETGTLQNAEPGFPGLRGERGPKGNPGLKGMKGDSGFCA
CDGGVPNTGPPGEPGLPGPPGLIGLPGLKGTRGDPGSGGAQGPSGAPGLFGPPGRTGPKG
EKGEQTLSSGSGVPGEQGDPGPQGLPGENGAPGRDGIPGLPGLPGLPGDGGQGFPGEKGL
PGLPGEKGHSGPTGPPGIGLPGSPGPRGFPGDQGIDGLPGQQGLPGLPGDCCCRESTGKG
DLVTEGGITLPCIIPGSYGPSGLPGTPGFPGPKGARGLPGTPGQPGLRGNKGEPGSPGSV
HLPELPGFPGPRGEKGLPGFPGLPGKDGLPGNLGSPGLPGSKGAPGDIFGAENGAPGEQG
LQGLPGDRGVPGDSGLPGPKGLLGKSGLLGPKGERGSPGTPGHMGQPGSPGPDGLFGVKG
KPGLPGAPGFPGLSGHPGKKGVRGEAGAPSSAGKRGLPGLKGLPGSPGLVGFLGSSGLPG
NTGLPGLPGPKGEKGSVGLAGFPGMPGLPGIPGASGLKGISGSAGRVGPSGQAGGAGEKG
DRGDPGPAGIPSPRPPMLNLRFKGDKGSRGSAGLDGFPGPRGDKGEAGPPGPPGLPGAPG
FPSTIKGLIGRAGLPGSTGQRGLPGLKGSPGITGFPGIPGESGSQGLNGAPGLPGTSGLP
GSKGDQGQTLGISGSPGPKGQPGESGFKGMKGTDGLVGDVGFPGSKGEDGNVGISGDVGL
PGSPGLPGIAGMRGNPGFPGSPGHPGATGPLGSSGLMGTKGFPGLPGLHGLNGLPGTKGT
HGTPGPSITGVPGPAGLPGPKGERGSLGSGLGAPGKSGMKGQKGGRGFPGLQGPAGLPGA
PGLSLPSVIVGQPGDPGRPGLDGERGRPGSPGPPGPPGPSSDQGDPGDPGFPGVPGPQGP
KGDQGIPGFSGFPGELGLKGVRGEPGFMGIPGKVGPPGDPGLPGMKGKAGPRGFAGPRGA
PGQTPIAEAVQVPPGPMGLPGIDGIPGLTGDPGVQGPVGFQGSKGLLGIPGKDGLNGLPG
PPGALGDPGLPGLQGPPGFEGAPGKKGPFGRAGAPGQSVRVGYTLVKHSQSEQVPVCPIG
MSQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIYCNIDEVCHYAGRNDKSYWL
STTAPIPMMPVGQAQIPQYISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLM
HTAAGAEGGGQSLVSPGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGE
EPASETLKAGQLHTRVSRCQVCMKSL
NT seq 5121 nt   +upstreamnt  +downstreamnt
atgcaccctgggttgtggcttctcctggttacgttgtgcctgacggaggaactggcagga
gccggtgagaagaagtcctatggaaagccatgtgggggccaagactgcaggggaggctgt
aaatgctttcctgagaaaggagcgagagggcgaccgggaccaatcggaattcaaggtcca
acgggtcctcaaggatttgccggccctagtggtttagccgggttgaaaggagaaaggggc
tccccagggcccctgggaccatatggaccaaaaggagataagggtcccatgggagttcct
ggctttctcggcatcaatgggattccgggccaccccggacagccagggcccaggggccca
cctggcctggatggctgtaatggcactcaaggagctgttggatatcagggccctgatggc
tatcctgggcttctcggaccacccgggcttcctggtcagaaaggctctaaaggtgaacct
gtccttgcccaaggtactttcaaaggaatgaagggggagcctggactgcctggactggat
ggaatcaatggtccaccaggagcacctgggtccccaggagcgataggacccatgggacca
ccaggattgcagggtcctccaggtccccctggcttccctggtcctgatgggaatatgggg
ttaggtttccaaggagagaaaggagtcaagggggatgttggcctcccaggccctgcagga
cccccaccctctaccggggagctggaatttatgggatttcccaaagggaagaaaggatcc
aagggtgaaccagggcttccgggtttcccaggaataagtggccctccaggtctcccagga
ttcggaactgctggagaaaagggagaaaagggaatccctggtttgccaggacctaggggt
cccatgggtttagaaggactccaaggccctccagggagtcagggcaagaaggggtcctca
ggtttccctgggcttaatggattcccaggaattaagggtgaaaagggtggcattggcctg
ccgggcccagatgttttcatcgatacagatggtgctgtgatctcaggttatcctggagac
cccggtatgccaggcctcccaggacttaaaggagatgaaggcatccagggcctgcttggc
ccttctggcgtccccggcttgccagctttaccaggtgccccaggtgccctagggccccaa
ggagttccaggtctgaagggggaccaaggaaacccaggccgcaccacagttgggacagct
gggtcccctggcagagatggtttgccaggcctgcccggcctcccggggccccccggtcca
gcatttgagactggaaccctacagaacgcagagccaggcttccctggtctccgaggagaa
cgaggtccaaaaggaaacccaggcctcaaaggaatgaaaggggactcgggcttttgtgct
tgcgacggtggtgtccctaacactggaccacctggggagccaggcctgcctgggccaccc
ggcctcataggccttccaggccttaaaggaaccagaggagatccaggctctgggggtgca
cagggcccatcaggggctccaggtttgtttgggcctcccggtcgcacaggccccaaagga
gagaaaggggaacagactctcagttcaggatcaggggtgccgggggagcagggtgatcct
ggcccccaggggctccctggggagaacggagccccaggcagggatggaatacccggttta
ccaggtctgccaggcctcccgggtgacggtggacagggcttcccaggtgaaaaggggtta
ccaggacttcctggtgaaaagggccacagtggtccaactggccccccaggaattgggctg
ccgggatctcctggacctcgtgggtttcctggagatcaaggaatcgatggattaccaggg
caacaaggcctccctgggcttcctggtgactgctgctgcagggagagcactggtaaagga
gacttagtcacagagggagggatcaccttgccgtgtataattcctggctcctatggtcca
tcaggcttaccaggaactcccggattcccaggccctaaaggggcccgtggcctccctggg
accccaggccagcccggactgcgtgggaataaaggagagcctggaagtccaggatcggtt
caccttcctgaattgccaggatttcctggacctcgtggggagaagggcttgcctgggttt
cctgggctccctggaaaagatggcttgcctgggaacctgggcagtccaggtttacccggt
tccaagggagcccctggtgacatctttggtgcagagaatggtgctccgggggagcaaggc
ctacagggattgccaggggacagaggagttcctggagactctggccttccagggcccaag
ggtttgcttgggaagtcgggcctgctaggccccaaaggtgaacggggcagccctggcaca
ccgggccacatgggacagccagggtccccagggcctgatggtttattcggcgtcaagggc
aaacccgggctcccgggtgcaccaggctttccaggcctttcaggacatcctggaaagaaa
ggtgtaagaggtgaggcaggtgccccttcatcagctggaaagagaggcctgccggggctg
aaaggccttccgggatctccagggctggttggcttcttggggagctcaggcttgccaggg
aacactgggttgccaggcctgccaggtccgaagggtgagaaggggtctgttgggctggca
ggctttccggggatgcctggtctcccaggtattcctggcgcgagtggattaaagggaatt
tctgggtcagccggaagagtgggaccatctggacaggccggtggtgctggtgagaaagga
gacagaggcgaccccgggccagctggaatacctagcccaagacctcccatgctgaacctc
cggttcaaaggggacaaaggatccagaggctcagctggattggacggatttcctgggccc
agaggtgacaaaggagaggctggccccccggggccaccagggttgcctggagctcctggc
ttccccagtaccatcaagggactcattggcagagctggcctccctggctccactggacaa
cggggcttacctggcctgaaggggtcccctggaatcacaggcttcccaggaataccaggg
gaaagcggttcacagggtctcaatggagcacctggactcccaggaacatctggtctccca
ggttcaaagggagatcaaggtcagacacttggaatttctggtagcccaggacccaaggga
caacctggggagtctggttttaaaggcatgaaaggaacagatgggcttgttggtgatgtg
ggtttcccaggaagcaaaggtgaagatgggaatgttggtatttctggagatgttggcctt
cctggctccccaggactccccgggattgcaggcatgagaggaaatccagggtttccgggt
tctccaggccatccaggggcaactgggcccctgggatcatctggcctaatgggaaccaaa
ggtttccctggacttcctggtttacatggactgaatgggcttccaggaaccaaggggacc
catggaactccaggacctagtataaccggcgtgcctgggccagctggcctgcctggtccc
aaaggggaaagaggttctctaggaagtggccttggagccccagggaagtcaggcatgaaa
ggacaaaaaggtggccgaggtttcccaggtctccagggccctgctggtctgcccggtgcc
ccaggcctctccttgccctcagtcatagtgggacagcctggcgaccctgggcgaccaggc
ctagatggagagcgaggccgcccaggctcccccgggcccccagggccccctgggccatcc
tccgatcaaggtgaccctggagaccctggcttccctggagttcctggccctcaagggccc
aagggagaccaaggaattccaggtttctccggcttccctggagagctaggactgaaaggc
gtgagaggtgagcctggcttcatggggattccaggcaaggttgggccacctggagaccca
ggacttcccggaatgaaggggaaggcagggcccagaggctttgccggaccccgaggtgct
cctggacaaacaccaattgcagaagctgtccaggttcctcctggacccatgggtctaccg
ggcatcgatggcatccctggcctcacgggggaccctggggttcaaggccctgtgggcttc
caaggctccaaaggtttactgggcatccctggcaaagatggtctcaacgggctccccggc
ccgcctggggctctcggtgatcctggtctccctggactgcaaggccctccaggatttgaa
ggagctccagggaagaagggtcccttcgggagggctggagcgcccgggcagagcgtgaga
gtggggtacaccttggtgaagcacagccagtcggaacaggtgcccgtgtgccccatcggg
atgagccagctgtgggtgggttacagcttgcttttcgtggaggggcaggagaaagcccac
aaccaggacctgggctttgctggctcctgcctgccccgcttcagcaccatgcccttcatc
tactgcaacatcgacgaagtgtgccactacgctgggcgcaacgataaatcctactggctc
tccaccaccgcccccattcccatgatgcccgtcggccaggcccagatcccccagtacatc
agccgctgttccgtgtgtgaggcgccctcgcaggccattgccgtgcacagccaggacatc
accatcccgcagtgccccctgggctggcgcagcctctggatcgggtactccttcctcatg
cacactgcggcgggcgccgagggcggcggccagtcgctggtctcccccggctcctgcctg
gaggatttccgcgccacgcccttcatcgagtgcagcggggcccggggcacctgccactac
ttcgccaacaagtacagtttctggctgaccacagtggaagagaggcagcagtttggggag
gagccggcatccgaaacgctcaaggccgggcagctccacacccgggtcagccgctgccag
gtgtgtatgaaaagcctgtag

KEGG   Ursus arctos horribilis: 113242050
Entry
113242050         CDS       T05909                                 

Gene name
THBS1
Definition
(RefSeq) thrombospondin-1
  KO
K16857  thrombospondin 1
Organism
uah  Ursus arctos horribilis
Pathway
uah04015  Rap1 signaling pathway
uah04115  p53 signaling pathway
uah04145  Phagosome
uah04151  PI3K-Akt signaling pathway
uah04350  TGF-beta signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05144  Malaria
uah05165  Human papillomavirus infection
uah05205  Proteoglycans in cancer
uah05206  MicroRNAs in cancer
uah05219  Bladder cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04015 Rap1 signaling pathway
    113242050 (THBS1)
   04350 TGF-beta signaling pathway
    113242050 (THBS1)
   04151 PI3K-Akt signaling pathway
    113242050 (THBS1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113242050 (THBS1)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    113242050 (THBS1)
  09143 Cell growth and death
   04115 p53 signaling pathway
    113242050 (THBS1)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113242050 (THBS1)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    113242050 (THBS1)
   05205 Proteoglycans in cancer
    113242050 (THBS1)
  09162 Cancer: specific types
   05219 Bladder cancer
    113242050 (THBS1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113242050 (THBS1)
  09174 Infectious disease: parasitic
   05144 Malaria
    113242050 (THBS1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:uah04131]
    113242050 (THBS1)
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113242050 (THBS1)
Membrane trafficking [BR:uah04131]
 Endocytosis
  Phagocytosis
   Opsonins
    113242050 (THBS1)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113242050 (THBS1)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC cEGF EGF_3 TSP1_ADAMTS EGF_CA TSP1_CCN Laminin_G_3 Laminin_G_2 EGF Laminin_G_1
Other DBs
NCBI-GeneID: 113242050
NCBI-ProteinID: XP_026335835
UniProt: A0A3Q7U6F0
LinkDB
Position
Unknown
AA seq 1170 aa
MGLAWGLSVLFLLHACGSSRIPESGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFR
IEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLAIERKDHSGQVFSVV
SNGKAGTLDLSLTVQGMQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELD
VPIQSIFTRDLASVARLRVAKGGVNDHFQGVLQNVRFVFGTTPEAILRNKGCSSSTNVLL
TLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRK
VTEENKELAIELRRPPLCYHNGVQYRNNEEWTVDSCTECRCQNSVTICKKVSCPIMPCSN
ATVPDGECCPRCWPSDSADDGWSPWSEWTSCSATCGNGIQQRGRSCDSLNNRCEGSSVQT
RTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARE
TKACKKDACPINGGWGPWSLWDLCSVTCGGGVQRRSRLCNNPTPQFGGKDCVGDATEKQI
CNKQDCPIDGCLSNPCFAGVKCTSYADGSWKCGACPPGYSGNGVQCEDVDECKEVPDACF
NHNGEHRCENTDPGYNCLPCPPRFTGPQPFGRGVDYATANKQVCKPRNPCTDGTHDCNKN
AKCSYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNEDLVCVANATYHCKKDNCPN
LPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYN
HNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEH
NPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDN
DGIPDDRDNCRLVPNPDQKDSDGDGRGDACKDDFDHDNVPDIDDICPENIDISETDFRRF
QMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDD
DYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNAL
WHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYD
KTYAGGRLGLFVFSQEMVFFSDLKYECRDS
NT seq 3513 nt   +upstreamnt  +downstreamnt
atggggctggcctggggactcagtgtcctattcctgttgcatgcgtgtggttccagccgc
attccagagtctgggggagacaacagcgtgttcgacatctttgaactcaccggggctgcc
cgtaagggctctgggcgccgactggtgaagggtcctgacccttccagcccagctttccgc
atcgaggatgccaacctgattccccctgtgcctgatgacaagttccaagacctagtggat
gccgtccgggcagagaaaggcttccttctcttggcctccctgaggcagatgaagaagacc
cgaggcactctgctggccatagaacggaaagaccactcgggccaggtcttcagtgtggtc
tccaatggcaaagcgggcaccctggacctgagcctcaccgtgcaggggatgcagcatgtg
gtgtcggtggaggaggcgctcctggccaccggccagtggaagagcatcaccctgttcgtg
caggaggaccgggcgcagctgtacatcgactgtgagaagatggagaacgcagagctggat
gtccccatccagagcatcttcaccagggacctggccagcgtggccagactccgcgttgcc
aaaggaggggtcaatgaccatttccagggggtgctgcagaacgtgaggtttgtcttcgga
accacaccagaagccatcctcaggaacaaaggctgctccagctcgaccaatgtccttctc
accctggacaacaatgtggtaaacggctccagccctgccatccgcactaactacattggc
cacaagacgaaggatctgcaagccatctgcggcatctcttgtgacgagctgtccagcatg
gtcctggagttgagaggcctgcgcaccattgtcaccacactccaggacagcattcgcaaa
gtgactgaagagaacaaagagttggccattgagctgaggaggcctccgctctgctaccac
aacggagttcagtacaggaataatgaggaatggactgttgatagctgcacggagtgccgc
tgtcagaactccgttaccatctgtaagaaagtgtcctgccccatcatgccctgctccaac
gccacagtccccgatggagaatgctgcccacggtgttggcccagcgactctgcggatgat
ggctggtccccgtggtccgagtggacctcctgctctgcaacctgtggcaatggaatccag
cagcgcggtcgctcctgtgacagtctcaacaaccgatgcgagggctcctccgtccagacg
cggacctgccacattcaggaatgcgataagagatttaaacaggatggtggctggagccac
tggtccccgtggtcatcttgttccgtgacctgcggcgatggtgtgatcacaagaattcga
ctctgcaattctccgagcccccagatgaacgggaaaccatgtgaaggcgaagcccgcgag
accaaagcctgcaagaaagacgcctgccccatcaacggaggctggggtccctggtcgctg
tgggacctctgctctgtcacctgcggaggaggggtgcagagacgtagccggctctgcaac
aaccccacaccccagtttggaggcaaggactgtgtcggtgacgcgacagaaaaacagatc
tgcaacaagcaggactgtccaattgacggatgcctgtccaatccctgctttgctggggtc
aagtgtacgagttacgcagatggcagctggaaatgcggtgcctgtcccccgggctacagc
ggaaacggcgtccagtgcgaagacgtggatgagtgcaaagaagtacccgacgcctgcttc
aaccacaacggcgagcacaggtgtgagaacaccgaccccggctacaactgcctgccctgc
ccgcctcgcttcacgggcccgcagcccttcggccggggtgtggactacgccactgccaac
aaacaggtgtgcaagccgcgcaacccctgcacggacgggacgcacgactgcaacaagaac
gccaagtgcagctacctgggccactacagcgaccccatgtaccgctgcgagtgcaagccc
ggctacgcgggcaacggcatcatctgcggggaggacacagacctggatggatggcccaac
gaggacctggtgtgcgtggccaatgccacctaccactgcaaaaaggataactgccccaac
ctgccaaactccgggcaggaggactacgacaaggatggaatcggcgacgcctgtgatgac
gacgatgacaacgataaaatcccagacgacagggacaactgcccattccattacaaccca
gcccagtatgactacgaccgagatgatgtgggagaccgctgtgacaactgcccctacaac
cacaaccccgatcaggctgacacggacaacaacggggaaggagacgcctgcgccgcagac
atcgatggggacggtatcctcaatgaacgagacaactgccaatatgtctacaacgtggac
cagagggacacagatatggatggggttggagatcagtgtgacaactgccccctggaacac
aatccagatcagctcgactctgactcagaccgcattggagacacgtgtgacaacaatcag
gatattgatgaagacggccaccagaacaacctggacaactgtccctatgtgcccaatgcc
aaccaggctgaccatgacaaagatggcaagggagatgcctgtgaccatgatgatgacaat
gacggcattcctgatgacagggacaactgccgactcgtgcccaaccctgaccagaaggat
tctgatggtgatggtcgaggtgatgcctgcaaagacgattttgaccatgacaatgtgcca
gacattgatgacatctgtcctgaaaacattgatatcagcgagactgatttccgccgattc
cagatgattcctctagatcccaaagggacatcccagaatgaccctaactgggtcgtacgc
catcagggtaaagaactcgtccagaccgtcaactgtgatcctggacttgctgtaggttac
gatgagtttaatgccgtggacttcagtggcaccttcttcatcaacacggaaagggatgac
gactatgctggatttgtgttcggctaccagtctagcagccgcttttatgttgtgatgtgg
aagcaagtcacccaatcctactgggacaccaaccccactagggctcagggatactcaggc
ctttctgtgaaagttgtgaactcaaccaccgggccaggcgagcacctgcggaatgccctg
tggcacacgggaaacacccctggccaggttcgtactctgtggcatgaccctcgtcacata
ggctggaaagatttcactgcctacagatggcgtctcagccaccggcccaagacgggtttc
attagagtggtgatgtacgaagggaagaaaatcatggctgactcaggacccatctatgac
aaaacctatgctggtggtaggctagggttgtttgtcttctcacaagaaatggtgttcttc
tctgacctgaaatacgaatgcagagattcctaa

KEGG   Ursus arctos horribilis: 113243732
Entry
113243732         CDS       T05909                                 

Gene name
TNXB
Definition
(RefSeq) LOW QUALITY PROTEIN: tenascin-X
  KO
K06252  tenascin
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05165  Human papillomavirus infection
uah05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113243732 (TNXB)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113243732 (TNXB)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113243732 (TNXB)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    113243732 (TNXB)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113243732 (TNXB)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113243732 (TNXB)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113243732 (TNXB)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_2 EGF_Tenascin Pur_ac_phosph_N DUF2369 hEGF CBX7_C
Other DBs
NCBI-GeneID: 113243732
NCBI-ProteinID: XP_026338311
UniProt: A0A3Q7TQD7
LinkDB
Position
Unknown
AA seq 3057 aa
MPAQWFLISRLALFVLLGTVRAGPFSPRSNVTLPAPRPPPQPGGRTVEPIGGSPSSQLYE
HTVEGGEKQVVFTHRINLPPSAGCGCPPGTEPPVPASEVQALKVRLEILEELVKGLKEQC
TGGCCPAAAQAGTGQTDVRSLCSLHGVFDLSRCACSCEPGWGGPTCSDPTGAGVSPSSPP
SASASCPDDCNDQGRCVRGRCVCFPGYSGPSCGWPSCPGDCNGRGRCVQGVCVCRAGFSG
DDCSQRSCPRGCSQRGRCEDGRCVCDPGYTGDDCGKRSCPRGCSQRGRCENGRCVCDPGY
SGEDCGLRSCPRGCSQRGRCENGRCVCDPGYAGDDCGSRSCAWDCGEGGRCVDGRCVCWP
GYAGEDCSTRTCPRDCRGRGRCEDGECICDSGYSGDDCGVRSCPGDCNQRGRCEDGRCVC
WPGYTGPDCGSRACPRDCRGHGRCENGVCVCNAGYSGEDCGVRSCPGDCRGRGRCENGRC
VCWPGYTGRDCGTRACPGDCRGRGRCVDGRCVCNPGYAGEDCGSRRCPGDCRGRGRCEDG
VCVCNAGYEGEDCGVRSCPGGCHGRGQCLDGRCVCDDGYSGEDCSVRLCPRDCNQHGVCQ
DGVCTCWEGYAGEDCGLRTCPSNCHQRGRCEDGRCVCDSGYTGPSCATRTCPADCRGRGR
CVQGVCVCHVGYSGEDCGQEEPPASACPGGCGPRELCRGGQCVCVEGFRGPDCAIQTCPR
DCLGRGECREGSCVCQDGYAGEDCGEEVPAIEGMRMHLLEETTVRTEWTRAPGTVDAYEI
QFIPTTEGVSPPFTARVPSSASAYDQRGLAPGQEYQVMVRALRGTSWGPPASKTITTMID
GPQDLRVVAVTPTTLELSWLRPQAEVDRFVVSYVSAGNQRVRLEVPSEADGTVLTDLMPG
VEYVVTVTAERGRAVSYPASVRANTGSSPSGLLGATDEPPPSGPSTTQGARAPELQQRPQ
ELGELRVLGRDKTGKLRVTWTAQPDTFAHFQLRLRMPEGPGAHEELLPGDVRQALVPSPP
PGVPYELSLRGVPPEGEPSAPLIYQGIMDREGVKPGKPLAPPHLGELTATDMTSSTLVLR
WTVPEGEFDSFVIQYKDRDGPQVAPVEGSQRSALITNLDLGRKYRFVLYGLLGKKRHGPL
VAEAKILPQSDPSPATPPRLGKLWVTDPTPDSLHLSWTVPEGQFDSFVVQYKDREGQPQV
MPVEGPERSVIVSSLDPNHKYRFTLFGIADKKRHGPLTADGTTASERRVESLHPESPERP
LLGELLVAQANPDSLRLSWTVTQGSFDSFVVQYKDAQGQSQAVPVRGDENEVTIPGLESH
RKYKMNLYGLHGRQRLGPVSVVATTAHQEVVEETPSPTEPSTEAPEPPEEPLLGELTVTG
SSPDSLSLSWTVPQGHFDAFTVQYKDGDGRPQVVRVGGDESEVTIGGLEPGRKYKLHLYG
LHRGQREGPVSAVAVTAPQPEETPATEPPLEPRLGELTVTDVTPNSVGLAWTVPEGQFDS
FMVQYKDRDGQPQVVPVAADQREVTIAGLEPARKHKMNFYGLHDGQRVGPLSVVAVTAPL
PPAPATESPQEPRLGELTVTDLTPDSVTLSWTVPEGEFDSFVVQYKDRDGQPRVVPVPTD
QRGVTIPGLEPSRKYKFLLYGLQDGKRRSPVSVEAKTAAQGDASLGAPPRLGELWVTDPT
LDSLRLSWTVPEGHFDSFVVQFKDRDGPRVVPVEGHERSVTVTPLDTGHKYRFLLYGLLG
KKRHGPLTAEGTTESWSGVHGAITKQPPKPRLGEELQVTSVTPDSVSLAWTVPEGQFDSF
VVQYKDRDGQPQVVPVEGSLREVSVSGLDPARRYKLLLYGLHQGKRVGPISAIALTAPRE
PVKAEAPSPPGPEPRLGEVTVEEATPDTLHLSWTETEGDFDSFEVHYTDQGGQLQVLRID
GDRNDVILSGLESEHRYLVNLYGFHGQQRVGPAHIEALTVPREEEDEPSEPPIKPRLGEL
AVTDTTPDSLHLSWTVPEGQFDHFLVQYKNGXXXXRPQVVRVTGDESEVTIGGLEPGHKY
RMNLYGLHRGQRKGPVSAVAVTASLPTEPPVAPRLGELAVATVTSDTVSLSWTVAQGLFD
SFLVQYKDVQGQAQAVPVDGHLREITISGLDPARKYKFLLFGLRDEKHHGPMSAEAKTLS
ATKGPRLGELTVTDLTPDSVSLSWTVPEGEFDSFVVQYKDRDGQPRVVPVAADQRRVTIP
GLEPRRKYKFLLYGLAGRKRLGPISADGTTAPLEKEQQRPPRLGELTVTDETSDSLRLSW
TVAQGVFDSFVVQYRDPAGQPQAVPVAKDQREVTIEGLEPGRKYKFLLYGIHGGQRLGPI
SVLGATAPEVDTPAPWRPATEAPEPPAGPQLRTLAVSDITPDSLRLSWSVAQGPFDSFVV
QYQDTDGQPQALLVGGNQNKVRVSGLEPSTSYRFFLYGLHEGKRLGPVSADGTTGPAPAG
LTPGEPGPRLSQLSVTDVTTSSLRLNWEAPPEAFDSFLLRFGVPSPSTLEPHPRPPQQRE
LTVPGSRRSAVLRDLHPGTLYSLTLYGLRGPHKADSIQGTARTLSPVLESPRDLQFSEIG
ETSAQVSWMPPPSRVDSFKVSYQLADGGEPQSVQVDGRTWTQRLKGLIPGSHYEVTVVSV
RGFEESEPLTGFLTTVPDGPTQLRALNLTEGSALLHWKPPQAPVDKYNVRVTASGAPPLQ
GSAPGSAVEYPLSGLELHTNYTATVRGLRGPNLTSPASITFTTGLEAPQDLEAKEVTPRT
ALLTWTEPQVPPTNYLLSFNTPGEQTQEILLPGRVTSHRLLGLFPSTPYSVWLRAMWGES
LTPPVSTSFTTGGLPIPFPRDCGEEIQNGASTSRATTVFLNGNRERPLNVFCDMETDGGG
WLVFQRRMDGQTDFWRDWEDYAHGFGNISGEFWLGNEALHSLTAAGDYSLRVDLRAGDEA
VFAQYDSFRVDSAAEYYRLHLAGYHGTAGDSMSYHSGSVFSARDRDPNNLLISCAVSYRG
AWWYKNCHYANLNGLYGSTVDHQGVSWYYWKGFEFSVPFTEMKLRPRSYRPPAAQGG
NT seq 9174 nt   +upstreamnt  +downstreamnt
atgcctgcccagtggttcctaatctccagactggccctctttgtactgctgggcacagtc
agagcaggccctttctctccacggtccaatgtgacactcccagccccacgcccccctccc
cagccggggggccgcacagtggagccaatagggggaagcccctcttctcagctttatgag
cacactgtggaaggaggggagaagcaggtggtttttacccaccgcatcaacctgccccct
tcagctggctgtggctgtcccccgggtacggagccccccgttcctgcttcagaggtgcag
gctttgaaggtccgtttagagatcctggaggagctggtgaaggggctcaaggaacagtgc
actgggggatgttgtcccgctgctgcccaggctggcacaggccagacagatgttcgtagc
ctctgcagtctccacggcgtgtttgacctgagccgctgtgcctgctcctgcgagccaggc
tggggtgggcccacctgctcagaccccacaggcgctggggtgtccccatcctccccaccc
tcggcctccgcgtcctgcccagatgactgcaacgatcagggtcgctgtgtccgcgggcgc
tgtgtgtgcttccctggttacagcggccccagctgtggctggccctcctgccccggggac
tgcaacggccgtgggcgctgcgtgcagggcgtgtgcgtttgccgggctggcttctccggc
gacgactgcagccagcgctcctgcccccggggctgcagccagagggggcgctgcgaggac
gggcgctgcgtgtgcgacccaggctacacgggcgacgactgtgggaagaggagctgccct
agaggttgcagccagagggggcgctgcgagaacggacgctgcgtgtgcgaccccggctac
agtggcgaagactgcgggttgaggagctgcccccggggctgcagccagagggggcgctgc
gagaacgggcgctgcgtgtgcgaccccggctacgctggcgacgactgcggctcgcggagc
tgcgcgtgggactgtggcgagggcgggcgctgcgtggacggccgctgtgtgtgctggccc
gggtacgcgggcgaggactgcagcactcggacgtgcccgcgagactgccggggccgcggg
cgctgcgaggacggcgagtgcatctgcgactcaggctacagcggggacgactgcggagtg
cgcagctgcccaggcgactgcaaccaaaggggccgctgcgaggacggccgctgcgtgtgc
tggccgggatacacggggcccgactgcggctcgcgcgcctgcccccgcgactgtcggggc
cacgggcgctgcgagaacggcgtctgcgtgtgcaacgcgggctacagcggcgaggactgc
ggcgtgcgcagctgtcccggggactgtcgcggccggggccgttgcgagaatggtcgctgc
gtgtgttggcccgggtacacaggccgggactgcggcacccgcgcctgccctggcgactgt
cgtgggcgcgggcgctgcgtggacggccgctgcgtgtgcaacccgggctacgcgggcgaa
gactgcgggagccgtcggtgccctggggactgccgcggacgcggccgctgcgaggatggc
gtgtgcgtgtgcaacgcgggctacgagggagaggactgtggcgtgcgtagctgcccagga
ggttgccacggccgcggccagtgcctggacgggcgctgcgtgtgtgacgatggctactcg
ggcgaggactgcagcgtaaggctgtgcccgcgcgattgcaaccagcatggcgtgtgccag
gacggtgtgtgcacctgttgggagggctacgccggggaggactgcggcctccgtacctgc
ccctccaactgtcaccagcgcggccgctgtgaggacgggcgctgcgtgtgcgactcaggc
tacactggcccctcctgcgccacccgcacctgcccggccgactgccggggccgcgggcgc
tgtgtgcagggagtgtgcgtgtgccacgtgggctatagcggcgaggactgtgggcaggaa
gagcctcccgccagcgcctgccctgggggctgtgggccccgggaactgtgccgcgggggc
cagtgtgtgtgcgtcgagggctttcgaggacccgactgcgccattcagacgtgccctagg
gactgcctcggcaggggagagtgtcgagagggcagctgcgtctgtcaagatggttatgca
ggggaagactgcggggaagaagtgccagccattgagggcatgaggatgcatctcttggag
gagacaacggttcggacagagtggacccgggctcctggcactgtggatgcctatgaaatt
cagttcattcccacgacagagggggtgagccccccattcacagcacgggtacccagctct
gcctcagcctatgaccagagaggactggcgcccggtcaggagtaccaggtcatggtccgt
gcccttcgagggactagctggggccctcctgcctccaagaccatcaccaccatgatcgat
ggcccccaagacctccgagtggtggctgtgacaccaaccacactggagctcagctggctg
cgacctcaggccgaggtggaccgattcgtggtgtcctatgtcagtgctggcaaccagagg
gtgcggctggaagtaccctctgaggcggatgggacggtgctgactgacctgatgccaggc
gtggaatatgtagtgactgtcacggcagagcggggccgggcagtaagctacccggcttct
gtcagggccaacacagggtcctctccctcaggcctcttgggggccactgatgagcctcct
ccctcaggcccttccacgactcaaggggcccgggcccccgaactgcagcagcgtccccag
gagctgggagagttgagggtactgggcagagacaagacagggaagctccgcgtcacttgg
actgctcagcctgacacgtttgcccacttccagctgcgccttcggatgcccgaggggccg
ggggcacatgaggaactactgccaggggatgtccgccaggctctggtgccctcaccccct
cctggagtcccctatgagctgtcacttcgtggggtccccccggagggcgagccctctgcc
cctctcatctaccaaggcattatggacagggaaggggtgaagcctgggaagcccttggcc
ccaccgcacctgggcgaactgacggcgactgacatgacctccagcaccctggtcctgcgc
tggacggtccccgagggcgagttcgactccttcgtgatccagtacaaggacagggatggg
ccccaggtggcacctgtggaaggatcccagcgctcggcactcatcaccaacctggacctt
ggccgcaagtacagatttgtgctctatgggctcctgggcaagaagaggcatggccccctg
gtggctgaagccaagatcttgcctcagagtgatcccagcccagcaactccaccccgccta
ggaaagctgtgggtgacagatcccaccccagactcactgcacctctcctggacggtccct
gagggccagtttgactccttcgtagtccagtacaaggacagggagggacagccccaagtg
atgcccgtggaagggcccgagcgctcggtcatcgtctcctcgctggaccctaaccacaag
tacagattcactctgtttgggattgccgacaagaagcggcacggcccccttacagccgac
ggcaccaccgcctcagagcggagagtggagtccctccacccagagtccccagagcggccc
ctgctgggggagctgttggtggctcaggcaaacccagactcccttcgcctgtcctggacc
gtgacccagggctcctttgactccttcgtggtccagtacaaggatgcacagggccagtcc
caggcagtgccggtcaggggggatgagaacgaggtcaccatccctggcctggagtcccac
cggaagtataagatgaatctctacgggcttcatggcaggcagcgtctggggcctgtgtcc
gtggtggccaccacagcccaccaggaggttgtggaggagacgcccagccccacagaaccc
agcacggaggccccagagccccccgaggagcccctcctgggggagctgacggtcacggga
tcctccccagactcgctgagcctctcctggacggtcccccagggccacttcgacgccttc
actgtgcagtacaaggatggggacgggcggccgcaggtggtgcgtgtcgggggtgacgag
agcgaggtcaccatcgggggcctggagccggggcgcaagtacaagctgcacctgtatggc
ctgcacagggggcagcgcgagggccccgtgtctgccgtggccgtgaccgctccacaacca
gaagagactccagccacagagccccccctggagccacgcctgggagagctgacagtgaca
gatgtgacccccaactctgtgggcctcgcatggacggtccctgagggccagtttgactcc
ttcatggtccagtacaaggacagggatgggcagccccaggtggtgcctgtggccgcggac
cagcgtgaggtcaccatcgccggcctggagcctgcacgcaaacacaagatgaacttctat
gggctacatgatgggcagcgtgtgggccccctctctgtggtagccgtgaccgctcccctc
cccccggccccagctacggagtccccccaggagccacggctgggagagctgacagtgacg
gacctgacccctgactccgtgacgctctcctggacggtccccgagggtgaattcgactcc
ttcgtggtccagtacaaggacagggacgggcagccccgggtggtgcccgtgcccacagac
cagcgtggggtcaccatccctggcctggagcccagcaggaagtacaagttcttgctctac
gggcttcaggatgggaagagacgcagcccagtctctgtggaggcaaagacagctgcccaa
ggtgacgccagccttggggccccaccccgccttggggagctatgggtgacagatcccacc
ctggactcactgcgtctctcctggacagtgcctgagggccactttgactcctttgtggtc
cagttcaaggacagggatgggccccgggtggtgcctgtggagggccacgagcgctcggtc
accgtcacccctctggacactggccacaagtacagattcctcctttatgggctcctgggc
aagaagcgccatggccccctcaccgctgaaggcaccacagagtcctggagtggtgtgcat
ggtgctataacaaagcagcccccaaagccccgtctcggggaggagctacaggtgaccagc
gtgacaccagactctgtgagcctcgcatggacagtccccgagggccagtttgactccttc
gtggtccagtacaaggacagggacgggcagccgcaggtggtccccgtggagggcagcctc
agggaggtcagcgtctcgggcctggacccggcccgcaggtataagctgctgctctatggg
ctgcaccagggcaagcgtgtgggtcccatctctgccatcgccttgactgcccccagagaa
cctgtcaaagctgaggccccaagtcctccagggcctgaaccccgcctaggggaggtgact
gtggaggaagccacaccagacaccttgcatctctcctggactgagactgagggcgatttt
gactccttcgaggtccactacacagaccaaggtgggcaactccaagtactcaggatagat
ggtgaccggaatgacgtcattctctctggcctggaatctgagcacagatacctagtgaac
ctgtatggtttccatggccagcagcgagtgggtcctgcccacatcgaggctctgacagtc
ccaagggaagaggaggacgaaccctcagagcctcccatcaagccccggctaggggagctg
gctgtgactgacaccactcctgactccctgcacctctcctggactgtacccgagggccag
tttgaccacttcctggtccagtacaagaacggggannnnnnnnggcggccgcaggtggtg
cgtgtcacgggtgacgagagcgaggtcaccatcgggggcctggagccggggcacaagtac
aggatgaacctgtacggcctgcacagggggcagcgcaagggccccgtgtccgctgtggcc
gtgaccgcctccctgcccacagagccccctgtggcaccccgcctgggggagctagctgtg
gcaaccgtgacctccgacacagtgagcctctcgtggacggtggcccagggcctcttcgac
tccttcctggtccagtacaaggatgtgcaggggcaggcccaggcagtgcctgtggatggg
cacctccgtgagatcaccatttcgggcctggacccggcccgcaagtacaagttcctactc
tttggcctccgggatgagaaacaccatggcccaatgtctgcagaggctaagactctctca
gccacgaaaggtccccgcctcggggagctaacagtgacggacctgaccccggactctgtg
agcctctcctggacagtccccgagggtgaattcgactcctttgtggtccagtacaaggac
agggacgggcagccccgggtggtgcccgtggccgcagaccagcgcagggtcacgatcccc
ggcctggagccccgcaggaagtacaagttcttgctctatgggctggcaggcaggaagcgc
ctgggccccatctctgccgatggcaccacagcccctctggagaaggagcagcagcgccca
ccccgccttggggagctgacagtgacagatgagacctcagactccctgcgcctctcatgg
acggtggcccagggcgtctttgattcctttgtggtccagtacagggacccagctgggcag
ccccaggcagtgcctgtggccaaagaccagcgggaagtcaccatagagggcctggagcct
ggcaggaaatacaagtttctgctgtatgggatccacggaggacagcgcctgggccccatc
tccgttctgggagcgacagccccggaagtggacactccagccccctggcgcccagccaca
gaggcccccgagccccccgcagggccccagctaaggacactggcagtgagcgacataacc
ccagactccctgcgcctctcgtggagtgtggcccagggcccctttgactccttcgtggtc
cagtatcaggacacagatgggcagccccaggccttgctcgtgggcggcaaccagaacaag
gtgcgagtgtcaggcctggagcccagcacctcctacaggttcttcctctatggcctccat
gaagggaagcgcctggggcctgtctcagccgatggcaccacagggcctgctcctgccggc
ctgaccccaggggagccagggccccgcctgtcccagctgtcggtgactgatgtgaccacc
agttcgctgcggctcaactgggaggcccccccggaggcctttgactccttcctgctccgc
tttggggtcccatccccaagcactctggagccgcacccgcgtcccccgcagcagcgggag
ctgacggtgccggggtcacggcgctctgccgtgctccgggacctgcacccggggaccctg
tacagcctgacgttgtatgggctgcgtgggccccataaggccgacagcatccagggcaca
gcccgcaccctcagcccagttctggaaagtccccgggaccttcagttcagcgaaatcggg
gaaacctcggcccaggtcagctggatgcctccaccatccagagtggacagtttcaaagtt
tcctaccagctggcagatggaggggagccgcagagtgtgcaggtggacggccgcacgtgg
acccagagactcaaggggctgatcccaggctctcattatgaagtgaccgtggtctctgtc
cggggctttgaggagagtgagcctctcacaggcttcctcaccacagttcctgacggcccc
acccagcttcgagcgctgaacttgacggaggggtccgcgctgctgcactggaagcccccc
caggcccctgtggacaagtataacgtccgtgtcacagcctcgggggcccccccgctgcag
ggctcggcccccggcagtgccgtggagtaccctctgagtggcctggagctccacaccaac
tacacagcgaccgtgcgtggtctccggggccccaacctcacctccccagccagcatcacc
ttcaccacagggttggaggccccccaggacttggaggccaaggaagtgaccccacgcacc
gccctgctcacttggactgaaccccaagtcccaccgactaactacctgctcagtttcaac
acccctggtgaacagacccaggagatcctgctcccaggacgggtcacctctcaccggctc
ctgggcctctttccctccaccccctacagcgtgtggctccgggcgatgtggggcgagagc
ctcacaccgcctgtgtccacctccttcaccaccggtggacttccgatccccttccctcgg
gattgtggggaggagatacagaacggagccagcacctcccgggccaccaccgtcttcctc
aatggcaaccgcgagcggcccctgaatgtgttttgtgacatggagaccgatgggggcggc
tggctggtgttccagcgccgcatggacggacaaacagacttctggagggactgggaggac
tacgcccacggctttggcaacatctccggggagttctggctgggcaacgaggccctgcac
agcctgacggcggcgggcgactactccctgcgcgtggacctgcgggctggggacgaggcc
gtgttcgcgcagtacgactccttccgagtcgactcggcggcggagtactaccgccttcat
ctggccggctaccacggcacagcaggcgactccatgagctaccacagcggcagcgtcttt
tccgcccgggaccgagaccccaacaacctgctcatctcctgcgcggtctcgtaccgcggg
gcctggtggtacaagaactgccactacgccaacctcaacgggctctacgggagcacagtg
gaccaccagggggtgagctggtactactggaagggcttcgagttctctgtgcccttcacg
gaaatgaagctgagaccaagaagctaccggcccccagcggcccagggaggctga

KEGG   Ursus arctos horribilis: 113245270
Entry
113245270         CDS       T05909                                 

Gene name
THBS3
Definition
(RefSeq) thrombospondin-3 isoform X1
  KO
K04659  thrombospondin 2/3/4/5
Organism
uah  Ursus arctos horribilis
Pathway
uah04145  Phagosome
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05144  Malaria
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113245270 (THBS3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113245270 (THBS3)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    113245270 (THBS3)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113245270 (THBS3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113245270 (THBS3)
  09174 Infectious disease: parasitic
   05144 Malaria
    113245270 (THBS3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:uah04131]
    113245270 (THBS3)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113245270 (THBS3)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113245270 (THBS3)
Membrane trafficking [BR:uah04131]
 Endocytosis
  Phagocytosis
   Opsonins
    113245270 (THBS3)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113245270 (THBS3)
  Exosomal proteins of colorectal cancer cells
   113245270 (THBS3)
  Exosomal proteins of bladder cancer cells
   113245270 (THBS3)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113245270 (THBS3)
SSDB
Motif
Pfam: TSP_3 TSP_C COMP EGF_CA cEGF EGF_3 EGF_MSP1_1
Other DBs
NCBI-GeneID: 113245270
NCBI-ProteinID: XP_026340999
UniProt: A0A3Q7VDJ4
LinkDB
Position
Unknown
AA seq 961 aa
METQELRGALALLLLCTFASASQDLQVIDLLTVGESRQMTAVAEKIRAALLTAGDIYLLS
TFRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQREDGRVHAVNLQQAGLADG
RTHTALLRLRGPARPSPALQLYVDCKLGDQHAGLPALAPIPPAEVDGLEIRTGQKAYLRM
QGFVESMKMILGGSMARVGALSECPFQGDESIHSAVTNALHSILGEQTKALVTQLTLFNQ
ILAELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRSHCSPNPCFRGVDCMEVYEYPGYRC
GPCPPGLQGNGTHCTDINECAHADPCFPGASCINTVPGFHCEACPRGYKGTRVSGVGIDY
ARASKQVCNDVDECNDGNNGGCDPNSICTNTVGSFKCGPCRLGFLGNQSQGCLPARTCHS
PAHSPCHVHAHCLFERNGAVSCSCNVGWAGNGNVCGTDTDIDGYPDQALPCMDNNKHCKQ
DNCLLTPNSGQEDADNDGVGDQCDDDADGDGIKNVEDNCRLFPNKDQQNSDTDSFGDACD
NCPNVPNNDQKDTDGNGEGDACDNDVDGDGIPNGLDNCPKVPNPLQTDRDEDGVGDACDS
CPEMSNPTQTDADSDLVGDVCDTNEDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECD
GDDDNDGVPDYVPPGPDNCRLVPNPNQKDSDGNGVGDVCEDDFDNDAVLDPLDVCPESAE
VTLTDFRAYQTVVLDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGT
FHVNTVTDDDYAGFLFSYQDSGRFYVVMWKQTEQTYWQATPFRAVAQPGLQLKAVTSVSG
PGEHLRNALWHTGHTPDQTRCRRTLSHSGGSCSREECEEAAARFRILILDPLALGSILET
LGSNLQPLSQTQTLLWHPQEFSPRGVVTPLFRSWEVFKGSFSQALTPGKITAHCHKVPVV
F
NT seq 2886 nt   +upstreamnt  +downstreamnt
atggagacgcaggaacttcggggggccctggctcttctcctcctttgcactttcgcatct
gccagtcaggacctgcaggtgattgacctgctgaccgtgggcgagtctcggcagatgaca
gctgtagcagagaagatccgggcagccctgctcactgcgggagacatctacctcttgtcc
accttccgcctgcccccgaagcagggtggtgtcctctttggcctctactctcgccaggac
aacacgcgatggctggaggcctcagttgtgggcaaaattaacaaagtgctggtgcggtac
cagcgagaagacggccgagtccatgcagtgaacctacaacaagcaggcctggccgacggg
cgcacacatacggctctcctgcggcttcgtgggcctgcccgacccagccctgccctgcag
ctctacgtggactgcaaactgggagaccagcatgctggcctcccggcactggcccccatt
cctccagcggaggtcgacgggctggagattcggactggacagaaggcgtatttgaggatg
cagggcttcgtggaatctatgaaaatgattctgggcgggtccatggcccgggtgggagcc
ctgagtgagtgtccattccagggggacgagtccatccacagcgcagtgaccaacgcgctc
cactccattctcggggagcagaccaaggcgctggtcacgcagctcaccctcttcaaccag
atcctggccgaactgcgcgatgatatccgagaccaggtgaaggaaatgtctctaatccga
aacaccatcatggagtgtcaggtgtgcggcttccacgagcagcgttcccactgcagcccc
aacccctgcttccgaggcgtggactgcatggaagtgtatgagtaccccggctaccgctgc
gggccctgcccccccggcctgcaaggcaacggcacccactgcacagatatcaatgagtgt
gctcacgctgacccttgcttccccggggccagctgcatcaacaccgtgcccggcttccac
tgtgaggcctgtcctcgaggatacaaaggcactcgggtgtccggtgtgggcattgactat
gcccgcgccagcaaacaggtctgcaacgatgtggacgagtgcaatgatgggaacaatggc
ggctgtgacccgaactccatctgcaccaatactgtgggctctttcaagtgtggcccctgt
cgcctgggcttcctaggaaaccagagccagggctgcctcccagcccgcacctgccacagc
ccagctcacagcccctgccatgtccacgcgcactgtctctttgaacgcaacggggcagtg
tcctgctcgtgtaacgtgggctgggccgggaatgggaatgtgtgtgggactgacacagac
atcgatggctacccggaccaggccctgccgtgcatggacaacaacaagcattgcaaacag
gacaattgccttttgacacccaattctgggcaggaagatgctgataacgacggcgtgggg
gaccagtgtgatgatgatgccgatggcgatgggatcaagaatgtcgaggacaactgccgg
ctgttccccaacaaggaccagcaaaactctgacacagattcatttggggatgcctgtgac
aactgccctaacgttcccaacaatgaccagaaagacacagacggcaacggggaaggggac
gcgtgtgacaacgatgtggatggggacggcatccccaatggattggacaattgccctaaa
gtccccaatcccctgcagacagacagggatgaggatggggtgggagatgcctgtgacagc
tgccctgaaatgagcaatcctacccagacagatgcagacagcgacctggtgggggatgtc
tgtgacaccaatgaggacagcgatggggatggacatcaggacaccaaggacaactgccca
cagctgccgaacagctcccagctggactcagacaacgacgggcttggagatgagtgtgat
ggggacgatgacaacgacggggtcccagactacgtgcctcccgggcctgataactgtcgc
ctggtacccaatcccaatcagaaggactcagacggcaatggtgttggtgacgtgtgtgag
gatgactttgacaatgacgcggtgctcgaccccctggacgtgtgccccgagagcgcggag
gtgaccctcacggatttccgggcctatcagactgtcgtcttggaccctgagggcgatgct
cagattgatccaaactgggtcgtgctcaaccagggcatggaaatcgttcagaccatgaac
agtgaccccggcctggcagttgggtatacagccttcaacggcgtggacttcgaaggcacc
ttccacgtgaacacggtgaccgatgacgactacgcaggctttctgttcagctatcaggat
agcggccgtttctacgtggtcatgtggaagcagaccgagcagacctactggcaggccaca
cccttccgagctgttgctcagcccgggctacagctcaaggcagtgacgtcagtgtccggc
ccaggagagcacctccggaacgccctgtggcatactggccacacccctgatcagacacgg
tgccggaggactttgagccattccggaggcagttgctccagggaagagtgtgaagaggca
gccgccagattcagaatcctgattttagaccctttagccttggggtccatcctggagacc
ctggggtctaatctacagccgctcagccaaacacagaccctcctctggcacccacaggag
ttcagccccagaggggtagtgaccccactgttcaggagttgggaagttttcaagggatct
ttttctcaggcactaaccccaggaaagataacagcacattgccataaagttccagtggtt
ttctaa

KEGG   Ursus arctos horribilis: 113247125
Entry
113247125         CDS       T05909                                 

Gene name
THBS2
Definition
(RefSeq) thrombospondin-2
  KO
K04659  thrombospondin 2/3/4/5
Organism
uah  Ursus arctos horribilis
Pathway
uah04145  Phagosome
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05144  Malaria
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113247125 (THBS2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113247125 (THBS2)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    113247125 (THBS2)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113247125 (THBS2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113247125 (THBS2)
  09174 Infectious disease: parasitic
   05144 Malaria
    113247125 (THBS2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:uah04131]
    113247125 (THBS2)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113247125 (THBS2)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113247125 (THBS2)
Membrane trafficking [BR:uah04131]
 Endocytosis
  Phagocytosis
   Opsonins
    113247125 (THBS2)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113247125 (THBS2)
  Exosomal proteins of colorectal cancer cells
   113247125 (THBS2)
  Exosomal proteins of bladder cancer cells
   113247125 (THBS2)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113247125 (THBS2)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC EGF_3 EGF_CA TSP1_ADAMTS EGF Laminin_G_3 cEGF TSP1_CCN Laminin_G_2
Other DBs
NCBI-GeneID: 113247125
NCBI-ProteinID: XP_026343786
UniProt: A0A3Q7U6C5
LinkDB
Position
Unknown
AA seq 1172 aa
MFWTLMLLALWASSGARAGGQDEDTAFDLFSISNINRKTIGAKQFRGPDPSVPAYRFVRF
DYIPPVNADYLSKIAKTVRQKEGFFLTASLKQDPKSRGTLLALEGPGASQRQFEIVSNGP
ADTLDLTYWIDGTQHVISLEDVGLADSQWKNVTVQVTGDTYSLYVGCDLIDSFTLDEPFY
EQLRTEKSRMYVAKGSARESHFRGLLQNVYLVFENSVEDVLSKKGCQQSQGAEANAISET
TETLHLSPQVAPEYAGPGAPRRPEVCERSCEELGTMITELSGLHVMVNQLHENLRTVSND
NQFLWELIGGPPKTRNMSACWQDGRFFAENETWVVDSCTKCTCKKFKTVCHQITCPPATC
ANPSFVEGECCPSCFHSLDGEEGWSPWAEWTQCSVTCGSGTQQRGRSCDVTSNTCLGPSI
QTRVCSLGKCDNRIRQDGGWSHWSPWSSCSVTCGVGNITRIRLCNSPVPQMGGKNCRGSG
RETKGCQGVPCPIDGRWSPWSPWSACTVTCAGGIRERTRVCNSPEPQHGGKDCVGDVQEQ
QMCNKRSCPIDGCLSNPCFPGAQCSSFPDGSWSCGSCPVGFLGNGTHCEDLDECAVVTDV
CFTTSKAHRCVNTNPGFHCLPCPPRYKGTQPFGVGLEVARTEKQVCEPENPCKDKTHACH
KHAECIYLGHFSDPMYKCECQTGYAGDGLICGEDSDLDGWPNKNLVCATNATYHCIQDNC
PLLPNSGQEDFDKDGTGDACDDDDDNDGVDDEKDNCQLLFNPRQFDYDKDEVGDRCDNCP
YVHNPAQIDTDNNGEGDACSVDIDGDDVFNERDNCPYVYNTDQRDTDGDGVGDHCDNCPL
VHNPDQTDVDNDLVGDQCDNNEDIDEDGHQNNQDNCPYISNANQADHDHDGQGDACDSDD
DNDGVPDDRDNCRLVSNPGQEDSDGDGRGDACKDDFDNDSIPDIDDVCPENSAISETDFR
NFQMVHLDPKGTTQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDR
DDDYAGFVFGYQSSSRFYVVMWKQVTQTYWEDQPTRAYGYSGVSLKVVNSTTGTGEHLRN
ALWHTGNTEGQVRTLWHDPKNVGWKDYTAYRWHLTHRPKTGYIRVLVHEGKQVMADSGPI
YDQTYAGGRLGLFVFSQEMVYFSDLKYECRDV
NT seq 3519 nt   +upstreamnt  +downstreamnt
atgttctggacgctaatgctgctggcgctctgggcctcgtccggcgctcgagccggtggc
caggacgaggacacggccttcgaccttttcagcatcagcaacataaaccggaagaccatc
ggggccaagcagttccgggggcctgaccccagcgtgcctgcctatcgcttcgtccgcttt
gactacatcccgccggtgaacgcagattacctcagcaagatcgccaagaccgtgcggcaa
aaggagggcttctttctcacggccagcctgaagcaggaccccaagtcccggggcacgctc
ctggctctggagggccccggagcctcccagaggcaattcgaaattgtctccaacggccca
gcagacacgctggaccttacctactggatagacggcacccagcacgtcatctccctggag
gacgtgggcctggctgactcccagtggaagaacgtgaccgtgcaggtcaccggggacacc
tacagcttgtacgtgggctgtgacctcatcgacagcttcacactggacgagcctttctat
gagcagctgcggacagaaaagagcaggatgtatgtggccaagggctctgcgcgagaaagt
cacttcaggggtttgctgcagaacgtctacttagtgttcgagaactcagtggaagatgtt
ctgagcaagaaaggttgtcagcaaagtcagggagctgaagccaatgccatcagcgagacc
acagagacgctgcacctgagcccgcaggtggcccccgagtacgcgggcccaggtgcgccc
aggaggccggaggtgtgtgagcgctcctgtgaggagctgggcaccatgatcactgagctg
tcggggctgcacgtcatggtcaaccagctgcacgagaacctgcggacagtgtccaacgat
aaccagtttctctgggagctcatcggcggcccgcctaagacgaggaacatgtccgcttgc
tggcaagacggccgcttcttcgcggaaaatgagacgtgggtggtggacagctgcaccaag
tgtacctgcaagaaatttaaaaccgtttgccaccaaatcacctgtcccccggcgacctgc
gccaacccgtcctttgtggaaggagagtgctgcccgtcctgcttccactcactggacgga
gaggaaggctggtccccatgggcggagtggacccagtgctctgtcacctgtggctcaggc
acccagcagagagggcggtcctgtgatgtcaccagcaacacctgcttggggccgtccatc
cagacgcgggtgtgcagcctgggcaagtgtgacaaccgcatccggcaggatgggggctgg
agccactggtcgccttggtcgtcctgctctgtgacctgcggcgtcggcaacatcacgcgc
atccgtctttgcaactcgccggtcccccagatgggcggcaagaactgcagagggagcggc
cgcgagacgaagggctgccagggcgtcccatgtccgatcgacggccggtggagcccgtgg
tccccgtggtcagcctgcaccgtcacctgtgccggcgggatccgggagcggacgcgtgtc
tgcaacagccccgagccccagcacgggggcaaggactgcgtgggggacgtccaggagcag
cagatgtgcaataagagaagctgtcccatcgacggctgcttatccaatccctgcttccct
ggagcccagtgcagcagctttcccgacggctcctggtcctgcggctcctgccccgtgggc
ttcctgggcaacggcacgcactgtgaggacctggacgagtgcgctgtggtcaccgacgtc
tgcttcaccacgagcaaagcccaccgctgcgtcaacaccaaccccggcttccactgcctg
ccgtgcccccctcgctacaaagggacccagccgttcggcgtcggcctagaggtcgccagg
accgagaagcaggtgtgtgagcctgaaaacccgtgcaaggacaagactcatgcctgccac
aagcatgcagaatgcatctacctggggcacttcagcgaccccatgtacaagtgtgagtgc
cagacgggctacgcgggcgacgggctcatctgcggggaggactcggacctggacggctgg
cccaacaagaacctggtctgtgccaccaacgccacgtaccactgcatccaggacaactgt
cccctactgcctaattctgggcaggaagacttcgacaaggacggcaccggcgacgcctgt
gacgatgacgacgacaatgacggcgtggacgatgagaaggacaactgccagcttctcttt
aacccccgccaatttgactacgacaaggatgaggttggggaccgctgtgacaactgcccg
tatgtgcacaaccccgcacagatcgacacggacaacaacggcgagggggacgcgtgctcc
gtggacatcgacggagacgatgtcttcaacgaacgggacaactgtccctacgtctacaac
actgaccagagagacaccgacggggacggcgtgggcgatcactgcgacaactgccccctg
gtgcacaaccccgaccagactgacgtggacaatgacctcgtgggagaccagtgtgacaac
aacgaggacattgatgaggacggtcaccagaacaaccaggacaactgtccctacatctcc
aatgccaaccaggctgaccatgaccatgatggccagggcgatgcctgtgactcagatgat
gacaacgatggggtccccgatgacagggacaactgccgactggtgtccaacccgggccag
gaggactcggacggtgacggacgtggggacgcttgcaaggacgactttgacaatgacagc
atcccagacattgatgacgtgtgtcctgaaaatagcgccatcagtgaaaccgacttcagg
aacttccagatggtccacctggaccccaagggcactactcaaatcgaccccaactgggtc
attcgccatcaaggcaaggagctggtgcagacggccaactcggaccccggcatcgctgtt
ggtttcgatgagttcgggtctgtcgacttcagcggcacattttacgtcaacaccgaccgc
gacgatgactacgccggcttcgtcttcggctaccagtccagcagccgcttctacgtggtc
atgtggaagcaggtgacgcagacctactgggaggaccagcccacccgggcctacggctac
tccggcgtgtccctcaaggtggtgaactccaccacggggaccggcgagcacctgaggaac
gccctgtggcacactgggaacaccgaagggcaggtgcgtacgctgtggcatgaccccaaa
aacgtcggctggaaagactacacggcttaccggtggcatctaactcacaggcccaagacc
ggctacataagagtcttagtgcatgaaggaaagcaggtcatggcggactcaggacccatc
tatgaccaaacctacgctggcgggcggctgggtctgtttgtcttctctcaagaaatggtc
tacttctcggacctcaaatacgaatgcagagacgtctaa

KEGG   Ursus arctos horribilis: 113250298
Entry
113250298         CDS       T05909                                 

Gene name
COL6A3
Definition
(RefSeq) collagen alpha-3(VI) chain isoform X1
  KO
K06238  collagen, type VI, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113250298 (COL6A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113250298 (COL6A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113250298 (COL6A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113250298 (COL6A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113250298 (COL6A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113250298 (COL6A3)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113250298 (COL6A3)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113250298 (COL6A3)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113250298 (COL6A3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   113250298 (COL6A3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Kunitz_BPTI fn3
Other DBs
NCBI-GeneID: 113250298
NCBI-ProteinID: XP_026347394
UniProt: A0A3Q7VXY5
LinkDB
Position
Unknown
AA seq 3187 aa
MRKHRHLPLVAMFCLFLSGFSLTRAQQQQADVKNGAAADIIFLVDSSWSIGKEHFQLVRE
FLYDVIESLAVGDSDFRFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSNETGK
GLEYVMQNHLTEAAGSRAGDGVPQVIVVLTDGHSDDGLALPSAGLKSADVNVFAIGVEDA
DEGALKEIASEPLTMHVFNLENFTSLHDIVGNLVSCVQSSVAPEGAGGTETLKDITAQDS
ADIIFLIDGSNNTGSVHFAVIRDFLVNLLERLSVGAQQIRVGVVQYSDEPRTVFSLDTYS
TKAQVLDAVKALAFTGGELANVGLALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSD
EIRDGVVALKQASVFSFGLGAQAASRAELQHIATNDNLVFTVPEFRSFGDLQEQLLPYIV
GVAQRHIVLQPPTIVTEVIEVNKRDIVFLVDGSSSLGLANFNAIRDFIAKVIQRLEIGQD
LIQVAVAQYADTVRPEFYFNSYPNKREVITAVRRMKPMEGSVLYTGSALDFVRNNLFTSS
AGHRAAEGVPKLLVLITGGKSLDEISQPAQELKRSSIMSFAVGSKAADQAELEEIAFDSS
LVFLPAEFRAAPLQGVLPGLLAPLRTLSGTSEVHVNKRDIIFLLDGSSNVGKTNFPYVRD
FVTNVVNSLDVGSDNIRVGLVQFSDTPVTEFSLDTYQTKAELLAHLRRLQPQGGSGLNTG
SALSYVHANHFTEAGGSRSREHVPQLLLLLTAGPAEDAYLPAANALARAGVLTLCVGASR
ANKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILF
LFDGSANLVGQFPAVRDFLYKVIDELDVKPDGTRIAVAQYSDDVRVESRFDEHQNKPEIL
SLVKRMKIKTGKALNLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAGRSSDRLDTPA
LNLKQSGVVTFILQAKNADPAELGLMVPSPVFILAAESLPKIGDLQPQIVNLLKSVHNGA
PTPVSGEKDVVFLIDGSEGVRSGFPLLKEFVQRVVESLDVGPDRVRVAVVQYSDRTRPEF
YLNSYMDQQSVVSAVRRLTLLGGPTPNTGAALDFVLRNILISSAGSRIAEGVPQLLIVLT
ADRSGDDVRGPSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV
ISDRVIQLNREELSRLQPVLLPPTSPGVGSKKDVVFLIDGSQSAGPEFQYIRTLIERLVD
YLDVGFDTTRVAVIQFSEDPRVEFLLNAHSSKDEVQNAVRRLRPKGGRQVNIGGALEYVS
RNIFKRPLGSRIEEGVPQFLVLISSGKSDDEVDDSAAELKQSGVAPFTVARNADQEELVK
ISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKILASTRYPSPAIESDAADIVFL
IDSSDSIKPDDVAHIRDFVIKIVRRLNIGPNKVRIGVVQFSNEVFPEFYLKTHKSQAAVL
DALRRLRFRGGSPLNTGRALEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDISRFS
QVISSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPSIEDRVMHAFGPSGVTP
APPGVDIPSPSRPEKKKADVVFLLDGSINFRRDTFQEVLRFVSEIVDTLYEGGDSIQVGL
VQYNSDPTDEFFLKDFSTKQQIIDAINKVVYKGGRHANTKVGIEHLRQNHFVPEAGSRLD
QRVPQIAFVITGGKSVEDAQEASLALTQKGVKVFAVGVKNIDSEEVGKIASNSATAFRVG
NVQELSELSEQVLETLHDAMHETLCPGVTDISKACNLDVILGFDGSRDQNVFVTQKGLES
KVDAVLNRISQMQRISCSGGQMPTVRVSVVANTPTGPVEAFDFAEYQPELFEKFRNMRSQ
HPYVLTADTLKVYQNKFRQSSPDSVKVVIHFTDGVDGSLADLQKASEALRQEGVQALILV
GLERVANLEQLMQLEFGRGFLYNRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQR
GDRGPIGSIGPKGIPGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGIKGSRGFP
GEKGELGEIGLDGLDGEDGDKGLPGSSGEKGNPGRRGDKGPKGDKGERGDIGIRGDPGDS
GRDSQQRGPKGETGDIGPMGLPGRDGVSGSPGETGKDGGFGRRGPAGAKGNKGGPGQPGS
VGEQGTRGAQGPPGPTGPPGLIGEQGISGPRGSGGTAGVPGERGRTGPLGRKGEPGEPGP
KGGIGSRGPRGETGDDGRDGVGGEGRRGKKGERGFPGYPGSKGAPGEPGTGGALGPKGIR
GRRGNSGPPGAVGQKGDPGYPGPSGPKGNRGDSMDQCALVQSIKDKCPCCYGPLECPVFP
TELAFALDTSEGVTQDTFSRMRDVLLKVVGDLTIAESNCPRGARVAVVTYNNEVTTEIRF
ADSKKKSVLLDKIKNLQVALTSKQQSLETAMSFVARNTFKRVRNGFLMRKVAVFFSNKPT
RASPQLREAVLKLSDAGITPLFLTSQEDGQLISALQINNTAVGHALVLPARRDLTDFLKN
VLTCHICLDICNIDPSCGFGTWRPSFRDRRAAGSDADIDMAFVLDSSESTTLFQFNEMRK
YIGYLVRQLDLSPDPKASQHLTRVAVVQHAPYESMGNVSVSPVKVEFSLTDYGSKEKLVD
FLHSRMTQLQGTRDLGRAIEYTIENVFESAPNPRDLKIMVLMLTGEVEKEQLEEAQRVIL
QAKCKGYFFVILGIGRKVNVKEVYGFASEPNDVFFKLLDKSTELNEEPLMRFGRLLPSFV
GSENAFYLSPDIRKQCDWFQGDQLSIKNPVKFGHKQLNIPNNVTSSPTTKLVTPAKPVTT
TEPVTTTTKPVATTTKPVATTTKPVTVVNLPASKPAAAKPAPPKPAAVRPVAPVRPVAAK
PEATKTATVRPAVAAKPVAAKPVAPKPAAVRPPTAARPVAAKPEAPKLQAAKPAVAKPAA
KPSREVQVSDVTENSAKLHWERPEPPGPYFYDLTVTSAHDQSLVLRQNLSVTERAVGGLL
AGHTYHVAVVCYLKSQVRAAYQASFSTKKAQPAAPQARSASSSTINLMVSTEPVAGGETD
ICKLPKEEGTCRKFMLKWYYDVETKSCMRFWYGGCSGNENRFNSQKECETVCAPALVNPG
VIAAMGT
NT seq 9564 nt   +upstreamnt  +downstreamnt
atgaggaaacatcgacatttgcccttggtggccatgttttgtctctttctctcaggcttt
tctcttacccgtgcccaacagcagcaagcagatgtcaaaaatggtgctgctgccgatata
atatttctagtggattcctcttggagcattggaaaggaacatttccaacttgttcgagag
tttctgtatgatgttatagaatctttagctgtgggagacagtgatttccgttttgctctg
gtccagttcaacggcaacccacataccgagttcctgttaaatacgtaccgtactaagcaa
gaagtcctctcccacatttccaacatgtcctatattgggggaagcaatgagactggaaaa
ggattagaatacgtaatgcagaaccacctgactgaggctgccggaagccgggccggtgac
ggagtccctcaggttatcgtagtgctaaccgatggacactcggacgacggccttgctctg
ccctcagcgggacttaagtctgctgatgttaacgtgtttgcaattggagttgaggacgca
gatgaaggagcgttaaaagaaatagcaagcgaaccgctcactatgcatgtgttcaaccta
gagaattttacctcacttcatgacatagtaggaaacttagtgtcctgtgtgcagtcatcc
gtggctccagaaggggctggaggcacagagacccttaaagacatcacagcacaagactct
gctgacattattttccttattgacggatcaaacaacaccggaagtgtccattttgcagtc
attcgcgacttccttgtaaatctccttgagagactctctgtcggagctcagcagatccga
gtgggggtggtccagtatagcgatgagcccagaaccgtgttctccttggacacctactcc
accaaggcccaggttctggatgcagtgaaagcccttgcgttcactggtggggagctggcc
aatgtcggcctcgcccttgatttcgtggtggagaaccacttcacccgggcagggggcagc
cgagtggaggaaggggttccccaggtgctggtcctcataagcgctgggccttctagtgac
gaaatccgcgacggggtggtagcactgaagcaggctagcgtgttctcatttggcctcgga
gcccaggccgcctccagggccgagcttcagcacatagccaccaatgacaacttggtgttt
actgtcccggaattccgtagctttggggacctccaggagcaattactgccgtacattgtt
ggcgtggcccaaaggcacattgtcttacaaccgccaaccattgtcacagaagtcattgaa
gtcaacaagagagacatagttttcctggtggatggctcatcctcactgggactggccaac
ttcaatgcaatccgcgacttcattgccaaagtcatccagaggctggaaatcggacaggat
cttatccaagtggcagtggctcagtacgcagacactgtgaggccagagttctatttcaat
agctaccccaataaaagggaagtcattaccgccgtgcggagaatgaaacccatggaaggc
tcggtcctgtacacgggctccgctctggactttgttcggaacaacctgttcactagttcg
gctggccaccgggccgccgagggggtccctaagctcctggtgctgattacaggcggtaag
tccctagatgaaatcagccagcctgcccaggagctgaagagaagcagcatcatgtccttt
gccgttgggagcaaggctgccgaccaggctgagctggaagagattgcttttgattcctcc
ctggtgttcctccccgccgagttccgagccgcccctctgcagggcgtgctgcccggcttg
ctggcgcctctcaggaccctctccggaacctctgaagttcacgtaaacaaaagggatatc
atctttcttttggatggatcgtccaacgttggaaagaccaatttcccttatgtgcgggac
tttgtcacgaacgtagttaacagccttgatgtcggaagcgacaatattcgtgttggttta
gtgcagtttagcgacactccggtgaccgagttctccctagacacataccagaccaaagca
gagttgcttgcccatctgaggcggctgcagccccagggggggtcgggcctgaacacgggc
tcggccctgagctatgtgcatgccaaccacttcaccgaagctggcggcagcaggagccga
gaacacgtgccgcagctcttgctcctgctcacggccgggccggccgaggacgcctacctg
ccggcggccaacgccctggcgcgcgccggcgtgctgaccctgtgtgtgggggctagccgg
gcgaacaaggccgagcttgagcagatcgcttttaacccgagcctggtgtatctcatggat
gatttcagctccctgccagctttgcctcagcagctcatccagcccctaaccacttatgtt
agtggaggtgtggaggaagtgccactcgcccagccagaaagcaagcgagacattctgttc
ctctttgacggctcagccaatctcgtgggccagttccctgcggtccgcgacttcctctac
aaggttattgacgagcttgacgtgaaaccggatgggacccggattgcggtggctcagtac
agcgatgatgtcagggtggagtcccgttttgatgagcaccagaataagcccgagatcctg
agccttgtgaagagaatgaagatcaagaccggcaaagccctcaatctgggctacgcgctg
gactacgcgcagaggtacatttttgtgaagtccgccgggagccgcatcgaggatggagtg
cttcagttcctggtgctgctggtggcgggaaggtcgtcggaccgtctggacacacctgca
ctcaacctgaaacagagcggggtggtgactttcatactgcaggccaagaacgcagaccct
gctgagctggggctgatggtgccttcccccgtctttatcctggccgccgagtcgcttccc
aagatcggagaccttcagccacagatcgtcaatctcctaaaatcagtgcacaacggggca
ccgacaccagtttcaggtgaaaaggacgtggtgtttctgattgatggctctgagggcgtc
aggagcggcttccctctgttgaaagaatttgtccagagagtcgtggagagcctggatgtg
ggcccggaccgggttcgcgtggctgtggtgcagtacagcgaccggaccagacccgagttc
tacctgaattcctacatggaccagcagagcgtggtcagcgctgtacgcaggctgacccta
ctgggagggccgacccccaacacgggggctgccctggacttcgtcctgaggaatatcctg
atcagctcggccggaagcaggatagcagaaggtgtcccccagctcctgatcgtcctcacg
gcggacaggtctggggatgacgtgaggggcccctcggtggtcgtgaagaggggaggggca
gtgcccatcggcatcggcatcgggaatgccgacatcacggagatgcagaccatctccttc
atccccgacttcgccgtggccattcctaccttccggcagctggggaccgtccagcaggtg
atctctgacagagtcatccagctcaaccgggaagagctaagcaggttgcagccagttttg
ctccccccaacgagcccgggtgttggaagcaagaaggatgtggtctttctcatcgatggg
tcccaaagtgctggtcccgaatttcagtacatccgcaccctcattgagaggctggttgac
tacctggacgtgggcttcgacacgactcgggtggcagtcatccagttcagcgaggatccc
agggtggaattcctgctgaatgcccattccagcaaggatgaagtgcagaacgccgtgagg
cggctgaggcccaaagggggaaggcaggtcaacattgggggtgccctggagtacgtgtca
aggaatatcttcaagaggcccctgggaagccggattgaagagggtgtccctcagttcctg
gtcctcatttcatccgggaagtctgatgatgaggtagacgattctgcagccgagctcaag
cagtctggtgtggcaccgtttaccgtcgcgaggaatgcagaccaggaagaactggttaag
atctccctgagccctgaatatgtgttctcagtgagcacgttccgggagctgcccagcctg
gagcagaaactgctgacccccatcacaaccctaacttcggagcagatacagaagatcctg
gccagcacccgctatccttctccagctattgagagcgatgcggcagacattgtcttccta
attgatagctctgacagcatcaagcctgatgacgttgcacatattagggacttcgtgatc
aagattgtccggagactcaacatcggccccaataaagtgaggattggggttgtacagttc
agcaatgaggtcttcccagaattctacctgaagacccataaatcccaggccgctgtgctc
gatgccttacgtcgcctgaggttcagaggggggtcaccactgaacactggcagagctctg
gaatttgtggcaagaaacctgtttgtcaagtctgctgggagccggatagaagacggggtg
ccccaacacctggttctgtttctgggtggaaaatctcaggatgatatttccaggttttca
caagtgattagctcctcagggattgtgagtttaggagtaggagaccgaaatatcgacaga
acggagctacagaccatcaccaacgaccccagactggtcttcacggtgcgagagttcaga
gagctccccagcatagaagacagggtcatgcatgcctttggaccctctggggtcacacct
gcacctccaggagtggacataccttctccctcacggccagagaaaaagaaagcagacgtt
gtgttcctgttggatggctccatcaatttcaggagggacactttccaggaagtgctccgt
ttcgtgtctgaaatcgtggacacgctgtatgaggggggtgactctatccaagtggggctg
gtccagtacaactctgaccccactgatgagttcttcctgaaggacttctccaccaagcag
cagattattgacgccatcaacaaagtggtctacaaaggggggagacacgcaaacaccaag
gtgggcatcgagcacctgcggcagaatcacttcgtgccggaggcgggcagccgcctggat
cagagggtcccgcagatagcctttgtgatcacgggtggaaagtcggtggaggacgcccag
gaggccagcctggcactcacccagaaaggggtcaaagtgttcgccgtgggcgtgaagaac
atcgactccgaggaggttgggaaaatagcgtccaacagtgccacggcgttccgagtgggg
aacgtccaggagctgtctgaattgagcgagcaagttctggaaactctgcatgatgcaatg
catgaaaccttatgtccgggagtgactgatatttccaaagcctgtaatctggatgtgatt
ctggggtttgacggatcgagggatcagaatgtatttgtgacccagaagggccttgagtcc
aaggtggatgctgtcttgaatagaatcagccagatgcaaagaatcagctgcagcggcggc
cagatgcccaccgtgcgggtgtcggtggtggccaacacgcccacaggccccgtggaggcc
ttcgactttgccgagtaccagccggagctgtttgagaaattccgtaacatgcgtagccag
cacccctatgtcctcactgcggacacgctgaaggtctatcagaacaagttccgacagtcc
tcaccggacagtgtgaaggtggtcattcatttcacggatggagtggatggaagtctggct
gatttacaaaaggcgtctgaggctctccgacaagaaggggtccaggctctgatcctggtg
ggccttgaacgggtagccaacttggagcagctgatgcagctggagttcgggcgaggcttc
ctgtacaacaggccgctgaggctaaacttgctggacctggactacgagctagcggagcag
ctcgacaacattgccgagaaagcttgctgtggcgttccctgcaaatgctccggacaaagg
ggagaccgcgggcccattggcagcatcgggccaaagggcattcccggggaggatggctat
cgaggctatcctggtgacgagggcggacccggtgagcgtgggcctcctggcgtgaacggc
actcaaggtttccagggctgccctgggcagagaggaataaagggctctcgcggattccca
ggagagaagggtgaattaggagaaatcgggctagatggtctcgacggtgaagacggagac
aaaggattgcctggttcttctggagagaaagggaatcccggtagaaggggtgacaaagga
cctaaaggggacaaaggagagagaggggacattggaattcgaggtgacccgggtgactca
ggacgggacagtcagcagagaggacccaaaggagaaactggagacattggccccatgggt
ctccctgggagagacggggtatctggaagccctggagaaaccgggaaggacggtggcttc
ggccgaaggggacctgcgggagctaagggcaacaagggcggcccgggccagccgggctcc
gtgggagagcaggggacccgaggcgcacagggtccacctggtcccacgggtcctccaggg
ctgatcggggaacaaggcatttccgggccccggggaagcggagggaccgcgggcgttcct
ggagaacgtggcaggaccgggcccctgggaagaaagggtgaacctggagagccaggaccg
aagggaggcatcgggagccggggcccccgaggggagacgggcgacgatggcagagacggg
gttggcggtgaaggacgcagaggcaaaaaaggagaaagaggattccctggatacccaggc
tcaaagggtgcccctggtgagccgggcacaggcggagcactgggacccaaaggcatcaga
ggccgaaggggaaattcaggacctccaggggcagttgggcagaagggagaccctggctac
ccaggaccatctggtcccaaaggcaacagaggggactccatggatcaatgtgcccttgtc
cagagcatcaaagataaatgtccgtgctgctatgggcccctggaatgccccgtcttcccg
acggagctggcctttgctttagacacctcggagggggtcacccaggacacgtttagccgg
atgagggatgtgctcctgaaggtcgtgggcgacctgaccattgcggagagcaactgccca
cggggggcgcgcgtggccgtggtcacctacaacaacgaggtgaccacggagatccggttc
gctgactccaagaagaagtcagtcctcctggacaagatcaagaaccttcaggtggctctg
acgtccaagcaacagagtctggaaacggccatgtccttcgtggccagaaacacgttcaag
cgcgtgaggaacggattcctgatgaggaaagtggctgttttcttcagcaataagcccacg
agggcgtccccgcagctcagggaggctgtgctgaagctctcggacgcaggcatcacaccc
ttgttcctcacaagccaggaggacgggcagctcatcagcgcgttgcagatcaataacacg
gcggtggggcatgcgctcgtcctccccgctaggagagacctcacggacttcctgaagaac
gtcctgacgtgtcacatttgcctggacatctgcaacattgacccatcctgtggattcggc
acctggaggccttccttcagggacaggcgagcggcgggcagtgatgcggacatcgacatg
gctttcgtcttagatagctctgagtccaccactctgttccagttcaacgagatgaggaag
tacatagggtacctggtcagacagctggacctgagcccagaccccaaggcctcccagcac
ttgaccagggtggccgtcgtgcaacacgctccctatgagtccatggggaacgtcagcgtg
tcacccgtgaaggtggaattctccttgactgactacggctccaaggagaagctggtggac
tttctccacagcagaatgacacagctgcaggggaccagggacctgggcagagccattgaa
tacaccatagagaacgtctttgaaagcgcccccaacccacgggacttgaaaattatggtt
ctgatgctgacgggtgaggtggagaaggagcagctggaagaggcccagagagtcatcttg
caggccaaatgcaagggttacttcttcgtgattctgggcatcggcaggaaggtgaatgtc
aaggaggtgtatggcttcgccagcgagccgaatgatgtcttcttcaaactactggataag
tcaactgagctcaatgaagagcctctgatgcgctttgggaggctgttaccatccttcgtc
ggcagtgaaaatgctttttacttgtccccagatatcaggaaacagtgtgattggttccaa
ggggaccaactgtcaattaagaatcctgtgaagttcggtcacaaacaattaaatattccg
aataatgttacttcaagtcctacaaccaaactagtgaccccagcaaagcccgtgaccacc
acagaaccggtgaccaccacgaccaaaccagtggccaccacgaccaaaccagtggccacc
acgaccaaaccagtgactgtggtaaatctgccggcctcaaagccagccgcagccaagcca
gccccacccaaaccggccgctgtgagacccgtggccccggtgaggcctgtggccgccaag
cccgaggccaccaagacggccacggtcagaccagcagtggccgcgaagccggtggccgca
aagccggtggccccgaagccagcagctgtgagaccccccactgcagccagaccagtggcc
gccaagcccgaggcccccaagctccaggcagccaaaccggccgtcgctaagcccgcggcg
aagccatcccgagaggtccaggtgtccgacgtcaccgagaacagcgccaaactccactgg
gagaggcccgaaccccctggcccttacttttatgacctcaccgtcacctcggcccacgac
cagtccctggtgctgcggcagaacctgtcggtgacggagcgcgccgtcggggggctgctc
gccgggcacacgtaccatgtggccgtggtctgctacttgaagtcccaggtcagagctgcc
tatcaagcaagtttcagcacaaagaaagctcagcctgcagccccacaagcgaggtcggct
tctagttcaaccatcaatctcatggtgagcacggaaccagtggctggtggcgagacagat
atatgcaagttgcccaaagaagaaggaacttgcaggaaattcatgttaaaatggtactac
gatgtggagaccaaaagctgcatgagattctggtacggaggctgcagtggcaacgaaaac
agatttaattcacagaaagaatgtgaaacggtttgcgctcctgcgctcgtcaaccccgga
gtcatcgcggccatggggacctaa

KEGG   Ursus arctos horribilis: 113250384
Entry
113250384         CDS       T05909                                 

Gene name
COL4A4
Definition
(RefSeq) collagen alpha-4(IV) chain isoform X1
  KO
K06237  collagen, type IV, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113250384 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113250384 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113250384 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113250384 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    113250384 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113250384 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113250384 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113250384 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113250384 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113250384 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113250384 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113250384 (COL4A4)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   113250384 (COL4A4)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113250384 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 113250384
NCBI-ProteinID: XP_026347584
UniProt: A0A3Q7WB67
LinkDB
Position
Unknown
AA seq 1688 aa
MHMASIRCSLRWIKPLAADPWSLILILFSLQHVYGSGKKFVGPCGGRDCSVCQCFPEKGS
RGQPGPLGPQGPIGPLGLPGPVGIPGEKGMRGDSGPPGAAGDKGDKGPTGVPGFPGLDGI
PGHPGPPGSRGKPGMHGYNGSRGDPGFPGERGVPGPGGPPGLPGESGEKGNSVFILGAIK
GIQGDRGNPGPPGLPGSRGARGPAGPMGHPGEPGLAGAPGHPGRPGLKGNPGVGVKGQMG
DPGEVGQQGSPGPTLLVQPPDSCLYKGEKGIKGMPGMIGPPGPPGPKGEPGIGAKGEKGI
PGFSGPRGDPGSYGSPGFPGLKGKPGVFGEPGSFGFLGPKGDPGDRGYPGPPGVLVTPSL
PLKGPPGDPGRPGRYGETGSVGPPGPPGPSGPPGEACAGMMGPPGPRGFPGHPGFPGAAG
IPGRADSSPGKPGNPGPPGLPGAPGLQGPPGSDVIYCSVGHPGPQGIKGKVGPPGRRGSK
GEKGNAGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGEKGHPGPPGAEGSPGPPGK
HGTSGPPGSKGEKGDMVIPRVKGHKGERGPDGLPGFPGQQGQHGRDGLPGRKGDPGPPGD
HEDALPGDEGSPGPPGPPGRAGPRGQPGLGFPGPPGERGPPGAPGRPGERGLEGLKGQKG
DTISCNVTYPGRPGPPGFDGPPGPKGFPGPPGAPGLRCLDGQKGRPGRPGISEIPGPPGF
RGDMGDPGFGGEKGPSLLGPPGLPGSRGANGQKGVMGDTAYGQPGAPGWRGLSGVPGSKG
HRGHPGRPGFAGPVGRPGLPGLKGPRGREGSAGFPGIPGPPGHSCEGGAPGTPGQPGLPG
APGRPGAPGWKGQRGDVGPPGPAGMKGLPGVPGRPGTDGPLGLPGVPGLSGDDGRPGLPG
PKGSQGLPGFPGFPGERGKPGPEGRTGRKGDPGEDGRPGFLGDQGVKGAKGERGPPGDEG
EMAIISQKGKTGEPGPPGDGGSPGEEGDKGDPGMQGRRGEPGRHGAPGFHRGEPGRTGQP
GLPGPPGLPGSPGLRGIIGFPGFPGDQGELGSPGSPGLSGVDGMRGPKGNRGDPASQFGS
PGPKGEPGSPGCPGHLGVPGEQGFPGVQGPTGPPGRPGLPGASGPPGCPGNQGVPGLQGP
PGETGDLGSRGMMGDPGTPGLPGIKGPSGSPGLNGLHGLKGQKGAKGASGLHEVGPPGPV
GIPGLKGETGDPGSPGISPPGLSGERGPPGPPGRPGSPGPAGAAGRAPEGDVPDPGPPGD
QGPPGPDGPRGAPGPQGPPGSVDLLKGEPGDCGLPGPPGPPGPPGPPGRKGFPGCDGKDG
QKGPIGFPGLQGPQGLPGPPGEKGLPGIPGRQGHPGLPGSRGEPGPPADVESCPRIPGLP
GVPGPRGPEGAMGVPGVRGPPGPGCKGESGPDGRRGEDGLPGPPGPPGSKGDAGEAGCPG
APGPPGPTGDPGPEGFGPGHLSGFLLVLHSQTDGEPACPAGMPRLWTGYSLLYLEGQEKA
HNQDLGLAGSCLPMFSTLPFAYCNIHQVCHYGRRNDRSYWLASAAPLPVTPLAEEAIRPY
ISRCAVCEAPAPAVALHSQDQSIPPCPRAWRSLWIGYSFLMHTGAGDQGGGQALMSPGSC
LEDFRAAPFLECQGRQGTCHFFANEYSFWLTTVSPDLQFSSAPSPDTLKESQAQRQRTSR
CRVCMKFS
NT seq 5067 nt   +upstreamnt  +downstreamnt
atgcacatggcgtccataaggtgctcactgaggtggatcaagccattggccgcagatccc
tggtcacttatacttatccttttttctctacaacatgtatatgggagtggaaagaagttt
gtgggtccttgcggaggaagagattgctcagtgtgccagtgttttcctgaaaaagggtct
cggggtcagccaggacctctggggccacagggtcctattggacccctgggactgcctgga
cccgttggcattccaggagagaaagggatgagaggtgacagtggtcctcctggggcagca
ggtgacaaaggtgataagggtccaactggtgttcctggatttccaggtttggatggcata
cctgggcacccagggcctcctggatccagaggcaagcctggcatgcacggctataatggt
tcacgaggtgatccagggtttccaggagaaagaggagttcctggcccaggagggccccca
ggccttcctggggaaagtggagaaaaaggaaactcagtgttcattttaggtgccattaaa
ggtattcagggcgacagagggaacccaggacctcccggcttgccgggatcaaggggggca
agaggaccagcgggccctatgggacatccaggagagcccgggttagctggtgctccaggc
catcctgggagaccgggcttgaagggcaatcctggcgtgggagtaaaggggcaaatggga
gacccgggtgaagttggccagcaaggttcccccggacccaccttactggtacagccgcct
gactcttgtttgtataaaggagaaaagggcataaaaggaatgcctggtatgattggacct
ccaggaccaccaggacccaagggagaacctggaattggggcaaaaggagagaagggtatt
cctgggttctcaggacctcggggtgatcctggttcctatggatctccaggttttccagga
ttaaaggggaaaccaggagtgtttggagaacctggatcatttggatttcttggcccaaag
ggggatcccggagaccgcgggtacccaggaccaccgggggttttggtaactccatctctc
ccactcaaaggccctccaggggatccggggcgccctggccgctatggagaaacggggtcc
gttggaccacctggtccccccggcccctccggtccaccaggggaagcctgtgcaggcatg
atgggaccacctgggccaagagggtttcctggtcatccgggatttccaggggcagctggt
atccctgggagagctgattccagtccaggaaagccagggaacccaggaccgcctgggttg
cctggagcaccagggctgcagggacctccgggatcagatgttatatactgtagtgttggg
caccctggaccacaaggaataaaaggcaaagtgggtcctccaggaagaagaggctcaaaa
ggagaaaaaggaaacgcggggctctgtgcctgtgagcctggtcccatgggcccaccaggc
cctccgggacttcctgggaggcagggtagtaagggagacttggggctccctgggtggctt
ggagagaaaggtcacccaggccctcctggtgctgaaggatctccaggaccaccaggaaaa
catggtacctcaggaccacctggcagcaaaggagaaaagggcgacatggttataccaaga
gtgaaagggcacaaaggagaaagaggtcctgatgggctcccaggatttccagggcaacag
ggacaacatggtcgagatggacttcctggaagaaaaggggatccaggccccccaggggat
catgaagacgcgctcccaggtgatgaagggtctcctggaccgccgggccccccgggcaga
gcaggacctaggggacaaccaggtctgggatttcctggcccaccaggcgagagagggcca
ccaggagctccgggccgccctggtgagaggggcctcgagggcttgaagggtcagaaaggc
gatacaatttcttgtaatgtcacctaccctgggaggccagggccccccgggtttgatggg
cctccaggaccaaagggatttccaggtcctccaggtgctccggggttgaggtgtttggat
gggcagaaaggtcggcctggcagaccaggaatatcagaaatcccgggtccacctggcttt
cgtggtgacatgggcgatccaggttttggaggtgaaaaagggccttcccttcttgggccc
ccaggccttcccggttctcgtggagcaaatggtcagaaaggagtcatgggagacactgcc
tatggccaaccaggtgccccaggatggagaggtctttcaggagtgccagggtcaaaagga
cacagaggtcacccaggacgtccaggctttgcagggccagtgggcaggccgggactccca
ggtctcaaaggccccagaggcagagagggaagtgctgggtttccaggaatcccaggtccg
cctggtcattcctgcgaaggaggcgccccagggacaccagggcaaccaggactccccggg
gctccgggccgtccaggtgccccaggttggaaaggacagcgaggggatgtggggcctcct
ggtcccgctggaatgaagggcctccctggagtcccgggacggccaggaacagatggtccc
ctaggactcccaggcgtcccaggcctctccggggatgatggacggcctggtcttccaggc
ccaaagggatcccaggggctgcctggcttccccggttttccgggggaaagaggaaagcct
ggccctgagggacgcactggcaggaagggggaccctggagaggatggtcggcctggcttc
ctcggagaccagggggtgaaaggtgccaaaggagagagaggacccccaggagatgaagga
gagatggctatcatttcccaaaaggggaaaaccggggaacccggacctccgggagatggt
ggatccccaggagaagaaggtgataaaggcgatcctgggatgcaggggaggagaggagag
ccgggaagacacggagcacctggatttcatagaggggagcctggtagaaccgggcagcca
gggcttcctggacccccaggcctcccaggctcacctgggctgagagggattattggtttt
ccgggatttccaggtgaccagggtgagctgggttctccagggtcccccggactttcagga
gttgatggaatgagaggacctaaaggaaacagaggtgaccctgctagtcaattcggctca
cctggtccaaagggtgaaccaggtagccctggatgtccaggacatctcggagtacccggg
gagcagggctttcccggtgttcaagggcccacaggaccacccggaaggccaggcctacct
ggtgcctctggaccaccagggtgtccaggtaatcagggggtgcctgggctgcagggacct
ccaggagaaacgggggatctcgggtcaagaggcatgatgggagatccagggacaccaggt
cttccaggaataaaaggtccctccgggtcgcccggtctgaacggcttacatggtttaaag
ggtcagaagggagccaaaggcgcttcaggtctgcacgaagtgggcccacctggtccagtg
ggcatacctgggctgaaaggagagacgggagaccctgggagcccgggaatttctccccca
ggcctttctggagaaagaggtccccccggtcccccagggagacctggatcacctggtcct
gcaggtgccgcaggaagagctcctgaaggggacgttcctgacccaggtccacccggagat
cagggacctcctggccccgatggtccaagaggagcacctgggccccaaggccctcctggg
agtgttgaccttctgaaaggggaaccaggagactgtggtctgccggggcctccaggtccc
ccgggcccacccgggcctccaggacgcaaaggcttcccaggatgtgatggaaaagacggc
cagaaaggaccaataggattcccggggctgcaggggccacaaggacttcctggcccccct
ggggagaagggtttacctggcattccaggcagacaggggcaccccggtcttccaggttcc
agaggtgagccagggccgcctgccgacgtggagtcctgtccccgaatccccgggcttccc
ggggtaccaggcccaagaggaccagaaggagccatgggggtgcctggagtgagagggccc
ccaggaccagggtgcaaaggagagtctgggccggatggcaggaggggcgaggatggcctc
ccagggcctcctgggcctcctggaagcaaaggggacgcgggagaagccggctgccctgga
gcaccaggccctcctgggcccactggggaccccgggcccgaagggtttgggcctggccac
ctcagtggcttcctcctggttctccacagtcagacggacggagagcccgcctgccccgcg
ggcatgcccaggctctggacgggctacagtctcttatacctggaaggacaggagaaggca
cacaaccaggacctcggtctggcagggtcttgccttcccatgtttagcacgctgcccttc
gcctactgcaacatccaccaagtgtgccactacggccgcaggaacgaccggtcctactgg
ctggccagcgccgcgccgctgcccgtgacgccgctcgccgaggaggccatccgcccgtac
atcagccgctgtgccgtgtgcgaggccccggcccccgccgtggcgctgcacagccaggac
cagtccatccccccgtgcccacgcgcctggaggagcctctggatcgggtactcgttcctg
atgcacacaggggctggggaccaaggaggagggcaggccctcatgtcccctggcagctgt
ctggaagacttccgagccgcaccattcctcgaatgccaaggccggcagggaacttgccac
ttttttgcgaacgagtatagcttctggctgacgacagtgagccctgacttgcagttttcc
tcggcgccctccccggacaccttgaaagagagccaggcccagcgccagaggaccagcagg
tgccgggtgtgcatgaagttcagctag

KEGG   Ursus arctos horribilis: 113250386
Entry
113250386         CDS       T05909                                 

Gene name
COL4A3
Definition
(RefSeq) collagen alpha-3(IV) chain isoform X1
  KO
K06237  collagen, type IV, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113250386 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113250386 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113250386 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113250386 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    113250386 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113250386 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113250386 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113250386 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113250386 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113250386 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113250386 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113250386 (COL4A3)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   113250386 (COL4A3)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113250386 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 113250386
NCBI-ProteinID: XP_026347592
UniProt: A0A3Q7WB72
LinkDB
Position
Unknown
AA seq 1670 aa
MSPRTAPRPPALLLPLLLVLLAAAPTTGKGCVCKDKGQCFCDGIKGEKGEKGFPGPVGSP
GQKGFPGPEGLPGPQGPKGSPGLPGLTGPKGVRGITGLPGFSGPPGLPGTPGHPGPYGPA
GIPGCNGSKGEQGFPGLPGIPGYPGMPGAVGLKGEKGAPAAEGIEPDGRGDPGLPGAPGF
QGLPGLPGFPGPVGPPGPPGFLGFPGTMGPPGPKGHMGDNVIGQKGERGVKGLTGPPGPP
GTVIVTLTGPDNRTDLKGEKGDQGAMGKSGPPGPSGPPGESYGSQKGAPGEPGPQGKPGK
DGAPGFPGTEGAKGNRGFPGLRGEDGIKGWKGDIGPPGFRGPTEYYDAYQEKGDEGLPGP
PGPKGARGPQGPSGSPGVPGSAGSSRPGLRGAPGPPGVKGSKGEQGPPGKNAVGPPGSPG
CPGSPGPIGLPGYPGPPGGIVFRQGPPGDGGLPGLVGFPGIPGVHGPKGEPGLLCIQCPC
IPGPPGLPGLPGLDGIKGIPGGQGAAGVKGSPGYPGSVGLPGFPGFPGTPGGPGLKGEKG
EAPQPEGEVGAPGEPGLRGHPGRRGLDGIPGTPGIKGSPGPKGEPALSGEKGDQGPPGDL
GTPGSPGPAGPPGPPNYGPQGEPGPKGTQGVPGAPGPPGEAGPKGEFGISTPVPGPPGPP
GPSGHAGPQGPPGVPGSIGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGEQGFPGAKGAPG
CPGEMGKPGSPGEPGLPGAKGEPGLAMPGEPGTPGFPGERGNSGENGEIGLPGPPGLPGI
PGTGGVDGLQGDPGKPGPPGEKGSPGRCMVGPRGAQGLPGLNGLEGQQGRRGETGPKGDP
GIPGLDRSGFPGEAGPPGMPGHRGEMGQPGQKGYPGNPGFLGLSGEKGMMGMMGHPGNTG
PPGPPGNPGTPGQRGSFGIPGAKGEKGPPGAKGEKGDKGPRGPSQISNLLGDKGEPGLKG
FAGKPGEKGNRGIPGLPGFKGHEGPPGPPGPPGLRGDPGSIGNPGEPGPRGGPGSMGNMG
VPGPKGHRGTLGLPGLTGRPGLPGVHGLRGDKGEPGYSAGTWPGPPGPKGDPGLPGDMGR
KGERGLPGTPGHSGPAGTEGAPGIPGSPGHPGKPGPDGDLGLKGIKGLPGSPGVKGPPGP
PGFLGPPGPVGMRGSQGRDGIPGPAGEKGETGLLGAHPGPRGSPGAPGAKGDRGVPGLPG
LPGRKGAVGDAGLRGPTGVMGLPGPPGFPGAIIPGQKGNRGPPGFRGNPGEPGPLGPPGS
HVRGIKGDKGLLGEPGPRGLPGTVGAEGPPGPPGAPGSPGLPGLRGDPGFHGFPGVKGEK
GNPGFLGPVGPPGQIGPKGPPGVRGDPGTIKIISLPGSPGPPGHAGGPGMQGEPGPPGPP
GILGPCGPRGKPGKDGRPGTPGPTGEKGNKGCKGEQGPPGSDGLPGLKGKPGHIGPPAPE
TMMRGFIFTRHSQTTAIPSCPEGTEPLYSGFSLLFVQGNEQAHGQDLGTLGSCLQRFTTM
PFLLCNIDDVCNFASRNDYSYWLSTPAPMPTDMAPITGRALEPYISRCTVCEGPTNAIAV
HSQTTDIPSCPNGWISLWKGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFIECHGRG
TCNYYSNSYSFWLASLNPQTMFRKPIPSTVKAGELEKIISRCQVCMKRRQ
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgagcccgaggacggcgcccaggccgcccgctctgttgctgccgctcctgctggtgctt
ctggcggctgcgcccacaacaggcaagggctgtgtctgtaaagacaaaggccagtgtttc
tgtgatgggatcaaaggggagaagggggagaaaggctttcctggacctgttggttctcct
ggccagaagggatttccaggtcccgaaggcttgcctggaccacagggacccaagggctca
ccaggactcccaggactcactggtcccaaaggtgtaaggggaataactggattgccggga
ttttcaggtcctcctggacttccaggcaccccaggccatcctgggccttatggacctgct
ggcatccctggatgcaatgggtctaagggggagcaagggtttccaggcctcccagggata
ccaggctacccagggatgccgggtgctgttggtttgaaaggagaaaagggtgctcctgct
gcagaaggtatagaacctgatggaagaggtgaccccgggttgccaggagctccgggtttc
cagggtttgccaggccttccaggctttccgggacctgttggcccacctggccctccggga
ttcttaggctttccaggaaccatgggacctccaggacctaagggtcacatgggcgataac
gtgataggacaaaaaggagagcggggtgtgaaaggattgacaggaccccctggaccacca
ggaacggttattgtgacgctaacaggcccagataacagaacggacctcaagggggaaaag
ggagaccagggagccatggggaaatccggacctcctggaccctcaggacctcctggagaa
tcttatggatctcaaaaaggtgctcctggagaacctggcccacagggaaaacctggcaaa
gacggtgcccctggtttccctggcactgagggagccaaaggcaacaggggcttccctgga
ttacggggtgaagacggcattaaggggtggaaaggggacattggccctccaggatttcgt
ggtccaacagaatattacgatgcataccaggaaaagggagatgaaggacttccaggcccc
ccaggccccaaaggagctcgtggcccacagggtcccagtggctctcctggggttcctgga
agtgctgggtcatcgaggcctggcctcagaggagcccccggacctccaggcgtgaaagga
agtaaaggggaacaagggcccccaggaaagaatgcagtggggcctccggggtccccaggt
tgtcccggttcaccaggccctatagggttgccgggatatccaggaccaccaggtggcatc
gtttttcgccaaggtccacccggagatggtggactcccaggccttgtagggtttccagga
atcccaggagtccatgggcccaaaggggaacctggcctcttgtgcatacaatgtccttgc
atcccagggcccccgggtctcccaggactgccagggttggatggcataaaaggaatacca
ggaggacaaggggcagctggcgttaaaggaagcccagggtacccaggaagtgtgggtctt
ccaggatttccaggattcccggggactccggggggtccaggacttaaaggagaaaaaggt
gaagcacctcagcctgagggagaagtgggtgccccaggggagcccggactcagagggcat
cccggaagaaggggcttggatggaattcctggaactccagggatcaaaggatcaccagga
cccaaaggtgaaccggccctgagtggtgagaagggggaccagggtcctccaggggatctt
ggcacccctgggtccccaggacctgcaggaccgcctggaccaccaaactatggaccacag
ggagagcctggtccaaagggcacccaaggagttcctggagcccctggaccacctggagaa
gccggtcctaaaggagaatttggtatttcaacaccagtcccagggcccccaggacctcca
gggccctctggccatgctggcccccaaggtccacctggtgtccctggatccataggaaaa
tgtggtgatccgggtcttcctgggcctgatggtgaaccaggaattccaggaatcggcttc
cctgggccccctggacctaagggagaacaaggttttccaggagcaaaaggagcaccaggt
tgtccaggagagatggggaagcccgggtcacctggagaaccgggtctcccaggagccaag
ggagaaccaggactagccatgcctggagaaccaggaacaccaggttttccaggagaaaga
ggcaattccggggaaaatggagaaattggactccctggacctccaggtctccctggaatt
ccaggaaccggaggggttgatggactgcaaggggatccagggaagcctggaccacctgga
gaaaaaggatctccaggaaggtgcatggtgggtcccaggggagcccagggacttccagga
ttaaatggattggaagggcagcaagggagaagaggtgaaacagggccaaagggagaccca
ggtattccaggcttggatagatcaggctttcctggagaagctggaccaccaggaatgcca
ggtcatcgaggtgaaatgggacaacctggtcaaaaaggatatccaggaaatccaggattt
ttaggattatcaggtgaaaaaggaatgatggggatgatgggccatccgggaaacactggc
cctcccgggcctcccgggaacccaggcaccccaggacagaggggtagctttggaattccg
ggagcaaagggtgagaaagggcccccaggagccaagggggaaaaaggagacaaaggacct
cgggggccttctcaaatatccaatttactgggggacaaaggagaaccaggcctcaaagga
tttgctggaaagcctggtgagaaaggaaacagaggcattccggggttaccaggtttcaaa
ggacacgaagggccacctggaccaccgggtccaccaggcctcaggggagatccgggcagc
attgggaatcctggagagccaggaccacgtggtgggccaggaagcatggggaacatgggg
gtgccaggtcctaaaggacacaggggaactttgggactaccgggtttaactgggagaccg
ggcctcccaggtgttcacggtctccgaggagataagggagagccaggttattcggcaggt
acatggccaggaccaccgggaccaaagggagacccaggattgccaggtgacatgggaagg
aaaggagaaagagggctacctggcacccctggacattcggggcctgctggaactgaggga
gcccctggaattcccggaagtcctggccacccaggaaagccgggccctgatggtgatttg
gggttaaaaggcatcaaaggcttgcctggttctccaggagtcaaaggccctccaggacct
ccaggattcctaggacctcctggaccggtgggtatgagaggcagccagggacgcgatgga
attcctgggccggcaggagaaaagggagaaacaggtttgctgggggcacatccaggcccg
agagggagccctggtgctccaggagccaaaggagacaggggcgtcccgggcttacctggc
ctcccaggcaggaaaggggcagtgggagatgcggggctgcggggacccactggcgtgatg
ggactcccagggccaccaggttttcctggagcaatcatccctggccagaaaggaaatcga
ggcccaccaggcttcagaggaaacccaggtgaacctggtcctctgggacctccagggagc
cacgtaagaggcatcaaaggagacaagggactcctgggcgagcctggccccagaggtctg
cccggaactgtaggagctgaggggccaccgggtccgccgggagcaccaggaagcccaggt
ctcccagggctcagaggtgatcctggattccatggatttccaggtgtgaaaggggagaag
ggcaatccgggatttctgggaccagttggacctccagggcaaattgggccaaaaggacca
cctggtgtacgcggagaccctggtacgattaagatcatctcccttccaggaagcccaggg
ccacctggccatgctggaggaccagggatgcaaggagaacccgggccaccggggccaccg
ggaatcctaggaccctgtgggccaagaggtaaaccgggcaaggacggaagaccaggaact
cctgggccaactggagaaaaaggcaacaaaggctgtaaaggagagcaaggaccacctgga
tcagacggactgccaggcttgaaggggaaacccggacacattgggccacctgcacctgag
acaatgatgagaggctttatcttcacccggcatagtcagaccacagcgattccctcctgt
ccagaagggacagagccgctctatagtgggttttctcttctttttgtacaaggaaatgaa
caagcccatggacaagatctgggaactctcggcagctgcctgcagcggttcaccacaatg
cctttcttgctctgtaacatcgacgatgtgtgtaattttgcctctcggaatgattattca
tactggttgtcaacaccagctccgatgccaacagacatggctccgattactggcagggcc
ctggagccttacatcagcagatgcactgtctgcgaaggtcctacgaatgccatagccgtt
cacagccaaaccactgacattccctcatgtcccaatggctggatttctctctggaaagga
ttttcgtttatcatgttcacaagtgcgggttctgagggcgctggacaggcactggcctcc
cctggctcctgcctggaagaattccgagccagtccatttatagaatgtcatggaagagga
acgtgcaactactattcaaattcctacagtttctggttggcttcattaaacccccaaacg
atgttcagaaaacctattccatcaactgtgaaagctggggagttagaaaagataataagt
cgctgtcaggtgtgcatgaagagaagacaatga

KEGG   Ursus arctos horribilis: 113250491
Entry
113250491         CDS       T05909                                 

Gene name
FN1
Definition
(RefSeq) fibronectin isoform X1
  KO
K05717  fibronectin 1
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04810  Regulation of actin cytoskeleton
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah05100  Bacterial invasion of epithelial cells
uah05135  Yersinia infection
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05205  Proteoglycans in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113250491 (FN1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113250491 (FN1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113250491 (FN1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    113250491 (FN1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113250491 (FN1)
   05205 Proteoglycans in cancer
    113250491 (FN1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113250491 (FN1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113250491 (FN1)
  09171 Infectious disease: bacterial
   05135 Yersinia infection
    113250491 (FN1)
   05100 Bacterial invasion of epithelial cells
    113250491 (FN1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113250491 (FN1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113250491 (FN1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:uah04131]
    113250491 (FN1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113250491 (FN1)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113250491 (FN1)
Membrane trafficking [BR:uah04131]
 Endoplasmic reticulum (ER) - Golgi transport
  Forward pathways
   ER-Golgi intermediate compartment (ERGIC) proteins
    113250491 (FN1)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of bladder cancer cells
   113250491 (FN1)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113250491 (FN1)
SSDB
Motif
Pfam: fn3 fn1 Pur_ac_phosph_N fn2 DUF2369 DUF1410
Other DBs
NCBI-GeneID: 113250491
NCBI-ProteinID: XP_026347788
UniProt: A0A3Q7VFD2
LinkDB
Position
Unknown
AA seq 2385 aa
MLRGPGPRLLLLAVLSLGIAVPSTGASKSKRQAQQIVQPQTPGAVSQSKPGCYDNGKHYQ
INQQWERTYLGNALVCTCYGGSRGFNCESKPEPEETCFDKYTGNTYRVGDTYERPKDSMI
WDCTCIGAGRGRISCTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCK
PIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTCLGEGSGRITCTSRNRCNDQDTRTSY
RIGDTWSKKDNRGNLLQCICTGNGRGEWKCERHASVQTTSTGSGPFTDVRTAIYQTQPHP
QPAAYGHCVTDSGVVYVVGMQWLKTQGNKQMLCTCLGNGVSCQETAVTQTYGGNSNGEPC
VLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTRGGNSNGALC
HFPFLYNNHNYTDCTSEGRRDNMKWCGTTPNYDADQKFGFCPMAAHEEICTTNEGVMYRI
GDQWDKQHDMGHMMRCTCVGNGRGEWTCVAYSQLRDQCIVDDITYNVNDTFHKRHEEGHM
LNCTCFGQGRGRWKCDPIDQCQDSETRTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQ
PLQTYPGTTGPVQVIITETPSQPNSHPIQWNAPEPSHISKYILRWKPKNSPGRWREATIP
GHLNSYTIKGLTPGVVYEGQLISVQHYGHKEVTRFDFTTTSTSPPVTSNTVTGETTPLSP
VVATSESVTEITASSFVVSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTATSVNIPDL
LPGRKYIVNVYQISEEGEQSLILSTSQTTAPDAPPDPAVDRVDDTSIVVRWSRPQAPITG
YRIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVFIQQETTG
VPRSDKVPPPRDLQFVEVTDVKITIMWTPPESAVTGYRVDVIPVNLPGEHGQRLPVNRNS
FAEVTGLSPGVTYLFKVFAVNQGRESKPLTAEQATKLDAPTNLRFTNETDSTVLVTWTPP
RARIAGYRLTVGPTRGGQPKKYNVGPSASQYPLRNLQPASEYTASLVAVKGNQQSPKATG
VFTTLQPLSSIPPYNTEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSESGSIVV
SGLTPGVEYVYTISVLRDGQERDAPIVKKVVTPLSPPTNLHLEANPDTGVLTVSWERSTT
PDITGYRITTTPLNGQQGYSLEEVVHADQSSCTFENLSPGLEYNVSVYTIKDDKESVPIS
DTIIPAVPPPTDLRFTNVGPDTMRVTWAPPPSIELTNLLVRYSPVKNEEDVAELSISPSD
NAVVLTNLLPGTEYLVSVSSVYEQHESIPLRGRQKTGLDSPSGIDFSDITANSFTVHWIA
PRATITGYRIRHHPEHTGGRPREDRVPPSRNSITLTNLSPGTEYVVSIVALNGREESPPL
IGQQSTVSDVPRDLEVIAATPTSVLISWDAPAVTVRYYRITYGETGGNSPVQDFTVPGSK
STATISGLKPGVDYTITVYAVTGRGDSPASSKPVSIDYRTEIDKPSQMQVTDVQDNSISV
RWLPSSSPVTGYRVTTTPKNGAGPSKTKTAGPDQTEMTIEGLQPTVEYVVSVYAQNRNGE
SQPLVQTAVTNIDRPKGLAFTDVDVDSIKIAWESPQGQVSRYRVTYSSPEDGIHELFPAP
DGEEDTAELQGLRPGSEYTVSVVALHDDMESQPLIGTQSTAIPAPADLKFTQVTPTSLTA
QWTAPNVQLTGYRVRVTPKEKTGPMKEINLAPDSSSVVVSGLMVSTKYEVSVYALKDTLT
SRPAQGVVTTLENVSPPRRARVTDATETTITISWRTKTETITGFQVDAIPANGQTPIQRT
IKPDVRSYTITGLQPGTDYKIYLYTLNDNARSSPVVIDASTAIDAPSNLRFLATTPNSLL
VSWQPPRAKITGYIIKYEKPGSPPREVVPRPRPGVTEATITGLEPGTEYTIQVIALKNNQ
KSEPLIGRKKTDELPQLVTLPHPNLHGPEILDVPSTVQKTPFITNPGYDTGNGIQLPGTS
GQQPSVGQQMIFEEHGFRRTTPPTTATPVRHRPRPYPPNVNEEIQVGHVPRGDVDHHLYP
HVMGLNPNASTGQEALSQTTISWTPFQESSEYIISCHPVGIDEEPLQFRVPGTSASATLT
GLTRGATYNIIVEALKDQKRHKVREEVVTVGNSVDQGLNQPTDDSCFDPYTVSHYAIGEE
WERLSESGFKLSCQCLGFGSGHFRCDSSKWCHDNGVNYKIGEKWDRQGENGQMMSCTCLG
NGKGEFKCDPHEATCYDDGKTYHVGEQWQKEYLGAICSCTCFGGQRGWRCDNCRRPGAEP
GHEGSTGHYNQYSQRYHQRTNTNVNCPIECFMPLDVQADREDSRE
NT seq 7158 nt   +upstreamnt  +downstreamnt
atgctcaggggtccggggccccggctgctgctgctggccgtcctgtccctggggatagcg
gtgccctccaccggagcgtcgaagagcaagagacaggcccagcaaatcgtgcagccccag
accccaggggctgtcagccagagcaagcctggttgttacgacaacgggaaacactatcag
ataaatcaacagtgggagcgcacctacctgggaaatgccttggtttgtacctgttacggt
gggagccgaggctttaactgcgagagcaaacctgaacctgaagagacttgctttgacaag
tacacggggaacacctaccgcgtgggtgacacttatgagcgccctaaagactccatgatc
tgggactgtacctgcatcggagccgggcgagggcggataagctgcaccattgcaaaccgt
tgccatgaagggggtcaatcctacaagattggtgacacctggaggagaccgcatgagact
ggtggttacatgttagaatgtgtatgtctcggcaacgggaaaggagaatggacctgcaag
cccatagctgagaaatgttttgatcacgctgctgggacttcctacgttgttggagagacc
tgggaaaagccatatcaaggctggatgatggtggattgtacttgtctgggagaaggcagt
ggacgtatcacctgcacttctagaaacagatgcaacgatcaggacaccaggacatcctac
cgaattggagacacgtggagcaagaaggataatcggggaaacctgctccagtgcatctgc
accggcaacggcagaggggagtggaagtgtgaaaggcacgcgtctgtgcagaccacgtcc
accggatccggccccttcacagatgtccgaacggccatctaccagacccagcctcacccc
cagcctgctgcgtacggtcactgtgtcacagacagtggtgtggtttacgtcgtggggatg
cagtggctgaagacacaaggaaataagcaaatgctttgcacttgcctgggcaatggagtc
agctgccaagagacagctgtcacccagacttacggtggcaattccaacggggagccctgt
gtcctgccgttcacctacaacggcagaactttctactcctgcaccacggaagggcgacag
gacggccacctgtggtgcagcaccacctccaattacgagcaagaccagaaatactccttc
tgcacagaccataccgttttggttcagactcgaggtgggaattccaatggtgccttgtgc
cacttccccttcctgtacaacaaccacaactacacggactgtacttctgagggcaggaga
gacaacatgaagtggtgtggaaccacgccgaactatgacgctgaccagaagtttggcttc
tgccccatggctgcccacgaggaaatctgcacaaccaatgaaggggtcatgtatcgcatt
ggagatcagtgggacaaacagcatgatatgggccacatgatgaggtgcacgtgcgttggg
aatggtcgtggagaatggacttgtgttgcctactcccaactccgagatcagtgcattgtt
gacgacatcacttacaacgtgaacgacacattccacaagcgtcacgaagagggacacatg
ctgaattgtacctgctttggtcagggccggggcagatggaagtgcgatcccattgaccaa
tgccaggactcagaaacccgcacattttatcaaattggagactcctgggagaagtacgtg
catggagtcaggtaccagtgctattgctatggccgtggcattggggagtggcattgccag
cctttgcagacctatccaggcacaactggtcccgtccaagtaatcatcactgagaccccc
agtcaacccaattctcaccccattcagtggaatgcaccagaaccatctcacatttccaag
tacattctcagatggaagcctaaaaattctccaggccgttggagggaggccaccattccc
ggccacttgaattcgtacaccatcaaaggcctgacgccaggtgtggtatatgaggggcag
ctcatcagtgtccagcactacggccacaaagaggtgacacgcttcgacttcaccaccacc
agcaccagccccccggtgaccagcaacaccgtgacaggagagacgacacccctttctccc
gttgtggccacttctgaatctgtgactgaaatcacggctagcagctttgtggtctcatgg
gtgtcggcctcagacactgtgtcaggattccgtgtggaatacgagctgagtgaggagggc
gatgaaccgcagtatctcgatctcccaagcacggccacttccgtgaatatccctgacctg
cttcctggccgaaaatacattgtgaatgtctatcagatatctgaagaaggagagcagagt
ctgatcctgtctacctcgcagacaacagcgcctgatgctcctcccgaccctgctgtagac
cgagttgatgacacctcgattgttgttcgctggagcagaccccaggcgcccatcacaggg
tacagaatagtgtattcgccatcagtagaaggtagcagcacagaactcaaccttcctgaa
accgccaactcggtcaccctcagtgacttgcagcccggcgttcaatataacatcactatc
tatgcggtagaagaaaaccaggaaagtactcctgttttcatccaacaagaaaccactggc
gtcccgcgttcagataaagttccccctccccgggacctgcagtttgtggaagtgacggac
gtgaagatcaccatcatgtggacaccccctgagagtgcagtgactggctaccgcgtggac
gtgatccccgtcaacctgcctggggaacacgggcagaggctgcccgtcaacaggaactcc
tttgcagaagtcaccggcctgtctcccggggtcacctatctcttcaaagtcttcgctgtg
aaccaagggcgggagagcaagcctttgacggcagaacaagcaaccaaattggatgctccc
actaacctccggtttaccaatgaaactgactcgaccgtcttggtgacctggactccacct
cgggcccggatagccgggtaccgactgaccgtgggcccgacccggggaggccagcccaag
aagtacaacgtggggccctcggcctcacagtaccccctgaggaatctgcagcctgcgtcc
gagtacaccgcatccctcgtagctgtcaaaggcaaccagcagagccccaaagccactgga
gtcttcaccactctgcaacctctgagttccattccaccttacaacacggaggtcaccgag
accaccattgtgatcacatggacacctgctccaaggattggttttaagctgggtgtacga
ccaagccagggaggggaagcaccgcgagaagtgacttcagaatcaggaagcatcgttgtg
tccggcctgactccaggcgtggaatacgtgtacaccatctcagtcctgagggatgggcaa
gagagagatgctccgattgtaaagaaagtagtgacaccattgtctccaccaacaaacttg
cacctggaggcaaaccctgacactggagtgcttaccgtctcctgggagaggagcaccaca
ccagacattactggttatagaattaccaccacccctctaaatggacaacagggatactct
ttggaagaagtggtccatgcagatcagagctcttgcacctttgaaaacctgagtcctggc
ctggagtacaatgtcagtgtttacactatcaaagatgacaaggaaagtgtccctatctct
gataccatcatcccagctgtccctcctcccactgacctgcgattcaccaatgttggtcca
gacactatgcgtgtcacctgggccccacctccatccattgaactgaccaacctcctggtg
cgctactcacctgtgaaaaatgaggaagatgttgcagagctgtcaatctctccttcggac
aatgcagtggtcttaacaaatctcctgcctggcacagagtatttggtcagtgtctccagt
gtgtacgaacagcatgagagcatacctcttagaggaagacagaaaacaggtcttgattcc
ccatctggcattgacttctctgatatcactgccaactctttcactgtgcattggatcgct
cctcgagccaccatcactggctaccggatccgccatcatcctgagcacactggtgggaga
cctcgggaagatcgagtgcccccgtctcggaattccatcaccctcaccaatctcagtccg
ggcacagaatatgtggtcagcattgttgctctgaatggcagagaagaaagtcctcccttg
attggccaacagtcaacagtttctgatgttccaagggacctggaagtcattgccgcaacc
cccaccagcgtgctgatcagctgggacgctcctgctgtcactgtgagatattacaggatc
acctatggagaaacaggaggaaacagccctgtccaggacttcactgtgcctgggagcaag
tccacagctaccatcagcggccttaaacctggagtagactacaccatcaccgtgtacgct
gtcactggccgtggagacagccccgcaagcagcaagccggtttccattgactatcgaaca
gaaattgacaaaccatcccagatgcaagtgactgatgttcaggacaacagcattagtgtc
aggtggctgccttcaagttcccctgttactggttacagagtgaccactactcccaaaaac
ggcgcaggaccatcaaaaacgaaaactgcaggtccagatcaaacagaaatgaccattgaa
ggtttgcagcccacagtggagtatgtggttagtgtctatgctcagaatcgaaatggagag
agtcagcctctggttcaaaccgcagtaacgaacattgatcgccctaaaggactggcattc
actgatgtggatgtcgattccatcaaaattgcttgggaaagcccacaggggcaagtttcc
aggtacagggtgacctactcgagccctgaggatggaatccatgagctattccccgcacct
gatggtgaagaagacaccgcagagctgcaaggcctcaggccgggttctgagtacacagtc
agtgtggttgccttgcacgatgatatggagagccagcccctgattggaacccagtccaca
gccattcctgcaccagctgacctgaagttcactcaggttactccaacaagcctgaccgcc
cagtggacggcacccaatgttcagctcacaggatatcgagtgcgggtgacccccaaggag
aagactggaccaatgaaagaaatcaaccttgctcctgacagctcatctgtggttgtgtca
ggactcatggtgtccaccaaatacgaagtgagtgtctatgcccttaaggacactctgaca
agcagaccggctcagggagtcgtcactactctggagaatgtcagccctccaagaagggcc
cgtgtgacagatgctaccgagaccaccatcaccattagctggagaaccaagactgagaca
atcactggcttccaagttgatgccatcccagcaaacggccaaactccaattcagagaacc
atcaagccagatgtccgaagctacaccatcacaggtttacagcccggcacggactacaag
atctacttgtacaccctgaacgacaatgcccggagctcccccgtggtcattgatgcctcc
actgccattgatgcaccatcgaacctgcgtttcctggccaccacacccaactccttgctg
gtatcatggcagccgccccgtgccaagattactggttacatcatcaagtatgagaagcct
gggtcccctcccagagaagtggtccctcgtccccgccctggtgtcacagaggctactatc
accggtctggaaccaggcaccgagtacaccatccaggtcattgccctcaagaacaaccag
aagagtgagcctctgattggaaggaaaaagacagatgagcttccccaactggtaaccctt
ccacacccaaatcttcacggaccagagatcttggatgttccctccacagttcaaaagacc
cctttcatcaccaaccctgggtatgacactggaaacggtattcagcttcctggcacttct
ggtcagcagcccagtgttgggcaacaaatgatctttgaggagcatggttttaggcgcacc
acaccgcccacaacggctacccctgtaaggcataggccaagaccatatccgccgaatgta
aatgaggagatccaagttggtcacgtccccaggggagacgttgaccatcacctctaccct
cacgtcatgggactcaatccaaatgcctctacaggacaagaagctctctctcagacaacc
atctcttggaccccattccaagaaagctctgagtatatcatttcatgtcatccagttggc
attgatgaagaacctttacagttccgagttcctggaacttctgctagtgccacgctgaca
ggcctcaccagaggggccacctacaacatcatagtggaggccctgaaagaccagaagagg
cacaaggttcgggaggaggttgttaccgtgggcaactctgtcgaccaaggcctaaaccaa
cctacagatgactcatgcttcgacccctacacggtttcccattatgccattggagaggag
tgggagcggttgtctgaatctggctttaagctctcgtgccagtgcttaggctttggcagt
ggtcatttcagatgtgactcatctaaatggtgccatgataacggtgtgaactacaagatt
ggagagaagtgggatcgtcagggggaaaatggccagatgatgagctgcacatgtcttgga
aatggaaaaggagaattcaagtgtgatcctcatgaggccacgtgttatgatgatgggaag
acataccacgtgggagaacagtggcagaaggaatatcttggtgccatttgctcctgcaca
tgctttggaggccagcggggctggcgctgtgacaactgccgcagaccaggggctgaaccc
ggtcacgaaggctccactggccactacaaccagtactcgcaaagataccatcagagaact
aacactaatgtcaactgcccaattgagtgcttcatgcctttagatgtacaggctgacaga
gaagattcccgagagtaa

KEGG   Ursus arctos horribilis: 113251538
Entry
113251538         CDS       T05909                                 

Gene name
COL4A2
Definition
(RefSeq) collagen alpha-2(IV) chain
  KO
K06237  collagen, type IV, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113251538 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113251538 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113251538 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113251538 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    113251538 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113251538 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113251538 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113251538 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113251538 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113251538 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113251538 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113251538 (COL4A2)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   113251538 (COL4A2)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113251538 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 113251538
NCBI-ProteinID: XP_026349378
UniProt: A0A3Q7W3Z5
LinkDB
Position
Unknown
AA seq 1712 aa
MDRHQCPASSPALRRWLLLGAVTVGLLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYTGPPGLQGFPGLQGRKGDKGERGSPGITGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGPPGYDGCNGTRGDVGRQGPEGPAGFLGPPGPQGPKGQKGEPYALSSEDRD
RYRGEPGEPGLVGFQGTPGRPGPVGQMGPVGAPGRPGPPGPPGPKGQPGNRGLGFYGEKG
EKGDVGLPGPNGIPSDTHHAIIGPMKEMFHPEQYKGEKGSEGEPGVKGISLRGEEGVMGF
SGPRGAAGFDGEKGSPGQKGIKGLDGYQGPHGPPGPKGEAGERGPPGAPAYFPHPSLIKG
ARGDPGFPGTHGEPGSRGEPGDPGPPGLPGTSVRDEDGKRGLPGEMGPKGFIGDPGIPAQ
HPGPPGTDGRPGLQGLPGPAGPPGPDGFLFGLKGTEGTVGYPGASGFPGARGQKGWKGDS
GDCKCAESDQFITGLPGLPGPKGFPGINGEPGKKGSQGDPGQHGIPGFPGFKGAPGNVGP
PGPKGAKGDSRAVTTKGDRGQPGVPGVPGLKGDDGVPGRDGLDGFPGLPGPPGDGIKGPP
GDSGYPGAPGTKGVPGERGPPGLGLPGPKGQRGFPGDAGLPGPPGFPGPPGLPGTPGLID
CDTGVKRPIGGDGQEVVQPGCVGGPKGSPGLQGPPGPSGAKGLQGIPGLSGADGAPGLKG
LPGDPGREGFPGPPGFVGPRGSKGAAGLPGTDGHPGPTGLPGPVGPPGDRGLPGEVLGAQ
PGPRGDAGLPGHPGVKGPPGERGPPGFRGSQGMPGMPGLKGQPGFPGPSGQPGLPGPPGQ
HGFPGAPGREGPFGLPGAPGYGGLPGDRGDPGDTGVPGPVGMKGLSGDRGDPGLLGERGP
PGSPGVKGMSGMPGVPGEKGGRGSPGMDGFQGMVGLRGRSGLPGNKGEAGFFGIPGLKGL
AGEPGVKGSRGDPGPPGPPPIILPGMKDIKGEKGDEGPMGLKGYLGLKGVPGMPGIPGLS
GVPGLPGKPGHIKGIKGDIGVPGVPGLPGFPGVPGPPGIIGFPGFTGSRGDKGAPGRAGL
YGEVGPTGDFGDIGDTIDLLGSPGLKGERGVTGAPGLKGFFGEKGTVGEVGFPGITGVPG
VQGPPGPKGQTGFPGLTGLQGPQGEPGRVGLPGDKGDYGWPGIPGSPGFPGIRGISGLHG
LPGTKGFPGSPGADIHGDPGFPGPAGDRGDPGEANTRPGPFGAPGQKGERGEPGERGPVG
SPGLQGFPGITPPSNMSGSPGDKGAPGIFGLEGYRGPPGPPGPAALPGTKGDEGNPGAPG
NPGSKGWGGDPGPQGRPGVFGLPGEKGPRGEPGFMGNTGATGSVGDRGPKGPKGDRGLPG
APGSMGSPGIAGIPQKISVQPGPVGPQGRRGPPGAQGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAIPGFRGDQGPMGQQGPVGQEGEPGRPGTPGLPGMPGRSISIGYLLVKHSQTDQ
EPMCPVGMKTLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEEDIKPYISRCSVCEAPAVAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5139 nt   +upstreamnt  +downstreamnt
atggacagacaccagtgcccggcgtccagccctgccctgcggcggtggctgctgctgggg
gctgtgacagtggggctcctggcccagagcgtcctggcgggcgtgaagaagttcgatgtg
ccctgcggagggagagactgcagcgggggctgccagtgttaccccgagaagggaggacgg
ggccagcccgggccagtgggcccccaggggtacaccgggcccccaggcctgcagggattc
ccaggactgcaaggccgcaaaggtgacaagggcgaacgggggtcacccgggatcacagga
ccgaaaggagacgtgggagcgagaggtgtgtctggattccctggtgccgacggaattccc
ggacaccccgggcaaggtgggcccagaggaccacccggctacgatggctgcaacgggacc
agaggagacgtgggccggcagggacccgaaggccctgcggggttcctcggccctcccggg
ccccaaggacccaaaggacagaaaggcgagccatatgcgctatccagtgaggaccgcgac
agatacaggggtgaacccggagagcctggattggttggtttccagggcacccccggccgc
cctgggcctgtaggacagatgggtccggttggagctcccggaagaccaggcccgcctgga
ccccctggaccgaaaggacagccaggcaacagaggacttggcttttatggagaaaagggt
gaaaagggcgacgtgggactgccgggacccaatgggatcccatcagacacccaccatgcc
atcatcggacccatgaaggagatgttccacccagagcagtataagggtgaaaaggggagt
gaaggggagccaggagtaaaaggcatctccttgaggggagaagaaggagtcatgggtttt
tcgggtccacggggtgctgctggctttgatggtgaaaaaggttcaccgggacaaaaaggg
atcaaaggactggatggttatcaaggcccccacggacccccaggacccaagggagaagca
ggcgagcgtggccccccaggtgcacctgcttacttccctcacccctccctaatcaaaggt
gccagaggtgacccaggattcccagggacccacggggagccgggaagccggggtgaacca
ggagacccaggccccccgggcctccctggcacgtccgtcagagacgaagatggcaagaga
ggcctcccgggtgaaatgggccccaaaggcttcataggagacccaggcatccctgcacag
caccccggcccgccgggcactgatggaaggccaggactccaaggactcccagggcctgcc
ggaccaccgggaccagacggtttccttttcggccttaaaggaacagaagggacggtgggc
taccctggggcttctggcttccctggggctcgtggacagaaaggatggaaaggtgactct
ggggactgtaagtgtgctgagagtgatcagttcatcacggggctcccagggctgccagga
cccaagggctttcctggcatcaatggggagccagggaagaaagggagccaaggagacccc
ggccagcacggcatccccgggttcccagggttcaagggtgcccctggcaacgtgggacca
cccgggcccaaaggggcgaagggggattccagagcagtcaccaccaaaggtgacagagga
cagccaggggtcccaggtgtgccggggctgaaaggtgacgacggtgtccccgggcgcgac
gggctggatgggttcccaggcctcccaggccctccgggtgatggcatcaaaggcccccca
ggggactcaggttacccaggagcacctggcactaagggcgttccaggagaaaggggaccc
ccaggactgggcctgcctggccccaagggccagcgtggtttccccggagatgccggatta
cctggaccaccaggctttcccggtccccctggcctcccaggcacccccggactaatagac
tgtgacacaggtgtgaagaggcccatcggaggtgacggacaggaggtcgtgcagccaggt
tgcgtcggagggcccaagggatcaccaggcctgcagggacccccgggcccctcaggtgcc
aagggcctgcaagggataccaggactctcgggcgctgatggagcaccagggctcaagggt
ctccccggagacccaggtcgtgaagggttcccgggacccccagggttcgtggggccccga
ggatccaaaggagcagcgggccttcctggcacggatggacacccgggtcccactggtctg
ccaggccccgtcgggcccccaggggacaggggccttcccggagaagttctgggagcccag
cccgggccccggggagatgctggactgcctggacaccccggggtcaaaggccctccggga
gagagaggcccgcctggattcaggggaagccagggcatgccgggaatgccaggcctgaag
ggccagccgggcttcccaggaccctcgggccagccaggcctgcccgggccgccaggacag
cacggattcccaggagctcccggccgggaggggccctttgggctgccaggtgcccccggt
tacggaggtctgcctggagacagaggggacccaggggacacaggtgtccctggccctgtg
ggcatgaagggtctctccggcgatagaggtgaccctggtttgctgggggagagaggcccc
ccaggaagtcctggggttaaaggaatgagcggaatgcctggtgtccctggggagaaaggc
gggagaggctcacccgggatggatggtttccaaggcatggttgggctcagaggaagatcc
gggcttccgggaaacaaaggagaggccggattttttggaattccaggactgaagggtctg
gctggggagccgggtgtgaaaggcagtcgtggggaccccggacccccaggaccaccgccc
atcatcctgccaggaatgaaagacatcaaaggagagaaaggagatgaagggcctatgggg
ttgaaaggatacctgggcctgaaaggtgttcccggaatgccagggatccccgggctgtcg
ggagtccccgggttgccagggaagccaggacacatcaaaggaatcaaaggagacatcggc
gtgcccggcgtgcctggtttaccaggattccctggtgtgcccggcccccccggaatcata
gggtttcccgggttcacaggaagtaggggtgacaagggagctccggggagagcaggcctg
tacggcgaggttggccccaccggagacttcggtgacattggagacactatagacctgctg
ggaagcccgggcctgaagggggagcggggcgtcactggagcaccaggtctgaagggattc
ttcggggagaaaggaacggtgggcgaagtcggcttccctgggataaccggcgtgcccggc
gtccaaggccctcccggacccaaagggcaaacaggcttcccaggactgacagggctgcag
gggccgcagggagagcccgggcgggtgggactgcccggcgacaaaggagactacggctgg
ccggggattccaggctccccaggttttcccggcatccgaggcatcagcggactgcacggc
ctgccaggcaccaaaggctttccaggatccccaggtgcggacatccacggagaccccggt
ttccccggtcctgctggggacaggggtgacccaggagaggccaacacccgcccaggccct
ttcggagccccaggacaaaagggggaacgaggcgagccaggggaacgaggcccagtcggg
agtccaggacttcagggtttcccaggcatcacccccccttccaacatgtctggctcgcct
ggtgacaaaggggcgccgggcatatttggcctagaaggttatcgagggcctcccgggcca
cccgggcctgctgctcttcctggaaccaaaggagatgaggggaacccaggagctccggga
aaccccgggtccaaaggatggggcggggaccctgggccccaaggccgacccggtgtgttc
ggtctcccgggagagaaagggcccagaggggagccaggattcatgggaaacacaggagca
accgggagtgtgggcgacagaggccccaagggacccaagggagaccgaggccttccaggt
gcccccggttccatgggatcccctgggattgcaggaatcccccagaagatttccgtccag
ccggggccagtgggtccgcagggaaggagaggccccccgggggcacagggagagatgggg
ccccagggccccccaggagaaccaggtttccgcggggctcccgggaaggcagggccccag
ggcagaggtggtgtgtctgctattcccggattccgaggagaccaggggcccatgggacag
caggggccggtgggccaggaaggggagccaggccgcccggggacccccggcctgccaggc
atgccgggccgcagcatcagcatcggctacctgctggtgaagcatagccagacggaccag
gagcccatgtgccccgtgggcatgaagacgctctggagcgggtacagcctgctatacttc
gaaggccaggagaaagcccacaaccaggacctgggtctggcgggctcctgcctggctcgg
ttcagcaccatgcccttcctgtactgcaaccccggtgatgtgtgctactatgccagccga
aatgacaagtcctactggctgtccaccaccgccccgctgcccatgatgcctgtggccgag
gaggacatcaagccctacatcagccgctgctccgtgtgcgaggccccagctgtcgccata
gcggtgcacagccaagatgtctccatcccccactgcccagccggatggcggagtctgtgg
attggatactccttcctcatgcacacggccgccggcgacgagggtggcggccagtcgctg
gtgtcgccgggcagctgcctggaggacttccgcgccacgcccttcatcgagtgcaacggg
ggccgcgggacctgccactactacgccaacaagtacagcttctggctcaccaccatcccc
gagcagagcttccagggctcgccctcggccgacacgctcaaggcggggctcatccgcacg
cacatcagccgctgccaggtgtgcatgaagaacctgtga

KEGG   Ursus arctos horribilis: 113251539
Entry
113251539         CDS       T05909                                 

Gene name
COL4A1
Definition
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen, type IV, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113251539 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113251539 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113251539 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113251539 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    113251539 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113251539 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113251539 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113251539 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113251539 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113251539 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113251539 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113251539 (COL4A1)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   113251539 (COL4A1)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113251539 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 113251539
NCBI-ProteinID: XP_026349380
UniProt: A0A3Q7VB36
LinkDB
Position
Unknown
AA seq 1669 aa
MGPRLGVWLLLLPAALLLHEERSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPHGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGEPGPLGPPGLPGFAGNPGPPGLPGMKGDPGEIIGHVPGTLLKGERGFPGP
PGTPGSPGLPGLQGPVGPPGFAGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGLGEKGEPGKPGPRGKPGKDGEKGEKGS
PGFPGDAGYPGLPGREGFKGDKGEAGPPGPPGIAIGPGPSGEKGERGYPGTPGLRGEPGP
KGFPGLQGQPGPPGFPVPGQAGAPGFPGERGEKGDQGFPGTSLPGPSGRDGLPGPPGPPG
PPGQPGHTNGIVECQPGPPGDQGPPGIPGQPGLTGEVGEKGQKGESCLICDTSGLRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGLEGLPGPQGSPGLMGQPGAKGEPGEIYFDARLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDI
GPPGPPGFGPIGPVGDKGQAGFPGTPGSPGQPGPKGEAGKAVPLPGPPGTQGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGTIGQPGIGFPGPPGPKGVDGLPGDVGPPGSPGRPGFNG
LPGNPGVPGQKGEPGVGLPGLKGLPGLPGVPGTPGEKGNIGGPGVPGEHGAIGPPGLQGI
RGDPGPPGLQGPKGAPGAPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGHPGLPGQQGTPGVPGFPGPKGEMGVMGTPGQPGSPGPAGLPG
LPGEKGDHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGIGGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGVPGLQGEKGAKGEKGQAGLPGIGIPGRPGDKGDQGITGFPGSPGE
KGEKGSAGIPGVPGSPGPKGSPGTVGYPGSPGLPGEKGDKGLPGSDGIPGIKGEAGLPGK
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGEKGSKGDVGFPGQAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGPKGDRGPQGQPGLPGLPGPMGPAGLPG
LDGLKGDKGNPGWPGAPGAPGPKGEPGFQGLPGIGGSPGITGSKGDMGPPGVPGFQGQKG
LPGLQGTKGDQGDQGFPGSKGLPGPPGPPGPYDIIKGEPGLPGPEGPAGLKGLPGPPGPK
GQQGVTGSVGLPGPPGSPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPIAGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcggcgtctggctgctgctgctgcctgccgccctcctgctccacgag
gagcgcagccgggccgctgcaaagggtggttgtgctggctctggctgcgggaagtgtgac
tgccatggagtgaagggacagaagggcgagagaggcctcccagggttgcaaggtgtcatc
gggtttcccgggatgcaaggacctgaggggccccacgggccaccaggacaaaagggtgac
actggagaacccgggctgcctggaacgaaggggacaagaggacccccaggagcatcgggt
taccctggaaacccaggactgcctggtattcctggccaagacggtccccccggtcctcca
ggaatccccggatgcaatggaacaaagggcgagccagggcctctggggcccccgggttta
cctggattcgccggaaatcctggaccgccaggattaccaggaatgaagggggatccaggt
gaaatcattggccatgtgcctgggaccctgttgaaaggtgaaagaggatttcctggaccc
ccaggaacaccaggctcgccaggactgccgggcctgcaaggtccggttggccctccagga
tttgccggaccaccgggtcctccaggccctcctggccctccaggcgaaaaggggcaaatg
ggcttaagttttcaagggccaaaaggcgacaagggtgaccaaggcgtcagtgggcctccc
ggagtgccaggacaagctcaagttcaagagaaaggagactttgccactaaaggagagaag
ggtcaaaaaggtgaacctggatttcagggaatgccggggctcggagagaaaggggagccc
ggaaaaccagggccccgaggaaaacctggaaaagatggcgaaaaaggagagaaagggagt
ccgggctttccaggtgacgcggggtacccgggactgccaggccgcgaaggtttcaaggga
gacaaaggtgaagcaggtcctccaggcccacctggaattgcaatcggcccaggaccctct
ggagaaaaaggagagcgggggtacccgggcaccccagggttgagaggagagccaggcccc
aaaggtttcccaggattacaaggccagccaggccctccaggctttccagtaccagggcag
gctggtgctcctggcttccctggtgaaagaggcgagaaaggtgaccaagggtttccaggc
acatctttgccaggaccaagtggaagggatgggctccctggcccccctgggcctcctggg
ccccctggacagccgggccacacaaatggaattgtggaatgccagcctggaccgccaggg
gatcagggtccccccggaataccggggcagccagggctgacgggagaggttggagaaaaa
ggtcagaaaggcgagagctgcctcatctgtgacacatcgggactgcgtgggcctccaggg
ccgcagggccccccgggagaaataggtttcccaggacagccaggggccaagggcgacaga
ggcttacctggcagagatggtctggaaggattgcctggcccgcaaggctcaccagggctg
atgggccagccaggagccaagggagagcctggtgagatttacttcgacgcacggctcaaa
ggagacaaaggagacccaggctttccaggacagcccgggatgccaggcagagcagggtct
cctgggagagatggccatccgggtctgcccggccccaaaggctccccgggttcagtagga
ttgaaaggagagcgtggaccccccgggggagtcggattccccggcagtcgcggtgacatc
ggccctcctgggcctccagggtttggccctattggccccgttggtgacaaaggacaagcg
ggttttccagggacccctggatccccaggccagccaggtcccaagggtgaagcaggaaaa
gccgtgcccctacctggtccccctggaacacaaggacttccgggatccccaggtttcccg
gggccgcaaggtgaccgaggttttcctggaaccccaggaaggccgggcctgccaggagag
aagggcactattggccagcccggaatcggatttccagggccccctggccccaaaggtgtt
gatggcttacccggagacgtgggacctcctggcagtccaggccgcccgggatttaatggc
ttacctggcaacccaggtgtgcctggccaaaagggagagcccggagttggtctaccagga
ctcaaaggattgccaggtcttcctggcgttcctggcacacctggggagaaggggaacatc
gggggaccaggtgttccgggagagcatggtgctattggccccccaggccttcagggaatc
agaggtgacccaggacctcctggattacaaggtcccaaaggagctccaggagctcctgga
attggcccccccggagcgatgggcccccctggaggacagggaccacccgggtcctcaggc
cctcccggagtgaaaggagagaagggcttccccggatttccaggcctggacatgccaggt
cccaaaggagataaagggtcccaagggctgcctggcctgacgggacagtcggggcaccct
ggtcttcctggacagcagggcacacctggggttcctgggtttccaggtcccaagggagag
atgggcgtcatggggaccccagggcagcctggctcaccgggaccagcgggcctgccagga
ttaccaggagaaaaaggggaccacggcttcccgggctcctcagggcccaggggagacccc
ggcttcaagggggacaaaggagacgtgggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggtcagaagggagaccaaggagagaaaggacaaattgga
ccgactggtgataaaggatcccggggagatcctggaaccccaggagtgcctggaaaggat
ggccaggcgggacatcctgggcagccaggacctaaaggtgacccaggcataggtggaacc
ccgggtgccccaggactccctggacccaaaggatcggttggtggaatgggcctgccaggg
acacctggagaaaaaggtgtgcctggaatccccggcccgcagggcgtccctggcttacag
ggggagaaaggagcaaaaggagagaaagggcaggcgggcctacctggcattggaattccg
ggccgtcccggggacaagggagatcaagggatcacaggctttccggggagtcctggagag
aagggagaaaaaggaagtgctggcatcccaggggtccctggctccccaggccccaaagga
tcaccggggactgttggctatccaggaagccccgggttgcctggagaaaaaggtgacaaa
ggtctcccgggatcggatggcattcccggcatcaaaggagaagcaggtcttcctgggaag
cctggccccacgggcccagccggccagaaaggggagcccggcagtgacggaatcccaggg
tcggcgggagagaagggtgaaccaggtctgcccggaagaggattcccagggtttccaggg
gccaaaggagagaaaggttcaaagggcgacgtgggtttcccgggacaagccggcagtcca
ggcatccccggatccaaaggagagcaaggattcatgggtcccccggggccgcaaggacag
ccgggcttacctggaactccaggccatgctgtggaggggcccaaaggagaccggggccca
cagggtcaacctggcctaccagggcttccgggacctatggggcctgcggggctccctggg
ctcgatggactcaaaggtgacaaaggaaacccaggttggccgggggctcctggagctcca
gggcccaagggagagccaggattccagggcctgcctgggattggtggctcgccagggatc
acgggctccaagggtgatatggggcctccaggagtgccgggatttcaaggtcagaaaggc
ctccctggcctgcagggaacgaagggggatcaaggtgaccagggcttccctggaagtaaa
ggccttcccggtcccccgggtcccccaggaccctacgacatcatcaaaggggagccagga
cttcctggtcctgagggtcccgcaggtctgaaggggctcccaggacctccaggccccaaa
ggacagcaaggtgtgacgggatctgtgggcttacctggaccgccaggtagtcccggtttt
gacggcgccccgggccagaaaggagagacgggacccttcggccctcctggtccacgaggg
tttccgggcccgccaggccctgatgggctgccaggatccatgggtcccccaggcaccccg
tctgttgatcatggcttccttgtgaccaggcacagtcaaacaacggatgacccacaatgt
cctcctgggaccaaaattctttaccacgggtactccttgctctacgtgcaaggcaacgaa
cgagcccacggccaggacttgggcacggccgggagctgcctgcgcaagttcagtacgatg
cccttcctcttctgcaacatcaacaacgtgtgtaacttcgcctcccgaaacgactactcc
tactggctgtccacacccgagcccatgcccatgtccatggcgcccatcgccggggacaat
atcagaccatttattagcaggtgcgcggtgtgtgaggcgccagccatggtgatggccgtg
cacagccagaccattcagatcccgcagtgccccagcggctggtcctccctctggattggc
tattccttcgtgatgcacaccagtgccggtgctgaaggttctggccaagccctcgcgtcc
cccgggtcttgtctggaagagttcaggagcgcgccattcatcgagtgccatggccgtggg
acttgcaactactatgcaaacgcttacagcttttggcttgccaccatcgagagaagcgag
atgttcaagaagcccacgccgtccaccttgaaggccggggagctgcgcacgcacgtcagt
cgctgtcaagtctgtatgagaagaacgtaa

KEGG   Ursus arctos horribilis: 113253229
Entry
113253229         CDS       T05909                                 

Gene name
LAMA1
Definition
(RefSeq) laminin subunit alpha-1 isoform X1
  KO
K05637  laminin, alpha 1/2
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
uah05410  Hypertrophic cardiomyopathy (HCM)
uah05412  Arrhythmogenic right ventricular cardiomyopathy (ARVC)
uah05414  Dilated cardiomyopathy (DCM)
uah05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113253229 (LAMA1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113253229 (LAMA1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113253229 (LAMA1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113253229 (LAMA1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113253229 (LAMA1)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy (HCM)
    113253229 (LAMA1)
   05412 Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    113253229 (LAMA1)
   05414 Dilated cardiomyopathy (DCM)
    113253229 (LAMA1)
   05416 Viral myocarditis
    113253229 (LAMA1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113253229 (LAMA1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113253229 (LAMA1)
   05145 Toxoplasmosis
    113253229 (LAMA1)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_B Laminin_N Laminin_I Laminin_II Laminin_G_3 Hepar_II_III BAR_3
Other DBs
NCBI-GeneID: 113253229
NCBI-ProteinID: XP_026351788
UniProt: A0A3Q7VS38
LinkDB
Position
Unknown
AA seq 3102 aa
MRGGGGAGVALLASLLWVAARCQQRGLFPAILNLASNAHISTNATCGEQGPEMSCKLVEH
VPGRPVRNAQCRICDANSANPKERHPISNAIDGTNNWWQSPSIQNGREYHWVTITLDLRQ
VFQVAYVIIKAANAPRPGNWILERSLDGTTFSPWQYYAVSDTECLTRYNITPRRGPPTYR
ADDEVICTSYYSRLVPLEHGEIHTSLINGRPSADDLSPKLLEFTSARYIRLRLQRIRTLH
ADLMTLSHRDPRDLDPIVTRRYYYSIKDISVGGMCICYGHASSCPWDETTKKLQCQCEHN
TCGESCSRCCPGYHQQPWRPGTVSAGNTCEKCNCHNKAQDCYYDENVANQRRSLNIAGQF
RGGGVCINCLQNTMGINCETCIDGYYRPYKVSPYEDNPCRPCDCDPFGSLSSVCIKDELQ
SGLHKGKRPGQCPCKEGYAGEKCDRCQFGYKAYPACTRCECSRAGSVNEDPCSEPCLCKE
NVEGDNCDLCKPGFYNLKERNPEGCSECFCFGVSDVCDSLSWSISQVKDMSGWLVTDLIS
PSKVPSQQDALGGRHQISINNTLVMQRLTSKYYWSAPEAYLGNKLTAFGGFLKYTVSYAI
PVETVDGDLMSHADVIIKGNGLTLSTQAEGLSLQPYEEYLNVVRLVPENFRDFNNKREVD
RDQLMTVLANVTHLLIRANYNSAKMALYRLDSVSLDTASPNVLDLALATEVEHCECPQGY
AGISCESCLPGYYRVDGILFGGICQPCECHGHAAECDIHGVCFACEHNTTGDHCEQCAPG
FYGLPSRGTPGDCQRCACPLSTASNNFSPTCHLEDGDEVVCDQCAPGYSGAWCERCADGY
YGNPTVPGESCVPCNCSGNVDPFEEGHCDSVTGECLKCIGNTDGAHCERCADGFYGDAVT
AKNCSACECHGKGSLSDVCHLETGLCDCKPHVTGRQCDQCLPGYYGLDAGLGCLPCDCSA
SGSVSDDCTAEGRCHCVPGVAGEKCDRCARGFYAYQDGGCTPCDCAHTQHTCHPESGECI
CPPHTRGAACDECEDGYWGHDLELGCQACNCSGVGSARPGCDALTGHCQCKPGFGGPNCH
QCSLGYRGFPDCVACDCDPRGTSADTCDEEQSLCSCAEETGSCSCKENVFGLHCSKCRAG
TYGLHADDPLGCTPCFCFGLSQACSELEGYVRTPVTLGAGQPLLRVVSQSNLRGTTEGVY
YQAPDVLLDAVTVRRHVHAEPYYWRLPDQFQGDQLLAYGGSLKYSVAFYSSDGIGTFNLE
PQVLLKGGRTRKQVIYVDMPAPENGVRQEQEVGMKENFWKYFNSVSEKPVTRSDFMSVLS
NIEYVLIKASYGQGLQQSRISNISMEVGRKAGELHPGQKAASLLEKCVCPPGTAGFSCQD
CAPGYHRGRLPPGGSRGPRPLLAPCVPCSCNNHSDACDPETGKCLDCRHSTAGDHCDVCA
PGYYGKVTGSPSDCSPCACPRSHPASFSPTCVLEGDADFRCDACVLGYEGQYCERCSSGY
HGNPRAPGGTCQRCDCSPRGSVHGDCDRRSGQCVCRLGATGLRCEECEPRHILLESDCVS
CDDECVGVLLNDLGNIDDTILSVNLTGIIPVPHGILSTLENTTKSLREALLKENPQKELA
KDQLEGVAEQTDHLLRELARVLTSSQNVTRATEGILNKSQDLMTFTEKLQINIQEIIEKA
ATLNQTLDEGFQLPSSTLQNMQRNITSLLEIIQKRHFKQLHQNAIQELKAAEDLLSQIQK
NYRKPQKELEVLKAAASSLLSKHNSELRAAGDLARGAEAEARESDRLLRIAHANLQEFHE
KKLRVQEEQNLTSTLIARGRRLLAAVTARAEATQNVLAQLERHRDELLLWTAKIRHHVDG
LVMQMSQRRALDLVYRAEDHAAELQRLAGALDSSLGGVRHVSLNATSTSHVHSNIWSLIE
ESEKAAKDALETVTTASVFSESLVSNGKVALQRSSRFLKEGNSLSRKYRDITLKLSELKN
AANRFQENADTITRRANESLSILRAIPGVSPTRCLTAPEVRGFTCVRDKGTKVKELATAA
NQSAASTLKDVVGLSQKLLNTSTDLSRVNATLQETNKLLQDSSMTTLLAGRRVKDAEAQA
SLLFDRLKPLKILEENLNRNLSEIKLLISQARKQAASIKVAVSADRDCIRAYQPPISSTN
YNTVTLNVKTSEPDNLLFYLGSSTSSDFLAVEMRRGKVAFLWDMGSGSTRLEFPDFPIDD
NKWHSIYVTRFGNIGSLSVKEMSATQKPPTKTSKSPGTANILDINNSTLMFVGGLGGQVK
KSPAVKVTHFKGCMGEASLNGQSIGLWNYIEREGKCHGCFGSPQNEDSSFHFDGSGYSVV
ERTLRAAGTHIIMLFSTFSPNGLLLYLASNGTKDFLSIELVRGRVRVTVDLGSGPLALIT
DRRYNSGTWYKIAFQRNRKQGILAVIDAYNTSYRETKQGETPGASSDLNRLDKDPIYVGG
LPRSRAVRKGVTSKSYVGCIKNLEISRSTFDLLRNSYGVRKGCILEPIRSVSFLKGGYVE
LSPKSLSPESELLATFATKNSSGIILAALGRQGEKQGHLQAHGQPSQPFFSIMLIEGHIE
VHVNSGDGTSLRRALLRAPKGTYGDGREHSISLIRTGRVITVQMDEMNPVEMKLGPSAES
RAINVSKLYLGGIPDGEGTSVLKMRRSFHGCIKNLIFNMEHLDFTSAAGNEQVDLDTCLL
SERPKLALHGEDSELPPESQPLPSLEQCAVDRAPEYIPHAHQFGLTQSSHFVLPFNQLAV
RKRLSVQLRIRTFASSGLIYYMAHQNQVDYATLQLHGGHLHFLFDLGKGRTKVTHPALIS
DGRWHTVKTEHFKRKGFMTVDGQESPTVTAVGDGTMLDVEGKLYLGGLPSEYRARNIGNI
THSVPACIGEVTVNGKQLDKDNPVSAFAVTRCYAVAQEGTFFDGSGYAALVKEGYKVQSD
VNITLEFRTTSENGVLLGISSAKVDAIGLEMVNGKILFHVNNGAGRITATYKPKATTTLC
DGKWHTLQAQKSKHRLVLTVDGNAVRAESPHTQSTSADTSNPIYVGGYPADVKQNCLSSQ
TSFRGCLRKLTLIKGPQIQSYDFSRAFDLQGVFPHSCPGSEP
NT seq 9309 nt   +upstreamnt  +downstreamnt
atgcgtggcggcggcggcgcgggggtcgcgctcctggcctcgctgctctgggtcgccgcg
cggtgccagcagagagggctgtttcctgccattctcaatcttgccagcaacgctcacatc
agcaccaacgcgacctgtggtgagcaagggcccgagatgtcttgcaagctcgtggagcac
gtgcccggtcgccccgtgcgcaacgcccagtgccggatctgtgatgccaacagcgccaac
cccaaagaacgccatccaatatcaaatgcaatcgatgggaccaataactggtggcaaagt
cccagtattcaaaacgggagagaatatcactgggtcacaatcactctggacctaagacag
gtctttcaagtcgcctacgtcatcatcaaagctgctaacgcccctcgacctggaaactgg
attttggagcgttctctggacggcaccacgttcagcccctggcagtattatgcagttagt
gacacggagtgcttgactcgttacaatatcactccgagacgggggccgcccacctacagg
gctgatgacgaagtgatctgcacctcctactattccagactggtaccgcttgagcatgga
gagattcatacgtcactgatcaacggcagaccgagcgctgatgatctttcccccaagttg
ctggagttcacttccgcgcgatacatccgccttcgattacagcgcattagaaccctccac
gccgatctcatgaccctcagccaccgtgaccctagagaccttgatcctattgttacacga
cgatattactattcaatcaaagacatatctgttggaggaatgtgtatttgttacggtcat
gctagcagctgcccatgggatgaaactacaaagaaactacagtgtcagtgtgagcataac
acctgcggagagagttgcagtaggtgctgtcctgggtaccaccagcagccctggaggcct
ggcactgtttctgctgggaacacgtgcgaaaaatgtaattgtcacaataaagcccaagac
tgttactacgatgaaaatgttgcaaatcagaggaggagtttgaacattgctggacagttc
cgaggagggggggtttgcatcaattgcctgcagaacaccatgggaatcaattgtgaaacc
tgtattgatggatattacagaccttacaaggtgtctccttatgaggacaacccttgtcgt
ccctgtgactgtgacccttttgggtccctcagttcggtctgtattaaagatgagctccaa
tctggcttacacaaagggaagcggccaggtcaatgtccatgcaaggaaggctacgcagga
gaaaaatgtgatcgctgccaatttggttacaaggcttacccagcctgcacccgctgtgag
tgtagtcgggccgggagcgtgaatgaggacccatgctcagagccttgtctctgtaaggaa
aatgttgagggagacaattgcgacctctgcaagccaggattctataacttgaaggagaga
aaccccgagggctgctcagagtgcttctgcttcggcgtttctgatgtctgtgacagcctt
tcttggtccatcagtcaggtgaaagacatgtctgggtggctggtcaccgacctgatcagt
ccgagcaaggtcccgtctcagcaagacgcgctgggtggacgccatcagatcagcatcaac
aacaccttggtcatgcagaggctgacttccaagtattactggtcggccccagaggcctac
ctcggaaataagctgactgcgtttggcggcttcctcaagtacacggtgtcttatgctatt
cccgtggagacagtggatggcgacctcatgtctcacgctgatgtgatcattaagggaaat
ggactcactttaagcacccaggctgagggcctgtcattacaaccctatgaagagtacttg
aacgtggtcagacttgtgccagagaacttccgagattttaataacaaaagggaggtagat
cgtgaccagctgatgactgtccttgccaacgtgacacatctcttgattagagccaactac
aattctgcaaaaatggctctttacaggttggattcggtctctttggatacagccagccct
aacgttctagacctggcgctggccaccgaggtggagcactgcgaatgtccccaaggctac
gcagggatctcctgtgagtcctgcctccctggctattaccgcgtggatggaatactcttc
ggaggaatttgtcaaccctgtgaatgccacggccatgcagctgaatgtgatatccatggc
gtttgctttgcgtgcgagcacaacaccactggggaccactgtgagcagtgcgcgcccggc
ttctacgggctgccctcccgagggactccgggggactgccagcggtgtgcctgccccctc
tccacagcctccaacaatttcagccccacctgccacctagaggatggggacgaagtggtt
tgtgaccagtgcgccccgggatactcgggagcttggtgcgagagatgcgcagacggttac
tatggaaacccaacagtgcccggggaatcctgtgttccctgcaactgcagcggcaatgtg
gaccccttcgaggagggtcactgtgactccgtcaccggagagtgcctgaaatgcattggg
aacacggatggcgcccactgcgagaggtgtgctgacggcttctacggggacgccgtgact
gccaaaaactgcagcgcctgtgagtgccatggaaaaggctccctgtctgatgtctgccat
cttgagactggactctgtgactgcaaaccccacgtgactggacggcagtgtgaccagtgc
ttgcctggctattacgggctggacgcggggctcgggtgcctgccctgtgactgcagcgcg
tcaggctccgtgtcagatgactgcaccgcggaaggccggtgtcactgcgtcccgggcgtc
gctggggagaagtgcgacaggtgtgcacgcggcttctatgcctaccaggacggtggctgt
acaccctgtgactgtgctcacacacagcacacttgtcatccggaatcgggggagtgtatc
tgccctcctcacactcggggcgccgcctgtgacgaatgtgaggacggatactggggccac
gacctcgagctcggatgccaggcctgcaactgcagtggtgttgggtcggccaggcccggg
tgcgatgcgctcactggccattgccaatgtaagcctggcttcggtggaccgaactgtcat
cagtgctccctggggtacagaggctttccagactgtgtggcctgtgactgtgacccgcga
gggacatcggcagatacctgtgatgaggaacagagtctgtgcagctgtgcagaggaaacc
ggcagctgctcttgcaaggaaaatgtgtttgggcttcattgcagcaagtgtcgagctggc
acctacggcctccatgctgatgaccctctgggatgtaccccctgcttctgcttcgggctg
tcacaagcctgctcagagctggagggttatgtgaggacaccagtaacgctgggcgcaggg
cagcccctcctgcgtgtggtttctcagagcaacctccggggcacgaccgagggggtgtat
taccaggcccctgacgtcctcctggacgccgtgactgtcaggcgacatgtccacgcagag
ccgtattactggcggctgccagaccagttccagggagaccagctcctggcttacggcggc
agcctgaagtacagcgtggccttctattcttccgacggcatcggcaccttcaacctcgag
ccccaagtgctcctcaaagggggccggaccagaaagcaagtcatttatgtggacatgccg
gccccggagaacggagtgcgacaagagcaggaagtggggatgaaggagaatttttggaag
tattttaactccgtttctgaaaaacctgtcacgcgctccgattttatgtctgttcttagc
aacattgagtacgtcctcatcaaagcatcttacggtcaaggattacagcagagcagaatc
tcaaatatttcaatggaggttggcaggaaggctggagagttgcacccgggacagaaggcg
gcgtctctcttagagaagtgtgtctgtcctcctggcacagctggattctcgtgtcaggac
tgtgcacctggttaccacagggggaggctcccgccaggtgggagccgaggaccccgcccg
ctgcttgccccttgtgtgccctgcagttgcaacaaccacagtgatgcctgtgaccctgaa
actgggaagtgtctggactgtaggcacagcaccgcgggggaccactgcgacgtgtgcgcc
cctgggtactacgggaaggtgaccggctcccccagcgactgctctccgtgcgcctgtccc
cgcagccaccccgccagtttcagccccacttgcgtcttggaaggtgatgcggatttccgc
tgtgacgcctgcgttctgggctacgaaggacagtactgcgaaaggtgctcctcaggctat
cacgggaaccctcgagcgccgggtggcacctgtcagaggtgtgactgcagcccgcgaggc
tctgtgcacggggactgcgaccgcaggtccgggcagtgcgtctgcaggctgggcgccacg
gggctccgatgcgaggaatgcgaaccgaggcatattctactggaaagcgattgcgtttct
tgtgatgatgagtgtgtaggcgtgctgctgaatgacttagggaatattgacgacaccatc
ctctctgtgaacctcactggcattatccctgtcccacatggaattttgtcaaccctggaa
aatacaacgaaatctctccgggaagcattattaaaagaaaatccacagaaggagctggca
aaagatcagcttgaaggtgttgcagaacaaacagaccatctgctaagggagctcgctaga
gtgttaacaagtagccagaatgtaaccagggcgactgaaggaatcctcaacaagagtcaa
gacctcatgacgtttactgagaagctgcagataaatattcaagaaattattgaaaaagca
gcaactctaaatcagaccttggatgaaggcttccagctacccagctccactcttcagaat
atgcaaaggaatattacatctttgctggaaatcatacagaaaaggcatttcaagcagtta
caccaaaatgccatacaggaactcaaggctgctgaagatttattgtcacaaattcagaag
aattaccggaagccacagaaagagctggaggtcttaaaagcagcagcaagcagcctcctt
tcaaaacacaacagtgagctgcgggcggcgggagacctcgcgaggggggcagaggcggag
gcccgggaaagcgaccgcctgctgcgcattgcccacgccaacctgcaagaattccacgaa
aagaagctacgtgttcaagaagaacaaaacttgacctcgacgctcattgccagagggaga
agactgctagctgccgtcacagcccgtgcagaggcgacacagaatgttctggcacagtta
gagcgccaccgcgatgagctccttctgtggactgccaaaatcaggcaccacgtagatggt
ctggtcatgcagatgtcccaaagaagagcactggaccttgtctacagagcagaggaccac
gcagctgagctccagagactagcaggtgccctggacagtagccttggcggcgttagacac
gtgtccctgaatgccaccagcacaagccacgtccattccaacatttggagcctcattgaa
gaatcagagaaagcggcaaaagatgctctcgagacggtgactacagcgagcgtgttctca
gaatcccttgtttctaatggaaaagtggctctccagcgcagttccagatttttaaaagaa
ggcaacagcctgagcagaaagtatcgagatatcacattgaaactgagtgaattgaaaaat
gcagcaaacagatttcaagagaatgctgatacaattactaggcgggccaatgaatcactc
tcaatactcagagcaattcctggagtaagcccgacaagatgcctcacagctcccgaggtc
cgtggattcacctgtgtcagagacaaaggaaccaaagtcaaagagttggccacagctgca
aatcagagcgcggcaagtactctaaaggacgtcgtgggattgagccagaagctgttgaat
acatccactgacctctccagggttaacgccaccttacaagaaaccaacaaacttctacag
gactcctcaatgaccaccctgctagccggaagaagagtgaaagacgcagaagcacaagcc
agccttttatttgatcggttgaaacctttgaagatattagaagaaaacctgaacagaaac
ctgtcagaaatcaaactgctcatcagccaggcccggaagcaggcagcttctattaaagtt
gccgtgtctgcagacagagattgcatccgggcctaccagcctccaatttcttccactaac
tataacacggtaacgctgaacgtgaagacaagcgagcccgataaccttctcttctacctc
gggagcagcaccagttctgatttcttggcagtggagatgcggcgagggaaggtggccttt
ctctgggatatgggctccgggtccacgcggttggaatttccagacttcccaattgacgac
aacaaatggcacagtatctatgtaaccagatttggaaacatcggttcattgagtgtaaag
gaaatgagcgcaactcagaagccaccaacaaaaacaagtaaatcccctggaacagcgaac
attctggatataaacaactcaacgctcatgtttgttggagggcttggaggacaggtcaag
aagtctcccgctgtgaaggttactcattttaaaggctgcatgggagaggcctccctgaac
ggacagtccatcggcctgtggaactacattgaacgggagggcaagtgtcacggctgcttt
ggaagcccccagaatgaagactcttccttccattttgacgggagtgggtattcggttgtg
gagagaacgctccgggctgcggggactcatataattatgctgtttagtaccttttcaccc
aatgggcttcttctctacctcgcttcaaatggcactaaagactttttgtccatcgagctg
gtgcgtggcagggtcagagtcacagttgacctgggttcggggcctcttgcccttatcaca
gacagacgctataacagcggaacctggtacaaaatcgccttccagagaaaccgaaagcaa
ggaatcctagcagttattgatgcgtataacaccagctacagagaaaccaagcaaggtgaa
actccaggagcatcttctgacctcaatcgtctagataaagatccaatttatgtgggtgga
ttacctaggtcaagagctgtgaggaaaggtgtcaccagcaaaagctatgtgggctgtatc
aaaaacctggaaatatccagatcgacctttgatttactcagaaattcctatggagtgaga
aaaggctgtatactggagcctatccgaagtgttagcttcttgaaaggcggctacgttgaa
ctgtcacccaagtctttgtcaccagaatcagaattgttggcgacgtttgccaccaagaac
agcagtggcatcatcctggctgccctgggcaggcagggggagaagcagggtcatctgcag
gcccatgggcaaccttcccagcccttcttttccatcatgctgattgaaggccacattgaa
gtgcacgttaactctggggatgggacaagcctgagaagagctctcctgcgtgctcccaag
ggcacatatggcgacggacgagagcattccatttccttgataaggaccgggagagttatc
actgtccaaatggatgagatgaatcctgtagaaatgaagttgggcccctcagcagaaagc
agggcaataaacgtatccaagctgtacttagggggcattccagatggggaggggacatcc
gtgctcaagatgagaagatcattccatggctgtatcaaaaacctgatctttaacatggaa
catttggatttcacgagcgcggctggcaacgaacaggtggacttggacacctgcttgctt
tcggaaaggccaaagctggctcttcatggagaggacagtgagctcccgccggagtcccag
cctttaccaagtctcgaacagtgtgctgtggacagagccccagagtatatcccccatgct
caccagtttggcctcacgcaaagcagccatttcgtgttgcctttcaatcagctggctgtc
agaaagaggctctcagttcagctaaggatccgaacatttgcctccagcggtctgatttac
tacatggctcatcagaaccaggttgattacgccacgctccagctgcacgggggccacctt
cacttcctgttcgatctcgggaaaggcagaacaaaggtcacccaccctgcactgatcagt
gacggcaggtggcacacggtcaagacagagcactttaaaagaaagggcttcatgacggtt
gatggccaggaatcccccacggtgaccgctgtgggggatggtaccatgttggatgtggaa
ggaaagctgtacctgggaggccttccctcggagtacagggccaggaacattggaaatatt
acccacagtgtccccgcctgcattggggaggtgacagtgaatggcaaacagctggacaag
gacaacccagtgtctgcatttgcagtaaccaggtgttatgcagtggcccaggaaggaact
ttctttgatggaagtggatatgcagctcttgtcaaggaaggctacaaagtccaatcagat
gtaaacatcaccctggaattccgtaccacctctgagaacggagtcctcctggggatcagc
agcgccaaagtagatgccattggattagagatggtaaatggcaagatcttgtttcatgtt
aacaatggtgccggtaggataacagccacatacaagcccaaagctaccactactctctgc
gatggaaaatggcacacgcttcaagctcagaagagcaaacaccgcttggttctgactgtt
gatgggaatgcagttcgtgctgagagtccacacacccagtccacctcggcagacaccagc
aatcctatttatgtcggtggctatcctgctgacgtaaagcaaaactgcctgagcagccag
acctccttccgggggtgtttgagaaagctcactctgattaagggcccacaaatacagtcc
tatgacttcagcagagcttttgacctacagggagttttccctcattcctgtcctgggtct
gaaccctga

KEGG   Ursus arctos horribilis: 113253294
Entry
113253294         CDS       T05909                                 

Gene name
LAMA3
Definition
(RefSeq) laminin subunit alpha-3 isoform X1
  KO
K06240  laminin, alpha 3/5
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113253294 (LAMA3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113253294 (LAMA3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113253294 (LAMA3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113253294 (LAMA3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113253294 (LAMA3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113253294 (LAMA3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113253294 (LAMA3)
   05145 Toxoplasmosis
    113253294 (LAMA3)
SSDB
Motif
Pfam: Laminin_EGF Laminin_G_2 Laminin_I Laminin_N Laminin_G_1 Laminin_II Laminin_B Laminin_G_3 Baculo_PEP_C
Other DBs
NCBI-GeneID: 113253294
NCBI-ProteinID: XP_026351891
UniProt: A0A3Q7UTJ1
LinkDB
Position
Unknown
AA seq 3350 aa
MAAAARAPGWASRPVLLLLLLLLPPPAVSPAREPRPAVRLSLHPPYFNLAEAARIWATAT
CGEREPGGARPRPELYCKLVGGPTAPGGGHTIQGQFCDYCNSEDPRKAHPVTNAIDGSER
WWQSPPLSSGMRYNRVNVTLDLGQLFHVAYLIIKFANSPRPDLWVLERSVDFGSTYSPWQ
YFAHSKVDCLEQFGQEANMAITQDDDVLCTTEYSRIVPLENGEVVVSLINGRPGAKNFTF
SHTLREFTKATNIRLCFLRTNTLLGHLISKAQRDPTVTRRYYYSIKDISIGGRCVCNGHA
QECNANNPEKSFRCECQHHTCGKTCDRCCAGYNQRRWRPATWEQSNECEACNCHGHALDC
YYDPDVERQQASLNIHGIYAGGGVCINCQHNTAGINCENCAKGYYRPYGVPVDAPHGCIP
CSCNPEHADACEPGSGRCTCKPNFRGDNCEKCAVGYYNFPFCLRIPIFPISTPRPEDPVA
GDIKGTVPAPWKLPVLGKTQAGCDCDLEGVLPEICDAYGRCLCRPGVEGPRCDACRLGFY
SFPICQACQCSVLGSYQTPCNPVTGQCDCFPGITGQRCDRCLSGAYDFPHCQGSSSACDP
AGTLDSSLGYCQCKAHVESPSCSICKPLYWNLAKENPSGCSECQCHVAGTVSGIGECGQR
DGDCHCKSHVSGDSCDTCEDGYFALEKSNYFGCQGCQCDIGGAITPLCSGPSGGCRCREH
VVGKACQRPENNYYFPDLHHMKYEIEDGTTPNGRELRFGFDPLEFPEFSWRGYAQMTSVQ
NEVRILLNVGKSSQSLFRVILKYINPGTEAVSGRVTIYPSWAEAGAAQSKEVIFPPSKEP
AFVTVPGNGLADPFSLAPGTWIACIKSEGVLLDYLVLLPRDYYVASSLQLPVTQPCADSG
PPRENCLLYQHLPVTRFPCALACEARHFLLDGEPRPLAVRQPTPAHPVMADLSGTEVELH
LRLRVPQVGNYVVVVEYATEVDQLSVVDVHMESPGSVLEGQVNIYSCKYSVLCRSVVTDG
RSRLAVYELLADAGVRLKAHSARFLLHQICIIPIEEFSTEYLKPQVKCIASYGGYINQSA
SCVSLVPETPPTALILEVPRGESSPFLPQDPLPSAEVVTGVTLKAPQNQVTLRGLVPRLG
RYVIVIHFYQPAHPTFPTQVFVDGGRLWPGVFRASFCPHTLGCRDQVIAEDQVEFDISKP
EVAVTVKVPEGKSLVLVRVLVVPAENYDYQILHKKSVDKSFEFITNCGGNSFYIDPQTAS
RFCKNSARSLVALYHKGALPCECHPTGAIGRHCSAEGGQCPCRPSVIGRQCTRCRTGYYG
FPHCKPCSCGPRLCEEVTGKCLCPPHTVRPQCDVCDTHSFSFHPLAGCEGCNCSRRGTVG
AVTPECNRDHGQCRCKPRVTGRQCDRCASGFYHFPECLPCHCNRDGTEPGVCDPGTGACL
CKENVEGTECNVCREGSFYLDPANPKGCTSCFCFGINNHCHSSHKRRTKFADMMGWCLET
ADGADIPVSFNPGSSSVVADLQELPSTVHSASWVAPLSYLGDKISSYGGYLTYQIKSFGL
PGDMVLLEKKPDVQLTGQHMSITYEEPSNPRPDRLHHVRVQVVEGNFRHASGGGPVSREE
LMMVLSRLEAVRLRGLYFTETQRLSLSGVGLEEASDTGSGRRAHHVEMCACPPDYMGDSC
QGCSPGYYRDNKGPYTGRCVPCDCNGHSSRCQDGSGICINCQHNTAGDHCERCKEGHYGN
AIQGTCSVCLCPHSNSFATGCVVSGGNMRCFCKPGYTGTQCERCAPGYFGNPQKLGGSCR
PCNCNSNGQLGSCHPLTGDCINQEPKDGGPGEECDDCDSCVMTLLNDLATMGDELHLVKS
QLHGLSASTSSLEQMKHLETQIKDLRNQLLTYRSAISNNGLKMDGLENELGSLNHEFETL
QEKVQVNSRKAQALHNNVDRITQSVKELDTKIKNVIQNVHILLKQISGTNGEGNNLPSGD
FSRERAEAERMMRELRNRNFGKQLTEAEAEKTEARLLLMRIKGWLEEHQGENGALVKSMR
ESLHDYEAKLGDLRAALQEAAAQAKQATGLNRENEKSLESIKTQVQEMNSLRSDFSKYLA
AADSALLQTNVLLQRMENSQQEYEKLAATLNEERHALNGKVRELSQSTGKASLVAEAEAH
AQSLQALAKQLEEIKRNTSGNELVRCAVDAATAYENILNAIKAAEDAANKATSASESALQ
TVIKEDLPRKAKTLSSDSDKLLKQAKITQKQLRQEISPALNNLQQTLKVLTAQKGLIDTN
ITAIRDDLRGIQRDDISGMIRSAKSMIRNADDITNEVLDGLGPIQTDVTRIKDTYRSTQS
EDFNKALTDADNSVKKLTNKLPDLLSKIESINQQLLPLGNISDNVDRIRELIQQARDAAN
KVAVPMRFNGKSGVEVRLPNDLEDLKGYTSLSLFLQRPESREYARTENMFVMYLGNKDAS
GDYVGMAVVDGKLTCVYNLGEQESELQVDQSVTKSETQEAVMDRVKFQRIYQFASLNYTK
KATSTKPEIPQLHEMDGGNSYTLLNLDPENVVFYVGGYPSDFRLPARLRFPPYKGCIELD
DLNENVLSLYNFKKTFNLNTTEVEPCRRRKEESDKNYFEGTGYARVPTQPKAPIPNFAQM
IQTTVDRGLLFFAENKDHFISLNIEDGKLLVRYKLNSEPPKEKEVTQVVNNGKDHSIQIQ
IGKTRKRMRINVDSANSIIDGDIFDFSTYYLGGIPISIRERFNISTPAFRGCMKNLKKTS
GVVRLNDTVGVTKKCSEDWKLVRSASFSRGGQLRFTNLDLPLPNEFQASLGFQTFQPSGI
LFNLQAQTRNLQVTLEDGHIELNTRDSNSPIFRSTQTYVDGSLHYVSVISDNSGFRLLID
DQTLKINQRLQDLSGSQHPLYLGGSHFEGCISNVFIRSQSESPAVLDLASKTFKRDVSLG
GCSLNPPPFLLLLRGSMKFNKSYTFNINQPLQDTPVASPRSMEMWREAQSCLPPPRAQAS
HGALRFGDRPTSYLLFTIPQELVKPRLRFAADMQTASSRGLVFYTGTKNSYMALYVSKGR
LVFTLGADGKKLKLKSKEKYSDGQWHTVVFGQDGEKGHLIVDGLRVREGSLPGNSTINLR
APVYLGSSPSGKPKSLPQNSFVGCLRNFQLDRKPLDTPSVSVGVSPCLGGSLEKGIYFSQ
EGGHVILANSVLLGPEFKLVFSIRPRSLTGVLIHIGSQPGRHLSVYMEAGKVTASVDSEA
GGILTSVTPKRSLCDGRWHSVTVTMKQHILHLELDAKNSYTAGQLPFPPASTHEPLHIGG
TPAGLKMLRLPVWESFFGCLKDIQVNHDPIPVTEAADIQGTVSLNGCPDH
NT seq 10053 nt   +upstreamnt  +downstreamnt
atggcggcggccgcgcgggctccaggctgggcatcgaggccagtcctgctgctgctgctg
ctgctgctgccgccgcccgccgtgagccccgctcgcgaaccccggcccgcggtccggctc
agcctgcacccgccctacttcaacctggccgaggcggcgaggatttgggccaccgccacc
tgcggagagcgggagcccggcggcgcgcggcctcggcccgagctctactgcaagttggtg
ggggggcccaccgccccgggcggcggccacaccatccagggccagttctgtgactactgc
aattctgaggaccccaggaaagcacatccagtcaccaacgcgattgatggatcggagcgt
tggtggcaaagccctcctctctcctcaggcatgcggtacaacagagtcaatgtcaccctg
gatctggggcagcttttccatgtggcctatcttatcatcaaatttgcaaattcccctcgc
cctgacctttgggtcttggaaagatctgtagactttggaagcacctactcaccgtggcaa
tattttgctcactctaaagtagattgtttggaacagtttgggcaggaggcaaatatggct
atcacccaagatgacgatgtactttgtactactgaatattcccggattgtacctttggaa
aatggtgaggttgtggtgtccttgataaatggtcgaccaggtgcaaaaaattttactttc
tctcacaccctgagggagtttaccaaggctacaaacatccgcctgtgttttctccgaact
aataccctccttggacacctcatctccaaagcacagagagatccaaccgtcactcggcgg
tattattacagcataaaggatatcagcattggtgggcggtgtgtttgcaatggccatgcc
caagagtgcaatgcaaacaatcctgaaaaatcgtttcggtgtgaatgccagcaccacacc
tgtgggaagacgtgcgatcgctgttgtgcggggtacaaccagaggcgctggcgacccgcc
acgtgggagcagagcaatgagtgtgaagcatgcaactgccatggccatgcccttgactgt
tactatgatccagatgttgagaggcagcaggccagtttgaatatccacggcatctacgca
ggtggaggggtctgcattaattgtcagcataacacagctggtataaactgtgaaaactgt
gccaagggttattaccgcccttatggtgttcccgttgatgccccccatggctgcatcccc
tgcagctgtaaccccgaacacgcggatgcctgtgagccgggctcaggccgctgcacctgc
aagccgaatttccgcggagacaactgtgagaagtgtgcagttggatactataacttcccc
ttttgcctgagaattcccattttccctatttctactccacgtcccgaagatccagtagct
ggagatataaaaggcacagtccctgctccctggaagctcccagtcctggggaagacacag
gcagggtgtgactgtgacttggaaggcgttctccccgaaatatgtgatgcctacggaagg
tgcctttgccgccctggggtcgagggccctcggtgtgatgcctgccgcctgggtttctat
tcattccccatttgccaagcctgccagtgttcagtccttggctcctaccagacgccctgc
aacccggtgacgggacagtgtgactgctttccggggattacaggacagcggtgtgacagg
tgtctctcaggagcctatgatttccctcactgccaaggctcgagcagtgcctgtgatcca
gccggtaccctggactccagtttgggatattgccagtgcaaggctcatgttgaaagtcct
tcttgtagcatctgcaaaccactatattggaatctggccaaagaaaaccctagtggatgt
tcagagtgccagtgccacgtggcgggaacagtgagcgggatcggagagtgtgggcagcga
gatggggactgtcactgcaagtcccacgtcagtggcgattcctgtgacacgtgtgaagat
ggatattttgctttggaaaagagcaattactttgggtgtcaagggtgtcagtgtgacatt
ggtggagcgatcacccccctgtgcagcgggccctcgggaggatgccgttgccgagagcac
gtggtggggaaggcttgccagcgacctgaaaacaactactatttcccggatttgcatcat
atgaagtatgagattgaagatggcaccacacctaatggaagagaacttcgatttggattt
gatcccctggagttccctgagtttagctggagaggatatgctcagatgacctcggtacag
aatgaagtgaggatcctgctgaatgtggggaagtccagtcagtctttgtttcgtgttatt
ctgaaatacatcaaccctggaactgaagccgtatccggtcgcgtgactatttatccatcc
tgggctgaggcaggcgctgctcaaagcaaagaagtcatcttcccgcccagtaaggagcca
gcctttgtcaccgtccccggaaatggtttggcagacccattttcacttgcaccagggaca
tggattgcttgtatcaaatcagagggagtcctcctggactacctggtgctgcttcccagg
gactactatgtggcgtcctccctgcagctgccggtcacccagccgtgcgccgactcggga
cccccgcgggagaattgcttgctttaccagcatttgccagtgacccgattcccctgtgct
ctggcttgtgaggccagacacttcctgcttgacggggagccaagacccttggcagtgagg
cagcccacccccgcacacccagtcatggcggacctcagtgggacagaggtggagctgcat
ctgcggctgcgggtcccccaggtgggcaactacgtggtcgtggtcgagtatgccacggaa
gtagaccagctgtctgtggttgatgtgcacatggagagccctgggtctgtcctggaaggc
caggtgaacatctacagctgcaagtacagcgtcctgtgccggagtgttgtgactgatggc
cggagtcgcctcgctgtgtacgagctgttagcagatgcaggcgttcggctcaaggcgcac
agcgcccgattccttctgcatcaaatttgtatcatacccatcgaagaattctcaactgaa
tatctgaaacctcaagtcaaatgcattgccagttacgggggttatattaatcaaagtgcc
tcttgtgtctccctggtccctgaaacccctccaacagcattaattttggaagtcccacgt
ggtgagtcttcccctttcctgccccaggatcctttgccttctgccgaagttgttactgga
gtcaccttgaaggcgccacagaaccaagtaaccctgcgaggactcgtaccacgcctaggc
cgatatgtcattgtcatccatttttatcaaccagcacacccaacatttcccacacaggtc
ttcgtggacggagggcggctgtggccaggtgtcttccgtgcctctttttgtccccacacg
cttggctgccgggaccaagtcattgctgaagatcaagttgagtttgacatctcaaagcca
gaggtggctgtgactgtgaaggttccagaaggaaagtccttagtattggtccgtgttcta
gtggtgcctgcggagaattacgactaccaaatacttcacaaaaaatcagtggacaagtca
tttgagtttatcaccaattgtggaggaaacagtttttatattgatccccagacggcctct
agattctgtaagaactctgccaggtccctggtggccctttaccacaagggagcactgccc
tgtgagtgccaccccaccggggccattggccgtcactgcagcgcagagggcgggcagtgc
ccgtgccggcccagtgtcattgggcggcagtgtacccgctgtcgaacgggctactacgga
ttcccacactgcaagccatgcagctgtggcccacgcctttgtgaagaggtgacggggaaa
tgcctctgccctccccacacagtcaggccccagtgtgacgtgtgtgacacccattccttc
agcttccaccccctggctggctgcgaaggctgcaactgttccaggaggggcaccgttggg
gctgtcaccccggagtgcaacagggaccacgggcagtgcaggtgcaagcccagagtcaca
gggcggcagtgtgaccggtgtgcttccgggttttaccacttccccgagtgccttccctgc
cattgcaacagggatggaaccgagccaggagtctgtgacccagggactggagcttgcctc
tgcaaggagaatgtggagggtacagaatgtaatgtgtgtcgagaaggatcgttctacttg
gacccagcgaatcccaagggttgtaccagctgcttctgttttggaataaataatcattgt
cacagttcacataaaagaagaactaagtttgcggatatgatgggctggtgcctggagaca
gcggacggagcagacatccccgtctcattcaacccgggcagcagcagtgtggttgcagac
ctccaggagctgccctccaccgtccacagtgcatcctgggttgcacctctttcctacctg
ggagacaagatttcttcatatggtggctacctcacctaccaaatcaagtcctttggcctg
cctggtgacatggttcttctggaaaagaagccggatgtgcagcttactggtcagcacatg
tccatcacctatgaggaaccaagtaacccacgaccagaccggctgcatcacgtgcgcgtg
caggtggtggagggaaacttcagacacgccagcggtggtggccctgtgtcccgggaagag
ctgatgatggtactttctagactggaagctgtgcgcctccgaggcctctacttcactgag
acacagcggctctctctgagcggggtggggctggaagaggcctctgacacaggaagtgga
cgcagagcacatcacgtggagatgtgtgcctgcccccctgactacatgggtgactcatgc
cagggttgtagccctggatactatagggataacaaaggcccctataccggacgatgtgtt
ccctgcgattgcaatggacattccagtcgatgtcaggatggctcgggaatatgtatcaac
tgccagcacaacaccgctggagaccactgtgagcgctgcaaagagggtcactacgggaat
gccatccagggaacctgtagcgtctgcctgtgtcctcattcaaacagttttgccactggc
tgtgtcgtgagcgggggaaacatgaggtgcttctgcaaacccggatacacaggcacacag
tgcgaaaggtgtgcaccaggatattttgggaatccccagaagcttggaggtagctgccga
ccatgcaattgtaacagcaatggccagttgggcagttgtcaccccctgactggagactgc
ataaaccaagagcccaaagacggcggccctggagaagaatgtgatgattgtgacagctgt
gtgatgacactcctgaatgacctggccaccatgggcgatgagctccacctggtcaagtct
cagctgcatggcctgagtgccagcactagctctctggagcagatgaagcacttggagacc
cagatcaaggacctgaggaatcagttactcacctaccgctctgccatttcaaataacgga
ttaaaaatggatggtctagaaaacgaattgggtagtttgaatcatgaatttgaaactttg
caagaaaaggttcaagtaaattccagaaaagcacaagcattacataacaatgttgatcgg
atcacccaaagcgtgaaagagttggacacaaaaattaaaaatgtcatccagaatgtgcac
attctcttgaagcagatctctgggacaaatggggaaggaaacaacctgccttcaggggat
ttttccagagagagggctgaagccgagcgcatgatgagggagctacggaaccgcaacttt
ggaaagcaactgacagaagcggaagctgaaaaaacagaggctcggctcttgctgatgcgg
ataaagggctggctggaagagcaccaaggggagaacggtgcgctggttaagagcatgcgg
gaatccttacatgactatgaagccaaactcggtgacctccgtgccgcgctccaggaggca
gctgcccaagcaaagcaggccactggcctcaaccgagaaaacgagaagtctttggaatcc
atcaagacacaagttcaggaaatgaattccctgcggagtgatttctccaagtacctagcc
gccgcggactcggccttactacagaccaacgtgctcttgcagcggatggagaacagccag
caggaatatgaaaagttagctgccactttaaatgaagaaagacatgcgctaaatggcaaa
gtgagagaactttcccagtccaccggcaaagcgtccctggtggcggaggcagaagcgcat
gcgcagtctttgcaagcgctggcaaagcagctggaggagatcaagaggaacaccagtggg
aacgagctggtgcgctgtgcggtggatgccgccaccgcctatgaaaacatcctcaatgcc
atcaaagcggctgaggatgccgccaacaaggccaccagcgcgtccgagtctgctctgcag
acagtgataaaggaagatcttccaagaaaagcaaaaaccctgagttcggacagtgataaa
ctcttaaagcaagccaagataacacagaagcagctccggcaagagattagcccggctctc
aacaacctacagcagactctgaaagttctgacagctcagaaggggttgattgacaccaat
atcactgccatccgcgatgaccttcgtgggatacagagagatgacatcagtggtatgatc
cgtagtgcgaagagcatgatcagaaatgccgacgacatcacgaatgaggtgctagatggg
ctcggccccattcagacagatgtgacaagaattaaggacacctataggagcacacagagt
gaagacttcaacaaggctctcactgacgcggataactcagtaaagaaattaaccaacaaa
ctgcctgatcttttgagcaagattgaaagtatcaaccaacagctgttgccactgggcaac
atctctgataatgtggaccggatccgggagctaattcagcaggctagagacgctgcaaat
aaggtcgctgtccccatgaggttcaatggtaaatctggagttgaagtccggctgccaaat
gacctagaagatttgaaaggatacacatctctttctttgtttctccaaagacctgaatca
agagaatatgcgaggactgagaatatgtttgtgatgtacctcggaaataaagatgcctcc
ggggattatgtcggcatggcagttgtagacggcaagctcacgtgtgtctacaacctggga
gaacaggagtctgaactccaagtggaccagagcgtgaccaagagcgaaactcaggaggca
gttatggaccgggtgaaatttcagagaatttatcagtttgcaagtctaaattacaccaaa
aaagccacatccactaaacccgaaataccccaattgcacgaaatggatggtggaaacagc
tacacgctcctcaatctggatcctgaaaatgttgtattttatgttggaggttacccatct
gactttagacttcctgctagactgaggttccctccatacaaaggttgtatcgaattagat
gacctcaatgaaaatgttctgagcttgtacaacttcaaaaaaacgttcaatctcaacaca
actgaagtagagccttgtagaaggagaaaggaagagtcagacaaaaattattttgaaggt
acaggctatgctcgagttccgactcaaccaaaagctcccatcccaaactttgcacagatg
attcagaccactgtggacagaggactgctgttctttgcagaaaacaaggatcacttcata
tctctaaatatagaagatggcaagctcctggtgcgatacaaactgaattcagagccaccg
aaagaaaaagaagttacacaggttgtcaacaacggaaaagaccattcgatacagatccaa
atcggaaaaacccgaaaacgtatgaggataaatgtggattcggctaacagcataattgac
ggtgacatatttgatttcagcacatattatctgggtggaattccaatttcaatcagggaa
aggtttaacatttctacacctgctttccgaggctgcatgaaaaatctgaagaaaaccagc
ggtgttgttagattgaatgacactgtgggagtcacaaaaaaatgctcagaagactggaag
ctcgtgcgctctgcctcattctccaggggtggacaattgaggttcaccaatttggacctc
cctttgcccaacgaattccaggcctccttggggtttcagacctttcaacccagtggcata
ttatttaatctccaggcacagacacggaacctgcaggtcaccctggaagatggtcacatt
gaactgaacaccagggatagcaacagcccgatttttagatctacgcagacgtacgtggat
ggttccctgcattatgtatctgtaataagtgacaactctggattccggctgctcattgac
gaccagactctgaaaattaaccaaaggctacaagacttgtcgggttcccagcatcccctg
tatctgggcgggagtcacttcgagggttgtatcagcaacgttttcatccggagtcaatca
gagagtcctgcagtcctggacttggccagtaaaactttcaagagagacgtgtccctggga
ggctgcagtctaaacccaccacctttcctcctgttgcttagaggttctatgaagtttaac
aaatcctacactttcaatatcaaccagccattgcaggacacaccagtggcctccccacgg
agcatggagatgtggcgagaggcccagtcctgcctaccacctccccgggcccaggccagt
cacggagccctcaggtttggggacaggcccaccagctacctgctattcacgattccccag
gagctggtgaaacccaggttacggtttgctgcggacatgcagacggcttcttccagaggg
cttgtgttctacacgggcacgaagaactcctatatggctctttatgtctcaaagggacgg
ctggtcttcaccctgggggcagatgggaagaagctgaaactcaaaagcaaggagaagtac
agtgatgggcagtggcacacggtggtgtttgggcaagatggagagaagggacacttgatt
gtggatggtctgagggtccgggagggaagtttgcctggaaattctaccatcaacctcaga
gcaccagtttacctgggatcgtctccctcagggaagccaaagagcctcccccaaaacagc
ttcgtgggatgcctgaggaactttcagttggatcggaaacccctggacaccccttctgtg
agcgttggtgtgtccccctgcttgggtggctctttggagaaaggcatttatttctcccag
gaaggaggtcatgtcatcctagctaactctgtgctgttggggcccgaatttaagctggtc
ttcagcattcgcccaagaagcctcactggcgtcctaatacacatcgggagccagccgggg
agacatttaagtgtttacatggaggccggaaaggtcacggcctctgtggacagtgaggcg
ggggggatcttgacatcggtcacaccaaagcggtctctgtgtgatggacggtggcactca
gtgacagtcaccatgaaacaacacatcctgcacctggaactggacgccaagaatagctac
acagctggacagctccccttcccaccggccagcactcacgagccactacacattggaggt
accccagccggtttgaagatgctgaggctccctgtgtgggaatcattttttggctgtctg
aaggacattcaagtcaaccacgatcctatccctgtcactgaagccgcggacatccagggc
actgtcagtctgaatggctgtcctgaccactaa

KEGG   Ursus arctos horribilis: 113253922
Entry
113253922         CDS       T05909                                 

Definition
(RefSeq) collagen alpha-4(VI) chain-like
  KO
K06238  collagen, type VI, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113253922
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113253922
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113253922
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113253922
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113253922
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113253922
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113253922
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113253922
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113253922
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   113253922
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Kunitz_BPTI VWA_CoxE DJ-1_PfpI SRPRB
Other DBs
NCBI-GeneID: 113253922
NCBI-ProteinID: XP_026352775
UniProt: A0A3Q7UVR2
LinkDB
Position
Unknown
AA seq 2349 aa
METWKVFWEFIFLSASFGFIKSQRIVCREASVGDVVFLVDANVNPQHTRSVRNFLHIMVN
SFNVSKDAIRVGLAQYGDTPHSKFLLSTYPRKGDVLEHIQRIQFRPGGRRLGLALQFLLD
EHFQATAGSRAGQGVPQVAVVISSSPAEDRAQDAAEALKRAGVRLYTVGVKDAVLAELKE
IASSPEEKFASFVPNFSDLGSHAQKLRKQLCDTLAKATQPVDNVFPACREAVLADIVFLV
DSSTSIGPQNFQKVKNFLYSVVLGLDISSDQVRVGLAQYNDNIYPAFQLNQYPLKSVVLE
QIQNLPYRTGDTNTGSVLEFIRMHYLTEAAGSRAKNRVPQIVILLTDGESNDEVQEAANK
LKEDGVVVYVVGVNVQDIQELQKIASEPFEKYLFNIENFNTLQDFSGGILQTLCSAVEGK
IKEFTQAYADVVFLADTSQGTSPASFQWMQNFISRVVGLLDVGRNKYQIGLAQYGGQGHT
EFLLNTYQTRDEMIVHIRDHFLLRGGSRRTGKALRYLHQTFFQEAAGSRFLQGVPQYAVV
MTSGKSEDEFWDAAQTLRERGVKVMSVGVQDFDRQELEGMATPPLVYEMQGQDGVRQLMQ
DVGVVIQGTGQPRFGIASEKEPVVVCPMAIPTDLVFLIEEFSWDRQSNFQQVVNFLKTTV
SSLNVHPDGVRIGLVFYSEEPRLEFSLDAFQNPANILEYLDRLTFRRRSGRTKTGAALDF
LRNEVFIEERGSRSKHGVQQMAVVITEGFSQDQLSKSASLLRRAGVTIYAVGTHLASESK
DLENIASYPPWKHVISLESFLQLSVVGNKIKNQLCPETLDRSISISGMDQAPLEDCVHIE
KADIYFLVDGSSSINHDNFLEMKVFMNEAIKRFQIGPDRVRFGVVQYSDGTDIHFILSQY
SSVAGLKAAIDDIQQRKGGTMTGEALSSMVQVFVNTTRRDVPWYLIIITDGKSEDPVAEP
AEALRREGVIIYAIGVKNANVMELKEIAKDRTFFTPEFDSLKAIQRDVVQDICSSETCKN
RQADIIFLIDGSESISPEDFEKMKGFVKRMVNQADISADEIQIGLLQFSSTPQEEFRLDQ
YSSKVDIHKAITNVQQMNDGTRTGKALNFTRHFFDSSRGGRPNVQQYLIVITDGVAQDDV
VMPAKALRDRNIVIFAIGVGEAKNAQLLQITDDPQKVYYEENFESLQNLEKKILLKVCIP
QGCNIDLSVGLDISTPRSPVQQKLQQLLPEVMQQLALLSNISCSAGGGLNLMFRFLVPGS
NGQLIFDSGFKKYSDDIIQTFLIHQTATSNYMDAEFLKSLGDRAIQLSSANVKVLLVFTD
GLDDNFQRLKLTSELLRSKGLSGLLFVGLEGVHKLEELQELEFGRGFIYNQPLSTTLQSL
PSILLKQLDTIVERRCCNIYMKCFGKDGYQGDGGNPGSKGKPGADGLPGHPGEEGGYGER
GPQGPPGPRGEEGCPGVRGPKGARGFSGEKGSPGDEGVDGLVGEQGNHGAPGSSGEKGNR
GNRGLPGPPGPPGEHGEPGLRGDPGDPGADSYIQGPKGEKGRPGHQGRSSFDGPQGEPGN
VGPQGSRGRRGVPGLKGARGESGEQGYQGELGYPGSQGLRGRQGPPGNSGQRGLPGVEGI
AGLPGPIGSKGKAGPTGMKGDVGDVGATGPRGPQGLRGQPGLLGTDGYGLPGRKGKKGEL
GFPGSPGAQGEDGDRGHRGEKGAKGIRGRRGNAGFPGLTGTPGDQGPPGPVGIKGPKGLA
DMMPCEIIAVIQENCPCSTGVSKCPAVPSEVVFALDMSNDVSQADFERMRNILLSLLKRM
DISESNCPTGARVAVVSYSAKTDYLIRFSDHKWKPALLQAVRTIPLQGSSGSRNLGEAMR
FVARHVFKRVRSGLLVRKVAVFFQAGWAGDPDAFSTATLELSALDITSAIVTFTEDHNLP
NALLMDGTNGFHLFVWETERQQDLEHMAHCTLCYDKCRPDPECEQSAPGPLAGDMDVAFV
VDSSHSVSADVYRTALSLVDAALDDLEVAAQPSVSPRGARAALVMHTTPDFRPGAGRSPV
LEGFHLTSYGQKIQMRRFLREASSRPLRGAPALGHALEWTLEKVLLAAPLPRPVRVLFAI
VASETSSWDQEKLRALSLEAKCKGITLFVLALGPGVGTHELAELEGVASLPTAQHLLHLE
GISGQEVAYAQGFTRAFLNLLKSGINQYPSPELIEKCGGPNRGDTLLQSFRPIRRLPKRQ
FGPSGFADELEALEVTDVSLEEKRKATMKSVTQQEALENYEKNGYGAGGNGQERPARRKQ
TGKERNSGTAYGPCSMDPMAGDCQDYTLKWYYDKEKQACRQFWYGSCGGNANRFETKEEC
EARCVPTPL
NT seq 7050 nt   +upstreamnt  +downstreamnt
atggagacttggaaggtgttttgggagttcatctttctgtcagctagttttggcttcatc
aagtcacagagaattgtctgcagggaggcttctgtgggagatgtcgtgtttctagtggac
gccaacgtcaacccccaacacacccgcagcgtgcggaacttcttgcacatcatggtaaac
agcttcaatgtcagcaaagacgccatccgcgtggggctggctcagtacggcgacacgccc
cattccaagttcctactttccacctacccccgcaaaggcgatgtgttggaacacattcaa
aggattcagtttaggcctgggggccgcaggttgggcctggccctgcagttccttctagat
gagcacttccaggcaacagcggggagtcgggcgggccagggcgtgcctcaggtggccgtg
gtgatcagcagcagccccgcggaggaccgcgcgcaggacgccgccgaggccctgaagagg
gcgggcgtccgactctacaccgttggtgtcaaagacgcagttttggcggagctcaaggag
attgcgagcagcccggaggagaagttcgcctcttttgttcccaacttctccgatctgggc
agtcatgctcagaagctgcggaaacagctctgtgacactttggcaaaggcgactcaacct
gttgacaacgtctttccagcttgcagagaggcagtcctggcagacattgtcttcctagta
gacagctcaaccagcattggaccccagaacttccagaaagtgaagaacttcctttactct
gtcgtcttggggcttgacatcagcagtgaccaggtccgagtgggacttgctcagtataat
gacaatatctacccagcctttcagctgaaccagtaccctctgaagagtgtcgtcctggag
cagatccagaatcttccctaccgtacaggagacacaaacacagggagtgtcctggagttt
atcaggatgcactacttgactgaggcagcaggcagccgggccaagaacagggttcctcag
atagttatcctgctgacagatggggagtcaaatgatgaagtccaggaggcagctaacaag
ttaaaagaagatggagtggttgtttatgtggtaggggtcaatgtccaggacatccaggaa
ttgcaaaaaatagccagtgagccatttgagaagtatctcttcaacattgaaaacttcaac
acccttcaggatttctcaggaggcattcttcagactctgtgctcagcagtggagggtaag
ataaaagaattcacccaggcttatgcagatgtggtctttcttgctgacacctcccagggc
acatcaccggccagtttccagtggatgcagaatttcatctccagggtggttggcctgctg
gatgttggcaggaacaagtaccagattgggctggctcagtacggtggtcaaggtcacacg
gaatttttgctcaatacctaccagacccgggatgagatgattgttcacatccgtgaccac
tttttgctccggggtggctccaggagaacgggcaaagctctgcgataccttcatcagacc
ttcttccaggaggcagcgggaagccggtttctccagggtgttccccaatatgcagtggtc
atgacctcaggcaaatccgaggatgagttctgggatgccgcacagacattgagggagcga
ggcgtgaaagtcatgtctgtgggtgtacaggactttgacagacaagaactggaggggatg
gcaactccaccccttgtatatgagatgcaaggacaagatggagtcagacagttgatgcag
gatgtgggtgtggtgatccaagggactgggcagccccggtttgggattgcgtctgagaaa
gaacctgtagtagtatgtccgatggctatcccaactgatttggtctttctcattgaagaa
ttcagctgggataggcaatcaaatttccaacaagttgtcaacttcttaaagaccactgtc
agctctctaaacgtacatccagatggtgtgagaattggcttggtcttttacagtgaggaa
ccacgactggagttttccctggatgcatttcagaacccagccaatatcttggagtatttg
gacagattaaccttccggagaagaagtggaaggaccaagactggagctgctttggatttc
ctaaggaatgaggttttcattgaggagaggggcagccggtccaagcatggtgtgcagcag
atggccgtggtcatcacggaaggcttctcccaagaccagttgtccaagtcagcttccctc
ctccgcagggcaggggttaccatctatgcggtgggcacccaccttgcctcagagagtaag
gacctggagaatatagcatcatatcctccttggaagcatgtcatctcactggaatccttt
ctgcaactctctgttgtgggaaacaagattaagaaccaactctgccctgagaccttggac
agaagtatttccatttctgggatggaccaggctccactagaagactgtgtgcacattgag
aaggccgatatttacttccttgtcgatgggtctagcagcatcaaccatgacaattttctc
gaaatgaaggtgttcatgaatgaggcgataaagaggttccaaattgggcctgacagagta
cggtttggagtcgttcaatactcggatggaactgatattcactttatcctcagccagtat
tccagtgtggcagggctgaaggcagccattgatgacatccagcagaggaaaggtggcacc
atgaccggtgaggccttgagcagcatggttcaggtctttgtgaacaccactcgcagggat
gtgccttggtatctcataatcatcactgacggtaaatctgaggacccggtggctgagcct
gcagaggcactgaggagagaaggagtcatcatttatgctattggagtaaaaaatgctaat
gttatggagctcaaggagatagctaaagacaggacgtttttcacgcctgagtttgattcc
ttgaaggccatccaacgagatgtggtacaggacatctgctcctcagagacctgtaagaat
aggcaagctgatatcatcttcctcatagacggttcagaatccatctccccagaagacttt
gaaaagatgaagggattcgtgaagaggatggtgaaccaagctgatattagtgctgatgag
attcagattggccttttgcagttcagctcaaccccccaggaagaattcagacttgaccaa
tactcctcaaaggtggacatccacaaagccatcacaaatgttcagcagatgaatgatggc
acccgcactgggaaagccctgaatttcactcggcatttttttgacagttcaagaggaggg
agacccaatgttcaacagtatttgattgtgatcactgatggggttgcccaggatgatgta
gtcatgccagccaaggctctcagggataggaacatcgttatttttgccattggggtggga
gaagccaaaaatgctcagcttttgcagattactgatgacccgcagaaagtgtactatgaa
gagaattttgagtccctgcagaacttggagaagaaaattcttcttaaggtctgcattcca
caaggatgcaacatagatttgtctgtaggacttgatatttccactcccagaagtccagtt
cagcagaagcttcaacagttactgccagaggtgatgcaacagttggccttgctttccaac
atcagctgtagtgctggtggtggcctcaacttgatgttccgcttcttggtccctggctca
aatggccagcttatctttgactcgggctttaaaaagtatagtgatgatattattcagaca
ttcttaattcatcagactgccacgagcaactatatggatgcggaatttttgaagtccctg
ggagatcgtgctatccaactgtcttctgctaatgtgaaggtccttttagtgtttacagat
ggactggatgataatttccagagactgaagttaacatctgagcttctccgcagcaaagga
ttatctgggctcctctttgttggcctggaaggtgtgcataaattagaagagctccaggag
ctagaatttggtagaggattcatatataatcaacctctgagcaccacactgcaatccctc
ccaagcatcttactgaagcaacttgacacaattgtggagagaagatgctgcaatatatat
atgaagtgttttggaaaagatgggtaccagggtgatggtgggaaccctgggagcaaggga
aagcctggtgctgatggattacctggtcatcctggtgaagaaggtggatatggagaaaga
ggcccccaaggccctcctggaccccgaggtgaggaaggatgtccaggtgtgagaggacct
aagggagcaagaggattttcaggagaaaagggaagccctggtgatgaaggtgttgatggc
ttggttggggaacagggtaatcatggagccccggggtcatctggagaaaaaggaaatagg
ggaaatcggggcttgccgggaccacctggacctcctggagaacacggagagcctgggtta
aggggagaccctggggatcctggagcagatagttacatccaaggccctaagggagaaaaa
ggaaggcctggacatcagggacgttctagttttgatggacctcagggagaacctggaaat
gttggccctcaggggtcaagaggaagacgaggtgtgccagggctgaagggtgcgcgtgga
gaatccggtgaacagggttaccaaggagagcttggatacccaggctcacagggactgaga
ggaaggcaaggaccaccaggaaattctggacaaagaggcttaccgggtgttgaggggatt
gctgggcttcccggaccaattggttccaaaggaaaagctggaccaacaggaatgaagggg
gatgttggtgatgtaggagccacaggcccgcggggtccacaaggactaagagggcaacct
ggccttttaggtacagatggatatggacttccaggaagaaaaggaaaaaagggtgaactt
ggatttcctggctcccctggtgcacaaggagaagatggtgaccggggccaccgaggagag
aagggggcaaagggaatcagagggcggaggggtaatgctggctttcctggacttactgga
actccaggtgaccaaggccctccaggaccagtgggcatcaagggccccaaaggtttggca
gatatgatgccttgtgaaatcattgctgtcatacaagaaaactgcccttgttcaacaggt
gtttccaaatgcccagcggtccctagcgaagtggtctttgccttggacatgtcaaatgac
gtctcccaggcggatttcgagagaatgagaaacattctattatctctgttgaagaggatg
gacataagtgagagtaactgcccaacaggtgccagagtggctgtcgtttcatacagcgcc
aaaacagactacctgattcgcttctcagaccacaagtggaagcccgcgcttctgcaggcg
gtcaggacaatcccgctgcaagggtcgtctggcagcaggaacctcggggaggccatgagg
tttgtggcaagacatgtattcaaacgtgtgcgctctggcctgctcgtgaggaaagtggct
gtgttcttccaggcgggctgggctggcgatccggatgccttcagcaccgccactctggag
ctcagcgcgctggacatcacctctgcgatcgtcaccttcacagaggaccacaacctcccg
aacgccctgctgatggatggaaccaacggatttcacctgttcgtctgggagaccgagagg
cagcaggatctggagcacatggcccactgcactctctgctacgacaagtgcagaccagac
ccagagtgcgagcagagcgcgcccggacccctggcaggggacatggacgtggcattcgtg
gtggacagctcccacagcgttagtgctgacgtgtaccgcaccgccctgagtctagtggat
gctgcgctggacgacctggaggtggctgcgcagccgagcgtgtccccccgtggggcgcgc
gctgcgctggtgatgcatacaactccagacttccggccaggtgcggggcgctcccctgtg
ctggagggcttccacctgacctcctacggccaaaagatacagatgcggaggttcctccgc
gaggcttccagccgtcctctgcggggagctccggccctgggacacgccctggagtggacg
ctggagaaggtgctcctggcagcccctctgccccggccggtgcgggtcctcttcgccatc
gttgccagtgagaccagcagctgggaccaggagaagctgagggccctgtccctggaggcc
aagtgcaagggcatcactctgtttgtgctggccttgggcccgggtgtgggtacccacgag
ctggccgagctggagggtgtggccagcctccccactgcgcagcacctgctgcatctggag
ggaatctcaggccaggaagtagcctatgcccagggattcacacgggccttcctgaacctt
ctaaaaagtggaataaaccagtacccatcccccgagctcattgagaaatgtgggggtcca
aaccgaggggacactctgctgcaatcatttaggcctatcaggaggttgcccaagcgccag
tttggcccatctggctttgcggatgagctggaagcacttgaagtgacagacgtttcgcta
gaggagaagagaaaagccacaatgaaatctgtaactcagcaagaagcccttgaaaattat
gaaaagaatggatatggtgctggaggaaatggacaagaaaggcctgccagacgaaaacaa
accggaaaagaaagaaattcaggcactgcctatggtccttgttccatggatcccatggca
ggggattgccaggattacaccctgaaatggtactatgacaaggagaagcaggcctgccga
cagttctggtatggcagctgcgggggcaacgcgaaccggtttgaaaccaaggaagagtgt
gaggcgcggtgtgtcccaacacccctgtag

KEGG   Ursus arctos horribilis: 113253923
Entry
113253923         CDS       T05909                                 

Gene name
COL6A5
Definition
(RefSeq) collagen alpha-5(VI) chain isoform X1
  KO
K06238  collagen, type VI, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113253923 (COL6A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113253923 (COL6A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113253923 (COL6A5)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113253923 (COL6A5)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113253923 (COL6A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113253923 (COL6A5)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113253923 (COL6A5)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113253923 (COL6A5)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113253923 (COL6A5)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   113253923 (COL6A5)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 VWA_CoxE Integrin_beta
Other DBs
NCBI-GeneID: 113253923
NCBI-ProteinID: XP_026352778
UniProt: A0A3Q7VV21
LinkDB
Position
Unknown
AA seq 2610 aa
MKSLLIIFILTLWTETLADQSPEPGPEYADVVFLVDSSDHLGTKSFPLVKAFIHKTVSGL
PVDAHKYRVALAQYSDRLHSEFQLATFKSRNPMLNHLKKNFGFLGGSLRTGHALREAHRT
FFSAPAGGRDKKQFPPILVVLASAQSEDDVEEASKALREDGVRIVSVGLQSASEEELKAM
ATSRFHFNLRSARDLDAFSQNMTQIIKEATQYRDGAAPHSVDVPFPVACQKDSLADLIFL
VDESVGTKQNLRNLQNFLRNIAASMDVRYNCTRLGLMSYSDRTNTISLLNSSTTQYEFQE
QIQKLSVQAGKSNAGAAIEKMRLEAFSESSGSRRAQGVPQIAVLVTHRPSTDEVRDVAIQ
LRLQDVTVFGMNIQGADETQLEEIASYPPRQMVSMLKSYADLETYSKNFQKKLRNEIWSQ
ISTRAEQMDLDRTGCIDTKEADIYFLIDGSTSIQGKHFEQIKEFMLAVTGMFSIGPDKVQ
VGAVQYSHKMRVEFYINDNSNNVNLKNAILNIEQLQGNTYTGEALNFTLSIIKEDKKHRT
SQVPCYLIVLTDGRSADDVLEPAERLRAEQVIIHAVGIGEANKVQLQQIAGGEERVSFGQ
NFDSLRSIKNEVVHRICTEKGCEDMKADIMFLVDSSGSIGHENFGKMKTFMKSLLAKIQI
GPDRTHIGVIQFSDKTREEFQLNKYFTQNEISDAIDRMSLIDENTLTGNALISVDQYFTP
ARGARIGVKKFLILITDGEAQDAVRDPAKALRDKGVVIFSVGVYGANRTQLEEISGDGNL
VFQVENFDDLKTIESKLIFRVCALHDCKNIKVLDIVFVLDHSGSINTQEQENMMALTIHL
VKKADVGSDRVRIGALKYSDYPEILFHLGKYSNRSSVIEHLRRRRDTGGNTYTARALDHT
NIMFTEEYGSRIQQNVKQMLIVITDGASHDRTLLNETALKLRNRGIDIYAVGVGPADQLE
LEAMAGNKSKTFHVDNFNKLEDIYLPLQESMCINAHEWCDIQEADVVFFCDGSDMVSNSE
FVTMTTFVSELIDHFNIQNQKIKIGLAQFGSGFQKIIELKNSLTIPELKTQIQSIPKSKG
FPRMDLALKKVKAMFDPSSGGRRNAGVPQTLVVITSGDPYYDVADEVKILRNLGICVLVW
GIGNIHKEQLLPITGHSEKIVTLQDFNKLMNVDVKKRMVREICQSCGKTNCFMDIVVGFD
ISTHLRGQPLFHGHPRLRSYLPGILEDISSIGRVSCGAGSDVQVNVAFKVNNDREFPAKF
QIYQKAIFDSLLQVTVNGPSHLNAQFLQSLWDTFKDQSKSQGQVLLIFSDGLGGESEIML
ENQSDRLREAGLDALLVVSLNTTAHDEFSSFEFGKGFDYMTHLTIGMRELGKMLSQYLGN
IAEKTCCCTLCKCSGTPGPYGSRGLQAAKGSRGLKGSRGHRGEDGDPGRRGDIGPPGDKG
IAGCPGEQGQKGAKGFSGSKGEQGEDGIDGLDGEEGFHGPSGEKGEKGDPGSQGSPGSRG
PPGGYGEKGFPGDPGNPGQNSNIKGQKGSKGEQGRQGRTGQKGTPGNPNSRGNRGREGQR
GPPGASGEPGNRGPEGTQGADGLQGPQGLNGIPGRKGEKGGEGHKGPQGSSGPVGAKGNV
GSPGPSGKKGEPGILGDPGPMGQAGQRGRQGDYGIPGYGPMGRKGIKGPRGFPGDMGQKG
AVGDPGIPGGPGPKGFRGLTRTVGLKGEEGPPGPPGPPGRRGMKGMAGKPVYSQCDLIQF
MRDHSPCWKGTCPVYPTELVFALDQSYGISEQRFNEMREIVTSIVNDLNIKESNCPVGAR
VVVVSYDSGTSYLIRRSDYRNKKQLLQLISQIKYRPPTEARDVGNAMRFVARNVFKRTYA
GANVRKVAVFFSNGQAASRSSIITATMEFSALDISPAVFALNERIFLEAFEFDNTGTFQV
IPVSPNGEYEPLEKFQHCTLCYDKCFPNECIKEIFLPEDAYMDVAFLLDNSQTIANDEFK
AVKTLVSSMIDNFKIASDPGISNFGDRIALLSYSPWDRSRKKKGVVKTEFGFTTFNDGVL
MKRHIQSSLRQLEGETTIGHTLLWVVENLFPQTPNLRKHKVLFVISAGEYHERKEFLKKV
ALRAKCQGYVIVVISLGSTYKEDMEELASYPLDHHLIQLGRIHKPDLDYITKFIRPFVYS
VRRGFNEYPPPVLENDCRLISRGEAYQNSDPLLTPEPHEISSGENSFIGQELSAGKDSSF
LWEDNGSAHLVYVPSRVLTPQELTITYAQDWDSEEIASLTSGHENHGRKEEPGLTNEPGD
TSLQEYYMDVAFLIDASQRIGNDEFKEVKAFLISVLDYFHIAPDPLTSMLGDRVAVLSYS
PPGYMPNSEECPVYLEFDLVTYNSIHQMKHHLQDSFQQLNGDVFIGHALQWTIENVFVGT
PNLRKNKVIFVISAGETNPLDKEVLRNVALRAKCQGYSIFVFSFGPIHNDKELEELASHP
LDHHLVQLGRTHKPDLNYITKFVKPFVHLIRRAINKYPPADLRPKCVNITSPNPENVGSG
NNVFLIPEVYKIETGNSELFDEFGSQEQHSLVLGNNPSNGSETTTDLIQKLYMLFSTGEL
VMNDKEETHSEEMPALADGKLDKKDGEDSR
NT seq 7833 nt   +upstreamnt  +downstreamnt
atgaagagcctgctaattatattcatcttaaccctttggactgaaacactggcagatcag
agtccagagccgggccctgagtatgcggatgtcgtgttcctggtggacagctccgaccac
ctgggaactaagtccttccccctcgtgaaagcgttcattcacaaaacggtcagcggtctg
cccgtagacgcccacaagtaccgcgtggccctggcgcagtacagcgaccggctccacagc
gagttccagctggccacgttcaagagcaggaaccccatgctgaaccacctcaagaagaac
ttcggcttccttggcggctccctgcggaccggccacgctctccgggaggcgcacaggact
ttcttctctgcgccggccggcgggagggacaagaagcagtttcccccgattctggtggtc
ctggcctcggcccagtccgaggatgacgtggaagaggcctccaaggccctgcgggaagac
ggggtgagaatcgtctccgtggggctgcagagtgcttctgaggaggagctgaaggccatg
gccacctctcggtttcatttcaacctccggtcggccagggaccttgacgcgttttcccaa
aacatgacgcagatcatcaaggaggccacccagtacagggacggagcggcccctcacagt
gtagacgttcccttcccagtggcctgtcagaaagattcattagctgacctcatattccta
gtggatgagtcagttggcacaaaacaaaatttaaggaacctgcagaatttcctgaggaac
attgccgcctccatggacgtgaggtacaactgcacacgccttggacttatgagttacagt
gatagaacaaacactatttcccttctaaattcaagcacaacccagtatgaatttcaggag
caaatccagaagctttctgtccaggctgggaaatccaatgctggggctgccattgagaag
atgaggctagaagccttctcagagtcaagtggcagcagaagggcacagggagtgcctcag
atcgcagtcttggtcactcatagaccatcgaccgatgaggtgcgtgatgtcgcaatacaa
cttcggctgcaggatgtgactgtgttcggcatgaacatccaaggggctgacgaaacccag
ttagaagaaatagcatcttaccctccaagacaaatggtttccatgctcaaatcctatgca
gacttggaaacttacagtaaaaacttccagaaaaagctccggaatgaaatttggtcccaa
atttctactcgtgctgagcaaatggaccttgacagaactggctgtatagatacaaaagag
gctgatatctatttcctcattgatggctcaaccagcatacagggaaaacacttcgagcaa
atcaaggaatttatgttggctgtgacaggaatgttcagcattggcccagacaaagttcag
gttggagctgtacagtattcacataagatgagagtggagttttacatcaatgacaattct
aataatgtgaacttaaagaacgcgattttgaacatcgagcagctccaaggcaacacctat
accggggaggccctgaatttcacgctgtcaataataaaagaagataagaagcataggaca
agccaggttccctgttacctcattgtgctgactgatgggaggtccgcagatgacgtcctg
gagcctgctgagagattaagggctgagcaagtcatcatccatgcagttggcattggggag
gctaacaaagtacaactccaacaaattgctgggggagaagaaagggttagctttgggcag
aactttgattctttaagaagcataaagaatgaagtggttcacagaatctgcactgaaaaa
ggatgtgaagacatgaaggctgacatcatgtttctggtggacagttctggcagtatagga
catgaaaattttggaaaaatgaaaaccttcatgaaaagcctattagctaagattcagatt
ggtccagacagaactcatattggtgtcattcagttcagtgataaaactcgggaagaattc
cagcttaataaatatttcacacaaaacgaaatttctgatgcaatagacagaatgtctctc
atcgacgaaaacactttgaccggaaacgcactaatctctgtagatcaatacttcaccccc
gccaggggggcccgtattggggtgaaaaagtttcttatcctcatcacggatggagaagca
caagatgctgtgagagaccctgctaaagctcttcgggacaaaggcgtggtcatcttctct
gtgggggtgtacggagccaataggacgcagctggaggagatcagtggggatggcaacctg
gtcttccaagttgagaactttgatgatctaaagacaatagaaagcaaactcatttttcgt
gtatgtgctctccatgattgtaaaaatattaaagtgttggacattgtgtttgtgctggat
cattcaggcagcataaacacacaagagcaagaaaacatgatggctctaactatccatttg
gtgaagaaagcagatgttggcagcgaccgagttcggattggagctctcaaatactcagac
tatcctgagattcttttccaccttgggaaatactcaaacagatcctcagtcatcgagcac
ctaaggaggcgcagggacaccggagggaatacctatactgccagggctcttgaccacacg
aacataatgttcacagaggaatatggcagccgcatccagcaaaatgtgaagcagatgctg
atcgtcatcactgacggggcatcccatgaccgaactctgctcaacgagactgcattgaaa
ttaagaaacagaggcattgatatctacgcggtgggtgtaggaccggctgaccaacttgaa
cttgaggctatggcagggaataaaagcaagactttccatgtagataatttcaacaaactg
gaagatatttacctgcctctacaagaaagtatgtgtatcaatgcacatgagtggtgtgac
attcaagaagccgatgtggttttcttttgtgacggctccgacatggtctccaactcagag
tttgttaccatgacaactttcgtgtcagaattaattgatcattttaacattcagaaccag
aagataaaaatcgggttggctcaatttgggagtggcttccaaaaaattatcgagttgaaa
aactctctgactataccggagttgaagactcaaattcaaagcattcctaagagcaagggg
tttccgcgaatggacctggcccttaaaaaagtgaaagccatgttcgatccgtcttccggt
gggagaagaaacgctggtgtccctcaaactttggttgttatcacatccggggacccctac
tatgatgtggcagatgaagtgaaaatcctgagaaacctcggaatttgtgtcctggtttgg
ggcataggaaatattcataaggagcaacttctgccgataacaggccattctgaaaaaata
gtcaccctccaagactttaataaattaatgaatgtggatgtgaaaaaaagaatggttcgt
gaaatctgccagagctgcgggaaaaccaattgctttatggacatagtggtcgggtttgac
atttccactcacctgcgggggcagccgctgttccatggccaccctcggctgagatcctac
ctcccaggcatcttagaggacatcagctccatcgggagggtcagctgcggggcaggctca
gacgtgcaggtgaacgtggccttcaaggtgaacaatgaccgagaattccctgccaagttc
caaatctaccagaaagcaatatttgacagcctgctgcaggtcaccgtcaatgggccatct
catctgaatgcacagttcctgcagtcgctgtgggatacatttaaggatcaatctaaatcc
caaggacaggtgctactcatcttttcagatggtcttgggggtgaaagcgagataatgctt
gaaaatcaatcggacaggctcagagaagcaggactcgatgctctgctggtggtgtcccta
aacacgactgcccatgatgagttctccagctttgaatttggaaagggatttgattacatg
actcacctgaccattggcatgagagagctgggcaagatgctgtcacagtacctgggaaac
atcgcagagaagacttgctgctgcacactctgcaagtgttcggggactccaggtccttat
gggtcccgaggactacaagctgccaagggttctcggggtctgaaaggcagcagaggacac
cggggagaggacggagaccccggaagacgaggagacattggacccccaggagataaaggg
attgcaggatgtccaggagagcagggtcaaaagggagccaaaggattttctggaagtaag
ggagaacaaggagaggatgggattgacggactggatggggaagagggctttcatggacct
tctggggaaaagggagaaaaaggtgatccaggatctcagggaagcccaggttccagaggc
cccccagggggttatggggagaagggcttcccaggagatcctggtaatccaggacaaaac
agtaacatcaaaggacaaaagggctccaaaggagaacaaggaagacaaggtagaactgga
cagaaagggacaccaggcaatcctaattccagaggaaataggggaagggaaggccaaagg
ggacccccaggtgcctcgggggagccgggaaatcgtggacctgaaggtacccagggagcc
gacggattacaaggcccacaggggttaaatggaattcccggcaggaaaggagagaaggga
ggcgaagggcataaaggacctcagggctcttctgggccagtgggagctaaagggaacgtt
ggaagtcctgggccttcagggaaaaaaggagaacctggaattcttggagatccagggcca
atgggacaagctggacagagaggaagacagggagattacggcatcccaggctatggtcct
atgggacgaaaaggaataaagggcccaagaggattccctggagatatggggcaaaagggt
gctgttggtgatcctggaattcctgggggtcctggacccaaaggatttaggggattaaca
cgcactgtaggcctgaaaggtgaagagggaccgccaggacccccaggccctcctggacgg
agaggcatgaaaggcatggcggggaaacctgtatattcccaatgtgatctgatccagttc
atgcgggaccatagtccttgttggaaaggaacgtgtccagtgtacccaacagagctggta
tttgccctggaccagtcctacggtatctccgaacagagatttaatgaaatgagggaaatc
gtcacatccattgtcaatgaccttaacatcaaggaaagtaactgcccggtgggagcgcga
gttgtcgtagtttcctatgactcaggcacgagctacctcatccgcaggtcggactaccgt
aacaagaagcagctcctccagcttatttcccaaataaaataccgaccccccacagaagcc
cgagatgttggtaatgcaatgaggtttgtggcccggaacgttttcaagcggacatacgca
ggagccaacgtgaggaaagtcgctgtgttcttcagcaatggacaagcagccagtcggtca
tccatcatcaccgccaccatggagttcagtgccctagacatcagtccggcagtctttgct
cttaacgaaaggattttccttgaggcttttgagtttgacaacactggaacatttcaggtg
atcccagtttctccaaatggagagtatgagccattagaaaaatttcaacactgtacactt
tgctatgataaatgttttccaaatgagtgcatcaaagagatctttttacctgaagatgca
tacatggatgtagccttcctcttagacaattctcagactatagcaaatgatgagtttaag
gctgtgaaaaccttggtgagctctatgatcgacaacttcaaaattgcatcagaccctgga
atctcaaactttggcgataggattgccctgttgagctattctccttgggatagatctagg
aaaaagaagggtgtggtaaaaacagagtttggatttacaactttcaacgacggagtccta
atgaagaggcacatccagtcttccctccgacagttagaaggagaaaccacaattggccat
accctactgtgggttgtggagaacctcttcccacaaacaccgaacttgagaaaacacaaa
gtcctttttgtgatctcagctggagaatatcatgagagaaaggaattcttgaagaaggtg
gctctgagggccaaatgtcaaggctatgtcatagttgtgatttccctgggctctacgtat
aaggaggacatggaggagttagccagctacccacttgatcaccatctgatacagcttggg
agaattcataaaccagatctggattatattacgaagtttataaggccatttgtttactca
gtcagacgaggattcaatgagtacccacccccagtgcttgagaatgactgtagactcatc
tcaagaggagaggcttatcaaaatagtgatcccctgcttactcctgagccacatgagatt
tcttcaggagagaacagcttcattggccaggaattaagtgcagggaaagactcgtccttc
ctgtgggaggacaatggaagtgcccatttggtttacgtaccaagccgcgtacttacacca
caagaattaacgatcacatatgcacaagattgggattctgaagaaattgcaagtttaact
tctggacatgaaaaccatggcagaaaagaagaaccaggtcttactaatgaacctggagat
acctctcttcaagaatattacatggatgtggctttcctcatagatgcctcccaaagaata
ggaaatgatgagtttaaggaagtgaaagcttttctaatctcagtgcttgattattttcac
attgccccagacccactgacctccatgttaggagacagagtggcagtcctgagctattct
cctccaggctatatgcccaacagtgaagaatgccccgtctacctggaatttgatttggtt
acttataacagtatacaccaaatgaaacatcatctccaagactcttttcaacagctcaat
ggagatgtttttattggccatgccctgcagtggacaattgaaaatgtgtttgtaggaacc
cccaacctgaggaaaaacaaagttatctttgttatatctgctggcgaaaccaacccctta
gacaaggaagtcttaagaaatgtagctctgagagccaagtgtcagggctactccatattt
gtgttttcctttggtcctatacacaatgacaaggaattggaagaattagccagccaccca
ctggatcatcacttagtccagcttggccgaacccacaagccagatttgaactatatcacc
aagtttgtcaagccatttgttcacttaatcagacgtgccatcaacaaatatccccctgca
gatctgagacccaagtgtgttaacatcacctctcccaacccagagaacgttggctcagga
aacaatgtattccttattcctgaggtatataaaatagagacaggaaacagtgagctgttt
gatgaatttggttcccaggagcagcattcccttgtattagggaacaatcctagtaatggt
tctgagactactactgatttgatccagaagttatacatgctcttttcaaccggagaactg
gtgatgaacgataaggaagagacacattcagaagaaatgccagctctagcagatggtaaa
ctagataaaaaagatggtgaggactcaagatga

KEGG   Ursus arctos horribilis: 113253925
Entry
113253925         CDS       T05909                                 

Gene name
COL6A6
Definition
(RefSeq) collagen alpha-6(VI) chain
  KO
K06238  collagen, type VI, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113253925 (COL6A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113253925 (COL6A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113253925 (COL6A6)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113253925 (COL6A6)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113253925 (COL6A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113253925 (COL6A6)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113253925 (COL6A6)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113253925 (COL6A6)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113253925 (COL6A6)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   113253925 (COL6A6)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Integrin_beta
Other DBs
NCBI-GeneID: 113253925
NCBI-ProteinID: XP_026352784
UniProt: A0A3Q7WEJ8
LinkDB
Position
Unknown
AA seq 2267 aa
MLLILFLIIICSYVSMNKDPGPEYADVVFLVDSSDQLGTKSFPLVKAFIHKTVSGLPVDA
HKYRVALAQYSDRLHSEFQLATFKSRNPMLNHLKKNFGFLGGSLRTGHALREAHRTFFSA
PAGGRDKKQFPPILVVLASAQSEDDVEEASKALREDGVRIVSVGLQSASEEELKAMATSR
FHFNLRSARDLGAFSQNMTQIIKEATQYRDGAADDILVEVCQGPSVADVVFLLDVSVNGS
QEDFEYLKEFLEESVSALDIKEHCMRVGLVTYSNETKVINSLSRGVNKSEVLQNIQNLSP
RAGKAYTGAAIRKIRKEVFSARNGSRKNQGVPQIAVLVTHRPSEDNVTKAAVNLRREGVT
IFTMGIEGASDSQLEKIASHPAEQHMSKLKTFSDLAAHNQTFLKKLRNQITHTVSVFSER
TETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAPQKVRVGAVQYA
DSWDLEFEINKYTNKHDLGKAVENIRQMGGNTNTGAALNFTLGLLQKAKKQRGNRVPCHL
VVLTNGMSKDSILEPANRLREELVRVYAIGVKEANQTQLREIAGEEKRVYYVHDFDALKD
IRNQVVQEICAEEACKEMKADIMFLVDSSGSIGLENFIKMKTFMKNLVSKSQIGADRVQI
GVVQFSDVNKEEFQLNRYMSQNEISNAIDRMTHIGETTLTGSALTFVSQYFSPAKGARPN
VRRFLILITDGEAQDIVKDPAVALRQEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVENF
DILQHIEDDLVFGICSPREECKRIEVLDVVFVIDSSGSIDHDEYSIMKDFMVDLVKKADV
GKNQVRFGALKYADDPEVLFYLGDLGSKWEVISVLQKDQPMGGNTYTAEALGFSDHMFTE
ARGSRLQKGVPQVLIVITDGESHDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAG
SSDKYFFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFRKMKE
FLASVVQDFDVSVNRVRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQIFGYTHIGA
ALRQVGHYFRPDMGSRINAGTPQVLLVLTDGQSQDEVARAAEDLRHKGIDIYSVGIGDVD
DQQLIQITGTAEKKLTVHNFDELTKVKKRIVRNICTSGGESNCFVDVVVGFDISTHENGQ
TLLEGQSWIETYLQDLLHTISSLNGVSCEVGTETQVSVAFQVTNAMEKYSPKFEIYSENI
LNSLKDITVKGPSLLNANLLSSLWDAFQNKSAARGKVVLLFSDGLDDDIEKLEQKSDELR
KEGLNALVTVALDGAADSSDLADLLYIEFGKGFEYRTQLTIGMRDLGSRLSKLLVNVAER
TCCCLFCKCIGGDGIMGDPGPPGKKGSPGFKGSEGYLGEEGIAGDRGAGGPVGEQGTKGC
DGAKGPKGNRGLNGQEGEVGESGIGGLHGEQGESGLPGEKGEKGDEGSQGSPGKRGVSGD
RGAKGQRGDPGVPGFDNTVEGPKGLKGERGRQGRRGWPGPPGTPGSRRKTAAHGRRGHTG
PQGKTGIPGPDGLEGSWGLKGPQGPRGETGVKGEKGGLGSEGPQGPPGPGGEAGSPGHLG
SQGNKGEPGDLGEKGAIGLPGPRGLPGDDGNPGYGSTGSRGAKGQEGFPGESGPKGETGD
PGGPGETGPKGARGKTISTGPPGEMGSPGEPGPPGRKGVKGARGLASFSMCELIQYVRDH
SPGGHGEPECPVHPTELVFALDQSRDVTKQEFERMKELMSSLVRGVRVRETSCPVGARIA
ILTYTSHARHLVRFSDAYRKPQLLREIEAIPYERSSDSREIGKVMRFISRNVFKRTLPGA
HTRRIATFFSSGQSADAQTIAAATMEFSALDIIPVVIAFSNVPSVKRAFAIDDTGTFQVI
MVPSGTDYAPALERLQRCTFCYDVCKPDAFCDQAKPPPAQSYMDAAFLLDSSRHVGSAEF
EDIRHFLGALLDHFEITPEPETSVTGDRVALLSHAPPNFLPNTQKSPVRTEFNLTTYNSK
RLMKRHLEESVQQLNGDAFIGQALQWTLDNIFLNAPNLRRNKVIFVISAGETSHLDKETL
KKESLRAKCQGYALFVFSLGPAWNDKELEDLASYPLDHHLVQLGRIHKPDHRYSVKIVRA
FINSIRRAINKYPPINFKIKCSRLSSTDLKQPPRQWRSFVPGPHKTALKEDALQKAKFFQ
DKKYLPRVARGSRDDTIRNLTRNTFHAFKNGKRVIKTAPNQHDKGSA
NT seq 6804 nt   +upstreamnt  +downstreamnt
atgctgctgattttgttcctcataataatttgttcctatgtctctatgaacaaagatccc
ggccctgagtatgcggatgtcgtgttcctggtggacagctccgaccagctgggaactaag
tccttccccctcgtgaaagcgttcattcacaaaacggtcagcggtctgcccgtagacgcc
cacaagtaccgcgtggccctggcgcagtacagcgaccggctccacagcgagttccagctg
gccacgttcaagagcaggaaccccatgctgaaccacctcaagaagaacttcggcttcctt
ggcggctccctgcggaccggccacgctctccgggaggcgcacaggaccttcttctctgcg
ccggccggcgggagggacaagaagcagtttcccccgattctggtggtcctggcctcggcc
cagtccgaggatgacgtggaagaggcctccaaggccctgcgggaagacggggtgagaatc
gtctccgtggggctgcagagtgcttctgaggaggagctgaaggccatggccacctctcgg
tttcatttcaacctccggtcggccagggaccttggcgcgttttcccaaaacatgacgcag
atcatcaaggaggccacccagtacagggacggagcggccgatgatattcttgtggaagtc
tgccaaggcccgtctgtggctgacgtggtgttcctgttggatgtgtctgtcaatggcagc
caggaggactttgaatatcttaaggaattcctggaagagagtgtatctgctcttgacata
aaggaacactgcatgagggttggcctcgtgacctacagcaatgagaccaaggtgatcaat
tctctcagcaggggcgtaaataagtcagaggttctccagaatatacagaacctctctccg
cgggctgggaaggcttacacaggagctgccatcagaaagatcaggaaggaagtgttcagt
gcgcgcaacggcagtcggaagaatcagggggtgccccagattgccgtgctggtgacccac
agaccctcagaggataacgtgaccaaggcagccgttaacctccggcgcgagggcgtgacc
atcttcaccatgggcatcgagggggccagcgacagccagctggagaagatcgcctctcac
cccgccgagcagcacatgtccaaactgaaaaccttctccgacctggccgctcacaaccag
acgtttctgaagaagctgcggaaccagataacacacacagtctctgttttttcagaacgc
accgaaactctcaaatccggttgtgtggacactgaagaagcagacatttatctgctcatt
gacggctctggaagcacccaggccacagacttccacgaaatgaagaccttcctgtcagag
gtggtaggaatgttcaacatcgctccccaaaaggtgcgggttggggctgtccagtacgct
gacagctgggacttggaatttgagatcaacaaatacactaacaagcacgacttgggaaaa
gccgtcgagaacatccggcagatgggtgggaacacaaacacgggtgcagccctgaatttc
acactagggctcttgcaaaaagcaaagaaacagcgaggaaaccgagtgccctgtcatctt
gttgtcctgacgaatggcatgtccaaggatagcatcctggagccagcgaacagactgcga
gaagagctcgttcgtgtttatgccattggggtcaaggaggccaaccaaacacagctgcga
gaaattgcaggcgaggagaagagggtatactacgtgcatgacttcgatgctttgaaagac
ataaggaaccaagttgttcaagaaatctgtgctgaagaagcctgcaaagagatgaaagcg
gacatcatgtttctggtggacagttctggcagtataggactggaaaacttcatcaaaatg
aaaacattcatgaaaaacctggtgagcaaatctcagattggggcagatcgggtacaaatc
ggcgtagtccagttcagcgacgtcaataaggaggaatttcagctcaacagatatatgtcc
caaaatgaaatttcaaatgcaatagaccggatgactcacattggagaaaccaccttgacc
ggtagtgccctgacctttgtgtctcagtacttcagccctgccaagggggcccggcccaat
gtcaggaggtttctcatcctcatcactgacggtgaagctcaggacatagtcaaggaccca
gcagtcgcacttcgacaggaaggcataattatctactctgtgggggtgtttggttccaat
gtcacccagcttgaggaaatcagcgggaggccggagatggttttttatgttgagaatttt
gatattctgcagcacattgaagatgatcttgtttttggaatatgcagcccccgtgaagaa
tgcaagcggattgaagttttggatgttgtgtttgtaattgatagctctggaagcattgac
cacgatgaatatagtatcatgaaagactttatggttgacttggtgaaaaaagctgatgtc
ggcaagaatcaggtccggtttggggctctgaagtatgctgatgacccagaggtgctgttt
tatctgggtgaccttggctcaaaatgggaggtgatttcggtgctccagaaggaccagccc
atggggggcaacacttataccgcggaggcattaggcttctcagaccacatgttcactgaa
gcccggggcagccgtctgcagaaaggggtcccccaagtcctcattgtgatcaccgatggg
gagtcccatgatgcagataagctcaatgccacagccaaggccttgagggacaaaggcatt
cttgtcctggccgtggggattgctggtgccaatcctgtggagctgttagccatggcagga
tcaagtgacaagtacttcttcgtggagacatttggaggcctgaaggggatattttcagat
gtgtcagccagtgtatgtaactcttcaaaagtagattgtgaaattgaaaaagtagatctt
gttttcctcatggatggctcaaatagcattcacccagatgacttcaggaagatgaaggaa
ttcttggcatcagttgttcaagacttcgatgtcagcgtcaacagagtgcgcataggtgct
gcccagtttagccacacctatcggccggaatttccactgggaactttcgtaggcaaaaag
gagatctcatttcagattgaaaacatccagcagatctttggatacacacacattggcgct
gccctccggcaggtggggcattacttccggccagacatgggtagccggataaatgcaggt
accccgcaggtgttgctggtcctcacagacggccagtcccaggatgaggtggcccgggcc
gctgaagacctgagacacaaagggattgacatttactcggtgggcattggcgacgtggat
gatcagcagctcattcaaatcaccgggaccgcggaaaaaaaactgacggttcataacttt
gatgaactgacaaaggtcaagaaaaggatagttcgaaacatctgtacctcggggggtgag
agcaattgttttgtggacgttgtggtgggatttgacatctcaactcatgagaatgggcag
actttgcttgaaggtcaatcttggatagaaacctaccttcaagacctcttacataccatc
agctccctcaatggggtaagctgtgaggtgggtacagagacgcaagttagcgtggctttt
caagtgaccaatgccatggaaaaatattcacccaagtttgagatctatagcgaaaacata
ctgaacagcttgaaggatataacagtaaaaggaccatctcttctcaacgcaaacctcttg
agttctctatgggatgcatttcagaataaatcagctgctcgaggaaaggtggttctctta
ttttcagatggattggatgatgacattgaaaagcttgaacaaaaatctgatgaacttaga
aaagaaggcctgaatgccctcgtaactgttgctctggatggagccgctgattcaagtgac
ctggccgatcttctctacattgaattcgggaaagggtttgagtacaggacccagctgact
attggaatgagggaccttgggagccgactgtcaaaactactggttaatgttgctgaaagg
acatgctgttgtttattctgcaagtgcattggaggggatggcataatgggggatcctgga
ccaccagggaaaaagggatccccaggttttaaaggcagtgagggctatctgggagaggaa
ggcattgctggagacagaggagccggtggaccagtgggagagcaaggtactaagggatgc
gatggtgccaaaggtcctaagggaaacaggggactaaatggacaggagggagaagttgga
gaaagcggaattggaggattacatggagaacagggtgagagtggtcttcctggagaaaaa
ggagaaaagggtgatgaaggatcccagggaagcccaggaaagcgaggagtttctggtgac
cgaggtgcaaagggccagcgaggagaccctggagttcctggatttgacaataccgtagaa
ggacctaagggcttgaaaggagaacgtggaagacaaggtagaagaggctggccgggcccc
cctggaacaccaggctccagaagaaagacagcagctcatggccgaaggggacatacaggc
ccacaggggaaaacaggcatcccaggcccagatggacttgaaggctcatggggacttaag
ggccctcagggccccagaggagagactggtgtgaaaggtgaaaaaggaggtctgggaagt
gaaggtcctcaggggcctccgggaccaggaggagaagctgggagtccaggccatttggga
agccaaggaaataaaggagagcctggagatttgggagaaaaaggagcaattggtctccca
ggtcctcgaggcttgccgggtgatgatggcaacccaggttatggtagcactggaagtaga
ggagcaaaggggcaagaaggatttcctggagaaagtggaccgaagggtgagactggggac
cctggtggtccaggagaaactggacccaagggagctagaggcaagacgatatccactggg
cctccaggagagatgggatcccctggggagccaggacctcctggacgcaagggtgtgaaa
ggagccagaggattggcttcattttctatgtgtgagctcattcagtatgtacgggaccac
agccctggcggacatggagaaccagaatgcccggtgcatcccaccgaattggtgtttgcc
ttggaccagtctcgagatgtcaccaagcaggaattcgagcgcatgaaggagctgatgtct
tccctggtgaggggcgtccgggtccgggagaccagctgcccggtgggcgcgcgcatcgcc
atcctcacctatacctctcacgcccggcacctcgtccgcttctcagacgcctaccggaag
ccgcagctcctcagggagatcgaagctattccttacgagcggtcctcggacagcagggag
atcggcaaagtgatgaggttcatctccaggaacgtcttcaagcgaacgcttccaggcgcg
cacacgagaagaatcgccaccttcttcagcagcggtcagtctgccgatgcccagaccatt
gccgcggccactatggaattcagtgcccttgacatcattccggtcgtgatcgcgttcagc
aacgtgccctccgtcaagcgtgcgtttgcgattgatgacactggcacattccaagtcata
atggttccatccgggaccgactacgcgccagcgttagagagactccagcggtgcactttc
tgctatgatgtctgcaagccagacgctttttgcgatcaagccaaaccgccccctgcacag
tcttacatggatgctgcgttccttctggacagctcccggcatgtgggaagtgctgaattt
gaagacataagacactttttgggagcactgttagatcactttgagatcaccccagagcct
gagacctctgtcactggagatcgggtggccctattaagccatgctccccccaacttccta
cccaacactcagaagagtcctgttagaactgagttcaacctcaccacctacaacagtaaa
cgcctcatgaagaggcacctggaagaatcagtgcaacaactaaatggagacgctttcatt
ggtcaggccttacagtggaccctggacaatatctttttaaatgctcccaatctgagaaga
aacaaagtcatatttgtgatctctgctggggaaaccagtcacttggacaaggagacctta
aagaaagaatccttgagagccaaatgtcaaggttatgccctatttgtgttttcccttggc
cctgcttggaatgacaaggaactagaagatctagccagctaccctttggatcaccacttg
gtccagcttggccgaattcataaacctgaccacagatacagtgtgaagattgtgagagcc
tttataaactcaatcaggcgtgcaatcaacaaatacccaccaataaacttcaaaatcaag
tgcagtagactcagctctacagatctgaagcagcccccacggcagtggcgaagctttgtt
cctggaccacataaaactgccctcaaagaagatgcattacagaaggcaaagttctttcaa
gataaaaaatatcttccaagagtagcaagaggcagcagagatgatactattcgaaatctt
accagaaacacattccatgcctttaaaaatggaaaaagggtgataaaaactgctccaaac
caacatgacaaaggaagtgcttga

KEGG   Ursus arctos horribilis: 113255163
Entry
113255163         CDS       T05909                                 

Gene name
THBS4
Definition
(RefSeq) thrombospondin-4
  KO
K04659  thrombospondin 2/3/4/5
Organism
uah  Ursus arctos horribilis
Pathway
uah04145  Phagosome
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05144  Malaria
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113255163 (THBS4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113255163 (THBS4)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    113255163 (THBS4)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113255163 (THBS4)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113255163 (THBS4)
  09174 Infectious disease: parasitic
   05144 Malaria
    113255163 (THBS4)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:uah04131]
    113255163 (THBS4)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113255163 (THBS4)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113255163 (THBS4)
Membrane trafficking [BR:uah04131]
 Endocytosis
  Phagocytosis
   Opsonins
    113255163 (THBS4)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113255163 (THBS4)
  Exosomal proteins of colorectal cancer cells
   113255163 (THBS4)
  Exosomal proteins of bladder cancer cells
   113255163 (THBS4)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113255163 (THBS4)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA EGF_3 cEGF EGF
Other DBs
NCBI-GeneID: 113255163
NCBI-ProteinID: XP_026354505
UniProt: A0A3Q7VQK3
LinkDB
Position
Unknown
AA seq 964 aa
MLAPRGAVFLLLHLALQPWLGAGAQATPQVFDLLPSSSQRPNPAALQPILTDPTLNEVYV
ISTFKLHTKSSATIFGLYSSTDNSKYFEFTVMGRLNKAILRYLKNDGKIHLVVFNNLQLA
DGRRHRVLLRLSNLQRGAGSVELYVDCAQVDSVHNLPRAFSRSSQSPEPVELRTFQRKAQ
DYLEELKLVVRGSLFQVASLQDCFLQQSEPLATTSTGDFNRQFLGQMTQLNQLLGEVKDL
LRQQVKETTFLRNTVAECQACGPLSFQSPTANTLVPPAPPAPATSLTPPVRRCDSNSCFR
GVRCTDTRDGFQCGPCPDGYTGNGITCSDIDECKYHPCYPGVRCVNLAPGFRCEACPVGF
TGPIVQGVGISFAKSNKQVCTDIDECRNGACVLNSICINTLGSYRCGPCKPGYTGDQTRG
CKTERSCRNPELNPCSVNAQCIEERQGDVTCVCGVGWAGDGYICGRDVDIDSYPDEELPC
SARNCKKDNCKYVPNSGQEDADRDGIGDACDEDADGDGIVNEQDNCVLTHNVDQRNSDKD
IFGDACDNCRNVLNNDQKDTDGDGKGDACDDDMDGDGIKNILDNCPKVPNRDQWDKDGDG
VGDVCDSCPDVSNPNQSDVDNDLVGDSCDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKD
GIGDECDDDDDNDGIPDLVPPGPDNCRLVPNPAQEDSNSDGVGDICETDFDQDQVIDRID
VCPENAEVTLTDFRAYQTVVLDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFN
GVDFEGTFHVNTQTDDDYAGFIFGYQDSSSFYVVMWKQTEQTYWQATPFRAVAEPGIQLK
AVKSKTGPGEHLRNSLWHTGDTSDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVR
FYEGSDLVADSGVTIDTTMRGGRLGVFCFSQENIIWSNLKYRCNDTIPEDFQEFQTQNFD
RLDN
NT seq 2895 nt   +upstreamnt  +downstreamnt
atgctggccccgcgcggagccgtcttcctcctgctgcacctggccctgcagccctggctg
ggggccggagcccaagccaccccccaggtctttgaccttctcccatcctccagccagagg
ccgaacccagctgctctgcagccgatcctgacagaccccaccctgaatgaggtctacgtg
atctccaccttcaagctgcacactaaaagttcagccaccatctttggcctttactcttca
acggacaacagcaaatattttgaattcaccgtgatgggacgcttgaacaaagccatcctc
cgttacctgaagaacgacgggaagattcacttggtggttttcaacaacttgcagctggct
gacggcaggcggcacagggtcctcctgagactgagcaacttgcagcgaggggcaggctcc
gtggagctctatgtggactgcgcccaggtggactctgtgcacaatctccccagagccttt
tcccgttcctcccagagtcccgagcccgtcgaattgaggactttccagaggaaggcacag
gactacttggaagagctgaagctggtggtgagaggctcgctgttccaggtggccagcctg
caagactgcttcctgcagcagagtgagcccctggccaccacaagcacaggagactttaat
cggcagttcttggggcaaatgacacaactgaaccagctactgggagaggtgaaggatctt
ctgagacagcaggtcaaggaaacgacatttttgcgaaacactgtagcggagtgccaggct
tgcggtccactcagctttcagtctccaaccgcaaacacactggtgcccccggcaccccca
gcgcccgcgacgtccctgacgccccctgtgcgccgctgtgattccaactcctgcttccgc
ggcgtccggtgtacggacaccagagacggcttccagtgtgggccctgccctgacggctac
accggaaacgggatcacctgttctgatatcgacgagtgcaaataccatccctgctaccca
ggcgtgcgctgtgtgaacttggctcccggcttccggtgcgaggcctgccccgtgggcttc
accggccccatagtacagggtgtcggcatcagttttgccaagtcaaacaagcaggtctgc
accgacattgatgagtgtcgaaacggagcatgtgttctcaattctatctgtattaacact
ttgggatcttaccgctgtgggccctgcaaaccagggtacacgggtgatcagacgagggga
tgcaaaaccgaaagaagctgcagaaaccctgagctgaatccttgcagcgtgaacgcgcag
tgcatcgaagagaggcagggtgacgtgacgtgtgtgtgcggcgtcggctgggctggcgac
ggctatatctgcgggagggacgtggacatcgacagttaccccgacgaagaactgccgtgc
tctgccaggaactgcaagaaggacaactgcaagtacgtgccaaactctggccaggaggat
gcagacagagacggcattggggacgcctgtgacgaggatgccgacggagatgggatcgtc
aatgaacaggataactgtgtcctgactcacaacgtggaccaaaggaacagcgacaaagat
atctttggggacgcctgtgataactgccggaatgtcctgaataatgaccagaaagacacc
gatggggatggaaaaggagatgcctgtgatgacgacatggacggagacggaataaaaaac
attctggacaactgcccaaaagttcccaatcgtgaccaatgggacaaggatggggatggc
gtgggggatgtctgtgacagttgtcctgatgtcagcaaccctaaccagtctgatgtggat
aacgatctggtcggggactcctgtgacaccaatcaggacagtgatggagatgggcaccag
gacagcacagacaactgccccacagtcattaacagtgcccagctggacactgataaggat
gggattggtgacgagtgtgatgacgacgatgacaacgatggcatcccagacctggtgccc
cctggaccggacaactgccggctggtccccaacccagcccaggaggatagcaacagcgac
ggcgtgggagacatctgcgagacagactttgaccaggaccaggtcatcgatcggatcgat
gtctgcccggagaatgcagaggtcaccctgactgacttcagagcctaccagactgtggtc
ctggaccctgaaggagacgcccagatcgatcccaactgggtggtcctgaaccagggcatg
gagatcgtgcagaccatgaacagcgatcccggcctggcagtggggtacacggcgttcaac
ggagtggactttgaagggaccttccacgtgaacacccagacggacgatgactacgccggc
ttcatcttcggctaccaggacagctccagcttctacgtggtcatgtggaagcagacagag
cagacatactggcaagccacgccgttccgggctgtcgccgaacccggcatccagcttaag
gccgttaagtctaagaccggtcccggggagcatctccgcaactccctgtggcacactggg
gacaccagcgaccaggtgcggctgctctggaaggactcaaggaacgtgggctggaaggac
aaggtgtcctaccgctggttcctgcagcacaggccccaggtgggctacatcagagtgcga
ttttatgaaggctccgacttggtggccgactccggggtcaccatagacaccaccatgcgt
ggaggccggctcggcgtgttctgcttctcccaagaaaacatcatttggtccaacctcaag
tatcgctgcaatgacaccatcccagaggacttccaagagtttcaaacccagaacttcgat
cgcctggataattaa

KEGG   Ursus arctos horribilis: 113256730
Entry
113256730         CDS       T05909                                 

Gene name
COMP
Definition
(RefSeq) cartilage oligomeric matrix protein
  KO
K04659  thrombospondin 2/3/4/5
Organism
uah  Ursus arctos horribilis
Pathway
uah04145  Phagosome
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05144  Malaria
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113256730 (COMP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113256730 (COMP)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    113256730 (COMP)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113256730 (COMP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113256730 (COMP)
  09174 Infectious disease: parasitic
   05144 Malaria
    113256730 (COMP)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:uah04131]
    113256730 (COMP)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113256730 (COMP)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113256730 (COMP)
Membrane trafficking [BR:uah04131]
 Endocytosis
  Phagocytosis
   Opsonins
    113256730 (COMP)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113256730 (COMP)
  Exosomal proteins of colorectal cancer cells
   113256730 (COMP)
  Exosomal proteins of bladder cancer cells
   113256730 (COMP)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113256730 (COMP)
SSDB
Motif
Pfam: TSP_C TSP_3 EGF_CA COMP EGF_3 cEGF EGF_MSP1_1 hEGF
Other DBs
NCBI-GeneID: 113256730
NCBI-ProteinID: XP_026356348
UniProt: A0A3Q7W4Z3
LinkDB
Position
Unknown
AA seq 756 aa
MVPAAACVLLLTLAALGASGQGQISLGADLGPQMLRELQETNAALQDVRELLRQQVKEIT
FLKNTVMECDACGMQPVRTSGLSVRPLSQCAPGFCYPGVVCTQTASGARCGPCPAGFTGN
GSHCTDVNECNAHPCFPRVRCINTSPGFRCEACPPGYSGPAHEGVGLAFAKANKQVCTDI
NECETGQHNCVPNSVCVNTRGSFQCGPCQPGFVGDQTSGCKRRPQRSCPDGTPSPCHEKA
DCILERDGSRSCVCAVGWAGNGLLCGRDTDLDGFPDEKLRCSERQCRKDNCVTVPNSGQE
DVDRDGIGDACDPDADGDGVLNEQDNCKLVRNPDQRNADGDKWGDACDNCRNQKNDDQKD
TDQDGKGDACDDDIDGDRIRNSLDNCPRVPNSDQKDSDGDGVGDACDNCPQKSNPDQRDV
DHDFVGDACDSDQDKDGDGHQDSRDNCPTVPNSAQQDSDHDGQGDACDEDDDNDGVPDSR
DNCRLVPNPNQEDLDRDGVGDACQGDFDADKVVDKIDVCPENAEVTLTDFRAFQTVVLDP
EGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFIF
GYQDSSSFYVVMWKQMEQTYWQANPFRAVAEPGIQLKAVKSSTGPGEQMRNALWHTGDTA
SQVRLLWKDPRNVGWKDKTSYRWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGGR
LGVFCFSQENIIWANLRYRCNDTIPEDYETQRLLQA
NT seq 2271 nt   +upstreamnt  +downstreamnt
atggttcctgccgccgcctgcgttctcctgctcaccctggccgccctcggcgcgtcgggt
cagggtcagatatcactaggtgcagatctgggcccgcagatgcttcgggaacttcaggag
accaacgcggcgttgcaggatgtgcgggagctgttgcggcagcaggtcaaggagatcacg
tttctgaaaaacacagtgatggagtgtgatgcgtgcgggatgcagccagtgcgcacctcg
ggtctgagcgtgcggcccctatcccagtgcgctcccggcttctgctacccaggcgtggtc
tgcactcagacggcgagcggcgcgcgctgtggaccctgccccgcgggcttcactggcaat
ggctcgcactgcaccgacgtcaacgagtgcaacgcccatccctgcttcccccgcgtccgc
tgcatcaacaccagcccgggcttccgctgcgaggcttgcccgccggggtacagcggcccc
gctcacgagggcgtggggctggccttcgccaaggccaacaagcaggtttgcacggacatt
aacgagtgtgagacagggcagcataactgcgtccccaactccgtgtgcgtcaatacccgg
ggctccttccagtgcggcccttgccagcccggcttcgtaggcgaccagacatctggctgc
aagcggcgcccacagcgctcctgccccgacggcacgcccagcccgtgccacgagaaggcc
gactgcattctggaacgcgatggctcgcgatcgtgcgtgtgcgccgttggctgggcaggc
aacgggctcctctgcggccgagacacggatttggacggcttccccgacgagaagcttcgc
tgctccgaacgccagtgccgtaaggacaactgtgtgacggtgcccaattcagggcaagag
gacgtggatcgcgacggcatcggagacgcctgcgatccggatgccgatggggacggggtc
ctcaacgagcaggataactgtaagctggtgcggaacccagaccagcgcaatgcggacggt
gacaagtggggtgatgcttgcgacaactgccggaaccagaagaacgacgaccaaaaggac
acagatcaggacggcaaaggcgatgcctgcgacgacgacatcgacggcgaccggatccga
aattcgttggacaactgcccaagggtgcccaactcagatcagaaggacagcgatggtgat
ggtgtaggggatgcctgtgacaactgtccccagaaaagcaacccagaccagagggatgta
gaccacgactttgtgggagacgcttgcgacagcgaccaagacaaggatggggatgggcac
caggactcacgggacaactgccctacagtaccaaatagtgcccagcaggactcagaccac
gatggccagggtgacgcctgcgacgaggatgacgacaacgacggggtccccgacagtcgg
gacaactgtcgcctggtgcccaacccaaaccaggaagacttagaccgtgatggagtgggc
gacgcgtgccagggcgacttcgacgcggacaaggtagtggacaagatcgatgtgtgtccg
gagaacgccgaggtcaccctcaccgacttccgggccttccagactgtcgtactggacccc
gagggcgacgcgcagatagatcccaactgggtggtgctcaaccagggtatggagatcgtg
cagacgatgaacagcgaccctggcctggctgtgggctatacggctttcaatggcgtggac
ttcgaaggcacgttccacgtgaacacggtcacagatgatgactatgcgggcttcatcttt
ggctaccaggacagctccagcttttatgtggttatgtggaagcagatggagcagacatac
tggcaggccaatcccttccgcgcagtagccgaacccggcattcagctcaaggccgtgaag
tcctccacaggccctggggagcagatgcggaacgcgctctggcacacaggggacacagca
tcgcaggtgcggctgctgtggaaggacccccgcaacgtgggttggaaggacaagacatcc
taccgctggttcctgcagcaccggccccaagtgggctacatcagagtgcggttctacgag
ggccctgagctggtggctgacagcaacgtggtcctggacacaaccatgcggggtggccgc
ctgggggtcttttgcttctcccaagagaacatcatctgggctaatctgcgctaccgatgc
aatgacaccatcccagaggactacgagactcagaggctgctgcaggcctag

KEGG   Ursus arctos horribilis: 113256862
Entry
113256862         CDS       T05909                                 

Gene name
LAMB2
Definition
(RefSeq) laminin subunit beta-2
  KO
K06243  laminin, beta 2
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113256862 (LAMB2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113256862 (LAMB2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113256862 (LAMB2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113256862 (LAMB2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113256862 (LAMB2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113256862 (LAMB2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113256862 (LAMB2)
   05145 Toxoplasmosis
    113256862 (LAMB2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N F5_F8_type_C
Other DBs
NCBI-GeneID: 113256862
NCBI-ProteinID: XP_026356605
UniProt: A0A3Q7VVP5
LinkDB
Position
Unknown
AA seq 1801 aa
MERASGDRGRDLRGQPGPWELRLGLLLSVLATALAQALAPDVPGCSRGSCYPATGDLLVG
RADRLTASSTCGLHGPQPYCIVSHLQDEKKCFLCDSRRPFSARDNPNSHRIQNVVTSFAP
QRRAAWWQSENGVPVVTIQLDLEAEFHFTHLIMTFKTFRPAAMLVERSADFGRTWHVYRY
FSYDCGADFPGVPLAPPRHWDDIVCESRYSEIEPSTEGEVIYRVLDPAIPIPDPYSPRIQ
NLLKITNLRVNLTRLHTLGDNLLDPRREIREKYYYALYELVVRGNCFCYGHASQCAPAPG
APAHAEGMVHGACVCKHNTRGLNCEQCQDFYHDLPWRPAEDGHSHACRKCECHGHAHSCH
FDMAAYLASGNMSGGVCDGCQHNTAGRHCELCRPFFYRDPSKDLRDPAVCRSCDCDPMGS
QDGGRCDPHDDPALGLVSGQCRCKEHVVGSRCQQCRDGYFGLSASDPAGCRRCQCDARGT
VPGTTSCDPNSGTCFCKRLVTGRGCNRCLPGHWGLSHDLLGCRPCDCDVGGAVDPQCDEA
TGQCRCRQHMVGRRCEQVQPGYFRPFLDHLTWEAEDTRGQVLDVVERLATPSGTPSWTGR
GFVRLQEGQTLEFLVAAVPRAMDYDLLLRLEPQVPEQWAEMEVTVQRPGPVSAHSPCGHV
LPKDDHIPGTLQPETRYMVFPRPVCLEPGVSYKLHLKLVRTGGRAQTEAPYSRPSLLIDS
LVLLPRVLVLEMFSGGDAAALERRATFERYRCHEEGLVPSKTPPSEACAPLLISLSTLLY
NGALPCQCDPQGSLSSECKPHGGQCLCKPAVAGRRCDLCAPGFYGFGPTGCQACQCSPEG
SLSGLCEGTSGQCPCRTGAFGLRCDRCQRGQWGFPRCQPCVCNGHADECDTHTGACLGCR
DHTGGEHCERCIAGFHGDPRLPYGGQCRPCPCPEGPGSRRHFATSCHRDGYSQQIMCQCR
AGYTGLRCDACAPGYFGDPSRPGGQCQPCECSGNIDPTDPDACDPRTGQCLRCLHHTEGP
HCAHCKPGFHGQAARQSCHRCSCNLLGTDPQQCPSADRCNCDPSSGQCPCLPNVQGPSCD
RCAPKFWNLTSGHGCQPCACHPSRARGPTCNEFTGQCHCRAGFGGRTCSECQELHWGDPG
LQCRACDCDPRGIDTPQCHRSTGHCSCRPGVSGVRCDQCARGFSGVFPACHPCHACFGDW
DRVVQDLAARTRRLEQRVQELQQTGVLGAFESSFWHIQEKLGTVQGIVGARNASAASTAQ
LVEATEELRREIGEATEHLTQLEAELTGVQDENFNANHALSGLERDGLALNLTLRQLDQH
LDLLKHSNFLGAYDSIRHAHSLSAEAEHRANTSALTVPSPVSNSADTRHRTEVLMGAHRE
DFNRKHMANQRALGELSARTHSLSLTAINELVCGPPGDAPCATSPCGGAGCLDEDGQPRC
GGLGCSGAVAMADLALGRARHTQAELQRALAEGGGILSQVAETRRQAGEAQQRAQAALDK
ANASRGQVEKANQELRELIQSVKDFLSQEGADPDSIEMVATRVLELSIPASPEQIQHLAG
EIAERVRSLADVDTILARTVGDVRRAEQLLQDAQRARSRAEGEKQKAETVQAALEEAQRA
QGAAQGAIQGAVVDTQDTERTLHQVQEKMAGAEQALSSAGERAQQLDGLLEALKLKRAGN
SLAASSAEETAGSAQGRAQEAEQLLQGPLGDQYQTVKALAERKAQGVLAAQARAEQLRDE
ARGLLQAAQDKLQRLQELEGTYEENERALEGKAAQLDGLEARMRSVLQAINLQVQIYNTC
Q
NT seq 5406 nt   +upstreamnt  +downstreamnt
atggagcgggcctcaggggaccgagggagggacctgcggggacagcctgggccctgggag
cttcgactgggcctactgctgagtgtgctggccaccgccctggcccaggccctggccccg
gatgtgccaggctgttcgaggggaagctgctaccctgccacaggtgacctgctagtgggc
cgtgctgacagactgactgcctcatccacctgtggcttgcatggcccccagccctactgc
atcgttagtcacctgcaggatgagaagaagtgcttcctgtgtgactcccggcgccccttc
tctgctagagacaacccaaacagccatcgcatccagaatgtagttaccagctttgcacca
cagcgccgggcagcctggtggcagtcagagaatggtgtccccgtggtcaccatccagctg
gacttggaggctgagtttcatttcacgcacctcattatgaccttcaagacatttcgtcct
gctgccatgctggtggagcgctcagctgactttggacgcacctggcatgtgtaccggtat
ttttcctatgactgtggggctgactttccaggagtcccactggcccccccaaggcactgg
gatgacatagtctgtgagtcccgctactcagagattgagccatccaccgaaggcgaggtc
atctatcgtgtgctggatcctgccatccctatcccagatccctatagcccacggatccag
aacctgctaaagatcaccaacctacgggtgaacctgacacggctacatacactgggggac
aacctgctggacccacggcgggagatccgtgagaaatactattacgccctctacgagctg
gttgtgcgtggcaactgcttctgttatggacatgcctcgcagtgtgcacccgccccagga
gctccagcccatgctgagggcatggttcatggggcctgcgtctgcaaacacaacactcgt
ggcctcaactgtgagcagtgtcaggatttctatcatgacttgccctggcgtccagccgag
gatggccacagtcatgcctgcaggaagtgtgagtgccatgggcacgcccacagctgccac
ttcgacatggccgcatacctggcatctggcaacatgagtgggggtgtgtgtgacggatgt
cagcacaatacagctgggcgccactgtgagctctgccgacccttcttctaccgtgaccca
agcaaggacctgcgggacccagctgtgtgccgctcctgtgattgtgaccccatgggttcc
caagacggtggtcgctgcgatccccatgatgatcctgctctggggctggtttcgggccag
tgtcgctgcaaagaacatgtggtgggctctcgctgccagcaatgccgtgatggctacttt
gggctcagtgccagcgaccctgcaggctgccggcggtgtcagtgtgatgcacggggcaca
gtgcctgggaccacctcctgtgaccccaacagcggaacctgtttctgcaagcgtctagtg
actggacgtggctgtaaccgctgcctgcctggtcactggggcctgagccatgacctgctt
ggctgccgtccgtgtgattgcgacgtgggtggtgccgtggatccccagtgcgatgaggcc
acaggtcagtgccgctgtcgccagcacatggtggggcgacgctgtgagcaggtgcagccc
ggctacttccggcccttccttgaccacctaacctgggaggctgaggacacccgaggacag
gtgcttgatgtggtggagcgcttggcgacccctagtgggactccatcctggacaggccgg
ggctttgttaggctgcaggaaggtcagacactggagtttttggtggccgctgtgccgaga
gccatggactacgacctgctgctgcgcttggagccccaggtccctgagcaatgggcagag
atggaagtgaccgtgcagcgcccagggcctgtgtctgcgcatagcccgtgtgggcatgtg
ctgcccaaggatgaccacattccagggactctgcagccagaaaccaggtatatggtgttt
cccagacctgtctgccttgagcctggcgtctcctacaaactgcatctgaagctggtgcga
acagggggacgtgcccagacagaggctccctactccagacccagcctgctcattgactcg
ctggtgctgctgccccgtgtcctggtgctggagatgtttagtgggggggatgctgctgcc
ctggagcgccgtgccacctttgagcgctaccgctgccatgaggaggggctggtgcccagc
aagacccctccctctgaggcctgcgcccccctcctcatcagcctgtccacgctactctac
aacggcgccttgccctgtcagtgcgacccccagggctcactaagttccgagtgcaaaccc
cacggcggccagtgcctgtgtaaacctgcggtggcggggcgccgctgtgacctctgtgcc
cccggcttctacggctttggccccacgggctgtcaagcctgccagtgcagccccgagggg
tcactcagtggcctgtgtgaagggaccagtgggcaatgcccctgccgaaccggggccttc
gggcttcgctgcgatcgctgccagcgtggccagtggggcttccctagatgccagccgtgt
gtctgcaacgggcatgcagatgagtgcgacacccacacgggcgcttgcctaggctgccgt
gatcacacagggggtgagcactgtgaaaggtgcattgctggcttccatggggacccacgg
ctaccatatgggggccagtgccggccctgtccctgccctgaaggccctggaagccggcgg
cactttgctacttcttgccaccgggatggctactcgcagcagatcatgtgccagtgtagg
gcaggctacacagggctgcggtgcgatgcttgtgcccctgggtactttggggacccgtca
aggccaggtggccagtgccaaccgtgtgagtgcagtggaaacattgaccccacggaccct
gatgcctgtgacccccgcacggggcaatgcctgcgctgcttacaccacacggaggggccg
cactgtgcccactgcaagcctggcttccatgggcaggctgcccgacagagctgtcaccgc
tgcagctgcaacttgctgggtacagatccccagcagtgcccatccgctgatcgctgcaac
tgtgacccaagcagtgggcagtgcccatgcctccccaatgtccagggccctagctgtgac
cgctgtgcccccaaattctggaaccttaccagtggccatggctgccagccctgtgcctgc
cacccaagccgagccagaggccccacctgcaatgagttcacagggcagtgccactgccgt
gctggctttggtgggcgaacctgttcagagtgccaggagctccactggggagaccctggg
ttgcagtgccgtgcctgtgattgtgaccctcgtggaatagacacacctcaatgtcaccgc
tccacagggcactgcagctgccgcccaggcgtgtcaggcgtgcgctgtgaccagtgtgcc
cgtggcttctcaggggtctttcctgcttgccacccatgccacgcatgcttcggggactgg
gaccgcgtggtacaggacttggctgcccgtacacggcgcctggagcagcgggtgcaggag
ctgcagcagacgggtgtgctgggtgcctttgagagcagcttctggcacatacaggagaag
ctggggactgtgcagggcattgtgggtgcccgcaacgcctcagctgcctccactgcacag
ctcgtggaggccacagaggagctgcggcgtgaaattggagaggccactgagcacctgacc
cagctggaggcagaactgacaggtgtgcaggacgagaacttcaatgccaaccatgcactg
agcggtctggagcgagacgggcttgcccttaatctcacactgcggcagctggatcagcat
ctggacctgctcaagcattcaaacttcctgggtgcctatgacagcatccgccacgcccac
agtctgtctgcagaggcagaacatcgtgccaacacatcggccttgacagtgcccagccct
gtgagcaactcagcagatacccggcatcggacagaggtgctgatgggtgcccacagggag
gacttcaaccgcaaacacatggccaaccagcgggcactaggcgagctctctgcccgtacc
cattccctgagcctgacagccataaatgaactggtgtgtggacccccaggggatgccccc
tgtgctacaagcccttgtgggggtgctggctgtctggacgaggatgggcagccccgctgt
gggggcctcggctgcagcggggcagtagccatggcggacctggcactgggtcgggcccgg
cacacacaggcagagttgcagcgggcactggcagaaggtggtggcatcctcagccaggtg
gctgagacccgtcggcaggcaggcgaggcacagcagcgggcccaggcagccctggacaag
gctaatgcttccaggggacaggtggagaaggccaaccaagaactccgggaacttatccag
agtgtgaaggacttcctcagccaggagggggctgatcctgatagcattgagatggtggcc
acacgagtgctagagctctccatcccagcatcacctgagcagattcagcacctggcaggc
gagattgcagagagggtccggagcctggcggatgtggacacaatcctggcgcgtactgtg
ggagatgtgcgtcgggcagagcagctgctacaagatgcacagcgggcaaggagccgggcc
gagggggagaaacagaaggcagaaacagtacaggcagcactggaggaagcccagagggca
cagggtgctgctcagggtgccatccagggggcagtagttgacacacaggacacggaaagg
accctgcaccaggtgcaggagaagatggcaggtgcggagcaggcactgagctctgcaggc
gagcgggctcagcaattggatggtctcctggaggctctgaaattgaaacgagcagggaat
agcctggcagcctctagtgctgaagaaacagcaggcagtgcccagggtcgtgctcaggaa
gctgagcagctgctgcagggcccactaggcgaccagtaccagacagtgaaggccctggct
gagcgcaaggcccagggcgtgctggctgcacaggcgcgggcagagcaacttcgggatgag
gctcgaggcttgttgcaggctgctcaggacaagctgcaaaggctgcaagagctggagggg
acctatgaggagaacgagcgggcactggaaggcaaagcagcccagctggacgggttggag
gccaggatgcgtagtgtgcttcaagccatcaacttgcaggtccagatctacaacacctgc
cagtga

KEGG   Ursus arctos horribilis: 113257484
Entry
113257484         CDS       T05909                                 

Gene name
COL2A1
Definition
(RefSeq) LOW QUALITY PROTEIN: collagen alpha-1(II) chain
  KO
K19719  collagen, type II, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113257484 (COL2A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113257484 (COL2A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113257484 (COL2A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113257484 (COL2A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113257484 (COL2A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113257484 (COL2A1)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113257484 (COL2A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 113257484
NCBI-ProteinID: XP_026357419
UniProt: A0A3Q7WTA3
LinkDB
Position
Unknown
AA seq 1727 aa
MIRFGAPQTLVLLTLLVAAVLRCHGQDVQKAGSCVQDGQRYNDKDVWKPEPCRICVCDTG
TVLCDDIICEDMKDCLSPETPFGECCPVCSTDLATASGQPGPKGQKGEPGDIKDIVGPKG
PPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFA
AQMAGGFDEKAGGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMG
PRGPPGPPGKPGDDGEAGKPGKSGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKG
EAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVG
PAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKG
SAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQG
APGPAGEEGKRGARGEPGGAGPVGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAG
PKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPG
VMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPG
PSGFQVGGCTGPTLQFLPCPGRCGQQRALLPRGTGQGASLQAWLLSPCWGSPEGWGSKLC
EGGKPGDQGVPGEAGAPGLVGPRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKG
PALTPIPYPLPPPPPHLREKLDLPVLQELLVLAAPRVNAERLDPPGPLDSQVLPPILEQP
SGHGGHLLAGLLTVPPTHTPRAAGPFGKTWSIWLASHSPASSFLEQGADGQPGAKGEQGE
AGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGN
PGPPGPPGPSGKDGPKGARGDSGPPGRAGDPGLQGPAGPPGEKGEPGDDGPSVSPSQARP
RESLELTEPRVEGDAGEGLLPGGKRAGSCGWQPGWVWMAEPLQVRAHVLTPASPLXGPDG
PPGPQGLAGQRGIVGLPGQRGERGFPGLPGPAVAWPPQKQRCPLRLPPTRHPPNSPSVSP
FFRKAQQVLGRGSLSAHSSSLSLSLQGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPG
REGTPGADGPPGRDGAAGVKGDRGEAGAVGAPGAPGPPGSPGPAGPIGKQGDRGEAVSIL
NSVKAEVEEGGQVSWELTHCLPDPQALEAPLGPGGLWAVGAATSVPSPGQMQRGPSLSWL
CSDTFPLSPSQGAQGPMGPAGPAGARGIPGPQGPRGDKGEAGEAGERGLKGHRGFTGLQG
LPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAG
PPGNPGPPGPPGPPGPGIDMSAFAGLGQREKGPDPLQYMRADQAAGDLRQHDAEVDATLK
SLNNQIESLRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETG
ETCVYPNPANVPKKNWWSSKNKDKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRL
LSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTVLKDGCTK
HTGKWGKTLIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL
NT seq 5184 nt   +upstreamnt  +downstreamnt
atgatccgcttcggggctccccagacgctggtgctgctgacgctgctcgtcgccgctgtc
cttcgatgtcacggccaggatgtccagaaggctggcagctgtgtgcaggacgggcagagg
tataatgataaggatgtgtggaagcccgagccctgccggatctgtgtctgtgacactggg
actgtcctctgcgacgacataatctgtgaagacatgaaagactgcctcagccccgagacc
cccttcggagagtgctgccccgtctgctcaactgacctcgccactgccagtgggcaacca
ggaccaaagggacagaaaggagaacctggagacatcaaggatatcgtaggacctaaagga
cctcctgggcctcagggacctgccggtgaacaaggacccagaggtgaccgtggtgacaaa
ggtgaaaagggtgcccctggacctcgtggcagagatggagagcctgggacccctggaaat
cctggcccccctggtcctcctggcccccctggcccccctggccttggtggaaactttgct
gcccagatggctggaggatttgatgagaaggctggtggcgcccagatgggagtgatgcaa
ggaccaatgggccccatgggacctcgaggacctccaggccctgcgggtgctcctggacct
caaggatttcaaggcaaccctggtgaacctggggaacccggcgtctctggtcccatgggt
ccccgtggtcctcctggcccccctggaaaacctggtgatgatggcgaagctggaaagcct
ggaaaatctggtgaaaggggcccccctggccctcagggtgctcgcggcttcccgggaacc
ccaggccttcctggtgtcaagggtcacagaggctacccaggtctagatggtgctaagggc
gaagccggtgctccaggtgtgaagggagagagtgggtcacccggcgagaatggttcccca
ggccccatgggtccccgcggcctgcccggtgagcgaggacggaccggccctgctggtgcc
gctggtgcccggggcaacgacggccaaccaggccccgcaggccctccgggtcccgtgggt
cctgccggcggtcctggcttccctggtgctcccggggccaagggtgaagctggccccact
ggtgctcgtggtcccgaaggcgctcaaggtcctcgtggcgaacctggtactcctgggtcc
cccgggccggctggtgcctccggtaaccccggaactgatggaattcccggagccaaagga
tctgctggtgctcctggcattgctggtgcccccggcttccctgggccccgtggtccacct
ggccctcaaggtgcaactgggcctctgggcccgaaaggtcagacgggcgagcctggcatt
gctggcttcaaaggtgaacaaggccccaagggagaacctggccctgctggtccccaagga
gcccctggtcctgctggtgaagaaggcaagagaggtgcccgtggagagcctggtggcgct
gggcccgttggtccccccggagagagaggtgctcctggcaaccgtggtttcccaggtcag
gatggtctggcaggtccaaagggagcccctggagagcgagggcccagcggccttgctggt
cccaaaggagccaacggtgaccctggccgtcctggagagcctggccttcctggagcccgg
ggtctcactggtcgccctggtgatgctggtcctcaaggcaaagttggtccttctggagct
cctggtgaagacggtcgccctggtcctccaggtcctcagggcgctcgtgggcagcctggt
gtcatgggtttccctggccccaaaggtgccaacggcgagcctggcaaggctggtgagaag
ggactgcccggcgctcctggtctgagaggtcttcctggcaaagacggtgagacaggtgct
gcaggaccccccggacccgccggacctgctggtgaacgaggcgagcagggtgctcctggg
ccatctgggttccaggtaggtggctgcactgggcccactcttcagttccttccatgccca
ggccgctgtgggcagcagcgggcactcctcccgagggggacggggcagggggcttcgcta
caggcctggctcctgtccccgtgctggggctccccggagggctggggcagtaaactctgc
gaaggtggaaaaccaggtgaccagggcgttcccggtgaagctggagcccccggcctcgtg
ggtcccaggggtgacgttggtgagaaaggccccgagggagcccctggaaaggacggtgga
cgaggcttgactggtcccattggcccccctggtcccgctggtgccaatggtgagaagggc
cccgccctgacccccatcccttatccgttgcccccccccccaccccacctcagggagaag
ttggacctcccggtcctgcaggagctgctggtgctcgcggcgccccgggtgaacgcggag
agactggaccccccgggcccgctggattcgcaggtcctccctcccatcttagagcagcct
tcaggccatggtggacacctgctcgctggcttgctcactgtcccacccactcacacaccc
agggctgctggcccttttggaaaaacctggagtatttggcttgcctctcattctcccgct
tcctccttcttggaacagggtgctgacggccaaccaggtgccaagggcgagcaaggagag
gctggccagaaaggtgatgctggtgccccgggtcctcagggcccctctggagctcccggg
cctcagggtcctactggtgtgactggtcctaaaggagcccgaggtgctcaaggccccccg
ggagccaccggattccctggagctgctggccgcgtcggacccccaggctccaatggaaac
cctgggccccctggtccccctggtccttctggaaaagatggtcccaaaggtgctcgagga
gacagcggcccgcctggccgtgccggtgaccctggcctccaaggtcctgctggaccccct
ggcgagaagggagaacctggagatgatggtccctctgtaagtccctcacaggcccggccc
agggagtctttggagctcacagagcccagagtggaaggggacgccggggaggggctgctg
ccgggcgggaagagggctggttcctgcggatggcagccaggctgggtgtggatggctgag
cccctccaggtcagagctcacgttctcactcctgcctcccctctctagggtcccgatggt
cctccaggtccccaaggtctggctggtcagaggggcatcgtcggtctgcccgggcagcgt
ggtgagagaggattccctggcttgcccggcccagcggtggcctggcctccgcagaagcag
cgctgccctctgaggcttccacccactcggcaccctccgaactcaccctctgtcagccct
ttctttagaaaggctcagcaggtcctgggccggggttctttgagtgcacactcatcgtcc
ctgtcgctgtccctccagggtgagcctggcaagcagggagctcctggtgcatctggagac
cgaggtccccccggccccgtgggtcctcctggcctgaccggtcctgctggcgaacctgga
cgagagggaacccctggtgctgatggcccccctggcagagatggtgccgctggagtcaag
ggtgatcgtggtgaggctggtgctgtgggtgcccccggagcccccgggccccctggctct
cctggccccgctggcccaattggaaagcagggagaccgaggagaagctgtgagtatcctt
aattcagtaaaagctgaagtggaagagggtggccaagtgagctgggagctcacccactgt
ctgcctgatccccaggccctggaggcacccctgggcccgggaggtctctgggccgtggga
gcagcaacctcagtgcccagcccaggacagatgcagagggggccctccctgtcctggctc
tgctctgacacgttcccactctctccctcacagggcgcacaaggccccatgggtcccgca
ggaccggctggagcccggggaatcccaggccctcaaggtccccggggtgacaaaggagaa
gctggagaggctggcgagagaggactgaagggacaccgtggcttcactggtctgcagggt
ctgcccggccctcctggtccttctggagaccaaggtgcttctggtcctgctggtccttct
ggccctagaggtcctcctggtcccgtcggtccctccggcaaagatggtgctaatggaatc
cctggtcccatcggacctcctggtccccgtggacgttcaggcgaaactggccccgcgggt
cctcctggaaaccccggacctcctggccctccaggaccccctggccctggcattgacatg
tctgcctttgctggccttggccagagagagaagggccctgaccccctgcagtacatgcgg
gctgaccaggcagccggcgacctgagacagcatgatgccgaggtggacgccacgctcaag
tccctcaacaaccagattgagagcctccgcagccccgagggctcccgcaagaaccccgct
cgcacctgccgggacctgaaactctgccaccctgaatggaagagcggagactactggatt
gaccccaaccagggctgcaccttggatgccatgaaggttttctgcaacatggagactggc
gagacctgcgtctaccccaacccagcgaacgttcccaagaagaactggtggagcagcaag
aacaaggacaagaaacacatctggtttggagaaaccatcaatggtggcttccacttcagc
tatggtgatgacaacctggctcccaacactgccaacgtccagatgaccttcctgcgcctg
ctgtccaccgagggctcccagaacatcacctaccactgcaagaacagcatcgcctacctg
gacgaagccgccggcaacctcaagaaggccctgctcatccagggctccaatgacgtggag
atccgggccgagggcaacagcagattcacgtacaccgtcctgaaggatggctgcacgaaa
cacaccggtaagtggggcaagactctgatcgagtaccggtcacagaagacctcgcgcctc
cccatcattgacattgcgcctatggacataggagggcccgagcaggaatttggtgtggac
atagggcctgtctgcttcttgtaa

KEGG   Ursus arctos horribilis: 113257990
Entry
113257990         CDS       T05909                                 

Gene name
VWF
Definition
(RefSeq) von Willebrand factor
  KO
K03900  von Willebrand factor
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04610  Complement and coagulation cascades
uah04611  Platelet activation
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113257990 (VWF)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113257990 (VWF)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113257990 (VWF)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    113257990 (VWF)
   04611 Platelet activation
    113257990 (VWF)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113257990 (VWF)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03110 Chaperones and folding catalysts [BR:uah03110]
    113257990 (VWF)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113257990 (VWF)
Chaperones and folding catalysts [BR:uah03110]
 Intramolecular chaperones
  Others
   113257990 (VWF)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113257990 (VWF)
SSDB
Motif
Pfam: VWD VWA C8 VWA_N2 TIL VWA_2 VWC VWA_3 Pacifastin_I MCR_beta_N
Other DBs
NCBI-GeneID: 113257990
NCBI-ProteinID: XP_026358415
UniProt: A0A3Q7W078
LinkDB
Position
Unknown
AA seq 2813 aa
MIPARLATVLLALALTLSGTLCTKGTVGRSPMARCSLFGDDFINTFDESMYSFAGDCSYL
LAGDCQKHSFSLIGGFQNGRRVSLSVYLGEFFDIHLFVNGTALQGTQSIAMPYASNGLYL
ETEAGYYKLSSEAYGFVAKIDSSGNFQVLLSDRYFNKTCGLCGNCNIFAEDDFRTQEGTL
TSDPYDFANSWALSSGEKRCKRVYPPSSPCNVSSEEMQQGPWEQCQLLKSASVFARCHPL
VDPEPFVALCERTLCTCAPGQRCPCAVLLEYARACAQQGMVLYGWTDHSTCRPACPAGME
YKECVSPCTRTCQSLQVQAVCQEQCVDGCSCPEGQLLDEGRCVGSADCSCVHAGQRYPPG
ASLSRDCNTCICRNSLWICSNEECPGECLVTGQSHFKSFDNRYFTFSGVCQYLLAQDCQD
HTFSVVIETVQCADDPDAVCTRSVTVRLPGLHNSLVKLKHGGGVSMDGQDVQIPLLQGDL
RIQHTVMASVRLSYGEDLQMDWDGRGRLLVMLSPVYAGKTCGLCGNYNGNRGDDFVTPAG
LAEPLVEDFGNAWKLHGDCENLQKQPSDPCSLNPRLTRFADEACALLTSSKFEACHRAVG
PLPYVQNCRYDVCSCSDGRDCLCSAVANYAAACARRGVHIGWREPSFCALSCPQGQVYLQ
CGTPCNMTCRSLSYPDEDCEEVCSEGCFCPPGLYLDERGECVPKAQCPCYYDGEIFQPED
IFSDHHTMCYCEDGFMHCTTSGAPGSLLPDTMLGSPRSHRSKRSLSCRPPMVKLVCPADN
PRAEGLECAKTCQNYDLQCMSTGCVSGCLCPPGMVRHENRCVALERCPCFHQGREYAPGE
TVTIDCNTCVCRDRKWNCTDHVCDATCSAIGMAHYLTFDGLKYLFPGECQYVLVQDYCDS
NPGTFRILVGNEGCSYPSVKCKKRVTILAEGGEIELFDGEVNVKKPMKDETHFEVVESGQ
YVILLLGKALSVVWDHRLSISVSLKRTYQERVCGLCGNFDGVQNNDFTSSSLQLEEDPVD
FGNSWKVNPQCADTSKVPLDSSPAVCHNNLMKQTMVDSSCRILTSDLFQDCNRLVAPEPY
LDICIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVAWRTATLCPQNCEERNLHENGY
ECEWRFNSCAPACPITCQHPEPLACPVQCVEGCHAHCPPGKILDELLQTCIDPEDCPVCE
VAGRRLAPGKKITLDPDDPEHCQICHCEGVNLTCEACREPGGLVVPPTEGPVASTTVYVE
DTPEPPLHDFYCSKLLDLVFLLDGSSKLSEDEFQVLKVFVVGMMERLHISQKRIRVAVVE
YHDGSHAYLELKDRKRPSELRRIASQVKYVGSEVASTSEVLKYTLFQIFGKIDRPEASRI
ALLLMASQEPPRLARNLIRYVQGLKKKKVIVIPVGIGPHANLKQIRNIEKQAPENKAFVL
SGVDELEQRRDEIINYLCDLAPEAPAPTQHPPMVPVTVGPELLGVPSSGPKRNSMVLDVV
FVLEGSDKIGEANFNKSREFMEEVIQRMDVGQDSIHVTVLQYSYTVAVEYTFSEAQSKGE
VLQHVREIRYRGGNRTNTGLALQYLSEHSFSVSQGDREQVPNLVYMVTGNPASDEIKRMP
GDIQVVPIGVGPHTNVQELERISWPNAPILIQDFETLPREAPDLVLQRCCSGEGPQAPTL
APTPDCSQPLDVALLLDGSSSFPASYFEEMKSFAKAFISRANIGPQLMQVSVLQYGSTTT
AAVPWNVAYEKAHLLSHVDLMQREGGLSHIGDALDYAVRYVTSEVHGARPGASKVVIILV
TDVSVDSVDAAANAATSNRVTVFPIGIGDRYDEAQLRRLAGPNAGSNVLRLQRIEDLSTV
ATLGNSFFHKLCSGFVRVCVDEDGNEKRPGDVWTLPDQCHTVTCLPDGQTLLKSHRVNCD
RGPRPSCPNGQPPLRVEETCGCRWTCPCVCMGSSTRHIVTFDGQNFKLTGSCSYVLFQNK
EQDLEVILHNGVCSPGVRQACMKSIEVKHDGLSVELHSDMQLTVNGRLVPVPYVGGDMEV
NVYGTIMYEIRFNHLGHIFMFTPRNNEFQLQLSPKTFASKTYGLCGICDENAANDFVLRD
GTVTTDWRALVQEWTIQEPGKTCQPISWEQCPVSSSSHCQVLLSELFAECHKVLAPATFY
AMCQQDSCHQEQVCEAIALYAHLCRTKGVCVDWRRTNFCDMSCPPSLVYNHCERGCPRLC
EGNTSSCGDHPSEGCFCPPNRVMLQGSCVPEEACTQCVSEDGVRRQFLETWVPAHQPCQI
CMCLSGRKVNCTSQPCPTARAPTCGPCEVARLRQNAEQCCPEYECVCDLLSCNLPPAPPC
EGGLQMTLTNPGECRPNFTCACRKDECRRESPPSCPPHRTPTLRKTQCCDEYECACNCVN
STVSCPLGYLTSAITNDCGCTTTTCFPDKVCVHRGTVYPVGQFWEEGCDVCTCTDLEDSV
MGLRVAQCSQKPCEDNCRSGFTYVIHEGECCGRCLPSACEVVIGSPRGDSQSHWKNVGSH
WASPDNPCLINECVRVKEEVFVQQRNVSCPQLDVPTCPTGFQLSCKTSECCPTCRCEPLE
ACMLNGTIIGPGKSLMIDVCTTCHCTVQVGVISGFKLECRKTTCEACPPGYKEEKNQGEC
CGKCLPTACTIQLRGGQIMTLKRDEMLQDGCDSHFCKVNERGEYIWEKRVTGCPPFDEHK
CLAEGGRVMKIPGTCCDTCEELECKDITARLRYVKVGDCKSEEEVNIHYCEGKCASKALY
SIDTEDVQDQCSCCSPTRTEPMQVPLRCANGSLVYHEILNAMQCRCSPRKCSK
NT seq 8442 nt   +upstreamnt  +downstreamnt
atgattcctgccagacttgcgacggtgctgctggctctggccctcaccttgtcagggacc
ctttgtacaaaagggactgttggcaggtcaccgatggcccggtgcagcctctttggagat
gacttcatcaacacctttgacgagagcatgtacagctttgcgggagattgcagttatctc
ctggctggagactgccagaagcactccttctcgcttatcgggggcttccaaaatggcagg
agagtgagcctctccgtgtatctgggagaattttttgacatccatttgtttgtcaatggt
actgcactgcaggggacccaaagcatcgccatgccctatgcctccaacgggctctacctg
gagaccgaggctggctactacaagctgtccagtgaggcctacggctttgtggccaaaatt
gacagcagtggcaactttcaagtcctgctgtcagacagatacttcaacaagacctgtggg
ctctgtggcaactgtaacatctttgccgaggatgacttcaggacgcaagaagggaccttg
acctcggacccctatgactttgccaactcctgggccctgagcagcggggaaaaacggtgc
aaacgggtgtaccctcccagcagcccatgcaatgtctcctctgaggaaatgcagcagggc
ccgtgggagcagtgccagcttctgaagagcgcctcggtgtttgcccgctgccaccctctg
gtggaccctgagcctttcgtggccctgtgtgaaaggacgctgtgcacgtgtgccccgggg
cagaggtgcccttgtgcggtcctcctggagtatgcccgggcctgtgcccagcagggcatg
gtcttgtacggctggaccgaccacagcacctgccgaccggcatgcccggctggcatggag
tacaaggagtgcgtgtccccttgcaccagaacttgccagagcctgcaggtccaagcagtg
tgtcaggagcaatgcgtggatggctgcagctgccctgaaggacagctcctggatgaaggc
cgctgcgtggggagcgctgactgttcctgtgtgcacgccgggcagcggtaccctccgggc
gcctccctctcgcgggactgcaatacctgcatttgccggaatagcctgtggatctgcagc
aatgaagaatgcccaggggagtgtctcgtcacaggacaatcccacttcaagagctttgac
aacaggtacttcaccttcagtggggtctgccagtacctgctggcccaggactgccaggac
cacaccttctctgttgtcatcgagaccgtgcagtgcgccgatgaccctgatgctgtctgc
acccgctcagtcactgtccgcctgccgggattgcacaacagcctcgtgaagctgaagcac
gggggaggcgtgtccatggatgggcaggatgtccagatccctctcctgcaaggtgacctc
cgcatccagcacactgtgatggcctccgtgcgcctcagctatggggaggacctgcagatg
gactgggatggccgtgggaggctgctggtgatgctgtccccagtctacgccgggaagacg
tgcggcctgtgcgggaattacaacggcaacaggggcgacgacttcgtcacgcccgcaggc
ctggcggagcccctggtggaggacttcgggaacgcctggaaactgcacggggactgtgaa
aacctgcagaagcagcccagcgatccgtgcagcctcaacccacgcctgaccaggtttgca
gacgaggcatgcgccctgctgacgtcctccaagttcgaggcctgccaccgcgccgtgggc
cctctgccctacgtgcagaactgccgctacgacgtctgctcctgctccgacggcagagac
tgcctttgcagcgccgtggccaactacgccgcagcctgtgcccggagaggcgtgcacatc
gggtggcgggagcccagcttctgtgcgctcagctgccctcagggccaggtgtatcttcag
tgtgggaccccctgcaacatgacctgccgctccctttcttacccggatgaggactgcgaa
gaggtctgctcggaaggctgcttctgccccccagggctgtacctggacgagaggggagag
tgcgtgcccaaggcccagtgcccctgctactatgatggcgagatctttcagcccgaagac
atcttctcagaccatcacaccatgtgctactgtgaggacggcttcatgcactgtaccacg
agtggcgccccaggaagcctgctgcccgacaccatgctcggcagtccccggtctcaccgc
agcaaaaggagcctgtcctgccggccccccatggtcaagttggtgtgccccgctgataac
ccgagggctgaagggctcgagtgtgccaaaacctgccagaactacgacctgcagtgcatg
agtacgggctgtgtctccggttgcctctgccccccgggcatggtccggcatgagaacagg
tgtgtggcactggaaaggtgtccctgcttccaccaaggcagagagtacgccccaggagaa
accgtgacaatcgactgcaacacttgtgtctgtcgggaccggaagtggaactgcacggac
cacgtgtgtgatgccacgtgctctgccatcggcatggctcactacctcaccttcgacggc
ctcaaatacctgttccctggggagtgccagtacgttctggtgcaggattactgtgacagt
aaccctgggaccttccggatcctggtggggaacgaggggtgcagctacccttcagtgaaa
tgcaagaagcgggtcaccatcctggcggaaggaggagagattgaactgtttgatggggag
gtgaatgtgaagaagcccatgaaggatgagacccacttcgaggtggtggagtccggtcag
tacgtcatcctgctgctgggcaaggcgctctccgtggtctgggaccaccgcctgagcatc
tccgtgagcctgaagcggacctaccaggagcgggtgtgtggcctgtgcgggaattttgat
ggtgtccagaacaatgacttcaccagcagcagcctccagttggaagaggaccctgtggac
tttgggaattcttggaaagtgaacccgcagtgtgccgacaccagcaaagtgcccctggac
tcgtctcctgccgtctgccacaacaacctcatgaagcagacgatggtggattcctcctgc
aggatcctcaccagtgatcttttccaggactgcaacaggctggtggcccctgagccatac
ctggacatttgcatctacgacacttgctcctgtgagtccattggggactgcgcctgcttc
tgtgacaccattgctgcctatgcccatgtgtgtgcccagcacggcaaggtggtggcctgg
aggacagctacattgtgtccccagaactgcgaggaaaggaatctccacgagaatgggtat
gagtgcgagtggcgctttaacagctgtgcccctgcctgtcccatcacgtgccagcacccc
gagccactggcgtgccctgtgcagtgtgttgaaggctgccacgcacactgccctccaggg
aaaatcctggatgagcttttgcagacctgcatcgaccctgaagactgtcctgtgtgtgag
gtggctggtcgtcgcttggccccgggaaagaagatcaccttggaccccgatgacccggag
cactgccagatttgtcattgtgagggcgtcaacctcacctgtgaggcctgcagggaaccg
ggaggcctcgtggtgccccccacggaaggccctgtcgcctccaccaccgtgtatgtggag
gacaccccagagccgcccctccatgacttctactgcagcaagcttctggacctggttttc
ctgctagatggctcctccaagctgtccgaggacgagttccaagtgctgaaggtctttgtg
gtcggcatgatggagcgtctgcatatctcccagaagcggatccgcgtggccgtggtggag
taccacgatggctcccacgcctaccttgagctcaaggaccggaagcgaccgtcggagctg
cggcgcatcgccagccaggtgaagtacgtgggcagcgaggtggcctccaccagcgaggtc
ttgaagtacacgctgttccagatctttggcaagatcgaccgccccgaagcgtcccgcatc
gccctgctcctgatggccagccaggagcccccgaggctggcccggaatttgatccgctac
gtgcaaggcctgaagaagaagaaagtcattgtgatccccgtgggcattgggccccacgcc
aacctcaagcagatccgcaacatcgagaagcaggcacccgagaacaaggccttcgtgctc
agtggtgtggacgagctggagcagcgaagggacgagattatcaactacctctgtgacctt
gcccctgaagcacctgcccctacccagcacccccctatggtcccggtcactgtgggtccg
gagctcttgggggttccatcatcaggacccaaaaggaactccatggttctggatgtggtg
tttgtcctggaagggtcggacaaaattggcgaggccaactttaacaagagcagggagttc
atggaggaggtgattcagcggatggacgtgggccaggacagcatccacgtcacggtgctg
cagtactcgtacacggtggccgtggagtacaccttcagcgaggcccagtccaagggggag
gtcctgcagcacgtgcgggagatccgataccggggcggcaacaggaccaacaccgggctg
gccctgcagtacctgtctgaacacagcttctctgtcagccagggggaccgggaacaggtc
cctaacctggtctacatggtcacaggaaaccctgcctctgatgagatcaagcggatgcct
ggagacatccaggttgtgcccattggggtgggtcctcacaccaatgtgcaggagctggag
cgcattagctggcccaatgcccccatcctcatccaggactttgagacgctgccccgagag
gcccctgatctggtgctgcagaggtgctgctctggagaggggccacaggcccccaccctc
gcccccaccccagactgcagccagcccctggacgtggccctcctcctggacggctcctcc
agcttccccgcttcttactttgaggagatgaagagtttcgccaaggctttcatttcaaga
gctaatatagggccccagctcatgcaggtctccgtgctgcagtacgggagtaccaccacc
gccgccgtgccatggaacgtggcctatgagaaagcccatctcctgagccacgtggacctc
atgcagcgggagggaggcctcagtcacattggggatgctttggactatgctgtgcgctat
gtcacctcagaagtgcacggtgccaggcctggagcctcaaaggtggtgatcatcctagtc
acagatgtctccgtggattcggtggatgctgcagccaatgccgccacgtccaaccgagtg
acagtgttccccattggaattggggatcggtatgacgaggcccagctgaggaggttggca
ggtccaaatgccggctccaacgtgctaagacttcagcgaatcgaagacctctccaccgtg
gccaccctgggaaattccttcttccacaagctgtgctctgggttcgttagagtctgcgtg
gatgaggatgggaacgagaagaggcctggggacgtctggactttgccagaccagtgccac
acagtgacttgcttgccagatggccagaccttgctgaagagtcatcgggtcaactgtgac
cgggggccaaggccttcatgccccaatggccagccccctctcagggtggaggagacctgt
ggctgccgctggacctgcccctgtgtgtgcatgggcagctctacccggcacatcgtgacc
ttcgatgggcagaatttcaagctgactggcagctgctcgtatgtcttatttcaaaacaag
gagcaggacctggaggtgattctccataatggtgtctgcagccctggggtgaggcaggcc
tgcatgaaatccatcgaggtgaagcatgacggcctctcagttgagctccacagtgacatg
cagctgacagtgaatgggagactggtccctgtcccatacgtgggtggggacatggaggtc
aatgtctatggcaccatcatgtatgagatcagattcaaccatcttggccacatcttcatg
ttcactccacggaataatgagttccagctgcagctcagccccaagacctttgcttcgaag
acgtatggtctctgtgggatctgtgacgagaacgcagccaatgacttcgtgctaagggat
ggcacggtcaccacagactggagagcacttgtccaggaatggaccattcaggagccaggg
aagacatgccagccgatctcctgggagcagtgtcctgtctccagcagctcccactgccag
gtcctcctctcagaattgtttgctgagtgccacaaggttctcgctccggccaccttttac
gccatgtgccagcaagacagttgccaccaggagcaggtgtgtgaggcgatcgccttgtat
gcccacctctgtcggaccaaaggggtctgtgttgactggaggaggaccaatttttgtgat
atgtcatgcccaccatccctggtgtacaaccactgtgagcgtggctgcccccggctctgt
gaaggcaatacgagctcctgtggggaccatccctctgaaggctgcttctgccccccaaac
cgagtcatgctgcaaggtagctgtgtccccgaggaggcctgtacccagtgcgtcagtgag
gatggagtccggcgccagttcctggaaacctgggtcccagctcaccagccctgccagatc
tgcatgtgcctcagtgggcggaaggtcaattgtacgtcgcagccctgccccacggccaga
gctcccacgtgtggcccgtgtgaagtggcccgtctccgccagaatgcagagcagtgctgc
ccggagtatgagtgtgtgtgtgacctgctgagctgtaacctgcccccagctcctccctgt
gaaggtggcctccagatgaccctgaccaatcccggcgagtgcagacctaacttcacctgt
gcctgcaggaaggatgagtgcagaagggagtccccgccctcttgccccccacatcggaca
ccgacccttcggaagacccagtgctgtgatgagtacgagtgtgcctgcaactgtgtcaac
tccacggtgagctgcccgctggggtacctgacttcagccatcaccaacgactgtggctgc
accacgacaacctgcttccctgacaaggtgtgtgtccaccgaggcaccgtctaccccgtg
ggccagttctgggaggagggctgtgacgtgtgcacctgcaccgacttggaggactctgtg
atgggcctgcgcgtggcccagtgctcccagaagccctgtgaggacaactgccggtcgggc
ttcacttacgtcatccatgaaggcgagtgctgtggaaggtgtctgccatctgcctgtgag
gtggtcattggttcaccgcggggggactcccagtctcactggaagaatgttggctctcac
tgggcctcccctgacaacccctgcctcatcaatgagtgtgtccgcgtgaaggaagaggtc
tttgtgcaacagaggaatgtctcctgcccccagctggacgtccctacctgcccgacgggt
tttcagctgagttgtaagacctcagagtgttgtcccacctgtcgctgcgagcccctggag
gcctgcatgctcaatggcactatcattgggcccgggaaaagtctgatgatcgatgtgtgc
acgacatgccactgtaccgtccaggtgggggtcatctctggattcaagctggagtgcagg
aagaccacctgtgaggcctgtcccccgggttacaaggaagagaagaaccaaggtgaatgc
tgtgggaaatgtctgcctacggcttgcaccattcagctaagaggaggacagatcatgaca
ctgaagcgtgatgagatgctgcaggatggctgtgatagtcacttctgcaaggtcaatgag
agaggagagtacatctgggagaagagggtcacaggctgcccgcctttcgatgaacacaag
tgtttggccgagggagggagagtcatgaaaattccaggcacctgctgtgacacatgtgag
gagctggaatgcaaggatatcactgcccggctgcggtatgtcaaggtgggagactgtaag
tctgaagaggaagtgaacattcattactgtgagggtaaatgtgccagcaaagccttgtac
tccatcgacacggaggatgtgcaggaccagtgttcctgttgctcgcccactcggacggag
cccatgcaggtgcccctgcgctgcgccaatggctcccttgtctaccacgagatcctcaac
gccatgcagtgcaggtgctcccccaggaagtgcagcaagtga

KEGG   Ursus arctos horribilis: 113258151
Entry
113258151         CDS       T05909                                 

Gene name
COL9A3
Definition
(RefSeq) LOW QUALITY PROTEIN: collagen alpha-3(IX) chain
  KO
K08131  collagen, type IX, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113258151 (COL9A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113258151 (COL9A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113258151 (COL9A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113258151 (COL9A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113258151 (COL9A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:uah00535]
    113258151 (COL9A3)
Proteoglycans [BR:uah00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   113258151 (COL9A3)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 113258151
NCBI-ProteinID: XP_026358612
UniProt: A0A3Q7X7R0
LinkDB
Position
Unknown
AA seq 796 aa
MAGAPTLVLLLLGHLLAAAMSEAQVSAGWSGVLGSHRRTAAARVGVRGSEMGRRTARLGD
DHPPTPASHPGPASEPGTRGRLETRRSPEARGPSRRGAESGRRTEPLQMPRHLVDEPRTL
PCPRLAGSAAAWAAATREKVGPQGPPGPQGPPGKPGKDGIDGEVGPPGLPGSPGPKGAPG
KPGKPGEAGLPGLPGVDGLTGQDGPPGPKGAPGERGSLGPPGPPGLGGKGLPGPPGEAGE
SGVPGGMGLRGPPGPSGLPGLPGPPGPPGPPGHPGVLPEGATDLQCPAICPPGPPGPPGM
PGFKGPTGYKGEQGEVGKDGEKGDPGPPGPAGIPGTVGLQGPRGLRGLPGPAGPPGDRGP
IGFRGPPGIPGAPGKVGDRGERGPEGFRGPKGDLGRPGPKGVPGLAGPGGEPGMPGKDGR
DGVPGLDGEKGEAGRNGAPGEKGPNGLPGLPGRAGSKGEKGELGQAGELGEAGPSGEPGI
PGDVGVPGERGEAGHRGSAGALGPQGPPGAPGIRGFQGQKGSMGDPGLPGPQGLRGGTGD
RGSGXAAGLKGDQGAAGSDGIPGDKGELGPGGPVGPKGESGSRGELGPKGIQGPNGTSGV
EGLPGPPGPMGFPGVQGVPGITGKPGVPGREASEQHIRELCGGMLSEQIAQLAAHLRKPL
APGSAGRPGPAGPPGPPGPPGSIGHPGARGPPGYRGPTGELGDPGPRGAPGDRGDKGSAG
AGLDGPDGDQGLQGPQGVPGVSKDGRDGAHGEPGPPGDPGLPGAVGAQGTPGICDTSACQ
GAVMGGGGEKSGSRSS
NT seq 2391 nt   +upstreamnt  +downstreamnt
atggccggagcccccacgctggttctgctcctgctcgggcatctcctggccgcggccatg
agcgaggcacaggtgagcgcgggctggagcggcgtgctgggctcgcaccggagaacagcg
gcggccagggtgggagtccggggctctgagatggggcgtcgaaccgcgcggctgggggac
gaccacccccccacccccgccagccatcctgggcccgcgagtgagcctggaactcgagga
cgcttggagacccggcgcagccctgaagcccgtggcccatctcgccgtggggctgagtct
gggcgccgcaccgagcccctgcagatgccgagacacctcgtggatgagcccaggacgttg
ccctgtcctcgcctggccgggtcggccgctgcctgggccgcggccaccagagagaaagtg
ggacctcaaggcccccctggcccccaagggccacctgggaagccgggcaaggatggcatt
gacggagaagttgggcctcccggtctgcccgggtccccgggaccaaaaggggccccaggg
aaaccagggaaaccaggagaggccgggctgccgggactgcctggcgtggacggtctgaca
gggcaggatggaccccccggccccaagggcgcgcctggagagcggggaagtctgggaccc
ccggggcctcccgggctggggggcaaaggcctccccggaccccccggagaggcaggagag
agtggcgtcccaggtggaatgggcctccggggccccccgggaccctccggactaccaggc
ctccctggccccccgggacctcctggaccccccggtcaccccggggtcctccctgaaggc
gccactgaccttcagtgcccggccatctgccccccaggtcctcccggccccccaggaatg
ccagggttcaaggggcccaccggctacaaaggggagcaaggagaggtcggaaaggacggc
gagaagggtgatcctggcccccctgggcccgcgggcatcccaggcactgtggggctgcag
gggcctcggggacttcgaggcctgccagggccggctggacccccaggggatcggggtccc
attggattccgaggaccaccagggatcccaggagcccccgggaaagtgggtgacagaggc
gagaggggcccagagggtttccgcggccccaagggtgaccttggcagacctggtcccaaa
ggagtccctgggctagcggggccgggcggagagccgggtatgccgggcaaggacggccgg
gatggtgttccgggactcgacggtgagaagggagaggctggtcgcaacggcgccccagga
gagaagggtcccaacgggctgccgggcctcccgggacgagcagggtccaagggcgagaag
ggagaactgggccaagctggagagctgggggaggctggcccctcaggagagcctggcatc
ccaggagatgttggcgtgcctggggagcgtggagaggctggccaccggggctcggcgggg
gctctgggcccacaaggccctcctggagcccctggcatccgaggcttccagggccagaag
ggcagcatgggtgaccctggcctgccaggcccccagggcctccgaggtggcacaggtgac
cgggggagtggctgagccgccggccttaagggagaccagggagctgcaggttccgatggc
attcctggggacaaaggagagctgggtcccggtggtccagtcggacccaaaggagagtct
ggcagtcgaggggagctgggcccgaagggcatccagggtcccaacggcaccagcggcgtc
gagggcctcccaggcccgcccggccccatgggcttcccgggagtacaaggcgtgcctggc
atcaccgggaaacccggagtcccgggccgagaagccagtgagcagcacatcagggagctg
tgcggagggatgctcagtgaacaaattgcacagttagctgctcacctgaggaagccttta
gcgcccggatccgcaggtcggcctggtccagctgggcccccaggccccccgggccccccg
ggctccatcggccaccccggtgcccgagggccccctggataccgtggtcccactggagag
ctgggagacccagggcccagaggggctccgggagacagaggagacaaaggctcggcgggc
gcgggtctggacggaccggacggagaccaggggctgcaaggaccgcaaggcgtgccaggc
gttagcaaagacggccgcgatggggcccacggcgagcccggacctccaggtgatcctggc
ctccctggtgctgtcggtgctcaggggacccccggcatctgtgacacctcggcctgccaa
ggagccgtgatgggaggcggcggggaaaaatcaggttctagaagctcctaa

KEGG   Ursus arctos horribilis: 113258464
Entry
113258464         CDS       T05909                                 

Gene name
LAMB4
Definition
(RefSeq) laminin subunit beta-4
  KO
K06245  laminin, beta 4
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113258464 (LAMB4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113258464 (LAMB4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113258464 (LAMB4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113258464 (LAMB4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113258464 (LAMB4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113258464 (LAMB4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113258464 (LAMB4)
   05145 Toxoplasmosis
    113258464 (LAMB4)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N LAGLIDADG_1
Other DBs
NCBI-GeneID: 113258464
NCBI-ProteinID: XP_026359034
UniProt: A0A3Q7WXP7
LinkDB
Position
Unknown
AA seq 1745 aa
MRFPLAFFLHLGLLSYAKAQDECKRGSCHPMTGDLLVGRSAQLTASSTCGLDGAQKYCIL
SYLEGEQKCFICDSRFPYDPYTQSRSHTIENVITSFEPEREKKWWQSENGLDHVSIRLDL
EALFQFSHLILTFKSFRPATMLVERSADYGHTWKVLKYFAKDCAASFPNITSGQAQGVGD
LVCDSKYSDIEPSTGGEVVLKVLDPSFEIENPYSPYIQDLVTLTNLRINFTKLHTLGDTL
LGRRENGSLDKYYYAVYKMVVRGSCFCNGHARECGPAHKVRGDSFSPPGMVHGRCVCQHD
TDGPNCERCRDFFHDAPWRPGAGLEDSTCRACHCNSHSDRCHFNMTVYVRSGGRSGGVCE
DCRHNTEGQHCDRCKPLFYRDPLKAMSDPYACIPCECDPDGTLSGGMCVSHSDPALGTVA
GQCLCKENVEGAKCDQCKPNHHGLSAADPVGCQPCNCNPFGSLPLSTCDVDTGQCLCQPF
ATGPHCEECIAGYWGLEDHLHGCSPCDCDIGGAYSNMCSPKDGQCECRPHITGRRCTEPA
PGYFFAPLNFYLYEAEEATPLQGLAPLGSVTFGQVPALHVVFREPVPGTPVAWTGPGFAR
VLPGAGLRFVVNNISVPMDFTIAVRYETQPAADWTVQVLVNPLGGSERCPRESPRLQPPS
FPLPPASRIVLLPTPICLEPDVQYSIDVYFSQPLEGGSHAHSHILVDSLGLIPQIDSLGN
FCSKQDLDEYKQHSCAEIAAKTGHQVLPGVCERLVVSMSARLHNGAVACKCHPQGSVGPS
CSRLGGQCQCKLHVTGRCCDRCSPGSYGLGHHGCHPCHCHPQGSKSPVCDQMTGQCSCQG
EVAGRRCDQCLEGYFGFPNCRPCLCNGFAELCDPETGSCFNCGGFTTGRNCERCIDGYYG
NPPSGRSCRPCPCPDVPSSSRYFAHSCYQHPWSSDVICNCLQGYAGKQCGECSAGFYGNP
KVSGAPCQPCACNNNTDATDPESCSQLTGECLKCLHNTRGPHCQFCKPGHFGSAVNQTCR
RCSCHPSGVNPAECPPGQEACLCDPDNGTCPCLPNVTGQACDRCADGYWNLVPSKGCQPC
DCDPRTSHSGHCDQLTGQCPCKLGYDGKHCSGCKENYYSDPLGRCIPCDCNREGAQKPTC
DQDTGMCRCREGVSGPRCDRCARGHGREFPACLRCHGCFDPWDHTVSALSKAVQGLMRLA
ANMRDKREPVLVCEADFKGLRENMSEIERILKHPVFSSEFLTVKDYQDSAREEVMQLSVQ
LKAVHEFPDLKERIRRTRNEANLLLEDLQREINLHSRARNASIMDSSENIKKHYQRSSSA
EKKSNETASIIKHSEKTRNDLLPIFDTLASKQNLSLKKLKQITVPDIQILNEKVCGEPGD
RPCVLSSCDGPSCRGSLPPSSDALQRAQKAESGIHNLSDQVQGWKHRLKNVSKLAEVSKN
NALQLSERLRNMKNHSESEEKMSLLIKKLKKFLSEESVPPEDIEKVANRVLDIHLPVTAR
NLTRELDTLRKLMPLCEDYGTDEDRLRKAGEEARKVLVKAKDAEKAANVLLNLDKTLNKL
QQVQVTQGRVNSTITQLTAEIMKIKNNVLQAENEAKDAKNELDLAKQRSVLEDGLSQLQT
KLQRNREQATRVTAQAASAQRQAAGLEQEFAELKNQYVALQQKTSATGLTKVTLEKVEQL
KDAAEKLATDTEDKIRRIADLEKKIQDLHLSRQEKADQLKRLEDQVVAIKNEIAEQENKY
ATCYS
NT seq 5238 nt   +upstreamnt  +downstreamnt
atgcgatttccactggccttttttttgcaccttggattgctcagttacgcgaaagcccaa
gatgaatgcaaaaggggctcctgtcaccctatgactggcgatctcctggtgggtcgcagc
gcacagctcacagcttcttctacctgtgggctggacggagcccagaaatactgcatcctc
agctacctcgagggggaacaaaaatgcttcatctgcgactcgagatttccgtatgaccca
tacacccaatcccgcagccacaccattgagaacgtaattacaagttttgaaccagagaga
gaaaagaaatggtggcagtcggaaaacggtcttgatcatgtcagcatccgattggaccta
gaggcattattccagttcagccacctcattctgaccttcaagagttttcggcctgccacg
atgttagttgaacggtctgcagactatgggcacacctggaaagtactcaaatattttgca
aaagactgcgctgcttcctttcccaatatcacatcgggccaggcccagggagtgggagac
cttgtttgtgactccaaatattcggatattgaaccctccactggtggcgaggttgtttta
aaagttctggatcccagctttgaaatagaaaacccttacagcccctacatccaggacctg
gtgacattgacaaacctgaggataaactttaccaaactccataccctgggggatacgttg
ctgggaaggagggaaaacgggtcccttgataagtactactatgctgtgtataagatggtc
gtgcggggcagctgtttttgcaatggccacgcccgcgagtgcggcccagcgcacaaggtg
cggggagactcgttcagcccgcccggaatggttcacggtcgctgcgtctgtcagcacgat
acagatggcccgaactgtgagaggtgcagggacttcttccacgacgctccctggaggcca
ggagcaggcctcgaggacagcacatgcagagcttgtcactgcaacagccactctgaccgc
tgtcacttcaacatgacggtgtacgtgaggagcggcggccgcagcgggggcgtgtgtgaa
gactgtcgccacaacaccgaggggcagcactgtgaccgctgcaaacccctcttctacagg
gacccgctcaaggccatgtccgatccatacgcctgcatcccttgtgaatgtgacccggat
gggaccttatcgggtggcatgtgtgtgagccactctgatcctgctttggggactgtggcc
ggccagtgcttgtgcaaggagaacgtggaaggagccaagtgcgaccagtgcaagcccaac
caccacgggctgagcgctgcggatcccgtgggttgtcagccctgcaactgtaaccccttc
gggagtctgcccctctcaacctgcgatgtggacacggggcaatgcttgtgccagccgttt
gccacgggaccacactgtgaagaatgcatcgctggatactggggactggaagaccacctc
catggctgttctccctgcgactgtgatatcggaggtgcttattccaacatgtgctcaccc
aaggatgggcagtgtgaatgccgcccacacatcactggccgccgctgtactgaaccggcc
cccggctacttctttgctcctttgaatttctatctttacgaggcagaagaagccacccca
ctgcaaggacttgcgcctttgggctcagtgacttttggccaggtgcctgcccttcacgtt
gttttccgagaaccggttcctggaacccctgttgcgtggactggacccggatttgccagg
gttctccctggggctggcttgagatttgtcgtcaacaacatttccgtgcccatggacttc
accattgccgttcgctacgaaactcagcccgcagccgactggactgtccaggtcttggtt
aacccccttggaggaagcgagcgctgtccacgtgagagtccacggctacagcctccgtct
ttccctttaccgccggcttcgagaatcgtgctgcttcccacacccatctgtttagaacca
gatgtacagtattccatagatgtctatttttcccagcctttggaaggagggtcccacgct
cattcacacatcctggttgattcacttggtctgattccccaaatcgattcactggggaat
ttctgcagcaagcaggatttagatgaatataagcagcacagctgtgctgaaattgccgcg
aagacgggacatcaggtgctccccggtgtgtgtgaaaggctggtagtcagcatgtccgcc
aggctacacaacggggcagtagcctgcaagtgtcacccccagggttcggtggggcccagc
tgcagccgacttgggggccagtgccagtgcaagcttcacgtgaccgggcgctgctgtgac
aggtgctcaccaggaagctatggtttgggacatcatggctgtcacccatgtcactgtcac
cctcaaggctctaagagccctgtatgtgaccaaatgacgggacagtgctcctgccagggg
gaggtggcaggccgccgttgcgatcagtgcctggaaggctacttcggattccccaactgc
cgcccttgcctttgtaatgggttcgctgaactttgtgatccagagacaggctcatgcttc
aattgcgggggttttacaactggaagaaactgtgaaagatgcatcgatggttactatggg
aatccgccttcaggacggtcctgccgtccttgcccgtgtccagacgttccctcaagtagc
aggtattttgcccattcctgttatcagcatccttggagctcagatgtgatctgcaattgt
cttcaaggttatgcaggtaagcagtgtggagaatgctctgcgggtttctacggaaatccc
aaagtttcgggagcgccttgccagccgtgtgcctgcaacaataacaccgatgcgactgac
cccgagtcctgcagccagctgacaggggagtgcctcaagtgtttgcacaacacgcggggg
ccccactgccagttctgcaaaccgggtcactttggatcagccgtcaatcagacctgcagg
aggtgctcctgccatccttccggggtgaatcccgctgagtgtccccctggtcaggaagct
tgcctctgtgaccctgacaatggcacatgcccttgtctgcccaacgtcacaggccaggcc
tgtgaccgttgtgcagatggatactggaatctggtccccagcaaaggatgccagccttgc
gactgtgacccccggacctctcacagcggtcactgcgaccagcttacaggccagtgtccg
tgtaaattaggctatgatgggaaacactgcagtgggtgcaaggaaaattattacagtgat
ccactggggcgatgcattccatgtgactgcaacagggaaggagcccagaagcccacctgt
gaccaggacacgggcatgtgccgctgccgggagggggttagcggcccgcgatgcgatcgc
tgtgcccggggtcatggccgggaattccctgcctgtctgcggtgtcacggctgctttgat
ccctgggaccacacggtttctgccctctccaaagcggtgcaagggttaatgaggctggct
gctaacatgcgagataaaagagagcctgtgctggtctgcgaggccgacttcaaaggcctc
agagagaacatgtctgaaatagaaaggattctaaaacatcctgttttctcatcggaattc
ttaaccgtcaaggattatcaggactctgctcgggaagaagtcatgcagctaagcgtgcaa
ctgaaagcagtgcatgagtttccagatctgaaagaaaggataagaagaacgaggaatgaa
gccaacctcttacttgaagatcttcagcgagaaatcaatttgcactcccgtgcccgtaac
gcaagcatcatggattcctcagagaacatcaagaagcattatcagagatcgtcatctgct
gaaaagaaaagtaatgaaaccgcttctatcataaaacactctgaaaaaaccaggaacgac
ttacttcccatctttgatacgctagcctcaaaacaaaacttgtcattgaaaaaattaaag
cagattacggtaccagatatccaaatattgaacgaaaaggtgtgtggagaaccgggggac
aggccgtgcgtgctctcgtcctgtgatggccccagctgtcgtggctccctgcccccctcg
agcgatgccctccagcgagctcagaaagcagagtctgggattcataatttgagcgaccag
gttcagggttggaagcaccggctcaaaaatgtaagtaaactggcagaagtctccaaaaac
aacgccttacagctaagtgagagactgaggaatatgaaaaaccacagcgaatctgaagag
aaaatgagtcttttaatcaaaaagctgaaaaagtttttgtcagaggaaagtgtgcctccg
gaggatatagagaaggttgcaaatcgcgtccttgacatccatctaccagtgacagcgcgg
aatctaacccgtgaacttgacacactacggaagctgatgccgctctgtgaggactacggg
accgatgaggacaggctacgcaaggcaggagaggaagcccgaaaggttttggtgaaggca
aaagacgctgaaaaagcagcaaatgttctattaaatcttgacaaaacgctgaataaattg
caacaagttcaagtcacccaaggacgcgtaaattctaccatcacacagctgacggcagag
ataatgaaaattaaaaacaacgttctgcaggctgaaaatgaagccaaggacgctaagaat
gagctggacttagcaaagcagcgatcggtgctggaggacgggctttcccagctgcagacc
aagttgcaaaggaaccgagagcaagccacccgtgtgacggctcaggccgcatcggcccaa
cgccaggctgcgggccttgagcaggagtttgcagagctgaaaaaccaatacgttgctctc
cagcagaagaccagcgctacaggactgaccaaggtaacgttagaaaaagtggagcagctc
aaagatgcagcagaaaaactggccaccgatacagaggacaagataagacgaatagcagat
ttagaaaagaagatccaagatctgcatctgagtagacaagaaaaagctgaccaactgaaa
cgattggaagatcaagttgttgccatcaaaaacgagattgctgagcaagagaataaatat
gctacttgctatagttag

KEGG   Ursus arctos horribilis: 113258474
Entry
113258474         CDS       T05909                                 

Gene name
LAMB1
Definition
(RefSeq) laminin subunit beta-1
  KO
K05636  laminin, beta 1
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113258474 (LAMB1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113258474 (LAMB1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113258474 (LAMB1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113258474 (LAMB1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113258474 (LAMB1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113258474 (LAMB1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113258474 (LAMB1)
   05145 Toxoplasmosis
    113258474 (LAMB1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N CorA F5_F8_type_C DUF348
Other DBs
NCBI-GeneID: 113258474
NCBI-ProteinID: XP_026359046
UniProt: A0A3Q7WXR5
LinkDB
Position
Unknown
AA seq 1786 aa
MGLLQVFAFSFLALCGALVRAQEPEFSYGCAEGSCYPATGDLLIGRAQKLSVTSTCGLHK
PEPYCIVSHLQEDKKCFICNSQDPYHETLNPDSHLIENVVTTFAPNRLKIWWQSENGVEN
VTIQLDLEAEFHFTHLIMTFKTFRPAAMLIERSSDFGKTWGVYRYFAYDCESSFPGISTG
PMKKVDDIICDSRYSDIEPSTEGEVIFRALDPAFRIEDPYSPRIQNLLKITNLRIKFVKL
HTLGDNLLDSRMEIREKYYYAVYDMVVRGNCFCYGHASECAPVDGVNEEVEGMVHGHCMC
RHNTKGLNCEQCMDFYHDLPWRPAEGRNSNACKKCHCNEHSSSCHFDMAVYLATGNVTGG
VCDDCQHNTMGRSCEQCKLFYFQHPGRDVRDPNVCERCTCDPAGSQNGGICDSYTDFSTG
LIAGQCRCKLHVEGEHCEICKEGFYGLSAEDPFGCKSCACNPLGTIPGGSPCDSETGYCY
CKRLVTGQHCDQCLPEHWGLSNDLDGCRPCDCDLGGALNNSCSAESGQCGCRPHVIGRQC
NEVESGYYFTTLDHYIYEAEDANFGPGVSIAERQYIQDRIPSWTGAGFVRVPEGAYLEFF
IDNIPYSMEYDILIRYEPQLPDRWEKAVITVQRPGKIPTSSRCGNTIPDDDNQVVSLSPG
SRYVVLPRPVCFEKGVNYTVRLELPQYTSSDSDVESPYTLIDSLVLMPFCKSLDIFTVGG
SGDGLVTNSAWETFQRYRCLENSRSVVKTPMTDVCRNIIFSISALLHQTGLACECDPQGS
LSSVCDPNGGQCQCRPNVVGRTCDRCAPGTFGFGPSGCRPCECHLQGSVNAFCHPVTGQC
QCFQGVYARQCDRCLPGYWGFPSCQPCQCNGHAEDCNSGTGACLGCQDYTTGHNCERCLA
GYYGDPIIGSGDHCRPCPCPDGPDSGRQFASSCYQDPVTLQLACVCHPGYIGSRCDDCAP
GFFGNPSDVGGSCQPCRCHHNIDTTDPEACDKETGRCLRCLYHTEGEHCQLCRAGYYGDA
LRQDCRKCVCNYLGTVPEHCNGSDCQCDKTTGQCSCLPNVVGQNCDRCAPDTWQLASGTG
CDPCSCDPAHSFGPSCNEFTGQCQCMPGFGGRTCSECQELFWGDPNVECRACDCDPRGIE
TPQCDQSTGQCVCVEGVEGPRCDKCTRGYSGVFPDCAPCHQCFALWDVIIAELTNRTQKF
LEKAKALKISGVIGPYRDTVDSVEKKVHEIRDILAQSPAAEPLKNIGDLFEEAEKLTKDV
SEKMAQVEVKLSDTASQSNSTARELDSLQTEAENLDNTVKELAEQLEFIKNSDIRGALDS
ITKYFQMSLEAEERVNASTTDPNSTVEQSALTRDRVEDLMMDREAQFREKQEEQARLLDE
LAGKLQSLDLSAAAEMTCGTPPGASCSETECGGPGCRTDEGEKKCGGPGCGGLVTVAHGA
WQKAMDFDRDVLSALAEVEQLSKMVSEAKLRADEAKLNAQNVLLKTNATKEKVDKSNEDL
RNLIKQIRNFLTQDSADLDSIEAVANEVLKMEMPSTPQQLQNLTEDIRERVESLSQVELI
LQQSAADVARAEMLLQEAKRASKNATDVKVTADTVKEALEEAEKAQIAAEKAIKLAVEDI
QGTQNLLTSIESEATASEETLLNASQRISELERNMEELKRKAAQNSGEAEYIEKVVYTVK
QSADDVKEDLDSEVDEKYKKVEKLIAQKTEESADARRKAEMLQNEAKALLAQANSKLQLL
KDLERKYEDNQKYLEDKAQELVRLEGEVRSLLKDISQKVAVYSTCL
NT seq 5361 nt   +upstreamnt  +downstreamnt
atggggctgctccaggtgttcgctttcagtttcctagccctgtgcggagccctggtgcgc
gctcaagaacccgaattcagctatggctgcgcggagggcagctgctaccccgccacgggc
gaccttctcatcggccgagcacagaagctctcggtgacctcgacgtgcgggctgcacaag
cctgagccgtactgtattgtcagccacctccaggaggacaaaaaatgcttcatctgcaat
tcacaagatccttatcacgagaccctcaatcctgacagtcatctcattgaaaatgtggtc
actacatttgctccaaaccgccttaagatttggtggcaatctgaaaatggcgtggaaaat
gtaacgatccaactggatttggaagcagaattccattttactcatctcataatgactttc
aagacattccgtcctgccgctatgctgatagaacgatcgtctgacttcgggaaaacctgg
ggcgtgtacagatactttgcctatgactgcgagagctcatttccaggcatttccaccggc
cccatgaagaaagtggacgatataatctgtgattcccgctattccgacatcgaaccctca
acggaaggagaggtgatatttcgtgctttagatcctgctttcagaatagaagatccttac
agtcccaggatacagaatctcttaaaaatcaccaacttgcgaatcaagtttgtgaagcta
cacactctgggtgataaccttttggattccagaatggaaatcagagagaagtattactat
gcggtgtatgacatggtggttcgagggaattgcttctgctacggccatgccagcgaatgc
gcccccgtggatggagtcaatgaagaagtagaaggaatggttcacgggcactgcatgtgc
aggcacaacaccaagggcctgaactgcgaacagtgcatggatttctaccacgatttacct
tggagacccgctgaaggtcgaaacagcaatgcttgtaaaaagtgtcactgcaacgaacat
tcgagctcctgccactttgacatggcggtgtacttggccactggaaacgtcaccggaggg
gtgtgtgatgactgtcagcacaacacgatgggacgcagctgtgagcagtgcaagctgttt
tacttccagcacccggggagagacgtccgggacccgaacgtctgcgaacgctgcacctgt
gacccagctgggtctcagaacggggggatctgtgacagttacacggacttttccaccggc
ctcattgctggtcagtgccggtgtaagttacacgtggaaggggagcactgtgaaatttgc
aaagaaggcttctacggtttaagtgctgaagatccgttcggttgcaaatcttgtgcttgc
aatcctctgggaacgattcctggcgggagtccttgtgattctgagaccggttactgctac
tgtaagcgtctggtgacaggacagcattgtgaccagtgtctgccagagcactggggctta
agcaatgatttggatggatgtcgaccttgtgattgtgaccttggaggggccttgaacaac
agctgctcggcggagtcaggccagtgcgggtgccgcccccacgtgatcgggcgtcagtgc
aacgaggtggaatccgggtactactttaccaccttggatcactacatctacgaagcagaa
gatgccaactttgggcctggggtcagcatagcggagcggcagtacatccaggaccggatt
ccttcctggaccggagcgggcttcgtccgagtgcccgaaggggcttacttggagtttttc
attgacaacataccgtattccatggagtatgacatcttgattcgctacgaaccacagctc
cctgaccgctgggaaaaagctgtcatcacagtgcagcgacctggaaagatcccaaccagc
agccgatgtggtaacactattcccgatgatgacaaccaggtggtgtccttatcgcctggc
tcgaggtatgtcgtccttccacgccccgtgtgctttgagaagggtgtgaactacacagtg
aggctggagctgccccagtacacgtcctccgatagcgacgtggagagtccctacacgctc
atcgattctcttgttctcatgccgttctgtaaatcactggacatcttcaccgtgggaggg
tcaggggatggattggtcaccaacagcgcctgggaaacctttcagagatacagatgtctg
gagaacagtaggagcgttgtgaaaacaccaatgaccgacgtatgcagaaacataatcttc
agtatttctgccttgttacaccagacaggcctggcttgtgaatgcgacccccagggctca
ctaagttccgtctgcgaccccaacggaggccagtgccagtgccggccgaacgtggttgga
agaacctgcgacagatgtgcccctggcaccttcggctttggccccagcggatgcagacct
tgtgagtgccacctgcaaggatccgtcaatgccttctgccaccccgtcaccggccagtgc
cagtgtttccagggcgtgtacgctcggcagtgtgaccggtgcttacctgggtactggggc
ttcccaagctgccagccctgccagtgtaacggccacgccgaggactgcaactcagggacg
ggggcgtgcctgggctgccaggactacaccacggggcacaactgtgaaaggtgcctggcc
ggttactatggcgatcccatcatcgggtcaggagatcactgccgcccttgtccttgccca
gacggtcccgacagcggccgccagttcgccagtagctgctatcaggaccccgtgacttta
cagctggcttgtgtctgtcacccgggatacattggctccaggtgtgacgactgcgccccc
ggcttctttggcaatccctcggatgtggggggctcctgtcagccttgccgctgtcaccac
aacatcgacacgacagacccggaagcctgcgacaaggagacggggcggtgcctcaggtgc
ctctaccacacggagggggagcactgccagctgtgccgggccgggtactacggggacgcc
ctgcggcaggactgtcgaaagtgtgtctgcaattacctgggcacggtgccagagcactgc
aacggctccgactgccagtgcgacaaaaccacgggccagtgctcgtgtcttcccaacgtg
gtcgggcagaactgtgaccgctgcgcgcccgacacctggcagctggccagcgggaccggc
tgtgacccgtgcagctgcgatcctgctcattccttcgggccgtcctgcaacgagttcacg
gggcagtgtcagtgcatgcccggtttcgggggccgcacctgcagcgagtgccaggagctc
ttctggggagaccccaacgtggagtgccgagcctgtgactgtgaccccaggggcattgag
acaccgcagtgtgaccagtccactggccagtgcgtctgcgtcgagggggtcgagggtcca
cgctgcgacaagtgcacgcgcgggtactcaggggtcttccccgactgcgctccctgccac
cagtgcttcgctctctgggatgtgatcatcgccgagttgaccaatcggacccagaagttc
ctggagaaagccaaggccttgaagatcagtggtgtgatcgggccgtaccgggacacagtg
gactctgtggagaaaaaagtccacgagatcagagacatccttgcccagagccccgccgcg
gagccgctgaaaaacattggggatctctttgaggaagcagagaaattaaccaaagatgtt
tcagaaaagatggctcaagtagaagtgaaattgtccgacacagcttcacaaagcaacagc
acagccagagaactagattctctgcagacagaagcagaaaacctggacaacacagtgaaa
gagcttgctgagcaattggaattcatcaaaaactcagatattcggggtgccttggatagc
attaccaagtacttccagatgtctctagaggcagaggagagggtgaacgcctccaccaca
gatcccaacagcaccgtggagcagtcggcgctcacccgggacagagtagaagacttgatg
atggatcgagaagcccagttcagggagaaacaagaggaacaggccaggcttctggatgaa
ctggcaggcaaactacaaagtctagacctttcagcggctgccgaaatgacctgcgggacg
cccccaggggcctcgtgttccgagacggagtgcggcggccctggctgcagaacggatgaa
ggcgagaagaagtgtgggggacctggctgcgggggcctggtcacggtggcacacggagcc
tggcagaaagccatggacttcgaccgagacgtcctgagcgcgctggccgaggtggagcag
ctctccaagatggtctctgaagcaaaactgagggcagatgaggcaaaattgaacgctcag
aacgtcctgctgaaaaccaacgctaccaaagaaaaagtggacaagagcaacgaggacctg
aggaatctgatcaagcaaatcagaaactttctgacccaggacagtgcggatctggacagc
attgaagcagtggctaacgaagtgctgaagatggaaatgcccagcaccccgcagcagtta
cagaacctgaccgaagatattcgggaacgagttgaaagcctgtctcaggtggagcttatt
ctacagcaaagcgctgcggacgtggccagagcggagatgctgctgcaagaagctaaaaga
gccagcaaaaacgcaaccgatgtcaaagtcacagctgacacggtgaaggaggcgctggag
gaggcagagaaggcccagatcgccgccgagaaagccattaagctagcagttgaagacatc
caaggaacccagaacctgctaacttcgattgagtctgaagccactgcttctgaggagacc
ttgctcaacgcctcccagcgcatcagcgagctcgagaggaacatggaagaactgaagcgg
aaggcggcccagaactctggggaggcagagtatatcgagaaagtggtgtacactgtgaag
cagagcgcagatgatgttaaagaggatctggacagtgaagttgatgaaaagtataagaag
gtagagaaattaattgctcaaaaaaccgaagaatctgcagatgccagaaggaaagctgaa
atgctacaaaatgaagcgaaagcacttctggctcaagccaacagcaagctgcagctcctg
aaagatttagaaagaaaatatgaggacaatcaaaaatatttagaagataaagctcaagaa
ttagtaagactggaaggagaagtccgttcgctcctaaaggatataagccagaaagttgct
gtttatagcacctgcttgtaa

KEGG   Ursus arctos horribilis: 113258790
Entry
113258790         CDS       T05909                                 

Gene name
LAMA5
Definition
(RefSeq) laminin subunit alpha-5
  KO
K06240  laminin, alpha 3/5
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113258790 (LAMA5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113258790 (LAMA5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113258790 (LAMA5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113258790 (LAMA5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113258790 (LAMA5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113258790 (LAMA5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113258790 (LAMA5)
   05145 Toxoplasmosis
    113258790 (LAMA5)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_I Laminin_G_2 Laminin_II Laminin_G_1 Laminin_B Laminin_G_3 Glyco_hydro_16
Other DBs
NCBI-GeneID: 113258790
NCBI-ProteinID: XP_026359644
UniProt: A0A3Q7WZA8
LinkDB
Position
Unknown
AA seq 3707 aa
MAKLGVRLRAGSARRAAGLRGPGPLLLVGLALLGAARALESADGGFSLHPPYFNLAEGAR
IAASATCGEEAPARGAPRPTEDLYCKLVGGPVAGGDPNQTIQGQYCDICTAANSNRAHPV
SNAIDGTERWWQSPPLSRGLEYNEVNVTLDLGQVFHVAYVLVKFANSPRPDLWVLERSTD
FGQTYQPWHFFASSKRDCLERFGPQTLERITRDDQVVCSTEYSRIVPLENGEIVVSLVNG
RPGAMNFSYSPLLRDFTRATNIRLRFLRTNTLLGHLMGKALRDPTVTRRYFYSIKDISIG
GRCVCHGHADVCDAPDPTDPFRLQCACQHNTCGGSCDRCCPGFNQRPWKPATTDSANECQ
SCNCHGHAHDCFYDPEVDRRNASRNQDNVYQGGGVCIDCQHHTTGINCERCLPGFYRAPD
HPLDSPHACRRCNCESDFTDGTCADLTGRCFCRPNFTGEHCDACAEGFSGFPRCYPVPSF
SPNGTGEQVLPAGQIVNCDCSAAGTQGNACRKDPRLGRCVCKPGFQGTHCELCAPGFYGP
GCQPCQCSSPGVADGDCDRDSGQCTCREGFEGATCDRCAPGYFHFPLCQLCGCSPAGTLP
EGCDEAGRCPCRPEFDGPHCDRCRPGHHGYPNCRACTCDPRGTLDQLCGVGGTCHCRPGY
AGAACQECSSGFHGFPDCAPCHCSTEGSLHASCDPRSGQCSCRPRVTGPRCDMCVPGAYN
FPSCEAGSCHPAGLAPADHSLPEAQAPCTCRAHVEGPSCDRCKPGFWGLSPSNPEGCIRC
SCDPRGTLGGLTECQPGDGQCSCKPHVCGQTCAACRDGFFGLDRADYFGCRSCRCDVGGA
LGQGCDPRTGACRCRPNTQGLTCSEPARDHYVPDLHGLRLELEEAATPDGHTVRFGFNPL
EFESFSWRGYAQMTPIQPRIVAKLNVTSPDLFRLVFRYVNRGPTSVSGRVSVEEEGRFAT
CTNCTEQSQLVTFPPSIEPAFVTVPQRGFGEPFVLNPGTWALLVEAEGVLLDYAALLPGV
YYEAALLQLRVTEACTFRPTGQRSGQTCLLYTHLPLDGFPSAAGPEALCRHDNSLPRPCP
MEQLSPSHPPLAACLGSDVDVQLQVTVPQPGRYALVVEYANEDARQEVGVAVHTPQQAPQ
QGAVMLHPCPYSTLCRGTALDAQHHLATFHLDTEASVRLTAEQARFFLHSVTLVPVETFS
LEFVEPRVHCVSSHGAFSPSSATCLLSRFPKPPQPIILRDCQVLPLPPSLPLAHSQDLTP
GAPPSGPQPRPPTAMDPDAEPTLLRHPQGTVVFTTHVPALGRYAFLLHGYQPEHPTFAVE
VLISGGRVWQGHANASFCPHAYGCRTLVVCEGQAVLDVTDSELTVTMRVPEGRWLWLEYV
LVVPEEAYSSSYLREEPLDKSYDFISQCASQGYHVSPTSSSPFCRDAASSLSLFYNNGAR
PCGCHEMGATGPTCEAFGGQCLCRAHVIGRDCSRCATGYWGFPSCRPCECRGRLCDELTG
RCVCPPRTVPPDCVVCQPQTFGCHPLVGCEECDCSGPGVQELADPTCDVDSGQCKCRPNV
TGRRCDACAPGFHGFPSCRPCDCHEAGSVPSLCDPLTGQCHCKENVQGPRCDQCRLGTFS
LDAANPKGCTRCFCFGATERCRGSAHARREHVDMEGWTLLSGDRQVVPHELRAEAELLYA
DLRRGFEAFPELYWQAPPSYLGDRVSSYGGTLRYELHSETQRGDVFIPTESRPDVVLQGN
QMSITFLEPVYPAPGHVHRGRLRLVEGNFRHTETHNAVSREELMMVLAGLEQLHIRALFS
QTSSAVSLRGVALEVASEVGGGPPASNVELCMCPANYRGDSCQECAPGYYRDIKGLFLGR
CVPCQCHGHSDRCLPGSGVCVGCQHNTEGDHCEHCRAGFVRSGSEDPMAPCVSCPCPLAV
PSNNFAVGCVLRGGHTQCLCKPGYAGASCERCAPGFFGNPLVLGSSCQPCDCSGNGDPNL
LFSDCDPLTGTCRGCLRHTAGPRCESCAPGFYGNALVPGNCTRCDCSTCGTETCDPHSGH
CVCKAGVTGPRCDRCREGYFGFQDCGGCRPCACGPAAEGSECHPQSGQCHCRPGAGGAQC
RECAPGHWGLPEQGCRRCQCQGGHCDVHTGRCTCPPGLSGERCDTCSQQHQVPVPGGPGG
WGARCEVCDHCVVLLLDDLERAGALLPAIREQLHSINASSAAWARLHRLDAAIADLQRQL
RSPLGLRHETTQQLETLEEQSSSLGQDTQRLDGQATRSLSEARRLLASTEASLGRAQRLL
VAIRAVDRILSELESQTDHLFPANASAPSGEHVRRTLAEVERLLGEMRARDLGAPRAAAE
AELGVARRLLARVQEQLTSRWEGNQGLATRARDRLAQHEAGLMDLRGALNRAVGTTREAE
ELNSRNQERLEDALHRKQELSRDNATLRATLQAASDTLAQLSGLLSGMDQAREEYERLAA
NLDGARTPLLEKMQAFSPASSKVELVEAAEAHAWQLEQLAVNLSSIIRGVNQDHFIQRAI
EAANAYSSILQAVQAAEGAAGQARQQAHDTWAMVVRRGLAPRAWELVTNTSALLEVVLRE
QQRLGHVRVTLQGTGTQLRDAQARKEQQAARIREVQAMLAMDTDETSKKIARAKAVAAEA
QDTAARVQSRIQDMQKHLERWQGQYGGLRSQDLGHAILDAGRSVSTLEKTLPQLLAKLSL
LENRSSHNASLALSASISRVRELIAQARGAASKVKVSMKFNGRSGVQLRAPRDLSDLAAY
TALKFYLQSPQPEPGRVPEDRFVLYMGSRQAVGDYMGVALRNQKVHWVYRLGEAGPAALS
VDEDIGEQFAAVSIDRTLQFGHMSITVEKQMLHETKGDTVAPGAEGLLNLRPDDFVFYVG
GYPSSFTPPEPLRFPGYLGCIEMDTLNEEVLSLYNFEETFQLDTAVDRPCARSKSTGDPW
LTDGSHLDGSGFARISVESQMGTTKRFEQELRLVSSDGIIFFMQYQDQFLCLAVQKGSLL
LFYDFGAGLMKAEPPHKEMEKLLAMTTASKAIQVFLLGGSRGRVSRVLVRLDRNNVFSVD
HSSTLELADAYYLGGVPPDQLPPSLRQLFPSGGSVRGCIKGIKAQGKYVDLKRLNTTGIS
SGCTADLLVGRAMTFHGDGYLTLKLPDVPPATGHIYSGFGFRSTQDVGLLYHKAFLSGPY
QVSLEQGRVALRLLRTEVKTRGSFADGAPHYVAFYSNDTGVWLYVDDQLQQMKPHQGTLP
RPQPSTQEPQQLYLGGLSNTSDSNFRGCISNVFVLRPVGPQRVFNLLQDWEKVNVSSGCA
PTPPPQTPAQAPRGLGAPAAQKAGRRSRQPPPDPACTPPWPLRTIRDAYQFGGPLSSYLE
FAHVPAPPGNWSHLSMLVRPRTRRGLLLLAAPLRASSPSLVLLLRHGHFVAQTEGPGPQL
RVQSRQRARAGRWHTVSVRWEKTRIQLVIDGVWAQDREGPSQQHQRAGSPRPHSLFVGGF
PASGFSPRLPVATGSSRFSGCVRRLRLDGRHLGAPTRVMGVTPCFSGALEKGLFFADSGG
IVTLDTVGATLPHVALELEVRPQTATSLIFHLGRVQKPPYLQLQVLAKQVLLRADDGAGE
FSTWVTCPAALCDGQWHRLAVTRSGNTLQLEVDTRSNRTLGPTLVAFMEDGHLPLHLGGL
PEPTNTRAGPLAYRGCMRNLALNRSPVTWPRSVGIQGAVGASGCPAP
NT seq 11124 nt   +upstreamnt  +downstreamnt
atggcgaagctgggcgtgcggctccgcgcggggagcgcacggcgcgcggcggggctccgg
ggccccgggccgctgctcctggtcggcctggcgctgctgggcgcagcgcgggcgctggag
tcggcggacggcggcttcagcctgcacccgccctactttaacctggccgagggcgcccgc
atcgccgcctcggccacctgcggcgaggaggccccggcgcgcggcgccccgcgccccacc
gaggacctctactgcaagctggtgggcggccccgtggcgggcggggaccccaaccagacc
atccagggccagtactgtgacatctgcacggccgccaacagcaacagggcgcaccccgtg
agcaacgccatcgacggcacagagcgctggtggcagagcccaccgctgtcccggggacta
gagtacaacgaggtcaacgtcaccctggacctgggccaggtcttccacgtggcctacgtg
cttgtcaagttcgccaactcgccgcggccggacctctgggtgctggagcggtccacggac
ttcggccagacctaccagccatggcacttctttgcctcctccaagagggactgcttggag
cggtttgggccgcagacgctggagcgcatcacacgggacgaccaggtcgtgtgctccacg
gagtactctcggatcgtgcccctggagaacggcgagatcgtggtatccctggtgaacggg
cgccccggggccatgaacttctcctactcgcccctgctgcgggacttcaccagagccacc
aacatccgcctgcgcttccttcgtaccaacacgctgctcggccacctcatgggcaaggcg
ctgcgggaccccaccgtcacccgccggtacttttacagcatcaaggacatcagcatcggc
ggccgctgtgtctgccatggccacgcggacgtctgtgacgccccagaccccacagacccc
ttcaggcttcagtgcgcctgtcagcacaacacgtgtgggggctcctgtgaccgctgctgc
cccggcttcaaccagcggccgtggaagccagccaccaccgacagtgccaacgagtgccag
tcctgcaactgccacggccacgcccacgactgcttctacgaccccgaggtggaccggcgc
aacgccagccggaaccaggacaacgtctaccagggcggtggcgtctgcatcgactgccag
catcacaccaccggcatcaactgtgagcgctgcctgcctggcttctaccgggccccggac
caccctctcgactctccccacgcttgccgccgctgcaactgcgagtcggacttcacagac
gggacatgtgcggacctgacgggccgctgcttctgccgccccaacttcacgggggagcac
tgcgacgcgtgcgccgagggcttctctggcttcccgcgctgctacccggtgccctccttc
tcccccaacggcaccggggagcaggtgctgccggccggacagattgtgaactgtgactgc
agcgccgccgggacccagggcaatgcctgccgcaaagacccgcggctgggacgctgcgtg
tgcaaacccggcttccagggcacccactgcgagctctgcgccccaggcttctatggccct
ggctgccagccgtgccagtgctccagccctggagtggcggatggggactgtgaccgagac
tcgggtcagtgcacctgccgggagggcttcgagggggccacgtgcgaccgctgcgccccg
ggctacttccacttccctctttgccagctgtgtggctgcagccctgcggggaccctgcct
gagggctgcgacgaggctggccgctgcccctgccggccggagtttgacggccctcactgt
gaccgctgccgcccgggccaccacggctaccccaactgccgcgcctgcacctgtgacccc
cggggtaccctggaccagctctgcggggtgggcggaacgtgccactgccgtcctggctat
gcgggcgctgcctgccaggagtgcagctccggcttccacggcttcccagactgtgccccc
tgccactgctccaccgagggctccctgcatgcgtcctgcgacccccgcagcggacagtgc
agctgccggccccgtgtgacagggccgcggtgtgacatgtgtgtgcctggcgcctacaac
ttcccctcctgtgaagctggctcctgccaccctgccggcctggccccggccgatcacagc
cttcctgaggcacaggccccctgtacgtgccgagcccacgtggaggggcccagctgtgat
cgctgtaaacctgggttctgggggctgagtcccagcaaccctgagggctgcatccgctgc
agctgcgatccccggggcacgctgggcggacttaccgagtgccagccgggcgacggccag
tgctcttgcaagcctcacgtgtgtggccagacctgcgcggcatgccgggacggcttcttt
gggctggaccgggccgactactttggctgccgcagctgccggtgtgacgtcggcggtgcg
ctgggacagggctgtgacccgaggacgggcgcctgccggtgccgccccaacacccagggc
ctcacctgcagcgagccagcgcgagaccactacgtccccgacctgcacggcctgcgcctg
gagctggaggaggcagccacgccagacggccacacggtgcgctttggcttcaaccccctc
gagttcgagagctttagctggaggggctacgcgcagatgacgcccatccagcccaggatc
gtggcaaagctgaacgtgacctcccccgacctcttccggctcgtcttccgctacgtcaac
cgtgggcccaccagtgtgagcgggcgggtctctgtggaagaggagggcaggtttgccact
tgcaccaactgcacagagcagagccagctcgtcaccttcccgcccagcatagagcctgcc
tttgtcaccgtgccccagaggggcttcggggagccctttgtgctgaaccctggcacctgg
gccctgctcgtggaggccgaaggggtgctcctggactacgcggccctgctgcccggggtc
tactacgaggcagcgctcctgcagctgcgggtgaccgaggcctgcacgttccggcccacc
ggccagcgctccgggcagacctgcctgctctacacccacctgcccctggatggcttcccc
tcagcggctggacccgaggccctatgtcgccatgacaacagcctgccccggccttgcccc
atggagcagctcagcccctcacacccgcccctggccgcctgcctgggcagtgatgtggac
gttcagcttcaggtcacagtgccgcagccaggccgctacgccctggtggtggagtacgcc
aatgaggacgcccgccaggaggtgggcgtggccgtgcacaccccccagcaggccccccag
cagggggctgtcatgcttcacccctgcccttatagcaccctatgccggggcactgccctg
gacgcccagcaccacctggccaccttccacctggacacggaggccagcgtccggctcacg
gccgagcaggcacgcttcttcctgcacagtgtcacactggtgcccgtggagacattcagc
ttggagttcgtggagccccgggtccactgtgtcagcagtcacggtgccttcagccccagc
agtgccacctgcctgctctctcgcttcccgaagccgccccagcccatcatcctcagggac
tgccaggtgctgccactgccccctagcctcccactggcccactcacaggatctcacgccc
ggcgcacccccgtcggggccccagcctcggccccccactgccatggaccccgacgcggag
cccacgctgctgcgccacccccagggcactgtggtcttcaccacccacgtgcccgccctg
ggccgctatgccttcctgctgcacggctaccagccagagcaccccaccttcgctgtggag
gtcctcatcagcgggggccgcgtctggcagggccatgccaacgccagcttctgcccacac
gcctatggctgccgcaccctggttgtgtgtgaggggcaggccgtcctggatgtgaccgac
agtgagctcaccgtgaccatgcgcgtgcctgagggccggtggctctggttggagtatgtg
ctggtggtccccgaggaggcctacagctccagctacctccgagaggagcctctggacaaa
tcttatgacttcatcagccagtgtgccagccagggctaccacgtcagccccaccagctcg
tccccgttctgtcgtgatgctgcctcttctctctctctcttctataacaacggggctcgg
ccttgcggctgccacgaaatgggtgccacgggccctacgtgtgaggcctttgggggccag
tgtctctgccgggcccacgtcattggccgtgactgctcccgctgtgccactggctactgg
ggcttccccagctgcaggccctgcgagtgccgaggccgtctgtgtgacgagctcacaggc
cgatgtgtctgcccgccacgcaccgtcccgcctgactgcgtcgtctgccagccgcagacc
ttcggctgccaccccctggtgggctgtgaggagtgtgactgctcagggcccggggtccag
gagctcgccgaccccacctgtgatgtggacagtggccagtgcaagtgcagacccaatgtg
accgggcgccgctgtgacgcctgtgcccctggcttccacggcttccccagctgccgtccc
tgcgactgccatgaagcgggctccgtgcccagcctgtgtgaccccctcacaggccagtgc
cactgcaaggagaacgtgcagggcccacggtgtgaccagtgccgcctcgggaccttctcc
ctcgatgccgccaaccccaaaggctgcacccgctgcttctgcttcggggccactgagcgc
tgcaggggctcggcccacgcccgccgtgagcacgtggacatggagggctggacgctgctg
agcggtgaccggcaggtggttcctcacgagctgcgagcagaggcagagctgctctacgct
gacctgcggcgtgggttcgaggccttccctgagctgtactggcaggccccaccctcctac
ctgggggacagggtgtcgtcctatggtgggaccctccgctatgaacttcactcggagacc
cagcgcggagacgtgttcatccccacggaaagcaggccggacgtggtgctacagggcaat
cagatgagcatcacgttcctggagcccgtgtacccggcgcccggccacgttcaccgcgga
cggctgcggctggtggaggggaacttccggcacacggagacgcacaacgccgtgtcccgc
gaggagctcatgatggtgctggcgggcctggagcagctgcacatccgtgccctcttctcc
cagacctcctcggctgtctccctgcgcggcgtggcgctggaggtggccagcgaggtgggc
ggggggcctccggccagcaacgtggagctgtgtatgtgcccggccaactaccgtggagat
tcgtgccaggaatgtgcccctggctactaccgggacatcaaaggtctcttcttgggtcgc
tgtgtcccctgtcagtgccatggccactcagaccgctgcctccctggctcgggcgtctgc
gtgggctgccagcacaacacggaaggtgaccactgtgagcactgccgggcgggcttcgtg
cgcagtgggtccgaggaccccatggccccctgtgtcagctgcccgtgccccctcgcagtg
ccttccaacaactttgccgtgggctgtgtccttcgaggagggcacacgcagtgtctctgc
aaacccggctacgcgggcgcctcctgcgagcggtgcgcgcccggcttctttgggaacccg
ctggtgctgggcagctcctgccagccgtgcgactgcagcggcaatggtgaccccaacctg
ctcttcagcgattgcgaccccctgaccggcacgtgccgcggctgcctgcgtcacaccgcc
gggccccgctgcgagagctgcgccccgggcttctacggcaacgcactggtgcccggcaac
tgcacccggtgtgactgctccacgtgcgggacagagacctgcgacccccacagtgggcac
tgcgtgtgcaaggcgggagtgacggggccacgctgtgaccgctgtcgggaaggatacttc
ggcttccaggactgcgggggctgccgcccgtgcgcctgtggaccggctgcggagggctcc
gagtgccacccccagagtgggcagtgccactgcaggccaggggctggaggagcccagtgc
cgtgagtgcgcccccggccactgggggctgcctgagcagggctgcaggcgctgccagtgc
caggggggccactgtgatgtgcacacgggccgctgcacctgccctcctgggctcagcggg
gagcgctgtgacacctgcagtcagcagcaccaggtgccggtgccaggcgggcccgggggc
tggggcgcccgctgcgaagtgtgtgaccactgtgtggtcctgctcctggacgacctggaa
cgagccggcgccctcctgcccgccatccgggagcagctgcacagcatcaacgccagctct
gcagcctgggcccggctgcacaggctggacgccgccatcgctgacctgcagaggcagctc
cggagccccctgggcctccgccacgagaccacgcagcagctggagaccttggaagagcag
agctcaagcctcggacaggacacacagaggctggacggccaggccacaagatccctctcc
gaggcccgccggctgctggccagcactgaggcctcactgggccgggcacagaggctgctg
gtggccatcagggctgtggaccgcatcctgagtgagctcgagtcccagacggaccacctg
ttcccggccaacgcttccgccccgtcgggcgagcacgtgcgccggacgctggccgaggtg
gagcggctgctgggggagatgcgggcccgggacctgggtgccccgagagcagcggctgag
gccgagctgggtgtggcccggagattgctggcccgtgtgcaggagcagctgaccagccgc
tgggaggggaaccagggactggccacacgcgcccgggatcggctggcccagcacgaggct
ggcctcatggaccttcggggggccctgaaccgggcagtgggcacgactcgggaggctgag
gaactcaacagccgaaaccaggagcgcctggaggacgccctgcaccggaagcaggagctg
tccagggacaatgccactctgagggccactctgcaggccgccagtgacaccctggcccag
ctctctgggctcttgtctggtatggaccaggccagggaggagtatgagcgccttgctgcc
aacctggatggggcccggacgcccctgctggagaagatgcaggccttctcgccggcaagc
agcaaggtggagctggtggaggccgctgaggcccacgcgtggcagctggagcagctggcc
gtcaacctgtccagcatcatccgtggagtcaaccaggaccacttcatccagcgggccatc
gaggccgctaacgcctacagcagcatcctccaagccgtgcaggccgccgagggggctgct
ggccaggcgcgacagcaggcacatgacacgtgggcgatggtggtgcggcggggcctggcg
ccccgggcctgggagctggtgaccaacaccagtgccctgctggaggtcgtcctcagggag
cagcagaggctgggccacgtgcgggttacccttcagggcaccgggacccagctccgagat
gcccaggccaggaaggagcagcaggcggcccgaatccgggaggtacaggccatgctggct
atggacactgatgagacaagcaagaagattgctcgtgccaaagccgtggctgctgaggcc
caggacacggccgcccgcgtgcagtcgcggattcaggacatgcagaaacacctggagcgg
tggcagggccagtacggaggcctgcggagccaggacctgggccatgccatactcgacgcg
ggccggtcagtgtccaccctggagaagacgctgccgcagctgctagccaagctgagcctc
ctggagaaccgcagctcgcacaacgccagcctggccttgtccgccagcatcagccgtgtg
cgggagctcatcgcccaggcccgtggagcggccagcaaggtcaaggtgtccatgaagttc
aacgggcgctcaggggtgcagctgcgtgccccccgggacctctccgacctcgccgcctac
accgctctcaagttctacctccagagcccacagccggagcccggccgggtccccgaggac
cgcttcgtgctgtacatgggcagccgtcaggctgtcggggactacatgggcgtggctctg
cggaaccagaaggtacactgggtgtaccgcctgggggaggcgggccccgcggccctcagc
gtcgacgaagacatcggggagcagtttgcagcagtcagcattgacaggaccctccagttt
ggccatatgtccatcacggtggagaagcagatgctccatgagaccaagggtgacacggtg
gcccctggggccgaggggctgctcaacttgcggcctgacgacttcgtcttctacgtggga
ggctatcccagcagcttcacgccccccgagcccctccgcttccccggctacctgggctgc
attgagatggatacgctcaacgaggaggtgctcagtctctacaacttcgaggagaccttc
cagctggacacggccgtggacaggccttgtgctcgctccaagtcaactggggacccatgg
ctcacagacggctcccacttggacggctccggcttcgcacgcatcagcgtggagagtcag
atgggcacgaccaaacgcttcgagcaggagctgcggctcgtgtcttccgacgggatcatc
ttcttcatgcagtaccaggaccagttcctgtgcctggctgtacagaaaggcagccttctc
ctgttctatgactttggcgcaggcctgatgaaggctgagcccccacacaaagagatggag
aaactgctagccatgaccacggccagcaaggcaatccaggtgtttctgctggggggaagt
cgtggccgtgtcagccgcgtgctggtgcgcttggacaggaacaacgtgttcagtgtggac
cacagcagcacgctggagctggccgacgcctactacctggggggggtgccccccgaccag
ctacccccaagcctgcggcagctcttcccctccggaggctcagtccgcggctgcatcaag
ggcatcaaggctcagggcaagtacgtggacctcaagaggctgaacacgacgggcatcagc
tcgggctgcactgccgacctgctggtgggacgggccatgactttccacggcgatggctac
ctgaccctgaagctccctgatgtccctcccgccacgggccacatctactccggcttcggc
ttccgcagcacccaggacgtcggtctgctctaccacaaagcattcctgagcgggccgtac
caggtgtccctggaacagggccgtgtggcactccggctgctgaggacagaggtgaagact
cgagggagctttgccgatggtgccccccattacgtggctttctacagtaacgacacgggg
gtctggctctatgtggacgatcagcttcagcagatgaagccccaccaggggacactcccc
aggccccagcccagcactcaggagccccagcagctctacttgggaggcttgtccaacacc
agcgactccaacttccgtggctgcatcagcaacgtcttcgtgctgcgacctgtggggccg
cagcgcgtgttcaacctgctgcaagactgggagaaagtcaacgtgagctcaggctgtgcc
cccaccccacccccccagaccccggcacaggctcctcgaggacttggggcccccgcagca
cagaaggctggccgacgcagccgccagccccccccggaccctgcctgcacaccaccctgg
cctctcaggaccatccgagacgcctaccagtttgggggccccctgtccagttacctggag
tttgcccacgtcccggcaccccctgggaactggtcccacctctcgatgctggtccgccct
cgcacccggcgaggactcctgctgcttgctgcccccctccgggccagcagcccttccctg
gttctcctcctgaggcacggacactttgtcgctcagacagagggcccagggccccagctc
cgtgtccagagccgccagcgtgcacgggcaggccggtggcacacggtgtctgtgcgatgg
gaaaagactcggatccagctggtgatagatggggtctgggcacaggaccgggaggggccc
agccagcaacaccagagggcagggagcccccggccccactctctctttgtggggggcttc
ccggccagtggcttcagcccaaggctcccagtggccactggcagttcccgcttcagtggc
tgtgtgaggagactgaggctggacgggcggcacctaggggcccccacacgggtgatgggg
gtcacgccatgcttctcaggcgccctggagaagggcctgttctttgcagacagcgggggt
attgtcaccctagatactgtgggggccacgctgcctcacgtggccctagagctggaggtg
cggcctcagacggccaccagcctcatcttccacctgggccgggtccagaagccaccctac
ctgcagctgcaggtgctggccaagcaggttctgctgcgcgcagacgatggtgcaggggag
ttctccacgtgggtgacatgccctgcagccttgtgtgacgggcagtggcaccgactggca
gtgaccagaagcgggaacacgctccagttagaggtggacacacgcagcaaccggaccttg
ggccccacgctggtggccttcatggaggacggccacctgcctctgcacctcgggggcctg
cctgagcccacgaacacacgggctgggcctctagcctaccgtggctgcatgaggaacctg
gcgctgaaccggtcccccgtcacctggcctcgctctgtgggcatccagggggcagtgggg
gccagcggctgcccagcaccctag

KEGG   Ursus arctos horribilis: 113258887
Entry
113258887         CDS       T05909                                 

Gene name
RELN
Definition
(RefSeq) reelin
  KO
K06249  reelin [EC:3.4.21.-]
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05017  Spinocerebellar ataxia
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113258887 (RELN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113258887 (RELN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113258887 (RELN)
 09160 Human Diseases
  09164 Neurodegenerative disease
   05017 Spinocerebellar ataxia
    113258887 (RELN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113258887 (RELN)
Enzymes [BR:uah01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.21  Serine endopeptidases
    3.4.21.-  
     113258887 (RELN)
SSDB
Motif
Pfam: BNR EGF_2 Reeler EGF_Tenascin hEGF EB
Other DBs
NCBI-GeneID: 113258887
NCBI-ProteinID: XP_026359773
UniProt: A0A3Q7WDS1
LinkDB
Position
Unknown
AA seq 3458 aa
MERSCWAPRTFLLALLLGTTLRARAAVGYYPRFSPFFFLCTHHGELEGDGEQGEVLISLH
IAGNPTYYVPGQEYHVTISTSTFFDGLLVTGLYTSTSAQASQSIGGSNAFGFGIMSDHQF
GNQFMCSVVASHVSHLPTTNLSFVWIAPPAGTGCVNFMATATHRGQIIFKDALAQQLCEQ
GAPTEATTHPHLAEVHSDSIILRDDFDSYHQLELNPNIWVECNNCETGEQCGAIMHGNAV
TFCEPYGPRELITTSLNTTTASVLQFSIGSGSCRFSYSDPSIIVSYAKNNTADWIQLERI
RAPSNVSAIIHILYLPQDAKGENVQFQWKQESLHVGEVYEACWALDNILIINSAHRQVIL
EDSLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSTEDIQEQW
SEEFESQPTGWDILGAVIGTECGTVESGLSMVFLKDGERKLCTPYMDTTGYGNLRFYFVM
GGVCDPGDSHENDITLYAKIEGRKEHITLDSLSYSSYKVPSLVSVVINPELQTPATRFCL
GQKNHQGHNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVGLEFSTNHGRS
WSLLHTECLPEICAGPHLPHSTIYSSENYSGWNRITIPLPNAALTRDTRIRWRQMGPILG
NMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSSRL
SSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGSKS
VLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYLNYHEPRIISVELPDDARQFGIQFR
WWQPYHSSQGEDVWAIDEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPYCGHDWTL
CFTGDSKLASSMRYVETQSMQIGASYMIQFSLVMGCGQKYTPHMDNQVKLEYSTNHGLTW
HLVQEECLPSMPSCQEFTSASIYHASEFTQWRRVIVLLPQKTWSSATRFRWSQSYYTAQD
EWALDSIYIGQQCPNMCSGHGSCDHGVCRCDQGYQGTECHPEAALPSTIMSDFENQNGWE
SDWQEVIGGEVVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWVDFVQFYIQIGGES
AACNKPDSREEGVLLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPCTRFRWW
QPVFSGEGYDQWAVDDIIILSEKQKQVIPIVNPTLPQNFYEKPAFDYPINQMSVWLMLAN
EGMAKNETFCSATPSAMVFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCANQFSSAAPVL
LQYSHDAGMSWFLVKEGCYPASAGKGCEGNSRELSEPTMYHTGDFEEWTRITIVIPRSLA
SSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCVSGVCFCDLGYTAAQGTC
VSNVPNHSEMFDRFEGKLSPLWYKITGGQVGTGCGTLNDGKSLYFNGPGKREARTVPLDT
RNIRLVQFYIQIGSKTSGITCIKPRARNEGLVVQYSNDNGILWHLLRELDFMSFLEPQII
SIDLPREAKTPATAFRWWQAQHGKHSAQWALDDVLIGMNDSSQTGFQDKFDGSIDLQASW
YRIQGGQVGTDCLSMDTALIFTESIGKPRYAETWDFHVSASTFLQFEMSMGCSKPFSDTH
GVQLQYSLNNGRDWHLVTEECVPPTIGCLHYTESSVYTSERFQNWKRITVYLPLSTISPR
TRFRWIQSNYTAGADAWAIDNVVLASGCPWLCSGRGICDAGRCVCDRGFGGAYCVPIVPL
PSILKDDFNGNLHPDLWPEVYGAERGNLNGETIKSGTSLIFKGERLRMLISRDLDCTNTM
YVQFSLRFIAKGTPERSHSILLQFSINGGITWHLMDEFYFPQTTNILFINVPLPYTAQTN
ATRFRLWQPYNNGKKEEIWIVDDFIIDGNNLNNPLMLLDTFDFGPREDNWFFYPGGNIGL
YCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLDVNENTIIQFEINVGCSTDSSSTDPVR
LEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHHPSSTYYAGTTQGWRREVVHFGKLHLC
GSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCNGHGSCINGTKCICDPGYSGPTCK
ISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLMTRDLDL
SHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIALEI
PLKARSASTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFTTLDSRKWLLHPG
GTKMPVCGSSGDALVFIEKASTRYVVTTDIAVNEDSFLQIDFAASCSVTDSCYAIELEYS
VDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPPYTRSQATRFRWHQ
PAPFDKQQTWAIDNVYIGDGCIDMCGGHGRCIQGSCICDEQWGGLYCDEPEISLPTQLKD
NFNRAPSSQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEFIQFYFM
YGCLITPNNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIATRFRW
WQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGRIAFDMF
MEDKTAVNEHWLFHDDCTVERFCDSPDGVMICGSHDGREVYAVTHDLTPTEGWIMQFKIS
VGCKVSEKVAQNQVHVQYSTDFGVSWNYLVPQCLPADPKCSGSVSQPSVFFPTKGWKRIT
YPLPESLVGNPVRLRFYQKYSDMQWAIDNFYLGPGCLDNCRGHGDCLKEQCICDPGYSGP
NCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTVRQAITQ
DLDLRGAKFLQYWGRIGSENNMTSCHRPICRKEGVLLDYSTDGGITWTLLHEMDYQKYIS
VRHDYILLPEDALTNTTRLRWWQPFVTSNGLVVSGVERAQWALDNILIGGAEINPSQLVD
TFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTHNALSSRELIIQPGYMM
QFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQFHEATIY
NAVNSSSWKRITIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPKLCSGHGY
CTTGAVCICDESFQGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIGSGCGQLAP
YAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTSQTDSCNSDVSGPHAVDKAVL
LQYSVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGTGHDQWAL
DHVEVVLTRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP
NT seq 10377 nt   +upstreamnt  +downstreamnt
atggagcgcagttgctgggccccgcggactttcctcctggcgctgttgctggggacgacg
ctgagggcgcgcgcggcggtgggctattacccccgcttctcgcccttctttttcctgtgc
acccaccacggggagctggaaggggatggggagcagggcgaggtgctcatttccctgcac
attgcgggcaaccccacctactacgtaccgggacaagaataccatgtgacaatttcaaca
agcaccttctttgacggcttgctggtgacaggactgtatacgtccacaagtgcgcaggct
tcacagagcattggaggttccaatgcttttggatttgggatcatgtctgaccaccagttt
ggtaaccagtttatgtgcagcgtggtggcctctcatgtgagtcatctgcccacaaccaac
ctcagtttcgtctggatcgctccacctgctggcaccggctgtgtgaatttcatggctacg
gcgacacacaggggccagattattttcaaagatgctttagcccagcagctgtgtgaacaa
ggagctccaacagaagccacgacgcacccacatctagctgaagtacatagcgacagcatt
atcctacgagatgactttgactcctaccaccaactagagttgaatccaaatatatgggtt
gaatgtaacaactgtgagactggagaacagtgtggtgcaattatgcatggcaatgctgtc
accttctgtgagccatatggtccgagagaattgattaccacaagccttaatacaacaaca
gcctctgtcctccaattttccattggctcaggttcctgtcgcttcagttactcagacccc
agcatcatcgtgtcctacgccaagaacaacaccgcagactggattcagcttgagagaatt
agagccccttccaacgtcagcgccatcatccatatcctctaccttcctcaagacgccaaa
ggcgagaacgtccagtttcagtggaagcaggaaagtctccatgtaggcgaagtgtatgaa
gcctgctgggccttggataacatcctgatcatcaactcagctcacagacaagtcatttta
gaagacagtcttgatccagtagacacgggcaactggctttttttcccaggagctacagtt
aagcatagctgtcagtcagatgggaactccatttatttccatggaaatgaaggcagtgag
ttcaatttcgccaccacccgggatgtagatctttctacagaggatattcaagagcagtgg
tcagaagaatttgaaagccagcctaccgggtgggacatcttgggagctgtcattggtaca
gaatgtggaacagtagaatcaggtttatcaatggtcttcctcaaagatggagagaggaaa
ttatgcaccccatatatggataccaccggttatgggaacctaagattttactttgttatg
ggaggagtttgtgaccctggagattctcatgaaaatgatatcaccctgtatgcaaagatt
gaaggaagaaaagagcacattacactggacagtctttcctattcctcctataaggttcca
tctttagtttctgtcgtcatcaatcctgaacttcagacgcctgctaccagattttgtctc
gggcaaaagaaccatcaagggcataacaggaatgtctgggctgtcgatttttttcacgtc
ttacctgttctcccttctacgatgtctcacatgatacagttttccattaatctgggctgc
ggaacacatcaacctggtaacagcgtcggcttggagttttccaccaaccatgggcgctcc
tggtccctcctccacactgagtgtttgcctgagatctgtgctggaccccacctcccccac
agcacgatctactcctccgaaaactatagtgggtggaaccgcataacgattccccttcct
aacgcagcactaaccagggacaccagaattcgctggagacaaatgggaccaatccttgga
aacatgtgggcaattgataatgtttatattggtccgtcgtgtctcaaattctgttctggc
cgaggacaatgcactcgacatggttgcaagtgtgaccctggattttctggcccagcttgt
gagatggcatcgcagacattcccaatgtttatttctgaaagctttggcagttctaggctc
tcctcttaccataacttttactctatccgtggtgctgaagtcagctttggttgtggtgtc
ttggccagtggtaaggccctggttttcaacaaggatgggaggcgtcagctaattacatct
ttccttgacagctcacagtccaggtttctccagttcacactgaggctggggagcaagtct
gttctcagcacatgcagagcccccgaccagcctggtgaaggagttttattgcattattcg
tatgataatgggataacttggaaacttttggagcactattcatatctcaactatcatgag
cccagaataatctctgtagagttaccagatgatgcaagacagttcggaattcagttcaga
tggtggcaaccatatcattcttcccagggagaagacgtatgggccattgatgagattatc
atgacatctgtccttttcaacagcattagccttgactttaccaatcttgtggaggtcact
caatctctgggattctaccttggaaatgttcagccatactgtggccatgactggacactt
tgtttcacgggagattctaaacttgcgtcgagtatgcgctatgtggaaacacaatcaatg
cagataggagcatcctatatgattcagttcagtttggtgatgggctgtggccagaaatac
actccgcacatggacaaccaggtgaagctggagtactcaaccaatcacggcctcacctgg
cacctggtccaagaggaatgtcttccaagtatgccgagttgtcaggaattcacgtcagca
agtatttaccatgccagtgagttcacgcagtggagaagagtcatagtgcttcttccccag
aaaacttggtccagtgccacccgcttccgctggagtcagagctattacacagcccaagac
gagtgggctttggacagcatttacattggacagcagtgcccaaacatgtgcagtggacac
ggctcatgtgaccacggcgtgtgcaggtgtgaccagggataccaaggcaccgaatgccac
ccggaagctgctcttccttccacgattatgtcggattttgagaaccagaatggttgggag
tctgactggcaagaagtcattgggggagaagttgtaaagcctgaacaagggtgtggagtc
atctcatctggctcatctctgtatttcagcaaggctgggaaaagacagctggtgagctgg
gatctggatacctcttgggtggactttgtccagttctacatccagattggcggagagagt
gccgcgtgcaacaagccggacagcagggaggagggcgtcctcctccagtacagcaacaac
gggggcatccagtggcacctgctggcagagatgtacttctccgacttcagcaaacccaga
tttgtctacctggagctcccagctgctgctaagaccccttgcaccagatttcgctggtgg
cagcctgtgttctcgggggagggctacgaccagtgggcggtggatgacatcatcatcctg
tcagagaagcaaaagcaggtcatcccaattgtgaacccaactttacctcagaacttttat
gagaagccagcttttgattacccgattaatcaaatgagtgtgtggttgatgttggctaat
gaaggaatggctaaaaacgaaactttctgctctgccacaccatcagccatggtatttgga
aaatcagatggggatcgatttgcagtaactcgagatttgactttgaaacctggatatgtg
ctacagttcaagctgaacatagggtgtgccaatcagttcagcagtgctgccccagttctt
cttcagtactctcatgatgcgggtatgtcctggtttctggtgaaagaaggctgttaccca
gcttctgcaggcaaaggatgtgagggcaactccagggaactcagtgagcccaccatgtat
cacacgggggactttgaggaatggacgagaatcactattgttattccaaggtctcttgca
tccagtaaaaccagattccgatggattcaggagagcagctcacagaagaacgtgcctccc
tttggtctggatggagtgtacatatctgagccttgtcccagttactgcagtggccatggg
gactgtgtctcaggggtgtgtttctgtgatctggggtacaccgccgcgcaaggaacctgt
gtgtctaatgtccctaatcacagtgagatgttcgataggtttgaggggaagctcagccct
ctgtggtacaagataactgggggccaggttggaaccggctgtggaacgcttaatgatggc
aaatctctctacttcaatggccctgggaaaagggaagcaaggactgtccctctggacacc
aggaatatcagactcgttcagttttatatacaaattggaagcaaaacttcagggattacc
tgcatcaaaccaagagctagaaatgaagggcttgttgttcagtattcaaatgacaacggg
atactctggcatttgctccgagagttggacttcatgtcatttctggagccacagatcatt
tccattgacctgccacgggaggcgaagacccccgccacagcttttcgatggtggcaagcc
caacacgggaaacattcagcccagtgggccttggatgatgtccttataggaatgaatgac
agctctcaaactgggtttcaagacaaatttgatggctccatagatttgcaagccagctgg
taccgaatacaaggaggtcaagtcggaactgactgtctctctatggatactgctctgata
ttcactgaaagcataggaaaacctcgttatgcagagacctgggactttcacgtgtcggca
tcgaccttcttgcagtttgaaatgagcatgggctgcagcaagcccttcagtgacacccat
ggcgtccagctccagtattctctaaacaacggcagggactggcatctggtcaccgaggag
tgtgtgcctcccaccattggctgcctgcactacacagaaagttctgtttacacctcagaa
agattccagaattggaaacggatcactgtctaccttccactctccaccatatctcccagg
acccggttcagatggattcagtccaactacaccgcgggggctgacgcatgggctattgat
aatgttgtgctggcctcggggtgcccctggctgtgctcgggaagagggatttgcgatgcc
ggacgctgtgtgtgtgaccggggctttggtggagcctactgtgtgcccatcgttcctctg
ccctcaattcttaaggatgatttcaacgggaacctacatcccgacctttggcctgaagtg
tatggtgccgagagggggaatctgaatggggaaaccatcaaatctggaacatcgcttatt
tttaaaggggaacgactgaggatgcttatttcaagagatctagactgtacgaatactatg
tatgtccagttttcacttagatttatagcaaaaggtaccccggagaggtctcactctatt
ctactacagttctccatcaacgggggaatcacttggcacctgatggacgaattttacttc
cctcaaacgaccaacatacttttcattaatgtgcccttgccgtatactgcccagaccaac
gctacaaggtttagactctggcaaccttataataatggtaagaaagaagaaatctggatt
gttgatgacttcattattgatgggaataatctaaacaaccctctgatgcttctggataca
tttgactttggacccagagaagacaactggtttttctatcctggtggtaacattggcctt
tattgcccgtattcctcaaaaggagctcctgaggaagattcagcaatggtatttgtttca
aatgaagttggtgagcattccattaccactcgtgacctggatgtgaacgagaacaccatc
atacaatttgagatcaacgtgggttgctccacggatagctcatccaccgatccagtcaga
ctggagttttcaagggacttcggggcgacctggcacctgctgctccccctgtgctaccac
agcagcagccacgtcagctccttgtgctccaccgagcaccacccgagcagcacctactac
gcggggaccacccagggctggaggagggaggtcgtgcactttgggaagctgcacctttgc
ggatctgtgcgtttcagatggtaccaaggattttaccctgcaggctctcagccagtgacg
tgggccatcgataatgtctatattggtccccaatgtgaagagatgtgtaacggacatggg
agttgtatcaatggaaccaagtgtatatgtgatcccggctactcaggtcccacctgtaaa
ataagcaccaaaaatcctgattttctcaaggatgattttgaaggtcagctagaatccgat
cgattcttattaatgagtggcgggaagccatctcggaagtgtggaatcctttccagtgga
aacaacctctttttcaacgaagatggcttgcgcatgttgatgacacgagatctggattta
tcacatgctagatttgtgcagttcttcatgagactgggatgtggtaaaggtgttcctgac
cccaggagccaacctgtgcttcttcagtattcgctcaatggcggcctgtcatggagtctc
cttcaagagttccttttcagcaactccagcaacgtgggcaggtacatcgccctggagata
cccttgaaagcccgttctgcttctactcgcctccgctggtggcaaccgtctgaaaatggg
cacttctacagtccctgggttatcgaccagattcttatcggaggaaatatttctggtaat
acggtcttggaggatgatttcacaactctggatagtaggaaatggctgctccacccagga
ggcaccaagatgcctgtgtgcggctcttctggtgacgccctggtcttcattgaaaaggcc
agcacccgctatgtggtcaccacagacattgctgtgaatgaggattcattcctacagatc
gattttgctgcctcctgctcggtcacagactcctgttatgctattgaattggaatactcg
gtagatcttggattatcgtggcatccattggtaagggactgtctacctaccaatgttgaa
tgcagtcgttaccacctacagcggatcctggtgtcagacactttcaacaaatggaccaga
atcactctgcctctccctccttataccaggtcccaagccactcgtttccgttggcatcaa
ccagctcctttcgacaagcagcagacatgggccatagataatgtctacatcggggacggc
tgtatagatatgtgtggtggccacggaaggtgcatccagggaagctgtatctgtgacgag
cagtggggtggcctatactgtgatgagcccgagatctcccttccaacccaactcaaagac
aacttcaatagagctccatccagccagaactggttgactgtgaacggagggaaactaagt
actgtgtgtggtgctgtggcttcgggaatggctctccatttcagcgggggttgcagtcga
ctgctagtcaccgtggatctgaacctcaccaatgccgaatttatccaattttacttcatg
tacggatgccttattacaccgaacaaccgtaaccaaggtgttctcctggaatattctgtc
aacggaggcattacctggaacctgctaatggaaattttctatgatcaatacagtaaacct
ggatttgtaaatatccttctccctcctgatgctaaagagattgctactcgcttccgctgg
tggcagccaagacacgacggcctggatcagaatgattgggccatcgacaatgtccttatc
tcaggctctgctgaccagaggactgtcatgctggacactttcagcagcgcccctgtgccc
cagcacgagcgctcccctgcagacgccggtcctgtcgggagaattgccttcgacatgttt
atggaggataaaactgcagtgaatgagcattggctattccacgacgactgcacggtagaa
agattctgtgactcccctgatggtgtcatgatttgcggcagccatgatggaagagaagta
tacgcagtgacccatgacctgacccccacggaaggctggatcatgcagttcaagatctct
gttggatgtaaagtatctgaaaaagttgcccagaatcaagttcatgtgcagtattctacc
gactttggtgtgagctggaattatctggtccctcagtgcttacctgcagaccccaaatgt
tctggaagtgtttctcagccatctgtgttctttccaaccaaaggctggaaaaggatcacc
tacccactccctgaaagcttagtggggaatccagtaagattgaggttctaccagaagtac
tcagatatgcagtgggcaatcgataatttctacctgggccctggatgcttggacaactgt
agaggccatggagattgcttaaaggagcagtgcatctgtgatccgggatactctgggccc
aactgctacttgactcacactctgaagactttcctgaaggaacgctttgacagtgaagaa
atcaagcctgatttatggatgtccttagaaggtggaagtacttgtaccgagtgtgggatt
cttgccgaggacactgcactctattttgggggatccaccgtgagacaagctattactcaa
gatttggatctcaggggggccaaattcctgcaatactgggggcgcatcggtagtgagaac
aacatgacctcctgccatcggcccatctgccggaaggaaggcgtgctgttggactactct
accgatggaggaattacttggactttgctccatgagatggattaccagaaatacatctct
gttcgacacgactacatactccttcctgaggacgccctcaccaacacgactcgacttcgc
tggtggcagccttttgtgaccagcaatggactcgtggtctctggagtggagcgtgcgcag
tgggcactagacaacatactgattgggggagcagaaatcaatcctagtcaactggtcgac
acttttgacgatgaaggtacttcccatgaagaaaactggagtttttaccctaatgcagta
aggacggcaggattctgtggcaatccatccttccacctctactggccaaataaaaagaag
gacaagactcacaatgcgctctcctccagagaactcattatacagccaggatatatgatg
cagtttaaaattgtggtgggttgcgaggccacttcgtgtggtgaccttcattccgtcatg
ttggagtatactaaggatgcaaggtctgattcctggcagctcgttcagacccagtgcctt
ccttcctcttcaaacagcatcggctgctcccccttccagttccatgaagccaccatctac
aatgctgtcaacagctcgagctggaagaggattaccattcagctgcctgaccatgtctcc
tccagtgcgacacaattccgctggatccagaagggggaagaaacggagaagcaaagctgg
gcgattgaccacgtgtacatcggagaggcttgccccaagctctgcagtgggcacggatac
tgcaccactggcgccgtctgcatctgtgatgaaagtttccaaggtgacgattgctctgtt
ttcagtcatgatcttcccagttatatcaaagataatttcgagtcagcaagagtcactgaa
gcaaactgggagaccattcaaggtggggtgataggaagtggctgtgggcagctcgcaccc
tacgcccatggagactcactctatttcaatggctgtcagataaggcaggcagccaccaag
cctctggatctcactcgagcaagcaaaatcatgtttgttttgcaaatcgggagcacgtcg
cagacggacagctgcaacagcgacgtgagcggcccgcacgccgtggacaaggcggtgctg
ctgcagtacagtgttaacaacggcatcacgtggcacgtcatcgcgcagcaccagcccaag
gacttcacgcaggcccagagggtgtcctacaatgtccccctggaggcacggatgaaagga
gttttactgcgctggtggcaacctcgccacaacggaacaggtcatgatcaatgggctttg
gaccatgtggaggtcgtcctcactcgcaaacaaaattacatgatgaatttttcacgacaa
catgggctcaggcacttctacaacagaagacgaaggtcacttagacgatacccatga

KEGG   Ursus arctos horribilis: 113259434
Entry
113259434         CDS       T05909                                 

Gene name
LAMA2
Definition
(RefSeq) laminin subunit alpha-2 isoform X1
  KO
K05637  laminin, alpha 1/2
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
uah05410  Hypertrophic cardiomyopathy (HCM)
uah05412  Arrhythmogenic right ventricular cardiomyopathy (ARVC)
uah05414  Dilated cardiomyopathy (DCM)
uah05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113259434 (LAMA2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113259434 (LAMA2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113259434 (LAMA2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113259434 (LAMA2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113259434 (LAMA2)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy (HCM)
    113259434 (LAMA2)
   05412 Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    113259434 (LAMA2)
   05414 Dilated cardiomyopathy (DCM)
    113259434 (LAMA2)
   05416 Viral myocarditis
    113259434 (LAMA2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113259434 (LAMA2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113259434 (LAMA2)
   05145 Toxoplasmosis
    113259434 (LAMA2)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_N Laminin_B Laminin_II Laminin_G_3
Other DBs
NCBI-GeneID: 113259434
NCBI-ProteinID: XP_026360566
UniProt: A0A3Q7VHN1
LinkDB
Position
Unknown
AA seq 3125 aa
MPGAAPVLLVLLLSGSLGGGRAQRPQQQQRQRQPQTHQQRGLFPAVLNLASNALITTNAT
CGERGPEMYCKLVEHVPGQPVRNPQCRICNQNSSNPFQRHPITNAIDGKNTWWQSPSIKN
GIEYHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDNVEYKPWQYHAVTDTECL
TLYNIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTS
ARYIRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACP
LDLVTNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDE
NVARRNLSLNIHGKYIGGGVCINCTQNTAGINCETCIDGFFRPKGVSPNYPRPCQPCHCD
PIGSLNEVCVKDEKRARRGLAPGSCHCKPGFRGVSCDRCARGYTGYPDCKPCNCSGAGST
NEDPCFGPCNCKENVEGGDCSRCKAGFFNLQEDNHKGCDECFCSGVSDRCQSSYWTYGHI
QDMRGWYLTDLSGRVQVTPGQDDLDPPQQISISVEEAHRALPQGYYWRAPAPYLGNKLTA
AGGQLTFTISYNFEEEEEDTERVFQIMVILEGNDLRISTAQDEVYLQPYEEHIHVLSLKE
ELFTIHGTNFPVSRRELMTVLANLKRVLIQITHSLGMNAIFRLGSVGLESAVRYPTDRSI
AAAVEVCQCPPGYTGSSCESCWPRHRRINGTLFGGICEPCHCFGHAESCDDITGECLNCK
NHTGGPYCNKCLPGFYGDPTKGTSEDCQPCACPLNIPSNNFSPTCRLDRSLGLVCDACPV
GYTGPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSVPGSCDSLSGSCLICKPGTTGRF
CELCADGYFGDAVDAKNCQPCRCNVNSSFSESCHARTGQCECKPNVQGRRCDECKPETFG
LQSGRGCVPCNCNSFGSKSFDCEESGQCWCQPGVSGKKCDRCAHGYFNFQEGGCTACDCS
HLGNNCDPKTGQCICPPNTIGEKCSKCAPNTWGHSITTGCKACNCSTVGSLDFQCNINTG
QCSCHPKFSGTKCTECNRGHWNYPHCSPCECFLPGTEDRTCDSETKKCSCIDQTGQCTCK
ANVEGVHCDRCRPGTFGLDAKNPLGCSNCYCFGATTQCSEAKGLIRTWVTLKPEQTILPL
VDEALQHTTTKGIVFQHPEIVAHMDLVRQELRLEPFYWKLPEQFEGKKLMAYGGKLKYAI
YFEAREETGFSTYNPQVIIRGGTPTHARIIVRHMAAPLIGQLTRHEIEMIEKEWKYYGDN
PRISRSVTREDFLDILYDIHYILIKATYGNIMRQSRISEISMEVAEQGRLTALTPPARLI
ERCDCPPGYAGLSCETCMPGFYRLRSELGSHTPRPTLGTCVPCQCNGHSSLCDPETSKCQ
NCQHHTAGDFCERCALGYYGIVRGLPNDCQQCACPLISSSNNFSPSCVTEGLDDYRCTAC
PRGYAGQYCERCAPGYTGSPSSPGGSCQECECDPHGSLPVPCDSITGLCTCRPGATGQKC
DGCEHWHAREGMECVFCGDECTGLLLGDLARLEQMAMSINLTGPLPAPYKMLYGLENTTQ
ELKHLLSPQRAPERLIQLAEGNLNTLVSEMNELLTRATKVTADGEQTGQDAERTNRRADS
LGEFIKELAQDAEAVNEKAVRLNETLGTPDKAFERNLQGLQKEIDQMMTELRRKNLDKQK
EVAEDELVAAEGLLKKVKKLFGESRGKNEEMEKGLQEILAGYKNKVDDAWDLLREAMEKI
REANRLSAANQKNMTALEKKKEAIERGKQQTENTLKEGSDILDEANRLAGEINSVIDYVE
DIQTKLPPMSEELKDKIDDLAQEIKDRKLAEKVSQAESHAAQLNDSSAVLDGILDEAKNI
SFNATAAFNAYSHIKDSIDEAEKIAQEAKGLAHEATKLATGPQGLLREDAKESLQKSFGI
LNEAKKLANAVKENDDYLNGLITRLENADVRNEDLLRALNDTLGKLSAIPNDTAAKLQAV
KDKARQANDTAKDVLAQIKDLHQNLDGLKKNYNQLADSVAKTNAVVKDPSKNKIIADADA
TVKNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRT
YKPEIKKGSYNNIIVNVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRV
EYPDLTIDDSYWYRIEASRTGRNGTISVRALDGPKASIVPSTYHAASPPGYTILDVDANA
MLFVGGLTGKLKKADAVRVITFTGCMGETYFDSKPIGLWNFREKEGDCKGCTVSPQVEDS
EGTIQFDGEGYALVSRPIRWYPNISTVMFKFRTFSSSALLMYLATRDLKDFMSVELTDGH
VKVSYDLGSGMASVVSNQNHNDGKWKSFTLSRIQKQANISIVDIDTNQEESIATSSSGNN
FGLDLKADDKIYFGGLPTLRNLSMKARPEVNLKKYSGCLKDIEISRTPYNILSSPDYVGV
TKGCSLENVYTVSFPKPGFVELPPVPIDIGTEINLSFSTKNESGIILLGSGGTPAQPRRK
RRQTGQAYYAIFLNKGRLEVHLSMGTRTMRKIVVKPEPSLFHDGREHSVHVERTKGVFTV
QVDEDRRHMQNLTIEQAIEVKKLFVGGAPPEFQPTPLRNIPPFEGCVWNLVINSVPMDFA
QPVSFKNADIGRCAHQKPREDDDGAVPAEIVTQPEPVPTPAFATPTPVVAHGPCAAESAP
ALLIGSKQFGLSRNSHIAIAFDDTKVKNRLTIEFEVRTEADSGLLFYMARINHADFATVQ
LRNGLPYFSYDLGSGDTITMIPTKINDGQWHKIKIMRIKQEGIIYVDDASNRTISPKKAD
ILDVVGMLYVGGLPINYTTRRIGPVTYSIDGCIRNFQMAEAPADLEQPTSSFHVGTCFAN
AQKGTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRTTGVLMGISSQKMDGMGIEMIDEKL
MFHVDNGAGRFTAVYGAGTPGQLCDGQWHKVTANKTRQRIELTVDGHQVEAQSPNQASTS
ADTNDPVFVGGFPDGLNQFGLTTNIRFRGCIRSLRLTKGTGKPLEVNFAKALELWGVQPV
SCPAN
NT seq 9378 nt   +upstreamnt  +downstreamnt
atgccaggagcggcccccgtcctcctggtcctgctgctcagcgggagtctcgggggcggc
cgggcgcagcggccacagcagcaacagcggcagcggcagccccagacacatcagcagaga
ggtttattccctgctgtcctgaatcttgcttctaatgctcttatcaccacaaatgcaaca
tgtggagaaagaggacctgaaatgtattgcaaattggtagagcatgtccctgggcagccc
gtgaggaacccccaatgtcgaatctgcaatcaaaacagcagcaatccattccaaagacac
ccgattacaaatgctattgatggaaagaacacctggtggcagagtcccagtattaagaat
ggaattgaataccattatgtgacaattacactggatttacagcaggtgttccagattgca
tatgtgattgtgaaagcagcaaactcccctcggcctggaaactggattttggaacgctca
cttgataatgttgaatacaagccctggcagtatcatgctgtgacagacaccgagtgtcta
actctctacaatatttatccccgtactggaccgccatcctatgccaaagatgatgaagtc
atctgcacttccttttattccaaaatacaccccttggaaaatggagagattcacatctct
ttgatcaatgggagaccaagtgccgatgacccttctcctgaactgctagaatttacctcc
gctcgctatattcgcctgagatttcagcggatccgtaccttgaatgctgatttgatgatg
tttgctcacaaagacccaagagaaattgacccgattgtcacccgaagatactactactcg
gtcaaggatatttcggtcggagggatgtgcatctgttatggtcatgccagggcttgtcca
cttgatctggtgacaaataaatcccgctgtgagtgtgagcataatacgtgtggtgacagc
tgtgatcagtgctgtccgggattccatcagaaaccttggcgagctgggacgtttctgact
aaaacagaatgtgaagcatgcaattgtcatgggaaagctgaagaatgctactatgatgaa
aatgttgccagaagaaatctgagtttaaatatacatggaaaatacattggagggggtgtg
tgcattaactgcacccagaacactgctggtataaactgtgagacatgcattgatggtttc
ttcagacccaaaggggtatctccaaattatccaagaccatgccagccatgccattgtgat
ccaattggttccttaaatgaagtctgtgtgaaggatgaaaaacgtgctcgacgaggcctg
gcacctgggtcttgtcattgcaaacctggcttcagaggcgtgagctgtgaccgttgtgcc
cggggctacactggctacccagactgcaagccctgtaactgcagtggtgcagggagcaca
aatgaggacccttgctttggaccctgtaactgcaaggagaatgttgaaggtggggactgc
agtcgctgtaaagccggcttcttcaatttgcaagaggataatcataagggttgtgacgag
tgtttctgttccggggtttcagacagatgtcagagctcctactggacctatggccatata
caagacatgcgtggctggtacctgaccgacctctctggccgcgttcaagtgactcccggg
caggatgacttagacccacctcagcagatcagcatcagcgttgaggaagcccaccgggcc
ctgccgcagggctactactggagggcaccagcgccgtatctgggaaacaagcttacagca
gctggaggacaactgacatttaccatatcatataatttcgaagaagaagaagaagataca
gaacgtgtattccagattatggttatcttagagggaaatgacttgagaattagcacggcc
caagatgaggtgtatctgcagccgtatgaagaacatattcacgtgctgtcacttaaagaa
gaattgtttaccatacatggcacaaatttccctgtcagtagaagagagttgatgacagtg
ctcgcgaatttgaagagagtcctcatacagatcacacacagccttgggatgaatgccatc
ttcaggttaggctctgttggccttgaatccgcggtccgctatcctacagacagaagcatt
gcagccgcagtagaggtttgtcagtgccctcctgggtacaccggctcctcctgcgaatct
tgttggcctcggcacagacgaattaacggcactctttttggtggcatctgtgaaccatgt
cattgctttggtcatgcagaatcttgtgatgacatcactggggaatgcctgaactgtaag
aatcacacaggtggcccatactgcaataaatgtcttcctggtttctatggtgatcctact
aaaggaacctctgaagattgtcagccgtgtgcctgtccactcaatatcccatccaataac
tttagcccgacgtgccgtttagaccgaagtctcggattggtctgtgatgcatgccctgtc
gggtacaccggaccacgctgtgagaggtgtgcggaaggctattttggacaaccttctgta
cctggaggctcatgtcagccatgccaatgcaatgacaaccttgacttctccgtccctggc
agctgtgacagcttgtctggctcctgtctgatatgtaagccaggtaccacaggccgcttc
tgtgagctctgtgctgatggatattttggagacgcagttgatgcaaagaactgtcagcct
tgtcgctgtaatgtcaacagctccttctctgagagttgccatgcccggacaggacagtgt
gaatgcaagcctaatgtgcagggacggcggtgtgatgagtgtaagcctgaaacctttggc
ctacaatcaggaaggggctgtgttccctgcaattgcaattcctttgggtctaaatcattc
gactgtgaagagagcggtcagtgttggtgccagcctggagtgtcaggaaagaaatgtgac
cgctgtgcccatggctatttcaacttccaagaaggaggctgcacagcttgtgactgttcc
catctgggaaataattgtgatccaaagactggtcagtgcatttgccctcccaacaccatt
ggagagaaatgttccaaatgtgcacctaatacctggggccacagcatcaccactggttgt
aaggcttgtaactgcagcacagtgggatccttggatttccaatgtaatataaacacgggc
caatgcagctgtcaccccaaattctctggtacaaaatgtacagagtgcaaccgaggtcac
tggaactaccctcactgcagtccctgcgagtgcttccttcccgggactgaggacaggacc
tgtgactcagagactaaaaagtgctcctgtattgatcagactgggcagtgcacctgtaag
gcgaatgtagaaggtgtccactgtgataggtgccggccgggcacatttggacttgacgcc
aagaacccacttggctgcagcaactgctattgcttcggggctactacccagtgctcggaa
gcaaagggactgatccgtacgtgggtgaccctgaaacctgagcagaccatcctgcctctg
gtggacgaggcactgcagcacaccactaccaaaggcatcgtgtttcaacacccagaaatt
gttgcacacatggacctggtaagacaggaactccgtttggaacctttttattggaaactt
ccagaacagtttgaaggaaagaagttgatggcctatggcggcaaactcaagtacgcaatc
tattttgaggctcgggaagagacaggtttctctacctacaatcctcaagttatcattcga
ggtggtacccctacgcatgctagaattatcgtcaggcatatggctgctcctctaattggc
cagttgacacggcatgaaatcgaaatgatagagaaagaatggaaatattatggtgataat
cctcgaatcagtagatctgtgacccgtgaagacttcttggatattctatatgatattcat
tatattcttatcaaggcaacttatggaaatatcatgagacaaagcaggatttctgagatc
tcgatggaggtagccgaacaaggacgcctaacagcactgactcctccagctcgcttgata
gaaagatgtgattgtcccccaggctatgctggtttgtcctgtgagacatgcatgccagga
ttttatcgactgcggtctgagctgggtagccacactcctagaccaactctgggcacctgc
gttccatgtcaatgtaatggacacagcagcctgtgtgaccctgaaacctccaaatgccag
aattgtcaacatcacactgctggtgacttctgtgaacgatgtgctcttggatattatgga
attgtcagaggattgccaaatgactgtcagcaatgtgcttgccctctaatttcttccagc
aacaattttagcccttcttgtgtcacggaaggcctcgatgattaccgctgcactgcctgc
ccacggggatacgcaggccagtactgtgaacggtgtgcccctggctatactggcagcccc
agcagccctggaggttcctgccaagaatgtgagtgtgacccgcatggctcgctgcctgtc
ccctgtgactccatcacaggactctgcacgtgccgccccggagccacagggcagaagtgt
gacggctgcgagcactggcatgcacgcgagggcatggagtgtgtgttttgtggagatgaa
tgcacgggcctccttctcggtgacttggctcgcctggagcagatggccatgagtatcaac
ctcactggcccgctacctgccccgtataaaatgctgtatggcctcgaaaatacgactcaa
gaactaaagcacttgctctcaccccagcgggctccagagagactcattcagttggcagag
ggcaatctgaacacactcgtgtcggaaatgaatgagcttctgaccagggctactaaagtg
acagcagatggcgagcaaactggacaagatgctgagaggaccaacaggagagcagactcc
ttaggagaattcattaaggagcttgcccaggatgcagaagctgtaaatgaaaaagctgta
agactaaatgaaactctaggaactcccgacaaggcctttgagagaaatttgcaagggctt
cagaaagagattgatcagatgatgacagaactgaggaggaaaaatctagataaacaaaag
gaagttgctgaagacgagttagtagctgcagaaggccttctgaagaaggtaaagaagtta
tttggtgagtcccgagggaaaaacgaggaaatggagaagggtctccaggagatattggca
ggctacaaaaacaaagttgatgatgcttgggacttgttgagagaagccatggagaaaatc
agagaggctaatcgcttatctgcagcaaaccaaaaaaacatgactgctttggagaaaaag
aaggaggctattgaacgtggcaagcaacaaactgagaacactttaaaagaaggcagtgac
atacttgatgaagccaaccgtcttgcaggtgaaatcaactcagtcatagattatgttgaa
gacattcaaactaaattgccacccatgtctgaagagcttaaagataaaatcgatgacctt
gcccaggaaataaaggacaggaagcttgctgagaaggtgtcccaggctgagagccatgcg
gctcagttgaatgactcatctgctgtccttgatggaatccttgacgaggctaaaaacatc
tccttcaatgccactgcagccttcaatgcttacagccacattaaggactctattgatgaa
gctgagaaaattgcccaagaagctaaaggtcttgcacacgaagctacaaaactggcaaca
ggtcctcagggtttactaagggaagatgccaaagagtctcttcagaaaagctttgggatt
cttaatgaagccaagaagttagcaaatgccgttaaagaaaatgatgactatctgaatggc
ttaattaccagattagaaaatgcagatgttagaaatgaagatctcctgagagctttgaat
gacactttggggaagctgtcagccattccaaatgacacagctgctaaactgcaagccgtt
aaggacaaagcaagacaagccaacgacacagcaaaagacgtattggcacagattaaagat
ctccaccagaaccttgacggcctgaagaaaaattacaatcaactagcagacagcgtagcc
aaaacaaatgctgtggtaaaagatccttcgaagaacaaaatcattgcagatgctgatgcc
actgtgaaaaatctagaacaagaagctgatcgtctgatagataaactcaaacccatcaag
gaacttgaggataacctaaagaaaaacatttctgagataaaggaactgataaaccaagcc
cggaaacaagctaattctatcaaagtatctgtgtcttccggaggtgactgcattcgaaca
tacaagccggaaatcaagaagggaagctacaataatatcattgtcaacgtaaagacagct
gttgctgacaacctccttttttatcttggaagtgccaaatttattgactttctggctata
gaaatgcgtaaaggcaaagtaagcttcctctgggatgttggatctggagttggacgtgta
gagtatccagatttgactattgatgactcttactggtaccgtattgaggcatcaagaact
gggaggaatgggaccatttctgtgagagccctggatggacccaaagccagcattgtgccc
agtacctaccacgcggcgtctcccccggggtatactatcctcgatgtggatgctaacgcg
atgctattcgttggtggcttgactgggaaactgaagaaggccgatgctgtacgcgtgatt
acattcacaggctgtatgggagaaacatactttgacagcaaacctatagggctgtggaat
ttccgagaaaaagaaggtgactgcaaaggctgtactgtcagtcctcaggtagaagatagt
gaaggaactattcagtttgatggagaagggtatgcattggtcagccgccccattcgctgg
taccccaacatctcaactgtcatgttcaagttcaggacattttcttcaagtgctctcctg
atgtaccttgccacacgagacctgaaagatttcatgagtgtagagctcactgatgggcat
gtaaaagtcagctatgatctgggttcaggaatggcttccgttgtcagcaatcaaaaccat
aatgatgggaaatggaaatccttcaccctgtcaagaattcagaaacaagccaacatatca
attgtagatatagatactaaccaggaggagagcatagcaacttcatcttctggaaacaac
ttcggtcttgacttgaaagcagatgacaaaatatattttggtggtctgccaacactgaga
aacttgagtatgaaagcaaggccagaagtaaatttgaagaaatattctggctgcctcaag
gatattgaaatttcaagaaccccatataacatactcagtagtcctgattatgttggtgtt
accaaaggatgttcactagagaacgtttacacagtcagcttccccaagcctggtttcgta
gagcttccccctgtgccaattgacatagggacagaaatcaacctgtccttcagcaccaag
aacgagtctgggatcattctcttgggaagtggagggacgccagcacaacctcggaggaaa
cgaaggcaaactggacaggcctattacgcgatattcctgaacaagggccgtctggaagtg
catctctccatggggacacgaacaatgaggaaaattgtcgtcaaaccggagccgagtctg
tttcacgacgggagagaacattccgttcacgtcgagagaactaaaggcgtctttactgtt
caagtcgatgaagacagaaggcatatgcaaaacctgaccatagaacaggcgattgaagtt
aaaaagcttttcgttgggggtgccccacctgaatttcaacctaccccactcagaaatatt
cctccttttgaaggctgtgtgtggaaccttgttataaactctgtcccgatggactttgca
cagcctgtatccttcaaaaatgcagacattggtcgttgtgcccatcagaagccccgcgag
gatgacgatggagcagtgccggccgaaatagtgacccagccagagccagtccccaccccc
gccttcgctacacccaccccagttgtggcacatggtccttgtgcggcagaatcagcacca
gctctcttgatagggagcaagcagttcgggctttcgagaaacagccacattgcaatcgcg
tttgatgacaccaaagtgaaaaaccgtctcaccattgagttcgaagtgcgaacagaagct
gactccggcctgcttttttacatggcccgcatcaatcacgctgactttgctaccgtgcag
ctgagaaatggattgccctatttcagttatgacttgggaagtggtgacaccatcaccatg
atccccaccaaaatcaacgacggtcagtggcacaagattaagattatgcgaattaagcaa
gagggaattatttatgtagatgatgcctccaacagaaccatcagtcccaagaaggcggat
atcctggatgttgtgggaatgctgtatgttggcgggctacccatcaactacactacccga
agaattggtccagtgacctacagcattgacggctgcatcaggaatttccagatggcagag
gcccctgctgatcttgaacagccaacctccagcttccatgttggaacatgttttgctaat
gctcagaaaggaacatattttgatggaacaggttttgccaaagcagttggtggattcaaa
gtgggattggaccttcttgtagaatttgaattccgcacaaccagaaccactggagttctt
atgggaatcagcagtcagaaaatggatggaatgggtattgaaatgattgatgaaaagcta
atgtttcacgtggacaatggcgccggccgattcactgcggtctatggtgctgggacccca
gggcagttgtgtgacggacagtggcataaagtcacagccaacaagaccagacagcgcatt
gagctgacagtagacgggcatcaggtggaagcccagagcccaaaccaagcatctacatca
gctgatacaaatgaccctgtgtttgttggtggtttcccggatggcctcaaccagtttggt
ctgacaaccaacattaggtttcgaggttgcattcgatctctgaggctcaccaaaggcaca
ggcaagccactggaggtcaattttgccaaggccctggagctatggggtgttcaacctgta
tcatgcccagccaactaa

KEGG   Ursus arctos horribilis: 113260108
Entry
113260108         CDS       T05909                                 

Gene name
COL1A2
Definition
(RefSeq) collagen alpha-2(I) chain
  KO
K06236  collagen, type I, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04611  Platelet activation
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113260108 (COL1A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113260108 (COL1A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113260108 (COL1A2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    113260108 (COL1A2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113260108 (COL1A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    113260108 (COL1A2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    113260108 (COL1A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113260108 (COL1A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113260108 (COL1A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113260108 (COL1A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113260108 (COL1A2)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113260108 (COL1A2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 113260108
NCBI-ProteinID: XP_026361636
UniProt: A0A3Q7VKW6
LinkDB
Position
Unknown
AA seq 1367 aa
MLSFVDTRTLLLLAVTSCLATCQSLQEATARKGPAGDRGPRGERGPPGPPGRDGDDGIPG
PPGPPGPPGPPGLGGNFAAQYDPGKGVGLGPGPMGLMGPRGPPGASGAPGPQGFQGPAGE
PGEPGQTGPAGARGPPGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGI
RGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGS
VGPVGPAGPIGSAGPPGFPGAPGPKGELGPVGNPGPAGPAGPRGEVGLPGVSGPVGPPGN
PGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGN
KGEPGSVGPQGPPGPSGEEGKRGPNGEAGSAGPSGPPGLRGSPGSRGLPGADGRAGVMGP
PGPRGATGPAGVRGPNGDSGRPGEPGLMGPRGFPGAPGNVGPAGKEGPMGLPGIDGRPGP
IGPAGARGEPGNIGFPGPKGPSGEPGKAGEKGHAGLAGARGAPGPDGNNGAQGPPGPQGV
QGGKGEQGPAGPPGFQGLPGPAGTAGEAGKPGERGLPGEFGLPGPAGPRGERGPPGESGA
AGPSGPIGSRGPSGPPGPDGNKGEPGVLGAPGTAGPSGPGGLPGERGAAGIPGGKGEKGE
TGLRGDVGNPGRDGARGAPGAVGAPGPAGATGDRGEAGPAGPAGPAGPRGSPGERGEVGP
AGPNGFAGPAGAAGQPGAKGERGTKGPKGENGPVGPTGPVGSAGPSGPNGPPGPAGSRGD
GGPPGATGFPGAAGRTGPPGPSGITGPPGPPGAAGKEGLRGPRGDQGPVGRTGETGAHGP
PGFAGEKGPSGEPGTAGPPGTAGPQGLLGAPGILGLPGSRGERGLPGVSGSVGEPGPLGI
SGPPGARGPPGAVGAPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGA
VGAPGPHGPVGPTGKHGNRGEPGPAGAVGPVGAVGPRGPSGPQGVRGDKGEPGDKGPRGL
PGLKGHNGLQGLPGLAGQHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRIGHPGTVGPAGV
RGSQGSQGPAGPPGPPGPPGPPGPSGGGYDFGYEGDFYRADQPRSPPSLRPKDYEVDATL
KSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVHCDFST
GETCIRAQPENIPAKNWYRNSKVKKHIWLGETINGGTQFEYNVEGVTTKEMATQLAFMRL
LANHASQNITYHCKNSIAYMDEETGNLNKAVILQGSNDVELVAEGNSRFTYSVLVDGCSK
KTNEWGKTIIEYKTNKPSRLPILDIAPLDIGGADQEFRVDVGPVCFK
NT seq 4104 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcggactttgttgctgcttgcagtaacttcgtgcctagca
acatgccaatctttacaagaggcaactgcaaggaagggcccagctggagatagaggacca
cgtggagaaaggggtccaccaggcccaccaggcagagatggtgacgatggcatcccaggc
cctcctggtccacctggtcctcctggcccccctggtcttggcgggaactttgcagctcag
tacgatcctggaaaaggagttggccttggccctggaccaatgggtttgatgggacctcga
gggccacctggtgcgtctggagctcctggccctcaaggtttccaaggacctgctggtgag
cctggtgaacctggtcaaactggtcctgcgggtgctcgtggtccacctgggcctcctggc
aaggctggggaggatggtcaccctggaaaacctggacgacctggtgagagaggagttgtt
ggaccacagggtgctcgtggtttccctgggactcctggacttcctggcttcaaaggcatt
cggggacacaatggtttggatggattgaagggacagcccggtgctccaggtgtgaagggt
gaacctggtgcccctggtgaaaatggaactccaggtcaaacaggagcccgtgggcttcct
ggtgagagaggacgcgttggtgcccctggtccagctggtgcccgtggaagtgatggaagt
gtgggtcccgtgggtcctgctggtcccattgggtctgccggccctccaggcttcccaggt
gctcctggccccaagggtgaactcggacctgttggtaaccctggtcctgctggtcctgcg
ggtccccgtggtgaagtgggtcttccaggtgtctccggccccgttgggcctcctggtaac
cctggagccaatggcctgactggtgctaagggtgctgctggcctgcccggtgttgccggg
gctcccggcctccctgggccccgtggaattcccggtcctgttggtgctgctggtgctacc
ggtgccagaggactcgtcggtgagcctggtccagctggttccaaaggagagagtggcaac
aagggtgagcccggctctgttgggccccaaggtcctcctggtcccagtggtgaagaagga
aagagaggccccaatggtgaagccggatctgctggcccctctggacctcctgggctgaga
ggaagtcctggttctcgtggtcttcctggagctgatggcagagctggcgtcatgggccct
cccggccctcgtggtgcaaccggccctgctggtgtccgaggtcccaacggagattctggt
cgccctggagagcctggcctcatgggaccccgaggttttcctggtgcccctggaaatgtt
ggcccagctggtaaagaaggtcccatgggcctccctggtattgacggcaggcctggaccg
attggcccagctggagcaagaggagagcctggcaacatcggattccctggacccaaaggc
cccagtggtgaacctggcaaagctggtgagaaaggtcatgctggtcttgctggtgctcgg
ggtgctccaggtcctgatggaaataatggtgctcaggggcctcctggaccacaaggcgtc
caaggtggaaaaggcgaacagggtcctgctggtcctccaggcttccagggtctgcctggc
cctgcaggtacagctggtgaagctggcaaaccaggagaaaggggtctccctggtgaattt
ggtctccctggtcccgctggtccaagaggggagcgtgggccccctggggaaagtggtgct
gctggtccttctggtcctattggaagccgaggtccttctggaccccctgggcccgatgga
aacaagggtgagcctggtgtgcttggtgctccgggcaccgcgggtccatctggtcccggc
ggactcccaggagagaggggtgctgccggcatacctggaggcaagggagaaaagggtgaa
actggtctcagaggtgacgttggtaacccaggcagagatggtgcccgtggcgctcctggt
gccgtaggtgcccctggtcctgctggagccactggtgaccggggtgaagccggtcctgcc
ggtcctgctggtcctgctggtcctcgaggtagccctggtgaacgtggtgaggtcggtccc
gctggccccaatggatttgcgggtcccgctggtgctgctggtcaacctggtgctaaagga
gagagaggaaccaaagggcccaagggtgaaaatggtcctgttggtcccacaggccccgtc
ggatctgctggcccatctggtccaaatggtccccctggtcctgctggaagtcgtggtgat
ggtggcccccctggtgctactggtttccctggtgctgctggacgaactggtcctcctggg
ccctctggtatcaccggccctcctggtccccctggtgctgctggtaaagaaggactccgt
gggcctcggggtgaccaaggtccagtgggccgaacgggagaaacaggtgcacatggtccc
cctggctttgccggcgagaagggtccctctggagagcctggtaccgctggccctcctggc
accgcaggtcctcaaggtctccttggtgctcctggcattctgggtctcccaggctctcga
ggtgaacgtggtctaccaggtgtttctggatctgtgggggaacccggacctctcggcatc
tctggtccacctggggctcgtggtccccccggagctgtgggtgcccctggagtcaacggc
gctcctggtgaagctggtcgtgatggcaaccctgggaacgatggtcccccaggccgcgat
ggtcaacccggacacaagggagagcgcggttaccctggcaacattggacccgttggcgct
gtgggtgcacctggtcctcatggccctgtgggtcccactggcaaacatggaaaccgtggt
gaacctggtcctgctggtgctgtgggtcccgtcggagctgttggtccaagaggtcctagt
ggcccacaaggtgttcgaggtgataagggagagcctggtgacaaggggcccagaggtctt
cctggcttaaagggacacaatgggctgcaaggtcttcctggtcttgctggtcaacatggc
gatcaaggtgcacctggctctgtgggtcctgccggtcctaggggtcctgctggtccttct
ggccccgctggcaaagacggtcgcattggacatcctggtacagtcggacctgctggcgtc
cgtggctctcaggggagccaaggtcctgctggtcctcctggtccccctggtcctcctggc
cctcctggcccaagtggtggtggctatgactttggttatgaaggggacttctacagggct
gaccagcctcgctcaccaccttctctcagacccaaggattatgaagttgatgctactctg
aaatccctcaacaaccagatcgagacccttcttactcctgaaggctctaggaagaaccca
gctcgcacatgccgtgacttgagactcagccaccccgagtggagcagtggttactactgg
attgaccctaaccaaggatgcactatggatgccatcaaagtacactgtgatttctccact
ggtgaaacgtgcatccgggctcaacctgagaacatcccagccaagaactggtacagaaat
tccaaggtcaagaagcacatctggttaggagaaactatcaatggtggtacccagtttgaa
tataatgttgaaggagtaaccaccaaggaaatggccactcaactcgccttcatgcgcctg
ctggccaaccatgcctctcagaacatcacctaccactgcaagaacagcattgcctacatg
gacgaggagactggcaacctgaacaaggctgtcattctgcaaggctccaatgatgttgaa
ctggttgccgagggcaacagcaggttcacctacagcgttcttgtggacggctgctctaaa
aagacaaatgaatggggaaagacaatcattgaatacaaaacaaataagccatcccgcctg
cctatccttgatattgcgcctttggacatcggcggtgctgaccaagagttcagggtggac
gttggcccagtctgtttcaaataa

KEGG   Ursus arctos horribilis: 113260915
Entry
113260915         CDS       T05909                                 

Gene name
COL6A2
Definition
(RefSeq) collagen alpha-2(VI) chain isoform X1
  KO
K06238  collagen, type VI, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113260915 (COL6A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113260915 (COL6A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113260915 (COL6A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113260915 (COL6A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113260915 (COL6A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113260915 (COL6A2)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113260915 (COL6A2)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113260915 (COL6A2)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113260915 (COL6A2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   113260915 (COL6A2)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3
Other DBs
NCBI-GeneID: 113260915
NCBI-ProteinID: XP_026362952
UniProt: A0A3Q7XK34
LinkDB
Position
Unknown
AA seq 1022 aa
MGAAKMLRSPCSALLLWVLLGAVHAQQQEIIAPGNSERNSCPEKADCPINVYFVLDTSES
VTMQSPIDSLLYHMKQFVRQFISQLQDETYLEQVALSWRYGGLHFSDVVRVFSPPDSDRA
SFTKSLESIVSIRKGTFTDCALANMTQEIRQLKSKGGVHFAVVITDGYVTGSPCGGIKLQ
AERAREEGIRIFTVAPDQVPNEQGLRDMASMPLELYRNNYATVRPDLEIDQDTINRIIKV
MKHEAYGECYKVSCLEIPGPPGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEGPIGFP
GPKGVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGSRGPDGYA
GEAGSPGEQGDQGIKGDAGRPGRRGPPGENGAKGSKGYQGNNGSPGSPGVKGAKGGPGPR
GPKGEPGRRGDPGAKGSPGSNGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQ
GALGEPGKQGSRGDPGDAGPRGESGQPGPKGDPGRPGFSYPGPRGTPGDKGEPGPPGPEG
GRGDFGAKGEPGRKGEKGEPADPGPPGEPGPRGPRGSPGPEGEPGPPGDPGLTECDVMTY
VRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETG
TRVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKE
SRRQKTRVFAVVITDGRHDPRDDDLNLRALCNHDVTVTAIGIGDMFHERHESENLYSIAC
DKPQQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPDLPCQTELYVAQCTQRPVDIVFL
LDGSERLGEQNFHKARRFVEEVSRRLTLARRDDDPLNARVALLQFGGPGEQQVEFPLTSN
LTVIHQALEGARYLNSFSHVGAGIVHAINHVVRGARAGARRHAELAFVFLTDGVTGNDSL
EEAVHSMRKQNVVPTVVAVGGDVDTDVLSKISLGDATAVFREKDYDSLAQPGFFDRFIRW
IC
NT seq 3069 nt   +upstreamnt  +downstreamnt
atgggtgctgctaagatgctccgaagtccctgctccgccctcctgctctgggtgctcctg
ggggccgtccatgcccagcagcaggagatcattgcccctggcaactctgagagaaacagc
tgcccagagaaggccgactgccccatcaacgtgtacttcgtgctggacacctcggagagc
gtcaccatgcagtcccccatcgacagcctgctctaccacatgaaacagttcgtgcgtcag
ttcatcagccagctgcaggacgagacctacctggagcaggtggccctgagctggcgctac
ggcggtctgcacttctccgacgtggtgagggtgttcagcccgccggacagcgaccgggcc
tccttcaccaagagcctggagagcatcgtgtccatccggaaaggcaccttcacggactgc
gcgctggccaacatgacccaggagatccgccagctcaagagcaagggtggggtgcacttt
gcggtggttatcaccgatggctacgtcaccggcagcccgtgcgggggcatcaagctgcag
gctgagcgggcccgagaggagggcatccggatcttcactgtggccccagaccaggtcccg
aacgaacagggtctgcgggacatggccagcatgccccttgagctctaccgtaacaactat
gccaccgtgcgtcccgacttggagatcgaccaggacaccatcaaccgcatcatcaaggtc
atgaaacatgaagcctatggagagtgctacaaggtgagctgtctggagatccccgggccc
cccggccccaagggctaccgcggacagaagggcgccaagggcaacatgggcgagccagga
gagcctgggcagaagggacgacagggagacccaggcatcgaaggccccattggattccca
ggacccaagggtgttcctggtttcaaaggagagaagggtgaatttggagcagacgggcgg
aagggggcccctggcctggccggcaagaacgggaccgatggacagaagggcaagctgggg
cgcatcgggcctcctggctgcaagggagatcctgggagtcggggccccgatggatacgca
ggggaagccggcagccccggggagcagggtgaccagggcatcaagggagatgctggccgc
ccaggacgcagaggacccccaggagagaacggggccaaaggaagcaaggggtatcaaggc
aacaacggatccccgggaagtcccggtgtgaaaggagccaagggtggacctgggccccga
ggacccaaaggcgagcctgggcgcaggggggacccaggagccaaaggcagcccgggcagc
aacggccccaagggtgagaagggagaccctggccctgagggtccccggggtctggctgga
gaggttggcaacaaaggagccaagggagaccgaggcttgcctggacccagaggccctcag
ggggctctcggagaacccgggaagcagggatctcggggagaccccggtgacgctggtccc
cgtggagagtcaggacagcccggtcccaagggagaccccggccggcctggattcagctac
ccaggaccccgaggaacacccggagacaaaggcgagcctggcccacctggccctgagggg
ggcagaggtgactttggtgccaaaggagagcccgggaggaaaggagagaagggcgagcct
gcagatcccggtccccctggcgagcccggcccccgggggccaagaggatccccaggaccc
gagggagagcccggcccccctggagaccccggcctcacggagtgtgacgtcatgacctac
gtgagggagacgtgcgggtgttgtgactgcgagaagcggtgtggggccctggacgtggtg
ttcgtcattgacagctccgagagcattggctacaccaacttcaccctggagaagaacttt
gtcatcaacgtggtcaacaggctgggggccattgccaaggaccccaagtcagagacggga
acccgcgtgggcgtggtgcagtacagccacgagggcacctttgaggccatccagctggac
gatgagcgcatcgactcgctgtccagcttcaaggaggccgtcaagaacctggagtggatt
gccggcggcacctggacgccctcagccctcaagtttgcctacaaccagctcatcaaggag
agccggcgccagaagacccgtgtgtttgcggtggtcatcacggacggccgccacgacccc
cgggatgacgacctcaacctgcgggcgctgtgcaaccacgatgtcacggtgacagccatc
ggcatcggcgacatgttccacgagagacacgagagcgagaacctctactccatcgcctgt
gacaagccacagcaggtgcgaaacatgaccctcttctccgatctcgtggccgagaagttc
atcgatgacatggaggacgtcctgtgcccagaccctcagatcgtgtgcccggaccttccc
tgccaaacagagctgtacgtggcccagtgcacgcagcggcctgtggacatcgtcttcctg
ctggacggctccgagcggctgggcgagcagaatttccacaaggcgcggcgcttcgtggag
gaggtgtcccggcggctgacgctggcgcgcagggacgacgacccgctcaacgcgcgcgtg
gccctgctgcagttcgggggccctggcgagcagcaggtggaattcccgctcacctccaac
ctgacggtcatccaccaggcactggaaggcgcgcgctacctcaactccttctcgcacgtg
ggcgcgggcatcgtgcacgccatcaaccacgtggtgcgcggcgctcgggcgggggcgcgg
cgccacgctgagctggccttcgtgttcctcaccgacggcgtcacgggcaacgacagcctg
gaggaggccgtgcactccatgcgcaagcagaacgtggtgcccaccgtggtggccgtgggc
ggcgacgtggacacagacgtgctgtccaagatcagcctgggcgacgcgaccgccgtcttc
cgcgagaaggactatgacagcctggcccagcctggcttcttcgacaggttcatccgctgg
atctgctag

KEGG   Ursus arctos horribilis: 113261138
Entry
113261138         CDS       T05909                                 

Gene name
COL9A1
Definition
(RefSeq) collagen alpha-1(IX) chain isoform X1
  KO
K08131  collagen, type IX, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113261138 (COL9A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113261138 (COL9A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113261138 (COL9A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113261138 (COL9A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113261138 (COL9A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:uah00535]
    113261138 (COL9A1)
Proteoglycans [BR:uah00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   113261138 (COL9A1)
SSDB
Motif
Pfam: Collagen Laminin_G_3 Laminin_G_2 Laminin_G_1 Toxin_R_bind_N
Other DBs
NCBI-GeneID: 113261138
NCBI-ProteinID: XP_026362917
UniProt: A0A3Q7XK05
LinkDB
Position
Unknown
AA seq 921 aa
MKTQWKIPVFFLVCSFLGSWASAAVKRRPRFPVNSNSNGENELCPKVRIGQDDLPGFDLI
SQFQIDKAASRRAIQRVVGSTALQVAYKLGNNVDFRIPTRNLYPSGLPEEYSFLTTFRMT
GGTLEKNWSIWQIQDSSGKEQVGVKINGQTKSVAFSYKGLDGNLQTVTFPNLPSLFDSQW
HKIMIGVERSSATLFVDCNRIQSLPIKPRGQIDVDGFAVLGKLADNPQVSVPFELQWMLI
HCDPLRPRRETCHELPVRITPSETTDQRGPPGEQGPPGPPGPPGVPGIDGIDGDRGPKGP
PGPPGPAGEPGKPGAPGKPGTPGADGLTGPDGSPGSVGPRGQKGEPGVPGSRGFPGRGIP
GPPGPPGAAGLPGELGRVGPIGDPGRRGPPGPPGPPGPSGAIGFHEGDPLCPNSCPPGRS
GYPGLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGELGEVGAQGPPGAQGLRGITGIVGAK
GEKGARGLDGEPGPQGLPGAPGDQGQRGPPGEIGPKGDRGPQGSRGIPGLPGTKGDTGLP
GVDGRDGIPGMPGTKGEPGKPGPPGDAGLQGLPGVPGVPGAKGIAGEKGNTGAPGKPGQL
GNSGKPGQQGQPGEVGPRGPRGLPGSRGEIGPVGPPGPPGKLGSPGSPGLPGLPGPPGLP
GMKGDRGAVGEPGPKGEQGSSGEEGEAGERGELGEIGLPGPKGSVGNPGEPGLRGPEGSR
GLPGVEGPRGPPGPRGVQGEQGATGLPGIEGPPGRAPTDQHIKQVCMRVMQEHLAEMAAS
LKRPDSGASGLPGRPGPPGPPGPPGENGFPGQMGLRGLPGIKGPPGALGLRGPKGDLGER
GERGPPGRGPKGLPGATGLPGEPGPASYGRNGRDGERGPPGVAGIPGVPGPPGPPGPPGF
CEPASCTLQAGQRAFSKGPDQ
NT seq 2766 nt   +upstreamnt  +downstreamnt
atgaaaacacaatggaaaattccagttttcttccttgtgtgcagtttcctgggatcttgg
gcatctgcagctgtcaagcgtcgcccaaggttccctgttaattcaaattcgaatggtgaa
aatgaactctgtccaaaggtcaggattggccaagatgatttaccaggctttgacttgatt
tctcagttccagatagataaagcagcatctagaagagctattcagagggtagtgggatca
actgcattacaagtggcttacaaattgggaaataatgtagacttcaggattccaacaagg
aatttatatcccagtggtctgcctgaagaatactcctttttaactacttttcggatgact
ggaggcacacttgaaaagaactggagcatttggcagattcaggattcctcagggaaggag
caagttggcgtgaagattaatggccaaacaaaatctgttgcattttcatacaagggactg
gatggaaatctccagactgtaacctttccaaatctgccttccttatttgattcccagtgg
cataagatcatgattggtgtggaaaggagtagtgctactctttttgttgattgcaacagg
attcaatccttacctataaagccaagaggccaaatcgatgttgatggctttgctgtgctg
ggaaaacttgcagataatcctcaagtttctgttccgtttgaacttcagtggatgctgatc
cattgtgacccactacgccccaggagagaaacttgccatgagctgccagtccgaataact
cccagtgagaccaccgaccagagaggtcccccgggcgagcaggggcccccggggcccccg
ggcccccctggagttccaggcatcgatggtatcgacggtgaccgaggtccaaagggtccc
ccgggccctccgggtcctgctggagaaccgggcaagccaggagctccgggcaagccgggc
acgccgggcgccgacggattaacaggacctgatggatcccctggttctgttggaccaaga
ggacaaaaaggagaacccggtgttcctggatctcgtggatttccaggccgtggtattcct
gggccccctggtccccctggggcagcaggactccctggagagcttggccgtgttggacca
attggtgaccctgggagaagaggaccacctggcccccctggccctccaggaccgagtgga
gcaattggctttcatgaaggcgatccattgtgtcccaattcctgtccaccaggccgctca
ggatatccaggcttaccaggcatgaggggtcataaaggggctaaaggagaaattggtgaa
cccggaagacagggtcacaagggtgaagaaggtgaccagggggaactgggagaagttgga
gctcaaggacctccgggagctcaaggattacgaggcatcactggcatagttggggccaaa
ggggaaaaaggtgctcggggcttagatggagaacctgggcctcagggtcttcctggtgca
cctggtgatcaaggacagagaggacctccaggagaaataggtcccaagggagatagaggg
cctcaaggttctagaggaattcctggcctccctgggaccaaaggagacacgggcttgcca
ggtgtggatggccgtgacgggatacctggaatgccaggaacaaagggtgaaccggggaag
cctgggcctcctggtgatgcaggattgcagggcttacctggtgtacctggggttcctggt
gcaaagggtattgctggtgaaaagggtaacacaggtgctccagggaagcctggtcagctg
gggaattcaggcaaaccgggccaacaggggcagccaggagaggtgggaccccgaggaccc
cgggggcttcctggcagcagaggagaaataggaccagtgggacctccaggaccaccaggt
aaactgggttctcctggtagtcctggcctccctggcttgcctggcccccctggccttcct
ggaatgaaaggtgacaggggagcagtcggtgaacctgggccaaagggtgaacagggttcc
tccggtgaagaaggtgaagcaggagaaaggggcgaacttggagaaataggattacctggg
ccaaagggatctgtgggtaaccctggggagcccggcttgagagggcctgaaggaagtcgg
gggcttcctggagtggaaggaccaagaggaccgcctggaccacgaggcgtgcagggagag
cagggtgccaccggcctgcctggcatcgagggccctccaggtagagcaccgacagatcag
cacattaagcaggtttgcatgagagtcatgcaagagcatcttgctgagatggctgctagt
ctcaagcgaccggactcaggagcctctggcctccctggtcggcctggcccccctggcccc
ccaggaccacctggagagaatggtttcccaggccagatgggactgcgtggcctcccaggc
attaaaggcccccctggtgctcttggtttaagggggcctaaaggtgacttgggagaaagg
ggggaacgaggccctccaggaagaggtcctaagggcttgcccggagctacaggtctccca
ggtgaaccaggtcctgccagctacgggaggaatggccgggatggcgagcgaggcccccca
ggggtggcaggaattcccggggtacctggccccccgggccctcctggccctcctgggttt
tgtgagccagcctcctgcaccctgcaggctgggcaacgggcctttagcaaagggcccgat
cagtga

KEGG   Ursus arctos horribilis: 113262587
Entry
113262587         CDS       T05909                                 

Gene name
TNN
Definition
(RefSeq) tenascin-N
  KO
K06252  tenascin
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05165  Human papillomavirus infection
uah05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113262587 (TNN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113262587 (TNN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113262587 (TNN)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    113262587 (TNN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113262587 (TNN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113262587 (TNN)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113262587 (TNN)
SSDB
Motif
Pfam: fn3 Fibrinogen_C DUF4998 Pur_ac_phosph_N Interfer-bind EGF_2 EGF_Tenascin
Other DBs
NCBI-GeneID: 113262587
NCBI-ProteinID: XP_026364805
UniProt: A0A3Q7WID8
LinkDB
Position
Unknown
AA seq 1156 aa
MGLQGTFCFPLGILFGSVLLVASAPATLEPHGCSDKEQQVTVSHTYKIDVPKSALVQVET
DPQPLSDDGASLLALGEAEEQNIIFRHNIRLQTPQKDCELAGSVQDLLARVKKLEEEMAE
MKEKCDVQRCCQGAAGGSHHCSGHGTFSPETCTCRCEQGWEGAACERPACPGACSGHGRC
TAGRCECEPPYVGADCAYPACPEDCSGHGVCVRGVCQCHDDFTSEDCSERRCPGDCSGHG
FCDTGECYCEEGFRGLDCAQVVAPQGLQLLKSTEESLLVSWEPSSEVDHYLLSYYPLGQE
LSGKQVQVPKEQHTYEIVGLQPATKYIVTLRNVKKEVSSSPQHLLATTDLAVLGTAWVTD
ETENSLDVEWENPPTEVDYYKLQYGPLTGQEVAEVTVPKSNDPKSRYDITGLQPGTEYKI
TVVPMKGELEGKPILLNGRTEIDSPTNVVTDRVTEDTAVVSWVPVQAVIDKYVVRFTSTD
GDTKDMAVPREQSSTVLTGLKPGEAYRVYVWAEKGNQESKKADTNALTEIDSPTNLATDR
VTEDTATLSWNPVQADIDKYMVRYTSADGDTREVSVGKEQSSTVLTGLRPGVEYTVQVWA
QKGARESKKADTKAPTGNKRKMGHQNCVEYTVQVWAQKGARESKKADTKAPTDIDSPKNL
VTNQVTENTATISWDPVQAVIDRYMVRYTSLDGDTRDIPVGKEQSSTVLTGLRPGVEYTV
QVWAQKGARESKKADTKAPTEIDSPQNLVTNRVTEDTATVSWDPVRAVIDKYTVRYTSAD
GDTREVSVGKEQSSTVLTGLRPGVEYTVQVWAQKGARESKKADTKALTDIDPPRNLHPSA
VTQSGGVLTWTAPSAPIDGYILTYQFPDGTIKELQLGRGDERLELQGLEQGLTYPVSLVA
FKGDRRSKSVSTTLSTVGARFPHPSDCSQVQQNSNVASGLYTIYVHGDTSRPLQVYCDMD
TDGGGWIVFQRRNTGQLDFFKRWRTYVEGFGDPMKEFWLGLDKLHNLTTGTPTRYEVRVD
LQTANESAYAVYDSFQVASSKERYRLSVGKYRGTAGDALTYHNGWKFTTFDRDNDIALSN
CALTHHGGWWYKNCHLANPNGRYGETKHSEGVNWEPWKGHEFSIPYVELKIRPHGYSGEH
VLDRKKRTLGEKSRTF
NT seq 3471 nt   +upstreamnt  +downstreamnt
atgggtctccaggggacattctgcttccccctggggatcctatttggctctgtgctcttg
gtggcgtcagccccagccactctcgaacctcacggctgcagcgacaaggagcaacaggtc
actgtcagccatacctacaagattgacgtgcccaagtccgctctggtccaggtggagact
gaccctcagcccctgagtgatgacggggcctcgctcctggccctgggggaggctgaggaa
cagaacatcatcttcaggcacaacatccgcctgcagacgccgcagaaggactgtgagctg
gctggcagcgtccaagacctcctggcccgggtgaagaagctggaggaggagatggcggag
atgaaggagaagtgtgatgtccagcgttgctgccaaggagctgctggcggcagccaccac
tgcagcggccacgggaccttctcgccggagacgtgcacctgtcgctgcgagcagggctgg
gagggcgccgcgtgcgagcggcccgcctgtcccggagcgtgcagtggccacgggcgctgc
accgccggccgctgcgagtgcgagccgccgtacgtgggcgccgactgcgcctaccccgcc
tgccccgaagactgcagcgggcacggcgtgtgcgtgcgcggcgtgtgccagtgccacgac
gacttcacgtccgaggactgcagcgagcgccgctgccccggcgactgcagcggccacggc
ttctgcgacacgggcgagtgctactgcgaggagggcttccgaggcctcgactgcgcccag
gtggtggctccccagggcctgcagctgctcaagagcaccgaggagtccctgctggtgagc
tgggagccctccagcgaggtggatcactacctcctcagctactaccctctggggcaggag
ctctctgggaagcaggtccaagtgcccaaggagcagcacacctacgagatcgtcggtttg
cagcctgcaaccaagtacatagtcacgctacgcaacgtgaagaaagaggtttccagcagc
ccacaacatctacttgccaccacagatcttgctgtgctcggcaccgcgtgggtgacggat
gagactgagaactcgcttgatgtggagtgggagaaccccccgaccgaggtggactactac
aagctgcagtatggccccctgacagggcaggaggtggctgaggtcaccgtgcccaagagc
aatgaccccaagagccgatatgacatcaccggtctacagcccgggacggaatataagatc
accgttgttcccatgaaaggagaactggagggcaagccgattctcctgaatggcaggaca
gaaattgatagcccaaccaatgtggtcactgatcgagtgacagaggacacagcggtggtc
tcctgggtccccgtccaggctgtcatagataagtatgtggtgcgtttcacctccaccgat
ggggacacgaaggacatggcggtccccagggagcagagcagcaccgtcctgacgggcctg
aagccaggagaggcgtacagagtctacgtgtgggccgagaagggcaaccaggagagcaag
aaggccgacaccaacgccctcacggaaatcgacagcccaacaaacctggccaccgaccgg
gtgacagaggacacggccaccctctcctggaacccagtgcaggctgatatcgacaagtac
atggtgcgctacacgtctgcggacggagacaccagggaggtgtccgtggggaaggagcag
agcagcaccgtcctgacgggcctgaggccaggtgtggagtacacggtccaggtgtgggcc
cagaagggggcccgggagagcaagaaggccgacaccaaggccccaacaggtaacaagaga
aagatgggccatcagaattgtgtggagtacacggtccaggtgtgggcccagaagggggcc
cgggagagcaagaaggccgacaccaaggccccaacagacattgacagccccaaaaacctg
gtgacaaaccaggtgacagagaacacggccaccatctcctgggacccggtgcaggccgtc
atcgacaggtacatggtgcgctacacctctttggacggagacaccagggacattccggtg
ggaaaggagcagagcagcaccgtcctgacgggcctgaggccaggtgtggagtacacggtc
caggtgtgggcccagaagggggcccgggagagcaagaaggccgacaccaaggccccgaca
gaaattgacagcccccaaaacctggtgaccaaccgtgtgacagaggacacagccaccgtc
tcctgggacccggtgcgggcggtcatcgacaagtacacggtgcgctacacgtctgcggac
ggagacaccagggaggtgtccgtggggaaggagcagagcagcactgtcctgacgggcctg
aggccaggtgtggagtacacggtccaggtgtgggcccagaagggggcccgggagagcaag
aaggccgacaccaaggccctgacagacatcgaccctcccagaaaccttcatccatctgcc
gtcacacagtccggaggggtgctgacctggacagcgccctctgctccgatcgatggctac
attctcacctaccagttcccagatggcaccattaaggagttacagcttggaagaggggac
gagaggcttgagttgcaaggccttgagcagggactcacctaccctgtctccttggtcgcc
tttaagggcgatcgccggagcaagagcgtatctaccaccctttccacagttggtgcacgt
tttccacatccttcggactgcagtcaagttcagcagaacagcaatgtcgccagtggtctg
tacaccatctacgtgcacggtgacaccagccggcccctgcaggtgtactgcgacatggac
accgacggaggcggctggattgtcttccagaggcgcaacactgggcagctggatttcttc
aagcgctggcggacctacgtggaaggctttggggaccccatgaaggagttctggcttgga
cttgacaagctacacaatctcaccactggcacccctacccgctatgaggtgagagtggac
ctgcagactgccaacgaatccgcctacgccgtgtatgattccttccaggtggcctccagc
aaggagcggtacaggctctcggttgggaaatacagaggcaccgcaggggacgctcttact
taccacaatggatggaagtttacaacttttgacagagacaatgatattgccctcagcaac
tgtgccctgacccatcatggtggctggtggtacaagaactgtcacttggccaaccccaat
ggcagatacggggagaccaaacacagtgagggggtgaactgggagccgtggaaaggacat
gaattctccattccttatgtggagttgaaaatccgccctcatggctacagcggggagcat
gtcctggacagaaagaagcggacactgggagaaaagtcgagaacgttctga

KEGG   Ursus arctos horribilis: 113262643
Entry
113262643         CDS       T05909                                 

Gene name
LAMB3
Definition
(RefSeq) laminin subunit beta-3 isoform X1
  KO
K06244  laminin, beta 3
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113262643 (LAMB3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113262643 (LAMB3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113262643 (LAMB3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113262643 (LAMB3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113262643 (LAMB3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113262643 (LAMB3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113262643 (LAMB3)
   05145 Toxoplasmosis
    113262643 (LAMB3)
SSDB
Motif
Pfam: Laminin_N Laminin_EGF Laminin_II YlqD F5_F8_type_C NPV_P10
Other DBs
NCBI-GeneID: 113262643
NCBI-ProteinID: XP_026364839
UniProt: A0A3Q7VUR5
LinkDB
Position
Unknown
AA seq 1185 aa
MCGNLRGFPHWLKMRPLLLLCFVWPGLLCAQQACSRGACYPPVGDLLIGRTRFLRASSTC
GLTKPETYCTQYGEWQMKCCKCDSRLPHNYNSHRVENVVSSSGPMRWWQSQNDVNPVSLQ
LDLGKRFQLRDIMMDFKGPMPAGMLIERSSDFGNTWQVYQYLAADCTSAFPRVHQGQPQS
WQDVRCQSLPQRPNGRLDGGKVQLNIMDLASGIPATQSQKIQELAAITNLRVNFTRLAPV
PQRGYYPPSAYYAVSQLRLQGSCFCHGHADRCTPQPGASAGPSATVQVHDVCVCQHNTAG
PNCERCAPFYSNRPWRPADDQDPHECQRCDCNGHSESCHFDPAVFAASQGTNGGVCDNCQ
DHTEGKNCERCQLHYFRNRRPGAPIQETCIPCECDPDGAVPGVPCDPVTGQCVCKEHVQG
ERCDLCKPGFTGLTYANPQGCHRCDCSLLGSRRDMPCDEESGQCLCLPHVVGPKCDQCAP
YYWKLASGRGCEPCACDPHNSLGPQCNQFTGQCPCREGFSGLTCSAAAIRQCPDRTYGDA
GTTCRACDCDFRGTEGPGCDKASGRCLCRPGLTGPRCDQCQRGYCDRYPVCVACHPCFQI
YDADLRGQSLRLGSLRNATTSLRPGLSLDDPGLASRIANAKSKMEQIQAILSRPSVTEQE
VSQVANAIFSIRQTLQGLQPDLPLEEEPSSLSENLKNLDRIFNRLLVLYQSKKEQFERIS
GANPLGAFRVLTAAYQRSSQAAQQMADGSRLLLQLRNSRREAERLEGQLGGGAGVSSPQL
AALRLEMASLPDLTPTINKLCGGSRQTACTPGACPGELCPRDNGTACGPHCRGALPRAGG
AFRMAGQVAQQLQGFDAQLQQTRQMIKAASKAASKVQSDAQRLETQVSASRSQMEEDVRR
MRLFIQQVRDFLSGSDTDAATIQEVSEAVLALWLPTDSATVLRKMNEIQAIAARLPNVDL
VLSQTKKDIARAQRLQMEAEQARSRAHAVEGQVEDVVGNLRQGAVALQEAQDTMQGTSRS
LRLIQERVAEVQQVLGPAEGLVAGMTLQLGDFRARMEELSRRARQQQEQAASARQLTESA
EQRALSAQEGFERIKQKYAELKDQLGRSPMLGVQGSRILSVKTEAEELFGETMEMMDRMK
DMESELLRGSQAIKLRSADLTGLEKHVERIRDHINERVLYYATCK
NT seq 3558 nt   +upstreamnt  +downstreamnt
atgtgcggtaatctcaggggattcccccactggctgaagatgaggccactccttctcctg
tgctttgtctggcccggcctcctgtgtgcccagcaagcctgctcccgtggggcctgctat
ccacctgttggggacctgctcatcgggaggacccggtttctccgagcttcatccacctgc
ggactgaccaagcctgagacctactgcacccagtatggcgagtggcagatgaaatgctgc
aagtgtgactctaggttgcctcacaattacaacagtcaccgagtggagaatgtggtttca
tcctcaggccccatgcgctggtggcagtcacagaatgatgtgaaccccgtctctctgcag
ttagacctgggcaagaggttccagcttcgagacatcatgatggattttaaggggcccatg
cccgctgggatgctgattgagcgctcctcggacttcggcaacacctggcaggtgtaccag
tacctggctgccgactgcacctccgccttcccccgggtccaccagggccagcctcagagc
tggcaggatgttcggtgccagtccctgccccagaggcccaacgggcgcctggatgggggg
aaggttcaacttaacattatggatttagcatctgggattccggccactcaaagtcaaaaa
attcaagagctggcggcaatcacaaacttgagagttaacttcaccaggctggcccctgtg
ccccaaagaggctactacccccccagtgcctactacgccgtgtcccagctgcgtctgcaa
ggcagctgcttctgccacggccacgccgaccgctgcaccccgcagcccggagcctcggcc
ggcccctccgccaccgtgcaggtccacgatgtctgcgtctgccagcacaacactgctggc
cccaactgtgaacgctgtgcacccttctacagcaatcggccctggagacccgcagatgac
caggacccacacgaatgccagcggtgtgactgcaacgggcactcagagagctgtcacttc
gacccagctgtgtttgctgcgagccaggggacaaacggaggcgtgtgtgacaactgccag
gaccacactgagggcaagaactgtgagcggtgtcagctgcactatttccgcaaccggcgt
cccggcgctcccattcaggagacctgtatcccctgcgagtgtgatcccgatggggcagtg
ccgggggttccctgtgacccggtgactgggcagtgcgtgtgcaaggagcacgtgcagggg
gagcgctgtgacctgtgcaagccaggatttacgggactcacctatgccaacccacagggc
tgccaccgctgtgactgcagcctcctggggtctcgtcgggacatgccgtgcgatgaggag
agtgggcagtgcctgtgtctgccccacgtggtgggccccaaatgtgaccagtgcgctccc
tactactggaagctggctagcggtcggggctgtgagccgtgtgcctgtgaccctcacaac
tccctcggcccccagtgcaaccagttcacagggcagtgcccctgtcgagaagggtttagt
ggcctgacgtgcagcgccgcagccatacgccagtgtcccgaccggacgtacggagatgca
ggcacgacatgccgtgcctgtgactgtgacttccggggaaccgagggcccaggctgtgac
aaggcctcgggccgctgcctctgccgccctggcttgaccggaccccgctgtgaccagtgc
caacgaggttactgtgaccgctacccagtgtgcgtggcctgccacccttgcttccagatc
tacgatgccgatctccgggggcagtccctgcgcctcggcagcctccgaaatgccaccacc
agcctacggccgggtctgagcctggacgatcccggcctcgcctcccggattgccaatgca
aagagcaagatggagcagatccaagcaatcctctcccgtccctcggtcaccgagcaggag
gtatcccaggtggccaatgccattttctccatcaggcagaccctccagggcctgcagcct
gatctgcccttagaggaggagccctcgtccctctcggaaaacctgaagaatctggacaga
atcttcaatcgcctcctcgttctgtatcagagcaagaaggaacagtttgaaaggataagc
ggtgcgaatcctttaggggccttccgggtgctgaccgcagcctaccagcggtcctcccag
gctgctcagcagatggccgacggctctcgcctgctcttgcagctcaggaacagccggaga
gaggcagagaggctggagggtcagctgggaggaggggccggagtcagcagtccccagctg
gcggccctgaggctggagatggcttccttgcctgatctgacacccaccatcaacaagctc
tgcgggggctccaggcagacagcctgtaccccaggagcgtgccctggagagctgtgtccc
cgagacaatggcacggcatgtggccctcactgcagaggtgccctccccagggcgggtggg
gccttccggatggcggggcaggtggcccagcagctgcagggtttcgatgcccagctccag
cagacccggcagatgatcaaggccgcctcgaaggccgcctcgaaggtgcagtcagatgcc
cagcgcctggagacccaggtgagcgccagccgctcccagatggaggaagacgtcaggcgc
atgcggctcttcatccagcaggtgcgggacttcctgtcaggctctgacaccgatgcggcc
accatccaggaggtcagcgaggccgtgctggccctgtggctgcccacagactcggccacg
gtcctgcggaagatgaacgagatccaggccattgctgccaggctacccaacgtggacctg
gtgctgtctcagaccaagaaagacattgctcgggctcagaggctccagatggaggccgag
caggccaggagccgggctcacgccgtggagggccaggtggaggacgtggtggggaacctt
cggcagggcgcggtggcgctgcaggaagcccaggacaccatgcaaggcaccagccgatcc
ctccggcttatccaggagagggttgctgaggttcagcaggtcctggggccggcggaagga
ctggtggccggaatgaccttgcagctgggtgacttccgggcgcggatggaggagctcagc
cgcagggcgaggcagcagcaggagcaggcagcgtcggcccggcagctcacagagagcgct
gagcagcgagcgctgagcgcccaggagggattcgagagaataaagcaaaaatatgctgag
ttgaaggaccagttgggtaggagccccatgctgggggttcagggcagccggatcctgagc
gtcaagacggaggcagaggagctgtttggggagaccatggagatgatggacaggatgaaa
gacatggagtcagagctgcttcgggggagccaggccatcaagctccgctcagcagacctg
acggggctggagaagcacgtggagcggatccgcgaccacatcaacgagcgagtgctctac
tacgccacctgtaagtga

KEGG   Ursus arctos horribilis: 113262733
Entry
113262733         CDS       T05909                                 

Gene name
LAMC1
Definition
(RefSeq) laminin subunit gamma-1
  KO
K05635  laminin, gamma 1
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05020  Prion diseases
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113262733 (LAMC1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113262733 (LAMC1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113262733 (LAMC1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113262733 (LAMC1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113262733 (LAMC1)
  09164 Neurodegenerative disease
   05020 Prion diseases
    113262733 (LAMC1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113262733 (LAMC1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113262733 (LAMC1)
   05145 Toxoplasmosis
    113262733 (LAMC1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 113262733
NCBI-ProteinID: XP_026364982
UniProt: A0A3Q7XQ09
LinkDB
Position
Unknown
AA seq 1608 aa
MRGSERAAPPLRPQGRLWPVLAVLAAAAAGCARAAMDECTDEGGRSQRCMPEFVNAAFNV
TVVATNTCGAPPEEYCVQTGVTGVTKSCHLCDAGQPHLQHGAAFLTDYNNQADTTWWQSQ
TMLAGVQYPNSINLTLHLGKAFDITYVRLKFHTSRPESFAIYKRTREDGPWIPYQYYSGS
CENTYSKANRGFIRTGGDEQQALCTDEFSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQ
EWVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYAISDFAVGGRCKCNGHASECVKNEFD
KLVCNCKHNTYGVDCEKCLPFFNDRPWRRATAESASECLPCECNGRSQECYFDPELYRST
GHGGHCTNCQGNTDGANCERCRENFFRLGNNEACSPCHCSPVGSLSTQCDSYGRCSCKPG
VMGDKCDRCQPGYHSLTEAGCRPCSCNPSGSVDECNVETGRCVCKDNVEGFNCERCKPGF
FNLESSNPRGCTPCFCFGHSSVCTNAVGYSVYPITSTFQIDEDGWRVEQRDGSEASLEWS
SARQDITVISDSYFPRYFIAPAKFLGKQVLSYGQNLSFSFRVDRRDTRLSAEDLVLEGAG
LRVSVPLIAQGNSYPSETTVKYVFRLHEATDYPWRPPLTPFEFQKLLNNLTSIKIRGTYS
ERSAGYLDDVTLTSARPGPGVPATWVESCTCPVGYGGQFCEMCLSGYRRETPSLGPYSPC
VLCTCNGHSETCDPETGVCNCRDNTAGPHCERCGDGYYGDSTSGSSSDCQPCPCPGGSSC
AVVPKTQEVVCTSCPTGTTGKRCELCDDGYFGDPLGKNGPVRLCRLCQCNDNIDPNAVGN
CNRLTGECLKCIYNTAGFYCDRCKDGFFGNPLAPNPADKCKACACSPYGTAKQQSGCNPV
TGQCECLPHVTGRDCGACDPGFYNLQSGQGCERCDCHALGSTSGQCDIRTGQCECQPGVT
GQHCERCEVNHFGFGPEGCKPCDCHSEGSLSLQCKDDGRCECKQGFVGNRCDQCEENYFY
NRSWPGCQECPACYRLVKDKVADHRVKLQELENLIANLGTGEEVVTDQAFEDRLKEAERE
VMDLLREAQGVKDVDQNLMDRLQRVNNTLSSQISRLQNIRNTIEETGNLAEQARARVEST
EQLIEIATRELEKAKIAVANVSITQPESTGDPNNMTLLAEEARKLAERHKQEADDIVRVA
KTANETSTEALNLLLRTLAGENQTALEIEELNRKYEQAKNMSQDLEKQAARVHEEAKRAG
DKAVEIYASVAQLTPVDSEALENEANKIKKEAEDLDRLIDQKLKDYEDLREDMRGKELEV
KKLLEKGKTEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKDFDRRVNDNK
TAAEDALRRIPAINQTIIEANEKTREAQLALGNAAADATEAKSKAHEAERIASAVQKNAT
STKAEAERTFAEVTDLDNEVSSMLKQLQEAEKELKRKQEDADQDMMMAGMASQAAQEAEI
NARKAKNSVTSLLHLINDLLEQLGQLDTVDLNKLNEIEGTLNKAKDEMKVSDLDRKVSDL
ENEARKQEAAIMDYNRDIEEITKDIRNLEDIKKTLPSGCFNTPSIEKP
NT seq 4827 nt   +upstreamnt  +downstreamnt
atgaggggcagcgagcgggccgcgcctccgctgcggccccaggggcggctctggccggtg
ttggcggtgttggcggccgctgcggccggctgcgcccgggcagccatggacgagtgcacg
gacgagggcgggcggtcgcagcgctgcatgcccgagttcgtcaacgccgccttcaacgtg
accgtggtggccaccaacacgtgcggggctccgcccgaggagtactgtgtgcagaccggg
gtgaccggggtcaccaagtcctgtcacctgtgcgacgccgggcagccccacctgcagcac
ggggcagccttcttgaccgactacaacaaccaggccgacaccacctggtggcaaagccag
accatgctggccggggtgcagtaccccaactccatcaacctcacgctgcacctgggaaaa
gcttttgacatcacatacgtgcgcctgaagttccacaccagccgcccggagagcttcgcc
atttacaagcgcacgcgggaggacgggccctggattccttaccagtactacagcggctcc
tgtgagaacacctactcgaaggcaaaccgcggcttcatcaggacaggaggagacgagcag
caggccttgtgcacagatgaattcagtgacatttcccctctaaccgggggcaacgtggcc
ttttcaacactagaaggaaggcccagcgcctacaactttgacaacagccctgtgctgcag
gaatgggtaactgccactgacatccgagtaactctcaatcgcctgaacacctttggagat
gaagtgtttaacgaccccaaagttctcaagtcctattattatgcaatctctgattttgct
gtgggtggtaggtgtaaatgcaacgggcatgcaagtgagtgtgtgaagaatgaatttgac
aagctggtgtgtaattgcaaacataatacttacggggtagactgtgaaaagtgtctgcct
ttcttcaacgaccggccgtggaggagggcaactgccgagagcgccagcgaatgcctgccc
tgtgagtgtaatggccgatcacaggagtgctactttgaccccgaactataccggtccacc
ggccacggcggccactgtaccaactgccagggtaacacagatggcgccaactgcgagaga
tgccgggagaatttcttccgccttgggaacaacgaagcctgctctccctgtcactgtagt
cctgtgggttctctcagcacacagtgtgatagctatggtaggtgtagctgcaagccgggc
gtgatgggggataagtgtgaccgttgccagcccggataccattctctcacggaggcaggg
tgcaggccatgctcttgtaatccttctggcagcgtagatgaatgtaatgttgaaacagga
agatgtgtttgcaaagacaacgttgaaggcttcaattgtgagagatgcaaacctggattt
tttaatctggaatcatctaatcctaggggttgcacaccctgcttctgcttcgggcattct
tctgtctgtacaaatgctgttggctacagtgtttatcctataacctctacctttcagatc
gatgaagacgggtggcgtgtggaacaaagggatggctctgaagcgtcacttgagtggtcc
tctgcgaggcaggatatcaccgtcatctcagatagctactttcctaggtacttcattgcc
cccgcgaagttcttgggcaagcaggtgttgagttatggtcagaacctctccttctccttt
cgcgtggacaggcgagacactcgcctctccgcagaagacctagtgctcgagggagctggc
ctgagagtgtctgtgcccttgatcgctcagggcaattcctatccgagtgagaccactgtg
aagtatgtcttcaggctccatgaagccacagattacccttggaggcctcctcttacccct
ttcgaatttcagaagctcctaaacaatttgacctctatcaaaatccgtgggacgtatagt
gagagaagtgccggatatctggatgatgtcaccctaacaagtgctcgcccggggcccggg
gtgcctgccacttgggtggagtcctgcacctgtcctgtgggatacggagggcagttttgt
gagatgtgcctctcgggttacagaagagaaactccgagtctcgggccgtacagcccgtgt
gtgctttgtacctgcaatgggcacagtgagacctgtgaccccgagacaggtgtgtgtaac
tgcagagacaacacagccggcccccactgtgagaggtgcggcgatgggtactatggcgat
tccacctcgggcagctcgtccgactgccagccctgcccgtgccccgggggctcgagctgc
gccgtcgttccaaagacgcaggaggtggtgtgcaccagctgtcctaccggcaccaccggt
aaaaggtgtgagctctgtgacgatggctactttggagaccctctgggtaaaaacggccct
gtgaggctttgccgcctttgtcaatgcaatgacaacatcgaccccaacgcggttggaaat
tgcaatcgcttgacaggagaatgcctgaaatgcatctataacactgctggcttctactgt
gaccgatgcaaagacggattttttggaaatcccctggctcccaatccagcagacaaatgc
aaagcctgtgcgtgcagtccctacggaaccgcgaagcagcagagcggctgtaaccctgtg
accgggcagtgcgaatgtctgccccacgtgaccggccgggactgtggagcctgtgacccc
ggattctacaacctgcagagtgggcagggctgtgagaggtgcgactgccatgcgttgggt
tctaccagtggacagtgtgacatccgcactggccagtgtgagtgccagcctggcgtcacg
ggccagcactgtgagcgctgtgaggtcaaccactttgggtttggacccgaaggctgcaaa
ccctgtgactgtcattctgagggatctctctcactccagtgcaaagatgatggtcgctgt
gaatgtaaacagggttttgtgggaaatcgctgtgaccagtgtgaagagaactatttctac
aatcggtcttggcctggctgccaggagtgtccagcatgttaccggctggtaaaggataag
gttgctgatcatcgagtgaaactccaggaattagagaacctcatagcaaaccttggaact
ggggaagaggtggtgacagatcaagcctttgaggacagactaaaggaagcagagagagaa
gttatggacctccttcgagaggcccagggtgtcaaagatgtagaccagaatttgatggac
cgcctccagagagtgaataacaccttatccagccaaattagccgtttacagaatatccgc
aataccattgaagagactggaaacttggctgagcaagcacgtgccagggtggaaagcaca
gagcagttgattgaaatcgcgacccgagaacttgagaaagcaaagattgctgttgccaac
gtgtcaatcactcagccagagtctacaggggacccaaacaacatgactcttctggcagaa
gaggcacgaaagcttgctgaacgccataaacaagaagctgatgatattgtaagagtggcg
aagacagctaatgagacatcaactgaggcacttaatctgcttctgagaacactggcagga
gaaaatcaaacagcacttgagattgaagagcttaatagaaagtatgagcaagcgaagaac
atgtcacaggacctggagaagcaagctgcccgggtacacgaggaggccaagagggctgga
gataaagcggtggagatctatgccagcgttgcccagctgacgcccgtggactcggaagct
ctggagaatgaggctaataaaatcaagaaggaagctgaagatctggaccgtctgattgac
cagaaattgaaagattatgaggacctcagagaagacatgagggggaaggaacttgaagtc
aagaagcttctggagaagggaaagaccgaacagcagactgctgaccagctcctagcccgg
gcagatgccgccaaggctcttgcagaagaagctgcaaaaaagggacgaaacaccttacag
gaagccaatgacattctcaacaacctgaaggacttcgacaggcgagtgaatgataacaag
accgccgcagaggacgctttaaggagaatccctgccatcaaccagaccataattgaagcc
aatgaaaagacgagggaggcgcagctggcgctgggcaatgctgccgcggacgccactgag
gccaagagcaaggcccacgaggcggagaggatcgccagcgccgtccagaagaacgctacc
agcaccaaggcagaagccgaaaggacttttgcagaagttacagacctggataacgaggtg
agcagtatgttgaaacagctacaggaagcagaaaaggaactgaagagaaagcaagaggac
gccgaccaggatatgatgatggcagggatggcttcacaggctgctcaggaagccgagatc
aatgccagaaaggccaaaaactccgtgaccagcctcctccacctgattaatgaccttctg
gagcagctggggcagctggacacagtagacctgaacaagctcaatgagatcgaaggcacc
ctgaacaaagccaaagatgaaatgaaagtcagtgatctcgacaggaaagtgtctgatttg
gagaatgaagccaggaagcaggaggccgccatcatggactataacagagacatcgaggag
atcacgaaggacattcgcaacctggaagacatcaagaagaccttaccatctggctgcttc
aacaccccgtccatcgaaaagccctag

KEGG   Ursus arctos horribilis: 113262734
Entry
113262734         CDS       T05909                                 

Gene name
LAMC2
Definition
(RefSeq) laminin subunit gamma-2
  KO
K06246  laminin, gamma 2
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113262734 (LAMC2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113262734 (LAMC2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113262734 (LAMC2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113262734 (LAMC2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113262734 (LAMC2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113262734 (LAMC2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113262734 (LAMC2)
   05145 Toxoplasmosis
    113262734 (LAMC2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_B
Other DBs
NCBI-GeneID: 113262734
NCBI-ProteinID: XP_026364983
UniProt: A0A3Q7WTA4
LinkDB
Position
Unknown
AA seq 1191 aa
MPAPWLSGCLCLALLLPAARATSRREVCDCNGKSRQCVFDAELHRHTGNGFRCLNCNDNT
DGPRCERCKDGFYRQRERDRCLPCNCHTKGSLSARCDHSGRCSCKPGVTGDRCDRCLPGF
HSLTDAGCAQDQRLLDPKCDCDPAGVSGPCDEGRCVCKPAVTGERCDRCRPGYYHLDGGN
PEGCTQCFCYGHSANCHRSGDYSVHKITSTFYQDVDGWKAIQRNGSPAKLQWSQRHRDVF
SSARRSDPVYFVAPAKFLGNQQVSYGQTLSFDYRVDRGGRHPSAHDVILEGAGLRVTAPL
MPLGKTLPCGITKTYTFRLNERPSSHWSPQLSYFEYRRLLRNLTALRIRATYGEYSTGYL
DNVTLISARPISGAPAPWVEQCVCPVGYKGQFCQDCASGYKRDSARLGPFGACIPCNCQG
GGACDPDTGDCYSGDENLDIECADCPIGFYNDPHDPRSCKPCPCHNGFSCSVMPETEEVV
CNNCPHGVTGARCELCADGYFGDPFGERGPARPCQPCQCNNNVDPSASGNCDRLTGRCLK
CVHNTAGVHCDQCKAGYFGDPLAPNPADKCRACNCNPAGSEPGECRSDGSCVCKPGFGGP
SCDHAALTNCPACYSQVKIQMDQFMQQLQSLEALVSRAQGGGGAMPDAEMEGRMRQAEQA
LQDILQEAQISEGASKSLSLQLGKARSEEKSYWHRLNDLKMTVERMRALDSQYQNRVQDD
RRLVSQMHLSLEESEASLRRTNIPTSEHYVGPNGFKSLAQEATRLADSHVGSASNMERLV
RETEDYSKQALSLAHKALREGGGGGSLDSSAVQGLMGKLEKAKSLAQRLSGEATQTDIEA
DRSYQHSLRLLDSVSQLPGVNDQSFQAEVKRIKQKTDSLSSLVTQRMDEFKHVQDNLRNW
EEETQQLLQNGKNGRQTSDQLLSRANLAKSRAQEALSMGNATFYEVENILKNLREFDLQV
EDRKAEAEEAMKRLSYISQKVADASDKTKQAETALGGAAADAQRAKTAAREALEITDKIE
QEIGSLNLEANVTADGALAMEKGLATLKSEVREVEGELTRKEQEFDLDLDAVQMVITEAQ
RVDNRAKNAGATIQDALDTLDSILHLISQPGNVDEEKLISLEQKLFRAKTQINSQLRPLM
SELEERARRQQGHLRVLETSIDGILADVKNLENIRDNLPPGCYNTQALEQQ
NT seq 3576 nt   +upstreamnt  +downstreamnt
atgcctgcgccctggctgagcggctgcctctgcctggcgctcctcctgcctgcagcccgg
gccacctccaggagggaagtctgtgactgcaacgggaagtccaggcagtgcgtctttgat
gccgagctccacagacacacagggaatggattccgctgcctgaactgcaacgacaacacg
gatggccctcgctgcgagcggtgcaaggacgggttctaccgacagagggagagggaccgc
tgcctgccctgcaactgtcacaccaaaggttctcttagcgctcgatgtgaccactctgga
cggtgcagctgtaagccaggtgtgaccggagacaggtgtgaccgctgtctgccaggcttc
cactcgctcactgacgccgggtgcgcccaagaccaacggctactagaccccaagtgtgac
tgtgacccagctggcgtctcggggccctgtgacgagggccgctgtgtctgcaagccagct
gtcaccggagagcgctgtgacaggtgtcgaccaggttactatcacctggatgggggaaac
cctgagggctgtacccagtgtttttgctatgggcattcagccaactgccacagatctgga
gactacagtgtccataaaatcacctctactttctatcaagatgttgatggctggaaggct
atccaaagaaatgggtctcctgcaaagctccagtggtcccagcgtcatcgggacgtgttc
agctcagcacgacgatcagaccctgtctattttgttgctcctgccaaatttcttgggaat
caacaggtgagctatgggcaaaccctgtcttttgactaccgtgtggacaggggaggcaga
cacccatccgcccacgatgtgatcctggaaggtgctggtctacgggtcacagctcccttg
atgccactgggcaagacgctgccttgtgggatcaccaagacttacacgttcagattaaat
gaacgtccaagcagtcactggagcccccagctgagttacttcgagtaccgcaggttactg
cgtaacctcacagccctgcgaatccgcgccacttacggagaatacagcactgggtacctg
gacaacgtgaccctgatttcagcccgccccatctccggagccccagcaccctgggtcgaa
caatgcgtatgtcccgttggctacaagggacagttctgccaggattgtgcttccggctac
aaaagagattctgccagactgggaccttttggtgcttgtattccgtgtaactgccagggg
ggaggagcctgcgatccagacacaggagactgttattcaggggatgaaaacctcgacatc
gagtgtgccgactgccccatcgggttctacaacgacccacacgacccccgcagctgcaag
ccgtgtccctgccacaatgggttcagctgctctgtcatgccggagacagaggaggtggtg
tgcaataactgtccccacggggtcacgggtgcccgctgtgagctctgtgccgatggctac
tttggggacccctttggggaacgtggcccagcgaggccttgtcagccctgtcagtgcaac
aacaatgtggaccccagcgcctccgggaactgtgaccgcctgacaggcaggtgtctgaag
tgcgtccacaacacggcgggtgtccactgcgaccagtgcaaagcaggctactttggcgac
cctttggctcccaacccagcggacaagtgtcgagcctgcaactgcaacccagcgggctca
gagcctggggaatgtagaagtgatggcagctgtgtttgcaagcccgggtttggcggcccc
agctgtgaccatgcggcactaaccaactgtccagcttgctacagtcaagtgaagattcag
atggaccagtttatgcagcagctccagagcctggaggccctggtttcaagggctcagggt
ggtggcggagcgatgcccgacgcagagatggagggcagaatgcgacaggctgaacaggcc
cttcaggacattctgcaagaagcccagatttcagaaggtgctagtaaatctctcagtctc
cagttgggcaaggccaggagcgaagagaagagctactggcaccgcctcaatgacctcaag
atgactgtggaaagaatgcgggccctggacagtcagtatcagaaccgagttcaggacgat
cgcagactggtctctcagatgcacctgagcctggaggagagcgaggcctccctgagaaga
accaacattcctacctcagagcactatgtggggccaaatggtttcaaaagcctggcccag
gaggccacgagattggcagacagccatgttgggtcagccagtaacatggagcggctggta
agggaaaccgaggactattccaaacaggcgctgtcactggcacacaaggctctgcgggaa
ggaggcggcgggggcagcctggacagctccgcggtgcaagggcttatgggaaagttggag
aaagccaagtccctggcccagcgactgtcaggggaggccactcaaactgacattgaagca
gacaggtcttaccagcatagtcttcgccttctcgattctgtgtctcagctgcctggagtc
aatgatcagtcttttcaggcagaagtgaagaggatcaaacaaaaaactgattctctctca
agcctggtgacccagcgtatggatgagttcaagcatgtgcaggacaatctgagaaactgg
gaagaagaaacccagcagctcttacagaatggaaagaatgggagacagacatcagatcag
ctgctgtcccgtgccaaccttgctaaaagcagagcccaggaagcactaagtatgggcaat
gccactttttatgaagttgagaacatcctgaagaacctcagagagtttgacttgcaggtt
gaagacaggaaagcagaagcagaagaggccatgaagagactctcctacatcagccagaag
gtcgcagatgccagtgacaagaccaagcaagcagaaacagccctgggcggtgctgctgcc
gacgcccagagggcaaagaccgcagccagggaggccctggagatcactgacaagatagaa
caggagataggaagtctgaacttggaagccaatgtgacagcagatggagccttggccatg
gagaagggactggccactctgaagagtgaggtgagggaagtggaaggagagctgaccagg
aaggagcaagagtttgacctggatctggacgcggtgcagatggtaattacagaagcccaa
agagttgataacagagccaagaatgccggagctacgatccaagacgcactcgacacactg
gacagcatcctacacctcataagtcagcctggcaatgtggatgaagagaagctgatctca
ttagagcagaagctcttccgggccaagactcagatcaacagccagctgaggcccttgatg
tcagagctggaagagagagcacgtcggcagcagggccacctccgtgtgctggagacgagc
atagatgggattctggctgacgtaaagaacctggagaacatcagggacaacctgccccca
ggctgctacaacacccaggcgctggagcaacagtga

KEGG   Ursus arctos horribilis: 113262794
Entry
113262794         CDS       T05909                                 

Gene name
TNR
Definition
(RefSeq) tenascin-R
  KO
K06252  tenascin
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05165  Human papillomavirus infection
uah05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113262794 (TNR)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113262794 (TNR)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113262794 (TNR)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    113262794 (TNR)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113262794 (TNR)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113262794 (TNR)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113262794 (TNR)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_Tenascin EGF_2 Pur_ac_phosph_N Uroplakin_II EGF
Other DBs
NCBI-GeneID: 113262794
NCBI-ProteinID: XP_026365083
UniProt: A0A3Q7WTJ1
LinkDB
Position
Unknown
AA seq 1358 aa
MGVEGETVVLKSMLIGVNLILLGSLLKPSQCHLEVTTERVRRQAVEEEGGVANYSASGKE
RPMVFNHVYNINVPLDSLCSSGLEASAEQEVSAEDEALAEYTGQTSDHDSQVTFTHRINL
PKKACPCAGSARVLQELLSRIEMLEREVSLLRDQCASNCCPESAATGQLDYIPHCSGHGN
FSLQSCGCICNEGWFGKNCSEPYCPLGCSSRGVCVDGQCVCDSEYSGGDCSELRCPTDCS
SRGLCVDGECVCEEPYTGEDCSELRCPGDCSGKGRCANGTCFCQEGYVGDDCSQRRCPNA
CSGRGHCQEGLCFCEDGYQGPDCSAVAPPEDLRVAGISDRSIELEWDGPMAVTEYVISYQ
PTALGGLQLQRRVPGDWTGVTITELEPGLTYNISVYSVISSILSLPITAKVATHLSTPQG
LHFKTITETTVEVQWEPFSFSFDGWEISFIPKNNEGGVIAQLPSDVTSFNQTGLKPGEEY
IVNVVALKEQARSPPTSASVSTVIDGPTQILVRDVSDTVAFVEWTPPRAKVDFILLKYGL
VGGEGGKTTFRLQPPLSQYSVQALRPGSRYEVWVSAVRGTNESQSTTTQFTTEIDAPKNL
RVGSHTATSLDLEWDNSEAEVQEYKVVYSTLAGEQYHELLVPKSIGPTTRATLTDLVPGT
EYGVGISAVMNSQQSVPATMNARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITF
TPSSGIASEVTVPKDRTSYTLTDLEPGAEYIISITAERGRQQSFESTVDAFTGFRPISHL
HFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMTEVSLDATKRHTVLMGLQPATEY
IVNLVAVHGTVTSEPIVGSITTGIDPPKDITISNVTKDSVMVSWSPPVASFDYYRVSYRP
TQVGRLDSSVVPNTVTEFTITKLYPATEYEISLNSVRGREESERVCTLIHTAMDNPVDLT
ATNITPTEALLQWKAPVGGVENYVIILTHFAVAGETILVDGGSEEFQLAGLLPSTHYTVN
MYATSGPLTSGTVSTNFSTLLDPPANLTASEVTRQSALISWQPPRAEIENYILTYKSTDG
SRKELIVDAEDTWIRLEGLSESTDYTVLLQAAQDTSRSSLTSATFTTGGRVFPHPQDCAQ
HLMNGDTLSGVYTIFLNGELSRKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWAEYRVG
FGNLEDEFWLGLDNIHRITSQGRYELRVDMRDGQEAAFAYYDKFSVEDGRNLYKLRLGGY
NGTAGDSLSYHQGRPFSTEDRDNDVAVTNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQG
INWYHWKGHEFSIPFVEMKMRPYKHGLTAGRTRRSLRF
NT seq 4077 nt   +upstreamnt  +downstreamnt
atgggggtggagggggaaacggtggtcctgaagagcatgctcattggcgtcaacctgatc
cttctgggctccctgctcaagccctcccagtgtcacctggaggtcaccactgaaagggtc
cggagacaggcggtggaggaggaaggaggtgtggccaactacagcgcatccggcaaagag
cggccaatggtcttcaaccacgtgtacaacattaacgtgccgctggacagcctttgctcc
tccgggctggaggcctcggccgagcaggaggtgagcgccgaagacgaggcgctggcagag
tacaccgggcagacctcggaccacgacagccaggtcaccttcacccacaggatcaacctc
cccaaaaaggcttgcccgtgtgccgggtcggcccgggtgctgcaggagctgctgagccgg
atcgagatgctggagagggaggtgtctctgctccgggaccagtgcgccagcaattgctgc
ccagaaagcgcggccacaggacaactggactacatccctcactgcagtggccatggcaac
tttagcctccagtcctgcggctgcatctgcaatgaaggttggttcggcaagaactgctcc
gagccctattgccccctgggctgctccagccggggcgtgtgtgtggacggccagtgtgtc
tgtgacagcgagtacagcgggggcgactgctccgagctccggtgcccgacagactgcagc
tcccgggggctctgcgtggatggagagtgtgtctgtgaagagccctacacaggcgaggac
tgcagcgagctgaggtgccctggggactgttcagggaaggggagatgcgccaatggcacc
tgcttctgccaggagggctacgtgggggacgactgcagtcagcggcggtgcccgaacgcc
tgcagcgggcgaggacactgccaggaggggctctgcttctgcgaagacggctaccagggc
cccgactgctcggcagttgcccctccagaggacttgcgagtggctggtatcagcgacagg
tccattgagctggaatgggacgggccgatggcagtgacggaatatgtgatctcttaccag
ccgacggccctggggggcctccagctccagcggcgggtacctggagattggactggtgtc
accatcacggagctggagccaggtctcacctacaacatcagcgtctactctgtcattagc
agcatcctcagccttcccatcactgccaaggtggccacccatctctccactcctcaaggg
ctacacttcaagaccatcacagagaccaccgtggaggtgcagtgggagcccttctcattc
tccttcgatgggtgggagatcagcttcattccaaagaataatgaaggaggggtgattgct
cagctccccagtgacgttacatccttcaatcagacaggactaaagcctggggaggaatac
attgtcaatgtggtggctctgaaagagcaagcccggagcccccctacctcggccagcgtc
tctactgtcattgatgggcccacacagatcctggttcgagatgtctcggacactgtggcc
ttcgtggagtggaccccacctcgagccaaagtcgatttcattctcttgaagtatggcttg
gtgggcggggaaggcgggaagaccacttttcggctacagcctcccctgagccaatactca
gtacaggccctgcggcccggctcccgctacgaggtgtgggtcagtgccgtccgtgggacc
aacgaaagccagtccacgaccacccagttcaccacagagattgacgcccccaagaatctg
cgggtgggctcacacacggcgaccagccttgacctcgagtgggacaacagtgaagcagag
gtccaggagtacaaggtcgtgtatagcaccctggcgggagagcagtaccacgagctgctg
gtccccaagagcatcggtccaaccaccagagccaccctcacagatctggtgcctggcact
gagtatggagttggaatatctgctgtcatgaactcacagcaaagtgtaccagctaccatg
aatgccaggactgaacttgacagtccccgagacctcatggtgacggcctcctcagaaacc
tccatctccctcatctggaccaaggccagtggccccattgatcattaccgaattaccttt
accccatcctctggaattgcttcagaagtcactgtgcccaaggacaggacctcgtacaca
ctgacagatctagagccaggagcagagtatatcatttcgatcacggcggagaggggtcgg
cagcaaagcttcgagtccactgtggatgccttcacaggcttccgccccatctcccatttg
cacttttctcacgtgacctcctccagcgtaaacataacctggagtgacccatcccctcca
gcagacagactcattctgaactacagccctcgggatgaagaggaagagatgacagaggtc
tccctggatgctaccaagaggcatactgtcctgatgggtctgcagccagccactgagtac
atcgtgaacctggtagctgtccatggcacggtgacctctgagcccatcgtgggctccatc
accacaggaattgatccccccaaagacattacaattagcaatgtgaccaaggactcagtt
atggtctcctggagccctcctgttgcatcttttgattactaccgagtatcctatcggcca
acacaagtgggacggctggacagctcagtggtgcccaacacggtgacagaattcaccatc
accaagctatatccagccactgaatatgaaatcagtctcaacagtgtgaggggcagagag
gagagtgagcgcgtctgtaccctcattcacacagccatggacaaccccgtggatctgact
gctaccaacatcactccaacagaagccctgctgcagtggaaggcaccagtgggtggagtg
gagaactacgtcattattctcacacactttgcagttgctggagagaccatcctggttgat
ggaggcagtgaggaattccagcttgctggcctgcttcctagcacccactataccgtcaat
atgtatgccaccagtgggcctctcaccagtggcaccgtcagcaccaacttctctaccctc
ctggaccctcctgcaaacctgacagccagtgaagtcaccagacaaagtgccctgatctcc
tggcagcctcccagagccgagattgaaaactacatcttaacctacaaatccactgatgga
agccgcaaggagctgattgtggatgcagaggacacgtggatccgactggagggcctgtct
gagagcacagactacacagtgctcctgcaggcagcccaggacacctcccggagcagcctc
acctctgccaccttcaccacagggggccgggtgtttcctcatcctcaagactgtgcccag
catttgatgaacggagacactctgagtggggtttacaccatcttcctcaatggggagcta
agccggaagttacaggtgtactgtgacatgaccaccgacgggggtggctggattgtgttc
cagagacggcagaatggccaaactgattttttccggaagtgggctgaataccgtgtcggc
ttcgggaacctggaggatgagttttggctggggctggacaacatacacaggatcacatcc
cagggccgctacgagctgcgcgtggatatgcgggacggacaggaggctgccttcgcctac
tacgacaagttctcggtggaggacggcagaaatctgtacaagctgcgcctagggggctac
aatggcactgcaggggactccctcagctatcatcagggacgccctttctccacagaggac
agagacaatgatgttgcggttaccaactgcgccatgtcctacaagggggcctggtggtat
aagaactgccaccggaccaacctcaacgggaaatacggagagtccaggcacagtcagggg
atcaactggtaccattggaaaggccacgagttctccatcccttttgtggagatgaagatg
cgtccctacaagcacggtctcacggcggggaggacacggcggtccctgcggttctga

KEGG   Ursus arctos horribilis: 113263032
Entry
113263032         CDS       T05909                                 

Gene name
IBSP
Definition
(RefSeq) bone sialoprotein 2
  KO
K06253  integrin binding sialoprotein
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113263032 (IBSP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113263032 (IBSP)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113263032 (IBSP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113263032 (IBSP)
SSDB
Motif
Pfam: BSP_II Utp14 Pox_Ag35 Mpp10
Other DBs
NCBI-GeneID: 113263032
NCBI-ProteinID: XP_026365388
UniProt: A0A3Q7WUE4
LinkDB
Position
Unknown
AA seq 320 aa
MKTALILLSILGMACAFSMKNLHRRAKLEDSEENGVFKYRPRYSLYKHAYFYPPLKRFPV
QSSSDSSEEDGNGDSSEEEEEEEETSNEEENNEENANSDENEDEESEAENTTLTATPGYG
EETTPGTGYIGLAAIQLPKKSGDIGHKATKEEESDEEEEEDEENEENEAEVDENGQGING
TSSNSTEAENGNGSSDGDNGEGEEESVTEAHAEGTTVAGKQDNGGSKTTTSPDGGYERTT
PAPELYGTTTRPSGEATPTGYEGEYEQTGTNEYDNGYEVYENENGEPRGDNYRAYEDEYS
YYKGRSYDGYDGQDYYYRHQ
NT seq 963 nt   +upstreamnt  +downstreamnt
atgaagactgctttaattttgctcagcattttgggaatggcctgtgctttctcaatgaaa
aacttgcatcgaagagccaaattagaggattctgaagaaaatggggtctttaagtaccgg
ccacgatattctctttacaagcatgcctacttttatcctcctctaaaacgatttccagtt
cagagcagtagtgactcatctgaagaagatggaaatggtgacagctcagaagaggaggaa
gaagaagaggagacttcaaatgaagaagaaaacaatgaagagaatgcaaattctgatgaa
aatgaagatgaagagtctgaggctgagaacaccacccttactgccacacctggctatgga
gaggagaccacacctggaacaggttatataggtctagctgcaatccaacttcccaagaag
tctggggatataggacacaaggctacaaaagaggaggaaagtgatgaagaagaagaggaa
gatgaagaaaatgaagaaaatgaagcagaagtggatgaaaatgggcaaggcataaacggc
accagcagcaacagcacagaggcagaaaatggcaatggcagcagtgacggagacaatgga
gaaggtgaagaagaaagtgtcactgaagcccatgcagaaggaaccacagtggctggaaag
caggacaatggtggctctaagacaacaacctctccagatggtgggtatgaacgtacgact
ccagcaccagagctctatgggactaccacccggccatctggggaagccacccccaccgga
tatgagggagagtatgaacaaacaggcactaatgaatacgacaatggatatgaagtctat
gaaaatgagaatggggaacctcgtggggataattaccgagcctatgaggatgagtacagc
tactataaagggcgcagctacgatggctatgatggtcaagattactactatcgccatcag
tga

KEGG   Ursus arctos horribilis: 113263166
Entry
113263166         CDS       T05909                                 

Gene name
SPP1
Definition
(RefSeq) osteopontin isoform X1
  KO
K06250  secreted phosphoprotein 1
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04371  Apelin signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04620  Toll-like receptor signaling pathway
uah04929  GnRH secretion
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04371 Apelin signaling pathway
    113263166 (SPP1)
   04151 PI3K-Akt signaling pathway
    113263166 (SPP1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113263166 (SPP1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113263166 (SPP1)
 09150 Organismal Systems
  09151 Immune system
   04620 Toll-like receptor signaling pathway
    113263166 (SPP1)
  09152 Endocrine system
   04929 GnRH secretion
    113263166 (SPP1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113263166 (SPP1)
SSDB
Motif
Pfam: Osteopontin LBR_tudor Tudor_3
Other DBs
NCBI-GeneID: 113263166
NCBI-ProteinID: XP_026365541
UniProt: A0A3Q7VWW9
LinkDB
Position
Unknown
AA seq 298 aa
MRIAVICFCLLGVAFAIPIKQPDSGSSEEKQLYHKYPGAVATWLKPDPSQKQTFLALQNT
VLSQETDDFQQKTLSSKSDESHDDVDEDDGDDVDSQDSVDSNDVDDNSNQSDESDELVTD
FPTDFPATQFFTPAVPTRDSNDGRGDSVAYGLRSRSKKSHRYEVQYPDSTEEDLTSLVKS
GSMEDDFNAVLLSQTVRGTSDGDSHAKDSQETSQLDDHSMETKSRRHTREYKLRASDESD
RHSHEIDSQENSEVSSELVSQIVQSHEKGLVLDSKSEEDKHLKFHTSHELDSASSEVN
NT seq 897 nt   +upstreamnt  +downstreamnt
atgagaattgcagtgatttgcttttgcctcttgggcgttgccttcgccattcccattaag
cagcctgattccgggagctctgaggaaaagcagctttaccacaaatacccaggtgctgta
gctacttggctaaagcctgacccatctcagaagcagactttcctagcactacagaacact
gtgctctctcaagaaactgatgacttccaacaaaagaccctctcaagcaagtccgatgaa
agccacgatgatgtggatgaagatgacggagacgatgtggatagccaggattccgttgat
tctaatgacgtagatgacaactctaaccagtctgatgaatctgatgaactggtcacggat
tttcccacggactttccagcaacccaattcttcactccagctgtccccacaagagactca
aatgatggccgaggtgatagtgtggcttatggactgaggtccagatctaagaagtcccac
agatatgaagtccagtatcctgattctacagaggaggacttgacatcacttgtgaaaagt
gggagtatggaagatgacttcaatgccgtccttctttcccagacagtgcgggggacttcc
gacggggacagccatgcgaaagacagtcaggaaacaagtcagctggatgaccacagtatg
gaaaccaagagccgcaggcacaccagagagtataagctgagagcaagtgatgagagcgat
aggcattcccatgagattgatagtcaggaaaactctgaagtcagcagtgaacttgtcagc
caaatagttcaaagccatgaaaaggggcttgtcctagattccaagagtgaggaagataaa
cacctgaaatttcacacttctcatgaattagatagtgcatcctcggaggtcaattaa

KEGG   Ursus arctos horribilis: 113263194
Entry
113263194         CDS       T05909                                 

Gene name
COL6A1
Definition
(RefSeq) collagen alpha-1(VI) chain
  KO
K06238  collagen, type VI, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113263194 (COL6A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113263194 (COL6A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113263194 (COL6A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113263194 (COL6A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113263194 (COL6A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113263194 (COL6A1)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113263194 (COL6A1)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113263194 (COL6A1)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113263194 (COL6A1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   113263194 (COL6A1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3
Other DBs
NCBI-GeneID: 113263194
NCBI-ProteinID: XP_026365914
UniProt: A0A3Q7XHQ6
LinkDB
Position
Unknown
AA seq 1030 aa
MKLARALLPLLLQACWATAQDVSGNARAVAFQDCPVDLFFVLDTSESVALRLKPYGALVD
KVKAFTKRFIDGLRDRYYRCDRNLVWNAGALHYSDEVEIISPLRPMPSDRDALKASVDAV
KYFGKGTYTDCAIKKGLEELLVGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAK
HLGVKVFSVAITPDHLEPRLSIIATDHMYRRNFTAADWGQTRDAEEIIAQTIDTIIDMIK
NNVEQVPRLTFLFDLQPARGPPGPRGDPGYEGERGKPGLPGEKGEAGEPGRPGDLGPIGY
QGMKGEKGSRGEKGSRGAKGYKGEKGKRGIDGVDGMKGETGYPGLPGCKGSPGFDGAQGP
PGPKGDAGAFGRKGEKGAPGADGEPGRPGNTGPPGDEGEPGLPGPPGEKGEAGDEGNPGP
DGAPGERGGPGERGPRGTPGVRGPRGDPGEAGPQGDQGREGPVGIPGDPGEAGPAGPKGY
RGDEGPPGPEGSRGAPGPAGPPGDPGLMGERGEDGPPGNGTEGFPGFPGYPGSRGPPGIN
GTKGYPGLKGDEGEAGDPGEDNNDIAPPGVKGAKGYRGPEGPQGPPGHTGPPGPDECEIL
DIIMKMCSCCECKCGPIDILFVLDSSESIGLQNFEIAKDFVVKVLDRLSRDELVKFEPGH
SHAGVVQYSHNQMQEHVDLRDANIRNVQELKEAVKKLRWMAGGTFTGEALQYTRNQLLPP
TQNNRIALVITDGRSDTQRDTTPLSVLCGPDIQVVSVGIKDMFGYVAGSDQLNVISCQGL
APQGRPGISLVKENYAELLEDNFLKNVTTQICIDKKCPDYTCPITFSSPADITILLDGSA
SVGSHNFDTTKRFAKRLAERFLTAGRTDPSHDVRVAVVQYSGPGQQRPERGSLQFLQNYT
MLASTVDGMDFFNDATDVNDALSYVTRFYREASSREAKKRLLLFSDGNSQGATAAAIEKA
VQEAQRADIEIFVVVVGRQVNEPHIRVLVTGKTAEYDVAFGERHLFRVPSYQALLRGVFY
QTVSRKVALG
NT seq 3093 nt   +upstreamnt  +downstreamnt
atgaagctggcccgcgctctgctccccctgctgctgcaggcctgctgggccaccgctcag
gacgtctcgggcaacgccagggctgtggccttccaggactgccctgtggacctcttcttc
gtgctggacacctcggagagcgtggccttgaggctgaagccctacggagccctggtggac
aaggtcaaggccttcaccaagcgcttcatcgatggcctgagagacaggtactaccgctgt
gaccgcaacctggtgtggaacgcgggcgcgctgcactacagcgacgaggtcgagatcatc
agcccgctcaggcccatgcccagcgaccgcgacgcgctcaaggccagtgtggacgcggtc
aagtacttcggcaagggcacctacacggactgcgccatcaagaaggggctggaggagctg
cttgtggggggctcccacctgaaggagaacaagtacctgattgtggtgaccgatgggcac
cccctggagggctacaaggagccgtgtgggggcctggaggacgccgtcaacgaggccaag
cacctgggcgtcaaagtcttctcggtggccatcacccccgaccacctggagccacgtctg
agcatcatagccacggaccacatgtaccgacggaacttcaccgcggctgactgggggcag
acccgggacgcggaggagattatcgcccagaccatcgacaccatcattgacatgatcaaa
aacaatgtggaacaagtgcccaggctgacctttctctttgatttgcagcctgccagagga
cctcccgggccacggggagaccctgggtacgagggagaacgtgggaagccaggtctccca
ggagagaaaggagaagccggagaacccggcaggcccggggacctcggacccatcggctac
caggggatgaagggagaaaaagggagccgaggggagaagggctccaggggagccaagggc
tacaagggcgagaagggcaagcgtggcatcgatggtgtggacggcatgaagggcgagacg
gggtaccccggcctgccaggctgcaagggctcgcccggattcgacggcgctcaaggaccc
cccgggcccaagggtgacgcgggtgccttcggacggaaaggagagaagggtgccccggga
gctgacggggagcccgggaggccagggaacacggggcccccgggagatgagggcgagccg
ggcttgcctggtcccccaggagagaagggagaagccggtgacgagggaaacccgggaccg
gacggtgcccccggcgagaggggcggccctggggaaagaggaccacgggggaccccaggt
gtgcggggcccacggggagacccgggcgaagctggaccccaaggtgaccagggacgagaa
ggccctgtcggcatccctggagacccgggagaggctggccccgctggacctaaaggttac
cgaggtgatgaggggcccccggggcctgagggttccagaggagctccagggcccgcagga
ccccccggagaccccgggctgatgggtgaaaggggggaagacggtccacccggaaacggc
accgagggcttccctggcttccctggctatccaggcagcaggggcccgcccgggataaac
ggcactaaaggctacccaggcctcaaaggggatgagggagaagccggagaccctggagag
gacaacaatgacatcgcacccccaggtgtcaaaggagcaaaggggtaccggggacccgaa
ggcccccagggacccccaggacacacaggaccgccggggccagatgaatgtgagatcttg
gacatcatcatgaaaatgtgctcttgctgtgagtgcaagtgtggccccatcgacatcctc
tttgtgctggacagctcggagagcattggcctgcagaacttcgagatagccaaggacttt
gtcgtcaaggtcctcgaccggctgagcagagacgagctggtcaagtttgagcccggccac
tcgcacgcgggcgtggtgcagtacagccacaaccagatgcaggagcacgtggatctgagg
gacgccaacatcaggaacgtccaggagctcaaggaagccgtcaagaagctgcggtggatg
gcggggggcacgttcacgggagaggctctgcagtacacccggaaccagctgctgcccccc
acccagaacaaccggattgcgctcgtcatcaccgacggccgctctgacacccagagggac
accacgccgctcagcgtgctctgtggcccggacatccaggtggtctccgtgggcattaag
gacatgttcggctatgtcgcgggctctgaccagctcaacgtcatctcctgccaaggcctg
gcaccccagggccggccaggcatctcgctggtcaaggagaactacgcggagctgctggag
gacaacttcctaaagaacgtcaccacccagatctgtatagacaagaaatgtccagattac
acctgcccaatcacgttctcctccccggccgacatcaccatcctgctggacggctcggcc
agcgtgggcagccacaacttcgacaccaccaagcgcttcgccaagcggctggccgagcgc
ttcctgacagcgggcaggaccgacccgtcccatgacgtccgcgtggcggtggtgcagtac
agcggccccgggcagcagcggccggagcgaggctcactgcagtttctgcagaactacacc
atgctggccagcaccgtggacggcatggacttcttcaacgatgccaccgacgtcaacgac
gccctcagctacgtgacgcgcttctaccgtgaggcctcgtcccgcgaagccaagaagagg
ctgctgctcttctccgacggcaactcccagggggccacggcggcggccatcgagaaggcg
gtgcaggaggcccagcgggcggacatcgagatcttcgtggtggtggtgggccgccaggtg
aacgagccccacatccgcgtcctggtcacgggcaagacggccgagtacgacgtggccttt
ggcgagcgccacctgttccgcgtgcccagctaccaggccctgctgcgcggcgtcttctac
cagaccgtgtccagaaaggtggcgctgggctag

KEGG   Ursus arctos horribilis: 113264714
Entry
113264714         CDS       T05909                                 

Gene name
LAMA4
Definition
(RefSeq) laminin subunit alpha-4 isoform X1
  KO
K06241  laminin, alpha 4
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05143  African trypanosomiasis
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113264714 (LAMA4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113264714 (LAMA4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113264714 (LAMA4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113264714 (LAMA4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113264714 (LAMA4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113264714 (LAMA4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113264714 (LAMA4)
   05145 Toxoplasmosis
    113264714 (LAMA4)
   05143 African trypanosomiasis
    113264714 (LAMA4)
SSDB
Motif
Pfam: Laminin_G_2 Laminin_G_1 Laminin_I Laminin_II Laminin_EGF Laminin_G_3
Other DBs
NCBI-GeneID: 113264714
NCBI-ProteinID: XP_026367398
UniProt: A0A3Q7WZN6
LinkDB
Position
Unknown
AA seq 1817 aa
MAVSSARCSVLPLWLLWGAVCAAAAAGTDNAFPFDIEGSSAVGRQDPPETSEPRVAPGRL
PPAAKKCGAGFFLAQSGECMPCGCNGNSDECLDGSGFCMHCQRNTTGEHCEKCLDGYIGD
SIRGPPRFCQPCPCPLPHVANFAESCYRKNGAVRCICKENYAGPNCERCAPGYYGNPLLI
GSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARVAKNCAV
CNCGGGPCDSVTGECLEEGFEPPTGCDKCIWDLIDDLRLAALLIEESTSGLLSVSSGAAA
HRHVNELNSTIYLLKTKLSERENQYVLRKIQINNAESTMKSFLSDVEELAERESQVSRRG
KLAQKESMDTINHATQLAEQAHDMRDKIQEINNKMLYYGEEQELTSEEISEKLVLAQKML
EEIRRRQPFLTQRELVDEEADEAHELLSQAESWQRLYNDTRALFPVVLEQLDDYSAKLSD
LQESLDQALDHVRDAEDMNRAIAARQRDHEKQHERVREQMEGVNGSLKTSLDSLTTPRLT
LSELDDTIKNASGIYAEIDGAKNELQGKLSNLSNLRHDLVQEAVDHAQNLQQEADELSRN
LHSSDMNGLVQKALDASNVYENIANYVSEANETAELALNITDRIYDAVSGIDTQIIYHKD
ESENLLNQARELQAKADPGNDEAVADTNRRVDGALARKRALQNRLNDAIKRLQATERGDA
QQRLDQSKLITEEASKTTVGVQQAAAPMASNLTNWSQNLQSFDSSAYNTAVDSARDAVRN
LTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDGQSAVEVH
PKTSMDDLKTFTSLSLYMKPPPVKQPELAGATDRFVLYLGSKHAKKEYMGLAIKNDNLVY
IYNLGAKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKFIKKGEF
AGDDSLLDLDPEDTVFYVGGVPSNFKLPASLNLPGFVGCLELATLNNDVISLYNFKHIYN
MDPSKSVPCARDKLAFTQSRAASYFFDGSSYAIVRDITRRGKFGQVTRFDIEVRTPADNG
LVLLMVNGSMFFSLEMRNGYLHVFYDFGFSNGPVHLEDTSKRAQINDAKYHEISIIYHND
KKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRTLRAHLPLDISFRGCMKGFQ
FQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASVQKISFFDGFEGGFNFRTL
QPNGLLFYYASGSDVFSISLDNGTVVMDVKGIKVQSADKQYNDGLSHFVITSVSPARYEL
IVDKSRLGSKNPTKGKVEQTQAGEKKFYFGGSPISPQYANFTGCISNAYFTRLDRDVEVE
DFQRYLEKVHTSLYECPIESSPLFLLYKKGKNSSKPKTSQNKKGEKSKDAPAGDPAGLKF
PERNVPRDSYCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGEKSQFSIRLKTRSSHG
MIFYVSDQEENDFMTLFLAHGRLVFMFNVGHKKLKIRSQEKYNDGLWHDVIFIREKSSGR
LIIDGLRVLEESLPPTGAAWNIKGPIYLGGVAPGRAVKNVQINSVYSFSGCLGNLQLNGA
SITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRSSSGTLV
HGHSVNGEYLNVHMKKGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDSNVVQLD
VDSEVNHVVGPLNPKPVDHREPVFVGGVPESLLTPRLAPGRPFTGCIRHFVIDGRPVSFS
KAALVSGAVSINSCPAA
NT seq 5454 nt   +upstreamnt  +downstreamnt
atggctgtgagctcagcccgctgctccgtcctgccgctgtggctcctctggggcgctgtc
tgcgccgccgcggcggccggcaccgacaacgcttttccttttgacattgaagggagctca
gcggtcggcagacaagacccgcccgagacgagcgagccccgcgtggctcccggaaggctg
ccgcctgctgccaagaaatgtggtgctggattctttctcgcccagtcaggagaatgtatg
ccctgtggctgcaatggtaactctgacgagtgcttggatggctctggattctgtatgcac
tgccagcggaacacgacaggagagcactgtgagaaatgtctagatggttatatcggagat
tccatcagaggaccaccccggttttgccagccatgtccctgtcccctaccccatgtggcc
aattttgccgaatcctgctatagaaaaaatggagctgttcggtgtatctgtaaagaaaac
tatgctggacctaactgtgaaagatgtgcccctggttactatggaaaccccttgctgatt
ggaagcacctgtaagaaatgtgactgcagtggaaattcagaccccaacctgatctttgaa
gattgcgatgaggtcaccggccagtgtaggaactgcctacgcaacaccacgggattcaag
tgtgaacgctgtgcccccggctactatggggatgccagggtagccaagaattgtgcagtg
tgcaattgtgggggaggcccatgtgacagcgtaaccggagaatgcttggaagaaggtttt
gaaccccctacaggctgtgataagtgcatctgggatctgattgatgaccttcgattagcg
gcactcttgattgaagaaagcacatctggtctgttgagtgtctcgtctggtgcggctgct
cataggcatgtgaatgaactcaattccaccatctacctcctcaaaacaaaattgtcagaa
agagaaaaccagtatgtcctaagaaagatacaaatcaacaatgctgagagcacaatgaaa
agctttctgtctgacgtggaggaattagctgaaagggaaagtcaagtctcaagaaggggc
aagctggctcagaaggaaagcatggataccattaaccatgcaactcagctcgcggagcaa
gctcacgatatgagggataaaattcaagagatcaacaacaagatgctctattatggggaa
gagcaggaacttacctctgaggaaatctctgagaagctggtgttggcccagaagatgctt
gaggagatcagacgccgccaaccatttctcacacagagggagctagtggacgaggaagca
gatgaagcccatgaactgctgagccaggctgagagctggcagcgcctctacaatgatact
cgcgctctgtttcctgttgtcctggagcagctggacgactacagtgctaagttgtcagac
ctgcaggagtcactggaccaggcccttgaccatgtcagggatgccgaagacatgaacaga
gccatagcagccaggcagcgggaccatgagaaacaacatgagagagtgagggaacaaatg
gaaggggtgaacggttctctgaagacgtctttggattctctgacaacacctcgtctgacc
ctttcagagcttgatgatacaattaagaatgcatcagggatttatgcggaaatagatgga
gccaaaaatgaactacaaggaaaactatccaacctaagtaaccttcgtcatgatttagtc
caagaagctgttgaccatgcacagaatcttcaacaagaggctgatgaactgagcaggaat
ttgcacagttcagatatgaatgggctggttcagaaggctttggatgcttcaaatgtctat
gaaaatattgccaattatgttagtgaggccaatgaaacggcagaattggctttgaacatc
actgaccgaatttatgatgctgtgagtgggattgatactcaaatcatttaccataaggat
gagagtgagaacctcctcaatcaagccagagaactgcaagccaaggcagatcctggcaat
gatgaggcagtagctgacacaaacaggcgtgtggatggagccctagcaaggaagagggct
ctccaaaacagattgaacgatgccattaagcgactgcaagccacagagagaggtgacgcg
cagcagcgcctggatcagtcgaagctgatcaccgaggaggctagcaagaccacagtggga
gtccagcaggcagctgcgccgatggccagcaatctaaccaactggtcgcaaaatctgcag
agttttgactcttctgcttacaacactgcagtggactctgcccgagatgcagtaagaaat
ctgacagaggttgtccctcagctcctggatcagctccgtaccgtggaacagaagcggcct
gcgagcaacgtctctgccagcatccagaggatccgggagcttatcgctcagaccagaagc
gtggccagcaagatccaagtctccatgatgtttgacggccagtcggcagtcgaggtgcac
cccaaaaccagtatggatgacttaaagaccttcacatccctgagcctgtacatgaaaccc
cctcccgtgaaacagccggagctggccggggctacagaccggtttgtcctgtacctcgga
agcaaacacgccaaaaaagaatacatgggtcttgcaatcaaaaatgataacctggtgtac
atttacaatctgggagctaaggatgtagagattcctctggactccaaacccgtcagttcc
tggcctgcttacttcagcattgtcaagattgaaagggtaggaaaacatggaaaggtgttt
ttgacagtcccgagtctaagtagcactgcagaggaaaagtttattaaaaagggagaattt
gcgggagatgactccttgttggatctggaccctgaggacactgtgttttatgttggtggt
gtgccttcaaacttcaagctccctgcaagcttaaacctgcctggctttgttggctgcctg
gaattggccactttgaataatgatgtgatcagcttgtataattttaagcacatctataac
atggatccctccaagtcagtaccctgtgccagagataaactggccttcactcagagtcgg
gctgccagctatttcttcgatggctctagttatgccatagtaagggatatcacaaggaga
gggaaattcggtcaggtgactcgctttgacatagaagttcgaacaccagctgacaatggc
ctcgtgctcctgatggtcaatggaagtatgtttttcagcttggaaatgcgcaatggttac
ctgcatgtgttctatgactttggatttagcaacggccctgtgcatctggaggacacatca
aagagagctcaaattaatgatgcaaaataccatgagatctcaatcatttaccacaacgat
aagaaaatgattttggtggttgacagacgacatgtcaagagcatggacaatgaaaagatg
aagataccttttacagacatatacattggaggggctcccccagaaattttacaatccagg
accctcagggcacaccttcccctagatatcagcttcagaggatgcatgaagggtttccag
ttccaaaagaaagatttcaacttattagagcagacagaaaccctgggagttggttatgga
tgcccagaagactctctcatatctcgcagagcgtatttcaatgggcagagtttcattgct
tcagttcagaaaatctctttcttcgacggctttgaaggaggttttaatttccgaacgtta
cagccaaatgggttactattctattatgcttcaggatcggatgtgttctccatttcgttg
gataacggcaccgtcgtcatggatgtcaagggaatcaaggtgcagtcagccgataagcag
tacaatgacgggctgtcccacttcgtcattacctctgtgtcacccgcaagatatgaactg
atagtagataagagcagacttgggagtaagaaccctaccaaagggaaagtggaacagaca
caagcaggtgaaaagaagttttacttcggtggctcacccatcagtccccagtatgctaat
ttcactggctgtataagtaacgcctactttaccaggttggatagagatgtggaggttgaa
gatttccagcgatatttggaaaaggtccacacctctctttatgagtgtcctattgagtct
tcgccgttgttcctcctctacaaaaaagggaaaaattcctcaaagcctaaaacaagtcag
aataaaaagggagagaaaagcaaagatgcccctgcaggggaccctgccggcctgaagttc
ccagagaggaatgttccaagggattcttactgccacctttccaacagtcctagagcaata
gaacacgcctatcaatatggagggactgccaacagccgccaagagtttgagcacttaaaa
ggagattttggtgaaaagtctcagttctccattcgtctgaaaactcgttcctcccatgga
atgattttctatgtctcagatcaagaagagaatgacttcatgactctattcttagcccac
ggccgcttggttttcatgtttaatgttggccacaagaaactgaagattagaagccaagag
aaatacaatgatggattgtggcatgatgtgatatttattcgggaaaagagcagtggccga
ctgatcattgatggacttcgagtcctagaagagagtcttccccctaccggagctgcctgg
aacatcaagggtcctatttatttgggaggtgtggctcctggaagggctgtgaaaaatgtc
cagatcaactcggtctacagtttcagtggctgtctcggcaatcttcagctcaacggggcc
tccatcacctctgcttctcagacgtttagcgtgaccccttgttttgaaggtccgatggag
acagggacatacttttcaacagaaggaggatacgtggttctagatgagtctttcaatatt
ggattgaagtttgagattgcatttgaagtccgtcccagaagcagttccggaacccttgtt
catggccacagtgtcaacggagagtacctcaatgttcacatgaagaaggggcaggtcata
gtgaaagtcaacaatgggatcagggacttctccacctcagtaacacccaagcagagtctc
tgtgatggcagatggcacagaatcacagttattagagattccaatgtggttcagttggac
gtagactctgaagtgaaccatgtggttggacccctgaatccaaaaccagttgatcacagg
gagcctgtattcgttggaggggttccagagtctctactgacaccacgcttggcccccggc
agacccttcacaggctgcatccgtcactttgtgattgacgggcgccctgtgagcttcagc
aaagcagccctggtcagcggagccgtgagcatcaactcctgtcccgcagcctga

KEGG   Ursus arctos horribilis: 113265614
Entry
113265614         CDS       T05909                                 

Gene name
COL1A1
Definition
(RefSeq) collagen alpha-1(I) chain isoform X1
  KO
K06236  collagen, type I, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04611  Platelet activation
uah04926  Relaxin signaling pathway
uah04933  AGE-RAGE signaling pathway in diabetic complications
uah04974  Protein digestion and absorption
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113265614 (COL1A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113265614 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113265614 (COL1A1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    113265614 (COL1A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    113265614 (COL1A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    113265614 (COL1A1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    113265614 (COL1A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    113265614 (COL1A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113265614 (COL1A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113265614 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113265614 (COL1A1)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113265614 (COL1A1)
SSDB
Motif
Pfam: Collagen COLFI VWC TILa
Other DBs
NCBI-GeneID: 113265614
NCBI-ProteinID: XP_026368913
UniProt: A0A3Q7X3Q3
LinkDB
Position
Unknown
AA seq 1461 aa
MFSFVDLRLLLLVAATALLTHGQEEGQEEEDIPPVTCVQNGLRYYDRDVWKPEACRICVC
DNGNVLCDDVICDETKNCPGAQVPPGECCPVCPDGEASPTDQETTGVEGPKGDTGPRGPR
GPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQMSYGYDEKSTGGISVPGPMGP
SGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGE
RGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGP
RGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGARGS
EGPQGVRGEPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGP
SGPPGPKGNSGEPGAPGNKGDTGAKGEPGPTGIQGPPGPAGEEGKRGARGEPGPTGLPGP
PGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGS
PGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGP
PGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGV
PGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGA
PGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGE
AGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGP
AGPTGPPGPIGNVGAPGPKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGK
EGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGL
PGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGSPGAEGS
PGRDGSPGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPIGPVGA
RGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGP
PGSAGSPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSGGFDFSFLPQ
PPQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLK
MCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPHVAQKNWYISKNPKEKRH
VWYGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGN
LKKALLLQGSNEIEIRAEGNSRFTYSVTYDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVA
PLDIGAPDQEFGMDVGPVCFL
NT seq 4386 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcgtagcggccaccgccctcctgacg
cacggccaagaggagggccaagaagaagaagacatcccaccagtcacctgcgtacagaac
ggcctcaggtactatgaccgagacgtatggaaacccgaggcctgccggatctgtgtctgc
gacaacggcaacgtgttgtgcgatgacgtgatctgcgacgaaaccaagaactgtcccggc
gcccaagtccccccgggcgagtgctgccccgtctgccccgacggcgaggcgtcacctacc
gaccaagaaaccacaggagtggagggacccaagggagacactggcccccgaggtccaagg
ggacctgccggcccccctggccgagatggcatccccggccagcctggacttcccggcccc
cccggacctcccggcccccccggacctcctggcctcggaggaaactttgctccccaaatg
tcttacggctatgatgagaaatcaactggaggaatctccgtgcctggccccatgggtcct
tctggtcctcgtggtctccctggcccccctggcgcacctggtccccaaggtttccaaggc
ccccctggtgagcctggcgagcctggagcctcaggtcccatgggtccccgtggtccccct
ggcccccctggcaagaacggagatgatggtgaagctggaaagcctggtcgtcctggtgag
cgtgggcctcctggacctcagggtgctcggggattgcctggaacagctggtcttcctgga
atgaagggacacagaggtttcagtggtttggatggtgccaagggagatgctggtcctgct
ggtcccaagggtgagcccggtagccctggtgaaaatggagctcctggtcagatgggcccc
cgtggtctgcctggtgagagaggtcgccctggagcccctggccctgctggtgctcgtgga
aacgatggtgctactggtgctgctgggccccctggtcccactggccccgctggtcctcct
ggcttccctggtgctgttggtgctaagggtgaagctggtccccaaggagcccgtggctct
gaaggtccccagggtgtgcgtggtgagcccggcccccctggccctgctggtgctgctggc
cctgctggaaaccctggtgctgacggacagcctggtgctaaaggtgctaatggcgctcct
ggcattgctggcgctcccggcttccccggtgcccgaggcccttctggaccccagggcccc
agcggtcctcctggccccaagggtaacagcggtgaacccggtgctcccggcaacaaagga
gacactggcgccaagggagagcccggccccactggtattcaaggcccccctggccctgct
ggggaagaaggaaagcgaggagcccgaggtgaacccggacccactggcctgcctggaccc
cccggcgagcgtggcggacctggtagccgtggtttccctggtgcagatggtgtcgctggt
cccaagggtcccgctggtgaacgtggctctcctggccccgctggccccaaaggttcccct
ggtgaagcgggtcgtcccggtgaagctggtctgcctggtgccaagggtctgactggaagt
cctggcagccccggtccagatggcaaaactggcccccctggtcccgctggtcaagatggt
cgccccggccccccaggcccccccggtgcccgtggtcaggctggcgtgatgggattccct
gggcctaaaggtgctgctggagagcctggcaaggctggagagcgaggtgtgcctggcccc
cctggtgctgttggtcctgctggcaaagacggagaagctggggctcagggaccccctggc
cctgctggccccgctggcgagagaggcgaacaaggcccagctggctcccccggattccag
ggtctccctggccccgctggtcctcccggtgaagcaggcaaacccggtgaacagggtgtt
cctggcgaccttggtgcccctggcccctccggagcaagaggcgagagaggtttccccggt
gaacgtggtgtgcaaggtccccccggccccgcaggtccccgtggagccaacggtgcccct
ggcaatgatggtgctaagggtgatgctggtgcccccggagccccaggtagccagggcgct
cctggccttcagggaatgcctggtgaacgaggcgcagctgggcttcccggccctaagggt
gacagaggcgatgctggtcccaaaggtgctgacggttctcctggcaaagatggtgtccgt
ggtctgactggacccattggtcctcctggccccgccggtgcccctggtgacaagggtgaa
gctggtcctagcggccctgctggtcccactggagctcgtggtgcccccggagaccgtggt
gagcctggtccccctggccctgctggcttcgctggcccccctggtgctgatggccaaccc
ggtgctaaaggcgaacccggtgatgctggtgctaaaggcgacgctggtccccctggcccc
gctggacccactggaccccctggccccattggtaacgttggtgctcctggacccaaaggt
gctcgcggcagtgccggtccccctggtgctactggtttccctggtgctgctggccgagtc
ggtccccccggcccctctggaaatgctggaccccctggcccccctggccctgctggcaaa
gaaggcggcaaaggcccccgtggtgagaccggccccgctggacgtcctggtgaagtcggt
ccccctggtccccctggccccgctggcgagaaaggatcccctggtgctgatggacctgct
ggtgctcccggcactcctggacctcaaggcattgctggacagcgtggtgtggtcggcctg
cccggtcagcgaggagaaagaggcttccccggtcttcccggcccctctggtgaacctggc
aagcaaggtccttccggagcaagtggggagcgtggcccccctggtcccatgggcccccct
ggattggctggaccccctggcgagtctggacgtgagggatctcctggtgctgaaggctcc
cctggacgagatggttctcccggccccaagggtgaccgtggtgagaccggccctgctgga
ccccctggtgcccctggtgctcctggtgcccctggccctgttggccccgctggcaagagc
ggcgaccgtggtgagactggtcctgctggtcctgctggcccgatcggccccgttggtgct
cgtggtcccgctggaccccaaggcccccgtggtgacaagggtgagacaggcgaacagggt
gacagaggcataaagggtcaccgtggcttctctggtctccagggtccccctggtcctccc
ggctctcctggtgaacaaggtccttctggagcttctggtcctgctggtccccgaggtccc
cctggctctgccggttctcctggcaaagacggactcaacggtctcccaggccccattggc
ccccctggtcctcgtggtcgtactggtgatgctggccctgttggtccccccggccctcct
ggaccccctggtccccccggtcctcccagcggcggtttcgacttcagcttcctgccccag
ccacctcaagagaaggctcacgatggcggccgctactaccgggccgatgatgccaacgtg
gtccgtgaccgtgacctcgaggtggacaccaccctcaagagcctgagccagcagatcgag
aacatccggagccctgaaggaagccgcaagaaccctgcccgcacctgccgggacctcaag
atgtgccactccgactggaagagcggagaatactggattgaccccaaccaaggatgcaac
ctggatgccatcaaggtcttctgcaacatggagacaggcgagacctgcgtgtaccccact
cagccccatgtggcccagaagaactggtacatcagcaagaaccccaaggaaaagaggcac
gtctggtacggcgagagcatgaccgacggattccagttcgagtacggtggccagggctcc
gatcctgccgatgtggccattcagctgacgttcctgcgcctgatgtccaccgaggcttcc
cagaacatcacctatcactgcaagaacagcgtggcctacatggaccagcagaccggcaac
ctcaagaaggccctgctcctccagggctccaacgagatcgagatccgggccgagggcaac
agccgcttcacctacagtgtcacctacgacggttgcacgagtcacaccggagcctggggc
aagacagtgatcgaatacaaaaccaccaagacctcccgtttgcccatcatcgatgtggca
ccattggacatcggcgccccagaccaggaattcggcatggatgttggccctgtctgcttc
ctgtaa

KEGG   Ursus arctos horribilis: 113265620
Entry
113265620         CDS       T05909                                 

Gene name
CHAD
Definition
(RefSeq) chondroadherin
  KO
K06248  chondroadherin
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113265620 (CHAD)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113265620 (CHAD)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113265620 (CHAD)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113265620 (CHAD)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:uah00535]
    113265620 (CHAD)
Proteoglycans [BR:uah00535]
 Extracellular matrix (ECM) proteoglycans
  Small leucine-rich proteoglycan (SLRP) family
   class IV
    113265620 (CHAD)
SSDB
Motif
Pfam: LRR_8 LRR_4 LRR_5 LRR_1 LRRNT LRR_9 LRRCT
Other DBs
NCBI-GeneID: 113265620
NCBI-ProteinID: XP_026368921
UniProt: A0A3Q7XZ74
LinkDB
Position
Unknown
AA seq 359 aa
MARPLLWLSLGLLAGLLPALAACPQNCHCHGDLQHVICDKVGLQKIPKVSEKTKLLNLQR
NNFPVLAANSFRAMPNLVSLHLQHCQIREVAAGAFRGLKQLIYLYLSHNDIRVLRAGAFD
DLTELTYLYLDHNKVTELPRGLLSPLVNLFILQLNNNKLRELRAGAFQGAKDLRWLYLSE
NALTSLQPGALDDVENLAKFHLDKNQLSSYPAATLSKLRVVEELKLSHNPLKSIPDDAFQ
SFGRYLETLWLDNTNLEKLSDGAFLGVTTLKHVHLENNRLSQLPSNFPFDNLETLTLTNN
PWKCTCQLRGLRRWLEAKTSRPDATCASPAKFKGQHIRDTDAFRGCKFPTKRSKKAGRH
NT seq 1080 nt   +upstreamnt  +downstreamnt
atggcccgtccgctgctctggctcagcctcggcctcctggccggcctgttgccggccctg
gccgcctgcccccagaactgccactgccacggcgacctgcagcacgtcatctgcgacaag
gtggggctgcagaagatccccaaggtgtcagagaagaccaagctgctcaacctgcagcgc
aacaacttcccggtgctggccgccaactcgttccgggccatgccgaacctcgtgtcgctg
cacctgcagcactgtcagatccgggaggtggccgccggcgccttccggggcctcaagcag
ctcatctacctgtacctgtcccacaacgacatccgggtgctgcgcgccggcgccttcgac
gacctgaccgagctcacctacctctacctggaccacaacaaggtgaccgagctgccccgg
gggctgctctccccgctcgtcaacctcttcatcctgcagctcaacaacaacaagctccgc
gagctgcgcgccggggccttccagggcgccaaggacctgcgctggctctacctgtccgaa
aacgcgctcacttcgctgcagcccggcgcgctggacgacgtggagaacctcgccaagttc
cacctggacaagaaccagctgtccagctaccccgcggccactctcagcaagctgcgggtg
gtggaggagctgaagctgtcccacaaccccctgaaaagcattcctgacgatgccttccag
tctttcggcaggtacctggagaccctctggctggacaacaccaacctggagaagctctcg
gacggcgccttcctgggtgtgaccacactgaaacacgtgcatctggagaacaaccgcctg
agccagctaccctccaacttcccttttgacaacctagagaccctcaccctcaccaacaac
ccctggaaatgtacctgccagctccggggacttcggaggtggctggaagccaagacttcc
cgccctgatgccacttgtgcctcgcctgccaagttcaaaggccagcatatccgtgacacg
gacgccttccgcggctgcaagttccccactaagaggtccaagaaagccggccgccattaa

KEGG   Ursus arctos horribilis: 113266339
Entry
113266339         CDS       T05909                                 

Gene name
TNC
Definition
(RefSeq) tenascin isoform X1
  KO
K06252  tenascin
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05165  Human papillomavirus infection
uah05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113266339 (TNC)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113266339 (TNC)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113266339 (TNC)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    113266339 (TNC)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113266339 (TNC)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113266339 (TNC)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113266339 (TNC)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_2 EGF_Tenascin Pur_ac_phosph_N EGF DUF2369 fn3_2 hEGF
Other DBs
NCBI-GeneID: 113266339
NCBI-ProteinID: XP_026369636
UniProt: A0A3Q7W985
LinkDB
Position
Unknown
AA seq 1655 aa
MGAMTRLLAGILLASLTLTTEGGVLKKVIRHKRQSGVNVTLPEEHQPVVFNHVYNIKLPV
GSQCSVDLESASGEKDLAPPSEPSESFQEHTVDGENQIVFTHRINIPRRACGCASAPDVK
ELLSRLEELENLVSSLREQCTTGAGCCLQPAEGRLDTRPFCSGRGNFSTEGCGCVCEPGW
KGPNCSEPECPGNCHLRGQCLDGQCICDEGFTGEDCGQPACPGDCNDQGKCVSGVCVCFE
GYSGADCSQEVCPVACSEEHGRCVDGRCVCQDGFAGEDCNEPLCLNNCHSHGRCVENECV
CDEGFTGEDCGELICPNDCFDRGRCVNGTCHCEQGFTGEDCGQLSCPNACTGRGRCEQGQ
CVCEPGFAGPDCSEKRCPSDCHHHGRCVDGQCECDAGFTGADCGELQCPNGCSGHGRCVN
GQCVCDEGHTGEDCGQLRCPNDCHSRGRCVQGQCVCEAGFQGYDCGDMSCPNDCHQHGRC
VNGMCVCDDGYTGEDCRDLRCPRDCSNRGRCVDGRCECEHGFTGPDCVELACPGDCHGQG
RCVNGQCVCHEGFMGAACKERRCPGDCHGRGRCEDGQCVCQEGFAGPDCRRRSCPNDCSG
WGQCVEGRCICIEGHAGEDCSDVSPPKDLIVTEVTEETVNLAWDNEMRVTEYLVMYTPTH
EDGLEMQFRVPGDQTSTTIRELEPGVEYFIRVFAILENKKSIPVSARVATYLPAPEGLKF
KSIKETSVEVEWDPLDIAFETWEIIFRNMNKEDEGEITKSLRRPETTYRQTGLAPGQEYE
ISLHIVKNNTRGPGLKRVTTTRLDAPSQIEVKDVTDTTALITWFKPLAEIDGVELSYGIK
DVPGDRTTIDLTHDENQYSIGNLKPDTEYEVSLISRRGDMSSNPAKETFTTGLDAPRNLR
RVSQTDNSITLEWKNGKAAVDSYRIKYAPISGGDHAEVEVPRSQQATTRTTLTGLRPGTE
YGIGVSAVKADKESDPATINAATDIDAPKDLRVSETTETNLTLLWRRPSAKFDHYRLNYS
LPSGQPVEVKLPRDTTSYVLRGLEPGKEYSILLTAEKGRHKSKPARVKSSTEAEPEVDNL
LVSDATPDGFRLSWTADEGVFDSFVLKIRDTKKQSEPLEITLLAPERTRDITGLREATEY
EIELYGISNGRRSQPVRALATTAMGSPKEISFSDITEDSATVRWMAPSAQVESFRITYVP
IAGGTPSVVTVDGTKTQTRLVRLLPGAEYLVNVIAMKGFEESEPVSGSFATALDGPSGLV
TANITDTEALAMWQPAIAPVDSYVISYTGERVPEITRTVSGNTVEYALTDLEPATEYTLR
IFAEKGPQKSSIITTKFTTDVDSPRDFTATEVQSETALLTWRPPRAPVTGYLLVYESVDG
TVKEVILGPETTSYSLAELSPSTHYTAKIQALNGPLRSKLVQTIFTTIGLLYPFPRDCSQ
AMLNGDTTSGLYTIYLNGDKAQALEVFCDMTSDGGGWIVFLRRKNGREDFYRNWKAYAAG
FGDRREEFWLGLDNLHKITAQGQYELRVDLRDHGKTAYAVYDKFSVGDAKTRYRLKVEGY
SGTAGDSMAYHNGRSFSTFDKDTDSAITNCALSYKGAFWYKNCHRVNLMGRYGDNNHSQG
VNWFHWKGHEYSIQFAEMKLRPSNFRNLEGRRKRA
NT seq 4968 nt   +upstreamnt  +downstreamnt
atgggggccatgacccggctgttggcgggcatcctcctagcctcgctcaccctcactacc
gaaggtggtgtcctcaagaaggtcatccggcacaagcgacagagcggggtgaatgtcact
ctgccggaggaacaccagccagtggtgtttaatcatgtctacaacattaagctgcctgtc
ggttcccagtgctcggtggatctggaatcagccagtggggagaaagacctggccccgcca
tcagagcccagcgagagcttccaggagcacacggtcgatggggaaaaccagatcgtcttc
acacaccgcatcaacatcccgcgtcgggcctgtggttgcgcctcggctcctgacgtcaag
gagcttctgagcagactggaggagctggagaatctggtgtcttccctgcgggagcagtgc
accacaggagctggctgctgtctccagcctgccgaaggccgcctggacaccaggcccttc
tgcagcggccggggcaacttcagcacggaaggatgcggctgcgtgtgcgaacccggctgg
aaaggccccaactgctcagagcccgaatgtcccggcaactgccacctgcgaggccagtgc
ctggacgggcagtgcatctgcgatgagggcttcacaggcgaggactgtggccagcccgcc
tgccccggtgactgcaacgaccagggcaagtgcgtgagcggggtctgcgtgtgtttcgaa
ggctactcgggcgccgactgcagccaggaggtgtgcccggtggcgtgcagcgaggagcac
ggcaggtgcgtggacggccgctgcgtgtgccaggacggcttcgcgggcgaggactgcaac
gagccgctctgcctcaacaactgccacagccatgggcggtgcgtggagaacgagtgcgtg
tgtgacgagggcttcacgggcgaggactgcggcgagctcatctgccccaacgactgcttc
gaccggggccgctgtgtcaacggcacctgccactgcgagcagggcttcacgggcgaagac
tgcggccagctcagctgcccgaacgcctgcaccggccggggccgctgcgagcagggccag
tgcgtgtgcgagccgggcttcgcggggcccgactgcagcgagaagaggtgtccctccgac
tgccaccaccacggtcgctgcgtggacgggcagtgtgagtgtgacgccggcttcacggga
gctgactgcggcgagctccagtgtcccaacggctgcagcggccacggccgctgcgtcaac
gggcagtgcgtgtgcgacgagggccacactggggaggactgcggccagctgcggtgcccc
aacgactgtcacagccggggccgctgcgtccagggccagtgcgtgtgcgaggcaggcttc
cagggctacgactgcggtgacatgagctgccccaacgactgccaccagcacggccgctgc
gtgaacggcatgtgcgtctgtgatgacggctacaccggggaagactgccgggacctgcgc
tgccccagggactgcagcaaccgtggccgctgcgtggacgggcggtgcgagtgtgagcac
ggcttcactggccccgactgcgtggaactcgcgtgcccgggggactgccacggccaaggc
cgctgtgtgaacgggcagtgcgtgtgccacgagggcttcatgggtgccgcgtgcaaggag
cggaggtgtcctggcgactgtcacggccggggccgctgcgaggacgggcagtgcgtctgc
caggagggctttgcaggccccgactgccggaggcgctcctgccccaacgactgcagcggc
tggggccagtgcgtggagggccgctgcatctgcattgagggccatgctggggaggactgc
tccgacgtgtcccctcccaaagacctcatcgtgacagaagtgacggaagagaccgtaaac
ctggcctgggacaatgagatgcgggtcacagagtacctggtcatgtacacacccacccac
gaggacggcctggaaatgcagttccgcgtgcccggggaccagacgtccaccaccatccgg
gagctggagcccggcgtggagtactttatccgtgtcttcgccatcctggagaacaagaag
agcattcctgtcagcgccagggtggccacgtacttgcctgcacctgaaggcctaaagttc
aagtccatcaaggagacatctgtggaagtggaatgggatcccctggacattgcttttgaa
acgtgggagatcatcttccggaatatgaataaagaagatgagggagagatcaccaaaagc
ctgaggaggccagagaccacataccggcagactggcctagccccggggcaagaatatgag
atctctctgcacatcgtgaaaaacaatacccggggcccgggcctgaagagggtgaccacc
acccgcttggacgcccccagccagattgaggtgaaagatgtcacggacaccacagccctg
atcacttggttcaaacccctggccgagattgatggcgttgagctctcctatggaatcaaa
gatgtgcctggcgaccgcaccaccatcgatctcactcacgatgagaaccagtactccatt
gggaacctgaagccggacaccgaatatgaggtgtccctcatctcccgcaggggtgacatg
tccagcaacccggccaaggagaccttcacgacaggcctggatgctcctagaaatctccgc
cgcgtctcccagacggacaacagcatcaccctggaatggaagaatggcaaggcggccgtt
gacagttatagaattaagtacgcccccatctccggaggtgaccatgccgaggtcgaagtc
ccaaggagccagcaagccaccaccagaaccacgctcacaggtctgaggccgggaaccgaa
tatgggatcggagtgtccgctgtgaaggcagacaaggagagcgatccggccaccatcaac
gcggccacagacatagacgcacccaaggacctgcgggtttctgaaaccacagagaccaac
ctgaccctgctctggaggaggccttcggccaagtttgaccattaccgcctcaactacagc
ctgccctcaggccagccagtggaggtgaaacttccaagagacaccacttcctatgtcctg
agaggcctggaacccgggaaggaatacagcatcctccttaccgcggagaagggcaggcac
aagagcaaacccgcacgagtgaagtcatccacagaagccgagccagaggtggacaacctt
ctggtttcagatgccaccccagatggtttccgtctgtcctggacagctgatgaaggggtc
ttcgacagttttgttctcaaaatcagagataccaaaaagcagtctgagccactggaaata
accctacttgcccccgaacgaaccagggacataactggtctcagagaggccactgagtat
gaaattgaactgtatggaataagcaatggaaggcgatcccaaccagtccgtgccctagca
accacagccatgggctctccgaaggagatcagtttctcagatatcaccgaagattcggcc
actgtccgctggatggcgccctctgcccaggtggagagcttccggattacctacgtgccc
attgcaggagggacgccgtcagtggtaaccgtggatggaaccaagactcagaccaggctg
gtgaggctcttacctggagccgaataccttgtcaacgtcatcgccatgaagggcttcgag
gaaagcgaacccgtctcaggatcgttcgctacagctctggatggcccatctggcctggtg
acagccaacatcaccgacacggaagccttggccatgtggcagccagccatcgcccctgtg
gatagttacgtcatctcctacacaggggagagagtgccagaaattacgcgcacggtgtcc
gggaacacagtggagtacgctctgaccgacctcgagcctgccacggagtacacgctgagg
atctttgcagagaaagggccccagaagagctcaatcatcactaccaagttcacaacagat
gtcgattctccaagagacttcactgctactgaggttcagtcagaaactgccctcctcact
tggagacccccccgggcacctgtcaccggttatctgttggtgtatgaatccgtcgatggt
acagtcaaggaagtcattctgggtccagaaaccacctcctatagcctggcggagctgagc
ccctccacccactacacagccaagatccaggcattgaatgggcccctgaggagcaagctg
gtccagaccatctttaccacaattggactcctgtacccattccccagggactgctcccaa
gcaatgctgaatggagacacgacctctggcctctacaccatttatctgaatggcgataag
gcccaggctctggaagtcttctgtgacatgacctctgatgggggtggatggatcgtgttc
ctgagacgcaagaatggacgtgaggatttctaccgcaactggaaagcctatgctgctggg
tttggggaccgcagagaagaattctggctcgggctggacaacctacacaagatcacagcc
caagggcagtacgagctccgggtggacctgcgcgaccacgggaagaccgcgtacgccgtc
tacgacaagttcagcgtgggggatgccaagactcgctacaggctgaaggtggagggctac
agcgggaccgcaggcgactccatggcttatcacaacggcagatccttctccaccttcgac
aaggacacggactcagccatcaccaactgtgctctgtcctacaagggggctttctggtac
aagaactgtcaccgcgtcaacctgatggggaggtacggggacaacaaccacagtcagggc
gttaactggttccactggaagggccacgaatattccatccagtttgcggagatgaagttg
agacccagcaacttccgaaatctcgaaggcaggcgcaagcgggcgtaa

KEGG   Ursus arctos horribilis: 113266526
Entry
113266526         CDS       T05909                                 

Gene name
LAMC3
Definition
(RefSeq) laminin subunit gamma-3 isoform X1
  KO
K06247  laminin, gamma 3
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah05145  Toxoplasmosis
uah05146  Amoebiasis
uah05165  Human papillomavirus infection
uah05200  Pathways in cancer
uah05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113266526 (LAMC3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113266526 (LAMC3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113266526 (LAMC3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    113266526 (LAMC3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    113266526 (LAMC3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113266526 (LAMC3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    113266526 (LAMC3)
   05145 Toxoplasmosis
    113266526 (LAMC3)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 113266526
NCBI-ProteinID: XP_026369983
UniProt: A0A3Q7X6P5
LinkDB
Position
Unknown
AA seq 1590 aa
MAAAALLLGLALLAPRAAGAGMGACYDDAGRPQRCLPVFENAAFGRRAEASHTCGRPPED
FCPHVGAPGAGAQCQRCDAADPRRRHDAAYLTDFHSQDDSTWWQSPSMAFGVQYPTSVNI
TLRLGKAYEITYVRLKFHTSRPESFAIYKRTQASGPWEPYQYYSASCQKTYGRPEGQYLR
PGEDERVAFCTSEFSDISPLSGGNVAFSTLEGRPSAYNFEESPVLQEWVTSTELLISLDR
LNTFGDDIFKDPKVLQSYYYAVSDLSVGGRCKCNGHASECGPDAAGRLVCRCQHNTTGTD
CERCLPFFQDRPWARGTAEAANECVPCNCSGHSEECAFDRELFRSTGHGGRCLRCRDHTA
GPHCERCQEDFYRWSPRTLCQPCDCHPAGSLRLQCDASGTCVCKPTVTGWKCDRCRPGFH
SLSEGGCRPCACSAAGSLGTCDPHGGRCPCKENVEGHLCDRCRPGTFNLQPHNPAGCTSC
FCYGHSKVCAAAAQFREHHILSDLRQGAQGWRTGSVGGPEHPARWSPRGLLLSPADEEEL
TLPEKFLGDQRFSYGQPLTLTFWVPPGGSPLPVQLRLEGAGLALTLQHSSLSGPPEAGQP
GEVRLRFELQETSEDVDPLLPPFHFQRLLANLTALRIRAGGQSPSPSGQVFLTEVRLTSA
QRGLSPPASWVETCSCPKGYTGQFCESCAPGYKRETPLGGPYTNCVPCTCNQHGTCDPHT
GICLCGHHTEGPSCERCLPGFYGNPFTGQADDCQPCPCPGQSACTAIPESREVVCTHCPL
GQRGRRCEICDDGYFGDPLGLSGAPQPCRQCQCSGNVDPNAVGNCDPLSGHCLRCLHNTT
GAHCERCQEGFYGSALAPRPADRCTPCGCHPKGSVSEQRACDLVTGQCPCLPHVTGRDCG
RCSPGFYDLQPGRGCRSCKCHPLGSQEDQCHPKTGQCPCRPGVEGQACDRCQLGFFGFSI
KGCRACRCSPLGAASAQCHKNGTCVCKPGFMGYKCDRCQDNFFLTAGGTRCQECPSCYAL
VKEEAAKLKARLTLMEGWLQGSDCGQPWGPLHILQGEAPRGDIYQGHHLLQGAREAFLEQ
VTGLEGAVKAAREQLRVLGRSARCAQTRAEKTCFQLAELDAVLESSEEEILQAAIILKSL
AIPQEGPSQPTTWSHLATEARALARSHRDTTAKIEATARRALVASNTSYALLWSLVEGRV
ALEAQRELEDRYQEVQAAQKALGTAVAEALPEAERALAAMQRVDADAALRLASLAAPAAL
PSCQALPEPTVSLQPQKSQARALSLKVQALEKMVASREHKATDAARALQATAQAMLHKTE
PLTQLRQEARAALTRASSSVQAATVTVTGARTLLADLEEMKRQFPRPRDQATVGRKAGIV
SGRLLADVTKKTKQAERMLGNAASVSSSARKKGREAEMLAKDSAKLAKALLGKGKQEHRR
AGRLSSQTRATLRQASREVLASQARRQELGKAEQMGAGLNALEWQIRESRTSLEKDIQAL
LELLAGLGSLDTHQAPARALNKTQRALERLRLQLSPPGTLQGKLRLLEQESEQQELQIQS
FENDLAEIRADKENLEAILHSLPKSCASWQ
NT seq 4773 nt   +upstreamnt  +downstreamnt
atggcggcggccgcgctcctgctgggcttggcgctgctggcgccgcgggcggccggcgcg
ggcatgggcgcgtgctacgacgacgcggggcggccgcagcgctgcctgccggtgttcgag
aacgcggcgttcggccggcgcgccgaggcctcgcacacgtgcggccggccgcccgaggac
ttctgcccgcacgtgggcgcgccgggcgcgggggcgcagtgccagcgctgcgacgccgcc
gacccccggcgccgccacgacgccgcctacctcaccgacttccacagccaggacgacagc
acctggtggcagagcccgtccatggccttcggtgtgcagtaccccacctcggtcaacatc
accctccgcctggggaaggcttacgagatcacctacgtgaggctgaaattccacaccagc
cgccccgagagctttgccatctacaagcgcacccaggccagcggcccgtgggagccctac
cagtactacagtgcctcctgccagaagacctacggcaggcccgagggccagtacctgcgc
cctggcgaggacgagcgcgtggccttctgcacctccgagttcagcgacatctccccgctg
agtggcggtaacgtggccttctccacgctggagggccggcccagcgcttacaacttcgag
gagagccccgtgttgcaggagtgggtcaccagcaccgagctcctcatctccctggaccgg
ctcaacacgtttggggacgacatcttcaaggaccccaaggtgctgcagtcctactactat
gccgtgtccgacctctctgtgggtggcaggtgcaagtgcaacgggcacgccagtgagtgt
ggtcccgacgcggcgggccggctggtctgccggtgccagcacaacaccacgggcaccgac
tgcgagcgctgcctgcccttcttccaggaccgcccgtgggcccggggcacggccgaggct
gccaacgagtgtgtgccctgcaactgcagtggccactcggaggagtgcgccttcgaccgg
gagctcttccgcagcacgggccacggcgggcgctgcctccgctgccgggaccacacagcc
gggccacactgcgagcgctgccaggaggacttctatcgctggagccctcggacgctgtgc
cagccctgtgactgccacccggcaggctccctgcgcctccagtgtgacgcctcggggacc
tgcgtctgcaagcccacggtgacgggctggaagtgtgaccgctgccggcccgggttccac
tcgctcagtgagggaggctgcagaccctgcgcctgcagtgcggctggcagcctgggcacc
tgtgacccccatggcggacgctgcccctgcaaggagaacgtggaaggccacctgtgtgac
agatgccgccccgggacgttcaacctgcagccccacaacccggccggctgtaccagctgc
ttctgctacggccactccaaggtgtgtgcggctgctgcccagtttcgggagcaccacatc
ctctccgacctccgccagggagcccagggctggcggaccggaagtgtggggggcccagag
catcctgcacgatggagcccgagggggctcctcctgagtccagcagatgaagaggagctt
acgctgccagagaagttcctgggagatcagcggttcagctacggacagcccctcacactg
accttctgggtcccccccgggggctccccactccccgtgcagctgaggctggaaggggcg
ggcctggccctgactctgcagcactccagcctttccgggcccccggaagccgggcagccc
ggggaagtacggctcaggtttgagttgcaggagacctccgaggacgtggaccctctgctg
ccccccttccacttccagcggctgctcgccaatctgacggctctgcgcatccgggccggt
ggccagagcccaagcccttctggccaggtgttcctgaccgaggtccggctcacatcagcc
cagcgggggctctccccgccagcctcctgggtagagacctgctcgtgtcccaaggggtac
acaggccagttctgtgaatcctgtgctccaggatacaagagggagacaccactggggggt
ccctataccaactgtgtcccctgcacctgcaaccagcatggcacttgtgacccccacaca
gggatctgcctgtgtggtcaccacaccgagggcccgtcctgtgagcgctgcttgccaggt
ttctatggcaaccccttcacaggccaagccgacgattgccagccctgtccgtgccccgga
cagtcggcctgcacagccatcccagagagcagggaggtggtgtgtacccactgcccccta
ggccagagagggcggcgctgtgagatctgtgatgacggctactttggggaccctctgggg
ctctctggggctccccagccctgccggcagtgccagtgcagtgggaacgtggaccccaac
gccgtgggcaactgtgaccccctgtctggccactgcctgcgttgcctgcacaacacgaca
ggtgcccactgtgagcgctgtcaggaaggcttctacgggagcgccctggcccctcggccc
gcagacagatgcacgccctgcggctgccaccccaagggctcagtcagtgagcagagagcc
tgtgacctggtgacgggccagtgcccctgcctgcctcatgtgacgggacgggactgcggc
cgctgcagccccggcttctacgacctccagcccgggaggggctgccggagctgcaagtgt
cacccgctgggctcccaggaggaccagtgccaccccaagaccgggcagtgcccctgtcgc
ccgggggttgagggccaggcctgtgacagatgccagctgggcttcttcggcttctccatc
aagggctgccgggcctgcaggtgctccccgttgggcgccgcctcagcccagtgccacaag
aacggcacgtgtgtgtgcaagcccggcttcatgggctacaagtgtgaccgctgtcaggac
aacttcttcctcacggccggcggcacacgctgtcaggagtgcccgtcctgctacgccctg
gtgaaggaggaggccgccaagctgaaggccaggctgaccctgatggaggggtggctgcag
gggtctgactgtggccagccctggggcccactgcacattctacagggagaggccccacgg
ggggacatctaccagggccaccacctgctgcaaggagcccgggaggccttcctggagcag
gtgacaggcctcgagggtgcggtgaaggctgcccgggaacagctgcgggtgctgggcagg
agtgcccgctgtgcccagacccgggcagagaagacgtgcttccagctggcagagctcgac
gcagtgctagagtcctcggaagaagagattctgcaggcggccatcatcctcaaatcgctg
gcgattcctcaggaagggcccagccagcccaccacctggagccacctggccacagaagcc
cgtgccctcgccaggagccacagggataccactgccaagatcgaggccactgctcggagg
gccctggttgcctccaacaccagttacgcgcttctctggagtttggtggagggcagagtg
gccttggaggcccagcgggaactagaggacaggtaccaggaggtccaggcggcccagaag
gcgctgggcacagctgtggcagaggctctgcccgaagcggagagggcgctggccgccatg
cagcgagtcgatgcagacgctgccctgcgcctggcctcgctggccgcccctgcagcactg
ccttcctgccaggccctgcctgagcccaccgtctccctgcagcctcagaagtcccaggcc
agggccctgagcctgaaggtgcaggccctggaaaagatggtcgcatccagagagcacaag
gccactgacgctgcccgggccctccaggccactgcccaggccatgctgcacaagaccgag
cccctcacgcagctacgccaggaggccagagccgccctgacccgggcttcctcctccgtc
caggctgccacagtgactgtcacgggagccaggaccctgctggccgacctggaagaaatg
aagcggcagtttcctcggcccagggaccaggccacggtggggaggaaggcaggcatcgtc
agcggcaggctcctcgcagacgtgacgaagaagaccaagcaggcggagaggatgctgggg
aacgcagcgtctgtctcctccagtgcccggaagaagggcagggaagccgagatgctggcc
aaggacagtgctaagcttgccaaggccttgctcgggaagggcaagcaggagcaccgccgg
gcaggccggctctccagccagacgcgggcgacactccgacaggcctcccgggaggtgctg
gcctcacaagcccgcagacaggagctggggaaagctgagcagatgggtgccgggctgaat
gccctggagtggcagatccgggaatcgcgcacctccctggagaaggacatccaggccttg
ttggagctgctcgctgggctggggtcactggacacccatcaagccccagcccgggccctg
aacaagacccagagggcactggagcgcctgaggctgcaactgagcccgccagggaccctg
caggggaaactgaggctgttggagcaggagtccgagcagcaggagctgcagatccaaagc
ttcgagaatgacctcgccgagatccgtgctgacaaggagaacctggaggccattctgcat
agcctgcccaagagctgtgccagctggcagtga

KEGG   Ursus arctos horribilis: 113268232
Entry
113268232         CDS       T05909                                 

Gene name
COL9A2
Definition
(RefSeq) collagen alpha-2(IX) chain
  KO
K08131  collagen, type IX, alpha
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04974  Protein digestion and absorption
uah05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113268232 (COL9A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113268232 (COL9A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113268232 (COL9A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    113268232 (COL9A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113268232 (COL9A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:uah00535]
    113268232 (COL9A2)
Proteoglycans [BR:uah00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   113268232 (COL9A2)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 113268232
NCBI-ProteinID: XP_026372349
UniProt: A0A3Q7XYG4
LinkDB
Position
Unknown
AA seq 688 aa
MAAAAAPRSLLLLLQVLGLALAQIRGPPGEPGPPGPPGPPGVPGSDGIDGDKGPPGKAGP
PGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPVGIPGVKGQPGLPGPPGLPGPGFAGPP
GPPGPVGLPGEIGITGPKGDPGPEGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTNCPAG
VKGPPGLQGVKGHPGKRGALGDSGRQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMEG
PKGETGPHGYKGMVGSIGAAGSPGEEGPRGPPGRAGEKGDVGSQGVRGPQGITGPKGATG
PPGIDGKDGTPGIPGLKGNAGQAGRPGNQGHQGLAGVPGQPGTKGGPGDKGEPGQQGLPG
FSGPPGKEGEPGPQGEIGPQGILGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQG
LPGIKGDKGSPGKTGPRGSVGDPGVAGLRGEKGEKGESGEPGPKGQQGVRGEPGYPGPSG
DAGAPGVQGYPGPPGPRGLAGDRGVPGLPGRQGVAGRDASDQHIVTVMMKMMQEQLAEVA
VSAKREALGAVGMVGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRG
EKGDQGEMGRGHPGMPGPPGIPGLPGRPGQAINGKDGARGSPGAPGEAGRPGLPGPVGLP
GFCEPAACLGASAYASARLTEPGSIKGP
NT seq 2067 nt   +upstreamnt  +downstreamnt
atggccgccgcggccgccccccgcagcctcctgctgctcctccaggtgctcgggctcgcc
ctggcgcagatcagaggtccgcccggagaaccggggcccccgggtcccccagggccgccg
ggagtgcctggatctgacggcatcgacggtgacaaggggccccccgggaaagctggccct
ccgggacctaagggagagcctggcaaagcagggccggatgggccagacgggaagcctggg
attgacggtctaactggagccaagggggaacctggccccgtggggatccctggagtcaag
ggccagcccgggctcccaggtccccctggcctgccgggccctggctttgctggacctcct
gggccacctggacctgtcggcctcccgggtgagattggaatcacaggccccaagggggat
cctggaccagagggaccatcagggcccccagggccacccggcaaaccgggccgacccgga
accatccagggcctggaaggcagcgcggatttcttgtgtccaaccaactgtccagcgggg
gtgaaaggccccccagggctgcagggagtgaaggggcatcctggcaaacgcggggctctg
ggagattctggccgccaggggaagccgggtcccaagggagatgtgggtgcctctggagag
caaggcatccccggaccaccgggtccccagggcatcaggggctaccccggcatggaggga
cccaaaggagagacgggtcctcatgggtacaaaggcatggtgggctccattggtgctgct
gggtcaccaggtgaggaaggtccacgggggccaccaggccgagctggggagaagggtgat
gtgggcagccaaggtgttcgaggaccccagggaataacaggcccgaagggagcaaccggc
cccccaggcattgatggcaaggacgggaccccaggcatacctggcttaaagggcaatgca
ggacaggcggggcggccaggaaaccaaggccaccagggcctagcgggtgtgccgggccag
cctgggacaaaaggaggcccaggagacaagggtgaaccaggccagcagggcctccctgga
ttctctggtcctcctgggaaggaaggagagccaggacctcaaggagaaatcggaccccaa
ggcatcctagggcagaagggtgaccagggtgagaggggaccagtggggcagccaggccct
caaggacggcagggccccaagggggagcaggggccccccggaattccagggccccaaggc
ttgccaggcatcaagggagacaagggctccccggggaagaccggaccccgcggcagcgtg
ggcgacccgggggtggccggcctccggggagagaaaggcgagaagggcgagtctggcgag
ccggggcccaaggggcagcaaggagtccgcggagagcccggctacccgggccccagcggg
gatgcgggcgccccgggggtgcagggctaccccgggccccccggccctcgaggactggct
ggagaccgaggcgtgccgggactgcccgggagacagggcgtggcgggccgagacgccagt
gaccagcacatcgtgaccgtgatgatgaagatgatgcaagagcaactggcagaggtcgct
gtgagtgccaagcgggaggccctgggtgcagtcgggatggtgggtcctccaggaccccct
gggcctcccggatatccgggcaagcagggaccccatgggcaccctggccctcggggagtt
cctggcatcgtgggagccgtgggtcagattggcaacacagggcccaagggaaaacgtgga
gaaaagggtgaccagggagagatgggacgcgggcatcccgggatgcctgggcccccgggg
atcccaggactccctggccggcccggccaggcaatcaacggcaaggatggagctcgaggg
tccccaggggccccgggagaagcaggccgaccaggcctgccaggccccgtggggctgcca
ggcttttgtgagcctgcggcctgccttggagcctcagcctacgcctctgcacgcctcacg
gagcctggatccatcaaagggccatga

KEGG   Ursus arctos horribilis: 113269030
Entry
113269030         CDS       T05909                                 

Gene name
VTN
Definition
(RefSeq) vitronectin
  KO
K06251  vitronectin
Organism
uah  Ursus arctos horribilis
Pathway
uah04151  PI3K-Akt signaling pathway
uah04510  Focal adhesion
uah04512  ECM-receptor interaction
uah04610  Complement and coagulation cascades
uah05165  Human papillomavirus infection
uah05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:uah00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    113269030 (VTN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    113269030 (VTN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    113269030 (VTN)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    113269030 (VTN)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    113269030 (VTN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    113269030 (VTN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:uah04147]
    113269030 (VTN)
   00536 Glycosaminoglycan binding proteins [BR:uah00536]
    113269030 (VTN)
Exosome [BR:uah04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   113269030 (VTN)
Glycosaminoglycan binding proteins [BR:uah00536]
 Heparan sulfate / Haparin
  Extracellular matrix molecules
   113269030 (VTN)
SSDB
Motif
Pfam: Hemopexin Somatomedin_B
Other DBs
NCBI-GeneID: 113269030
NCBI-ProteinID: XP_026373485
UniProt: A0A3Q7X7V5
LinkDB
Position
Unknown
AA seq 470 aa
MASPRPLLTLALLAWVVLADQESCKGRCTEGFTADRKCQCDELCSYYQSCCEDYVAECKP
QVTRGDVFTLPEDEYGAFDYSEGTREGDLTEPESTTLSPGPQTQPEEAPAQVPILIVEEE
APGPGQETSGPETHDEDIPESPAEEELCSGKPFDAFTDLKNGSLFAFRGQYCYELDEEAV
RPGYPKLIRDVWGVEGPIDAAFTRINCQGKTYLFKGNQYWRFEDGVLDPDFPRNISEGFK
GIPDNVDAALALPAHSYNGRERVYFFKGRQYWEYEFQQQPSQEECEDSSLSAVFEHFALL
QRDSWESLFELLFWSRPSGGAGQPRFISQDWPGVPTQVDAAMAGRIYISGSAPRSWAKKP
KSKRRNRKRYRSRRNRHRGRGRSQNPHRQSRSTWPSWFSSEESGLGTYNYDYDMDWLVPA
TCEPIQSVYFFSRDKYYRVNLRTRRVDTVSPPYPRSIAQYWLGCSVSGHQ
NT seq 1413 nt   +upstreamnt  +downstreamnt
atggcatccccaaggcctctactgacgctagccctgctggcgtgggttgttctggctgac
caagagtcctgcaagggccgctgcacagagggcttcactgctgacaggaagtgtcagtgc
gatgagctctgctcttactaccagagctgctgcgaggactacgtggccgagtgcaaaccc
caagtgactcgtggggatgtattcactctgccagaagatgagtacggggcctttgactac
tctgaggggactagagaaggcgacctcacagagcccgagagcaccaccctgagccctggc
ccgcagacccagcctgaagaggctcctgcccaggtacctattctgatcgttgaggaagag
gctccaggacctgggcaggagacctcagggcccgagacgcatgatgaagacatccctgag
tccccagcagaggaggagctatgcagtgggaagccctttgacgccttcactgacctcaag
aatggctccctctttgccttccgagggcagtactgttatgagctggatgaagaggcagtg
aggcctggataccccaagcttatccgagatgtctggggcgttgagggccccattgatgct
gccttcacccgcatcaactgtcaggggaagacctacctcttcaagggtaaccaatactgg
cgctttgaggatggtgtcctggacccagatttcccccgcaacatctctgaaggcttcaag
ggcattccggacaacgtggatgcagccttggctctccctgctcatagctacaatggccgg
gagcgggtctacttcttcaaggggagacagtactgggagtatgagttccagcagcagccc
agtcaggaggagtgtgaagacagctccctgtctgccgtgtttgaacactttgccctgctg
cagagggacagctgggagagcctctttgagcttctcttctggagcagaccctctggtggt
gctggacagccccggttcatcagccaggactggcccggtgtgcccacgcaggtggatgcg
gccatggcgggccgcatctacatctcaggctcagctccccgatcctgggccaagaaaccc
aagtccaagaggcgcaaccgtaagcgctatcgttcacgccgtaaccgtcaccgtggccga
ggccgcagccagaacccccaccggcaatcccgttcaacctggccttcctggttctccagt
gaggagagcggcctgggcacctacaactatgactacgacatggactggcttgtgcctgcc
acctgcgagcccatccaaagtgtttacttcttctcacgagacaagtactaccgagtcaac
cttcgcacacggcgagtggacactgtgagccctccttacccacgctccatcgcccagtac
tggctgggctgctcagtttctggccaccagtag

DBGET integrated database retrieval system