KEGG   Macaca mulatta (rhesus monkey): 100429807
Entry
100429807         CDS       T01028                                 

Gene name
COL4A2
Definition
(RefSeq) collagen alpha-2(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100429807 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100429807 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100429807 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100429807 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    100429807 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100429807 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100429807 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100429807 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100429807 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100429807 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    100429807 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    100429807 (COL4A2)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100429807 (COL4A2)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100429807 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100429807
NCBI-ProteinID: XP_014976929
Ensembl: ENSMMUG00000022282
UniProt: A0A5K1VIA9
LinkDB
Position
17
AA seq 1712 aa
MGRDQRAVAGPALRRWLLLGTVTVGFLAQGVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGITGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPREERD
RYRGEPGEPGLVGFQGPPGRPGPVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIVAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGPRGPPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDLGPPGLPAYSPHPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGPRGPPGLSIGDEDQRRGLPGEMGPKGFIGDPGIPAL
YPGPPGPDGKPGPPGPPGLPGPPGPDGFLFGRKGAKGTPGFPGLSGSPGARGPKGWKGDA
GDCRCAEGDEAVRGLPGLPGPKGFAGINGEPGRKGDKGDPGQHGLPGFPGLKGVPGNVGA
PGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRNGLDGFPGLPGPPGDGIKGPP
GDAGYPGIPGTKGTPGEMGPPGLGLPGFKGQRGFPGDAGLPGPPGFPGPPGPAGTPGQID
CDTDVKRPIGGDRQEAVQPGCIGGPKGLPGLPGPPGPSGAKGLRGIPGFSGADGGPGPKG
LPGDPGREGFPGPPGFIGPRGSKGAVGLPGPDGPPGPIGLPGPDGPPGDRGLPGEVLGAQ
PGPRGDAGVPGQPGFKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLSGPPGQ
HGFPGAPGQEGPLGLPGIPGLEGLPGDRGDPGDIGAPGPVGMKGVSGDRGDAGLAGERGH
PGSPGFKGIDGMPGTPGLKGERGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPIILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGAPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGPTGDFGDIGDTINLPGRPGLKGEQGTAGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQTGFPGLTGPPGSQGEPGRIGLPGGKGDDGWPGIAGLPGFPGLRGIRGLHG
LPGTKGFPGSPGADIHGDPGFPGPPGDRGDPGDANTLPGPVGVPGQKGEQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDAGNPGAPG
TPGTKGWAGDSGPQGRPGVFGLPGEKGPRGEQGFMGNTGPAGAVGDRGPKGPKGDPGFPG
APGIVGAPGIAGIPQKIAIPPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPVGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTEQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAVAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5139 nt   +upstreamnt  +downstreamnt
atggggagagaccagcgcgcggtggccggccctgccctacggcggtggctgctgctgggg
acagtgaccgtggggttccttgcccagggcgtcttggcgggtgtgaagaagtttgatgtg
ccctgcggaggaagagattgcagcgggggctgccagtgctaccctgagaaaggtggacgt
ggtcagcctgggccagtgggcccccaggggtacaacgggccaccaggattacaaggattc
ccaggactgcagggacgcaaaggagacaagggtgaaaggggagcccccggaataacagga
cccaagggtgacgtgggagcaagaggcgtttctggattccctggtgccgatggaattcct
ggacacccggggcaaggtgggcccaggggaaggcccggctatgatggctgcaacggaacc
cagggagactcaggtccacaggggccccccggctctgaggggttcaccgggcctcccggg
ccccaaggaccaaaagggcagaaaggtgagccttatgcactgcctagagaggaacgcgac
agatatcggggcgaacctggggagcctggattggtcggtttccagggacctcccggccgc
cctgggcctgtggggcagatgggtccagttggagctccagggagaccaggaccacccgga
ccccctggaccaaaaggacagcaaggcaacagaggacttggtttctacggagttaagggt
gaaaagggtgacgtagggcagccgggacccaatgggattccgtcggacaccctccacccc
atcgtcgcgcccacgggagtcaccttccacccagatcagtacaagggtgaaaaaggcagt
gagggggaaccaggaataagaggcatttccttgaagggagaagaaggaatcatgggcttt
cctggaccgaggggtccccctggcttgagtggtgaaaaaggatcaccaggacagaaggga
agccgaggcctggatggctaccaaggccctgatgggccccggggacctaagggagaagcc
ggagacctagggccccccggactacctgcctactcccctcacccttccctagcaaaaggt
gccagaggtgacccgggattcccaggggcccaaggggagccaggaagccagggtgagcca
ggagacccgggcccccgaggaccccctggcctctccatcggagatgaagatcagaggaga
ggcctgccgggtgagatgggacccaagggcttcatcggagaccccggcatccctgcgctc
tacccgggcccacctggacctgatggaaagccagggcctccaggcccccccgggctccct
ggaccacctggacctgatggcttcctgtttgggcggaaaggagcaaaaggaacaccaggc
ttccctgggctttccggctcccctggagcccgcggaccgaagggatggaagggtgacgct
ggggactgcagatgtgcagaaggcgacgaagctgtcagaggtcttccgggactgccagga
cccaagggctttgcaggcatcaacggggagccggggaggaaaggggacaaaggagacccc
ggccaacacggcctccctgggttcccagggctcaagggagtccctggcaacgttggtgct
cctgggcccaaaggagcaaaaggagattccagaacaatcacgaccaaaggtgagcgggga
cagcccggcgtcccaggtgtgcccgggatgaaaggtgatgatggcagcccaggccgcaat
gggctcgatggattccccggcctcccaggccctcccggcgatggcatcaagggccctccc
ggggacgcgggctatccaggaattcctggaacgaaaggtactccgggagaaatgggtccc
ccaggactgggccttcccggcttcaaaggccagcgtggtttccctggagatgccggctta
cctggaccaccaggcttcccgggccctcctggccccgcagggaccccaggacaaatagat
tgtgacacagatgtgaaaaggcccattggaggtgacagacaggaggccgtccagccaggt
tgcataggagggcccaagggactgccaggcctgccaggacccccaggaccctcaggtgcc
aaaggcctccgaggaatcccaggcttctcaggagctgatggaggaccagggcccaagggt
ttgccaggagacccaggtcgtgaagggttcccaggacccccagggttcataggaccccga
ggatccaaaggtgcggtgggcctccctggcccagatggacccccaggtcccatcggcctg
ccagggccagacgggccccctggggacaggggccttcctggagaagtcctgggagcccag
cccgggccacggggagatgctggtgtgcctggacagcctgggtttaaaggccttcccgga
gacagaggcccccctggatttagaggaagtcaagggatgcctgggatgccagggctgaag
ggccagccgggcctcccgggaccttccggccagccaggcctgtccgggcctccaggacag
cacggattcccaggagctcctggccaagaggggcccttggggctgccaggaatcccaggt
cttgaaggtctgcctggagatagaggggaccctggggacataggcgctcccggccctgtg
ggcatgaaaggggtctctggtgacagaggagatgctggcttggcaggggagcgaggccat
ccaggaagccctggatttaaaggaattgatggaatgcctgggacccccgggctaaaagga
gagagaggctcacctgggatggatggtttccaaggcatgcctggactcaaagggagaccc
gggtttccagggagcaaaggcgaggctggatttttcggaatacccggtctgaagggtctg
gccggtgagccaggttttaaaggcagccgaggcgaccctgggcccccaggaccacctccc
atcatcctgccaggaatgaaagacattaaaggagagaaaggagacgaagggcctatggga
ctgaaaggatacctgggcgcaaaaggtatccaaggaatgccaggcatcccaggactgtca
ggaatccctgggctgcctgggaggcccggccacatcaaaggagtcaagggagacatcgga
gccccgggcatccctggtttgccaggattccctggggtggctggcccccctggaattaca
gggttcccaggattcataggaagccggggtgacaaaggcgccccggggagagcaggcctg
tatggcgagatcggcccgaccggtgatttcggtgacattggggacactataaatttacca
ggaagaccaggcctgaagggggagcagggcaccgctggaatcccaggtctgaagggattc
tttggagagaagggaacagaaggtgacatcggcttccctgggataacaggcgtgactgga
gtccaaggccctcctgggcttaaaggacaaacaggctttccagggctaactgggcctcca
gggtcgcagggagagccggggcggattggactgcctggtggcaaaggagatgatggctgg
ccgggaattgcaggcttaccaggttttccgggactccgtgggatccgcggcttacacggc
ttgccaggcaccaaaggctttccaggatccccaggtgccgacatccacggagacccaggc
ttcccaggccctcctggggacagaggtgacccaggagacgccaacacccttccaggccct
gtgggagtcccaggacagaaaggagagcaaggagctccaggagaacgaggcccacctggg
agtccaggacttcaggggttccctggcatcacgcccccttccaacatctctggggcacct
ggtgacaaaggggcgccagggatatttggcctgaaaggttatcggggcccgccagggcca
cccgggtctgctgctcttcctggaagcaaaggtgacgcagggaacccaggagctccagga
accccagggaccaaaggatgggccggggactccgggccccagggcaggccgggtgtgttc
ggtctcccgggagaaaaagggcctaggggtgaacaaggctttatggggaacactggaccg
gccggtgcggtgggcgacagaggccccaagggacccaagggagacccaggattccctggt
gcccccgggattgtgggagcccccgggattgcaggaatcccccagaagattgccatccca
ccagggacagtgggtccgcaggggaggcgaggcccccctggggcaccaggggagatgggg
ccccagggcccccccggagaaccaggtttccgtggggctccagggaaagccgggccccaa
ggaagaggtggcgtgtctgctgttcccgggttccggggagatgaaggacccgtaggccac
caggggccgattggccaagaaggtgcaccaggccgtccagggagcccgggcctgccgggt
atgccaggccgcagcgtcagcatcggctacctcctggtgaagcacagccagacggaacag
gagcccatgtgcccggtgggcatgaacaagctttggagtgggtacagcctgctgtacttt
gagggccaggagaaggcacacaaccaggacctggggctggcgggctcctgcctggcgcgg
ttcagcaccatgcccttcctgtactgcaaccctggtgatgtctgctactatgccagccgg
aacgacaagtcctactggctgtctaccacggctccgctgcccatgatgcctgtggccgag
gatgagatcaagccctacatcagccgctgctcggtgtgtgaggccccagccgtcgccatc
gccgtccacagtcaggatgtctccattccacactgcccagctgggtggcggagcttgtgg
atcggatattccttcctcatgcacacggcggcgggagacgaaggcggcggccagtccctg
gtgtcaccgggcagctgtctggaggacttccgcgccacgccattcatcgagtgcaatgga
ggccgcggcacctgccactactacgccaataagtacagcttctggctgaccaccattccc
gagcagagcttccagggctcgccctccgctgacacgctcaaggccggcctcatccgcaca
cacatcagccgctgccaggtgtgcatgaagaacctgtga

KEGG   Macaca mulatta (rhesus monkey): 100430603
Entry
100430603         CDS       T01028                                 

Gene name
LAMA5
Definition
(RefSeq) laminin subunit alpha-5
  KO
K06240  laminin, alpha 3/5
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100430603 (LAMA5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100430603 (LAMA5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100430603 (LAMA5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100430603 (LAMA5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100430603 (LAMA5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100430603 (LAMA5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100430603 (LAMA5)
   05145 Toxoplasmosis
    100430603 (LAMA5)
SSDB
Motif
Pfam: Laminin_EGF Laminin_I Laminin_N Laminin_G_2 Laminin_G_1 Laminin_II Laminin_B Laminin_G_3
Other DBs
NCBI-GeneID: 100430603
NCBI-ProteinID: XP_028684513
Ensembl: ENSMMUG00000031416
LinkDB
Position
10
AA seq 3694 aa
MAKRLCAGSALCVRGPRGPAPLLLVGLALLGTARAREEAGGGFSLHPPYFNLAEGARIAA
SATCGEEAPARGSPRPTEDLYCKLVGGPVAGGDPNQTIQGQYCDICTAANSNRAHPASNA
IDGTERWWQSPPLSRGLEYNEVNVTLDLGQVFHVAYVLIKFANSPRPDLWVLERSMDFGR
TYQPWQFFASSKRDCLERFGPQTLERITRDDAAICTTEYSRIVPLENGEIVVSLVNGRPG
AMNFSYSPLLREFTKATNVRLRFLRTNTLLGHLMGKALRDPTVTRRYYYSIKDISIGGRC
VCHGHADACDAKDPTDPFRLQCTCQHNTCGGTCDRCCPGFNQQPWKPATANSANECQSCN
CHGHATDCYYDPEVDRRRASQSLDGTYQGGGVCIDCQHHTTGVNCERCLPGFYRSPDHPL
DSPHVCCRCNCESNFTDGTCEDLTGRCYCRPNFSGQRCDVCAEGFTGFPSCYPTPSSSND
TREQVLPAGQIVNCDCSAAGTQGNACRKDPRVGRCVCKPNFQGTHCELCVPGFYGPGCQP
CQCSSPGVADDRCDPDTGRCRCREGFEGATCDRCAPGYFRFPLCQLCGCSPAGTLPEGCD
EAGRCPCQPGFAGPHCDQCHPGYHGFPNCQPCTCDPRGALDQLCGAGGLCRCRPGYTGTA
CQECSPGFHGFPSCVPCHCSAEGSLHAACDPRSGQCSCRPRVTGLRCDTCVPGAYNFPYC
EAGSCHPAGLALVDPALPEAQAPCVCRANVEGPSCDRCKPGFWGLSPSNPEGCTRCSCDP
RGTLGGVAECQPGTGQCFCKPHVCGQACAACKDGFFGLDQADYFGCRSCRCDAGGALGQS
CEPRTGACRCRPNTQGPTCSRPVRDHYLPDLHHLRLELEEAATPEGHAVRFGFNPLEFEN
FSWRGYAQMAPVQPRIVARLNLTSPDLFWLIFRYVNRGAVSVSGRVSVREEGRLAACANC
TAQSQPVAFPPSTEPAFVTVPQRGFGEPFVLNPGTWALLVEAEGVLLDYVVLLPSAYYEA
ALLQLRVTEACTYRPSTQQSGDNCLLYTHLPLDGFPSAAGLEALCRQDNSLPRPCPTEQL
SPSHPPLVTCTGSDVDVQLQVAVPQPGRYALVVEYANEDARQEVGVAMHTPQRAPQQGLL
SLHPCLYSTLCRGTARDTQDRLAVFHLDSEANVRLTAEQAHFYLHGVTLVPMEEFSPEFV
EPRVRCISSHGAFGPSSAACLPSRFPKPPQPIILRDCQVIPLPPGLPLTQAQDLMPATSP
TGPQPRPPTAVDPDVEPTLLREPQATVVFTTHVPTLGRYAFLLHGYQPAHPTFPVEVLIN
AGRVWQGHANASFCPHGYGCRTLVVCEGQALLDVTHSELTVTVRVPEGRWLWLDYVLVVP
ENVYSFGYLREEPLDKSYDFISHCAAQGYHISPGSSSLFCRNAAASLSLFYNNGARPCGC
HEVGATGPTCEPFGGQCPCRAHVIGRDCSRCATGYWGFPNCRPCDCGARLCDELTGQCIC
PPRTVPPECLLCQPQTFGCHPLVGCEECNCSGPGVQELTDPTCDTDSGQCKCRPNVTGRR
CDTCSPGFHGYPHCRPCDCHEAGTAPGVCDPLTGQCYCKENVQGPRCDQCSLGTFSLDAA
NPKGCTRCFCFGATERCRSSSYTRQEFVDMEGWVLLSTDRQVVPHERRPGTEMLRADLRH
VPEAVPEAFPELYWQAPPSYLGDRVSSYGGTLRYELHSETQRGDVFVPTESRPDVVLQGN
QMSITFLEPAYPMPGHIHRGQLQLVEGNFRHTETRNTVSREELMMVLASLEQLQIRALFS
QISSAVSLRRVALEVASPAGQGTLASNVELCLCPASYRGDSCQECAPGFYRDVKGLFLGR
CVPCQCHGHSDRCLPGSGICVDCQHNTEGAQCERCQAGFVSSREDPSAPCVSCPCPLSVP
SNNFAEGCVLRGGRTQCLCKPGYAGASCERCAPGFFGNPLVLGSSCQPCDCSGNGDPNLL
FSDCDPLTGACRGCLRHTTGPRCEICAPGFYGNALLPGNCTRCDCAPCGTEACNPHSGQC
LCKAGVTGRRCDRCQEGHFGFEGCGGCRPCACGPAAEGSECHPQSGQCHCRPGTMGPQCR
ECAPGYWGLPEQGCRRCQCPGGRCDPHTGRCTCPPGLSGERCDTCSQQHQVPVPGGPGDH
SVHCEVCDHCVVLLLDDLERAGTLLPAIREQLRGINASSVAWARLHRLNASITDLQSQLR
SPLGPRHETAQQLEVLEQQSTSLGQDAQQLDSQAAGTREQASRLLAGTEATLGRAKTLLA
AIRAVDRTLSELMSQTGHLGLANASALSGEQLRRTLAEVERLLWEMRARDLGAPRAAAEA
ELAAAQRLLARVQEQLSNLWEENQALATHTRDRLAQHEASLMDLREALNRAVDVTREAQE
LNSRNQERLEEALQRKQELSRDNATLKATLHAASDTLASVFRLLHSLDQAKEELERLAAS
LDGARTPLLQRMQNFSPAGSKLRLVEAAEAHAQQLDQLALNLSSIILDINQDRLTQRAIE
ASSAYSRILQAVQAAEDAAGQALQQADHTWATVVRQGLVDRARQLLANSSALEEAVLREQ
RRLGLVWAALQGAGTQLRDVRAKKDQLEARIQAAQAMLAMDTDETSKKIAHAKAVAAEAQ
DVAARVETQLQAVQESVERWQGQYEGLRGQDLGQAVLDAGRSVSSLEKTLPQLLAKLSVL
ENRGAHNASLVLSASIGRVRQLIAQARGAASKVKVPMKFNGRSGVQLRTPRDLADLAAYT
ALKFYLQGPEPEPGQGTEDHFVMYMGSRQATGDYMGVSLRDKKVHWVYRLGEAGPAVLSI
DEDIGEQFAAVSLDRTLQFGHMSVTVERQMIQETKGDTVAPGAEGLLNLRPDDFVFYVGG
YPSTFTPPPLLRFPGYRGCIEMDTLNEEVVSLYNFERTFQLDTAVDRPCARSKSTGDPWL
TDGSYLDGTGFARISFDGQISTTKRFEQELRLVSYSGVLFFLKQQSQFLCLAVREGSLVL
LYDFGAGLREAVPLQTPPPLTSASKAIQVFLLGGSRKRVLVRVERATVFSVEQDNVLELA
DAYYLGGAPPDQLPPSLRRLFPMGGSVRGCVKGIKALGKYVDLKRLNTTGVSAGCTADLL
VGRAMTFHGHGFLHLALSNVAPLTGDVYSGFGFHSAQDSALLYYRASPDGPCQVSLRQGH
VTLQLLRTEVIAEGAFADGAPHYVAFYSNATGVWLYVDDQLQQMKPHRGPRPEPQPQPEG
PPSLLLGGLPEPDTFHFSGCISNIFVRRLLGPQRVFDLQQNLGSVNVSTGCAPTLQAQTL
DLGPRGLRTATRKASRRSRQPTQDPACMLPPYLRTTQDTYQFGGSLSSHLEFVGILAPHR
NWPSLSMRVLPRSPRGLLLFAASLRPSSPSLALFLSNGHFIAQMEGLGTRLRAQSRQRSW
PGHWHKVSVRWEKNRILLVTDGARAWSQEGPRWQHQGAEHPQPHTLFVGGLPASSHSSKL
PVTVGFSGCVKRLRLHGRPLGAPTRMAGVTPCISGPLEAGLFFPGSGGLITLDLPGATLP
DVGLELEVRPLAVTGLIFHLGQARTPPYLQLQVTEKQVLLRADDGAGEFSTSVTRPSVLC
DGQWHRLAVMKGGNVLRLEVDAQSNHTVGPSLAAAAGAPAPLHLGGLPEPTAMQPWPPAY
RGCMRRLTVNRAPVTMTRSAEVHGAVGASGCPAA
NT seq 11085 nt   +upstreamnt  +downstreamnt
atggcgaagcggctctgcgcggggagcgcactgtgtgtccgcggcccccggggccccgcg
ccgctactgctggtggggctggcgctactgggcacggcgcgggcgcgggaggaggcgggc
ggcggcttcagcctgcacccgccctacttcaacctggccgagggcgcccgcatcgccgcc
tccgcaacctgcggggaggaggccccggcgcgcggctccccgcgccccaccgaggacctt
tactgcaagctggtggggggccccgtggccggcggcgaccccaaccagaccatccagggc
cagtactgtgacatctgtacggctgcgaacagcaacagggcacaccccgcgagcaacgcc
atcgatggcacagagcgctggtggcagagcccgcccctgtcccgcggcctggagtacaac
gaggtcaacgtcaccctggacctgggccaggtcttccacgtggcctacgtcctcatcaag
tttgccaactcaccccggccggacctctgggtgctggagcggtccatggacttcggccgc
acctaccagccctggcagttctttgcctcctccaagagggactgtctggagcggtttggg
ccacagactctggaacgcatcacacgggatgacgcggccatctgcaccaccgagtactcg
cgcatcgtgcccctggagaatggagagattgtggtgtccctggtgaatgggcgcccaggc
gccatgaatttctcctactcgccgctgctgcgtgagttcaccaaggccactaacgtccgc
ctgcgcttcctgcgcaccaacacgctgctgggccacctcatggggaaggcgctgcgagac
cccacggtcacccgccggtattattacagcatcaaggacatcagcatcggaggccgctgt
gtctgccacggccacgcggatgcctgtgatgccaaagaccccacagacccgttcaggctg
cagtgcacctgccagcacaacacctgcgggggcacctgtgaccgctgctgccccggcttc
aaccagcagccgtggaagcctgcgactgccaacagtgccaatgagtgccagtcctgtaac
tgccacggccacgccaccgactgttactacgaccctgaggtggaccggcgccgtgccagc
cagagcctggatggcacctatcagggcgggggtgtctgtatcgactgccagcaccacacc
accggcgtcaactgtgagcgctgcctgccaggcttctaccgctctcctgaccacccgctt
gactcgccccacgtctgctgccgctgcaactgcgagtccaacttcacggatggcacctgc
gaggacctgacgggtcgctgctactgccggcccaacttctccgggcagcggtgcgacgtg
tgtgccgagggcttcacgggcttcccaagttgctacccgacaccctcatcctccaatgac
accagggagcaggtgctgccagccggccagattgtgaattgtgactgcagtgcggcaggg
acccagggcaacgcctgccggaaggacccgagagtggggcgctgtgtgtgcaaacccaac
ttccaaggcacccattgtgagctctgcgtgccagggttctatggccccggctgccagccc
tgccagtgttccagccctggagtggccgatgaccgctgtgaccctgacacaggccggtgc
aggtgccgagagggcttcgagggggccacatgtgatcgctgtgcccctggctacttccgc
ttccctctctgccagttgtgcggctgcagccctgcaggaaccctgcccgagggctgcgac
gaggccggccgctgcccatgccagcctgggtttgctggacctcattgtgaccagtgccac
cctggctaccacggtttccccaactgccaaccgtgcacctgcgatccccggggagccctg
gaccagctctgcggggcaggaggtttgtgccgctgccgccccggctacacaggcaccgcc
tgccaggaatgcagccccggcttccatggcttccccagctgtgtcccctgccactgctct
gccgaaggctccctgcacgcagcctgcgacccccggagtgggcaatgcagctgccggccc
cgcgtgacggggctgcggtgtgacacatgtgtgccaggtgcctacaacttcccttactgc
gaagctggctcttgccatcctgctggtctggctctggtggatcctgcccttcctgaggca
caggctccctgtgtgtgccgggctaacgtggaggggccaagctgtgaccgctgcaaacct
gggttctggggactaagccccagcaaccccgagggctgcacccgctgcagctgcgacccc
aggggcacgctgggtggagttgctgagtgccagccaggcaccggccaatgcttctgcaag
ccccacgtgtgtggccaggcctgcgcagcctgcaaggacggcttctttggactggatcag
gctgactatttcggctgccgcagctgccggtgtgacgctggcggtgccctggggcagagc
tgtgaaccgaggacgggtgcctgccgctgccgccccaacacccagggccccacctgcagc
cggcctgtgagggaccactacctcccggacctgcaccacctgcgcctggagctggaggag
gccgccacacctgagggtcacgctgtgcgctttggcttcaacccccttgagttcgagaac
ttcagctggaggggctacgcgcaaatggcacctgtccagcccaggatcgtggccaggctg
aacctgacctcccccgaccttttctggcttatcttccgatacgtcaaccggggggccgtg
agtgtgagcgggcgagtctctgtgagagaggagggcaggttggccgcctgcgccaactgc
acagcgcagagccagcccgtggccttcccgcccagcacagaacctgccttcgtcaccgtg
ccccagaggggcttcggagagcccttcgtgctgaaccctggcacctgggccttgcttgtg
gaggctgagggggtgctcctggactacgtggttctgctgcctagcgcatactacgaggcg
gcactcctgcagctgcgggtgaccgaggcctgcacgtaccgcccctctacccagcagtct
ggcgacaactgcctcctctacacacacctccccctggatggcttcccctcggccgcgggg
ttggaggccctgtgtcgccaggacaacagcctgccccggccctgccccacggagcagctc
agcccatcgcacccaccactggtcacctgcacaggcagtgatgtggatgtccagcttcag
gtggcagtgccacagccgggccgctatgccctggtggtggagtatgccaatgaggatgcc
cgccaggaggtgggtgtggccatgcacaccccacagcgggcgccccagcaggggctgctc
tccctgcacccctgcctgtacagcaccctgtgccggggcactgcccgggatacccaggac
cgcctggccgtcttccacctggactcagaggccaacgtgaggctcacagctgagcaggcg
cacttctacctgcacggggtcactctggtgcccatggaggagttcagcccggagttcgtg
gagccccgggtccgctgcatcagcagccacggagcctttggtcccagcagtgccgcctgt
ctgccctcgcgcttcccaaagccgccccagcccatcatcctcagggactgccaggtgatc
ccgctgccgcccggtctcccgctgacccaggcacaggacctcatgccagccacgtcccca
actggaccccagcctcggccccccaccgccgtggaccctgatgtggagcccactttgctg
cgtgagccccaggccaccgtggtcttcaccacccatgtgcccacgctgggccgctacgcc
ttcctgctgcacggctaccagccagcccaccccaccttccccgtggaagtcctcatcaac
gccggccgcgtgtggcagggccatgccaacgccagcttctgtccacatggctacggctgc
cgcaccctggtggtgtgtgagggccaggccctgctggacgtgacccacagcgagctcact
gtgactgtgcgtgtgcccgagggccgctggctctggctggattatgtcctcgtggtccct
gagaatgtctacagctttggctacctccgggaggagcccctggacaaatcctatgacttc
atcagccactgtgcagcccagggctaccacatcagccccggcagctcatccctgttctgc
cgaaatgctgctgcctccctctccctcttctataacaacggagcccggccatgtggctgc
cacgaagtaggtgctacaggccccacgtgtgagcccttcgggggccagtgtccctgccgt
gcccacgtcattggccgtgactgctcccgctgcgccaccggatactggggcttccccaac
tgcaggccctgtgactgcggcgcccgcctctgtgacgagctcacgggccagtgcatctgc
ccaccacgcaccgtcccgcctgagtgcctgctgtgccagccccagacctttggctgccac
cccctggtcggctgtgaggagtgtaactgctcagggcccggtgtccaggagctcacagac
cctacctgtgacacagacagcggccagtgcaagtgcagacccaacgtgaccgggcgtcgc
tgtgatacctgctctccgggcttccatggctacccccactgccgcccctgtgactgtcac
gaggcgggcactgcgcctggtgtgtgtgaccccctcacaggacagtgctactgtaaggag
aatgtgcagggccccagatgtgaccagtgcagccttgggaccttctccctggatgctgcc
aaccccaaaggttgcacccgctgcttctgttttggggccacagagcgctgccggagctcg
tcctacacccgccaggagttcgtggacatggagggctgggtgctgctgagcactgaccgg
caggtggtgccccacgagcggcggccagggacggagatgctccgtgcagacctgcggcac
gtgcccgaggcggtgcctgaggccttccccgagctgtactggcaggccccaccctcctac
ctgggggaccgggtgtcatcctacggtgggaccctccgttatgaactgcactcggagacc
cagcggggcgacgtctttgtccccacggagagcaggccggatgtggtgctgcagggcaac
cagatgagcatcacattcctggagccggcataccccatgcccggccacattcaccgtggg
cagctgcagctggtggaggggaacttccggcacacggagactcgcaacactgtgtcccgt
gaggagctcatgatggtgcttgccagcctggagcagctgcagatccgtgccctcttctcg
cagatctcctcggctgtctccctgcgcagggtggcactggaggtggccagcccagcaggc
cagggtaccctggccagcaatgtggagctgtgcctgtgccccgctagctaccggggggac
tcatgccaggaatgtgcccccggcttctatcgggacgtcaaaggtctcttcctgggccga
tgtgtcccttgtcagtgccacggacactcagaccgctgcctccctggctctggcatctgt
gtggactgccagcacaacaccgaaggagctcaatgtgagcgctgccaggctggctttgtg
agcagcagggaggaccccagtgccccctgtgtcagctgcccctgccccctctcagtgcct
tccaacaactttgccgagggctgtgtcctgcgaggtggccgcacccagtgcctctgcaaa
cctggttatgcaggcgcctcctgtgagcggtgtgcgcccggtttcttcgggaacccactg
gtgctgggcagctcctgccagccttgtgactgcagcggcaatggtgaccccaacctgctc
ttcagcgactgcgaccccctgacgggtgcctgccgtggctgcctgcgccacaccactggg
ccacgctgcgagatctgtgcccccggcttctatggcaacgcgctgctgcccggcaactgc
acccggtgcgactgtgccccatgtgggacagaggcctgcaacccccatagtgggcaatgc
ctgtgcaaggcgggtgtgactgggcggcgctgtgaccgctgtcaggagggacattttggc
ttcgaaggctgcgggggctgccgcccgtgtgcttgtggaccggccgccgagggctccgag
tgccacccccagagcggacagtgccactgccgaccagggaccatgggaccccagtgccgc
gagtgtgcccctggctactgggggctccctgagcagggctgcaggcgctgccagtgccct
gggggccgctgtgaccctcacacgggccgctgcacctgccccccagggctcagtggggag
cgctgcgacacctgcagccagcagcatcaggtgcctgttccaggtgggcctggggaccac
agcgtccactgtgaagtgtgtgaccactgtgtggtcctgctcctggacgacctggaacgg
gccggcaccctcctccccgccatccgcgagcaactgcgtggcatcaatgccagctctgtg
gcctgggcccgtctgcacaggctgaacgcctccatcactgacctacagagccagctccgg
agccccctgggcccccgccacgagacggcacagcagctggaggtgctggagcagcaaagc
acaagccttgggcaggatgcacagcagctggacagccaggccgcagggacccgagagcag
gcgagccgattgctggccggcaccgaggccacgctgggccgggcgaagacgctgttggcg
gctatccgggctgtggaccgcaccctgagcgagctcatgtcccagaccggccacctgggg
ctggccaatgcctcggctctgtcaggcgagcagctgcgccggacgctggccgaggtggag
cggctgctctgggagatgcgggcccgggacctgggggccccgcgggcagctgctgaggct
gagttggctgcagcgcagagactgctggcccgagtgcaggagcagctgagcaacctctgg
gaggagaaccaggcactggccacacacacccgcgaccggctggcccagcatgaggccagt
ctcatggaccttcgagaggccttgaaccgggcagtggacgtcacgcgggaggcccaggag
ctcaacagccgcaaccaggagcgcctggaggaagccctgcaaaggaagcaggagctgtcc
cgggacaatgccaccctgaaggccactctacatgcagctagtgacaccctggccagcgtc
ttcagattgctgcacagcctggaccaggccaaggaggagctggagcgcctcgccgccagc
ctggacggggctcggaccccactgctgcagaggatgcagaacttctccccagcgggcagc
aagctgcgtctggtggaggccgccgaggcccacgcacagcagctggaccagctggcgctc
aacctgtccagcatcatcctggacatcaaccaggaccgccttacccagagggccatcgag
gcctccagcgcttacagccgcatcctgcaggccgtgcaggctgccgaggatgctgctggc
caggccctgcagcaggcagaccacacgtgggcgacagtggtgcggcagggcctggtggac
cgagcccggcagctcctggcaaacagcagtgcactggaggaggctgtgctccgggaacag
cggaggctgggcctcgtgtgggctgccctccagggtgccgggacccagctccgagatgtc
cgggccaagaaggaccaactggaggcgcgcatccaggcggcgcaggccatgcttgccatg
gacacagacgagacgagcaagaagatcgcacacgccaaggctgtggccgctgaagcccag
gacgtggccgcccgtgtggagacgcagctgcaggccgtgcaggagagcgtggagcggtgg
cagggccagtatgagggcctgcggggccaggacctgggccaggcggtgctcgacgcaggc
cgctcagtgtccagcctggagaagacgctgccccagctgctggccaagctgagtgtcctg
gagaaccgtggggcgcacaacgccagcctggtcctgtccgccagcatcggccgtgtgcgg
cagcttattgcccaggcccggggcgctgccagcaaggtcaaggtgcccatgaagttcaat
gggcgctcaggggtgcagctgcgcaccccacgggatcttgccgaccttgctgcctacact
gccctcaagttctacctgcagggcccagagcctgagcctgggcagggtaccgaggatcac
tttgtgatgtacatgggcagccgccaggccactggggactacatgggtgtgtctctgcgt
gacaagaaggtgcactgggtgtatcggctgggcgaggcgggccctgcagtcctgagcatc
gacgaggacattggggagcagtttgcagctgtcagcctggacaggactctccagtttggc
cacatgtctgtcacagtggagagacagatgatccaggaaaccaagggtgacacggtggcc
cctggggcagaggggctgctcaacctgcggccagacgactttgtcttctacgtcgggggt
taccccagtaccttcacgccccctcccctgcttcgcttccctggctaccggggctgcatc
gagatggacacgctgaatgaggaggtggtcagcctctacaacttcgagaggaccttccag
ctggacacggctgtggacaggccttgtgcccgctccaagtcgaccggggacccgtggctc
acagacggctcctacctggatggcaccggttttgcccgcatcagcttcgatggtcagatc
agcaccaccaagcgctttgaacaggagctgcgcctcgtatcctacagtggggtcctcttc
ttcctgaagcagcagagccagttcctgtgcttggccgtgcgagaaggcagcctcgtgctg
ctgtatgactttggcgctggcctgagagaggctgtcccgctgcagaccccaccgcccctg
acctcggccagcaaggcgatccaggtgttcctgctggggggcagccgcaagcgtgtgctg
gtgcgtgtggagcgggccacggtgttcagcgtggagcaggacaatgttctggaactggcc
gacgcctactacctggggggcgcgccgcccgaccagctgcccccgagcctgcgacggctc
ttccccatgggaggctcagtccgtggctgcgtcaaagggatcaaggccctgggcaagtat
gtggacctcaagcggctgaacacaacaggcgtgagcgccggctgcaccgctgacctgctg
gtggggcgcgccatgactttccacggccacggcttccttcacctggcgctctcgaacgtg
gcacccctcactggcgacgtctactccggctttggcttccacagtgcccaggacagtgcc
ctgctctactaccgggcatccccggatgggccatgccaggtgtccctgcggcagggccac
gtgaccctacagctcctgagaactgaagtgatagctgagggggccttcgccgatggcgcc
ccccattacgtcgccttctacagcaatgccacaggggtctggctctatgtcgacgaccag
ctccagcagatgaagccccaccggggaccacgcccagagccccagccgcagcctgagggg
cccccgagtctcctcctgggaggcctgcctgagcctgacacctttcacttcagtggctgc
ataagcaacatctttgtgcggcggctcctgggcccacagcgcgtattcgacctgcagcag
aacctgggcagcgtgaacgtgagcacgggctgtgcacccactctgcaagcccagaccctg
gacctggggcctagaggactgcggaccgccacccggaaggcttcccgccgcagtcgtcag
cccacccaggatcccgcctgcatgctgcccccatacctcaggaccacccaagacacctac
cagtttgggggttccctgtccagtcacctggagtttgtgggcatcctggccccacatagg
aactggcccagcctctccatgcgcgtcctcccgcgaagcccccgaggcctcctgctcttc
gctgcgagcctgaggcccagcagcccctccctggcactcttcctgagcaatggacacttc
atcgcacagatggaaggccttgggactcggctccgcgcccagagccgccagcgctcctgg
cctggccactggcacaaggtctccgtgcgctgggagaagaaccggatcctgctggtgacg
gacggggcccgggcctggagccaggaggggccacgctggcagcaccagggggcagagcac
ccccagccccacaccctctttgtggggggcctcccggccagcagccacagctccaaactt
ccggtgaccgtcgggttcagcggctgtgtgaagagactgaggctgcacggaaggcccctg
ggggcccccacacggatggcaggggtcacaccctgcatctcgggccccctggaggcgggc
ctgttcttcccaggcagcgggggacttatcactttagacctcccaggagctacactgcct
gacgtgggcctggaactggaggtacggcccctggcagtcaccggcctgatcttccacttg
ggccaggcccggacacccccctatttgcagctgcaggtgactgagaagcaagtcctgctg
cgggcggacgatggagcaggggagttctccacatcagtgacccgcccctcagtgctgtgt
gacggccagtggcaccggctggcagtgatgaaaggcggaaacgtgctccggctagaggtg
gacgcacagagcaaccacaccgtgggcccctcgctggcggctgcggctggtgccccagcc
cctctgcacctcgggggcctgcctgagcccacggccatgcagccctggccccccgcctac
cgtggctgcatgaggaggctgacagtgaaccgggcccccgtcaccatgactcgctctgca
gaggtccacggggcagtgggggccagtggctgcccagccgcctag

KEGG   Macaca mulatta (rhesus monkey): 574201
Entry
574201            CDS       T01028                                 

Gene name
COL1A1
Definition
(RefSeq) collagen alpha-1(I) chain isoform X1
  KO
K06236  collagen type I alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04611  Platelet activation
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    574201 (COL1A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    574201 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    574201 (COL1A1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    574201 (COL1A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    574201 (COL1A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    574201 (COL1A1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    574201 (COL1A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    574201 (COL1A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    574201 (COL1A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    574201 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    574201 (COL1A1)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   574201 (COL1A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 574201
NCBI-ProteinID: XP_014974803
LinkDB
Position
16
AA seq 1498 aa
MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDRDVWKPEPCRI
CVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESPTDQETTGVEVISCPRLLS
LRRPCLLTVLPSLTWPLVSSSPIPQGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPGPPGP
PGPPGPPGLGGNFAPQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPG
EPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKG
HRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDG
ATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAG
NPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKGDTG
AKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKG
PAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPG
PPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAG
PAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERG
VQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRG
DAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGETGPSGPAGPTGARGAPGDRGEPG
PPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGPKGARG
SAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPG
PPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQG
PSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGPKGDRGETGPAGPPG
APGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRG
IKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGTPGKDGLNGLPGPIGPPG
PRGRTGDAGPVGPPGPPGPPGPPGPPSGGFDFSFLPQPPQEKAHDGGRYYRADDANVVRD
RDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDA
IKVFCNMETGETCVYPTQPSVAQKNWYISKNPKDKRHVWFGESMTDGFQFEYGGEGSDPA
DVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKALLLQGSNEIEIRAEGNSRF
TYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCFL
NT seq 4497 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcttagcggccaccgccctcctgacg
cacggccaagaggaaggccaagtcgagggccaagacgaagacatcccaccaatcacctgc
gtacagaacggcctcaggtaccatgaccgagacgtgtggaaacccgagccctgccggatc
tgcgtctgcgacaacggcaaggtgttgtgcgatgacgtgatctgtgacgagaccaagaac
tgccccggcgccgaagtccccgagggcgagtgctgtcccgtctgccccgacggctcagaa
tcacccaccgaccaagaaaccaccggcgtcgaggtaatctcctgccctcgacttttgtcc
ctccgccgcccgtgcctcctcacggtcctcccttctctaacctggcctcttgtttcttct
tccccaatcccacagggacccaagggagacactggcccccgaggcccaaggggacccgca
ggcccccctggccgagatggcatccctggacagcctggacttcccggaccccccggaccc
cccggacctcccggaccccctggcctcggaggaaacttcgctccccagttgtcttatggc
tatgatgagaaatcaaccggaggaatttctgtgcctggtcccatgggtccctctggtccc
cgtggtctccctggcccccctggtgcacctggtcctcaaggcttccaaggtccccctggt
gagcctggcgagcctggagcttcaggtcccatgggtccccgaggtccccctggcccccct
ggaaagaacggagatgatggggaagctggaaaacctggtcgtcctggtgagcgtgggcct
cctgggcctcagggtgctcgaggattgcccggaacagctggcctccctggaatgaaggga
cacagaggtttcagtggtttggatggtgccaagggagatgctggtcctgctggtcccaag
ggtgagcctggcagccctggtgaaaatggagctcctggtcagatgggcccccgtggtctg
cctggtgagagaggtcgccctggagcccctggccctgctggtgctcgcggaaatgatggt
gctactggtgctgccggaccccctggtcctactggccccgctggtcctcctggcttccct
ggtgctgttggtgctaagggcgaagctggtccccaaggtccccgaggctctgaaggtccc
cagggtgtgcgtggtgagcctggcccccctggccctgctggtgctgctggccctgctgga
aaccctggtgctgatggacagcctggcgctaaaggtgccaatggtgctcctggtattgct
ggtgctcctggcttccctggtgcccgaggcccctctggaccccagggccccggcggccct
cccggtcccaagggtaacagcggtgaacctggtgctcctggcagcaaaggagacactggt
gctaagggagaacccggccccgttggtgttcaaggacctcctggccctgctggagaggaa
ggaaagcgaggagctcgaggtgaacccggacccactggcctgcccggaccccctggcgag
cgtggtggacctggtagccgtggtttccctggtgcagatggtgttgctggtcccaagggt
cccgctggtgaacgtggttctcctggccctgctggccccaaaggatctcctggtgaagct
ggtcgtcctggtgaagctggtctgcctggtgccaagggtctgactggaagccctggcagc
cctggtcctgatggcaaaactggcccccctggtcccgccggtcaagatggtcgccccgga
cccccaggcccccctggtgcccgtggtcaggctggtgtgatgggattccctggacctaaa
ggtgctgctggagagcccggcaaggctggagagcgaggtgttcccggaccccctggcgct
gtcggtcctgctggcaaagatggagaggctggagctcagggaccccctggccctgctggt
cccgctggcgagagaggtgaacaaggccctgctggctcccccggcttccagggtctccct
ggccccgctggtcctcccggtgaagcaggcaaacctggtgaacagggtgttcctggagac
cttggtgcccccggcccctctggagcaagaggcgagagaggattccctggcgagcgtggt
gtgcaaggtccccctggtcctgctggtccccgaggggccaatggtgctcccggcaatgat
ggtgctaagggtgatgctggtgcccctggagctcccggtagccagggcgcccctggcctt
cagggaatgcctggtgaacgtggtgcagctggtcttccagggcctaagggtgacagaggt
gatgctggtcccaaaggtgctgatggctctcctggcaaagatggcgtccgtggtctgact
ggccccattggtcctcctggtcctgctggtgcccctggtgacaagggtgaaactggtccc
agcggccctgctggtcccactggagctcgtggtgcccccggagaccgtggtgagcctggt
ccccccggccctgctggcttcgctggcccccctggtgctgatggccaacccggtgctaaa
ggcgaacctggtgatgctggtgctaaaggcgatgctggtccccctggccctgccggaccc
gctggaccccctggccccattggtaatgttggtgctcctggacccaaaggtgctcgtggc
agcgctggtccccctggtgctactggtttccctggtgctgctggccgagtcggtcctcct
ggcccctctggaaatgctggaccccctggccctcctggtcctgctggcaaagaaggtggc
aaaggtccccgtggtgagactggccctgctggacgtcctggtgaagttggtccccctggt
ccccctggccccgctggcgagaaaggatcccctggtgctgatgggcctgctggtgctccc
ggtactcccgggcctcaaggtattgctggacagcgtggtgtggtcggcctgcctggtcag
agaggagaaagaggcttccctggtcttcctggcccctctggtgaacctggcaaacaaggt
ccctctggagcaagtggtgaacgtggtccccctggtcccatgggcccccctggattggcc
ggaccccctggtgaatctggacgtgagggagctcctggtgccgaaggttcccctggacga
gatggttctcctggccccaagggtgaccgtggtgagactggccccgctggacctcctggt
gctcctggtgctcctggtgcccctggccccgttggccctgctggcaagagtggtgatcgt
ggtgagactggtcctgccggtcctgctggtcctgtcggccctgttggcgcccgtggcccc
gctggaccccaaggcccccgtggtgacaagggtgagacaggcgaacaaggcgacagaggc
ataaagggtcaccgtggcttctctggcctccagggtccccctggccctcctggctctcct
ggcgaacaaggtccctctggagcctctggtcctgctggtccccgaggtccccctggctct
gctggtactcctggcaaagatggactcaacggcctccctggccccattgggccccctggt
cctcgcggtcgcactggtgatgctggtcctgttggtccccctggccctcctggccctccc
ggtccccctggtcctcccagcggtggtttcgacttcagcttcctgccccagccacctcaa
gagaaggctcacgatggcggccgctactaccgggctgatgatgccaatgtggttcgtgac
cgtgacctcgaggtggacaccaccctcaagagcctgagccagcagatcgagaacatccgg
agccccgagggcagccgcaagaaccccgcccgcacctgccgcgacctcaagatgtgccac
tctgactggaagagcggagagtactggattgaccccaaccaaggctgcaacctggatgcc
atcaaagtcttctgcaacatggagactggtgagacctgcgtgtaccccactcagcccagt
gtggcccagaagaactggtacatcagcaagaaccccaaggacaagaggcacgtctggttc
ggcgagagcatgaccgacggattccagttcgagtatggcggcgagggctccgaccctgcc
gacgtggccatccagctgaccttcctgcgcctgatgtccaccgaggcctcccagaacatc
acctaccactgcaagaacagcgtggcctacatggaccagcagactggcaacctcaagaag
gcactgctcctccagggctccaacgagatcgagatccgcgccgagggcaacagtcgcttc
acctacagcgtcactgtcgatggctgcacgagtcacaccggagcctggggcaagacagtg
atcgaatacaaaaccaccaagacctcccgcctgcccatcatcgatgtggcccccttggac
gttggtgccccagaccaggaattcggcttcgacgttggccctgtctgcttcctgtaa

KEGG   Macaca mulatta (rhesus monkey): 574276
Entry
574276            CDS       T01028                                 

Gene name
THBS4
Definition
(RefSeq) thrombospondin-4
  KO
K04659  thrombospondin 2/3/4/5
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04145  Phagosome
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05144  Malaria
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    574276 (THBS4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    574276 (THBS4)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    574276 (THBS4)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    574276 (THBS4)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    574276 (THBS4)
  09174 Infectious disease: parasitic
   05144 Malaria
    574276 (THBS4)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:mcc04131]
    574276 (THBS4)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    574276 (THBS4)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    574276 (THBS4)
   04990 Domain-containing proteins not elsewhere classified [BR:mcc04990]
    574276 (THBS4)
Membrane trafficking [BR:mcc04131]
 Endocytosis
  Phagocytosis
   Opsonins
    574276 (THBS4)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   574276 (THBS4)
  Exosomal proteins of colorectal cancer cells
   574276 (THBS4)
  Exosomal proteins of bladder cancer cells
   574276 (THBS4)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   574276 (THBS4)
Domain-containing proteins not elsewhere classified [BR:mcc04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   574276 (THBS4)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA cEGF EGF_3 EGF
Other DBs
NCBI-GeneID: 574276
NCBI-ProteinID: XP_014995819
Ensembl: ENSMMUG00000003143
UniProt: F7EPG2
LinkDB
Position
6
AA seq 961 aa
MLAPRGAAVLLLHLVLQPWLAAGAQATPQVFDLLPSSSQRVNPGALLPVLTDQALNDLYV
ISTFKLQTKSSATIFGLYSSTDNSKYFEFTVMGRLNKAILRYLKNDGKVHLVVFNNLQLA
DGRRHRLLLRLSNLQRGAGSLELYLDCIQVDSIHNLPRAFAGLSQNPETIELRTFQRKPQ
DFLEELKLVVRGSLFQVASLQDCFLQQSEPLAATGTGDFNRQFLGQMTQLNQLLGEVKDL
LRQQVKETSFLRNTIAECQACGPLKFQSPTPSTVVPPAPPAPPTRPPRRCDSNPCFRGVQ
CTDSRDGFQCGPCPEGYTGNGITCIDVDECKYHPCYPGVRCVNLSPGFRCDACPVGFTGP
MVQGVGISFAKSNKQVCTDIDECRNGACVSNSICINTLGSYRCGPCKPGYTGDQIRGCKM
ERNCRNPELNPCSVNARCIEERQGDVTCVCGVGWAGDGYVCGKDVDIDSYPDEELPCSAR
NCKKDNCKYVPNSGQEDADRDGIGDACDEDADGDGILNEQDNCVLIHNVDQRNSDKDIFG
DACDNCLSVLNNDQKDTDGDGRGDACDDDMDGDGIKNILDNCPRFPNRDQRDKDGDGVGD
ACDSCPDISNPNQSDVDNDLVGDSCDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKDGIG
DECDDDDDNDGIPDLVPPGPDNCRLVPNPAQEDSNSDGVGDICESDFDQDQVIDRIDVCP
ENAEVTLTDFRAYQTVVLDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVD
FEGTFHVNTQTDDDYAGFIFGYQDSSSFYVVMWKQTEQTYWQATPFRAVAEPGIQLKAVK
SKTGPGEHLRNSLWHTGDTSDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVRFYE
GSELVADSGVTIDTTMRGGRLGVFCFSQENIIWSNLKYRCNDTIPEDFQEFQTQNFDRLD
N
NT seq 2886 nt   +upstreamnt  +downstreamnt
atgctggccccgcgcggagccgccgtcctcctgctgcacctggtcctgcagccgtggctg
gcggccggcgcccaggccaccccccaggtctttgaccttctcccatcttccagtcagagg
gtgaacccaggcgctctgctgccagtcctgacagatcaagccctgaatgatctctatgtg
atttccaccttcaagctgcagactaaaagttcagccaccatcttcggtctttactcttca
actgacaacagtaaatattttgaatttactgtgatgggacgcttaaacaaagccatcctc
cgttacctgaagaacgacgggaaggtccatttggtggttttcaacaacctgcagctggca
gacggaaggcggcacaggctcctcctgaggctgagcaatttgcagcgaggggctggctcc
ctagagctctacctggactgcatccaggtggattccattcacaatctccccagggccttt
gccggcctctcccagaatcctgagaccattgaattgaggactttccagaggaagccacag
gacttcttggaagagctgaagctggtggtgagaggctcactgttccaggtggccagcctg
caagactgcttcctgcagcagagcgagccgctggctgccacaggcacaggggactttaac
cggcagttcttgggtcaaatgacacaattaaaccaactcctgggagaggtgaaggacctt
ctgcgacagcaggttaaggaaacatcatttttgcgaaacaccatagctgaatgccaggct
tgcggtcctctcaagtttcagtctccaaccccaagcaccgtggtacccccagcaccccct
gcaccgccaacacgcccacctcgccggtgtgactccaacccatgtttccgaggtgtccaa
tgtaccgacagcagagatggcttccagtgtgggccctgccccgagggctacacaggaaac
gggatcacctgcattgatgtggatgagtgcaaataccatccctgctacccaggcgtgcgc
tgcgtgaatttgtctcctggcttcagatgtgacgcctgcccagtaggcttcacagggccc
atggtgcagggtgttgggatcagttttgccaagtcaaacaagcaggtctgcactgacatt
gatgagtgtcgaaatggagcatgcgtttccaactcgatctgcattaatactttgggatct
taccgctgtgggccttgcaagccagggtatactggtgatcagataagaggatgcaaaatg
gaaagaaactgcagaaacccagagctgaacccttgcagtgtgaatgcccggtgcattgaa
gagaggcagggggatgtgacatgtgtgtgtggagtcggttgggctggcgatggctatgtc
tgtggaaaggatgtggacatcgacagttaccctgatgaagaactgccatgctctgccagg
aactgtaagaaggacaactgcaagtatgtgccaaattctggccaagaagatgcagacaga
gatggcattggcgacgcttgtgacgaggatgctgatggagacgggatcctgaatgagcag
gataactgtgtcctgattcacaatgtggaccaaaggaacagcgataaagatatctttgga
gatgcctgtgataactgcctgagtgtcttaaataatgaccagaaagacacagatggggat
ggaagaggagatgcctgtgatgatgacatggatggagatggaataaaaaacattctggac
aactgcccaagatttcccaatcgtgaccaacgggacaaggatggtgatggtgtgggggac
gcctgtgacagttgtcctgatatcagcaaccctaaccagtctgatgtggataatgatctg
gttggggactcctgtgacaccaatcaggacagtgatggagatgggcaccaggacagcaca
gacaactgccccaccgtcattaacagtgcccagctggacaccgataaggatggaattggt
gacgagtgtgatgatgatgatgacaatgatggtatcccagacctggtgccccctggacca
gacaactgccggctggtccccaacccagcccaggaggatagcaacagcgacggagtggga
gacatctgtgagtctgactttgaccaggaccaggtcatcgatcggatcgatgtctgccca
gagaacgcagaggtcaccctgaccgacttcagggcttaccagaccgtggtcctggatcct
gaaggggatgcccagatcgatcccaactgggtggtcctgaaccagggcatggagattgtg
cagaccatgaacagtgatcctggcctggcagtggggtacacggcttttaatggagttgac
ttcgaagggaccttccatgtaaatacccagacagatgatgactatgcgggctttatcttt
ggctaccaagatagctccagcttctacgtggtcatgtggaagcagacggagcagacatat
tggcaagccaccccgttccgagcggttgcagaacctggcattcagctcaaggctgtgaag
tctaagacaggtccaggggagcatctccggaactccctgtggcacacgggggacaccagt
gaccaggtcaggctgctgtggaaggattccaggaatgtgggctggaaggacaaggtgtcc
taccgctggttcctacagcacaggccccaggtgggctacatcagggtacgattttatgaa
ggctccgagttggtggctgactctggagtcaccatagacaccacaatgcgtggaggccga
cttggcgttttctgcttctctcaagaaaacatcatctggtccaacctcaagtatcgctgc
aatgacaccatccctgaggacttccaagagtttcaaacccagaatttcgaccgcttggat
aattga

KEGG   Macaca mulatta (rhesus monkey): 613269
Entry
613269            CDS       T01028                                 

Gene name
FN1
Definition
(RefSeq) fibronectin isoform X1
  KO
K05717  fibronectin 1
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04810  Regulation of actin cytoskeleton
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc05100  Bacterial invasion of epithelial cells
mcc05135  Yersinia infection
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05205  Proteoglycans in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    613269 (FN1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    613269 (FN1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    613269 (FN1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    613269 (FN1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    613269 (FN1)
   05205 Proteoglycans in cancer
    613269 (FN1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    613269 (FN1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    613269 (FN1)
  09171 Infectious disease: bacterial
   05135 Yersinia infection
    613269 (FN1)
   05100 Bacterial invasion of epithelial cells
    613269 (FN1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    613269 (FN1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    613269 (FN1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:mcc04131]
    613269 (FN1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    613269 (FN1)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    613269 (FN1)
   04990 Domain-containing proteins not elsewhere classified [BR:mcc04990]
    613269 (FN1)
Membrane trafficking [BR:mcc04131]
 Endoplasmic reticulum (ER) - Golgi transport
  Forward pathways
   ER-Golgi intermediate compartment (ERGIC) proteins
    613269 (FN1)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of bladder cancer cells
   613269 (FN1)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   613269 (FN1)
Domain-containing proteins not elsewhere classified [BR:mcc04990]
 Fibronectin (FN) domain-containing proteins
  Fibronectin type I domain-containing proteins
   613269 (FN1)
SSDB
Motif
Pfam: fn3 fn1 Pur_ac_phosph_N fn2 DUF2369
Other DBs
NCBI-GeneID: 613269
NCBI-ProteinID: XP_014966387
Ensembl: ENSMMUG00000012321
LinkDB
Position
12
AA seq 2478 aa
MLRGPGPGLLLLAVLCLGTAVPSTGASKSKRQAQQMIQPQSPVAVSQSKPGCYDNGKHYQ
INQQWERTYLGNALICTCYGGSRGFNCESKPEAEETCFDKYTGNTYRVGDTYERPKDSMI
WDCTCIGAGRGRISCTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCK
PIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTCLGEGSGRITCTSRNRCNDQDTRTSY
RIGDTWSKKDNRGNLLQCICTGNGRGEWKCERHTTVQTTSSGSGPFTDVREAVYQPQPHP
QPAPYGHCVTDSGVVYSVGMQWLKTQGNKQMLCMCLGNGVSCQETAVTQTYGGNSNGEPC
VLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTRGGNSNGALC
HFPFLYNNHNYTDCTSEGRRDNMKWCGTTQNYDADQKFGFCPMAAHEEICTTNEGVMYRI
GDQWDKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVNDTFHKRHEEGHM
LNCTCFGQGRGRWKCDPVDQCQDSETGTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQ
PLQTYPSSSGPVQVFITETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIP
GHLNSYTIKGLKPGVVYEGQLISIQQYGHREVTRFDFTTTSTSTPVTSNTVTGETTPLSP
LVATSESVTEITASSFVVSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTATSVNIPDL
LPGRKYIVNVYEISEDGEQSLILSTSQTTAPDAPPDPTVDQVDDTSIVVRWSRPQAPITG
YRIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIFAVEENQESTPVFIQQETTG
TPRSDTVPSPRDLQFVEVTDVKVTIMWTPPESAVTGYRVDVIPVNLPGEHGQRLPISRNT
FAEVTGLSPGVTYYFKVFAVNHGRESKPLTAEQTTKLDAPTNLQFVNETDSTVLVRWTPP
RARITGYRLTVGLTRRGQPRQYNVGPSVSKYPLRNLQPGSEYTVSLVAIKGNQESPKATG
VFTTLQPGSSIPPYNTEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIVV
SGLTPGVEYVYTIQVLRDGQERDAPIVNKVVTPLSPPTNLHLETNPDTGVLTVSWERSTT
PDITGYRITTTPTNGQQGYSLEEVVHADQSSCTFDNLSPGLEYNVSVYTVKDDKESVPIS
DTIIPEVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFVDSSV
GYYTVTGLEPGIDYDISVITLINGGESAPTTLTQQTAVPPPTDLRFTNIGPDTMRVTWAP
PPSIDLTNFLVRYSPVKNEEDVAELSISPSDNAVVLTNLLPGTEYVVSVSSVYEQHESTP
LRGRQKTGLDSPTGIDFSDITANSFTVHWIAPRATITGYRIRHHPEHMSGRPREDRVPPS
RNSITLTNLTPGTEYVVSIVALNGREESPLLIGQQSTVSDVPRDLEVVAATPTSLLISWD
APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGVDYTITVYAVTGRGDSPA
SSKPISINYRTEIDKPSQMQVTDVQDNSISVKWLPSSSPVTGYRVTTTPKNGPGPTKTKT
AGPDQTEMTIEGLQPTVEYVVSVFAQNPNGESQPLVQTAVTNIDRPKGLAFTDVDVDSIK
IAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTAELQGLRPGSEYTVSVVALHDDM
ESQPLIGTQSTAIPAPTDLKFTQVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEIN
LAPDSSSVVVSGLMVATKYEVSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETT
ITISWRTKTETITGFQVDAVPANGQTPVQRTIKPDVRSYTITGLQPGTDYKIYLYTLNDN
ARSSPVVIDASTAIDAPSNLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSSPREVVP
RPRPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELPQLVTLPHPNLHGPE
ILDVPSTVQKTPFITHPGYDTGNGIQLPGTSGQQPTVGQQMIFEEHGFRRTTPPTTATPI
RHRPRPYPPNVGEEIQIGHIPREDVDYHLYPHGLGLNPNASTGQEALSQTTISWAPFQDT
SEYIISCHPVGTDEEPLQFRVPGTSTSATLTGLTRGATYNIIVEALKDQQRHKVREEVVT
VGNSVNEGLNQPTDDSCFDPYTVSHYAVGDEWERMSESGFKLLCHCLGFGSGHFRCDSSR
WCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDPHEATCYDDGKTYHVGEQWQ
KEYLGAICSCTCFGGQRGWRCDNCRRPGGEPSPEGTTGQSYNQYSQRYHQRTNTVSALVP
RVCSLILDSPFYRYLPSE
NT seq 7437 nt   +upstreamnt  +downstreamnt
atgctcaggggtccggggcccgggctgctgctgctggccgtcctgtgcctggggacagcg
gtgccctccacgggagcctcgaagagcaagaggcaggctcagcaaatgattcagccccag
tctccggtggctgtcagtcaaagcaagcccggttgttatgacaatggaaaacactatcag
ataaatcaacagtgggagcggacctacctgggcaatgccttgatttgtacttgttatgga
ggaagccgaggttttaactgcgagagtaaacctgaagctgaagagacttgctttgacaag
tacactgggaacacttaccgagtgggtgacacttatgagcgtcctaaagactccatgatc
tgggactgtacctgcatcggggctgggcgagggagaataagctgcaccatagcaaaccgc
tgccatgaagggggtcagtcctacaagattggtgacacctggaggagaccacacgagact
ggtggttacatgttagagtgtgtgtgtcttggtaatggaaaaggagaatggacctgcaag
cccatagctgagaaatgttttgatcatgctgctgggacttcctatgtggtcggagaaacc
tgggagaagccctaccaaggctggatgatggtagattgtacttgcctgggagaaggcagc
ggacgcatcacgtgcacctctagaaatagatgcaacgatcaggacacgaggacgtcctat
agaattggagacacctggagcaagaaggataatcgaggaaacctgctccagtgcatctgc
acaggcaacggccgaggagagtggaagtgtgagaggcacaccactgtgcagaccacatcg
agcggatctggccccttcaccgatgttcgtgaagctgtttaccaaccgcagcctcacccc
cagcctgccccctatggccactgtgtcacagacagtggtgtggtctactctgtggggatg
cagtggctgaagacacaaggaaataagcaaatgctttgcatgtgcctgggcaatggagtc
agctgccaagagacagctgtaacccagacttatggtggcaactcaaatggagagccatgt
gtcttaccattcacctacaatggcaggacgttctactcctgcaccacagaagggcgacag
gacggacatctctggtgcagcacaacttcgaattatgagcaggaccagaaatattctttc
tgcacagaccatactgttttggttcagactcgaggaggaaattccaatggtgccttgtgc
cacttccccttcctatacaacaaccacaattacactgattgcacttctgagggcagaagg
gacaacatgaagtggtgtgggaccacacagaactatgatgccgaccagaagtttggattc
tgccccatggctgcccatgaggaaatctgcacaaccaatgaaggggtcatgtaccgcatt
ggagatcagtgggataagcagcatgacatgggccacatgatgaggtgcacgtgtgttggg
aacggtcgtggggaatggacatgcattgcctactcgcagcttcgagatcagtgcattgtt
gatgacatcacttacaatgtgaatgacacattccacaagcgtcatgaagagggacacatg
ctgaactgtacatgcttcggtcagggtcggggtagatggaagtgtgatcccgtcgaccaa
tgccaggattcagagactgggacattttatcaaattggagattcatgggagaagtatgtg
catggtgtcagataccagtgctactgctatggccgtggcattggggagtggcattgccaa
cctttacagacctatccaagctcaagtggtcctgtccaagtatttatcactgagactccg
agtcagcccaactctcaccccatccagtggaatgcaccacagccgtctcacatttccaag
tacattctcaggtggaggcctaaaaattctgtaggccgttggaaggaagctaccatacca
ggccacttaaactcctacaccatcaaaggcctgaagcctggtgtggtatacgagggccag
ctcatcagcatccagcagtacggccaccgagaagtgactcgctttgacttcaccaccacc
agcaccagcacacctgtgaccagcaacaccgtgacaggagagacaactcccctttctccc
cttgtggccacttctgaatctgtcactgaaatcacagccagcagctttgtggtctcctgg
gtctcagcttccgacactgtgtcaggattccgggtggaatatgagctgagtgaggaggga
gacgagccacagtatctggatcttccaagcacagccacttccgtgaacatccctgacctg
cttcctggccgaaaatacattgtaaatgtctatgagatatctgaggatggggagcagagt
ttgatcctgtctacttcacaaacaacagcgcccgatgcccctcctgacccgactgtggac
caagttgatgacacctcaattgttgttcgctggagcagaccccaggctcccatcacaggg
tacagaatagtctattcaccgtcagtagaaggcagcagcacagaactcaaccttcctgaa
actgcaaactccgtcaccctcagtgacttgcaacctggcgttcagtataacatcactatc
tttgccgtggaagaaaatcaagaaagtacacctgttttcattcaacaagaaaccactggc
accccacgttcagatacagtgccctctcccagggacctgcagtttgtggaagtgacagac
gtgaaggtcaccatcatgtggacaccgcctgagagtgcagtgactggctaccgcgtggat
gtgatccccgtcaacctgcctggggagcacgggcagaggttgcccatcagcaggaacacc
ttcgcagaagtcaccgggctgtcccctggggtcacctattacttcaaagtctttgccgtg
aaccacgggagggagagcaagcctctgactgccgaacagacaaccaaactggatgctccc
actaacctccagtttgtcaatgaaacggattctactgtcctggtgagatggactccacct
cgggcccggataacaggataccgactgactgtgggcctgacccgaagaggccagcccagg
cagtacaatgtgggtccctctgtctcgaagtacccactgaggaatctgcagcctggatca
gagtacaccgtatcccttgtggccataaagggcaaccaagagagccccaaagccactgga
gtctttaccacactgcagcctgggagctccattccaccttacaacacggaggtgaccgag
accaccattgtgatcacatggacacctgctccaagaattggttttaagctgggtgtacga
ccaagccagggaggagaggcaccacgagaagtgacttcagactcaggaagcatcgttgtg
tctggcttgactccaggagtggaatatgtctacaccatccaagtcctgagagacggacag
gaaagagatgcgccaattgtaaacaaagtggtgacaccattgtctccaccaacaaacttg
catctggagacaaaccctgacactggagtgctcacagtctcctgggagaggagtaccacc
ccagacattactggttatagaattaccacaacccctacaaacggccagcagggatattct
ttggaagaagtggtccatgctgatcagagctcctgcacttttgataacctgagtcccggc
ctggagtacaatgtcagtgtttacactgtcaaggatgacaaggaaagtgtccctatctct
gataccatcatcccagaggtgccccaactcactgacctaagctttgttgatataaccgat
tcaagcatcggcctgaggtggaccccgctaaactcttccaccattattgggtaccgcatc
acagtagttgcggcaggagaaggtatccctatttttgaagattttgtggactcctcagta
ggatactacacagtcacagggctggagccgggcattgactatgatatcagcgttatcact
ctcattaatggcggcgagagtgcccctactacactgacacaacaaacggctgttcctcct
cccactgacctgcgattcaccaacattggtccagacaccatgcgtgtcacctgggctcca
cccccatccattgatttgaccaacttcctggtgcgttactcacctgtgaaaaatgaggaa
gatgttgcagagttgtcaatttctccttcagacaatgcagtggtcttaacaaatctcctg
cctggtacagaatatgtagtgagtgtctccagtgtctacgaacaacacgagagcacacct
cttagaggaagacagaaaacaggtcttgattccccaactggcattgacttttctgatatt
actgccaactcttttactgtgcactggattgctcctcgagccaccatcactggctacagg
atccgccatcaccccgagcacatgagcgggagacctcgagaagatcgggtgcccccctct
cggaattccatcaccctcaccaacctcactccaggcacagagtatgtggtcagcatcgtt
gctcttaatggcagagaggaaagtcccttattgattggccaacaatcaacagtttctgat
gttccgagggacctggaagttgttgctgcgacccccaccagcttgctgatcagctgggat
gctcctgctgtcactgtgagatattacaggatcacttatggagaaacaggaggaaatagc
cctgtccaggagttcactgtgcctgggagcaagtctacagctaccatcagtggccttaaa
cctggagtcgattataccatcactgtgtatgctgtcactggccgtggagacagccccgca
agcagcaagccaatttccattaattaccgaacagaaattgacaaaccatcacagatgcaa
gtgaccgatgttcaggacaacagcattagtgtcaagtggctgccttcaagttcccctgtt
actggttacagagtaaccaccactcccaaaaatggaccaggaccaacaaaaactaaaact
gcaggtccagatcaaacagaaatgactattgaaggcttgcagcccacagtggagtatgtg
gttagtgtctttgctcagaatccaaacggagagagtcagcctctggttcagactgcagta
accaacattgatcgccctaaaggactggcattcactgatgtggatgtcgattccatcaaa
attgcttgggaaagcccacaggggcaagtttccaggtacagggtgacctactcgagccct
gaggatggaatccatgagctattccctgcacctgatggtgaagaagacactgcagagctg
caaggcctcagaccgggttctgagtacacagtcagtgtggttgccttgcacgatgatatg
gagagccagcccctgattggaacccagtccacagctattcctgcaccaactgacctgaag
ttcactcaggtcacaccaacgagcctgagcgcccagtggacaccacccaatgttcagctc
actggatatcgagtgcgggtgacccccaaggagaaaaccggaccaatgaaagaaatcaac
cttgctcctgacagctcatccgtggttgtatcaggacttatggtggccaccaaatatgaa
gtgagtgtctatgctcttaaggacactctgacaagcagaccagctcagggagttgtcacc
actctagagaatgtcagcccaccaagaagggctcgtgtgacagatgctactgagaccacc
atcaccattagctggagaaccaagaccgagacgatcactggcttccaagttgatgctgtt
ccagccaatggccagactccagtccagagaaccatcaagccagatgtcagaagctacacc
atcacaggtttacaaccaggcactgactacaagatctacctgtacaccctgaacgacaat
gcccggagctcccctgtggtcatcgacgcctccactgccattgatgcaccatccaacctg
cgtttcctggccaccacacccaattccttgctggtatcgtggcagccgccacgtgccagg
atcaccggctacatcatcaagtatgagaagcctgggtcttctcccagagaagtggtccct
cggccccgccctggtgtcacagaggctactattactggcctggaaccgggaaccgaatat
accatttatgtcattgccctgaagaataatcagaagagcgagcccctgattggaaggaaa
aagacagacgagcttccccaactggtaacccttccacaccccaatcttcatggaccagag
atcttggatgttccttccacagttcaaaagacccctttcatcacccaccctgggtatgac
actggaaatggtattcagcttcctggcacttctggtcagcaacccactgttgggcaacaa
atgatctttgaggaacatggttttaggcggaccacacctcccacaacagccacccccata
aggcataggccaagaccatacccgccgaatgtaggtgaggagattcaaattggtcacatc
cccagggaagacgtagactatcacctctacccacatgggctgggactcaatccaaatgcc
tctacaggacaagaagctctctctcagacaaccatctcatgggccccattccaggacact
tctgagtacatcatttcatgtcatcctgttggcactgacgaagagcccttacagttcagg
gttcctggaacttctaccagtgccactctgacaggcctcaccagaggtgccacctacaac
atcatagtggaggcactgaaagaccagcagaggcataaggttcgggaagaggttgttacc
gtgggcaactctgtcaatgaaggcttgaaccaacctacggatgactcatgctttgacccc
tacacggtttcccattacgccgttggagatgagtgggaacgaatgtctgaatcaggcttt
aaactgttgtgccactgcttaggctttggaagtggtcatttcagatgtgattcatctaga
tggtgccatgacaatggtgtgaactacaagattggagagaagtgggaccgtcagggagaa
aatggccagatgatgagctgcacgtgtcttgggaatggaaaaggagaattcaagtgtgac
cctcatgaggcaacgtgttatgatgatgggaagacgtaccacgtaggagaacagtggcag
aaggaatatctcggtgccatttgctcctgcacatgctttggaggccagcggggctggcgc
tgtgacaactgccgcagacctgggggtgaacccagtcccgaaggcactactggtcagtcc
tacaaccagtattctcagagataccatcaaagaacaaacactgtaagtgcattggtgcca
cgagtgtgttccctcatactagacagtcctttctacaggtatcttccttcagaatga

KEGG   Macaca mulatta (rhesus monkey): 693357
Entry
693357            CDS       T01028                                 

Definition
(RefSeq) collagen alpha-4(VI) chain-like isoform X1
  KO
K06238  collagen type VI alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    693357
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    693357
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    693357
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    693357
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    693357
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    693357
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    693357
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   693357
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   693357
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   693357
SSDB
Motif
Pfam: VWA VWA_2 Collagen Kunitz_BPTI SRPRB DUF1194 VWA_CoxE
Other DBs
NCBI-GeneID: 693357
NCBI-ProteinID: XP_028699516
Ensembl: ENSMMUG00000002827
LinkDB
Position
2
AA seq 2348 aa
METWKIFWGIILFETGFGFIKSQSIVCREASLGDVVFLVDANINPQHARSVRNFLYILVN
SFNVSSKTIRVGLAQYSDVPHSEFLLSTYHRKADVLRHIRQFKFKPGGKKMGLALKFILD
HHFQEASGSRASQGVPQIAMVISSGPVEDHVHGPAEALRRAGILLYATGVRDAVWAELRE
IASSPQENFTSFVPYFSGLSNLAQKLRQELCDMLAKAAPRVDHVSPACREAALADIVFLV
DSSTSIGPQNFQKVKNFLYSVVLGLDISSDHVRVGLAQYNDNIYPAFQLNQHPLKSTVLE
QIQNLPYRTGGTNTGSALEFIRTNYLTEESGSRAKDRVPQIVILVTDGESNDEVQEVADR
LKEDGVVVYVVGVNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAVEGK
IKEFIKAYADVVFLADTSQDTSWASFQRMQNFLSRVVGMLEVGRDKYQIGLAQYGDQGHT
EFLLNTYKNQKEMIAHIHERFVPLGGSRRTGKALQYLLQTFFQEEAGSRYLQGIPQYAVV
INSGKSKDEVQDAAQRLREKGVKVMSVGVQDFDRRELEGMGSPDLVYDMQREDEVRHIVE
DVNVVIQGTGQQQHRITANEEAVGACTTAIRADLVFLIEEFSRVRQPNFQQVVNFLKTIV
SSLSIHPDTVRFGLVFYSEEPRLEFSLDTFQNPAKILEHLDKLTYRERRGRTKTGAALDF
LRNEVFIQEKGSRSNHGVQQIAVVIMEGFSQDSVSRPASHLRRAGITIYAVGTQNVSESK
ELEKIASYPHWKYSVPLESFLQLSVVRSKLINQLCSEMVDSKVSFRGMSYPLQEGCVHVE
KADIYFLIDGSGSINPQDFLEMKAFIKEVIKMFQIGPNRVQFGVIQYSDKIQSQFILSQY
PSVAELKVAIDNIQQGGGGTATGEALNNMTQVFADTGRINVARYLIVITDGKSSDPVAEA
AEGLRENGVIIYAIGVREANIDELKEIAKDKIFFVYEFDLLKDIQKEVVQDICSSEACKT
SKADIIFLIDGSESISPEDFEKMKRFVASMVNQSNIGTDGIQIGLLQFSSIPQEEFRLNQ
YSSKVDIYSAIFDVQQMRDGTRTGKALNFTLPFFDSSKGGRPSVQQYLIVITDGVAQDNV
IIPAKALRDKNIIIFAIGVGEAKKSQLLEITNDEDKVYHDVNFEALQNLEKEILSKVCDP
QGCNLDLSVGIDISTHSKQVQQKLQGLLPELMQQLTLLSNISCGIPDQINPRFRYLVPGS
NGKLIFDSGFEKYSYEIIQKFLVHQAAKNNNMDVNFLQSLGDNAIHLSSAKVKVLLVFTD
GLDDDLKRLKKTSEFLHSRGFSGLLIIGLEGVHKLEELQELEFGRGFAYKQPLSITLQSL
PSVLLKQLDTIVERTCCNIYAKCFGEDGYRGDYGSSGRKGEKGFDGLPGHPGEEGRYGER
GPQGLPGLRGEEGCPGMRGPKGARGFSGEKGNPGEEGVDGLDGEQGNRGIPGSSGEKGNR
GNRGLTGPPGQPGERGEPGLRGDPGDPGTNSYVQGPKGEKGRRGHQGSSNFDGPQGEPGN
VGPWGTIGRRGLPGLKGVPGESGEQGYQGELGYPGSQGSRGRQGPPGTSGQKGLVGAQGN
PGPPGPKGSKGKAGPRGVKGELGDVGERGPQGRQGPRGQPGLPGSDGYGHPGRKGTKGER
GFPGYPGVQGEDGDLGHQGEKGAKGVRGKRGNAGFPGLVGTLGYQGPPGQMGIKGPKGLV
EMTPCKIVDFTRENCPCSRGISKCPVFPTEVAFALDMSNDVSPLDFERMRNILLSLLMKI
EISDSNCPTGARVAIVSYSAKTNYLVRFSDYKEKAALLEAVRNIPLEHSSGRRNLGATMR
FLARHVFKRVRSGLLMRKVAVFFQVGWAYDAASINTATLELAAADIIPAVITFTKGHNLP
DTLLMDGTNRFYLFIWETEHQQDVEHVARCILCYDKCRPDPECPAGAPGPQEMDVDLVFL
VDSSYGVDADVYRGSLSLVDATLEDLEVAEQPSASHGGARVALVMHTTPNFWPGVGRRPV
LEGFRLTTFSNRKQMQRLVHEAAAHPLQGTPALGHALEWTLENVLLAAPLPRRAQVLFAI
VASETSSWDREKLWTLSLEAKCKGITLFVLALGPGVGTHELAELAELVSAPSEQHLLRLQ
GVSEPEVNYARGFTRAFLTLLKSGTNQYPPPELTEECGGLHRGDTLLQLVTPVNRLPRHQ
FGMSGLADDLEALEATGIFLEEKRKDMTTSITQQEVLENYENNKYDIEENEQETPAKQKE
TGKEINAGTTYGPCSMDPIEGECQDHTLKWYYNKEERVCQQFWYGSCGGNANRFETKEEC
EAQCVPIQ
NT seq 7047 nt   +upstreamnt  +downstreamnt
atggagacttggaagatattttgggggatcatcctctttgaaactggttttggcttcatc
aagtcacagagtattgtctgcagggaggcctctctgggagatgttgtgtttctggtggac
gccaacatcaacccccaacatgcacgcagtgtgcggaacttcttgtacattctggtaaac
agtttcaatgtcagcagcaagaccatccgtgtgggtctggcccagtatagtgatgtgccc
cattcagagttcttgctttccacctaccaccgcaaagctgatgtgttgagacacattcgg
cagtttaaatttaagcctgggggcaagaagatgggcctggccctgaagtttattctagac
caccacttccaggaagcatcagggagccgggcaagccaaggggtacctcagattgccatg
gtgatcagcagtgggccagttgaggaccatgtccacggacctgcggaggcattgaggcgg
gcaggcatcctgctttatgctactggcgtcagagatgcagtttgggcagagctcagggaa
attgccagtagtcctcaggagaattttacctcctttgttccttacttctctggtctgagc
aatctcgcccagaagctgcggcaggagctctgtgatatgttggcaaaggcagctccacgt
gttgaccacgtctctccagcttgcagagaagcagccctggcagacattgtgtttctagta
gacagttcaactagcatcggaccccaaaacttccagaaagtcaagaacttcctttactcc
gtcgtcttggggcttgacatcagcagtgaccatgtccgagtgggacttgcccagtataat
gacaacatctacccagccttccagctgaaccagcaccctctgaagagcacggtcttggag
cagatccagaatctgccataccgcacaggaggcacaaacacagggagtgccctggagttt
atcaggaccaactatttgactgaggagtctggcagtcgggccaaggacagagttcctcag
atagttatcctggtgacagatggggagtcaaacgatgaggtacaggaggtggctgaccgg
ttgaaagaagatggagttgttgtgtatgtggtaggggtcaatgtccaggatgtccaagag
ttgcaaaaaatagccagtgaaccatttgagaagtttctcttcaacactgaaaacttcaat
atcctgcaggatttttcaggaagtattcttcagactctgtgctcagcagtggagggtaag
ataaaagaattcatcaaagcctatgcagatgtggtctttcttgctgacacatcacaggac
acatcatgggccagtttccagaggatgcagaatttcctctccagagtggttggcatgctg
gaggttggcagggacaagtaccaaattgggctggctcagtatggtgatcaaggtcacact
gagtttttgctcaatacctacaagaaccagaaagagatgatagctcacattcatgagcgt
tttgtgcccctgggtggctccaggaggacaggcaaggcactgcaataccttcttcagaca
ttcttccaggaggaagcaggaagccggtatctccagggcattccccagtatgcagtggtc
attaactcaggcaaatctaaggatgaagtccaggatgctgcacagagactaagagagaaa
ggcgtgaaagttatgtctgtgggtgtgcaggactttgacaggagagaactggaaggaatg
gggtctccagaccttgtctatgatatgcagagagaagatgaagtcagacacatagtagaa
gatgtgaacgtggtgatccaaggcactggacagcagcagcacaggattacagccaacgag
gaggctgtaggagcatgcacaactgctatccgggctgacttagtattcctcattgaggaa
tttagcagggttaggcaacccaatttccaacaagttgtcaatttcctaaagaccattgtc
agttctttaagcattcatcctgatactgtgagatttggcttggtcttctacagtgaggaa
ccacgactcgaattttcactggatacatttcagaatccagccaaaatcttggagcatttg
gacaaattaacctaccgggaaagaagaggaaggacgaagactggtgctgcattagatttc
ctgagaaacgaggttttcattcaggagaagggcagccggtccaaccacggtgtgcagcag
atagctgtggtcatcatggaaggcttctcccaagacagtgtatctagacccgcttctcac
ctccgaagggcaggcatcaccatctatgcagtgggcacccagaatgtctcagagagtaag
gaactggagaagatagcatcatatcctcattggaagtattcagtccccctggaatccttt
ttgcagctctctgtagtacgaagcaagcttataaaccagctctgctctgagatggtggac
agtaaagtttcctttagagggatgagctatcccctacaagaaggttgtgtgcacgtcgaa
aaggcagatatttacttccttattgatgggtctggcagcatcaatccacaagattttctt
gagatgaaggcgttcattaaagaagtgataaaaatgttccagatcggacccaacagagta
cagtttggagtcattcagtactcagacaaaattcaaagtcaatttatcctcagccagtat
cccagtgtggcagagctgaaggtagccattgataacatccagcaggggggaggtggcacc
gcaaccggtgaggccttgaacaacatgactcaggtttttgcagacactggccgaatcaat
gttgctcgatatcttatagtcatcactgatggtaaatcttcagacccggtggctgaggct
gcagagggattgagagaaaatggagttataatttatgccattggagtaagagaagctaac
attgatgagcttaaggaaatagctaaagacaagatattttttgtgtatgagtttgattta
ctgaaggacatccaaaaagaagtggtacaggacatctgctcttcagaggcctgtaagact
agtaaagctgacattatcttcctgatagatggttcagaatccatctccccagaagacttt
gaaaagatgaaaagatttgtggcgagcatggtgaaccaatctaatattggtactgatgga
attcaaattggccttttgcagttcagctcaatcccccaggaagaattcaggctcaaccaa
tactcctcaaaggtggacatttacagtgccatctttgatgttcagcagatgagggatggc
acccgcactgggaaagctctgaatttcactctgcctttttttgacagttcgaaaggaggg
agacccagtgttcaacaatatttgattgtgatcaccgatggggtcgcccaagataatgta
atcataccagccaaagccctcagggacaaaaacataattatttttgccattggggtggga
gaagccaaaaaatcacagcttttggagatcactaatgatgaggacaaagtgtaccatgat
gtaaattttgaggccctgcagaacctggagaaggaaattctttctaaggtctgtgatcca
caaggatgcaacctggatttgtctgtaggaattgatatctcaactcattcaaagcaagtt
cagcagaagcttcaagggttactgccagagctgatgcaacagttgactttgctttctaac
atcagctgtggcattcctgatcagatcaacccaagattccgctacttggttcctggctca
aatggcaaacttatctttgactcaggctttgaaaaatacagttatgagatcatccagaaa
ttcttggttcatcaggctgccaagaataacaatatggatgtgaactttttgcagtccttg
ggagataatgctattcatctctcttctgctaaagtaaaggttcttttagtgtttacagat
ggactggatgatgatttaaagagactgaaaaaaacatcggagtttctccacagcagagga
ttctctggactcctaatcattggcctggaaggtgtccataaattagaagagctccaggag
ctggaatttggcagagggtttgcatataagcaacctctgagcatcacactacaatccctc
ccaagcgtcttactgaagcaacttgatacaattgtggaaagaacatgctgcaatatatat
gcaaaatgttttggagaggatgggtacagaggtgattatgggagttctgggaggaaggga
gagaagggttttgatgggttacccggtcatcctggtgaagaaggcagatatggagaacga
ggcccccagggtcttcctggactccgaggtgaggaaggatgccctggtatgaggggacct
aagggagcaagaggattttcaggagagaagggcaaccctggtgaggaaggagttgatggc
ttggatggagaacagggtaatcgtggaatcccagggtcatctggagaaaaaggaaatagg
ggaaatcggggcttgacgggaccgccaggacaacctggagagcgtggagagcctggatta
aggggagatcctggggatcctggaactaatagctatgtccaaggccctaagggagaaaaa
ggaaggcgtgggcatcagggaagttctaattttgatggacctcagggagaacctggaaat
gtcggcccttgggggacaataggaaggcgaggtctgccaggattaaagggtgtaccagga
gaatccggtgaacagggttaccaaggagagcttgggtatccaggctcacagggatcaaga
ggaaggcaaggaccaccaggaacttctggacaaaaaggcttagtgggcgctcaggggaat
cctgggcctccagggccaaagggttcaaaaggaaaagctggaccaagaggagtgaaggga
gagcttggtgatgtaggagaaagaggcccacagggtcgacaaggaccaagagggcaacct
ggtcttcctggttctgatggatatggacatccaggaagaaaaggaacaaagggtgaacgt
ggattccctggctatcctggtgtccaaggagaagatggtgacctgggccatcaaggagaa
aagggggcaaagggagtaagagggaagaggggtaatgctggctttcctggattagttgga
actttaggttaccaaggcccaccaggacaaatgggtatcaagggccccaaaggtttggta
gaaatgacgccttgcaaaattgttgatttcacgcgtgaaaactgcccttgttcaagaggt
atttccaaatgcccagtgttcccaaccgaagtggcctttgccttggacatgtcaaatgat
gtctccccgttggattttgagaggatgagaaacattttattgtctctgttgatgaagatt
gaaataagtgacagtaactgcccaacaggtgcccgtgtggccattgtttcctacagtgcc
aaaaccaattacttggttcgtttctcagactacaaggagaaggctgcactcctggaggct
gtcaggaacatccccctggagcactcatctggccgcaggaaccttggggccaccatgagg
tttctggcgagacacgtgttcaaacgtgtacgctcaggtcttctcatgaggaaggtggct
gtgttcttccaggtgggctgggcctatgatgcagcttccatcaatacagccacactggag
cttgctgcagcggacatcatccccgcggtcatcaccttcacaaaggggcacaacctccca
gacactctgctgatggatggaaccaacagattttacttgttcatctgggagacagagcac
cagcaggatgtggagcatgtggctcgctgcattctctgctatgacaaatgcagaccagat
ccagaatgcccggctggcgcgccggggccccaggagatggacgtggatttggtgttcctg
gtggacagctcctatggagtggatgccgacgtgtaccgcgggtctttgagtctagtggat
gccacgctagaagacctggaggtggctgagcagcccagcgcgtcccacggtggggcgcgc
gtggccctggtgatgcacacgacacccaacttctggccgggtgtgggtcgccgccctgtg
ctcgagggcttccgcctgactacctttagcaaccggaagcagatgcagagacttgtgcac
gaagctgcagcccaccccttgcagggaacccccgccctgggccacgccctggagtggacg
ctggagaatgtgctcctggcagcccctctgccgcggagggcacaagtcctcttcgccatc
gtggccagcgagacaagtagctgggacagggagaagctatggactctgtccctggaggcc
aaatgcaagggcatcactctctttgtgctggccttgggtccgggtgtggggacgcatgag
ttagccgagttagccgagctggtcagtgccccctctgagcagcatctactgcgcctacaa
ggggtctcggagccagaggttaactacgctcggggattcactcgggccttcctgaccctc
ctaaaaagtgggacaaaccagtacccacccccagaactcaccgaagaatgtgggggccta
caccgcggggacactctgctgcaattagtcacgcctgtcaacaggttgcccaggcaccag
tttggtatgtctggcttggctgatgatttggaagcacttgaagcaacaggcatttttcta
gaagagaagagaaaagacatgacaacatctataactcagcaagaagtgcttgaaaattat
gaaaataacaaatatgacattgaagaaaacgaacaagaaacaccagcaaaacaaaaagaa
actggaaaagaaataaatgcaggcactacctatggtccttgttccatggatccaatagaa
ggcgagtgccaggatcacaccctgaagtggtattacaacaaggaggaacgggtttgccag
cagttctggtatggcagctgtgggggtaatgccaaccggtttgaaaccaaggaagaatgt
gaggctcagtgcgtcccaatacagtaa

KEGG   Macaca mulatta (rhesus monkey): 694248
Entry
694248            CDS       T01028                                 

Gene name
COL9A2
Definition
(RefSeq) collagen alpha-2(IX) chain isoform X2
  KO
K08131  collagen type IX alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    694248 (COL9A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    694248 (COL9A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    694248 (COL9A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    694248 (COL9A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    694248 (COL9A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mcc00535]
    694248 (COL9A2)
Proteoglycans [BR:mcc00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   694248 (COL9A2)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 694248
NCBI-ProteinID: XP_014991967
Ensembl: ENSMMUG00000018664
LinkDB
Position
1
AA seq 677 aa
MQKVLNTWEFTEERGPPGERGPPGPPGPPGVPGSDGIDGDKGPPGKAGPPGPKGEPGKAG
PDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGPPGPPGPVGLPGE
IGIPGPKGDPGPDGPSGPPGPPGRPGRPGTIQVLEGSADFLCPTNCPPGMKGPPGLQGVK
GHAGKRGVLGDPGRQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMAGPKGETGPHGYK
GMVGAIGAIGPPGEEGPRGPPGRAGEKGDVGSPGIRGPQGITGPKGATGPPGINGKDGTP
GMPGMKGSAGQAGRPGSPGHQGLAGVPGQPGTKGGPGDQGEPGPQGLPGFSGPPGKEGEP
GPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGSPGIPGPQGLPGIKGNKGFP
GKTGPRGGVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPSGDAGAPGVQGYP
GLPGPRGLVGNRGVPGQPGRQGVAGRDATDQHIVDVALKMLQEQLAEVAVSAKREALGAV
GMVGPPGPPGPPGYPGKQGPHGHPGPRGVPGIVGAVGQIGNTGPKGKRGEKGDPGEVGRG
HPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGLPGFCEPAACLGA
SAYASARLAEPGSIKGP
NT seq 2034 nt   +upstreamnt  +downstreamnt
atgcagaaggttctcaatacgtgggagttcacggaggagagaggtccaccgggagaacgg
ggccccccgggccccccgggaccgccgggagtgcctggatccgacggcatcgacggtgac
aaggggccccctggcaaagctggccctccgggacccaagggtgagcctggcaaagccggg
ccagatggtccagacgggaagcctgggattgatggtttaactggagccaagggggagcct
ggccccatggggatccctggagtcaagggccagcccgggcttcctggtcctcccggcctt
ccaggccctggttttgctggacctcctggaccgcctggacctgttggcctccctggtgag
attggaatcccaggccccaagggggaccctggaccagatggaccatcggggcccccagga
cctccagggagacctggtcgcccgggaaccatccaggttctggaaggcagtgcggatttc
ctgtgtccaaccaactgtccaccggggatgaaaggtccaccagggctgcagggagtgaag
ggacatgcgggcaaacgcggggttctgggtgatcctggccgccaggggaagccgggtccc
aagggagatgtgggtgcctctggagagcaaggcatccctggaccaccgggtccccagggc
atcaggggctacccgggcatggcagggcccaagggagagacgggccctcatggatataaa
ggcatggtgggcgccatcggtgccattgggcccccgggtgaggaaggtcctaggggaccg
ccaggccgagctggggagaagggcgacgtgggcagcccaggtattcgtggaccccagggg
atcacaggcccgaagggagcaacgggtcccccaggcatcaacggcaaggacgggacccca
ggcatgcctggcatgaagggcagtgcaggacaggcgggacggccaggaagcccaggccac
cagggcctagcgggtgtgccgggccagcctgggacaaaaggaggccctggagaccagggt
gaaccgggcccgcagggccttcctgggttctctggtccccctgggaaagagggagagcca
gggcctcgaggagaaattggtcctcagggcatcatgggacagaagggtgaccagggcgag
aggggtccagtggggcagccgggccctcagggaaggcagggccctaagggcgagcagggc
tcccctggaattccagggccccaaggcttgccaggcatcaaaggaaacaagggcttccca
gggaagaccgggcctcgcggcggagtgggtgacccgggggtggccggccttcctggagag
aaaggcgagaagggcgagtccggcgagccggggcccaagggacagcaaggagtacgtgga
gaacccggctaccccggccccagtggggatgcgggcgccccaggggttcagggctaccct
ggtctccccggccctcgaggactggtggggaaccgaggcgttccaggacagcccgggaga
cagggcgtggcgggccgggatgccactgaccagcacatcgtggatgtggcgctgaagatg
ctgcaagagcaactggctgaggtcgccgtgagtgccaagcgggaagccctgggtgcggtg
ggcatggtgggtcccccaggacctcctgggccccctggatacccaggcaagcagggaccc
catgggcaccctggccctcggggtgttcctggcatcgtgggagccgtgggtcagatcggg
aacacggggcccaagggaaaacgtggagagaagggtgatccaggagaagtgggacggggg
caccccgggatgcctgggcccccagggatcccaggactccctggccggcctggccaggca
atcaacggcaaggatggcgatcgagggtccccaggggctccaggagaggcaggtcgacct
gggctgccaggccccgtagggctgccaggcttctgtgaacctgcagcctgccttggagct
tcggcctatgcctctgcccgccttgcagagcccggatccatcaaggggccttga

KEGG   Macaca mulatta (rhesus monkey): 694701
Entry
694701            CDS       T01028                                 

Gene name
COL6A3
Definition
(RefSeq) collagen alpha-3(VI) chain isoform X4
  KO
K06238  collagen type VI alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    694701 (COL6A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    694701 (COL6A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    694701 (COL6A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    694701 (COL6A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    694701 (COL6A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    694701 (COL6A3)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    694701 (COL6A3)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   694701 (COL6A3)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   694701 (COL6A3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   694701 (COL6A3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen Kunitz_BPTI VWA_3 fn3
Other DBs
NCBI-GeneID: 694701
NCBI-ProteinID: XP_001084624
Ensembl: ENSMMUG00000014049
LinkDB
Position
12
AA seq 2969 aa
MRKHRHLPLVAVFCLFLSGFPTTHAQQQQAAQDSADIIFLIDGSNNTGSVNFAVILDFLV
NLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLAL
DFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVALKQASVFSFGLGAQAASR
AELQHIATDDNLVFTVPEFHSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRDI
VFLVDGTSALGLANFNAIRDFVAKVIQRLEIGQDLIQVAVAQYADTVRPEFYFNTHPTKR
EVITAVRKMKPLDGSALYTGSALDFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDEIS
QPAQELKRSSIMAFAIGNKGADQAELEEIAFDSSLVFIPAEFRAAPLQGMLPGLLAPLRT
LTGTPEVHANKRDIIFLLDGSANVGKTNFPYVRDFVMNLVNSLDVGNDNIRVGLVQFSDT
PVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIREHVPQL
LLLLTAGQSEDSYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSLVYLMDDFSSLP
ALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILFLFDGSANLVGQFPVVRDFLYKIIDEL
DVKPEGTRIAVAQYSDDVKVESRFDEHQSKPEILNLVKRMKIKTGKALNLGYALDYAQRY
IFVKSAGSRIEDGVLQFLVLLVAGRSSDRVDGPASNLKQSGVVPFIFQAKNADPAELEQI
VLSPAFILAAESLPKIGDLQPQIVNLLKSVHNGAPAPVSGEKDVVFLLDGSEGVRSGFPL
LKEFVQRVVESLDVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVNAVRQLTLLGGPIP
NTGAALEFVLRNILVSSAGSRITEGVPQLLIVLTAERSGDDVRNPSVVLKRGGAVPIGIG
IGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISDRVTQLTREELSRLQPVLQPLPRP
GVGGKRDVVFLIDGSQSAGPEFQHIRTLIERLVDYLDVGFDTTRVAVIQFSDDPKVEFLL
NAHSSKDEVQNAVQRLRPKGGRQINVGSALEYVSRNIFKRPLGSRIEEGVPQFLVLISSG
KSDDEVDDPAVELKQFGVAPLTIARNADQEELVKISLSPEYVFSVNTFRELPSLEQKLLT
PITTLTSEQIQQLLASTRYPPPAVESDAADIVFLIDSSEGVRPDGFAHIRDFVSRIVRRL
NIGPSKVRVGVVQFSNDVFPEFYLKTYRSQALVLDAIRRLRLRGGSPLNTGRALEFVARN
LFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGVGDRNIDRAELQTI
TNDPRLVFTVREFRELPNIEERIMTSFGTSAATPAPPGVATPSPSRPEKKKADIVFLLDG
SINFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAI
NKVVYKGGRHANTKVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSAEDAQDVSLAL
TQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSELSEQVLETLHDAMHETLCP
GVTDVAKACNLDVILGFDGSRDQNVFVAQKGLESKVDAILNRISQMHRVSCSGGRSPTVR
VSVVANTPSGPVEAFDFDEYQPEMLEKFRNMRSQHPYVLTADTLKVYQNKFRQSSPDSVK
VVIHFTDGADGDLADLHRASEDLRQEGVHALILVGLERVANLERLMHLEFGRGFMYDRPL
RLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQRGDRGPIGSIGPKGIPGEDGYRGYPGD
EGGPGERGPPGVNGTQGFQGCPGQRGIKGSRGFPGEKGEIGEIGLDGLDGEDGDKGLPGS
SGEKGNPGRRGDKGPRGEKGERGDVGIRGDPGNPGQDSQQRGPKGETGDLGPMGVPGRDG
VPGGPGETGKNGGFGRRGPPGAKGNKGGPGQPGFQGEQGTRGAQGPAGPAGPPGLIGEQG
ISGPRGSGGAAGAPGERGRTGPLGRKGEPGEPGPKGGIGNRGPRGETGDDGRDGVGSEGR
RGKKGERGFPGYPGPKGNPGEPGLNGTPGPKGIRGRRGNSGPPGIVGQKGDPGYPGPAGP
KGNRGDSIDQCALIQSIKDKCPCCYGPLECPVFPTELAFALDTSEGVNQDTFSRMRDVVL
SIVNDLTIAESNCPRGARVAVVTYNNEVTTEIRFADSKRKSVLLDKIKNLQVALTSKQQS
LETAMSFVARNTFKRVRNGFLMRKVAVFFSNRPTTASPQLREAVLKLSDAGITPLFLTSQ
EDRQLVNALQINNTAVGHALVLPAARDLTDFLENVRSCHVCLDICNIDPSCGFGSWRPSF
RDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVRQLDMSPDPKASQHFARVAV
VQHAPYESMGNASVSPVKVEFSLTDYGSKEKLVDFLSRGMTQLQGTRALGSAIEYTIENV
FESAPNPRDLKIVVLMLTGEVQEEQLEEAQRVILQAKCKGYFFVVLGIGRKVNIKEVYTF
ASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENAFYLSPDIRKQCDWFQGDQPTK
NLVKFGHKQINVPNNVTSSPTSNPVTTTKPVTTTKPVSTTTKPVTIVNQPSVKPAAAKPA
PEKPAPAKPVAAKPVATKTATVRPPVAVKPATAAKPVAAKPAAVRPPAAAAAKQVAAKPE
APRPQAAKPAATKPATTKPVVKVSREVQVSEITENSARLHWERPEPPSPYFYDLTITSAR
DQSLVLKQNLTVTDRVVGGLLAGQMYHVAVVCYLRSQVRATYHGSFSTKKSQPPPPQPAR
SASSSTINLMVSTEPLALTETDICKLPKDEGTCRDFILKWYYDPNTKSCARFWYGGCGGN
ENKFGSQTECEKVCAPVLAKPGVINVMGT
NT seq 8910 nt   +upstreamnt  +downstreamnt
atgaggaaacatcggcacttgcccttagtggccgtcttttgcctctttctctcaggcttt
cccacaacacatgcccagcagcagcaagcagcacaagactctgctgatattattttcctt
attgatggatcaaacaacaccggaagtgtcaatttcgcagtcattctcgacttccttgta
aatctccttgagaaactcccaattggaactcagcagatccgagtgggggtggtccagttt
agcgatgagcccagaaccatgttctccttggacacctactccaccaaggcccaggttctg
ggtgcagtgaaagccctcgggtttgctggtggggagttggccaatattggcctcgccctt
gatttcgtggtggagaatcacttcacccgggcagggggcagccgtgtggaggaaggggtt
ccccaggtgctggtcctcataagtgccgggccttctagtgacgagattcgctacggggtg
gtagcactgaagcaggctagcgtgttctcattcggcctcggagcccaggccgcctccagg
gcagagcttcagcacatagctaccgatgacaacttggtgtttactgtcccggaattccat
agctttggggacctccaggagaaattactgccgtacattgttggcgtggcccaaaggcac
attgtcttgaaaccgccaaccattgtcacacaagtcattgaagtcaacaagagagacata
gtcttcctggtggatggcacgtctgcactgggactggccaacttcaatgccatccgagac
ttcgttgctaaagtcatccagaggctggaaatcggacaggatcttatccaggtggcagtg
gcccagtacgcagacactgtgaggcccgaattttatttcaatacccatcccacaaaaagg
gaagtcataaccgctgtgcggaaaatgaagcccctggacggctcggccctgtacacgggc
tccgctctagactttgttcggaacaacctgttcacaagttcagccggctaccgggctgcc
gaggggattcctaagcttttggtgctgatcacaggtggtaagtccctagatgaaatcagc
cagcccgcccaggagctgaagagaagcagcataatggcctttgccatcgggaacaagggc
gccgatcaggctgagctggaagagattgctttcgactcctccctggtgttcatcccagct
gagttccgtgccgccccattgcaaggcatgctgcctggcttgctggcacctctcaggacc
ctcactggaacccctgaagttcacgcaaacaaaagggatatcatcttccttttggatgga
tcagccaacgttggaaaaaccaatttcccttatgtgcgcgactttgtaatgaacctagtt
aacagccttgatgttggaaatgacaatattcgtgttggtttagtgcaatttagtgacact
cctgtaacggagttctctttaaacacataccagaccaagtcagatatccttggtcacctg
aggcagctgcagctccagggaggttctggtctgaacacaggctcagccctaagctatgtc
catgccaaccacttcacggaagctggcggcagcaggatccgtgaacacgtgccgcagctc
ctgcttctgctcacagccgggcagtctgaggactcctatttgcaagctgccaacgccttg
acacgcgcgggcatcctgaccttttgtgtgggagctagccaggcaaataaggcagagctt
gagcagattgcttttaacccaagcctggtgtatctcatggatgatttcagctccctgcca
gctttgcctcagcagctgattcagcccctaaccacatatgttagtggaggtgtggaggaa
gtaccactcgcccagccagagagcaagcgagacattctgttcctctttgacggctcagct
aatcttgtgggccagttccctgttgtccgtgactttctctacaagattatcgatgagctc
gatgtgaagccagaagggacccgaattgcggtggctcagtacagcgatgatgtcaaggtg
gagtcccgttttgatgagcaccagagtaagcctgagatcctgaatctcgtgaagagaatg
aagatcaagacgggcaaagccctcaacctgggctacgccctggactacgcacagaggtac
atttttgtgaagtctgctggcagccggattgaggatggagtgcttcagttcctggtgctg
ctggtcgcgggaaggtcatctgaccgtgtggacgggccagcaagtaacctgaagcagagt
ggggtcgtgcctttcatcttccaagccaagaatgcggaccctgctgagttagagcagatc
gtgctgtctccagcgtttatcctggctgcggagtcgcttcccaagattggagatcttcaa
ccacagattgtgaatctcttaaaatcagtgcacaatggagcgccagcaccagtttcaggc
gaaaaggatgtggtgtttctgcttgacggctctgagggcgtcaggagcggcttccctctg
ttgaaagagttcgtccagagagtggtggagagcctggacgtgggccaggaccgggtccgc
gtggccgtggtgcagtacagcgaccggaccaggcccgagttctacctgaattcatacatg
aaccagcaggacgtcgtcaacgccgtccgccagctgaccctgctgggagggccgatcccc
aacaccggggctgccctggagtttgtcctgaggaacatcctggtcagctctgcgggaagc
aggataacagaaggcgtgccccagctgctgatcgtcctcacggccgaaaggtctggggat
gatgtgcggaacccctccgtggtcctgaagaggggcggggctgtgcccatcggcattggc
atcgggaacgctgacatcacagagatgcagaccatctccttcatcccggactttgccgtg
gccattcccacctttcgccagctgggtaccgtccaacaggtcatctccgacagggtgacg
cagctcacccgcgaggagctgagcaggctgcagccggtgttgcagcctctgccgcgccca
ggtgttggcggcaagagggacgtggtctttctcatcgatggttcccaaagtgccgggcct
gagttccagcacatccgcaccctcatagagaggctggttgactacctggatgtgggcttt
gacaccacccgggtggccgtcatccagttcagtgatgaccccaaggtggagttcctgctg
aacgcccattccagcaaggacgaagtgcagaacgcggtgcagcggttgaggcccaaggga
gggcggcagatcaacgtgggcagtgccctggagtatgtgtccaggaacattttcaagagg
cccctggggagccgcatcgaagagggcgtcccacagttcctggtcctcatctcgtccgga
aagtccgacgatgaggtggatgacccggcggtggaactcaagcagtttggcgtggcccct
ttgacgatcgccagaaacgcagaccaggaggagctggtgaagatctcactgagccccgaa
tatgtgttctcggtgaacaccttcagggagctgcccagcttggagcagaaactgctgacg
cccatcacgaccctgacctcagagcagatccagcagctcctagccagcactcgctatccg
cctccagcagttgagagtgatgctgcagacatcgtctttctgatcgacagctctgagggc
gttaggccagatggctttgcacatattcgagactttgttagcaggattgttcgaagactc
aacattggccccagtaaagtgagagttggggttgtgcagttcagcaatgatgtcttccca
gaattctacctgaagacctacagatcccaggccctggtgctggacgccatacggcgcctg
aggctcagaggggggtccccactgaacactggcagggctctcgaatttgtggcaagaaac
ctctttgttaagtctgcggggagtcgcatagaagacggggtgccccaacacctggtcctg
gtcctgggtggaaaatcccaggatgacgtgtccaggtttgcccaggtgatccgttcctcg
gggattgtgagtttaggagtaggagaccggaacatcgacagagcagagctgcagaccatc
accaatgaccccagactggtcttcacagtgcgagagttcagagagcttcccaacatagaa
gaaagaatcatgacctcatttggaacctctgcggccactcctgcacctcccggggttgcc
accccttctccttcgcggccagagaagaagaaagcagacattgtgttcctgttggatggt
tccatcaacttcagaagggacagtttccaggaagtgcttcgttttgtgtctgaaatcgtg
gacacagtttatgaagatggtgactccatccaagtggggctggtccagtacaactctgac
cccactgacgaattcttcctgaaggacttctccaccaagaggcagattatcgacgccatc
aacaaagtggtctacaaagggggaagacatgccaacaccaaggtgggccttgagcacctg
cgggtgaaccacttcgtgcctgaggcaggcagccgcctggaccagcgggtccctcagatt
gcctttgtgatcacgggaggaaagtcggcggaagatgcacaggacgtgagcctggccctc
acccagaggggggtcaaagtatttgcagttggagtgaggaatatcgactcggaggaggtt
ggaaagatagcatccaacagcgccacggcattccgtgtgggcaacgtccaggagctgtcc
gaactgagcgagcaagttttagaaactttgcatgatgcgatgcacgaaaccctgtgccct
ggtgtgactgacgttgccaaagcttgtaatctggatgtgattctggggtttgatggttct
agagaccagaatgtatttgtggcccagaaaggcttagagtccaaggtggacgccatcttg
aacagaatcagccagatgcacagggtcagctgcagcggcggccgctcgcccaccgtgcgt
gtgtcagtggtggccaacacgccctcgggcccggtggaggcctttgactttgatgagtac
cagccagagatgctcgagaagttccggaacatgcgcagccagcacccctatgtcctcacg
gcggacacgctgaaagtctatcagaacaagttcagacagtcctcgccggacagtgtgaag
gtggtcattcattttactgatggagcagatggagatctggctgatttacacagagcatct
gaggacctccgccaagaaggtgtccacgctttgatcctggtgggccttgaacgagtggcc
aacttggagcggctaatgcatctggagtttgggcgagggttcatgtatgacaggcccctg
aggcttaacttgctggacttggattatgaactagcggagcaacttgacaacattgccgag
aaagcttgctgtggggttccctgcaagtgctctgggcagaggggagaccgtgggcccatc
ggcagcattgggccaaagggtattcctggagaagacggctaccgaggctatcctggtgat
gagggtggacccggtgagcggggtccgcctggtgtgaacggcactcaaggtttccagggc
tgcccgggccagagaggaataaagggctctcggggattcccaggagagaagggcgaaata
ggagaaattggactggatggtctagacggtgaagatggagacaaaggattgcctggttct
tctggagagaaagggaatcctggaagaaggggtgataaaggacctcgaggagagaaagga
gaaagaggagacgtggggattcgaggggacccgggtaacccaggacaagacagccagcag
agaggacccaaaggagaaaccggcgacctcggccccatgggtgtcccagggagagacgga
gtacctggaggacctggagaaactgggaagaatggtggctttggccgaaggggacccccc
ggagctaagggcaacaagggtggtcctggccagccgggctttcagggagagcaggggacc
agaggtgcacagggtccagctggtcctgctggtcctccagggctgataggagaacaaggc
atttctggacctcggggaagcggaggtgccgcaggtgctcctggagaacgaggcagaacc
ggtcccctgggaagaaagggtgagcccggagagccaggaccaaaaggaggaattggaaac
cggggccctcgtggggagacgggagatgacgggagagacggagttggcagtgaaggacgc
agaggcaaaaaaggagaaagaggattccctggctacccaggaccaaagggtaaccccggt
gagcctgggctaaatggaacaccaggacccaaaggcatccgaggccgaaggggaaattcg
ggacctccagggatagttggacagaagggagaccctggctacccaggaccagctggtccc
aagggcaacagaggtgactccatcgatcaatgtgccctcatccaaagcatcaaagataaa
tgtccttgctgctatgggcccctggagtgccccgtcttcccgacagaactagcctttgct
ttagacacctctgagggggtcaaccaagacacgttcagccggatgcgagatgtggtcttg
agtattgtgaatgacctgaccattgctgagagcaactgcccacggggggcccgggtggcc
gtagtcacctacaacaacgaggtgaccacagagatccggtttgctgactccaagaggaag
tcggtcctcctggacaagattaagaaccttcaggtggctctgacatccaaacagcagagt
ctggagaccgccatgtcatttgtggccagaaacacatttaagcgtgtgcggaacggattc
ctaatgaggaaagtagccgttttcttcagcaacaggcccacaacagcatccccacagctc
agagaggctgtgctcaagctctcagatgcggggatcacccccttgttccttacgagccag
gaagaccggcagctcgtcaacgctttgcagatcaataacacagcagtggggcatgcgctc
gtcctgcctgcggcgagagacctcacagacttcctggagaatgtccgctcgtgtcatgtt
tgcctggacatctgcaacattgacccatcctgcggatttggcagttggaggccttccttc
agggacaggagagcggcagggagcgatgtggacatcgacatggctttcatcttagacagc
gctgagaccaccaccctgttccagttcaatgagatgaagaagtacatagcgtacctggtc
agacaactggacatgagcccagaccccaaggcctcccagcacttcgccagagtggccgtt
gtgcagcatgcaccctatgagtccatgggcaatgccagcgtgtcgcctgtgaaggtggaa
ttctccctgactgactatggctccaaggagaagctggtggacttcctcagcaggggaatg
acacagttgcagggaaccagggccttgggcagtgccattgaatacaccatagagaatgtc
tttgaaagtgctccaaacccacgggacctgaaaattgtggttctgatgctgacgggcgag
gtgcaggaggagcagctggaggaggcccagagagtcatcctgcaggccaaatgcaagggc
tacttcttcgtggtcctgggcattggcaggaaggtgaacatcaaggaggtatacaccttc
gccagtgagccaaatgatgtcttcttcaaattagtggacaagtccactgagcttaacgag
gaacctttgatgcgcttcgggaggctgttgccatccttcgtcagcagcgaaaatgctttt
tacttgtccccagatatcaggaaacagtgtgattggttccaaggggaccaacccacaaag
aatcttgtgaagtttggtcacaaacaaataaatgttccaaataatgttacttcaagtcct
acatccaacccagtgacgaccacgaagccggtgactacaacgaagccagtgtccaccaca
acaaagcctgtaactatcgtaaatcagccatctgtgaagccagctgctgcaaagccggcc
cctgagaaaccagcccctgcaaaacctgtggctgccaagcctgtggccacaaagacggcc
actgttagacccccggtcgcggtgaagccagcaacggcagcgaagcctgtagcagcaaag
ccagcagctgtgagaccccctgctgctgctgccgcaaaacaagtggcggccaagcctgag
gcccctaggccacaggcagccaaaccagctgccaccaagccagccaccactaagcccgtg
gttaaggtgtcccgagaagtccaggtgtctgagatcacagagaacagcgccagactccac
tgggagaggcctgagccccccagtccttatttttatgacctcaccatcacctcagcccgt
gatcagtccctggttctgaagcagaatctcacggtcacggaccgcgtcgtcggaggcctg
ctcgctgggcagatgtaccacgtggccgtggtctgctacctgaggtctcaggtcagagcc
acctaccacggaagtttcagtacaaagaaatctcagcccccacctccacagccagcaagg
tcagcttctagttcaaccatcaatctaatggtgagcacagaaccgttggctctcactgaa
acagatatatgcaagttgccaaaagacgaaggaacttgcagggatttcatattaaaatgg
tactatgatccaaacaccaaaagctgtgcaagattctggtatggaggttgtggtggaaac
gaaaacaaatttggatcacagacagaatgtgaaaaggtttgcgctcctgtgctcgccaaa
cccggagtcatcaatgtgatgggaacctaa

KEGG   Macaca mulatta (rhesus monkey): 696372
Entry
696372            CDS       T01028                                 

Gene name
RELN
Definition
(RefSeq) reelin isoform X3
  KO
K06249  reelin [EC:3.4.21.-]
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05017  Spinocerebellar ataxia
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    696372 (RELN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    696372 (RELN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    696372 (RELN)
 09160 Human Diseases
  09164 Neurodegenerative disease
   05017 Spinocerebellar ataxia
    696372 (RELN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    696372 (RELN)
Enzymes [BR:mcc01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.21  Serine endopeptidases
    3.4.21.-  
     696372 (RELN)
SSDB
Motif
Pfam: BNR EGF_2 EGF_Tenascin Reeler EB
Other DBs
NCBI-GeneID: 696372
NCBI-ProteinID: XP_014990065
Ensembl: ENSMMUG00000048420
LinkDB
Position
3
AA seq 3346 aa
MSDHQFGNQFMCSVVASHVSHLPTTNLSFIWIAPPAGTGCVNFMATATHRGQVIFKDALA
QQLCEQGAPTEATMHPHLAEIHSDSIILRDDFDSYHQLQLNPNIWVECNNCETGEQCGAI
MHGNAVTFCEPYGPRELITTGLNTTTASVLQFSIGSGSCRFSYSDPSIIVLYAKNNSADW
IQLEKIRAPSNVSTIIHILYLPEDAKGENVQFQWKQENLRVGEVYEACWALDNILIINSA
HRQVILEDSLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSTE
DIQEQWSEEFESQPTGWDVLGAVIGTECGTIESGLSMVFLKDGERKLCTPYMDTTGYGNL
RFYFVMGGICDPGNSHENDIILYAKIEGRKEHIALDTLSYSSYKVPSLVSVVINPELQTP
ATKFCLRQKNHQGHNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVSLEFS
TNHGRSWSLLHTECLPEICAGPHLPHSTVYSSENYSGWNRITIPLPNAALTRNTRIRWRQ
TGPILGNMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISES
FGSSRLSSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTL
RLGSKSVLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYLSYHEPRIISVELPDDARQ
FGIQFRWWQPYHSSQGEDVWAIDEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPCC
GHDWTLCFTGDSKLASSMRYVETQSMQIGACYMIQFSLVMGCGQKYTPHMDNQVKLEYST
NHGLTWHLIQEECLPSMPSCQEFTSASIYHASEFTQWRRVIVLLPQKTWSSATRFRWSQS
YYTAQDEWALDSIYIGQQCPNMCSGHGSCDHGVCRCDQGYQGTECHPVAALPSTIMSDFE
NQNGWESDWQEVIGGEIVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWVDFVQFYI
QIGGESASCNKPDSREEGVLLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPC
TRFRWWQPVFSGEDYDQWAVDDIIILSEKQKQIIPVINPTLPQNFYEKPAFDYPMNQMSV
WLMLANEGMVKNETFCAATPSAMIFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCANQFS
NTAPVLLQYSHDAGMSWFLVKEGCYPASAGKGCEGNSRELSEPTMYHAGDFEEWTRITIV
IPRSLASSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCISGVCFCDLGYT
AAQGTCVSNVPNHNEMFDRFEGKLSPLWYKITGAQVGTGCGTLNDGKSLYFNGPGKREAR
TVPLDTRNIRLVQFYIQIGSKTSGITCIKPRTRNEGLIVQYSNDNGILWHLLRELDFMSF
LEPQIISIDLPQEAKTPATAFRWWQPQHGKHSAQWALDDVLIGMNDSSQTGFQDKFDGSI
DLQANWYRIQGGQVDIDCLSMDTALIFTENIGKPRYAETWDFHVSASTFLQFEMSMGCSK
PFSSSHSVQLQYSLNNGKDWHLVTEECVPPTIGCLHYTESSIYTSERFQNWKRITVYLPL
STISPRTRFRWIQANYTVGADSWAIDNVVLASGCPWMCSGRGICDAGHCVCDRGFGGPYC
VPVVPLPSILKDDFNGNLHPDLWPEVYGAERGNLNGETIKSGTSLIFKGEGIRMLISRDL
DCTNTMYVQFSLRFIAKGTPERSHSILLQFSINGGITWHLMDEFYFPQTTNILFINVPLP
YTAQTNATRFRLWQPYNNGKKEEIWIVDDFIIDGNNLNNPMMLLDTFDFGPREDNWFFYP
GGNIGLYCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLNVNENTIIQFEINVGCSTDSS
SADPVRLEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHHPSSTYYAGTMQGWRREVVHF
GKLHLCGSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCNGQGSCINGTKCICDPGY
SGPTCKISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLM
TRDLDLSHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGR
YIALEIPLKARSGSTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFTTLDSRK
WLLHPGGTKMPVCGSTGDALVFIEKASTRYVVSTDIAVNEDSFLQIDFAASCSVTDSCYA
IELEYSVDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPPYTRSQAT
RFRWHQPAPFDKQQTWAIDNVYIGDGCIDMCSGHGRCIQGNCFCDEQWGGLYCDDPETSL
PTQLKDNFNRAPSNQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEF
IQFYFMYGCLITPNNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEI
ATRFRWWQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGR
IAFDMFMEDKTSVNEHWLFHDDCTVERFCESPDGVMLCGSHDGREVYAVTHDLTPTEGWI
MQFKISVGCKVSEKIAQNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGTVSQPSVFFPTK
GWKRITYPLPESLVGNPVRFRFYQKYSDMQWAIDNFYLGPGCLDNCRGHGDCLREQCICD
PGYSGPNCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTV
RQAITQDLDLRGAKFLQYWGRIGSENNMTSCHRPICRKEGVLLDYSTDGGITWTLLHEMD
YQKYISVRHDYILLPEDALTNTTRLRWWQPFVISNGIVVSGVERAQWALDNILIGGAEIN
PSQLVDTFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTHNALSSRELII
QPGYMMQFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQF
HEATIYNSVNSSSWKRITIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPKL
CSGHGYCTTGAICICDESFQGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIGSG
CGQLAPYAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTSQTDSCNSDLSGPHT
ADKAVLLQYSVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGTG
HDQWALDHVEVVLVSTRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP
NT seq 10041 nt   +upstreamnt  +downstreamnt
atgtctgaccaccagtttggtaaccagtttatgtgcagtgtggtggcctctcacgtgagt
cacctgcccacaaccaacctcagcttcatctggattgctccacctgcgggcacaggctgt
gttaatttcatggctacagcaacacaccggggccaggttattttcaaagatgctttagcc
cagcagttgtgtgaacaaggagctccaacagaggccactatgcacccacatctagctgaa
atacatagtgacagcattatcctgcgagatgactttgactcctaccaccaactgcaatta
aatccaaatatatgggttgaatgtaacaactgtgagactggagaacagtgtggtgcgatt
atgcatggcaatgctgtcaccttctgtgaaccatatggtccacgagaactgattaccaca
ggccttaatacaacaacagcttctgtcctccaattttccattgggtcaggttcatgtcgc
tttagttattcagaccccagcatcatcgtgttatatgccaagaataactcagcggactgg
attcagctagagaaaattagagccccttccaatgtcagcacaatcatccatatcctctac
cttcctgaggacgccaaaggggagaatgtccagtttcagtggaagcaggaaaatcttcgt
gtaggtgaagtgtatgaagcctgctgggccttagataatatcctgatcatcaattcagct
cacagacaagtcattttagaagatagtctcgacccagtggacacaggcaactggcttttc
ttcccaggagctacagttaagcatagctgtcagtcagatgggaactccatttatttccat
ggaaatgaaggcagcgagttcaattttgccaccacccgggatgtagatctttccacagaa
gatattcaagagcaatggtcagaagaatttgagagccagcctacaggatgggatgtcttg
ggagctgtcattggtacggaatgtggaacaatagaatcaggcttatcaatggtcttcctc
aaagatggagagaggaaattatgcactccatatatggatactaccggttacgggaacctg
aggttttactttgtgatgggaggaatttgtgaccctggaaattctcatgaaaatgacata
atcctgtatgcaaaaattgaaggaagaaaagagcacatagcactggataccctttcctat
tcctcatataaggttccatctttggtttctgtggtcatcaatcctgaacttcagactcct
gctaccaaattttgtctcaggcaaaagaatcatcaaggacataataggaatgtctgggct
gtagactttttccatgtcttgcctgttctcccttctacaatgtctcacatgatacagttt
tccatcaatctgggatgtggaacacatcagcctggtaacagtgtcagcttggagttttct
accaaccatggacgttcctggtccctccttcacactgaatgcttgcctgagatctgtgct
ggaccccacctcccccatagcactgtctactcctctgaaaactacagtgggtggaaccga
ataacaattccccttcctaatgcagcactaacccggaacaccaggattcgctggagacaa
acaggaccaatccttggaaacatgtgggcaattgataatgtttatattggcccatcatgt
ctcaaattctgttccggcagaggacagtgcactagacatggctgcaagtgtgaccctgga
ttttctggcccagcttgtgagatggcatcccagacattcccaatgtttatttctgaaagc
tttggcagttctaggctctcctcttaccataacttttactctatccgtggtgctgaagtc
agctttggttgtggtgtcttggccagcggtaaggccctggttttcaacaaagatgggcgg
cgtcagctaattacatccttccttgacagctcacaatccaggtttctgcagttcacactg
agactggggagcaaatctgttctgagcacatgcagagcccctgaccagcctggtgaagga
gttctgttgcattattcttatgataatgggataacttggaaactcctggagcactattca
tatctcagctatcatgagcccagaataatctctgtagaactaccagatgatgcaaggcag
tttggaattcagttcaggtggtggcagccgtatcattcttcccaaggagaagatgtatgg
gctattgatgagattatcatgacgtctgtacttttcaatagcattagtcttgactttacc
aatcttgtggaggtcactcagtctctgggattctaccttggaaatgttcagccgtgctgt
ggccatgactggaccctttgttttacaggagattctaaacttgcctcaagtatgcgctat
gtggaaacacaatcaatgcagataggagcgtgctatatgattcagttcagtttggtgatg
ggatgtggccagaaatacaccccacacatggacaaccaggtgaagctggagtactcaacc
aaccacggccttacctggcacctcatccaagaagaatgtcttccaagtatgccaagttgt
caggaatttacatcagcaagtatttaccatgccagtgagttcacacagtggagaagagtc
atagtgcttcttccacagaaaacttggtccagtgccacccgcttccgttggagccagagc
tattacacagctcaagacgagtgggctttggacagcatttacattgggcaacagtgcccc
aacatgtgcagtgggcatggctcatgtgatcatggcgtatgcaggtgtgaccagggatat
caaggcactgaatgccacccggtagctgcccttccttccacaattatgtcagattttgag
aaccagaatggctgggagtctgactggcaagaagttattgggggagaaattgtaaaacca
gaacaagggtgtggtgtcatctcttctggatcatctctgtacttcagcaaggctgggaaa
aggcaattggtgagttgggacctggatacttcttgggtggactttgtccagttctacatc
cagataggtggagagagtgcttcatgcaacaagcccgacagcagagaagagggcgtcctc
cttcagtacagcaacaatgggggcatccagtggcacctgctagcagagatgtacttttca
gacttcagcaaacccagatttgtctatctggagcttccagctgctgccaagaccccttgc
accaggttccgctggtggcagcccgtgttctcgggagaggactatgaccagtgggcagtc
gatgacatcatcattctgtccgagaagcagaagcagatcatcccagttatcaatccaact
ttacctcagaacttttatgagaagccagcttttgattatcctatgaatcagatgagtgtg
tggttgatgttggctaatgaaggaatggttaaaaatgaaaccttctgtgctgccacacca
tcagcaatgatatttggaaaatcagatggagatcgatttgcagtaactcgagatttgacc
ctgaagcctggatatgtgctacagttcaagctaaacatagggtgtgccaatcaattcagc
aatactgctccagttcttcttcaatactctcatgatgctggtatgtcctggtttctggtg
aaagaaggctgttacccagcttctgcaggcaaaggatgtgaaggaaactccagagaacta
agtgagcccaccatgtatcacgcaggggactttgaagaatggacaagaatcaccattgtt
attccaaggtctcttgcatccagcaagaccagattccgatggatccaggagagcagctca
cagaaaaacgtgcctccatttggtttagatggagtgtacatatctgagccctgtcccagt
tactgcagtggccatggggactgcatttcaggagtgtgtttctgtgacctggggtatact
gcggcacaaggaacttgtgtgtcaaatgtccccaaccacaatgagatgttcgataggttt
gaggggaagctcagccctctgtggtacaagataacaggcgcccaggttggaactggctgt
ggaacacttaacgatggcaaatctctctacttcaatggccctgggaaaagggaagccagg
acggtccctctggacaccaggaacatcagactcgttcaattttatatacaaattggaagc
aaaacttcaggtattacctgcatcaaaccaagaactagaaatgaagggcttattgttcag
tattcaaatgacaatgggatactctggcatttgcttcgagagttggacttcatgtccttc
ctggaaccacagatcatttccattgacctgccacaggaggcgaagacacctgcaacagca
tttcgatggtggcagccacaacatgggaagcattcagcccagtgggctttggatgatgtt
cttataggaatgaatgacagctctcaaactggatttcaagacaaatttgatggctctata
gatttgcaagccaactggtatcgaatccaaggaggtcaagttgatattgactgtctctcc
atggacactgctctgatattcactgaaaacataggaaaacctcgttatgctgagacctgg
gattttcatgtgtcagcatctacctttctgcagtttgaaatgagcatgggttgtagcaag
cccttcagcagctcccacagtgtacagctccagtattctctgaacaatggcaaggactgg
catcttgtcaccgaagagtgtgttcctccaaccattggctgtctgcactacacggaaagt
tcaatttatacctcagaaagattccagaattggaagcggatcactgtctaccttccactc
tccaccatttctcccaggacccggttcagatggattcaggccaactacactgtgggggct
gattcctgggcgattgataatgttgtactggcctcagggtgcccttggatgtgctcagga
cgagggatttgtgatgctggacactgtgtgtgtgaccggggctttggtggaccctattgt
gttcctgttgttcctctgccctcgattcttaaagacgatttcaatgggaacttgcatcct
gacctttggcctgaagtgtatggtgcagagagggggaatctgaatggtgaaaccatcaaa
tctggaacatctctaatttttaaaggggaaggaataaggatgcttatttcaagagatctg
gattgtacaaatacgatgtatgtccagttttcacttagatttatagcaaaaggtacccca
gagagatctcactctattctattacaattctccatcaatggaggaatcacttggcacctg
atggatgaattttactttcctcaaacaaccaatatacttttcatcaatgttcccttgcca
tatactgcccaaaccaatgctacaagattcagactctggcaaccttataataatggtaag
aaagaagaaatctggattgttgacgacttcattatcgatggaaataatttaaacaacccc
atgatgctcctggatacattcgactttgggcccagagaagacaattggtttttctatcct
ggtggtaacattggtctttattgtccatattcttcaaagggggcaccggaagaagattca
gctatggtgtttgtttccaatgaagttggtgagcattccattaccacccgtgacctcaat
gtgaatgagaacaccatcatacaatttgagatcaatgtcggctgctcgactgatagctca
tccgcggatccagtgagactggaattttcaagggacttcggggcgacctggcacctgctg
ctgcccctctgctaccacagcagcagccacgtcagctccttatgctccaccgagcaccac
cccagcagcacctactacgcgggaaccatgcagggctggaggagggaggttgtgcacttt
gggaagctgcacctttgtggatctgtccgtttcagatggtaccagggattttaccctgct
ggctctcagccagtgacatgggccattgataatgtctacatcggtccccaatgtgaggag
atgtgtaatggacaggggagctgtatcaatggaaccaaatgtatatgtgaccctggctac
tcaggtccaacctgtaaaataagcaccaaaaatcctgattttctcaaagatgatttcgaa
ggtcagctagaatctgatagattcttattaatgagtggtgggaaaccatctcgaaagtgt
ggaatcctttctagtggcaacaacctctttttcaatgaagatggcttgcgcatgttgatg
acacgagacctggatttatcacatgcaagatttgtgcagttcttcatgagactgggatgt
ggtaaaggcgttcctgaccccaggagccaacccgtgctcctacagtattctctcaacggt
ggcctctcgtggagtcttcttcaggagttccttttcagcaattccagcaatgtgggcagg
tacattgccctggagatacctttgaaagcccgttctggttctactcgccttcgctggtgg
caaccgtctgagaatgggcacttctacagcccctgggttatcgatcagattcttattgga
ggaaatatttctggtaatacggtcttggaagatgatttcacaacccttgatagtaggaaa
tggctgcttcacccaggaggcaccaagatgcccgtgtgtggctctactggtgatgccctg
gtcttcattgaaaaggccagcacccgttacgtggtcagcacagacattgccgtgaatgag
gattccttcctacagatagacttcgctgcctcctgctcagtcacagactcttgttatgcg
attgaattggaatactcagtggatcttggattgtcatggcacccattggtaagggactgt
ctgcctaccaatgtggaatgcagtcgctatcatctgcagcggatcctggtgtcagacact
ttcaacaagtggaccagaattactctgcctctccctccttataccaggtcccaagccact
cgtttccgctggcatcaaccagctccctttgacaagcagcagacgtgggcaatagataat
gtctatatcggggatggctgcatagacatgtgtagtggccatgggagatgcatccaggga
aactgcttctgtgatgaacagtggggtggcctgtactgtgatgaccccgagacctctctt
ccaacccaactcaaagacaatttcaatcgagctccatccaaccagaactggctgactgtg
aatggagggaaattgagtacagtgtgtggagccgtggcgtcaggaatggctctccatttc
agtgggggttgtagtcgattattagtcactgtggatctaaacctcactaatgctgagttc
atccagttttacttcatgtatgggtgcctgattacaccaaacaaccgtaaccaaggtgtt
ctcttggaatattctgtcaatggaggcattacctggaacctgctcatggagattttctat
gaccagtacagtaagcccggatttgtgaatatccttctccctcctgatgctaaagagatt
gccactcgcttccgctggtggcagccaagacatgatggcctggatcagaacgactgggcc
attgacaatgtcctcatctcaggctctgctgaccaaaggaccgttatgctggacaccttc
agcagcgccccagtgccccagcacgagcgctcccctgcagacgccggccctgtcgggagg
attgcctttgacatgtttatggaagacaaaacttcagtgaatgagcactggctattccat
gatgattgcacagtagagagattctgtgagtcccctgatggtgtgatgctctgtggcagt
catgacggacgggaggtatatgcagtgacccacgacctgactcccactgaaggctggatt
atgcaattcaagatctctgttggatgtaaggtgtctgaaaaaattgcccagaatcaaatt
catgtgcagtactctactgactttggtgtgagctggaattatctggtccctcagtgcttg
cctgctgacccaaaatgctctggaactgtttctcagccatctgtattcttcccaactaaa
gggtggaaaaggatcacctacccacttcctgaaagcttagtgggaaatccagtgaggttt
aggttctatcagaagtactcagacatgcagtgggcaatcgataatttctacctgggccct
ggatgcttggacaactgcaggggccatggagattgcttaagggaacagtgcatctgtgat
ccgggatactcagggccaaactgctacttgacccacactctgaagactttcctgaaggaa
cgctttgacagtgaagaaatcaaacctgacttatggatgtccttagaaggtggaagtact
tgcactgagtgtggaattcttgctgaggacactgcactctattttgggggatctactgtg
agacaagcgattacacaagatttggatcttcgaggtgcaaagttcctgcaatactggggg
cgcattgggagtgagaacaacatgacctcttgccatcgtcccatctgccggaaggaaggc
gtgctgttggactactctaccgatggaggaattacctggactttgctccatgagatggat
taccagaaatacatttctgttagacatgactacatacttcttcctgaagatgcacttacc
aacacaactcgacttcgttggtggcagccttttgtgatcagcaatggaattgtggtctct
ggggtggagcgtgctcagtgggcactggacaatattttgattggtggagcagaaatcaat
cccagccaattggtggacacttttgatgatgaaggcacttcccatgaagaaaactggagt
ttttaccctaatgctgtaagaacagcaggattttgtggcaatccatccttccacctctat
tggccaaataaaaagaaggacaagactcacaatgctctctcctcccgagaactcattata
cagccaggatacatgatgcagtttaaaattgtggtgggttgtgaagccacttcttgcggt
gaccttcattctgtaatgttggaatacactaaggatgcaagatcagattcctggcagctt
gtacagacccagtgccttccttcctcttctaacagcattggctgctcccctttccagttc
catgaagccaccatctacaactctgtcaacagctcaagctggaaaagaatcaccatccag
ctgcctgaccatgtctcctccagtgcgacacagttccgctggatccagaagggagaagaa
actgagaagcaaagctgggcaattgaccacgtgtacattggagaggcttgccccaagctc
tgcagcgggcacggatactgcacgaccggcgccatctgcatctgcgatgagagcttccaa
ggtgatgactgctctgttttcagtcacgaccttcccagttatattaaagataattttgag
tccgcaagagtcaccgaggcaaactgggagaccattcaaggtggagtcataggaagtggc
tgtgggcagctggccccctacgcccatggagactcactgtactttaatggctgtcagatc
aggcaagcagctaccaagcctctggatctcactcgagcaagcaaaatcatgtttgtcttg
caaattgggagcacgtcgcagacggacagctgcaacagtgacctgagtggcccccacact
gcggacaaggcggtgctgctgcagtatagcgtcaacaacgggatcacctggcacgtcatc
gcccagcaccagccaaaggacttcacacaagctcagagagtgtcttacaatgtccccctg
gaggcacgaatgaaaggagtcttactgcgctggtggcaaccacgccacaatggaacaggt
catgatcagtgggctttggaccatgtggaggttgtcctagtaagcactcgcaaacaaaat
tacatgatgaatttttcacgacaacatgggctcaggcacttctacaacagaagacgaagg
tcacttaggcgatacccatga

KEGG   Macaca mulatta (rhesus monkey): 696685
Entry
696685            CDS       T01028                                 

Gene name
LAMA4
Definition
(RefSeq) laminin subunit alpha-4 isoform X1
  KO
K06241  laminin, alpha 4
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05143  African trypanosomiasis
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    696685 (LAMA4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    696685 (LAMA4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    696685 (LAMA4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    696685 (LAMA4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    696685 (LAMA4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    696685 (LAMA4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    696685 (LAMA4)
   05145 Toxoplasmosis
    696685 (LAMA4)
   05143 African trypanosomiasis
    696685 (LAMA4)
SSDB
Motif
Pfam: Laminin_G_2 Laminin_I Laminin_G_1 Laminin_II Laminin_EGF Laminin_G_3
Other DBs
NCBI-GeneID: 696685
NCBI-ProteinID: XP_014993041
Ensembl: ENSMMUG00000013782
LinkDB
Position
4
AA seq 1823 aa
MALSSAWRSVLPLWLLWSAACSRAASGDDNAFPFDIEGSSAVGRQDPPETSEPRVALGRL
PPAAEKCNAGFFRTLSGECVPCDCNGNSNECLDGSGFCVHCQRNTTGEHCEKCLDGYIGD
SIRGAPRFCQLCPCPLPHLANFAESCYRKNGAVRCICKENYAGPNCERCAPGYYGNPLLI
GSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCERCAPGYYGDARIAKNCAV
CNCGGGPCDSVTGECLEEGLEPPTGTDCSTISCDKCVWDLTDDLRLAALSIEEGKSGVLS
VSSGAAAHRHVNEINATIYLLKTKLSERENQYALRKIQINNAENTMKSLLSDVEELVEKE
NQASRKGQLIQKESMDTIKHASQLVEQAHDMRDKIQEINNKMLYYGEEHELSPKEISEKL
VLAQKMLEEIRRRQPFFTQRELVDEEADEAHELLSQAESWQRLHNETRTLFPVVLEQLDD
YNAKLSDLQEALDQALNHVRDAEDMNRATAARQRDHEKQQERVREQMEAVNMSLNTSADS
LTTPRLTLSELDDIIKNASGIYAEIDGAKNELQVKLSNLSNLSHDLVQEAIDHAQELQQE
ANELSRKLHSSDMNGLVQKALDASNVYENIVNYVSEANETAEFALNTTDRIYDAVSGIDT
QIIYHKDESENLLNQARELQAKAESSSDEAVADTSRRVGGALARKSALKTRLSDAVKQLQ
AAERGDAQQRLGQSRLITEEANRTTMEVQQATAPMANNLTNWSQNLQHFDSSAYNTAVDS
ARDAVRNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDG
QSAVEVHSRTSMDDLKAFTSLSLYMKPPVRQPELTETADQFILYLGSKNAKKEYMGLAIK
NDNLVYIYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEKF
IKKGEFSGDDSLLDLDPEDTVFYVGGVPSNFKLPNSLNLPGFVGCLELATLNNDVISLYN
FKHIYNMDPSTSVPCARDKLAFTQSRAASYFFDGSGYAAVRDITKRGKFGQVTRFDIEVR
TPADNSLILLMVNGSMFFRLEMRNGYLHVFYDFGFSSGPVHLEDTLKKAQINDAKYHEIS
IIYHNDKKMILVVDRRHVKSMDNEKMKIPFTDIYIGGAPPEILQSRTLRAHLPLDINFRG
CMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEGG
FNFRTLQPNGLLFYYASGSDVFSISLDNGTVVMDVKGIKVESVDKQYNDGLSHFVITSVS
PTRYELIVDKSRVGSKNPTKGKIEQTQAGEKKFYFGGSPISAQYANFTGCISNAYFTRVD
RDVEVEDFQRYTEKVHTSLYECPIESSPLFLLHKKGKNLSKPKASQNKKGGKSKDAPSWD
PVALKFPERNTPRNSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGAKSQFSIRLR
TRSSHGMIFYVSDQEENDFMTLFLAHGRLVYMFNVGHKKLKIRSQEKYNDGLWHDVVFIR
ERSSGRLVIDGLRVLEESLPPTEATWKIKGPIYLGGVAPGKAVKNVQINSIYSFSGCLSN
LQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPRS
SSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTSVTPKQSLCDGRWHRITVIRDS
NVVQLDVDSEVNHVVGPLNPKPIDHREPVFVGGVPESLLTPRLAPSKPFTGCIRHFVIDG
HPVSFSKAALVSGAVSINSCPAA
NT seq 5472 nt   +upstreamnt  +downstreamnt
atggctttgagctcagcctggcgctcggtcctgcctctgtggctcctctggagcgctgcc
tgctcccgcgccgcgtccggggacgacaacgcttttccttttgacattgaagggagctca
gcggttggcaggcaagacccgcctgagacgagcgagccccgcgtggctctgggacgcctt
ccgcctgctgccgagaaatgcaatgctggattctttcgcaccctgtcgggggaatgtgtg
ccctgcgactgtaatggcaattccaacgagtgtttggacggctcaggattctgtgtgcat
tgccagcggaacacaacaggagagcactgtgaaaaatgtctggatggttatatcggagat
tccatcaggggagcaccccgattctgccagctgtgcccctgtcccctgccccatttggcc
aattttgcagaatcctgctataggaaaaatggagctgttcggtgcatttgtaaagaaaat
tatgctggacctaactgtgaaagatgtgctcctggttactatggaaaccccttactcatc
ggaagcacctgtaagaaatgtgactgcagtggaaattcagatcccaacctgatctttgaa
gattgtgatgaagtcactggccagtgtaggaactgcttacgcaacaccaccggattcaag
tgtgaacgctgtgctcctggctactatggggacgccagaatagccaagaactgtgcagtg
tgcaactgcgggggaggcccatgtgacagtgtaaccggagaatgcttggaagaaggtttg
gaaccccctacaggcacggactgctcaaccataagctgtgataagtgcgtctgggacctg
actgatgacctgcggttagcagcactctccattgaggaaggcaaatccggggtgctgagc
gtatcctctggggccgccgctcataggcacgtgaatgaaatcaacgccaccatctacctc
ctcaaaacaaaattgtcagaaagagaaaaccaatacgccctaagaaagatacaaatcaac
aatgctgagaacacgatgaaaagccttctgtctgacgtagaggaattagttgaaaaggaa
aatcaagcctccagaaaagggcaacttattcagaaggaaagcatggacaccattaaacac
gcaagtcagcttgtagagcaagcccatgacatgagggataaaatccaagagatcaacaac
aagatgctctattatggggaagagcatgaacttagccccaaggaaatctctgagaagctg
gtgttggcccagaagatgcttgaggagattagacgccgtcagccatttttcacccaacgg
gagcttgtcgatgaggaggcagatgaggctcacgaactgctgagccaggctgagagctgg
cagcggctgcacaatgagacccgcactctgtttcctgtcgtcctggagcagctggacgac
tacaatgctaagttgtcagatctccaggaagcacttgaccaggcccttaaccacgtcagg
gatgccgaagacatgaacagggccacagcagccaggcagcgggaccacgagaaacaacag
gaaagagtgagggaacaaatggaagcggtgaacatgtctctgaacacatctgcggactct
ctgacaacacctcgcctcactctttcagaacttgatgatataataaagaatgcatcaggg
atttatgcagaaatagatggagctaaaaatgaactacaagtaaaactatctaacctaagt
aacctcagccatgatttagtccaagaagctattgaccatgcacaggaacttcaacaagaa
gctaatgaactgagcaggaagttacacagttcagatatgaacgggctggtacagaaggct
ttggatgcatcaaatgtctatgaaaatatcgttaattatgttagtgaagccaatgaaaca
gcagaatttgctttgaacaccactgaccgaatttatgatgcggtgagtgggattgatact
caaatcatttaccataaagatgaaagtgagaacctcctcaatcaagccagagaactgcaa
gcaaaggcagagtctagcagtgatgaggcagtggctgacactagcaggcgtgtgggtgga
gccctagcaaggaaaagtgcccttaaaaccagactcagtgatgccgttaagcaactacaa
gcagcagagagaggggatgcccagcagcgcctggggcagtccaggctgatcaccgaggaa
gccaacaggacgacgatggaggtgcagcaggccactgcccccatggccaacaatctaacc
aactggtcacagaatcttcaacattttgactcttctgcttacaacactgcagtggactct
gctagggatgcagtaagaaatctgacagaagttgtgcctcagctcctggatcagcttcgt
acggttgagcagaagcggcctgcaagcaacgtgtctgccagcatccagaggatccgagag
ctcattgctcagaccagaagtgttgccagcaagatccaagtctccatgatgtttgatggc
cagtcagctgtggaagtgcactcaagaaccagtatggatgacttaaaggctttcacatct
ctgagcctgtacatgaaaccccctgtgaggcagccagaactgaccgagactgcagatcag
tttatcctgtaccttggaagcaaaaacgccaaaaaagagtatatgggtcttgcaatcaaa
aatgataacctggtatacatctataatttgggaaccaaagatgtggagattcccctggac
tccaagcccgtcagttcctggcctgcttacttcagcattgtcaagattgaaagggtggga
aaacatggaaaggtgtttttaacagtcccgagtctaagtagcacagcagaggaaaagttc
attaaaaagggggaattttcaggagatgactccctgctggacctggaccccgaggacacg
gtgttttatgttggtggagtgccttccaacttcaagctccctaacagcttaaacctgcct
ggcttcgttggctgcctggaactggccactttgaataatgatgtgatcagcttatacaac
tttaagcacatctataatatggacccctccacatcagtgccctgtgcccgagataagctg
gccttcactcagagtcgggctgctagttacttcttcgatggctccggctatgccgcggtg
agagacatcacaaagagagggaagtttggtcaggtgactcgctttgacatagaagttcga
acaccagctgacaacagccttattctcctgatggtcaatggaagtatgtttttcagactg
gaaatgcgcaatggttacctacacgtgttctatgattttggattcagcagtggccctgtg
catcttgaagatacgttaaagaaagctcaaattaatgatgcaaaataccatgagatctca
atcatttaccacaatgataagaaaatgatcttggtagttgacagaaggcatgtcaagagc
atggataatgaaaagatgaaaataccttttacagatatatacattggaggagctccccca
gaaatcttacaatccaggaccctcagagcacaccttcccctagatatcaacttcagagga
tgcatgaagggcttccagttccaaaagaaagatttcaatttactggagcagacagaaacc
ctgggagttggttatggatgcccagaagactctcttatatctcgcagagcatatttcaat
ggacagagcttcattgcttcaattcagaaaatatctttctttgatggctttgaaggaggt
tttaatttccgaacattacaaccaaatgggttactattctattatgcttcaggatcagac
gtgttctccatctcactggataacggtactgtcgtcatggatgtaaagggaatcaaagtt
gagtcagtagataagcagtacaatgatgggctgtcccacttcgtcattacctctgtctca
cccacaagatatgaactgatagtagataaaagcagagttgggagtaagaatcctaccaaa
gggaaaatagaacagacacaagcaggtgaaaagaagttttacttcggtggctcaccaatc
agtgctcagtatgctaatttcactggctgcataagtaatgcctactttaccagggtggat
agagatgtggaggttgaagatttccaacggtatactgaaaaggtccacacttctctttat
gagtgtcccattgagtcttcaccattgtttctcctccataaaaaaggaaaaaatttatcc
aagcctaaagcaagtcagaataaaaagggagggaaaagtaaagatgcaccttcatgggat
cctgttgccctgaaattcccagagcggaatactccaagaaactctcattgccacctttcc
aacagccctagagcaatagagcacgcctatcaatatggaggaacagccaacagccgccaa
gagtttgaacacttaaaaggagattttggtgcaaaatctcagttttccattcgtctgaga
actcgttcctctcatggcatgatcttctatgtctcagatcaagaagagaatgactttatg
actctatttttggcccacggccgcttggtttacatgtttaatgttggtcacaagaaactg
aagattagaagccaggagaaatacaatgatggcctgtggcatgatgtggtatttattcga
gaaaggagcagtggccgactggtaattgatggtcttcgagtcctagaagaaagtcttcct
cctactgaagctacctggaaaatcaagggtcccatttatttgggaggtgtggctcctgga
aaggctgtgaaaaatgttcagattaactccatctacagttttagtggctgtctcagcaat
ctccagctcaatggggcctccatcacctctgcttctcagacattcagtgtgaccccttgc
tttgaaggccccatggaaacaggaacttacttttcaacagaaggaggatatgtggttcta
gatgaatctttcaatattggattgaagtttgaaattgcatttgaagtccgtcccagaagc
agttccggaaccctggtccacggccacagtgtcaatggggagtacctaaatgttcacatg
aaaaatggacaggtcatagtgaaagtcaataatggcatcagagatttttccacctcagta
acacccaagcagagtctctgtgatggcagatggcacagaattacagttattagagattct
aatgtggttcagttggatgtggactctgaagtgaaccacgtggttggacccctgaatcca
aaaccaattgatcacagggagcctgtgtttgttggaggtgttccagaatctctactgaca
ccacgcttggcccccagcaaacccttcacaggctgcatacgccactttgtgattgacgga
cacccagtgagcttcagtaaagcagccctggtcagcggcgccgtaagcatcaactcctgt
ccagcagcctga

KEGG   Macaca mulatta (rhesus monkey): 700281
Entry
700281            CDS       T01028                                 

Gene name
COL4A1
Definition
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    700281 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    700281 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    700281 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    700281 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    700281 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    700281 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    700281 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    700281 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    700281 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    700281 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    700281 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    700281 (COL4A1)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   700281 (COL4A1)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   700281 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 700281
NCBI-ProteinID: XP_014976931
Ensembl: ENSMMUG00000022280
LinkDB
Position
17
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGL
PGNPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLVGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGIPGPRGEPGP
KGFPGLPGQPGPPGPPLPGLAGAPGFPGERGEKGDRGFPGASLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGLIGEIGEKGQKGESCLICDTTGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGRPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPTGPIGDKGQAGFPGGPGSPGLPGPKGEPGKVVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGIPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGVGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGSQGLPGVTGQSGLPGLPGQQGSPGIPGFPGSKGEMGVMGTPGQPGSPGPVGVPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLQGEKGAKGAKGQAGPPGIGIPGLPGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
VPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPAGLKGLQGPPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPTGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggcgaaagaggcctcccagggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccgcagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacccccgggagcatctggc
taccctggaaacccgggacttcccggtattcctggccaagacggcccaccagggccccca
ggtattccaggatgcaatggcacaaagggagagagagggccgctcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccagggttaccaggaatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggactc
ccagggaatccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttaccggaccaccaggtcccccaggtcctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggacccaaaggtgacaagggtgaccaaggggtcagtgggcctccg
ggagtaccaggacaggctcaagttcaagaaaaaggagattttgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcagggaatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggaaaacctggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcgtaggccgccagggcccacaggga
gaaaagggtgaagcaggtcctcccggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagcggggctaccctggaattccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcccccctctacctgggctg
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttccgggt
gcatctctgccaggaccaagtggaagagacgggctcccgggtcctcctggttcccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggattgataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatacaaccggatatcgggggcctcccggg
ccacagggacccccaggagaaataggtttcccaggacagccaggggccaagggcgacaga
ggtttgcctggcagagacggtgttgcaggagtgcctgggcctcaaggtacaccagggctg
atcggccagccaggagccaagggggagcctggtgagatttatttcgacctgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccgtccgggtcttcctggccccaagggctcaccgggttctgtagga
ttgaaaggagagcgtggtccccctggaggagttggattcccaggcagtcgtggtgacacc
ggtccccctgggcctccaggatatggtcctactggtcccattggtgacaaaggacaagca
ggctttcctggaggccctgggtccccaggcctgccaggtccaaagggtgaaccaggaaaa
gttgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcctggaaccccaggaaggccaggcctgccaggagag
aagggtgctgtgggccagccaggaattggatttccagggccccctggccccaaaggtgtt
gatggcttacctggagacatggggcctccagggactccaggtcgcccgggatttaatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctaccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacctggggagaaggggagcatt
ggggtaccaggcattcctggagaacacggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccggggcctcctggattgccaggctccgtggggtctccaggagttccagga
gttggcccccctggagctaggggcccccctggaggacagggaccaccggggttgtcgggc
cctcctggaataaaaggagagaagggtttccccggattccccggactggacatgccaggc
cctaaaggagataaagggtctcaaggacttcctggcgtaacaggacagtcagggctccct
ggccttcctggacagcaggggtctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgttccggga
ttaccgggtgaaaaaggggaccatggcttcccgggctcctcaggacccaggggagaccct
ggcctgaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaagggagaccaaggagaaaaaggacaaattgga
ccaattggtgagaaaggatcccgaggagaccctgggaccccaggagtgcctggaaaggat
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacag
ggagaaaaaggtgcaaaaggagcgaaagggcaggcaggcccacctggcataggcatccca
gggctgcctggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggagag
aagggagaaaaaggaagcatcgggatcccaggaatgccagggtccccaggccttaaaggg
tctcctgggagtgttggctatccagggagccctgggctgcctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggacg
cctggccccacaggcccagctggccagaaaggggagccaggcagtgacggaatcccgggc
tcggcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagctgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccagggccccagggacag
ccggggttaccaggatccccaggccatgccacggaggggcccaaaggagaccgtggacct
cagggccagcctggcttgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggcattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
gttcctggcctccagggaattaaaggtgatcaaggcgatcaaggtgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccgcagggctgaaagggcttcagggacctccaggccccaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttc
gacggtgcccctggccagaaaggagagatgggacctaccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccagggtccatggggccgccaggcacccca
tctgttgatcacggcttccttgtgaccaggcacagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccatgggtactctttgctctacgtgcaaggcaacgaa
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaacattaacaacgtgtgcaactttgcatcacgaaatgactactca
tactggctgtccacccccgagcccatgcccatgtcaatggcacccatcacgggggacaac
ataagaccctttattagtaggtgtgctgtgtgtgaggcgcccgccatggtgatggccgtg
cacagtcagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcatcc
cctggctcctgtctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgtaattactacgcaaatgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcccacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

KEGG   Macaca mulatta (rhesus monkey): 700359
Entry
700359            CDS       T01028                                 

Gene name
COL1A2
Definition
(RefSeq) collagen alpha-2(I) chain precursor
  KO
K06236  collagen type I alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04611  Platelet activation
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    700359 (COL1A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    700359 (COL1A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    700359 (COL1A2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    700359 (COL1A2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    700359 (COL1A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    700359 (COL1A2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    700359 (COL1A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    700359 (COL1A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    700359 (COL1A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    700359 (COL1A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    700359 (COL1A2)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   700359 (COL1A2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 700359
NCBI-ProteinID: NP_001253266
Ensembl: ENSMMUG00000039654
UniProt: H9Z2D1
LinkDB
Position
3
AA seq 1366 aa
MLSFVDTRTLLLLAVTSCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPAGSRGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEVGSAGPPGPPGLRGGPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQ
GGKGEQGPAGPPGFQGLPGPSGPAGEVGKPGERGLPGEFGLPGPAGARGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAAGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPSGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPSGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGVVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNNGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLP
GLKGHNGLQGLPGLAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGANQEFFVDIGPVCFK
NT seq 4101 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcgaactttgttgctgcttgcagtaacctcatgcctagca
acatgccaatctttacaagaggaaactgtaagaaagggcccagccggagatagaggacca
cgtggagaaaggggtccaccaggccccccaggcagagatggtgaagatggtcccacaggc
cctcctggtccacctggtcctcctggcccccctggtcttggtgggaactttgctgctcag
tatgatggaaaaggagttggacttggccctggaccaatgggtttaatgggacctagaggc
cctcctggtgcagctggagccccaggccctcaaggtttccaaggacctgctggtgagcct
ggtgaacctggtcaaactggtcctgcaggctctcgtggtccagctggccctcctggcaag
gctggtgaagatggtcaccctggaaaacctggacgacctggtgagagaggagttgttgga
ccacagggtgctcgtggtttccctggaactcctggacttcctggcttcaaaggcattagg
ggacacaatggtctggatggattgaagggacagcccggtgctcctggtgtgaagggtgaa
cctggtgcccctggtgaaaatggaactccaggtcaaacaggagcccgtgggcttcctggt
gagagaggacgtgttggtgcccctggccctgctggtgcccgtggcagtgatggaagtgtg
ggtcccgtgggtcctgctggtcccattgggtctgctggccctccaggcttcccaggtgcc
cctggtcccaagggtgaaattggagctgttggtaacgctggtcctgctggtcccgctggt
ccccgcggtgaagtgggtcttccaggcctctccggccccgttggacctcctggtaatcct
ggagcaaacggcctcactggtgccaagggtgctgcaggccttcccggtgttgctggggct
cctggcctccctggaccccgcggtattcctggccctgttggtgctgctggtgctactggt
gccagaggacttgttggtgagcctggtccagctggctccaaaggagagagcggtaacaag
ggtgagcccggctctgctgggccccaaggtcctcctggtcccagtggtgaagaaggaaag
agaggccctaatggggaagttggatctgctggccctccaggacctcctgggctgagaggt
ggtcctggttctcgtggtcttcctggagctgatggcagagctggcgtcatgggccctcct
ggtagtcgtggtgcgagtggccctgctggagttcgaggccccaatggagatgctggtcgc
cctggggagcccggtctcatgggacccagaggtcttcctggttcccctggaaatatcggc
cccgctggaaaagaaggtcctgtcggcctccctggcatcgacggcaggcctggcccaatt
ggcccagctggagcaagaggagagcctggcaacattggattccctggacccaaaggcccc
actggtgatcctggcaaaaacggtgataaaggtcatgctggtcttgctggtgctcggggt
gctccaggtcccgatggaaacaatggtgctcagggacctcctggaccacagggtgtccaa
ggtggaaaaggtgaacagggtcccgctggtcctccaggtttccagggtctgcctggcccc
tcagggccagctggtgaagttggcaaaccaggagaaaggggtctccctggtgagtttggt
ctccctggtcctgctggtgcaagaggggaacgcggtcccccaggtgagagtggtgctgcc
ggtcctactggtcctattggaagccgaggtccttctggacccccagggcctgatggaaac
aagggtgaacctggtgtggttggtgctgcaggcactgctggtccatctggtcctagtgga
ctcccaggagagaggggtgctgctggcatacctggaggcaagggagaaaagggtgaacct
ggtctcagaggtgaaattggtaaccctggcagagatggtgctcgtggtgctcctggtgct
gtaggtgcccctggtcctgctggagccacaggtgaccggggtgaagctggtgctgctggt
cctgctggtcctgctggtcctcggggaagccctggtgaacgtggtgaggtcggtcctgct
ggccccaatggatttgctggtcctgctggtgctgctggtcaacctggtgctaaaggagaa
agaggagccaaagggcctaagggtgaaaacggtgtcgttggtcccacaggccccgttgga
gctgctggcccatctggtccaaatggtccccctggtcctgctggaagtcgtggtgatgga
ggcccccctggtatgactggtttccctggtgctgctggacggactggtcccccaggaccc
tctggtatttctggccctcctggtccccctggtccttctgggaaagaagggcttcgtggt
cctcgtggtgaccaaggtccagttggccgaactggagaagtaggtgctgttggtccccct
ggctttgctggtgagaagggtccctctggagaggccggtactgctggacctcctggcact
ccaggtcctcagggtcttcttggtgctcctggtattctgggtctccctggctcaagaggt
gaacgtggtctaccaggtgttgctggtgttgtgggtgaacctggtcctcttggcattgcc
ggccctcctggggcccgtggtcctcctggtgctgtgggtagtcctggagtcaatggtgct
cctggtgaagctggtcgtgatggcaaccctgggaacgatggtcccccaggtcgcgatggt
caacccggacacaagggagagcgtggttaccctggcaacaatggtcccgttggtgctgca
ggtgcacctggtcctcatggccccgtgggtcctgctggcaaacatggaaaccgtggtgaa
actggtccttctggtcctgttggtcctgctggtgctgttggtccaagaggtcctagtggc
ccacaaggcattcgtggtgataagggagagcccggtgataaggggcccagaggtcttcct
ggcttaaagggacacaatggattgcaaggtctgcccggtctcgctggtcaccatggtgat
caaggtgctcctggctccgtgggtcctgctggtcctaggggccctgccggtccttctggc
cctgctggaaaagatggtcgcactggacatcctggtacagttggacctgctggcattcga
ggccctcagggtcaccaaggtcctgctggcccccctggtccccctggcccacctggacct
ccaggtgtaagtggtggtggttatgactttggttacgatggagacttctacagggctgac
cagcctcgctcagcaccttctctcagacccaaggattatgaagttgatgctactctgaag
tctctcaacaaccagattgagacccttcttactcctgaaggctctagaaagaacccagct
cgcacatgccgtgacttgagactcagccacccagagtggagcagcggttactactggatt
gaccctaaccaaggatgcactatggatgctatcaaagtatactgtgatttctctactggc
gaaacctgtatccgggcccaacctgaaaacatcccagccaagaactggtataggagctct
aaggacaagaaacacgtctggctaggagaaactatcaatgctggcagccagtttgaatac
aatgtagaaggagtgacttccaaggaaatggctacccaacttgccttcatgcgcttgctg
gccaactatgcctctcagaacatcacctaccactgcaagaacagcattgcatacatggat
gaggagactggcaacctgaaaaaggctgtcattctacagggctctaatgatgttgaactt
gttgctgagggcaacagcaggttcacttacactgttctcgtagatggctgctctaaaaag
acaaatgaatggggaaagacaatcattgaatacaaaacaaataagccatctcgcctgccc
ttccttgatattgcacctttggacatcggtggtgccaaccaggaattctttgtggacatt
ggcccagtctgtttcaaataa

KEGG   Macaca mulatta (rhesus monkey): 701313
Entry
701313            CDS       T01028                                 

Gene name
LAMA3
Definition
(RefSeq) laminin subunit alpha-3 isoform X1
  KO
K06240  laminin, alpha 3/5
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    701313 (LAMA3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    701313 (LAMA3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    701313 (LAMA3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    701313 (LAMA3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    701313 (LAMA3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    701313 (LAMA3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    701313 (LAMA3)
   05145 Toxoplasmosis
    701313 (LAMA3)
SSDB
Motif
Pfam: Laminin_EGF Laminin_I Laminin_G_2 Laminin_N Laminin_G_1 Laminin_II Laminin_B Laminin_G_3
Other DBs
NCBI-GeneID: 701313
NCBI-ProteinID: XP_014977167
Ensembl: ENSMMUG00000004321
LinkDB
Position
18
AA seq 3336 aa
MAAAARPRGRAPGPVLPPTPLLLLVLQLLPAWGATARDPRAAAGLSLHPPYFNLAEAARI
WATATCGERGPSERRPRPELYCKLVGGPTAPGSGHTIQGQFCDYCNSEDPRKAHPVTNAI
DGSERWWQSPPLSSGTQYNRVNLTLDLGQLFHVAYILIKFANSPRPDLWVLERSVDFGST
YSPWQYFAHSKVDCLKEFGREANMAVTRDDDVLCVTEYSRIVPLENGEVVVSLINGRPGA
KNFTFSHTLREFTKATNIRLRFLRTNTLLGHLISKAQRDPTVTRRYYYSIKDISIGGRCV
CNGHAEVCNANNPEKLFRCQCQHHTCGETCDRCCTGYNQRRWQPAAWEQSHECEACNCHG
HASDCYYDSDVERQQASLNTQGIYAGGGVCINCQHNTAGVNCEQCAKGYYRPYGVPVDAP
DGCIPCSCDPEHADGCEQGSGRCHCKPNFHGDNCEKCAVGYYNFPFCLKIPIFPVSTPSP
EDPVAGDIRGCDCNPEGVLPEICDVHGRCLCRPGVEGPRCDTCRSGFYSFPICQACWCSA
LGSYQMPCSSVTGQCECRPGVTGQRCDRCLSGAYDFPHCQGSSSACDPAGTIDSNLGSCP
CKLHVEGPTCSICKPLYWNLDKENPNGCSECQCHKAGTVSGTGECRQGDGDCHCKSHVGG
DSCDTCEDGYFALEKSNYFGCQGCQCDIGGALSSVCSGPSGVCQCREHVVGKACQRPENN
YYFPDLHHMKYEIEDGSTPNGRDLRFGFDPLAFPGFSWRGYAQMTSVQNDVRITLNVGKS
SGSLFHVILRYINPGTEAVSGHITIYPSWGKASAAQSKEIIFLPSKEPAFVTVPGNGFAD
PFSITPGIWVACIKAEGVLLDYLVLLPRDYYEASVLQLPVTEPCAYAGPPQENCLLYQHL
PVTRFPCTLACEARHFLLDGEPRPVAVRQPTPAHPVMVDLSGREVELHLRLRVPQVGRYV
VVLEYSTEAAQLFVADVNVKSPGSVRAGQVNIYSCKYSVLCRSAVIDHMSRIAVYELLAD
ADIQLKGHMARFLLHQVCIIPIEEFSAEYVRPQVHCIASYRRFVNQSATCVSLAHETPPT
ALILDVVSGRPFPHLPQQSSPSVDVLPGVTLKAPQNQVTLRGRAPHLGRYVFVIHFYQAA
HPTFPAQVSVDGGWPRAGSFRASFCPHVLGCRDQVIAEGQIEFDISEPEVAATVKVPEGK
SLVLVRVLMVPTENYDYQILHKKSMDKSLEFITNCGKNSFYIDPQTASRFCKNSARSLVA
FYHKGALPCECHPAGATSPHCSPEGGQCPCRPNVIGRQCTRCATGHYGFPHCKPCSCGLR
LCEEMTGQCRCPPRTVRPQCEVCETHSFSFHPMAGCEGCNCSRRGTSEAAMLQCDRDSGQ
CRCKPRITGRQCDRCASGFYRFPECVPCNCNRDGTEPGVCDPGTGACLCKENVEGTECHV
CREGSFHLDPANPKGCTSCFCFGVNNQCHSAHKRRAKFVDMLGWRLETADRMDIPVSFNP
GSNSMVADLQELPATVHSASWVAPASYLGDKVSSYGGYLTYQAKSFGLPSDMVLLEKMPD
VQLTGQHMSVIYEETNTPRPDRLHHGRVQVVEGNFRHASSRAPVSREELMTVLSRLADMR
IQGLYFTETQRLTLSEVGLEEASDTGSGRIAYAVEMCACPPAYAGDSCQGCSPGYYRDHK
GLYTGRCVPCNCNGHSNRCQDGSGICVNCQHNTAGEHCERCQEGYYGNAVHGSCRACPCP
HTNRFATGCVVNGGDVRCSCKAGYTGIQCERCAPGYFGNPQKFGGSCQPCSCNNNGQLGS
CDPLTGDCINQEPKDSSPAEECDDCDSCVMTLLNDLATMGEQLRLVKSQLQGLSASAGLL
EQMRHMETQAKDLRNQLLNYRSTISNHGSKIEGLERELTDLNQEFETLQEKAQVNSRKAQ
TLYNNVNRAIQSAKELDMKIKTVIRNVHILLKQISGPDGEGNNVPSGEFSREWAEAQRMM
RELRNRNFGKHLREAEADKRESQLLLNRIRTWQKTHQGENNGLANSIRDSLNEYEAKLSD
LRARLQEAAARAKQANGLNQENERALGAIQRQVKEINSLQSDFTKYLTSADSSLLQTNIA
LQLMEKSQKEYEKLAASLNEARQELSDKVRELSRSAGKTSLVEEAEKHAQSLQELAKQLE
EIKRNASGDELVRCAVDAATAYENILNAIKAAEDAANRAASASESALQTVIKEDLPRKAK
TLSSNSDKLLNEAKMTQKKLKQEVSPALSNLQQTLNIVTVQREVIATNLTTLRDGLRGIQ
RGDIDAMISSAKSMVRKANDITDEVLDGLNPIQTDVERIKDTYGSTQNEDFKKALTDADN
SVNKLTNKLPDLWRKIESINQQLLPLGNISDNMDRIRELIQQARDAASKVAVPMRFNGKS
GVEVRLPNDLEDLKGYTSLSLFLQRPNSRENGGTENMFVMYLGNKDASRDYIGMAVVDGQ
LTCVYNLGDRETELQVDQILTESETQEAVMDRVKFQRIYQFARLNYTKGATSSKPEIPGV
YDMDGRNSNTLLNLDPENVVFYVGGYPPDFKLPSRLRFPPYKGCIELDDLNENVLSLYNF
KKTFNLNTTEVEPCRRRKEESDKNYFEGTGYARVPTQPHASIPTFGQTIQTTVDRGLLFF
AENGDRFISLNIEDGKLMVRYKLNSEPPKERGVGDAINNGRDHSIQIKIGKVQKRMWINV
DFQNTIIDGEVFDFSTYYLGGIPIAIRERFNISTPAFRGCMKNLKKTSGVVRLNDTVGVT
KKCSEDWKLVRSASFSRGGQLSFTNLGSPLTDHLQASFGFQTFQPSGILLSHQTWTSTLQ
VTLEDGHIELSTRDSGSPIFKSPQTYMDGLLHYVSVISDNSGLRLLIDDQPLRNNQRLNR
ISSSQQSLRLGGSNFEGCISNVFVQRLSLSPEVLDLASNSLKRDVSLGGCSLNKPPFLML
LQGSTRFNKAKTFRINQLLQDTPVASPRSMKVWQDACSPLPRTQANHGALQFGDIPTSHL
LFKLPQELLKPRSQFAVDVQTTSSRGLVFHTGTKNSFMALYLSKGRLVFALGTDGKKLRI
KSKEKCNDGKWHTVVFGHDGEKGRLVVDGLRAREGSLPGNSTVSLRGPVYLGSPPSGKPK
SLPTNSFVGCLKNFQLDSKPLDTPSSSFGVSSCLGGPLEKGIYFSEEGGHVILAHSVLLG
PEFKLVFSIRPRSLTGILIHIGSQPGKHLGVYLEAGKVTASMDSGAGGTSTSVTPKQSLC
DGQWHSVAVTIKQHILHLELDTDSSYTAGQIPFPPASTQEPLHLGGAPANLTTLRIPVWK
SFFGCLRNIFVNHIPVPVTEALEVQGPVSLNGCPDH
NT seq 10011 nt   +upstreamnt  +downstreamnt
atggcggcggccgcgcggcctcggggtcgggcaccggggccagtcctgccgccgacgccg
ctgctgctgctggtgctgcagctgctgccagcctggggggcgaccgctcgggatccccgg
gccgcggccgggctcagccttcacccgccttacttcaacctggccgaggcagcgaggatt
tgggccactgccacctgcggggagaggggacccagcgagaggaggccccggcccgagctc
tactgcaagttggtcgggggccccaccgccccaggcagcggccacaccatccagggccag
ttctgtgactattgcaattctgaggaccccaggaaggcgcatcctgtcaccaatgctatt
gatggatctgaacgttggtggcaaagtcctcccctgtcctcaggcacacagtacaacaga
gtcaacctcaccttggatctggggcagctcttccatgtggcctatattttaatcaaattt
gcaaattctcctcgccctgatctttgggtcttggaaagatctgtagattttggaagcacc
tactcaccatggcaatactttgctcattctaaagtagactgtttaaaagaattcgggcgg
gaggcaaatatggctgtcacccgggatgatgatgtactttgtgttactgaatattcccgt
attgtacctttggaaaatggtgaggttgtggtgtccttgataaacggtcgtccaggtgca
aaaaattttactttctctcacaccctgagggagtttaccaaggcaacaaacatccgcttg
cgttttcttcgaaccaatacgcttcttggacacctcatctccaaagcccagcgagatcca
actgtcactcggcggtattattacagcataaaggacatcagcattggtgggcggtgtgtt
tgcaatggccatgctgaagtgtgcaatgcaaacaatcctgaaaaactgtttcggtgtcaa
tgccagcaccacacctgtggggagacgtgtgatcgctgctgcacagggtacaaccagagg
cgctggcagcctgccgcgtgggagcagagccatgagtgtgaagcatgcaactgccacggc
catgccagtgattgttactatgattcagatgttgagcggcagcaggcaagcttgaatacc
cagggcatctatgctggtggaggggtctgcattaactgtcagcacaacacagctggagta
aactgtgaacagtgtgctaagggctattaccgcccttatggggttccagtggatgcccct
gatggctgcatcccctgcagctgtgaccctgagcatgcggatggctgtgaacagggctca
ggccgctgtcactgcaagccaaatttccatggagacaactgtgagaagtgtgcggttgga
tactacaattttccattttgcttgaaaattcccatttttcctgtttctactccaagtcca
gaagatccagtagctggagacataagagggtgtgactgtaatccggaaggcgttctcccc
gaaatatgtgatgtccacggacggtgcctgtgccgccctggggttgagggccctcgatgt
gatacctgccgctctggtttctactcgttccctatttgccaagcctgctggtgttcagcc
cttggatcctaccagatgccctgcagctcagtgactggacagtgtgaatgtcggccagga
gttacaggacagcggtgtgacaggtgtctctcaggagcctatgatttcccccactgccaa
ggttccagcagtgcttgtgacccagctggtaccatcgactccaatttggggtcttgccca
tgcaagcttcacgttgaaggccctacttgcagcatctgcaaaccgttgtattggaatctg
gacaaagaaaaccccaatggatgttcagaatgccagtgccataaggcgggaacagtgagt
ggaactggagagtgtaggcagggagacggtgactgtcactgcaagtcccacgtgggtggt
gattcctgcgacacctgtgaagatggatattttgctttggaaaagagcaattactttggg
tgtcaagggtgccagtgtgacattggtggggcgttgtcctctgtgtgcagcgggccctcc
ggagtgtgccagtgccgagagcatgtcgtgggaaaggcgtgccagcggcctgaaaacaac
tactatttcccagatttgcatcatatgaagtacgagattgaagacggcagcacacctaat
gggagagaccttcggtttggatttgatccgctggcatttcctgggtttagctggagagga
tatgcccaaatgacctcagtacagaatgatgtaagaataacattgaatgtggggaagtca
agtggctccttgtttcacgttattctgagatacattaacccgggaactgaagcagtatct
ggccatataactatttatccatcctggggtaaggcaagtgctgctcaaagcaaagagatc
atcttcctgccgagtaaggagccagcctttgtcaccgtccctggaaatggttttgcagat
ccattttcaatcacaccaggaatatgggttgcttgtattaaggcagaaggagtccttctg
gattacctggtgctgctccccagggactactatgaagcctctgtactgcagctgccagtc
acagaaccgtgtgcctacgcaggacctccccaagaaaattgcttgctctaccagcatttg
ccagtgaccagattcccctgtaccctggcttgtgaggccagacacttcctgcttgatggg
gagcccagacccgtggcagtgaggcagcccacacctgcacaccctgtcatggtggacctc
agtgggagagaggtggaattgcatctgcggctgcgcgtcccacaggttggccgctacgtg
gttgtgcttgaatattccacagaggcagcccagctgtttgtggctgatgtgaacgtgaag
agccccgggtctgttcgggcaggccaggtgaacatttacagctgcaagtacagcgttctc
tgccggagtgctgtgattgatcacatgagccgcatcgctgtgtatgagctgttggcagat
gcagacattcagctcaagggacacatggcccgattccttctgcatcaagtttgtatcata
cccattgaagaattctcagctgagtatgtgaggccgcaagtccactgcattgccagttac
aggcgatttgtcaatcaaagtgccacctgtgtctccctggcccatgaaactcctccaaca
gcattaattttggatgttgtaagtggcaggcctttccctcacctgccccagcagtcttca
ccttctgttgatgttcttcctggggtcaccttgaaggcaccacagaatcaagtgaccctg
agaggacgtgcaccacacctgggccgatacgtctttgtcatccatttttaccaagcagcg
cacccgacgtttcccgcgcaggtgtcagtggacggcgggtggccacgggcaggctccttc
cgtgcctctttttgcccccatgtgctcggctgccgggatcaagtgattgccgaaggccag
attgagtttgacatctcagagcctgaagtggctgcaaccgtgaaggttccagaaggaaag
tccttggttttggtccgtgttctaatggtgcctacagaaaactatgactaccaaatactt
cacaaaaaatccatggacaagtcactcgagtttatcaccaattgtggaaaaaacagcttt
tacattgacccccagacagcctccagattctgtaagaattctgccaggtccctggtggcc
ttttaccacaaaggtgccctgccttgtgagtgccaccccgccggagccaccagccctcac
tgcagccctgaaggtgggcagtgcccatgccggcccaacgtcattgggcggcagtgcacc
cgctgtgcaacaggccactatggattcccacactgcaagccgtgcagctgtggcctgcgc
ctttgtgaagagatgacggggcagtgccgctgccctccccgcacggtcaggccccagtgc
gaagtgtgtgagacacactcattcagcttccaccccatggccggctgcgaaggctgcaac
tgttccaggaggggcaccagcgaggctgccatgctgcagtgtgaccgggacagcgggcag
tgcagatgcaagcccagaatcacagggcggcagtgtgaccgatgtgcttccgggttttac
cgattccctgagtgtgttccctgcaattgcaaccgagatgggactgagccaggagtgtgt
gacccagggaccggggcttgcctctgcaaggaaaatgtagaaggcacagagtgtcatgtg
tgtcgagaaggctcattccatttggacccagccaatcccaagggttgtaccagctgtttc
tgttttggagtaaataatcaatgtcacagcgcacataagcgaagggctaagtttgtggat
atgctgggctggcgcctggagacggcagacagaatggacatccccgtctcattcaacccg
ggcagcaacagcatggtggcggatctccaggagctgcccgcaactgtccacagcgcgtcc
tgggtcgcacccgcgtcctacctgggggacaaggtttcttcatatggtggctacctcact
taccaagccaagtccttcggcttgcctagcgacatggttcttctggaaaagatgccggat
gtacagctcactggtcagcacatgtccgtcatctatgaggagacaaacaccccacggcca
gaccggctgcatcacgggcgagtgcaggtggtcgagggaaacttcagacatgccagcagc
cgtgccccggtgtctagggaggagctgatgacagtgctgtctagactggcagatatgcgc
atccaaggcctctacttcacagagactcaaaggctcaccctgagtgaggtggggctagag
gaagcctctgacacaggaagtggacgcatagcatacgctgtggaaatgtgtgcctgcccc
cctgcctacgctggcgactcttgtcagggttgtagccctggatattatcgggatcacaaa
ggcttgtataccggacggtgtgttccctgcaattgcaacggacattcaaatcgatgccag
gatggctcaggcatatgtgttaactgtcagcacaacaccgcgggggagcactgtgagcgc
tgccaggagggctactatggcaacgccgtccacggatcctgcagggcctgcccgtgtcct
cacaccaacaggtttgccaccggctgtgtggtgaacgggggagacgtgaggtgctcctgc
aaagctgggtacacaggaatacagtgtgaaaggtgtgcaccgggatatttcgggaatccc
cagaaattcggaggtagctgccaaccatgcagttgtaacaacaatggccagctgggcagt
tgtgatcccctgactggagactgcataaaccaagaacccaaagatagcagccctgcagaa
gaatgtgatgattgcgacagctgtgtgatgaccctcctgaacgacctggccaccatgggc
gagcagctccgcctggtcaagtctcagctgcagggcctgagtgccagcgcggggcttctg
gagcagatgaggcacatggagacccaggccaaggacctgaggaatcagttgctcaactac
cgttctaccatttcaaatcatggatcaaaaatagaaggcctggaaagagaactgactgat
ttgaatcaggaatttgagactttgcaagaaaaggctcaagtaaattccagaaaagcacaa
acattatacaacaatgttaatcgggcaatacaaagcgcaaaagaactggatatgaagatt
aaaactgtcatccggaatgtgcacattctcttaaagcagatctcggggccagatggagag
ggaaacaacgtgccttcaggtgaattttccagagagtgggctgaagcccagcgcatgatg
agggaactgcggaacaggaactttggaaagcacctgagagaagcagaagctgataaaagg
gagtcgcagctcttgctgaaccggataaggacctggcagaaaacccaccagggggagaac
aatgggcttgctaacagtatacgggattctttaaatgaatacgaagccaaactcagtgac
cttcgcgctcggctgcaggaggcagctgcccgagccaagcaggcaaatggcttgaaccaa
gaaaacgagagagctttgggagccatccagagacaagtaaaagaaataaattccctgcag
agtgatttcaccaagtatctgacctctgctgactcatctttgttgcaaaccaacattgca
ctgcagctgatggagaaaagccagaaggaatatgaaaaattagctgccagtttaaatgaa
gcaagacaagaactaagtgacaaagtaagagaactttccagatctgctggcaaaacatcc
cttgtggaggaggcagaaaagcatgcgcagtccttacaagagctggcaaagcagctggaa
gagatcaagagaaatgccagcggggatgagctggtgcgctgtgctgtggatgccgccacc
gcctatgagaacattctcaacgccatcaaagcggccgaggacgcggccaacagggctgcc
agtgcatctgaatctgccctccagacagtgataaaggaagatctgccaagaaaagctaaa
accctgagttccaacagtgataaactgttaaatgaagccaagatgacacaaaagaagcta
aagcaagaagtcagtccggctctcagcaacctacagcaaaccctgaatattgtgacagtt
cagagagaagtgatagccaccaatctcacaactctccgagatggtcttcgtgggatacag
agaggtgatattgatgctatgatcagtagtgcaaagagcatggtcagaaaggccaacgac
atcacagatgaggtcctggatgggctgaaccccattcagacagatgtggaaagaattaag
gacacctatgggagcacacagaacgaagacttcaaaaaggctctgacggatgcagataac
tcggtgaataagttaaccaacaaactacctgatctttggcgcaagattgaaagtatcaac
caacagctgttgcccttgggaaacatctctgacaacatggacagaatacgagaattaatt
cagcaggccagagatgctgccagtaaggttgctgtccccatgagattcaatggtaaatct
ggagttgaagtccgactgccaaatgacctggaagatttgaaaggatacacatctctgtcc
ttgtttctccaaaggcccaactcaagagaaaatgggggtactgagaatatgtttgtgatg
taccttggaaataaagatgcctcccgggactacatcggcatggcagttgtggatggtcag
ctcacctgtgtctacaacctgggagaccgtgagactgaactccaagtggaccagatcttg
accgagagtgagactcaggaggcagttatggatcgggtgaaatttcagagaatttatcag
tttgcaaggcttaattacaccaaaggagccacatccagtaaaccagaaatacccggagtc
tatgacatggatggtagaaacagcaatacactccttaatctggatcccgaaaatgttgta
ttttatgttggaggttacccacctgattttaaacttcccagtcgactaaggttccctcca
tacaaaggttgtattgaattagatgacctcaatgaaaatgttctgagcttatacaacttc
aaaaaaacattcaatctcaacacaactgaagtggagccttgtagaaggaggaaggaagag
tcagacaaaaattattttgaaggtacgggctatgctcgagttccaactcaaccacatgct
tccatcccaacctttgggcagacaattcagaccaccgtggatagaggcttgctgttcttc
gcagaaaatggggatcgcttcatatctctaaatatagaagatggcaagctcatggtgaga
tacaaactgaattcagagccaccaaaagagagaggagttggagacgccataaacaacggc
agagaccattcgattcagatcaaaattggaaaagtccaaaagcgtatgtggataaatgtg
gactttcaaaacactataattgatggtgaagtatttgatttcagcacatattatctggga
ggaattccaattgcaatcagggaaagatttaacatttctacgcctgctttccgaggctgc
atgaaaaatttgaagaaaaccagtggtgtcgttagattgaacgatactgtgggagtaacc
aaaaagtgctcggaagactggaagcttgtgcgatctgcctcattctccagaggaggacaa
ttgagtttcactaatctgggctcaccgcttactgaccaccttcaggcctcatttggattt
cagacttttcaacccagtggcatattattaagtcatcaaacatggacaagcaccctgcag
gtcactctggaagatggtcacattgaattgagcaccagggatagcggcagcccaattttt
aaatctccacagacgtatatggatggtttgctgcactatgtatctgtaataagcgacaac
tctggactacggcttctcatcgatgaccagcctctgagaaataaccaaaggctaaatcgc
atttcaagttcccagcagtctctgcgtctgggcgggagcaattttgagggttgtatcagc
aatgtttttgtccagaggttatcactgagtcccgaagtcctagatttggccagtaactcc
ctcaagagagatgtgtccctgggaggctgcagtttaaacaaaccaccttttctaatgttg
cttcagggttctaccaggtttaacaaggccaagactttccgtatcaatcagctgttgcag
gacacaccagtggcctccccaaggagcatgaaggtgtggcaagatgcctgttcaccactt
cccaggacccaggccaatcatggagccctccagtttggggacattcccaccagccacttg
ctattcaagcttcctcaggagctgctgaaacccaggtcacagtttgctgtggacgtgcag
acaacatcctccagaggactggtgtttcacacgggcactaagaactcctttatggctctt
tatctttcaaaaggacgtctggtctttgcactggggacagatgggaaaaaattgaggatc
aaaagcaaggagaaatgcaatgatgggaagtggcacacggtggtgtttggccatgatgga
gaaaaggggcgcttggttgtggatggactgagggcccgggagggaagtttgcctggaaac
tccaccgtcagcctcagaggaccagtttacctgggatcacctccatcagggaaaccaaag
agcctccccacaaacagctttgtgggatgcctgaagaactttcagctggattcaaaaccc
ttggataccccttcttcaagcttcggggtgtcttcttgcttgggtggtcctttggagaaa
ggcatttatttctccgaagaaggaggtcatgtcatcttggctcactctgtgttgctgggg
ccagaatttaagcttgttttcagcatccgcccaagaagtctcacagggatcctaatacac
atcggaagtcagccggggaagcacttaggtgtttacctggaggccggaaaggtcacggcc
tctatggacagtggggcaggcgggacctcaacctcagtcacaccaaagcagtctctgtgt
gatggacagtggcactcggtggcagtcaccataaaacaacatatcctgcacctggaactg
gacacagacagtagctacacagctggacagatccccttcccacctgccagcactcaagag
ccactacaccttggaggtgctccagccaatttgacgacactgaggatccctgtgtggaaa
tcattctttggctgtctgaggaatatttttgtcaaccacatccctgtccctgtcactgaa
gccttggaagtccaggggcctgtcagtctgaatggctgtcctgaccactaa

KEGG   Macaca mulatta (rhesus monkey): 701835
Entry
701835            CDS       T01028                                 

Gene name
IBSP
Definition
(RefSeq) bone sialoprotein 2
  KO
K06253  integrin binding sialoprotein
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    701835 (IBSP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    701835 (IBSP)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    701835 (IBSP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    701835 (IBSP)
SSDB
Motif
Pfam: BSP_II Mpp10 Utp14 CDC45 SAPS
Other DBs
NCBI-GeneID: 701835
NCBI-ProteinID: XP_028704013
Ensembl: ENSMMUG00000011111
LinkDB
Position
5
AA seq 318 aa
MKTALILLSILGMACAFSMKNLHRRVKIEDSEENGVFKYRPQYYLYKHAYFYPPLKRFPV
QGSSDSSEENGDDSSEEEEEEEETSNEGENNEESNEDEDSEAENTTLSATTLGYGEDATP
GTWYTGLAAIQLPKKAGDVTNKATKKEESDEEEEEEEEENENEESEAEVDENEQGINGTS
TNSTEAENGNGSSGGDNGEEEGEEESVTGASAEGTTATGGQGKGSSKTTTSPNGGFEPTT
PPPVYRTTSLPFGKTTTVEYEGEYEYTGANEYDNGYEVYESENGEPRGDNYRAYEDEYSY
FKGQGYDSYDGQNYYHHQ
NT seq 957 nt   +upstreamnt  +downstreamnt
atgaagactgctttaattttgctcagcattttgggaatggcctgtgctttctcaatgaaa
aatttgcatcgaagagtcaaaatagaggattctgaagaaaatggggtctttaagtacagg
ccacaatattatctttacaagcatgcctacttttatcctcctttaaaacgatttccagtt
cagggcagtagtgactcatctgaagaaaatggagatgatagttcagaagaggaggaggaa
gaagaggagacttcaaatgaaggagaaaacaatgaagaatcaaatgaagatgaagactct
gaggctgagaacactacactttctgctacaacactgggctatggagaggacgccacacct
ggcacatggtatacagggttagctgcaatccagcttcccaagaaggctggggatgtaaca
aacaaagctacaaaaaaggaggaaagtgatgaagaagaagaggaggaagaggaagaaaat
gaaaacgaagaaagcgaagcagaagtggatgaaaacgaacaaggcataaacggcaccagt
accaacagcacagaggcagaaaacggcaacggcagcagcggcggagacaacggagaagaa
gaaggggaagaagaaagtgtcactggagccagtgcagaaggcaccacagcaaccggaggg
cagggcaagggcagctctaagacaacaacctctccaaatggtgggtttgaacctacaacc
ccaccaccagtctatagaaccacctccctaccttttgggaaaaccaccaccgtcgaatac
gagggggagtacgaatatacgggcgccaatgaatacgacaatggatatgaagtctatgaa
agtgagaacggggaacctcgtggggacaattaccgagcctacgaggatgagtacagctac
tttaaagggcaaggctacgacagctatgatggtcagaattactaccatcaccagtga

KEGG   Macaca mulatta (rhesus monkey): 702111
Entry
702111            CDS       T01028                                 

Gene name
LAMB1
Definition
(RefSeq) laminin subunit beta-1 isoform X2
  KO
K05636  laminin, beta 1
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    702111 (LAMB1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    702111 (LAMB1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    702111 (LAMB1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    702111 (LAMB1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    702111 (LAMB1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    702111 (LAMB1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    702111 (LAMB1)
   05145 Toxoplasmosis
    702111 (LAMB1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N CorA F5_F8_type_C
Other DBs
NCBI-GeneID: 702111
NCBI-ProteinID: XP_001090393
Ensembl: ENSMMUG00000006567
LinkDB
Position
3
AA seq 1786 aa
MGLLQVLAFSFLALCGARVRAQEPEFSYGCAEGSCYPATGDLLIGRAQRLSVTSTCGLHK
PEPYCIVSHLQEDKKCFICNSQDPYHETLNPDSHLIENVVTTFAPNRLKIWWQSENGVEN
VTIQLDLEAEFHFTHLIMTFKTFRPAAMLIERSSDFGKTWGVYRYFAYDCEASFPGISTG
PMKKVDDIICDSRYSDIEPSTEGEVIFRALDPAFKIEDPYSPRIQNLLKITNLRIKFVKL
HTLGDNLLDSRMEIREKYYYAVYDMVVRGNCFCYGHASECAPVDGFSEEVEGMVHGHCMC
RHNTKGLNCELCMDFYHDLPWRPAEGRNSNACKKCNCNEHSSSCHFDMAVYLATGNVSGG
VCDDCQHNTMGRNCEQCKPFYYQHPERDIRDPNFCERCTCDPAGSQNEGICDSYTDFSTG
LIAGQCRCKLHVEGEHCDVCKEGFYDLSGEDPFGCKSCACNPLGTIPGGNPCDSETGHCY
CKRLVTGQHCDQCLPEHWGLSNDLDGCRPCDCDLGGALNNSCSAESGQCSCRPHMIGRQC
NEVEPGYYFATLDHYLYEAEEANLGPGVSIVERQYIQDRIPSWTGAGFVRVPEGAYLEFF
IDDIPYSMEYDILIRYEPQLPDHWEKAVITVQRPGKIPPSSRCGNTIPDDDNQVVSLSPG
SRYVVLPRPVCFEKGTNYTVRLELPQYTSSDSDVESPYTLIDSLVLMPYCKSLDIFTVGG
SGDGVVTNSAWETFQRYRCLENSRSVVKTPMTDVCRNIIFSISALLHQTGLACECDPQGS
LSSVCDPNGGQCQCRPNVVGRTCNRCAPGTFGFGPSGCKPCECHLQGSVNAFCNPVTGQC
HCFQGVYARQCDRCLPGHWGFPSCQPCQCNGHADDCDPVTGECLNCQDYTMGHNCERCLA
GYYGDPIIGSGDHCRPCPCPDGPDSGRQFARSCYQDPVTLQLACVCDPGYIGSRCDDCAS
GFFGNPSEVGGLCQPCQCHNNIDTTDPEACDKETGRCLKCLYHTEGEHCQFCRFGYYGDA
LRQDCRKCVCNYLGTVQEHCNGSDCQCDKATGQCLCLPNVIGQNCDRCAPNTWQLASGTG
CDPCNCNTAHSFGPSCNEFTGQCQCMPGFGGRTCSECQELFWGDPDVECRACDCDPRGIE
TPQCDQSTGQCVCVEGVEGPRCDKCTRGYSGVFPDCTPCHQCFALWDVIIAELTNRTHRF
LEKAKALKISGVIGPYRETVDSVERKVSEIKDILAQSPAAEPLKNIGNLFEEAEKLIKDV
TEMMAQVEVKLSDTTSQSNSTAKELDSLQTEAESLDNTVKELAEQLEFIKNSDIRGALDS
ITKYFQMSLEAEERVNASTAEPNSTVEQSALMRDRVEDVMMERESQFKEKQEEQARLLDE
LAGKLQSLDLSAAAEMTCGTPPGASCSETECGGPNCRTDEGEKKCGGPGCGGLVTVAHNA
WQKAMDLDQDVLSALSEVEQLSKMVSEAKLRADEAKQSAEDILLKTNATKEKMDKSNEDL
RNLIKQIRNFLTQDSADLDSIEAVANEVLKMEMPSTPQQLQNLTEDIRERVESLSQVEVI
LQQSAADIARAEMLLEEAKKASKSATDVKVTADMVKEALEEAEKAQVAAEKAIQQADEDI
QGTQNLLTSIESETAASEETLFNASQRISELERNVEELKRKAAQNSGEAEYIEKVVYTVK
QSAEDVKKTLDGELDEKYKKVENLIAKKTEESADARRKAEMLQNEAKTLLAQANSKLQLL
KDLERKYEDNQKYLEDKAQELARLEGEVRSLLKDISQKVAVYSTCL
NT seq 5361 nt   +upstreamnt  +downstreamnt
atggggcttctccaggtgttagctttcagtttcttagccctgtgcggagcccgagtgcgc
gctcaagaacccgagttcagctacggctgcgccgaaggcagttgctaccccgccacgggc
gaccttctcatcggccgagcacagaggctttctgtgacctcgacgtgcgggctgcacaag
cccgaaccctactgtatcgtcagccacttgcaggaggacaaaaaatgcttcatatgcaat
tcccaagatccttatcatgagaccctgaatcctgacagccatctcattgaaaatgtggtc
actacctttgcgccaaaccgccttaagatttggtggcaatctgaaaatggtgtggaaaat
gtaactatccaactggatttggaagcagaattccattttactcatctcataatgactttc
aagacattccgtccagctgccatgctgatagaacgatcgtccgactttgggaaaacctgg
ggcgtgtacagatacttcgcctatgactgtgaggcctcgtttccaggcatttcaactggc
cccatgaaaaaagtcgatgacataatttgtgattcccgatattctgacattgaaccctca
actgaaggagaggtgatatttcgtgctttagatcctgctttcaaaatagaagatccttat
agcccgaggatacagaatttattaaaaattaccaacttgagaatcaagtttgtgaaactg
catactttgggagataaccttctggattccagaatggaaatcagagaaaagtattattat
gcagtttatgatatggtggttcgaggaaattgcttctgctatggtcatgccagtgaatgt
gcccctgtggatggattcagtgaagaagtggaaggaatggttcatggacactgcatgtgc
aggcataacactaagggcttaaattgtgaactctgcatggatttctaccatgatttacct
tggagacctgctgaaggccgaaatagcaacgcctgtaaaaaatgtaactgcaatgaacat
tccagctcatgtcactttgacatggctgtttacctggccacggggaacgtcagcggaggc
gtgtgtgatgactgtcagcacaacaccatggggcgcaactgtgagcaatgcaagccgttt
tactaccagcacccagagagggacatccgagatcctaatttctgtgaacgatgtacatgt
gacccagctggctctcaaaatgagggaatttgtgacagctatactgatttttctactggt
ctcattgctggccagtgtcggtgcaaattacatgtggaaggagaacattgtgatgtttgc
aaagaaggcttctatgatttaagtggtgaagatccatttggttgtaaatcttgtgcttgc
aatcctctgggaacgattcctggagggaatccttgtgattccgagacaggtcactgctac
tgcaagcgtctggtgacaggacagcattgtgaccagtgcctgccagagcactggggctta
agcaatgatttggatggatgtcgaccatgtgactgtgaccttgggggagcgttaaacaat
agttgctctgcagagtcaggccagtgctcatgtcggcctcacatgattggacgtcagtgc
aatgaagtggaacctggttactactttgccaccctggatcactacctctatgaagcagag
gaagccaacttggggcccggggttagcatagtggagcggcagtacatccaggaccggatt
ccctcctggactggagccggcttcgtccgagtgcctgaaggggcttatttggagtttttc
attgacgacataccatattccatggagtatgacatcctaattcgctatgagccacagcta
cccgaccactgggaaaaagctgtcatcacagtgcagcgacctggaaagattccacccagc
agccgatgtggtaataccatccctgatgatgacaaccaggtggtgtcattatcaccaggc
tcaagatatgtcgtccttcctcggccggtgtgctttgagaagggaacgaactacacggtg
aggttggagctgcctcagtacacttcctctgatagcgacgtggagagcccctacacgctg
atcgattctcttgttctcatgccatactgtaaatcactggacatcttcaccgtgggaggt
tcaggagatggggtggtcaccaacagtgcctgggaaacctttcagagataccgatgtcta
gagaacagcagaagcgttgtgaaaacgccgatgacagatgtttgcagaaacatcatcttt
agcatttctgccctgttacaccagacaggcctggcgtgtgaatgcgaccctcagggttca
ttgagttctgtgtgtgatcccaacggaggccagtgccagtgccggcccaacgtggttgga
agaacctgcaacagatgtgcacctggaacttttggctttggccccagtggatgcaaacct
tgtgagtgccatctgcaaggatctgtcaatgccttctgcaatcctgtcactggccagtgc
cactgtttccagggagtgtatgctcggcagtgtgatcggtgcttacctgggcactgggga
tttccaagttgccagccctgccagtgcaatggccatgccgatgactgcgacccagtgacg
ggggagtgcttgaactgccaggactacaccatgggtcataactgtgagaggtgcttggct
ggttactatggcgaccccatcattgggtcaggagatcactgccgcccttgcccttgcccc
gatggtcccgacagtggacgccagtttgccaggagctgctaccaagatcctgttacttta
cagcttgcctgtgtttgtgatcctggatacattggttccagatgtgacgactgtgcctca
ggattctttggcaatccatcggaagttggggggttgtgtcagccttgccagtgtcacaac
aacattgacacgacagacccagaagcctgtgacaaggagaccgggaggtgcctcaagtgc
ctgtaccacaccgaaggggaacactgtcagttctgccggtttggatactatggtgacgcc
ctccggcaggactgtcgaaagtgtgtctgtaattacctgggcaccgtgcaagagcactgt
aacggctctgactgccagtgcgacaaagccactggtcagtgcttgtgtcttcctaatgtg
atcgggcagaactgtgaccgctgtgcacctaatacctggcagctggccagtggcactggc
tgtgacccatgcaactgcaacactgctcattccttcgggccatcttgcaatgagttcaca
gggcagtgccagtgcatgcctgggtttggaggccgcacctgcagcgagtgccaggaactc
ttctggggagaccctgacgtggagtgccgagcctgtgactgtgaccccaggggcattgag
acgccacagtgtgaccagtccacgggccagtgtgtctgcgtggagggtgttgagggtcca
cgctgtgacaagtgcacgcgagggtactcgggggtcttccctgactgcacaccctgccac
cagtgctttgctctctgggacgtgatcattgctgagctgaccaacaggacacacagattc
ctggagaaagccaaggccttgaagatcagtggtgtgatcgggccttaccgtgagactgtg
gactcggtggagaggaaagtcagcgagataaaagacatcctggcgcagagccccgcagca
gagccactgaaaaacattgggaatctctttgaggaagcagagaaactgattaaagatgtt
acagaaatgatggctcaagtagaagtgaaattatctgacacaacttcccaaagcaacagc
acagccaaagaactggattctctacagacagaagcggaaagcctagacaacactgtgaaa
gaacttgctgaacaactggaatttatcaaaaactcagatattcggggtgccttggatagc
attaccaagtatttccagatgtctctcgaggcagaggagagggtgaatgcctccaccgca
gaacccaacagtactgtggaacagtcagccctcatgagagacagagtagaagatgtgatg
atggagcgagaatcccagttcaaggaaaaacaagaggagcaggctcgcctccttgatgaa
ctggcaggcaagctgcaaagcctagacctttcagccgctgccgaaatgacctgtggaaca
cccccaggggcctcctgttccgagactgaatgtggcgggccaaactgcagaactgatgaa
ggagagaagaagtgtggggggcctggctgtggtggtctggttactgttgcacacaacgcc
tggcagaaagccatggacttggaccaagatgtcctgagtgccctgtctgaagtggaacag
ctctccaagatggtctctgaagcaaaactgagggcagatgaggcaaaacagagtgctgaa
gacattctgttgaagacaaatgctaccaaagaaaaaatggacaagagcaatgaggatctg
agaaatctaatcaagcaaatcagaaactttttgacccaggatagtgctgatttggacagc
attgaagcagttgctaatgaagtactgaaaatggaaatgcctagcaccccacagcaatta
cagaacttgacagaagatatacgtgaacgagttgaaagcctttctcaagtagaggttatt
ctacagcagagtgctgctgacattgccagagctgagatgttgttagaagaagctaaaaaa
gcaagcaaaagtgcaacagatgttaaagtcactgcagatatggtaaaggaagctctggaa
gaagcagaaaaggcccaggtcgcagcagagaaggcaattcaacaagcagatgaagacatt
caaggaacccagaacctgctaacttcgattgagtctgaaactgcagcttctgaggaaacc
ttgttcaacgcgtcccagcgcatcagcgagttagagaggaatgtggaggaacttaagcgg
aaagctgcccaaaactctggggaggcagaatatattgaaaaagtagtatatactgtgaag
caaagcgcagaagatgttaagaagactttagatggtgaacttgatgaaaaatataaaaaa
gtagaaaatttaattgccaaaaaaactgaggagtcagctgatgccagaaggaaagctgaa
atgctacaaaatgaagcaaaaactcttttagctcaagcaaacagcaagctgcaactgctc
aaagatttagaaagaaaatatgaagacaatcaaaaatacttagaagataaagctcaagaa
ttagcaagactggaaggagaagtccgttcactccttaaggatataagccagaaagttgct
gtttatagcacgtgcttgtaa

KEGG   Macaca mulatta (rhesus monkey): 702225
Entry
702225            CDS       T01028                                 

Gene name
LAMB4
Definition
(RefSeq) laminin subunit beta-4 isoform X1
  KO
K06245  laminin, beta 4
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    702225 (LAMB4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    702225 (LAMB4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    702225 (LAMB4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    702225 (LAMB4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    702225 (LAMB4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    702225 (LAMB4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    702225 (LAMB4)
   05145 Toxoplasmosis
    702225 (LAMB4)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N
Other DBs
NCBI-GeneID: 702225
NCBI-ProteinID: XP_014990160
Ensembl: ENSMMUG00000021289
LinkDB
Position
3
AA seq 1762 aa
MQFQLTLFLHLGWLSYSKAQDDCNRGACHPTTGDLLVGRNTQLTASSTCGLSRAQKYCIL
SYLEGEQKCFICDSRFPYDPYTQPNSHTIENVIVSFEPDREKKWWQSENGLDHVSIRLDL
EALFRFSHLILTFKTFRPAAMLVERSTDYGHTWKVFKYFAKDCATSFPNITSGQAQGVGD
IVCDSKYSDIEPSTGGEVVLKVLDPSFEIENPYSPYIQDLVTLTNLRINFTKLHTLGDTL
LGRRQNDSLDKYYYALYEMVVRGSCFCNGHASECRPMQKVRGDVFSPPGMVHGQCVCQHN
TDGPNCERCKDFFQDAPWRPAAELQDNACRSCSCNSHSSRCHFDMTMYLASGGLSGGVCE
DCQHNTEGQHCDRCRPLFYRDPLKTISDPYACIPCECDPDGTISGGICVSHSDPALGSVA
GQCLCKENVEGAKCDQCKPNHYGLSATDPLGCQPCDCNPLGSLPLLTCDVDTGQCLCLSY
VTGAHCEECTVGYWGLGNHLHGCSPCDCDIGGAYSNMCSPKNGQCECRPHVTGRSCSEPA
PGYFFAPLNFYLYEAEEAMPLQGLAPLGSETFGQSPAVHIVLGEPVPGNPVTWTGPGFAR
VLPGAGLRFAVNNIPFPVDFTIAIRYETQSAADWTVQIMVNPTRGSEHCVPKTSQSKPQS
FALPAATRIMLLPTPICLEPDVQYSIDVYFSQPLERESYTHSHILVDSLGLIPQINSLEN
FCSKQDLDEYQLHNCVEIASAMGPQVLPGACERLIISMSAKLHDGAVACKCHPQGSVGSS
CSRLGGQCQCKSLVVGRCCDRCSPGSYGLGHHGCHPCHCHPQGSKDTVCDLVTGQCPCHG
EVSGRRCDRCLAGYFGFPSCRSCPCNGFAELCDPETGSCFNCGGFTTGRNCERCIDGYYG
NPSSGQPCRPCLCPDDPSSNQYFAHSCYQNLWSSDVICNCLQGYTGTQCGECSTGFYGNP
RISGAPCQPCPCNNNIDVTDPESCSRVTGECLRCLHNTQGANCQLCKPGHYGSALNQTCR
RCSCHASGVSPVECPPGGGACLCDPVTGACPCLPNVTGLACDRCADGYWNLVPGRGCQSC
DCDPRTSQSSHCDQLTGQCPCKLGYGGKRCSECQENYYGDPPGRCIPCDCNRAGTQKPIC
DPDTGMCRCREGVSGQRCDRCARGHSQEFPTCLQCHLCFDQWDHTISSLSKAVQGLMRLA
ANMEDKRETLPVCKADFKGLRENMSEIERILKHPVFPSGKFLKVKDYHDSVRKQIMQLNE
QLKAVYEFQDLKHTIERAKNEADLLLEDLQKEIDLQFSVLNASIVDSSENIKKYYHISSS
AEKKINETSSTINTSANTRNDFLTILNTLTSKGNLSLEKLKQIKIPDIQILNEKVCGDPG
NVPCVVPLPCGGAVCMGQEGHRKCGGAGCHGSLTLSTNALQKAQEAKSVIRNLDKQVRGL
KNQIKNISKLAEVSKNNALQLRGKLGNTRNQSDSEEEKLNLFIKKVKNFLLEENVPPEDI
EKVANGVLDIHLPIPSQNLMDEVNKIQKLMQLCEDYRSDENRLSEEEDGAQKLLVKAKAA
EKAANVLLNLDKTLNQLQQAQITHGRANSTITQLTANITKIKKNVLQAENQTREMKNELE
SAKQRSGLEDGLALLQTKLQRHEDHAVNVKVQAESAQHQAGSLEKEFVELKKQYAVLQHK
TSTTGLTKETLGKVKQLKDAAEKLAGDTEDKIRRITDLERKIQDLNLSRQAKADQLRILE
DQVVAIKNEIVEQEKKYATCYS
NT seq 5289 nt   +upstreamnt  +downstreamnt
atgcaatttcaactgaccctttttttgcaccttgggtggctcagttactcaaaagctcaa
gatgactgcaacaggggtgcctgtcatcccaccaccggcgatctcctggtgggcaggaac
acacagcttacggcttcttctacctgtgggctgagcagagcccagaaatactgcatcctc
agttacctggagggggaacaaaaatgcttcatctgtgactctagatttccatatgatcca
tacacccaacccaatagccacaccattgagaatgtcattgtaagttttgaaccagacaga
gaaaagaaatggtggcagtctgaaaatggtcttgatcatgtcagcatcagactggactta
gaggcattatttcggttcagccaccttatcttgacctttaagacttttcggcctgctgca
atgttagttgaacgttccacagactatggacacacctggaaagtgttcaaatattttgca
aaagactgtgccacttcctttcctaacatcacatctggccaggcccagggagtgggagac
attgtttgtgactccaaatactcggatattgaaccctcaacaggtggagaggttgtttta
aaagttttggatcccagttttgaaattgaaaacccttatagcccctacatccaagacctt
gtgacattgacaaacctgaggataaactttaccaagctccacacccttggggatactttg
cttggaaggaggcaaaatgattcccttgataaatactactatgctctgtacgagatggtt
gttcggggaagctgcttttgcaatggccatgctagcgaatgtcgccctatgcagaaggtg
cggggagatgttttcagccctcctggaatggttcacggtcagtgtgtctgtcagcacaat
acagatggtccaaactgtgagagatgcaaggacttcttccaggatgctccttggaggcca
gctgcagaactccaggacaacgcttgcagatcgtgcagctgtaacagccactccagccgc
tgtcacttcgacatgactatgtacctggcaagcggtggcctcagcgggggcgtgtgtgaa
gactgccagcacaacactgaggggcagcactgcgaccgctgcaggcccctcttctacagg
gaccccctcaagaccatctcggatccctacgcgtgcattccttgtgaatgtgaccctgat
gggaccatatctggtggtatttgtgtgagccactctgatcctgccttagggtctgtggcc
ggccagtgcctttgtaaagaaaacgtggaaggagccaaatgcgaccagtgcaagcccaac
cactacggactgagcgccactgaccccttgggctgccagccctgcgactgtaaccccctt
gggagtctgccactcttgacctgtgatgtggatacaggccaatgcttgtgcctgtcatat
gtcaccggagcacactgcgaagaatgcactgttggatactggggcctgggaaatcatctc
catgggtgttctccctgtgactgtgatattggaggtgcttattctaacatgtgttcaccc
aagaatgggcagtgtgaatgccgcccacatgtcactggccgtagctgctctgaaccagcc
cctggctacttctttgctcctttgaatttctatctctacgaggcggaggaagccatgcca
ctccaaggactggcgcctttgggctcggaaacttttggccagagtcctgctgttcacatt
gttttaggagagccagttcctgggaaccctgttacatggactggccctggatttgccagg
gttctccctggggctggcttgagatttgctgtcaacaacattccctttcctgtggacttc
accattgccattcgctatgaaacccagtctgcagctgactggactgtccagatcatggtg
aaccccactcgagggagtgagcactgcgtacccaagacttcacagtcaaagcctcagtct
tttgccttaccagcggctaccagaatcatgctgcttcccacacccatctgtttagaacca
gatgtacaatattccatagacgtctatttttctcagcctttggaaagagagtcctatact
cattcacacatcctggtggactcccttggccttattccccaaatcaattcattggagaat
ttctgcagcaagcaggacttagatgagtatcaacttcacaactgtgttgaaattgcctca
gcaatgggacctcaagtgctcccgggtgcctgtgaaaggctgatcatcagcatgtccgcc
aagctgcatgatggggctgtggcctgcaagtgtcacccccagggctcagtcggatccagc
tgtagtcgacttggaggccagtgccagtgtaaatctcttgtggtcgggcgctgctgtgat
aggtgctcacctggaagctatggtttggggcatcatggctgtcacccatgtcactgccat
cctcaaggatcaaaggacactgtgtgtgacctagtaacaggacagtgcccctgccatgga
gaggtgtctggccgccgctgtgatcgctgcctggcaggctactttggatttcccagctgc
cggtcttgcccttgtaatgggtttgctgaactttgtgatcctgagacggggtcatgcttc
aattgtggaggctttacaactggcagaaactgtgaaaggtgtattgatggttactatgga
aatccttcttcaggacagccctgtcgtccttgcctgtgtccggatgatccctcaagcaat
cagtattttgcccattcctgttatcagaatctgtggagctcagatgtaatctgcaattgt
cttcaaggttatacgggtactcagtgtggagaatgctctactggtttctatggaaatcca
agaatttcaggagcaccttgccaaccatgtccctgcaacaacaacatagatgtaaccgat
ccagagtcctgcagccgggtaacaggggagtgccttcgatgtttgcacaacactcagggt
gcaaactgccagctctgcaaaccaggtcactatggatcagccctcaatcagacctgcaga
agatgctcctgccatgcttctggcgtgagtcccgtggagtgtccccctggtgggggagct
tgcctctgtgaccctgtcactggtgcatgcccttgtctgccgaatgtcacaggcctggcc
tgtgatcgctgtgctgatggatactggaatctggtccctggcagaggatgtcagtcatgt
gactgtgaccctaggacctctcaaagtagccactgtgaccagcttacaggccagtgtcca
tgtaaattaggctacggcgggaaacgttgcagtgagtgccaggaaaattattatggtgat
ccacctgggagatgcattccatgtgattgcaacagggcgggtacccagaagcccatctgt
gatccagacacaggcatgtgccgctgccgggagggtgtcagcggccagagatgtgatcgc
tgtgcccggggtcacagccaggaattccctacttgtcttcaatgtcacttgtgctttgat
caatgggatcacaccatttcttccctctccaaagcggtgcaagggttaatgagactggct
gctaacatggaagataaaagagagaccctgcctgtctgtaaggcagacttcaaaggcctc
agagagaacatgtctgaaatagaaaggattttgaaacatcctgttttcccatctgggaaa
ttcttaaaagtcaaggattatcatgactctgttagaaaacaaatcatgcagctaaatgaa
caattgaaagcagtgtatgaatttcaagatctgaaacatacaatagaaagagcaaagaat
gaagcagacctcttacttgaagatcttcagaaagaaattgatttgcaattcagtgtcctt
aatgcaagcatcgtggactcctcagaaaacatcaagaaatattatcacatatcatcatct
gctgaaaagaaaattaatgaaactagttctaccattaatacctctgcaaataccaggaat
gactttcttaccatcttaaatacattaacctccaaaggaaacttgtcattggaaaaatta
aagcagattaagataccagatatccaaatattgaatgaaaaggtgtgtggagatccagga
aatgtgccatgtgttgtgcccttgccctgtggtggtgctgtctgcatgggccaggaaggg
cacaggaagtgtgggggtgccggctgtcatggctccctgaccctctcaacgaatgccctc
caaaaagcccaggaagcaaaatctgttattcgtaatttagacaaacaggttcgtgggttg
aagaatcagatcaaaaatataagtaaactggcagaagtctccaaaaacaatgccttacag
ctaaggggaaaactgggaaatacaagaaaccaaagtgactctgaagaagaaaaactcaat
cttttcatcaaaaaagtgaaaaactttttgttagaggaaaacgtgcctccagaagacatc
gagaaggttgcaaatggtgtacttgacattcacctaccaattccatcccaaaatctaatg
gatgaagttaacaaaatacagaaacttatgcaactctgtgaggattacaggtcagatgaa
aacaggctaagtgaagaagaagatggagcccaaaagcttctggtgaaggctaaagcagct
gagaaagcagcaaatgttctattaaatcttgacaaaacattgaaccagttgcaacaagct
caaatcactcatggacgggcaaactctaccattacacagctgactgccaatataacaaaa
ataaaaaagaatgtgctgcaggctgaaaatcaaaccagggaaatgaagaatgagctggag
tcagcaaagcagcgatcagggctggaggatggacttgccctgctgcagaccaagttgcaa
aggcatgaagaccacgctgtcaatgtgaaagttcaggctgaatctgcccaacaccaggct
gggagtcttgagaaggaatttgttgagctgaaaaaacaatatgctgtgctccagcataag
acaagcactacaggactaacaaaggaaacattaggaaaagttaaacagctaaaagatgct
gcagaaaaattggctggagatacagaggacaagataagaagaataacagatttagaaagg
aaaatccaagatttgaatctaagtagacaagcaaaagctgatcaactgagaatattggaa
gatcaagttgttgccattaaaaatgaaatcgttgaacaagaaaaaaaatatgctacgtgc
tatagctag

KEGG   Macaca mulatta (rhesus monkey): 702601
Entry
702601            CDS       T01028                                 

Gene name
CHAD
Definition
(RefSeq) chondroadherin
  KO
K06248  chondroadherin
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    702601 (CHAD)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    702601 (CHAD)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    702601 (CHAD)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    702601 (CHAD)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mcc00535]
    702601 (CHAD)
Proteoglycans [BR:mcc00535]
 Extracellular matrix (ECM) proteoglycans
  Small leucine-rich proteoglycan (SLRP) family
   class IV
    702601 (CHAD)
SSDB
Motif
Pfam: LRR_8 LRR_4 LRR_5 LRR_1 LRRNT LRR_9 LRRCT
Other DBs
NCBI-GeneID: 702601
NCBI-ProteinID: XP_014974797
Ensembl: ENSMMUG00000017238
UniProt: A0A5F8A6P0
LinkDB
Position
16
AA seq 359 aa
MASPMLLLSLGLLAGLLPALAACPQNCHCHGDLQHVICDKVGLQKIPKVSEKTKLLNLQR
NNFPVLAANSFRAMPNLVSLHLQHCQIREVAAGAFRGLKQLIYLYLSHNDIRVLRAGAFD
DLTELTYLYLDHNKVTELPRGLLSPLVNLFILQLNNNKIRELRAGAFQGAKDLRWLYLSE
NALSSLQPGALDDVENLAKFHVDRNQLSSYPSAALSKLRVVEELKLSHNPLKSIPDNAFQ
SFGRYLETLWLDNTNLEKFSDGAFLGVTTLKHAHLENNRLNHLPSNFPFDSLETLTLTNN
PWKCTCQLRGLRRWLEAKASRPDATCASPAKFKGQHIRDTDAFRSCKFPTKRSKKAGRH
NT seq 1080 nt   +upstreamnt  +downstreamnt
atggccagcccaatgctcttgctcagcctcggcctcctggctggtctgctgccggcgctg
gccgcctgcccccagaactgccactgccacggcgacctgcagcacgtcatttgcgacaag
gtggggctgcagaagatccccaaggtatcagagaagacaaagctgctcaacctacagcgc
aacaacttcccggtgctggctgccaactcgtttcgggccatgccgaacctcgtgtcgctg
cacctgcagcactgccagattcgcgaggtggccgccggtgccttccgcggcctcaagcag
cttatctacttgtacctgtcccataacgacatccgcgtgctgcgcgcaggtgccttcgac
gacctgaccgagctgacctacctctacctggaccacaacaaggtcaccgagctgccccgg
gggttgctctccccgctggtcaacctcttcatcttgcagctcaacaacaacaaaatccgt
gagctgcgcgcaggcgccttccagggagccaaggacctgcgctggctctacctgtcggaa
aacgcgctgagctccctgcagcccggggccctggacgacgtggagaacctcgccaaattc
cacgtggacaggaaccagctgtccagctacccctcggctgccctgagcaagctacgggtg
gtggaggagctgaagctgtcccacaaccccctgaaaagcatcccggacaatgccttccag
tcctttggcagatacctggaaaccctctggctggacaacactaacctggagaagttctca
gatggtgccttcctgggtgtaaccacactgaaacacgcccatttggagaacaaccgcctg
aaccacctgccttccaacttccctttcgacagcctggagaccctcacccttaccaacaac
ccctggaagtgtacctgccagctccggggtcttcggcggtggctagaagccaaggcctcc
cgcccagatgccacctgtgcctcacctgccaagttcaagggccaacacatccgtgacaca
gacgccttccgcagctgcaagttccctaccaagaggtccaagaaagctggccgccattaa

KEGG   Macaca mulatta (rhesus monkey): 702720
Entry
702720            CDS       T01028                                 

Gene name
TNC
Definition
(RefSeq) tenascin isoform X1
  KO
K06252  tenascin
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05165  Human papillomavirus infection
mcc05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    702720 (TNC)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    702720 (TNC)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    702720 (TNC)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    702720 (TNC)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    702720 (TNC)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    702720 (TNC)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   702720 (TNC)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_Tenascin EGF_2 EGF Pur_ac_phosph_N hEGF fn3_2 DUF2369 DUF4998
Other DBs
NCBI-GeneID: 702720
NCBI-ProteinID: XP_014972457
Ensembl: ENSMMUG00000003356
LinkDB
Position
15
AA seq 2201 aa
MGAMTRLSAGVLLALLALLTEGGVLKKVIRHKRQSGVNITLPEENQPVVFNHVYNIKLPV
GSQCSVDLESASGEKDLAPPSEPSESFQEHTVDGENQIVFTHRINIPRRACGCAAAPDVK
ELLSRLEELENLVSSLREQCAAGAGCCLQSAIGRVDTRPFCSGRGNFSTEGCGCVCEPGW
KGPNCSEPECPGNCNLRGQCIDGQCICDEGFTGEDCSQLACPGDCNDQGKCVNGVCICFE
GYAGADCSREICLVPCSEEHGTCVDGLCVCHEGFAGDDCNKPLCLNNCYNRGRCVENECV
CDEGFTGEDCSELICPNDCFDRGRCINGTCYCEEGFTGEDCGKLTCPHACHGQGRCEEGQ
CVCDEGFAGVDCSEKRCPADCHNRGRCVDGQCECDDGFTGADCGELKCPNGCSGHGRCVN
GQCVCDEGYTGEDCSQLRCPNDCHSRGRCVQGKCVCEQGFKGYDCSDMSCPNDCHQHGRC
VNGMCVCDDGYTGEDCRDRQCPRDCSNRGRCVDGQCVCEDGFTGPDCAELSCPNDCHGRG
RCVNGQCVCHEGFMGKDCKEQRCPSDCHGQGRCMDGQCICHEGFTGLDCGQRSCPSDCSN
LGQCVSGRCICNEGYAGEDCSEVSPPKDLIVTEVTEETVNLAWDNEMRVTEYLVVYTPTH
EGGLEMQFRVPGDQTSTIIRELEPGVEYFIRVFAILENKKSIPVSARVATYLPAPEGLKF
KSIKETSVEVEWDPLDIAFETWEIIFRNMNKEDEGEITKSLRRPETSYRQTGLAPGQEYE
ISLHIVKNNTRGPGLKRVTTTRLDAPSQMEVKDVTDTTALITWFKPLAEIDGIELTYGIK
DVPGDRTTIDLTEDENQYSIGNLKPDTEYEVSLISRRGDMSSNPAKETFTTGLDAPRNLR
RVSQTDNSITLEWRNGKAAIDSYRIKYAPISGGDHAEVDVPKSQQATTKTTLTGLRPGTE
YGIGVSAVKEDKESNPATINAATELDTPKDLRVSETAETSLTLFWKTPLAKFDRYRLNYS
LPTGQWVGVQLPRNTTSYVLRGLEPGQEYNVLLTAEKGRHKSKPARVKASTEQAPELENL
TVTEVGWDGLRLNWTAADQAYEHFVIQVQEANKVEAAQNLTVPGNLRAVDIPGLKAATPY
TVSIYGVIQGYRTPVLSAEASTGETPNLGEVMVAEVGWDALKLNWTVPEGAYEYFFIQVQ
EADTVEAAQNHTVPGGLRSTDLPGLKAATHYTITIRGVTQDFSTTPLSVEVLTEEVPDMG
NLTVTEVSWDALRLNWTTPDGTYDQFTIQVQEADQVEEAHNLTVPGSLRSVEIPGLRAGT
PYTITLHGEVRGHSTRPLALEVITEELPQLGDLAVSEVGWDGLRLNWTAADQAYEHFVIQ
VQEVNKVEAAQNLTVPGSLRAVDIPGLKAATPYTVSIYGVIRGYRTPVLSAEASTAKEPE
IGNLNVSNITPESFSLSWTATDGIFETFTIEIIDSNRLLEIVEYNISGAERTAHISGLPP
STDFIVYLSGLAPSFRTKTISATATTEALPLLENLTISDINPYGFTVSWMASENAFDSFL
VTVVDSGKLLDPQEFTLSGTQRKLELRGLITGIGYEVMVSGFTQGHQTKPLRAEIVTEAE
PEVDNLLVSDATPDGFRLSWTADEGIFDNFVLKIRDTKKQSEPLEITLLAPERTRDITGL
REATEYEIELYGISKGRRSQPVSAIATTAMGSPKEVIFSDITENSATVSWRAPTAQVESF
RITYVPITGGTPSMVTVDGTKTQTRLVKLIPGVEYLVNIIAMKGFEESEPVSGSFTTALD
GPSGLVTANITDSEALARWQPAIATVDSYVISYTGEKVPEITRTVSGNTVEYALTDLEPA
TEYTLRIFAEKGPQKSSTVTAKFTTDLDSPRDLTATEVQSETALLTWRPPRASVTGYLLV
YESVDGTVKEVIVGPDTTSYSLVDLSPSSHYTAKIQALNGPLRSNMIQTIFTTIGLLYPF
PKDCSQAMLNGDTTSGLYTIYLNGDKAQALEVFCDMTSDGGGWIVFLRRKNGHENFYQNW
KAYAAGFGDRREEFWLGLDNLNKITAQGQYELRVDLRDHGETAFAVYDKFSVGDAKTRYK
LKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAITNCALSYKGAFWYRNCHRVNLMGRYGD
NNHSQGVNWFHWKGHEHSIQFAEMKLRPSNFRNLEGRRKRA
NT seq 6606 nt   +upstreamnt  +downstreamnt
atgggggccatgactcggctgtcggcaggtgtccttcttgctttgcttgccctccttacc
gaaggtggggtcctcaagaaagtcatccggcacaagcgacagagtggggtgaacatcacc
ctgccagaggagaaccagccagtggtgtttaaccacgtctacaacatcaagctgccagtg
ggatcccagtgttcggtggatctggagtcagccagtggggagaaagacctggccccacct
tcagagcccagcgaaagcttccaggagcacacagtggatggggaaaaccagattgtgttc
acacatcgcatcaacatcccccgccgggcctgtggctgtgccgcagcgcctgatgttaag
gagctgctgagcagactggaggagctggagaacctggtgtcttccctgagggagcaatgt
gctgcaggagcaggctgctgtctccagtctgccataggccgcgtggacaccaggcccttc
tgcagcggtcggggcaacttcagcactgaaggatgtggctgtgtctgcgaacctggctgg
aaaggccccaactgctctgagcccgaatgtccaggcaactgtaaccttcgaggccagtgc
attgatgggcagtgcatctgtgacgagggcttcacgggcgaggactgcagccagcttgct
tgccccggcgactgcaatgaccagggcaagtgcgtgaatggagtctgcatctgttttgaa
ggctacgccggggctgactgcagccgggaaatctgcctggtgccctgcagtgaggagcac
ggcacgtgtgtagatggcttgtgtgtgtgccacgaaggctttgcaggtgatgactgcaac
aagcctctgtgtctcaacaattgctacaaccgtgggcgatgcgtggagaatgagtgcgtg
tgtgatgagggtttcacgggtgaagactgcagtgagctcatctgccccaatgactgcttc
gaccggggccgctgcatcaatggcacctgctactgcgaagaaggcttcacaggtgaagac
tgcgggaaactcacctgcccgcatgcctgccatggccagggccggtgcgaggaggggcag
tgcgtatgcgatgagggctttgccggtgtggattgcagcgagaagaggtgtcctgctgac
tgtcacaatcgtggccgctgtgtagacgggcagtgtgagtgtgatgatggtttcacggga
gctgactgtggggagctcaagtgtcccaatggctgcagtggccatggccgctgtgtcaat
gggcagtgtgtgtgtgatgagggctatactggggaagactgcagccagctacggtgcccc
aatgactgtcacagtcggggccgctgtgtccagggcaaatgtgtatgtgagcaaggcttc
aagggctatgactgcagtgacatgagctgccccaatgactgtcaccagcacggccgctgt
gtgaatggcatgtgtgtttgtgatgacggctacacaggtgaagactgccgggatcgccag
tgcccccgggactgcagcaacaggggccgctgtgtggacggacagtgcgtctgtgaggac
ggcttcaccggccctgactgcgcagaactctcctgtccaaatgactgccatggccggggt
cgctgtgtgaacgggcagtgtgtgtgccatgaaggattcatgggcaaagattgcaaggag
caaagatgtcccagtgactgtcatggccagggccgctgcatggacggccaatgcatctgc
cacgagggcttcacaggcctggactgtggccagcgctcctgccccagtgactgcagcaac
ttaggacaatgcgtctcaggccgctgcatctgcaacgagggctacgccggagaagactgc
tcagaggtgtctcctcccaaagacctcatcgtgacagaagtgacggaagagacggtcaac
ctggcctgggacaatgagatgcgggtcacagagtaccttgtcgtgtacacgcccacccac
gagggcggcctggaaatgcagttccgtgtgcctggggaccagacgtctaccatcatccgg
gagctggagcctggtgtggagtactttatccgtgtatttgccatcctggagaacaagaag
agcattcctgtcagtgccagggtggccacgtacttgcctgcacctgaaggcctgaaattc
aagtctatcaaggagacatctgtggaagtggagtgggatcctctagatatcgcttttgaa
acctgggagatcatcttccggaatatgaataaagaagatgagggagagatcaccaaaagc
ctgaggaggccagagacctcttaccggcaaactggcctggctcctgggcaagagtatgag
atatctctgcacatagtgaaaaacaatacccggggccctggcctgaagagggtgaccacc
acacgcttggatgcccctagccagatggaggtgaaagatgtcacagacaccactgctttg
atcacctggttcaagcccctggctgagatagacggcattgagctgacctacggcatcaaa
gacgtgccaggcgaccgtaccaccatcgatctcacagaggacgagaatcagtactccatc
gggaacctgaagcccgacactgagtacgaggtgtccctcatctcccgcagaggcgacatg
tcaagcaacccagccaaagagaccttcacaacaggcctcgatgctcccaggaaccttcga
cgtgtttcccagacagataacagcatcaccctggaatggaggaatggcaaagcagctatt
gacagttacagaattaagtatgctcccatctctggaggggaccacgctgaggttgatgtt
ccaaagagccaacaagccacaaccaaaaccacactcacaggtctgaggccggggactgaa
tatgggattggagtttctgctgtgaaggaagacaaggagagcaatccagcgaccatcaac
gcagccacagagttggacacacccaaggaccttcgggtttctgaaactgcagagaccagc
ctgaccctgttctggaagaccccgttggccaagtttgaccgctaccgcctcaattacagc
ctccccacaggccagtgggtgggagtgcagcttccaagaaacaccacttcctatgtcttg
agaggcctggaaccgggacaggagtacaatgtcctcctgacagccgagaaaggcaggcac
aaaagcaagcccgcacgcgtgaaggcatccactgaacaagcccctgaactggaaaacctc
accgtgactgaggttggctgggatggcctcagactcaactggaccgcagctgaccaggcc
tatgagcactttgtcattcaagtgcaagaggccaacaaggtggaggcagctcagaacctc
accgtgcctggcaacctccgggctgtggacatcccgggcctcaaggctgccacgccttat
acagtctccatctatggggtgatccagggctatagaacaccagtgctctctgctgaggcc
tccacaggggaaactcccaatttgggagaggtcatggtggccgaggtgggctgggatgcc
ctcaaactcaactggactgttccggaaggggcctatgagtactttttcattcaggtgcag
gaggctgacacagtagaggcagcccagaaccacacggtcccaggaggactgaggtccacg
gacctgcctgggctcaaagcagccactcattataccatcaccatccgcggggtcactcag
gacttcagcacaacccccctctctgttgaagtcttgacagaggaggttccagatatggga
aacctcacagtgaccgaggttagctgggatgctctcagactgaactggaccacgccagat
ggaacctatgaccagtttaccattcaggtccaggaggctgaccaggtggaagaggctcac
aatctcaccgttcctggcagcctgcgctccgtggaaatcccaggcctcagggctggcact
ccttacacaatcaccctgcacggcgaggtcaggggccacagcactcgaccccttgcttta
gaggtcatcacagaggagctcccacagctgggagatttagccgtgtctgaggttggctgg
gatggcctcagactcaactggaccgcagctgaccaggcctatgagcactttgtcattcag
gtgcaggaggtcaacaaagtggaggcagcccagaacctcacggtgcctggcagcctccgg
gctgtggacatcccgggcctcaaggctgccacgccttatacagtctccatctatggggtg
atccggggctatagaacaccagtactctctgctgaggcttccacagccaaagaacctgaa
attggaaacttaaatgtttctaacataactcccgagagcttcagtctctcctggacagct
accgatgggatcttcgagacctttaccattgaaattattgattccaataggttgctggag
attgtggaatataatatctctggtgctgaacgaactgcccatatctcagggctaccccct
agtactgattttattgtctacctctctggacttgctcccagcttccggaccaaaaccatc
agtgccacagccacaacagaggccctgccccttctggaaaacctaaccatttccgacatt
aatccctacgggttcacagtttcctggatggcatcggagaatgcctttgacagctttcta
gtaacggtggtggattctgggaagctgctggacccccaggaatttacactttcaggaacg
cagaggaagctggagcttagaggcctcataactggcattggctatgaggttatggtctct
ggcttcacccaagggcaccaaaccaagcccttgagggctgagattgttacagaagccgaa
ccagaagttgacaaccttctggtttcagatgccaccccagacggtttccgtctgtcctgg
acagctgatgaagggatcttcgacaattttgttctcaaaatcagagataccaaaaagcag
tctgagccactggaaataaccctacttgcccccgaacgtaccagggacataacaggtctc
agagaggctactgaatacgaaattgaactgtatggaataagcaaaggaaggcgatcccag
ccagtcagtgctatagcaacaacagccatgggctctccaaaggaagtcattttctcggac
atcactgaaaattcggctactgtcagctggagggcacccactgcccaggtggagagcttc
cggattacctatgtgcccattacaggaggtacaccctccatggtaactgtggatggaacc
aagactcagaccaggctggtgaaactcatacctggggtggagtaccttgtcaacatcatt
gccatgaagggctttgaggaaagtgaacctgtctcagggtcgttcaccacagctctggat
ggcccatctggcctggtgacagccaatatcactgactcagaagccttggccaggtggcag
ccggccattgccactgtggacagttatgtcatctcctacacaggggagaaagtgccagaa
attacacgcacggtgtccgggaacacagtggagtatgctctgaccgacctcgagcctgcc
acggaatacacactgagaatctttgcagagaaagggccccagaagagctcaaccgtcact
gccaagttcacaacagacctggattctccaagagacttgactgctactgaggttcagtca
gaaactgccctccttacctggcgaccaccccgggcatcagtcactggttacctgctggtc
tatgaatcggtggatggcacagtcaaggaagtcattgtgggtccagacaccacctcctac
agcctggtagacctaagcccatcctcccactacaccgccaagatccaggcactcaatggg
cccctaaggagcaatatgatccagaccatcttcaccacaattggactcctgtaccccttc
cccaaggactgctcccaagcaatgctgaatggagacacgacctctggcctctacaccatt
tatctgaatggtgataaggcccaggcgctggaagtcttctgtgacatgacctctgatggg
ggtggatggattgtgttcctgagacgcaaaaacggacatgagaacttctaccaaaactgg
aaggcctatgctgctggatttggggaccgcagagaagaattctggcttgggctggacaac
ctgaacaaaatcacagcccaagggcagtacgagctccgggtggacctgcgggaccatggg
gagacagcctttgctgtctatgacaagttcagcgtgggagatgccaagactcgctacaag
ctgaaggtggaggggtacagcgggacagcaggtgactccatggcctaccacaacggcagg
tccttctccaccttcgacaaggacacggattcagccatcaccaactgtgctctgtcctac
aagggggctttctggtacaggaactgtcaccgtgtcaacctgatggggagatatggggac
aataaccacagtcagggcgttaactggttccactggaagggccatgaacactcaatccag
tttgctgagatgaaactgagaccaagcaacttcagaaatcttgaaggcaggcgcaaacgg
gcataa

KEGG   Macaca mulatta (rhesus monkey): 703707
Entry
703707            CDS       T01028                                 

Gene name
COL4A6
Definition
(RefSeq) collagen alpha-6(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    703707 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    703707 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    703707 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    703707 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    703707 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    703707 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    703707 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    703707 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    703707 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    703707 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    703707 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    703707 (COL4A6)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   703707 (COL4A6)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   703707 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 703707
NCBI-ProteinID: XP_001098448
Ensembl: ENSMMUG00000007205
LinkDB
Position
X
AA seq 1687 aa
MHPGLWLLLVTLCLTEELAGAGEKSYGKPCGGQDCSGSCQCFPEKGARGRPGPIGIQGPS
GPQGFTGSTGLSGLKGERGSPGLLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGPP
GLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPGLPGLDG
ITGPQGAPGSSGAVGPAGPPGLQGPPGPPGPPGPDGNMGLGFQGEKGVKGDVGLPGPAGP
PPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGIPGLPGPRGP
MGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGEKGDIGLPGPDVFIDIDGAVISGNPGDP
GVPGLPGLKGDEGIQGLRGPSGVAGLPALSGVPGALGPQGFPGLKGDQGNPGRTTIGAAG
LPGRDGLPGPPGPPGPPGPEFEAETLHNKEPGFPGLRGEQGPKGNPGLKGIKGDSGFCAC
DGGVPNTGPPGEPGPPGPRGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGSPGPSGPKGK
KGEPILSTISGMPGDRGDSGSQGFPGVIGKPGNDGVPGLPGLPGLPGDGGQGFPGEKGLP
GLPGEKGHPGPPGLPGIGLPGLPGPRGLPGDKGKDGLPGQQGPPGSKGITLPCIIPGSYG
PSGFPGTPGFPGPKGSRGLPGTPGQPGSSGNKGKPGSPGLVHLPELPGFPGPRGEKGLPG
FPGLPGKDGLPGTIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGDKGLLGDSGLPGL
KGVYGKPGLLGPKGERGSPGTPGPVGQPGTPGSSGPYGIKGKSGLPGAPGFPGTSGHPGK
KGTRGEKGPPGSIVKKGLPGLKGLPGNPGLIGLKGSPGSPGVAGLPALSGPKGEKGSVGF
VGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGHAGTPGEKGDRGNPGPVGIPGPRRPMSN
LWLKGDKGSQGSAGSDGFPGPRGDKGEAGQPGPPGLPGAPGLPGTIKGVSGKPGPPGFMG
IRGLPGLKGSSGITGFPGMPGESGSQGIRGSSGLPGTSGLPGLKGDNGQTLEISGSPGPK
GQPGESGFKGTKGRDGPIGNIGFPGNKGEDGKVGVSGDVGLHGAPGFPGVAGMRGEPGLP
GSSGHQGAIGPLGPPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPAGLPG
PKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGDPGRP
GLDGERGRPGPPGPPGPTGPSSNQGNTGDPGFPGIPGPKGPKGDQGIPGFSGLPGELGLK
GMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGPQGAPGQTPTAEAVQVPPGPLGL
PGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQGPPGF
EGAPGQQGPFGMPGMPGQSARVGYTLVKHSQSEQVPLCPIGMSQLWVGYSLLFVEGQEKA
HNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQIPQY
ISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVSPGSC
LEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETLKAGQLHTRVSRC
QVCMKSL
NT seq 5064 nt   +upstreamnt  +downstreamnt
atgcaccctgggttgtggctgctcctggttacgttgtgcctgacggaggaactggcagga
gcgggagagaagtcttatggaaagccgtgtgggggccaagactgcagtgggagctgtcag
tgttttcctgagaaaggagcgagagggcgacctggaccaattggaattcaaggcccatca
ggtcctcaaggattcactggctctactggtttatcgggattgaaaggagaaaggggttcc
ccaggccttctgggaccttatggacccaaaggagataagggtcccatgggagttcctgga
tttcttggcatcaatgggattccgggccatcctggacagccgggccccagaggcccacct
ggtctggatggctgtaatggaactcaaggagctgtcggatttccaggccctgatggctat
cctgggcttctcggaccacctgggcttcctggtcagaaaggatcaaaaggtgaccctgtc
cttgctccaggtagtttcaaaggaatgaagggggatcctgggctgcctggactggatgga
atcactggcccacaaggagcacccggatcttctggagctgtaggacctgcaggaccacca
ggattacaaggtcctccagggcctcctggtcctcctggtcctgatgggaatatggggcta
ggttttcaaggagagaaaggagtcaagggggatgttggcctccctggcccagcagggcct
ccaccatctactggagagctggaattcatgggattccccaaagggaagaaaggatccaag
ggtgaaccagggcctaagggttttccaggcataagtggccctccaggcttcccgggcctt
ggaactactggagaaaagggagaaaagggaatccctggtttgccaggacctaggggtccc
atgggttcagaaggagtccaaggccctccagggcaacagggcaagaaagggaccctggga
tttcctgggcttaatggattccaaggaattgagggtgaaaagggtgacattggcctgcca
ggcccagatgttttcatcgatatagatggtgctgtgatctcaggtaatcctggagaccct
ggtgtacctggcctcccaggccttaaaggagatgaaggcatccagggcctgcgtggcccg
tctggtgtcgctggcttgccagcattatcaggtgtcccaggagccctagggcctcaggga
tttccagggctgaagggggaccaaggaaacccaggccgtaccacaattggagcagctggc
ctccctggcagagatggtttgccaggcccaccaggtccaccaggcccacctggtccagaa
tttgaggccgaaactctacacaacaaagagccagggttccctggtctccgaggagaacaa
ggtccaaaaggaaacccaggcctcaaaggaataaaaggagactcaggtttctgtgcttgt
gacggtggtgttcccaacactggaccacccggggaaccaggcccacctggtccacggggt
ctcataggccttccaggccttaaaggagccagaggagatcgaggctctgggggtgcacag
ggcccagcaggggctccaggcttagttgggtctccgggtccttcaggacccaaaggaaag
aagggggaaccaattctcagtacaatctcaggaatgccaggggatcggggtgattctggc
tcccagggcttccctggtgtaataggaaaaccaggcaacgatggagtaccaggtctacca
ggtctgccaggacttccgggtgatggtgggcagggcttcccaggtgaaaaggggttacct
ggacttcctggtgaaaaaggccatcctggtccacctggcctcccgggaattgggttacca
ggactgcctggaccccgtgggcttcctggagataaaggcaaggatggattaccgggacaa
caaggccctcctggatctaagggaatcaccctgccgtgtattattcctgggtcatacggt
ccatcaggatttccaggcactcccggattcccaggccctaaagggtcccgaggcctccct
gggaccccaggccagcctggatcaagtggaaataaaggaaagccagggagtccaggattg
gttcatcttcctgaattaccaggatttcctggacctcgtggggagaagggcttgcctggg
tttcctgggctccctggaaaagatggcttgcctgggacgattggcagtccaggtttacct
ggttccaagggagccactggtgacatctttggtgctgaaaatggtgctccgggggagcaa
ggcctacaaggattaacaggcgacaaaggacttcttggagactctggccttccaggactc
aagggtgtgtacgggaagcctggcttgctaggccccaaaggtgagcggggcagccctgga
acaccaggaccggtgggacagccaggcaccccaggatctagtggtccgtatggcatcaag
ggcaaatctgggctcccaggagcaccaggctttccaggcacctcagggcatcctggaaag
aaaggaacaagaggcgagaaaggccctcctggatcaattgtaaagaaagggctgccaggg
ctaaaaggccttcctggaaatccaggcctaatagggctgaaaggaagcccaggctctcca
ggggtcgctgggttgccagccctctctggacccaagggagagaaggggtctgttggattc
gtaggttttccaggaataccaggtctgcctggtattcctggaacaagaggattaaaggga
attccaggatcaaccggaaaaatgggaccatctggacacgctggcactcctggtgaaaag
ggagacagaggcaatccggggccagtcggaatacctggtccaagacgtccaatgtcaaac
ctttggctcaaaggagacaaaggctctcaaggctcagccggatccgatggatttccaggg
ccaagaggtgacaaaggagaggctggtcaacctgggccaccaggcctgcctggagctcct
ggcctcccaggcactatcaaaggagttagtggaaagccaggtccccctggcttcatggga
atccggggcttacctggcttgaaggggtcctctgggataacaggtttcccaggaatgcca
ggagaaagtggttcacaaggtatcagagggtcgtctggactcccaggaacatctggtctc
ccaggcctgaaaggagacaatggccagacacttgaaatttccggtagcccaggaccgaag
ggacaacctggtgaatctggttttaaaggcacaaaaggaagagatggaccaataggcaat
ataggcttccctggaaacaaaggtgaagatggaaaagttggtgtttctggagatgttggc
cttcatggagctccagggtttccaggggttgccggcatgagaggagaaccaggacttcca
ggctcttctggtcatcaaggggcaattgggcccctaggaccccccggattaataggaccc
aaaggcttccctggatttcctggtttacatggactgaatgggcttccgggcaccaagggt
acccatggcactccaggacctagtatcactggtgtgcctgggcctgctggtctccctgga
cccaaaggagaaaaaggatatccaggaattggcatcggagctccagggaagccgggcctg
agagggcaaaaaggtgatcgaggtttcccaggtctccagggccctgctggtctccccggt
gccccaggcatctccttgccctcactcatagcaggacagcctggtgaccccgggcgacca
ggcctagatggagaacgaggccgcccaggcccccctggacccccaggtcccactgggcca
tcctcgaatcaaggcaacaccggagaccctggcttccctggaattcctggacctaaaggg
cctaagggagaccaaggaattccaggtttttctggccttcctggagagctaggactgaaa
ggcatgagaggtgagcctggcttcatggggactccaggcaaggttgggccacctggagac
ccaggatttcccgggatgaaggggaaggcagggccaagaggctcttctggcccccaaggt
gctcctggacaaacaccaactgcagaagctgtccaggttcctcctggacccttgggtctg
ccagggatcgatggcatccctggcctcactggggaccctggggctcaaggccctgtaggc
ctacaaggctccaaaggtttacctggcatccccggcaaagatggtcccagtgggctccca
ggcccacctggggctcttggtgatcctggtctgcctggactgcaaggccctccaggattt
gaaggagctccagggcagcaaggccccttcgggatgcctggaatgcctggccagagcgcg
agagtgggctacacgttggtaaagcacagccagtcggaacaggtgcccctgtgtcccatc
gggatgagccagctatgggtggggtacagcttactgttcgtggaggggcaagagaaagcc
cataaccaggacctgggctttgctggctcctgtctgccccgcttcagcaccatgcccttc
atctactgcaacatcaacgaggtgtgccactatgccaggcgcaatgataaatcctactgg
ctctccactactgcccctatccccatgatgcccgtcagccagacccagattccccagtac
atcagccgctgctctgtgtgtgaggcaccctcgcaagccattgctgtgcacagccaggac
atcaccatcccgcagtgccccctgggctggcgcagcctctggattgggtactccttcctc
atgcacactgccgctggtgccgagggtggaggccagtccctggtctcacctggctcctgc
ctagaggactttcgggccacccctttcattgagtgcagtggtgcccgaggcacctgccac
tactttgcaaacaaatacagtttctggttgaccacagtggaggagaggcagcagtttggg
gagttgcctgtgtctgaaacactgaaagctgggcagctccacactcgagtcagtcgctgc
caggtgtgtatgaaaagcctgtag

KEGG   Macaca mulatta (rhesus monkey): 703814
Entry
703814            CDS       T01028                                 

Gene name
COL4A5
Definition
(RefSeq) collagen alpha-5(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    703814 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    703814 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    703814 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    703814 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    703814 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    703814 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    703814 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    703814 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    703814 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    703814 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    703814 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    703814 (COL4A5)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   703814 (COL4A5)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   703814 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 703814
NCBI-ProteinID: XP_014983488
Ensembl: ENSMMUG00000007235
LinkDB
Position
X
AA seq 1691 aa
MKLRGVSQAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDIGPPGPPGLVIPRPGTGITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVVGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPPGPHIPPSDEICEAGPPGPPGSPGDKGLQGEQGMKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGSKGLPGIPGAPGAPGF
PGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGVQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGLPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGTLGPKGDRGFP
GPPGPPGRTGIDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGSIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPAGEKGSKGEPGLPGPPGPMDPNLL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPPGAPGQPGSPGLPGQKGDKGDPGISSIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPG
NPGLPGETGPVGGGGRPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGPTGFQGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRP
GLNGMKGDPGLPGVPGFPGMKGPSGIPGSAGPEGEPGLTGPPGPPGLPGPSGQSIIIKGD
AGPPGIPGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGT
RGLDGPPGPDGLQGPPGPPGTSSIAHGFLITRHSQTTDAPQCPQGTLQIYEGFSLLYVQG
NKRAHGQDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPVSMQPLKG
QSIQPFISRCAVCEAPAVVIAVHSQTIQIPRCPQGWDSLWIGYSFMMHTSAGAEGSGQAL
ASPGSCLEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTR
ISRCQVCMKRT
NT seq 5076 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagccaggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcggcttgctatgggtgttctccaggatcaaagtgtgactgc
agtggcataaaaggggaaaagggagagagagggtttccaggtttggaaggacacccagga
ttacctggatttccaggtccagaagggcctccagggcctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccatgatggggccccaggacctcaaggt
attcctggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttccc
ggtttacagggtcctccaggaccccctgggatcccaggtatgaagggtgaaccaggtagt
ataattatgtcatcactgccaggaccaaagggtaatccaggatatccaggtcctcctgga
atacaaggcctacctggtcccactggtataccagggccaattggtcccccaggaccacca
ggtttgatgggccctcctggtccaccaggacttccaggaccaaaggggaatatgggctta
aatttccagggacccaaaggtgaaaagggtgagcaaggtcttcagggcccacctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtccc
ccaggtggtgagaaaggtgagaagggtgagcaaggagagccaggcaaaagaggtaaacca
ggcaaagatggagaaaacggccaaccaggaattcctggtttgcctggtgatcctggttac
cctggtgaacccggaagggatggtgaaaagggccaaaaaggtgacattggcccacctgga
cctcctggacttgtaattcctagacctgggactggtataactataggagaaaaaggaaac
attgggttgcctggcttgcccggggaaaaaggagagcgaggatttcctggaatacagggt
ccacctggccttcctggacctccaggggctgcagttgtgggtcctcctggccctcctgga
tttcctggagaaaggggtcagaaaggtgatgaaggaccacctggaatttccattcctgga
cctcctggacttgatggacagcctggggctcctgggcttccagggcctcctggccctcct
ggacctcacatccctcctagtgatgagatatgtgaagcaggccctccaggccccccagga
tctccaggtgataaaggactccaaggagaacaaggaatgaaaggtgacaaaggtgacact
tgcttcaactgcattggaactggtatttcagggcctccaggtcaacctggtttgccaggt
ctcccaggtcctccaggatcccttggtttccctggacagaaaggtgaaaaaggacaagct
ggtgcaactggttccaaaggattaccaggcattccaggagctccaggtgctccaggcttt
cctggatctaaaggtgaacctggtgatatcctcacttttccaggaatgaagggtgacaaa
ggagagttgggttcccctggagctccagggcttcctggtttacctggcactcctggacag
gatggattgccagggcttcctggcccgaaaggagagcctggtggaattacttttaagggt
gaaagaggtccccctgggaacccaggtttaccaggcctcccagggaatatagggcctatg
ggtcctcctggttttggccctccaggcccagtaggtgaaaaaggcgtacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggggatccaggtcagactataacc
cagccagggaagcctggcttgcctggtaacccaggcagagatggtgaagtaggtcttcca
ggtgaccctggactcccagggcaaccaggcttgccagggatacctggtagcaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggctttcctggaatt
ccaggacctccaggagcacctgggacacctggaagaattggtctagaaggccttcctggg
ccacctggctttccaggaccaaagggtgaaccaggatttgcattacctgggccacctggg
ccaccaggacttccaggtttcaaaggaacacttggtccaaaaggtgatcgtggtttccca
ggacctccaggtcctccaggacgcactggcatagatgggctccctggaccaaaaggtgat
gttggaccaaatggacaacctggaccaatgggacctcctgggctgccaggaataggtgtt
caaggaccaccaggaccaccagggattcctgggccaataggtcaacctggtttacatgga
ataccaggagagaagggggatccaggacctcctggacttgatgttccaggacccccaggt
gaaagaggcagtccagggatccctggagcacctggttctataggacctccaggatcacca
gggcttccaggaaaagcaggtgcctctggatttccaggtaccaaaggtgaaatgggtatg
atgggacctccaggcccaccaggacctttgggaattcctggcaggagtggtgtacctggt
cttaaaggtgatgatggcttgcagggtcagccaggacttcctggccctgcaggagaaaaa
ggtagtaaaggagagcctggccttccaggccctcctggaccaatggatccaaatcttctg
ggctcaaaaggagagaagggggaacctggcttaccaggtatacctggagtttcagggcca
aaaggttatcagggcttgcctggagacccagggcaacctggactgagtggacaacctgga
ttaccaggaccaccaggtcccaaaggtaaccctggtctccctggacagccaggtcttata
ggacctcctggacttaaaggaaccatcggcgatatgggttttccagggcctcagggtgtg
gaagggcctcctggacctcctggagctcctggacagcctggctccccaggattacctgga
cagaaaggagacaaaggtgatcctggtatttcaagcattggtcttccaggtcttcctggt
ccaaagggtgagcctggtctgcctggatacccagggaaccctggtatcaaaggttctgtg
ggagatcctggtttgcctggattaccaggaacccctggagcaaaaggacaaccaggcctt
cctggattcccaggaaccccaggccctcctggaccaaaaggtattagtggccctcctggg
aaccctggccttccaggagaaactggtcctgtaggtggtggaggtcgtcctgggcaacca
gggcctccaggcgaaaaaggcaaacctggtcaagatggtattcctggaccagctggacag
aagggtgaaccaggtcaaccaggctttggaaacccaggaccccctggacttccaggactt
tctggccaaaagggtgatggaggattacctggaattccaggaaatcctggccttccaggt
ccaaagggcgaaccaggctttcatggtttccctggtgtgcagggtcccccaggccctcct
ggttctccgggtccagctctggaaggacctaaaggcaaccccgggccccaaggtcctcct
gggagaccaggtcctacaggttttcaaggtctaccaggtccggaaggtcccccaggtctc
cctggaaatggaggtattaaaggagagaagggaaacccaggccaacctgggctacctggc
ttgcctggtttgaaaggagatcaaggaccaccaggactccagggtaatcctggtcggccg
ggtctcaatggaatgaaaggagatcctggtctccctggtgttccaggattcccaggcatg
aaaggacccagtggaatacctggatcagctggtcctgagggggaaccgggacttactggt
cctccaggtcctcctggattacctggtccttcaggacagagtatcataatcaaaggagat
gctggtcctccaggaatccctggacaacctgggttaaaaggtctaccaggaccccaagga
cctcaaggtttaccaggtccaactggccctccaggagatcctggacgcaatggactccct
ggctttgatggtgcaggagggcgcaaaggagacccaggtctgccaggacagccaggtact
cgtggtttggatggtccccccgggccagatggattgcaaggtcccccaggtccccctgga
acctcctctattgcacatggatttcttattacacgccacagccagacaacggatgcacca
caatgcccgcagggaacacttcagatctatgaaggcttttctctcctgtatgtacaagga
aataaaagagcccatggtcaagacttggggacggctggcagctgccttcgtcgctttagt
accatgcctttcatgttctgcaacatcaataatgtttgcaacttcgcttcaagaaatgac
tattcttactggctctctaccccagagcccatgccagtgagcatgcaacccctaaagggc
cagagcatccagccattcattagtcgatgtgcagtatgtgaagctccagctgtggtgatc
gcagttcacagtcagacgatccagattccccgttgtcctcagggatgggattctctgtgg
attggttattccttcatgatgcatacaagtgcaggggcagaaggctcaggtcaagcccta
gcctctcctggttcctgcttggaagagtttcgttcagctcccttcatcgaatgtcatggg
aggggtacctgtaactactatgccaactcctacagcttttggctggcaactgtagatgtg
tcagacatgttcagtaaacctcagtcagaaacgctgaaagcaggagacttgaggacacga
attagccgatgtcaagtgtgcatgaagaggacataa

KEGG   Macaca mulatta (rhesus monkey): 704851
Entry
704851            CDS       T01028                                 

Gene name
COL2A1
Definition
(RefSeq) collagen alpha-1(II) chain isoform X2
  KO
K19719  collagen type II alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    704851 (COL2A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    704851 (COL2A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    704851 (COL2A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    704851 (COL2A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    704851 (COL2A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    704851 (COL2A1)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   704851 (COL2A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 704851
NCBI-ProteinID: XP_001100739
Ensembl: ENSMMUG00000011444
UniProt: F7B3R3
LinkDB
Position
11
AA seq 1487 aa
MIRLGAPQTLVLLTLLVAAVLRCQGQDVQEAGSCVQDGQRYNDKDVWKPEPCRICVCDTG
TVLCDDIICEDVKDCLSPEIPFGECCPICPTDLATASGQPGPKGQKGEPGDIKDIVGPKG
PPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFA
AQMAGGFDEKAGGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMG
PRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKG
EAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVG
PAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGSPGSPGPAGASGNPGTDGIPGAKG
SAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQG
APGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAG
PKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPG
VMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPG
PSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGSQGLQGARG
LPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPG
KDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPG
ADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPG
AAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGDPGLQGPAGPPGEKGEPG
DDGPSGADGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPG
PVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGSPGPPGSPGPAG
PTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQG
LPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAG
PPGNPGPPGPPGPPGPGIDMSAFAGLGPREKGPDPLQYMRADQAAGGLRQHDAEVDATLK
SLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETG
ETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRL
LSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDDCTK
HTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL
NT seq 4464 nt   +upstreamnt  +downstreamnt
atgattcgcctcggggctccccagacgctggtgctgctgacgctgctcgtcgccgctgtc
cttcggtgtcagggccaggatgtccaggaggctggcagctgtgtgcaggatgggcagagg
tataatgataaggatgtgtggaagccggagccctgccggatctgtgtctgtgacactggg
actgtcctctgcgacgacataatctgtgaagacgtgaaagactgcctcagccctgagatc
cccttcggagagtgctgccccatctgcccaactgacctcgccactgccagtgggcaacca
ggaccaaagggacagaaaggagaacctggagacatcaaggatattgtaggacccaaagga
cctcctgggcctcagggacctgcaggggaacaaggacccagaggcgatcgtggtgacaaa
ggtgaaaaaggtgcccctggacctcgtggcagagatggagaacctgggacccctggaaat
cctggcccccctggtcctcctggcccccctggtccccctggtcttggtggaaactttgct
gcccagatggctggaggatttgatgaaaaggctggtggcgcccagatgggagtaatgcag
ggaccaatgggccccatgggacctcgaggacctccaggccctgcaggtgctcctgggcct
caaggatttcaaggcaatcctggtgaacctggtgaacctggtgtctctggtcccatgggt
ccccgtggtcctcctggtccccctggaaagcctggtgatgatggcgaagctggaaaacct
ggaaaagctggtgaaagaggtccgcctggtcctcagggtgctcgcggtttcccaggaacc
ccaggccttcctggtgtcaaaggtcacagaggttatccaggcctggacggtgctaaggga
gaggcgggtgcccctggtgtgaagggtgagagtggttccccgggtgagaacggatctccg
ggcccaatgggtcctcgtggcctgcctggtgaaagaggacggactggccctgctggtgct
gcgggtgcccgaggcaacgatggtcagccaggccccgcagggcctccgggtcctgtcggt
cctgctggtggtcctggcttccctggtgctcctggagccaagggtgaagccggccccact
ggtgcccgtggtcctgaaggtgctcaaggtcctcgtggtgaacctggtagtcctgggtcc
cctgggcctgctggtgcctctggtaaccctggaacagatggaattcctggagccaaagga
tctgctggtgctcctggcattgctggtgctcctggtttccctgggccacggggacctcct
ggccctcaaggtgcaactggtcctctgggcccgaaaggtcagacgggtgaacctggtatt
gctggcttcaaaggtgaacaaggccccaagggagaacctggccccgctggcccccaggga
gcccctggacccgctggtgaagaaggcaagagaggtgcccgtggagagcctggtggtgtt
gggcccattggtccccctggagaaagaggtgctcctggcaaccgcggtttcccaggtcaa
gatggtctggcaggtcccaagggagctcctggagagcgagggcccagtggtcttgctggt
cccaagggagccaatggtgaccctggccgtcctggagaacctggacttcctggagcccgg
ggtctcactggccgccctggtgatgctggtcctcaaggcaaagttggcccttctggagcc
cctggtgaagatggtcgtcctggacctccaggtcctcagggggctcgcgggcagcctggt
gtcatgggtttccctggccccaaaggtgccaatggcgagcctggcaaagctggtgagaag
ggactgcctggtgctcctggtctgagaggccttcctggcaaagatggtgagacaggtgct
gcaggaccccctggccctgctggacctgctggtgaacgaggcgagcagggtgctcctggg
ccatctgggttccagggacttcctggccctcctggtccgccaggtgaaggtggaaaacca
ggtgaccagggtgttcccggtgaagctggagcccctggcctcgtgggtcccaggggtgaa
cgaggtttcccaggtgaacgtggctctcccggttcccaaggcctccaaggtgcccgtggc
ctccctggcactcctggcactgatggtcccaaaggtgcatctggcccagcaggcccccct
ggggctcagggccctccaggtcttcagggaatgcccggtgagaggggagcagctggtatc
gctgggcccaagggtgacaggggtgatgttggtgagaaaggccctgagggagcccctgga
aaggatggtggacgaggcctgacaggtcccattggcccccctggcccggctggtgctaac
ggcgagaagggagaagttggacctcctggtcctgcaggaagtgctggtgctcgtggcgct
ccgggtgaacgtggagagactgggccccccggaccagcaggatttgctgggcctcccggt
gctgatggccagcctggggccaagggtgagcaaggagaggccggccagaaaggcgacgct
ggtgcccctggtcctcagggcccctctggagcgcctgggcctcagggtcctactggagtg
actggtcctaaaggagcccgaggtgcccaaggccccccgggagccaccggattccctgga
gctgctggccgcgttggacccccaggctccaatggcaaccctggaccccctggtccccct
ggtccttctggaaaagatggtcccaaaggtgctcgaggagacagcggcccccctggccga
gctggtgaccctggcctccaaggtcctgctggcccccctggcgagaaaggagagcctgga
gatgatggtccctctggtgccgatggtccaccaggtccccagggtctggcgggacagaga
ggcatcgtcggtctgcctgggcagcgtggtgagagaggattcccaggcttgcctggccca
tcgggtgagcccggcaagcagggcgctcctggagcatctggagacagaggtcctcctggc
cccgtgggtcctcctggcctgaccggtcctgcaggtgaacctggacgagagggaagcccc
ggcgctgatggcccccctggcagagatggcgcagctggagtcaagggtgatcgtggtgag
actggtgctgtgggagctcctggatcccctgggccccctggctctcctggccccgctggt
ccaactggcaagcagggagacagaggagaagctggtgcacaaggccccatgggaccctca
ggaccagctggagcccggggaatccagggtcctcaaggcccccgaggtgacaaaggagag
gctggagagcctggcgagagaggcctgaagggacaccgtggcttcactggtctgcagggt
ctgcccggccctcctggtccttctggagaccaaggtgcttctggtcctgctggtccttct
ggccctagaggtcctcctggccctgtcggtccctctggcaaagatggtgctaatggaatc
cctggtcccattgggcctcctggtccccgtggacgatcaggcgaaaccggccctgctggt
cctcctggaaatcctggaccccctggtcctccaggtccccctggccctggcatcgacatg
tccgcctttgctggcctaggcccgagagagaagggccccgaccccctgcagtacatgcgg
gctgaccaggcagccggtggcctgagacagcatgatgccgaggtggatgccacactcaag
tccctcaacaaccagattgagagcatccgcagccccgagggctcccgcaagaaccctgca
cgcacctgcagagacctgaaactctgccatcctgagtggaagagtggagactactggatt
gaccccaaccaaggctgcaccttggacgccatgaaggttttctgcaacatggagactggt
gagacttgcgtctaccccaacccagcaaacgttcccaagaagaattggtggagcagcaag
agcaaggagaagaaacatatctggtttggagaaaccatcaatggtggcttccacttcagc
tatggagatgacaacctggctcccaacactgccaacgtccagatgaccttcctacgcctg
ctttccacggaaggctcccagaacatcacctaccactgcaagaatagcattgcctacctg
gacgaagcagctggcaacctcaagaaggccctgctcatccagggctccaacgacgtggag
atccgggcagagggcaatagcaggttcacgtacactgccctgaaggatgactgcacgaaa
cacaccggtaagtggggcaagactgtcatcgagtaccggtcacagaagacctcacgcctc
cccatcattgacattgcacccatggacataggagggcccgagcaggaattcggtgtggac
atagggccggtctgcttcttgtaa

KEGG   Macaca mulatta (rhesus monkey): 704930
Entry
704930            CDS       T01028                                 

Gene name
SPP1
Definition
(RefSeq) osteopontin precursor
  KO
K06250  secreted phosphoprotein 1
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04371  Apelin signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04620  Toll-like receptor signaling pathway
mcc04929  GnRH secretion
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04371 Apelin signaling pathway
    704930 (SPP1)
   04151 PI3K-Akt signaling pathway
    704930 (SPP1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    704930 (SPP1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    704930 (SPP1)
 09150 Organismal Systems
  09151 Immune system
   04620 Toll-like receptor signaling pathway
    704930 (SPP1)
  09152 Endocrine system
   04929 GnRH secretion
    704930 (SPP1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    704930 (SPP1)
SSDB
Motif
Pfam: Osteopontin
Other DBs
NCBI-GeneID: 704930
NCBI-ProteinID: NP_001248735
Ensembl: ENSMMUG00000008793
UniProt: F7F5M6 G8F1F1
LinkDB
Position
5
AA seq 314 aa
MRIAVISFCLLGIAYALPVKQADSGSSEEKQLYNKYPDAVATWLKPDPSQKQNLLAPQNA
VSSEETNDFKQETLPSKSNESHDHMDDVDDEDDDDHVDSQDSIDSNDSDEVDDTDDSHQS
DESHHSDESDELVTDFPTDLPATEVFTPVVPTVDIYDGRGDSVAYGLRSKSKKFRRPDIQ
YPDATDEDITSHVESEELNGAYKAIPVVQGLNVPSDWDSRGKDSHETSQLDDHSAETHSH
KHSRLYKRKASDDSNEHSDVIDSQERSKISHEFHSHEFHSHEDMLVVDPKSKEEDKHLKF
RISHELDSASSEVN
NT seq 945 nt   +upstreamnt  +downstreamnt
atgagaattgcagtgatttccttttgcctcttaggcatcgcctatgccctgccagttaaa
caggctgattctggaagttctgaggaaaagcagctttacaacaaatacccagatgctgtg
gccacatggttaaagcctgacccatctcagaagcagaatctcctagccccacagaatgct
gtgtcttccgaagaaaccaatgactttaaacaagagacccttccaagtaagtctaacgaa
agccatgaccacatggatgatgtggatgatgaagatgatgatgaccatgtggacagccag
gactccatcgactcgaatgactctgatgaagtagatgacactgatgattctcaccagtct
gatgagtctcaccattctgatgaatccgatgaactggttactgattttcccacagacctg
ccagcaactgaagttttcactccagttgtccccacagtagacatatatgatggccgaggt
gatagtgtggcttatggactgaggtcaaaatctaagaagttccgcagacctgacatccag
taccctgatgctacagacgaggacatcacctcacatgtggaaagcgaggagttgaatggt
gcatacaaggccatcccagttgtccagggcctgaacgtgccttctgattgggacagccgt
gggaaggacagtcatgaaacgagtcagctggatgaccacagtgctgaaacccacagccac
aagcactccagattatataagcggaaagccagtgatgacagcaatgagcattccgatgtg
attgacagtcaggaacgttccaaaatcagccatgaattccacagccatgaatttcacagc
catgaagatatgctggttgtagaccctaaaagtaaggaagaagataaacacctgaaattt
cgcatttctcatgaattagatagtgcatcttctgaggtcaattaa

KEGG   Macaca mulatta (rhesus monkey): 705413
Entry
705413            CDS       T01028                                 

Gene name
THBS1
Definition
(RefSeq) thrombospondin-1
  KO
K16857  thrombospondin 1
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04015  Rap1 signaling pathway
mcc04115  p53 signaling pathway
mcc04145  Phagosome
mcc04151  PI3K-Akt signaling pathway
mcc04350  TGF-beta signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05144  Malaria
mcc05165  Human papillomavirus infection
mcc05205  Proteoglycans in cancer
mcc05206  MicroRNAs in cancer
mcc05219  Bladder cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04015 Rap1 signaling pathway
    705413 (THBS1)
   04350 TGF-beta signaling pathway
    705413 (THBS1)
   04151 PI3K-Akt signaling pathway
    705413 (THBS1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    705413 (THBS1)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    705413 (THBS1)
  09143 Cell growth and death
   04115 p53 signaling pathway
    705413 (THBS1)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    705413 (THBS1)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    705413 (THBS1)
   05205 Proteoglycans in cancer
    705413 (THBS1)
  09162 Cancer: specific types
   05219 Bladder cancer
    705413 (THBS1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    705413 (THBS1)
  09174 Infectious disease: parasitic
   05144 Malaria
    705413 (THBS1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:mcc04131]
    705413 (THBS1)
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    705413 (THBS1)
   04990 Domain-containing proteins not elsewhere classified [BR:mcc04990]
    705413 (THBS1)
Membrane trafficking [BR:mcc04131]
 Endocytosis
  Phagocytosis
   Opsonins
    705413 (THBS1)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   705413 (THBS1)
Domain-containing proteins not elsewhere classified [BR:mcc04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   705413 (THBS1)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC EGF_3 TSP1_ADAMTS EGF_CA Laminin_G_3 cEGF Laminin_G_2 TSP1_CCN EGF Laminin_G_1
Other DBs
NCBI-GeneID: 705413
NCBI-ProteinID: XP_014997402
Ensembl: ENSMMUG00000006035
UniProt: F6WIY2
LinkDB
Position
7
AA seq 1170 aa
MGLAWGLGVLFLLHVCGTNRIPESGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFR
IEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVV
SNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELD
VPIQSVFTRDLASIARLRIAKGGVNDNFQGVLQNVRFVFGTTPEDILRNKGCSSSTSVLL
TLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRK
VTEENKELANELRRPPLCYHNGVQYRNNEEWTVDSCTECRCQNSVTICKKVSCPIMPCSN
ATVPDGECCPRCWPSDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQT
RTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARE
TKACKKDACPINGGWGPWSPWDICSVTCGGGVQKRSRLCNNPTPQFGGKDCIGDVTENQI
CNKQDCPIDGCLSNPCFAGVKCTSYPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACF
NHNGEHRCKNTDPGYNCLPCPPRFTGSQPFGQGVEYATANKQVCKPRNPCTDGTHDCNKN
AKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNEDLVCVANATYHCKKDNCPN
LPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYN
HNPDQADTDSNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEH
NPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDN
DGIPDDRDNCRLVPNPDQKDSDGDGRGDACKDDFDHDNVPDIDDICPENVDISETDFRRF
QMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDD
DYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNAL
WHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYD
KTYAGGRLGLFVFSQEMVFFSDLKYECRDP
NT seq 3513 nt   +upstreamnt  +downstreamnt
atggggctggcctggggactaggcgtcctgttcctgttgcatgtgtgcggcaccaaccgc
attccagagtctgggggagacaacagcgtgtttgacatctttgaactcaccggggctgcc
cgcaaggggtctgggcgccgactggtgaagggtcctgacccttctagcccagctttccgc
atcgaggatgccaacctgatcccccctgtgcctgatgacaagttccaagacctggtggat
gctgtgcgggccgagaagggtttcctgcttctggcctccctgaggcagatgaagaagacc
cggggcacgctgctggccctggagcggaaagaccactctggccaggtcttcagcgtggtg
tccaacggcaaggcgggcaccctggacctcagcctgaccgtccaaggaaagcagcacgtg
gtgtctgtggaagaagctctcctggcaaccggccagtggaagagcatcaccctgtttgtg
caggaagacagggcccagctgtacatcgactgtgaaaagatggagaatgccgagctggac
gtccccatccaaagcgtcttcaccagagacctggccagcatcgccagactccgcatcgca
aaggggggcgtcaatgacaatttccagggggtgctgcagaatgtgaggttcgtctttgga
accacaccagaagacatcctcaggaacaaaggctgctccagctctaccagtgtcctcctc
acccttgacaacaacgtggtgaatggttccagccctgccatccgcactaactacattggc
cacaagacaaaggatctgcaagccatctgcggcatctcctgtgatgagctgtccagcatg
gtcctggaactcaggggcctgcgcaccatcgtgaccacgctgcaggacagcatccgcaaa
gtgactgaagagaacaaagagttggccaatgagctgaggcggcctcccctgtgctatcac
aacggagttcagtacagaaataacgaggaatggactgttgatagctgcaccgagtgtcgc
tgtcagaactcagttaccatctgcaaaaaggtgtcctgtcccatcatgccctgctccaat
gccacagttcccgatggagaatgctgtcctcgctgttggcccagcgactctgcggacgat
ggctggtctccatggtctgagtggacgtcctgttctacaagctgtggcaacggaattcag
cagcgtggccgctcctgtgatagcctcaacaaccgatgtgagggctcctcggtccagaca
cgcacctgccacattcaggagtgcgacaagagatttaaacaggatggtggctggagccac
tggtccccgtggtcatcttgttctgtgacttgtggtgacggtgtgatcacaaggatccgg
ctctgcaactctcccagcccccagatgaacgggaaaccctgtgaaggcgaagcacgggag
accaaagcctgcaagaaagatgcctgccccatcaatggaggctggggtccttggtcacca
tgggacatctgttctgtcacctgtggaggaggggtacagaaacgtagccggctctgcaac
aaccccacaccccagtttggaggcaaggactgcattggtgacgtgacagaaaaccagatc
tgcaacaagcaggactgtccaattgatggatgcctgtccaatccctgctttgccggtgtg
aagtgtactagctaccctgatggcagctggaaatgtggtgcttgtccgcctggttacagt
ggaaatggcatccagtgcacagatgttgatgagtgcaaagaagtgcctgatgcctgcttc
aaccacaacggagagcacaggtgtaagaatacggaccccggctacaactgcctgccctgc
cccccgcgcttcaccggctcacagccctttggccagggcgtcgaatatgccacagccaac
aaacaggtgtgcaagccccgtaacccctgcacggatggaacccacgactgcaacaagaac
gccaagtgcaactacctgggccactacagcgaccccatgtaccgctgcgagtgcaagccc
ggctacgctggcaatggcatcatctgcggggaggacacagacctggatggctggcccaat
gaggacctggtgtgcgtggccaatgcgacttaccactgcaaaaaggataattgtcccaac
ctccccaactcaggccaggaagactatgacaaggatggaattggtgatgcctgtgatgat
gacgatgacaatgataaaattccagatgacagggacaactgtccattccattacaaccca
gctcagtatgactatgacagagatgacgtgggagaccgctgtgacaactgtccctacaac
cacaacccagatcaggcagacacagacagcaatggggaaggagacgcctgtgctgcagac
attgacggggacggtatcctcaatgaacgggacaactgccagtacgtctacaatgtggac
cagagagacactgacatggatggggttggagatcagtgtgacaactgccccttggaacac
aatccagatcagctggactctgactcagaccgcattggagatacctgtgacaacaatcag
gatattgatgaagatggccaccagaacaatctggacaactgtccctatgtgcccaatgcc
aaccaggctgaccatgacaaagatggcaagggagatgcctgtgaccacgatgatgacaac
gatggcattcctgatgacagggacaactgcagacttgtgcccaatcccgaccagaaggac
tctgacggtgatggtcgaggtgatgcttgcaaagatgattttgaccatgacaatgtgcca
gacattgatgacatctgtcctgagaatgttgacatcagtgagaccgatttccgccgattc
cagatgattcctctggatcccaaagggacatcccaaaatgaccctaactgggttgtacgc
catcagggcaaagaactcgtccagacggtcaactgtgaccctggactcgctgtaggttat
gacgagtttaatgctgtggacttcagtggcaccttcttcatcaacaccgaaagggacgat
gactatgctggatttgtctttggctaccagtccagcagccgcttctatgttgtgatgtgg
aagcaagtcacccagtcctactgggacaccaaccccacgagggctcaggggtactcaggc
ctttctgtgaaagttgtgaactccaccacggggcctggcgagcacctgcggaatgccctg
tggcacacaggaaacacccctggccaggtgcgcaccctgtggcatgaccctcgtcacata
ggctggaaagatttcaccgcctacagatggcgtctcagccacaggccaaagacgggtttc
attagagtggtgatgtatgaagggaagaaaatcatggctgactcaggacccatctatgat
aaaacctatgctggtggtagactagggttgtttgtcttctctcaagaaatggtgttcttc
tccgacctgaaatacgaatgtagagatccctaa

KEGG   Macaca mulatta (rhesus monkey): 707892
Entry
707892            CDS       T01028                                 

Gene name
LAMB2
Definition
(RefSeq) laminin subunit beta-2 precursor
  KO
K06243  laminin, beta 2
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    707892 (LAMB2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    707892 (LAMB2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    707892 (LAMB2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    707892 (LAMB2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    707892 (LAMB2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    707892 (LAMB2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    707892 (LAMB2)
   05145 Toxoplasmosis
    707892 (LAMB2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N F5_F8_type_C Sec5
Other DBs
NCBI-GeneID: 707892
NCBI-ProteinID: NP_001181397
Ensembl: ENSMMUG00000002952
UniProt: I0FRM8 G7MIN2
LinkDB
Position
2
AA seq 1798 aa
MELTSREGGRGQPLPWELRLGLLLSVLAATLAQAPAPDVPGCSRGSCYPATGDLLVGRAD
RLTASSTCGLNGPQPYCIVSHLQDEKKCFLCDSRRPFSARDNPHSHRIQNVVTSFAPQRR
AAWWQSENGIPAVTIQLDLEAEFHFTHLIMTFKTFRPAAMLVERSADFGRTWHVYRYFSY
DCGADFPGVPLAPPRHWDDVVCESRYSEIEPSTEGEVIYRVLDPAIPIPDPYSSRIQNLL
KITNLRVNLTRLHTLGDNLLDPRREIREKYYYALYELVVRGNCFCYGHASECAPAPGAPA
HAEGMVHGACICKHNTRGLNCEQCQDFYHDLPWHPAEDGHSHACRKCECHGHTHSCHFDM
AVYLASGNVSGGVCDGCQHNTAGRHCELCRPFFYRDPTKDLRDPAVCRSCDCDPMGSQDG
GRCDSHDDPALGLVSGQCRCKEHVVGTRCQQCRDGFFGLSISDPLGCRRCQCNARGTVPG
STPCDPNSGSCYCKRLVTGRGCDRCLPGHWGLSHDLLGCRPCDCDVGGALDPQCDEATGQ
CHCRQHMVGRRCEQVQPGYFRPFLDHLTWEAEDTRGQVLDVVERLVTPGETPSWTGSGFV
RLREGQALEFLVASVPKAMDYDLLLRLEPQVPEQWAELELTLQRPGPVPAHSLCGHVLPK
DDRIQGTLQPHTRYMVFPNPVCLEPAISYKLHLKLVRTGGSAQPETPYSGPGLLIDSLVL
LPRALVLEMFSGGNAAALERRATFERYQCHEEGLVPSKTSPSEACAPLLLSLSTLIYNGA
LSCQCNRQGSLSSECNPHGGQCLCKPGVVGRRCDLCAPGYYGFGPTGCQACQCSPEGALS
SLCEKTSGQCPCRTGTFGLHCDRCQRGQWGFPSCRPCVCNGHADECDTHTGACLGCRDHT
GGEHCERCIAGFHGDPQLPYGGQCRPCPCPEGPGSQRHFATSCHRDEYSQKIVCHCRAGY
TGLRCEACAPGHFGDPSRPGGRCQLCECSGNIDLMDPDACDPHTGQCLRCLHHTEGPHCA
HCKPGFHGQAARQSCHRCTCNLLGTNPQQCPSPDQCHCDRSSGQCPCLPNVQGPSCDHCA
PNFWNFTSGHGCHPCACHPRQARGPTCNEFTGQCYCRAGFGGRTCSECQELHWGDAGLQC
RACDCDPRGTDTPQCHRSTGHCSCRPGVSGVRCDQCARGFSGFFPACHPCHACFGDWDRV
VQDLAARTRRLEQRAQELQQTGVLGAFESSFWHMQEKLGIVQGIVGARNTSAASTAQLVE
ATEELRREIGEATEHLTQLEAELTDVQDENFNANHALSGLERDRLALNLTLRQLDQHLDL
LKHSNFLGAYDSIRHAHSQSAEAERCANTSALAVPSPVSNSASARHRTEALMDAQNEDFN
SKHMANQRALRKLSAHTHTLSLTGINELVCGAPGDAPCATSPCGGAGCRDEDGQPRCGGL
NCNGAVATADLALGRARHTQAELQRALAEGGSILSRVAETRRQASEAQQRAQAALDKANA
SRGQVEQANQELRELVQSVKDFLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAIA
ERVRSLADVDAILARTVGDVRRAEQLLQDARRARSRAEGEKQKAETVQAALEEAQRAQGV
AQGAIRGAVADTRDTEQTLYQVQERMTGAEQALRSAGERARQLDALLEALKLKWAGNSLA
ASTAEETAGSAQGRAQEAEQLLRGPLGDQYQTVKALAERKAQGVLAAQGRAEQLRDEARD
LLQAAQDKLQRLQELEGTYEENERALEGKAAQLDGLEARMRSVLQAINLQVQIYNTCQ
NT seq 5397 nt   +upstreamnt  +downstreamnt
atggagctgacctcaagggaaggagggaggggacagcctctgccctgggaacttcgactc
ggcctactgctaagcgtgctggctgccacactggcacaggcccctgccccggatgtgcct
ggctgttcgaggggaagctgctaccccgccacgggcgacctgctggtgggccgagctgac
agactgactgcctcatccacctgtggcctgaatggtccccagccctactgcatcgtcagt
cacttgcaggatgaaaagaagtgcttcctttgcgactcccggcgccccttctctgctaga
gacaacccacacagccatcgcatccagaatgtagtcaccagctttgcaccacagcggcgg
gcagcctggtggcagtcagagaatggtatccctgcagtcaccatccaactggacctggag
gctgagtttcatttcacacacctcattatgaccttcaagacatttcgccctgctgccatg
ctggtggaacgctcagcagactttggccgcacctggcatgtgtaccgatatttctcctat
gactgtggggctgacttcccaggagtcccactagcccctccacggcactgggatgatgta
gtctgtgagtcccgctactcagagattgagccatccactgaaggcgaggtcatctatcgt
gtgttggaccctgccatccctatcccagacccctacagctcacggattcagaacctgttg
aagatcaccaacttacgggtgaacctgactcgtctacacacattgggagacaacctactt
gacccacggagggagatccgagagaagtactactatgccctctatgagctggttgtacgt
ggcaactgcttctgctatggacacgcctcagagtgtgcacctgccccaggggcaccagcc
catgctgagggcatggtgcacggagcttgcatctgcaaacacaacacacgtggcctcaac
tgcgaacagtgtcaggatttctatcatgacctgccctggcatccggctgaggacggccat
agtcatgcctgtaggaagtgtgagtgccatgggcacacccacagctgccacttcgacatg
gctgtatacctggcatctggcaatgtgagtggaggtgtatgtgatggatgtcagcataac
acagctgggcgccactgtgagctctgtcggcccttcttctaccgtgacccaaccaaagac
ctgcgggatccagctgtgtgccgctcctgtgattgtgaccccatgggttctcaagacggc
ggtcgctgtgattcccatgatgaccctgcactgggactggtctcaggccagtgtcgctgc
aaagaacatgtggtgggcactcgctgccaacaatgccgtgatggcttctttgggctcagc
atcagtgaccctctgggctgccggcgatgtcaatgtaatgcacggggcacagtgcctggg
agtactccttgtgaccccaacagtggatcctgttattgcaaacgtctggtgactggacgt
ggatgtgaccgctgcctgcctggccactggggcctgagccacgacctactcggctgccgc
ccctgtgactgcgacgtgggtggtgccttggatccccagtgtgatgaggccacaggtcaa
tgccactgccgccagcacatggtcgggcgacgctgtgagcaggtgcaacctggctacttc
cggcccttcctggaccacctaacttgggaggctgaggacacccgagggcaggtgctcgat
gtggtggagcgcctggttactcctggggaaactccatcctggactggctcaggctttgtg
cggctacgggaaggtcaggccctggagttcctcgtggcctctgtgccgaaggccatggac
tatgatctgctgctgcgcttagaaccccaggtccctgagcaatgggcagagttggaactg
actttgcagcgtccagggcctgtgcctgctcacagcctgtgtgggcatgtgctgcccaag
gatgaccgcatccaagggactctgcaaccacacaccaggtacatggtgtttcccaatcct
gtctgccttgagcctgctatctcctacaagctgcatctgaagctggtacgcacaggggga
agtgcccagcctgagactccctactctggacccggcctgctcattgactcgctggtgctg
ctgccccgtgccctggtgctagagatgtttagtgggggtaatgctgctgccctggagcgc
cgggccacctttgaacgctaccaatgccatgaggagggtctggtgcccagcaagacttct
ccctctgaggcctgcgcgccgctcctcctcagcctgtccaccctcatctacaatggtgcc
ctgtcatgtcagtgcaaccgtcaaggttcactgagttctgagtgcaaccctcatggcggt
cagtgcctgtgcaagcctggagtggttgggcgccgctgtgacctctgtgcccctggctac
tatggctttggccccacaggctgtcaagcctgccagtgcagccccgagggggcgctcagc
agtctctgtgaaaagaccagtgggcaatgtccctgtcgaactggtacctttgggcttcac
tgtgaccgctgccagcgtggccagtggggattccctagctgccggccatgtgtctgcaat
gggcatgcagatgagtgcgacacccacacaggcgcttgcctgggctgccgtgatcacaca
gggggtgagcactgtgaaaggtgcattgctggtttccacggggacccacaactgccatat
gggggccagtgccggccctgtccatgtcctgaaggccctgggagccaacggcactttgct
acttcttgccaccgggatgaatattcccagaagattgtgtgccactgccgggcaggctac
acggggctgcgatgtgaagcttgtgcccctgggcactttggggacccatcaaggccaggt
ggccggtgccaactgtgtgagtgcagtgggaacattgacctaatggatcctgatgcctgt
gacccccacacggggcaatgcctgcgctgtttacaccacacagagggtccacactgtgcc
cactgcaagcctggcttccacgggcaggctgcccgacagagctgtcaccgctgcacctgc
aacctgctgggcacaaatccccagcagtgcccgtctcctgaccagtgccactgtgatcga
agcagtgggcagtgcccatgcctccccaatgtccagggccctagttgtgaccactgtgcc
cccaacttctggaacttcaccagtggccatggttgccatccttgtgcctgccacccaaga
caggccagaggccccacctgcaatgagttcacagggcagtgctactgccgtgccggcttt
ggagggcggacttgttctgagtgccaagagctccactggggagacgctgggttgcagtgt
cgtgcctgtgactgtgaccctcgtggaacagatacacctcagtgtcaccgctccacaggt
cactgcagctgccgcccaggcgtgtctggtgtgcgctgtgaccagtgtgcccgtggcttc
tcaggattctttcctgcctgccatccctgccatgcatgcttcggggattgggaccgagtg
gtgcaggacttggctgcccgcacacggcgcctagagcagcgggcacaggagttgcagcag
acgggtgtgctgggtgcctttgagagcagcttctggcatatgcaggagaagctgggcatt
gtgcagggcatcgtaggtgcccgcaacacctcagccgcctccactgcacagcttgtggag
gccacagaggagctgcggcgtgaaattggggaggccactgagcacctgactcaactggaa
gcagagttgacagatgtgcaggatgagaacttcaatgccaaccatgcactaagtggtctg
gagcgagacaggcttgcacttaatctcacactgcggcagctggaccagcatcttgacttg
ctcaaacattcaaacttcctgggtgcctatgacagcatccggcatgcccatagccagtct
gcagaggcagaacgttgtgccaacacctcggccctggcagtacctagccctgtgagcaac
tcggcaagtgctcggcatcggacagaggcactgatggatgcccagaatgaggacttcaac
agcaaacacatggccaaccagcgggccctgcgcaagctctctgcccatacccacaccctg
agcctgacaggcataaacgagctggtgtgtggggcaccaggggatgcaccctgtgctaca
agcccttgtgggggtgccggctgtcgagatgaggatgggcagccgcgctgtgggggcctc
aactgcaatggggcagtggctacagcagacctagcactgggccgggcccggcacacacag
gcagagctgcagcgggcactggcagaaggtggtagcatcctcagcagagtggctgagact
cgtcggcaggcaagcgaggcacagcagcgggcccaggcagccctggacaaggctaatgct
tccaggggacaggtggaacaggccaaccaggaacttcgagaacttgtccagagtgtgaag
gacttcctcaaccaggaaggggctgatcctgatagcattgagatggtggccacacgggtg
ctagagctctccatcccagcttcagctgagcagatccagcacctggcgggcgcaattgca
gagcgagtccggagcctggcggatgtggacgcgatcctggcacgtactgtaggagatgtg
cgtcgtgccgagcagttattgcaggatgcacggcgggcaaggagccgggctgagggtgag
aaacagaaggcggagacagtacaggcagcactggaggaggcccagcgggcacagggtgtt
gcccagggtgccatccggggggcagtggctgacacacgggacacagagcagaccctatac
caggtacaggagaggatgacaggtgcagagcaggcactgagatctgcaggtgaaagggct
cggcagttggatgctctcctggaggctctgaagttgaaatgggcgggaaatagtctggca
gcctctacagcagaagaaacggcaggcagtgcccagggtcgtgcccaggaggctgagcag
ctgctacggggtcctctgggtgatcagtaccagacggtgaaggccctggctgagcgcaag
gcccagggtgtgctagctgcacagggaagggcagagcaactgcgggatgaggctcgggac
ctgttgcaggctgctcaggacaagctgcagcggctacaggaactggaaggcacctatgag
gagaatgagcgggcactggagggtaaggcagcccagctggacgggctggaggccaggatg
cgcagcgtgcttcaagccatcaacttgcaggtgcagatctacaacacctgccagtga

KEGG   Macaca mulatta (rhesus monkey): 708165
Entry
708165            CDS       T01028                                 

Gene name
THBS2
Definition
(RefSeq) thrombospondin-2
  KO
K04659  thrombospondin 2/3/4/5
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04145  Phagosome
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05144  Malaria
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    708165 (THBS2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    708165 (THBS2)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    708165 (THBS2)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    708165 (THBS2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    708165 (THBS2)
  09174 Infectious disease: parasitic
   05144 Malaria
    708165 (THBS2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:mcc04131]
    708165 (THBS2)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    708165 (THBS2)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    708165 (THBS2)
   04990 Domain-containing proteins not elsewhere classified [BR:mcc04990]
    708165 (THBS2)
Membrane trafficking [BR:mcc04131]
 Endocytosis
  Phagocytosis
   Opsonins
    708165 (THBS2)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   708165 (THBS2)
  Exosomal proteins of colorectal cancer cells
   708165 (THBS2)
  Exosomal proteins of bladder cancer cells
   708165 (THBS2)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   708165 (THBS2)
Domain-containing proteins not elsewhere classified [BR:mcc04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   708165 (THBS2)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC EGF_3 TSP1_ADAMTS EGF_CA EGF cEGF TSP1_CCN Laminin_G_2
Other DBs
NCBI-GeneID: 708165
NCBI-ProteinID: XP_014993209
LinkDB
Position
4
AA seq 1172 aa
MVWGLVLLALWVWPSTQAGHQDKGTTFDLFSISNINRKTIGAKQFRGPDPGVPAYRFVRF
DYIPPVNAEDLSKITRSMRQKEGFFLTAQLKQDGKSRGTLLALEGPGLSQRQFEIVSNGP
ADTLDLIYWIDGTRHVVSLEDVGLADSQWKNVTVQVAGETYSLHVGCDLIDSFALDEPFY
EHLQAEKSRMYVAKGSARESHFRGLLQNVHLVFENSVEDILSKKGCQQGQGAEINAISEN
TETLRLGPHVTTEYVGPSSERRPEVCERSCEELGNMVQELSGLHVLVNQLSENLKRVSND
NQFLWELIGGPPKTRNMSACWQDGRFFAENETWVVDSCTTCTCKKFKTICHQITCPPATC
ASPSFVEGECCPSCLHSVDGEEGWSPWAEWTECSVTCGSGTQQRGRSCDVTSNTCLGPSI
QTRACSLSKCDTRIRQDGGWSHWSPWSSCSVTCGVGNITRIRLCNSPVPQMGGKNCKGSG
RETKACQGAPCPIDGRWSPWSPWSACTVTCAGGIRERTRVCNSPEPQYGGKACVGDVQER
QMCNKRSCPVDGCLSNPCFPGAQCSSFPDGSWSCGSCPVGFLGNGTHCEDLDECALVPDI
CFSTSKVPRCVNTQPGFHCLPCPPRYRGSQPTGVGLEAAKTEKQVCEPENPCKDKTHNCH
KHAECIYLGHFSDPMYKCECQTGYAGDGLICGEDSDLDGWPNLNLVCATNATYHCIKDNC
PHLPNSGQEDFDKDGIGDACDDDDDNDGVTDEKDNCQLLFNPRQADYDKDEVGDRCDNCP
YVHNPAQIDTDNNGEGDACSVDIDGDDVFNERDNCPYVYNTDQRDTDGDGVGDHCDNCPL
VHNPDQTDMDNDLVGDQCDNNEDIDDDGHQNNQDNCPYISNANQADHDNDGQGDACDHDD
DNDGVPDDRDNCRLVFNPDQEDLDGDGRGDICKDDFDNDNVPDIDDVCPENNAISETDFR
NFQMVPLDPKGTTQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDR
DDDYAGFVFGYQSSSRFYVVMWKQVTQTYWEDQPTRAYGYSGVSLKVVNSTTGTGEHLRN
ALWHTGNTPGQVRTLWHDPRNIGWKDYTAYRWHLTHRPKTGYIRVLVHEGKQVMADSGPI
YDQTYAGGRLGLFVFSQEMVYFSDLKYECRDV
NT seq 3519 nt   +upstreamnt  +downstreamnt
atggtctgggggctggtcctgctggctctgtgggtgtggcccagcacacaagctggtcac
caggacaaaggcacgaccttcgaccttttcagtatcagcaacatcaaccgcaagaccatt
ggcgccaagcagttccgcgggcccgaccccggcgtgccggcttaccgcttcgtgcgcttc
gactacatcccgccggtgaacgcagaggacctcagcaagatcaccaggagcatgcggcag
aaggagggcttctttctcacggcccagctcaagcaggacggcaagtccaggggcacgctg
ctggctctggagggccccggtctctcccagaggcagttcgagatcgtctccaatggccct
gcggacacgctggacctcatctactggatcgacggcacccggcacgtggtctccctggag
gacgtgggcctggctgactcgcagtggaagaacgtcaccgtgcaggtggctggagagacc
tacagcctgcacgtgggctgcgacctcatagacagcttcgccctggacgagcccttctac
gagcacctgcaggcggaaaagagccggatgtacgtggccaaaggctcagccagagagagt
cacttcaggggtttgcttcagaatgtccacctcgtgtttgaaaactctgtggaagatatt
ctaagcaagaagggttgccagcaaggccagggagctgagatcaacgccatcagtgagaac
acggagacactgcgcctgggtccgcatgtcaccaccgagtacgtgggccccagctcggag
aggaggcctgaggtgtgcgaacgctcctgcgaggagctggggaacatggtccaggagctt
tccgggctccacgtcctcgtgaaccagctcagcgagaacctcaagagagtgtcgaacgat
aaccagtttctctgggagctcattggtggccctcctaagacaaggaacatgtcagcttgc
tggcaggatggccggttctttgcggaaaatgagacgtgggtggtggacagctgcaccacg
tgtacctgcaagaaatttaaaaccatttgccaccaaatcacctgcccgcccgcgacctgc
gccagcccgtccttcgtggaaggcgaatgctgcccctcctgcctccactcggtggacggt
gaggagggctggtctccgtgggcagagtggaccgagtgctcggtgacgtgtggctctggg
acgcagcagagaggccggtcctgtgacgtcaccagcaacacctgcttggggccctccatc
cagacacgggcttgcagcctgagcaagtgtgacacccgcatccggcaggatggcggctgg
agccactggtcaccttggtcttcatgctctgtgacctgtggagtcggcaacatcacacgc
attcgtctctgcaactccccagtgccccagatggggggcaagaattgcaaagggagtggt
cgggagaccaaagcctgccagggtgccccatgcccaatcgatggccgctggagcccctgg
tccccatggtcggcctgcactgtcacctgtgccggtgggatccgggagcgcacccgggtc
tgcaacagccctgagcctcagtacggagggaaggcctgcgtgggggacgtgcaggaacgc
cagatgtgcaacaagaggagctgccccgtggatggctgtttatccaacccctgcttcccg
ggagcccagtgcagcagcttccccgacgggtcctggtcctgcggctcctgccccgtgggc
ttcttgggtaatggcacccactgtgaggacctggacgagtgtgccctggtccccgacatc
tgcttctccaccagcaaggtgcctcgctgtgtcaacactcagcccggcttccactgcctg
ccctgcccaccccgatacagagggagccagcccactggggttggcctggaggcagccaag
acggaaaagcaagtgtgtgagcccgaaaacccatgtaaggacaagacacacaactgccac
aagcacgcagagtgcatctacctgggccacttcagcgaccccatgtacaaatgcgagtgc
cagacaggctacgcgggcgacgggctcatctgcggggaggactcggacctggacggctgg
cccaacctcaacctggtctgcgccaccaacgccacctaccactgcatcaaggataactgc
ccccatctgccgaattctgggcaggaagactttgacaaggacgggattggcgatgcctgt
gatgatgacgacgacaatgatggtgtgaccgatgagaaggacaactgccagctcctcttc
aatccccgccaggctgactacgacaaggatgaggttggggaccgctgtgacaactgtcct
tacgtgcacaaccccgcccagatcgacacagacaacaacggagagggtgacgcctgctcc
gtggacattgacggggacgatgtcttcaatgaacgagacaattgtccctacgtctacaac
actgaccagagggacacggacggtgacggtgtgggggatcactgtgacaactgccccctg
gtgcacaaccccgaccagactgacatggacaacgaccttgtcggggaccagtgtgacaac
aatgaggacatagatgacgacggccaccagaacaaccaggacaactgcccctacatctcc
aacgccaaccaggccgaccacgacaacgacggccaaggcgacgcctgcgatcacgatgat
gacaacgatggcgtccccgacgacagggacaactgccggcttgtgttcaacccagaccag
gaggacttggacggtgatggacggggtgatatttgtaaagatgattttgacaatgacaac
gtcccagatattgatgatgtgtgtcctgaaaacaatgccatcagtgagacagacttcagg
aacttccagatggtccccttggatcccaaagggaccacccaaattgatcccaactgggtc
attcgccatcaaggcaaggagctggttcagacggccaactcggaccctggcatcgctgta
ggttttgacgagtttgggtccgtggacttcagtggcacgttctatgtaaacactgaccgg
gatgacgactatgccggcttcgtctttgggtaccagtcaagcagccgcttctacgtggtg
atgtggaagcaggtgacgcagacctactgggaggaccagccgacacgggcctatggctac
tccggcgtgtccctcaaggtggtgaactccaccacggggacgggcgagcacctgaggaac
gcgctgtggcacacggggaacacaccggggcaggtgcgaaccttatggcatgaccccagg
aacattggctggaaggactacacagcctacaggtggcacctgactcacaggcccaagact
ggctacatcagagtcttagtgcatgaaggaaaacaggtcatggcagactcaggacctatc
tatgaccaaacctacgctggcgggcggctgggtctatttgtcttctctcaagaaatggtc
tatttctcagacctcaagtacgaatgcagagatgtttaa

KEGG   Macaca mulatta (rhesus monkey): 708287
Entry
708287            CDS       T01028                                 

Gene name
COL4A4
Definition
(RefSeq) collagen alpha-4(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    708287 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    708287 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    708287 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    708287 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    708287 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    708287 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    708287 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    708287 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    708287 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    708287 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    708287 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    708287 (COL4A4)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   708287 (COL4A4)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   708287 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 708287
NCBI-ProteinID: XP_001110249
Ensembl: ENSMMUG00000013233
LinkDB
Position
12
AA seq 1692 aa
MSSLHVVLMRCSFRLTKSLATGPWSLILILFSVQYVHGSGKKYVGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPFGPLGAPGPTGLSGEKGMRGDRGPPGTAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGKCGYNGSRGDPGFPGGRGALGPGGPPGHPGEKGEKGNSVFILG
AIKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKG
QMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMIGLPGPPGHKGESGIGAKGE
KGIPGFPGPWGDPGSYGSPGFPGLKGELGLVGDPGPFGFLGPKGDPGDRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGAPGLQGLPGSSVTYCSVGNPGPQGIKGKVGPPGGR
GSKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGL
PGKPGASGPPGNKGAKGDVVISRVKGHKGERGPDGPPGFPGQPGSHGLDGHAGEKGDPGL
PGDHEDAIAGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGRPGVRGPDGLKG
QKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP
PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPG
IKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGDPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPG
PPGPKGPRGLPGFPGFPGERGKPGAEGCPGTKGEPGEKGMSGFPGDQGVRGAKGTIGPPG
DEGEMAIISKKGKPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFYRGEPGEK
GQPGPPGPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPAPA
SHFGPPGPKGEPGSSGCPGHFGASGEQGLPGVQGPRGSPGWPGPPGSSGPPGCPGDRGMP
GLRGQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVG
PPGPVGMPGLKGERGDPGSPGISPPGPCGEKGLPGPPGRSGPPGPAGATGRVPKDIPDPG
PSGDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGLPGLPGPPGPPGYKGFPGCD
GKDGQKGPMGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADADDCPRI
PGLPGVPGLRGPEGAMGLPGMRGPPGPGCKGEPGLDGRRGMDGIPGSPGPPGRKGDTGED
GYPGGPGPPGPTGDPGPKGFGAGYLSGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEG
QEKAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEA
IRPYVSRCAVCEAPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMS
PGSCLEDFRAAPFLECQGRQGTCHFFANEYSFWLTTVKADLQFSSAPAPDTLKESQAQRQ
KISRCQVCVKYS
NT seq 5079 nt   +upstreamnt  +downstreamnt
atgtcgtctctgcacgtagtactaatgaggtgctccttcagattgaccaagtcattggcc
acaggtccctggtcacttatactcattctcttttctgtacaatacgtacatgggagtgga
aagaaatacgttggtccttgtggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccggggccacagggtccatttgggcccctggga
gccccaggacccactgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggaacagcaggggacaaaggagataagggcccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggtaagtgtggc
tacaatggctcaagaggtgacccagggtttccaggaggaagaggagctcttggtccagga
ggccccccaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccattaaaggtattcagggagacagaggcgacccaggactgcccggcttaccaggatct
tggggtgcaggaggaccggcgggccccacagggtatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggagtaaagggg
caaatgggagacccgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcctggaatg
attggactgccaggaccaccaggacacaagggagaatctggtattggggcaaaaggagaa
aaaggcattcctggatttccagggccttggggggatcctggttcctacggatctccaggt
ttcccaggattaaagggagaactaggactggttggagatcctgggccatttggatttctt
ggcccaaagggggatcctggagaccgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccaggattccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagcaccaggcctgcagggcctcccaggatcaagtgtgacatactgt
agtgttgggaaccctggaccacaaggaataaaaggcaaagtgggtcccccaggaggaaga
ggctcaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggtcccatgggc
ccccctggacctccaggacttcccgggaggcaggggagtaagggagacttggggctccct
ggctggcttggaacaaaaggtgacccaggacctcctggtgctgaaggacctccagggcta
ccaggaaagcctggtgcctctggaccacccggcaacaaaggggcgaagggtgacgtggtc
atatcaagagttaaaggacacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcaaccaggatcgcatggtctggatggacatgctggagaaaaaggggatccaggactc
ccaggggatcatgaagacgcgattgcaggtggtaaaggatttcctggacctctgggtccc
ccaggcaaagcaggacctgtggggcccccgggactgggatttcctggtccaccaggagag
cgaggccacccaggagttccaggccgcccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacagtttcttgcaatgtgacctatcctgggaggcaaggccctccaggt
tttgatggacctccaggtccaaagggatttccaggtccccaaggtgcccctgggctgagc
ggttcagatgggcataaaggcagacctggcacaccaggaacatcggaaataccaggtcca
cctggttttcgtggtgacataggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcctgggtcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggacccccgggaaagaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatatccaggtgctgaagggccagctggcattcct
ggattcccaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccacctggccattcctgtgaaagaggtgctccaggaataccagggcaaccggga
ctccctggggatccaggtagcccaggtgctccaggtgggaaaggacagccgggagatgtg
gggcctcctgggccagctggaatgaaaggtctccctggactcccaggacggcctggggca
cacggtcccccgggcctcccaggaatcccaggtccctttggggatgatgggctacctggt
cctccaggtccaaaaggaccccgggggctgcctggtttcccaggttttcccggagaacga
ggaaagcctggtgcagagggatgtcctggcacaaagggagaacccggagagaagggcatg
tctggctttcccggagaccagggagtgagaggggccaaaggaaccataggacctcctgga
gatgaaggagaaatggctatcatttccaaaaagggaaaacctggggaacctggacctcct
ggagatgatggattcccaggagaaagaggtgataaaggaacacccgggatgcaagggaga
agaggagagccgggaagatatggaccacctggattttacagaggggaacctggtgagaaa
ggtcagccagggcctcctggacccccaggccctccaggctcaactggtctaagagggttc
attggttttccaggacttccaggtgaccagggtgagccaggttctccaggtccccctgga
ttttcaggaatcgatggagcaagaggacctaaaggaaacaaaggtgaccctgcccctgcc
agtcactttggtccacctggtccaaagggtgagccaggtagctctggatgtccagggcat
tttggagcatccggagagcagggcttgcctggcgttcaagggcccagaggatcacctgga
tggccaggaccacctggctcctctggaccaccagggtgcccaggtgatcgggggatgcct
gggctgaggggacagccaggagaaatgggagaccctgggccaagaggcctccagggggat
ccagggataccaggtcctccaggaataaaaggtccttccgggtcacctggcctgaatggc
ttgcatggattgaagggtcagaaaggaaccaaaggtgcttcaggtttgcatgatgtgggg
ccacctggtcctgtgggaatgcctgggctaaaaggggagagaggagatcctgggagccca
ggaatctctcctccaggtccttgtggagaaaaaggtctcccaggtcccccagggagatca
ggaccacctggtcctgcaggtgccacaggaagagtgcctaaggacattcctgacccgggt
ccatctggagatcagggacctcctggtcccgatggcccaagaggagcacctgggcctcca
ggcctccctgggagtgttgacctcctgagaggggagccaggtgactgtggtctaccaggg
ctaccaggtctccccggcccaccaggccctccaggatacaaaggctttccaggatgtgat
ggaaaagatggccagaaaggaccaatgggattcccaggaccgcagggaccacatggattt
cctgggccacctggagagaagggtttacctggacctccagggagaaaaggacccactggt
cttccaggtcccagaggtgaaccggggccacctgcagatgcggatgactgtccccgaatc
ccagggcttcctggggtgccaggcctgagaggaccagaaggagccatgggactccctgga
atgagaggccccccaggaccagggtgcaaaggagagcctgggctggatggcaggaggggt
atggatggcattcctgggtctcctgggcctcccggacgtaaaggtgacacaggagaagac
ggctaccccggaggaccagggcctcctggtcccactggggatcctgggcccaaagggttt
ggcgctggatacctcagtggcttcctcctggttctccacagtcagacggaccaggagccc
acctgccccctgggcatgcccaggctctggactgggtatagtctgttatacctggaaggg
caagagaaagctcacaatcaagaccttggtctggcagggtcttgccttcccgtgtttagc
acgctgccctttgcctactgcaacatccaccaggtgtgccactatgcccagagaaacgac
agatcctactggctggccagcgctgcacccctccccatgatgccactctctgaagaggcg
atccgcccctacgtcagccgctgtgcggtatgcgaggccccggcccaggcggtggcggtg
cacagccaggaccagtccatccccccatgtccgcagacctggaggagcctctggattggg
tattcattcctgatgcacacaggagctggggaccaaggaggagggcaggccctcatgtca
ccgggtagctgcctggaagatttcagagcagcaccattccttgaatgccaaggccggcag
ggaacttgccacttttttgcaaatgagtatagcttctggctgacaacagtgaaagcagac
ttgcagttttcctctgctccagcaccagacaccttaaaagaaagccaggcccaacgccag
aaaatcagccggtgccaggtctgcgtgaagtatagctag

KEGG   Macaca mulatta (rhesus monkey): 708499
Entry
708499            CDS       T01028                                 

Gene name
VTN
Definition
(RefSeq) vitronectin
  KO
K06251  vitronectin
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04610  Complement and coagulation cascades
mcc05165  Human papillomavirus infection
mcc05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    708499 (VTN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    708499 (VTN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    708499 (VTN)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    708499 (VTN)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    708499 (VTN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    708499 (VTN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    708499 (VTN)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    708499 (VTN)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   708499 (VTN)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   708499 (VTN)
SSDB
Motif
Pfam: Hemopexin Somatomedin_B
Other DBs
NCBI-GeneID: 708499
NCBI-ProteinID: XP_001106884
Ensembl: ENSMMUG00000019148
LinkDB
Position
16
AA seq 479 aa
MAPLRPLLMLALLAWVAVADQESCKGRCTEGFNADKKCQCDELCSYYQSCCTDYTAECKP
QVTRGDVFTMPEDEYVTYDYGEEKNNATIHEQPEGSSPSPDLQAQPEGTPEQATVLNPEE
ESPAPEVGISKPEGTDSRPETLRPGTPELPAEEELCSGKPFDAFTDLKNGSLFAFRGQYC
YELDEKAVRPGYPKLIRDVWGIEGPIDAAFTRINCQGKTYLFKGSQYWRFEDGVLDPDYP
RNISDGFDGMPDNVDAALALPAHSYSGRERVYFFKGKQYWEYQFQHQPSQEECEGSSRSA
VFEHFAVMQRDSWEDIFELLFWGRTSAGMRQPQLISRDWHGVPGQVDAAMAGRIYVSGMA
PHPSLAKKQKFRRRNRKGYRSDRGHSRGRNQNSRRPSRPTWRLSLFSSEESNLGANNYDD
YRMDWLVPATCEPIQSVFFFSGDKYYRVNLRTRRVDAVDPPYPRSIAQYWLGCPAPGHL
NT seq 1440 nt   +upstreamnt  +downstreamnt
atggcacccctgagaccccttctcatgctggccctgctggcatgggttgctgtggctgac
caagagtcatgcaagggccgctgcactgagggcttcaatgctgacaagaagtgccagtgt
gatgagctctgctcttactaccagagctgctgcacagactatacggctgagtgcaagccc
caagtgactcgtggggatgtgttcactatgccggaggatgagtatgtgacctatgactac
ggtgaggagaaaaacaatgccaccatccatgaacagccggagggctcctccccgagccct
gacctccaggcccagcccgaagggactcctgagcaggcaactgttctgaaccctgaggaa
gaatcccctgcacctgaggtgggcatctctaagcctgaggggacggactcaaggcccgag
acccttcgtccagggacccctgagctcccagcagaggaggagctatgcagtgggaaaccc
ttcgacgccttcaccgacctcaagaacggttccctctttgccttccgagggcagtactgc
tatgaactggatgaaaaggcagtgaggcctgggtaccccaagctcatccgagatgtctgg
ggcatcgagggccccattgatgccgccttcacccgcatcaactgtcaggggaagacctac
ctcttcaagggtagtcagtactggcgctttgaggatggtgtcctggaccctgattacccc
cgaaatatctctgatggcttcgatggcatgccagacaatgtggacgcagccttggccctc
cctgcccatagctacagtggcagggagcgggtctacttcttcaaggggaaacagtactgg
gagtaccagttccagcaccagcccagtcaggaggagtgtgaaggcagctcccggtcggct
gtgtttgagcactttgccgtgatgcagcgggacagctgggaggacatcttcgagcttctt
ttctggggcagaacctctgctggtatgagacagccccagctcattagccgggactggcat
ggtgtgccagggcaagtggacgcagccatggctggccgcatctacgtctcaggcatggca
ccccacccctccttggccaagaagcaaaagtttaggcgtcgcaaccgcaaaggctaccgt
tcagaccgaggccacagccgtggccgcaaccagaactcccgccggccatcccgccccacg
tggcggctgtccttgttctccagtgaggagagtaacttgggagccaacaactatgatgac
tacaggatggactggcttgtgcctgccacctgtgaacccatccagagtgtcttcttcttc
tctggagacaagtactaccgagtcaaccttcgcacacggcgagtggacgctgtggaccct
ccctacccacgctccatcgcccagtactggctgggctgcccagctcctggccatctgtag

KEGG   Macaca mulatta (rhesus monkey): 708587
Entry
708587            CDS       T01028                                 

Gene name
TNN
Definition
(RefSeq) tenascin-N isoform X1
  KO
K06252  tenascin
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05165  Human papillomavirus infection
mcc05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    708587 (TNN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    708587 (TNN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    708587 (TNN)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    708587 (TNN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    708587 (TNN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    708587 (TNN)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   708587 (TNN)
SSDB
Motif
Pfam: fn3 Fibrinogen_C DUF4998 Interfer-bind Pur_ac_phosph_N EGF_2 EGF_Tenascin
Other DBs
NCBI-GeneID: 708587
NCBI-ProteinID: XP_028706456
Ensembl: ENSMMUG00000000545
LinkDB
Position
1
AA seq 1299 aa
MSLQEMFRFPLGLLLGSVLLVASAPATLEPPSCNNKEQQVTVSHTYKIDVPKSALVQVEA
DPQPLSDDGASLLALGEAGEEQNIVFRHNIRLQTPQKDCELAGTVQDLLARVKKLEEEMV
EMKEQCSAQRCCQGATDLSRRCSDHGTFSLETCSCHCEEGREGPACEQLACPGACSGHGR
CVDGRCLCHEPYVGADCGYPACPENCSGHGECVRGVCQCHEDFTSEDCSERRCPGDCSGH
GFCDTGECYCEEGFTGLDCAQVVAPQGLQLLKSTEESLLVSWEPSSQVDHYFLNYYPLGK
ELSGKQIRVPKEQHSYEILGLLPGTKYIVTLRNVKKDIFSSPQHLLATTDLAALGTAWVT
EETEHSLDVEWENPSTEVDYYKLRYGPMTGQEVTEVTVPKSSDPKSRYDITGLHPGTEYK
ITVVPMRGTLEGKPILLNGRTEIDSPTNVVTDRVTEDTATVSWKPVQAVIDKYVVRYTSA
DGDTKEMAVHKDESSTVLTGLKPGEAYKVYVWAERGNQGSKKADTNALTEIDSPANLVTD
RVTENTATISWDPVQATIDKYVVHYTSADNQETREVPVGKEQSSTVLTGLRPGVEYTVHV
WAQKGDRESKKANTNAPTDIDSPKNLVTDRVTENMATVSWDPVQATIDKYMVRYTSADGE
TREVPVGKEHSSTILTGLRPGMEYMVHVWAQKGDQESKKTDTKAQTDIDSPQNLVTDWVT
ENMATVSWDPVQATIDRYVVRYTSAKDGETREVPVGKEQNSTVLTGLRPGVEYTVHVWAQ
KGAQESKKADTKAQTDIDSPKNLVTDRVTENTATVSWDLVQATIDRYVVRYTSANGETRE
VPVGKEQSSTVLTGLRPGVEYTVHVWAQKGNQESKKADTKAQTEIDSPKNLVTDRVTENM
ATVSWDPVQATIDKYVVRYTSADGETREVPVGKEHSSTVLTGLRPGMEYMVHVWAQKGDQ
ESKKADTKAQTEIDPPRNLRPSAVTQSGGVLTWTPPSAQIHGYILTYQFPDGTVKEIQLG
REDQRFELQGLEQGATYPVSLVAFKGGLRSRNVSTTLSTVGARFPHPSDCSQVQQNSNAA
SGLYTIYLHGDSSRPLQVYCDMDTDGGGWIVFQRRNTGQLDFFKRWRTYVEGFGDPTKEF
WLGLDKLHNLTTGTPTRYEVRVDLQTANESAYAIYDFFQVASSKERYKLTVGKYRGTAGD
ALSYHNGWKFTTFDRDNDIALSNCALTHHGGWWYKNCHLANPNGRYGETKHSEGVNWEPW
KGHEFSIPYVELKIRPHGYSKEPVLGRKKRTLGGRLRTF
NT seq 3900 nt   +upstreamnt  +downstreamnt
atgagtctccaggagatgttccgcttccccttggggctcctgcttggctctgtgcttctg
gtggcttcggccccagccactctggagcctcccagctgcaacaacaaggagcaacaggtc
actgtcagccacacctacaagatcgatgtgcctaagtctgccttggttcaggtggaggct
gaccctcagcccctcagtgacgatggggcttcgcttttggccctgggtgaggccggggag
gaacagaacatcgtcttcaggcacaacatccgccttcagacgccacagaaggactgcgag
ttggcaggcactgtccaggacctcctggcccgggtgaaaaagctggaggaagagatggtg
gagatgaaggaacagtgtagtgcccagcgctgctgccagggagctactgatctgagccgc
cgctgcagcgaccacgggaccttctccctggagacctgcagctgccactgcgaggagggc
agggagggccccgcctgcgagcagctggcttgccccggggcgtgcagcggccacgggcgt
tgcgtggacgggcgctgcctgtgccacgagccctacgtgggtgccgactgcggctacccg
gcctgccctgagaactgcagcgggcatggcgagtgcgtgcgcggcgtgtgccagtgccac
gaagacttcacgtctgaggactgcagcgagcgacgctgtcctggcgactgcagcggccac
ggcttctgcgacacgggcgagtgctactgcgaggagggcttcacaggcctggactgtgcc
caggtggtcgccccacagggcctgcagctgctcaagagcacggaggaatctctgctggtg
agctgggagccctccagccaggtggatcactatttcctcaactactaccccctggggaag
gagctgtctgggaagcagatccgagtgcccaaggagcaacacagctatgagattctcggt
ttgctgcctggaaccaaatacatagtcaccctgcgtaacgtcaagaaagacatttttagc
agcccacagcatctacttgccaccacagaccttgctgcacttggcactgcctgggtgaca
gaggagactgagcactccctcgacgtggagtgggaaaacccctcgaccgaggtggactac
tacaagctgcgatatggccccatgacagggcaggaggtaactgaggtcactgtgccgaag
agcagtgaccccaagagccgatatgacatcactggtctgcacccagggaccgaatataag
atcacggtggtgcccatgcgaggaacgctggagggcaagccgatcctcctgaatggcagg
acagaaattgacagtccaaccaatgttgttactgatcgagtgactgaagacacagcaact
gtctcctggaaaccagtgcaggctgtcatagacaagtatgtggtgcgctacacctctgct
gatggggacaccaaggaaatggcagtgcacaaggatgagagcagcactgtcctgacgggc
ctgaagccaggagaggcatacaaggtctacgtatgggctgaaaggggcaaccaggggagc
aagaaagctgacaccaatgccctcacagaaattgacagcccagcaaacctggtgactgac
cgggtgactgagaatactgccaccatctcctgggacccggtgcaggccaccattgacaag
tacgtggtgcactacacctctgctgacaaccaagagaccagagaggttccggtggggaag
gagcagagcagcactgtcctgacaggcctgaggccaggtgtggagtacacagtgcacgtc
tgggcccagaagggggaccgagagagcaagaaggccaacaccaacgccccaacagacatt
gacagccccaaaaacctggtgactgaccgggtgacagagaatatggccactgtctcctgg
gacccggtgcaggccaccattgacaagtatatggtgcgctacacctctgcggacggagag
accagggaggttccggtggggaaggagcacagcagcaccatcctgacgggcctgagaccg
ggcatggagtacatggtgcacgtgtgggcccagaagggggaccaggagagcaagaagact
gacaccaaggcccagacagacattgacagcccccaaaacctggtgactgactgggtgaca
gagaatatggccactgtctcctgggacccagtgcaggccactattgacaggtatgtggtg
cgctacacctctgccaaggacggagagaccagggaggttccggtggggaaggagcagaac
agcactgtcctgacaggcctgaggcctggtgtggagtacacggtgcacgtgtgggcccag
aagggggcccaggagagcaagaaggctgacaccaaggcccagacagacattgacagcccc
aaaaacctggtcactgaccgggtgacagagaatacagccactgtctcctgggacctggtg
caggccaccattgacaggtatgtggtgcgctacacatctgccaatggagagaccagggag
gttccagtggggaaggagcagagcagcaccgtcctgacgggcctgaggccaggcgtggag
tacacagtgcacgtgtgggcccagaaggggaaccaggagagcaagaaggctgacaccaag
gcccagacagaaattgacagccccaaaaacctggtgactgaccgggtgacagagaatatg
gccactgtctcctgggacccggtgcaggccacgattgacaagtatgtggtgcgctacacc
tctgcggatggagagaccagggaggttccggtggggaaggagcacagcagcaccgtcctg
acgggcctgagaccgggcatggagtacatggtgcacgtgtgggcccagaagggggaccag
gagagcaagaaggccgacaccaaggcccagacagaaattgaccctcccagaaaccttcgt
ccatctgctgtaacgcagtctggtggtgtattgacctggacacccccctctgctcagatc
catggctacattctgacctaccagttcccagacggcacagttaaggagatacagctggga
cgagaagaccagaggtttgagttgcaaggcctggagcaaggtgccacctatcctgtctcc
cttgttgcctttaagggtggtctccggagcagaaatgtatccaccaccctctccacagtt
ggtgcccgtttcccacacccttcggactgcagtcaggttcagcagaacagcaatgccgcc
agtggtctgtacaccatctacctgcatggcgattccagccggcccctgcaggtgtactgc
gacatggacacggacggaggtggctggattgtcttccagaggcggaacactgggcagctg
gatttcttcaagcgatggaggacctatgtggaaggctttggggaccccacgaaggagttc
tggcttggacttgacaagctacacaacctcaccactggcactccaacgcggtatgaggtg
agagtggatttacagactgccaatgaatccgcctatgctatatatgatttcttccaagtg
gcctccagcaaggagcggtataagctgacagttgggaaatacagaggcacagcgggggat
gctctttcttaccacaatggatggaagtttacaactttcgacagagacaacgatattgca
ctcagcaactgtgccctgacacatcatggtggctggtggtataagaactgccacttggcc
aaccccaatggcagatatggggagaccaagcacagtgagggggtgaactgggagccttgg
aaaggacatgaattctccattccttacgtggagttgaaaatccgccctcatggctacagc
aaggagcctgtcctgggcagaaagaagcggacgctgggaggaaggctgcgaacgttctga

KEGG   Macaca mulatta (rhesus monkey): 708707
Entry
708707            CDS       T01028                                 

Gene name
TNR
Definition
(RefSeq) tenascin-R
  KO
K06252  tenascin
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05165  Human papillomavirus infection
mcc05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    708707 (TNR)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    708707 (TNR)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    708707 (TNR)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    708707 (TNR)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    708707 (TNR)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    708707 (TNR)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   708707 (TNR)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_Tenascin EGF_2 Pur_ac_phosph_N Uroplakin_II EGF Lzipper-MIP1
Other DBs
NCBI-GeneID: 708707
NCBI-ProteinID: NP_001247930
Ensembl: ENSMMUG00000015723
UniProt: H9ER94 G7MFE3
LinkDB
Position
1
AA seq 1358 aa
MGADGETVVLKNMLIGVNLILLGSMLKPSECQLEVTTERVQRQSVEEEGGIANYNTSSKE
QPVVFNHVYNINVPLDSLCSSGLEASAEQDVSAEDETVAEYTGQTSDHESQVTFTHRINL
PKKACPCASSAQMLQELLSRIEMLEREVSVLRDQCNANCCQESAATGQLDYIPHCSGHGN
FSFESCGCICNEGWFGKNCSEPYCPLGCSSRGVCVDGQCICDSEYSGDDCSELRCPTDCS
SRGLCVDGECVCEEPYTGEDCRELRCPGDCSGKGRCANGTCLCEEGYVGEDCGQRQCLNA
CSGRGQCEEGLCICEEGYQGPDCSAVAPPEDLRVAGISDRSIELEWDGPMAVTEYVISYQ
PTALGGLQLQQRVPGDWSGVTITELEPGLTYNISVYAVISNILSLPITAKVATHLSTPQG
LQFKTITETTVEVQWEPFSFSFDGWEISFIPKNNEGGVIAQVPSDVTSFNQTGLKPGEEY
IVNVVALKEQARSPPTSASVSTVIDGPTQILVRDVSDTVAFVEWIPPRAKVDFILLKYGL
VGGEGGRTTFRLQPPLSQYSVQALRPGSRYEVSVSAVRGTNESDSATTQFTTEIDAPKNL
RVGSRTATSLDLEWDNSEAEVQEYKVVYSTLAGEQYHEVLVPKGIGPTTRTTLTDLVPGT
EYGVGISAIMNSQQSVPATMNARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITF
TPSSGIASEVTVPKDRTSYTLTDLEPGAEYIISITAERGRQQSLESTVDAFTGFRPISHL
HFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMTEVSLDATKRHAVLMGLQPATEY
IVNLVAVHGTVTSEPIVGSITTGIDPPKDITISNVTKDSVMVSWSPPVASFDYYRVSYRP
TQVGRLDSSVVPNTVTEFTITRLYPATEYEISLNSVRGREESERICTLVHTAMDNPVDLT
ATNITPTEALLQWKAPVGEVENYVIVLTHFAVAGETILVDGVSEEFRLVDLLPSTHYTAT
MYATNGPLTSGTISTNFSTLLDPPANLTASEVTRQSALISWQPPRAEIENYVLTYKSTDG
SRKELIVDAEDTWIRLEGLLENTDYTVLLQAAQDTAWSSITSTAFTTGGRVFPHPQDCAQ
HLMNGDTLSGVYPIFLNGELSQKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVG
FGNLEDEFWLGLDNIHRITSQGRYELRVDMRDGQEAAFASYDRFSVEDNRNLYKLRIGSY
NGTAGDSLSYHQGRPFSTEDRDNDVAVTNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQG
INWYHWKGHEFSIPFVEMKMRPYSHRLMAGRKRQSLQF
NT seq 4077 nt   +upstreamnt  +downstreamnt
atgggggcagatggggaaacagtggttctgaagaacatgctcattggcgtcaacctgatc
cttctgggctccatgctcaagccttcggagtgtcaactggaggtcaccacagaaagggtc
cagagacagtcagtggaggaggaaggaggcattgccaactacaacacatctagcaaagaa
cagcctgtggtcttcaaccacgtgtacaacattaacgtgcccctggacagcctctgctcc
tcggggctagaggcctctgctgagcaggacgtgagcgcagaagacgagactgtggcagag
tacacgggccagacctcagaccacgagagccaggtcacctttacacacaggatcaacctc
cccaaaaaggcctgcccatgtgccagttcggcccagatgctgcaggagctgctgagccgg
atcgagatgctggagagagaggtgtcagtgctgcgagaccagtgcaacgccaactgctgc
caagaaagtgctgccacaggacaactggactatatccctcactgcagtggccacggcaac
tttagctttgagtcctgtggctgcatctgcaacgaaggctggtttggcaagaactgctcg
gagccctactgcccgctgggttgctccagccggggggtgtgtgtggatggccagtgcatc
tgtgacagcgagtacagcggggatgactgttccgaactccggtgcccaacagactgcagt
tcccgggggctctgcgtggacggggagtgtgtctgtgaagagccctacactggcgaggac
tgcagggaactgaggtgccccggggactgttcggggaaggggagatgtgccaacggtacc
tgtttatgcgaggagggctacgttggtgaggactgcggccagcggcagtgtctgaatgcc
tgcagtgggcgaggacaatgtgaggaggggctctgcatctgtgaagagggctaccagggc
cctgactgctcagcagttgcccctccagaggacttgcgagtggctggtatcagcgacagg
tccattgagctggaatgggacgggccgatggcagtgacggaatatgtgatctcttaccag
ccgacggccctggggggcctccagctccagcagcgggtgcctggagattggagtggtgtc
accatcacggagctggagccaggtctcacctacaacatcagcgtctacgctgtcattagc
aacatcctcagccttcccatcactgccaaggtggccacccatctctccactcctcaaggg
ctacaatttaagacgatcacagagaccaccgtggaggtgcagtgggagcccttctcattt
tcctttgatgggtgggaaatcagcttcattccaaagaacaatgaagggggagtgattgct
caggtccccagtgatgttacatcctttaaccagacgggactaaagcctggggaagaatac
attgtcaatgtggtggctctgaaagaacaggcccgaagcccccctacctcggccagtgtc
tccacagtcattgacggccccacgcagatcctggttcgagacgtctctgacaccgtggct
tttgtggagtggatcccccctcgagccaaagtcgatttcattctcttgaaatatggcctg
gtgggcggggaaggcgggaggaccaccttccggctgcagcctcccctgagccaatactca
gtgcaggccctgcggcccggctcccgctacgaggtgtcagtcagtgctgtccgagggacc
aacgagagtgattctgccaccactcagttcacaacagagattgatgcccccaagaacttg
cgagttggttctcgcacagcaaccagccttgacctcgagtgggataacagtgaggccgaa
gttcaggagtacaaggttgtatacagcaccctggcgggtgagcaatatcatgaggtactg
gtccccaagggcattggtccaaccaccaggaccaccctgacagatctggtacctggcact
gagtatggagttggaatatctgccatcatgaactcacaacaaagcgtaccagccaccatg
aatgccaggactgaacttgacagtccccgagacctcatggtgacagcctcctccgagacc
tccatctccctcatctggactaaggccagtggacccattgatcactaccgaattaccttt
accccatcctctgggatcgcctcagaagtcactgtacccaaggacaggacctcgtacaca
ttgacagatctagagcctggggcagagtacatcatttccatcactgctgagaggggtcgg
cagcagagcttggagtccactgtggatgctttcacaggcttccgtcccatttctcatctg
cacttttctcatgtgacctcctccagtgtgaacatcacttggagtgacccatctccccca
gcagacagacttattctgaactacagccccagggatgaggaagaagagatgacagaggtc
tccctggatgccaccaagaggcatgctgtcctgatgggcctgcaaccagccacagagtat
attgtgaaccttgtggcagtccatggcacagtgacctctgagcccattgtgggctccatc
accacaggaattgatcccccaaaagacatcacaattagcaatgtgaccaaggactcagtg
atggtctcctggagccctcctgttgcatcttttgattactaccgagtatcatatcgaccc
acccaagtgggacgattagacagctcagtggtgcccaacactgtgacagaattcaccatc
accagactgtacccagctaccgaatacgaaatcagcctcaacagcgtgcggggcagggag
gaaagcgagcgcatctgtactctcgtgcacacagccatggacaaccctgtggatctgact
gctaccaacatcactccaacagaagccctgctgcagtggaaggcgccagtgggtgaggtg
gagaactacgtcattgttctcacacactttgcagttgctggagagaccatcctggttgac
ggagtcagtgaggaatttcggcttgttgacctgcttcctagcacccactatactgccacc
atgtatgccaccaatgggcctctcaccagtggcaccatcagcaccaacttctctactctc
ctggaccctccagcaaacctgacagccagtgaagtcaccagacaaagtgccctgatctcc
tggcagcctcccagggcagagattgaaaattatgtcttgacctacaaatccaccgatgga
agccgcaaggagctgattgtggatgcagaagacacctggattcgactggagggcctgttg
gagaacacagactacacggtgctcctgcaggcagcacaagatactgcgtggagcagcatc
acctccaccgccttcaccacagggggccgggtgttccctcatcctcaagactgtgcccag
catttgatgaatggagacactttgagtggggtttaccccatcttcctcaatggggagctg
agccagaagttacaagtgtactgtgatatgaccaccgacgggggcggctggattgtattt
cagaggcggcagaatggccaaactgattttttccggaaatgggctgattaccgtgttggc
ttcgggaacctggaggatgagttctggctggggctggacaacatacacaggatcacatcc
cagggccgctatgagctgcgcgtggatatgcgtgatggccaggaggccgccttcgcctcc
tacgacaggttctctgtcgaggacaacagaaacctgtacaaactccgcataggaagctac
aacggcactgcgggggactccctcagctatcatcaaggacgccctttctctacagaggat
agagacaatgatgttgcagtgaccaactgtgccatgtcatacaagggagcgtggtggtat
aagaactgccaccggaccaacctcaatgggaagtacggggagtccagacacagtcagggc
atcaactggtaccattggaaaggacatgagttctccatcccctttgtggaaatgaagatg
cgcccctacagtcaccgtctcatggcaggaagaaaacggcagtccttacagttctga

KEGG   Macaca mulatta (rhesus monkey): 709493
Entry
709493            CDS       T01028                                 

Gene name
COL6A2
Definition
(RefSeq) collagen alpha-2(VI) chain isoform X2
  KO
K06238  collagen type VI alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    709493 (COL6A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    709493 (COL6A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    709493 (COL6A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    709493 (COL6A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    709493 (COL6A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    709493 (COL6A2)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    709493 (COL6A2)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   709493 (COL6A2)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   709493 (COL6A2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   709493 (COL6A2)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3 Ins134_P3_kin_N FAM180
Other DBs
NCBI-GeneID: 709493
NCBI-ProteinID: XP_014988164
Ensembl: ENSMMUG00000022317
LinkDB
Position
3
AA seq 918 aa
MLQGPCSVLLLWGILGATQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQ
SPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFIK
SLQGISSFRRGTFTDCALANMTEQIRQHGTKGTVHFAVVITDGHVTGSPCGGIKLQAERA
REEGIRLFAVAPNRNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKH
EAYGECYRVSCLEIPGPPGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEGPIGFPGPK
GVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGNRGPDGYPGEA
GSPGERGDQGGKGDPGRPGRRGPPGDIGAKGSKGYQGNNGAPGSHGVKGAKGGPGPRGPK
GEPGRRGDPGTKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQGAL
GEPGNQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPGDKGEPGPRGPEGGRG
DFGLKGEPGRKGEKGEPADPGPPGEPGPRGPRGVPGPEGEPGPPGDPGLTECDVMTYVRE
TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV
GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRR
QKTRVFAVVITDGRHDPRDDDLNLRALCDRDVTVTAIGIGDMFHEKHESENLYSIACDKP
QQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPDLPCQTDAPWPGGESPVTFLRTEEGP
DATFPRTIPLIQQLLNATELTQDPAAYSQLVAVLVYTAERAKFATGVERQDWMELFIDTF
KLVHRDIVGDPETALALC
NT seq 2757 nt   +upstreamnt  +downstreamnt
atgctccagggcccctgctccgtgctcctgctctggggaatcctgggggccacccaggcc
cagcagcaggaggtcatctcgccagatactacggagagaaacaacaactgcccagagaag
acggactgccccatccatgtgtacttcgtgctggacacctcggagagcgtcaccatgcag
tcccccaccgacatcctgcttttccacatgaagcagttcgtgccgcagttcatcagccag
ctgcagaacgagttctacctggaccaggtggcactgagctggcgctacgggggcctgcac
ttctccgaccaggtggaggtgttcagcccaccgggcagcgaccgggcctccttcatcaag
agcctgcagggcatcagctccttccgccgcggcaccttcactgactgcgcgctggccaac
atgacggagcagatccggcagcacggcaccaagggcaccgtccacttcgccgtggtcatc
accgacggccacgtcaccggcagcccctgcgggggcatcaagctgcaggccgagcgggcc
cgcgaggagggcatccggctcttcgccgtggcccccaaccggaacctgaaggagcagggc
ctgcgggacatcgccagcacgccgcacgagctctaccgcaacgactacgccaccatgctg
cccgactccaccgagatcgaccaggacaccatcaaccgcatcatcaaggtcatgaaacac
gaagcctacggagagtgctacagggtgagctgcctggaaatccctgggccccccggcccc
aagggctaccgtggacagaagggtgccaagggcaacatgggtgagccgggagagcctggc
cagaagggaagacagggagacccaggcatcgaaggccccattggattcccaggacccaag
ggcgttcctggcttcaaaggagagaagggtgaatttggagccgacggtcgcaagggggcc
cctggcctggctggcaagaatgggaccgatggacagaagggcaagctggggcgcatcgga
cctcctggttgcaagggagaccccggaaaccggggccccgatggttacccaggggaagca
gggagtccgggggagcgaggagaccaaggcggcaagggggaccctggccgcccaggacgc
agagggcccccgggagacatcggggcgaagggaagcaaggggtatcaaggcaacaatgga
gccccaggaagtcatggtgtgaagggagccaagggtgggcctgggccccgtggacccaaa
ggcgagccggggcgcaggggagaccccggcaccaagggcagcccaggcagcgacggcccc
aagggggagaagggggaccctggccctgaggggccccgcggcctggctggagaggttggc
aacaaaggagccaagggagaccgaggcttgcctggacccagaggcccccagggggctctt
ggggagcccggaaaccagggatctcggggagaccccggtgatgcaggaccccgtggagac
tcaggacagccaggccccaagggagaccccggcaggcctggattcagctacccaggaccc
cgaggagcacctggagacaaaggcgagcccggcccacgcggccccgagggcggccgaggc
gactttggcttgaaaggagaacctgggaggaaaggagagaaaggagagcctgcggatcct
ggtccccctggtgagccaggccctcgggggccaagaggagtcccaggacccgagggtgag
cctggaccccctggagaccccggcctgacggagtgtgacgtcatgacctacgtgagggag
acctgcgggtgctgcgactgcgagaagcgctgtggcgccctggacgtcgtcttcgtcatc
gacagctccgagagcatcgggtacaccaacttcacgctggagaagaactttgtcatcaac
gtggtcaacaggctgggcgccatcgcgaaggaccccaagtccgagacagggacgcgcgtg
ggcgtggtgcagtacagccacgagggcaccttcgaggccatccagctggacgacgaacgc
atcgactccctgtcgagctttaaggaggccgtcaagaacctcgagtggatcgcgggcggc
acctggacaccctcggccctcaagtttgcctacgaccgtctcatcaaggagagccggcgc
cagaaaacacgcgtgtttgcggtggtcatcacggacgggcgccacgaccctcgggatgat
gacctcaacctgcgggcgctgtgtgaccgcgacgtcacagtgacggccatcggcattggc
gacatgttccacgagaagcacgagagcgagaacctgtactccatagcctgcgacaagcca
cagcaggtgcgcaacatgacgctgttctccgacctggtggctgagaagttcattgacgac
atggaggacgtcctctgcccggaccctcagatcgtgtgcccagaccttccctgccaaaca
gatgcaccgtggcctggcggcgagtccccggtcaccttcctccgcacggaagaggggccg
gacgccaccttccccaggaccatccccctgatccaacagttgctaaacgccacggagctc
acacaggacccggccgcctactcccagctggtggccgtgctggtctacaccgccgagcgg
gccaagttcgccaccggggtagagcggcaggactggatggagctgttcattgacaccttt
aagctggttcacagggacatcgtgggcgaccctgagaccgcgttggccctctgctga

KEGG   Macaca mulatta (rhesus monkey): 712444
Entry
712444            CDS       T01028                                 

Gene name
COL4A3
Definition
(RefSeq) collagen alpha-3(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04926  Relaxin signaling pathway
mcc04933  AGE-RAGE signaling pathway in diabetic complications
mcc04974  Protein digestion and absorption
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    712444 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    712444 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    712444 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    712444 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    712444 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    712444 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    712444 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    712444 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    712444 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    712444 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    712444 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    712444 (COL4A3)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   712444 (COL4A3)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   712444 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 712444
NCBI-ProteinID: XP_028686995
Ensembl: ENSMMUG00000013235
LinkDB
Position
12
AA seq 1670 aa
MSPRTAPRSQVLLLQLLLVLLAAAPTASKGCVCKDKGQCFCDGAKGEKGERGFPGPPGSP
GQKGFTGPEGLLGPQGPNGSPGLPGLRGPKGVRGITGLPGFSGSPGLPGTPGNTGPYGLV
GVPGCNGSKGERGFPGLPGTLGYRGILGAAGLKGQKGAPAEGEDIELDAKGDPGLPGAPG
PQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGQKGERGVKGLTGPPGP
PGTVTVTLTGPDNRTDLKGEKGDKGAIGQPGPPGPSGLPGESYRSEKGAPGDPGPQGKPG
KDGVPGFPGSEGVKGNRGFPGLAGEDGIKGQKGDTGPPGFRGPTEYYDTYQEKGDEGIPG
PPGPKGVRGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSP
GCPGSPGPPGSPGPPGPPGDIVFRKGPPGDRGLPGYLGSPGITGVDGPKGEPGLLCTQCP
YIPGSPGLPGLPGLHGVKGIPGRPGAAGLKGSPGSPGNTGLPGIPGFPGAKGDRGLKGEK
GETLQPEGQVGAPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGD
PGSPGSPGPAGPAGPPGYGPQGEPGPQGTQGVPGAPGPRGEAGPRGELSVSTPVPGPPGP
PGPPGHPGPQGPPGMPGSMGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSL
GCPGNLGEPGLPGKPGVPGAKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPG
TPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKMGPKGD
PGIPGLDRSGFPGETGSPGMSGHQGEMGPPGQKGYPGNPGILGLPGEDGVVGMMGFPGAT
GPPGPPGNPGMPGQRGSLGIPGVKGQRGTPGAKGEQGDKGNPGPSQISHVIGHKGEPGLK
GFAGNPGEKGNRGVPGIPGLKGLKGLPGPAGPPGPRGDLGSTGNAGEPGLRGIPGSMGNM
GMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGEMG
KKGEMGQPGPPGHSGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIRGFPGPPGIKGPPG
LPGFPGSPGPMGVRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGSPGFP
GLPGRKGAMGDAGPRGPTGIEGLPGPPGRPGAIIPGRKGNHGPPGLRGSPGEPGPPGPPG
SHVTGIKGDKGSMGHPGPKGPPGTVGDMGPPGHLGAPGTPGLPGLRGDPGFQGFPGVKGE
KGNPGFLGSIGPPGPIGPIGPPGIRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGP
PGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPPGSDGLPGLKGKRGDSGSPAT
WTTRGFVFTRHSQTTAIPSCPEGTAPLYSGFSFLFVQGNERAHGQDLGTLGSCLQRFTTM
PFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEAPAIAIAV
HSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgagcccccggacggcgcccagatcgcaggtgctcctgctgcagctcctgctggtgctc
ctggcggcggcgcccacggccagcaagggctgtgtctgtaaagacaaaggccagtgcttc
tgtgatggagccaaaggggagaagggggagagaggctttcctggaccccctggttctcct
ggccagaaaggattcacaggtcctgaaggcttgcttggaccacagggacccaacggctct
ccaggacttccaggactcagaggtcccaaaggtgtaaggggaataactggattgccagga
ttttccggttctcctggacttccaggcaccccaggcaatactgggccttatggactcgtc
ggtgtaccaggatgcaatggttctaagggtgagcgggggtttccaggactcccagggaca
ttgggctaccgagggatcctgggtgctgctggtttgaaaggacaaaagggtgctcctgct
gaaggagaagatatagaacttgatgcaaaaggtgaccccgggttgccaggggctccagga
ccccagggtttgccaggccctccaggttttcctgggcctgttggcccacctggtcctccg
ggattctttggctttccaggagccatgggacctagaggacctaagggtcacatgggtgaa
agagtgataggacaaaaaggagagcggggtgtgaaagggttaacaggacccccgggacca
ccaggaacagttactgtgaccctaactggcccagataacagaacggacctcaagggggaa
aagggagacaagggagccatcggccagcctggacctcctggaccctcaggactgcctgga
gaatcatatagatctgaaaagggtgctcctggagaccccggcccgcagggaaaacccggt
aaagacggcgttcctggcttccctggaagtgagggagtcaagggcaacaggggtttccct
gggttagcaggtgaagatggcattaagggacagaaaggggacaccggccctccaggattt
cgtggtccaacagaatattatgacacataccaggaaaagggagatgaaggcattccaggc
ccaccagggcccaaaggagttcgtggcccacaaggtcccagtggtccccccggagttcct
ggaagtcctggatcatcaaggcctggcctcagaggagcccctggatggccaggcttgaaa
ggaagtaaaggggaacgaggccgcccaggaaaggatgccatggggactcctgggtcccca
ggttgtcccggttcaccaggccctccaggatcgccgggacctccaggaccaccaggtgac
atcgtttttcgcaagggtccacctggagatcgcggactgccaggctatctagggtctcca
ggaatcacaggcgttgacgggcccaaaggagaaccaggcctgctgtgtacacagtgccct
tatatcccagggtcccctggtctcccagggttgccagggttacatggtgtaaaaggaatc
ccaggaagaccaggtgcagctggcttgaaaggaagcccagggtcccccggaaatacaggt
cttccaggaattccaggtttcccaggtgccaagggtgaccgaggacttaaaggagaaaaa
ggtgaaacacttcagcctgaggggcaagtgggtgccccaggggacccggggctcagaggc
caacctgggagaaagggcttggatggaattcctggaactccaggagtgaaaggattacca
ggacctaaaggcgaactggctctgagtggtgagaagggggaccaaggtcctccaggggat
cctggctcccctgggtccccaggacctgcaggaccagctggaccacctggctacggaccc
caaggagaacctggtccccagggcacgcaaggagttcctggagcccctggaccacgcgga
gaagccggtcctaggggagaactcagtgtttcaacaccagttccaggcccaccaggacct
ccagggccccctggccatcctggcccccaaggtccacctggtatgcctggatctatgggg
aaatgtggagatcctggtcttccaggacctgatggtgaaccaggaattccaggaattgga
tttcctgggcctcctggacctaagggagaccaaggttttccaggtacaaaaggatcactg
ggttgtcctggaaatttgggagagcctgggttacctggaaagccaggcgtcccaggagcc
aagggagaaccagcagtagccatgcctggaggaccaggaacaccaggttttccaggagaa
agaggcaattctggggaacatggagaaattggactccctggacttccgggtctccctgga
actccaggaaatgaaggtcttgatggaccacgaggagatccagggcagcctggaccacct
ggagaacaaggacccccaggaaggtgcatagagggtcctaggggagcccaaggacttcca
ggcttaaatggattgaaagggcaacaaggcagaagaggtaaaatggggccaaagggagac
ccaggaattccaggcttggatagatcaggatttcctggagaaactggatcaccgggaatg
tcaggtcatcaaggcgagatgggaccaccgggtcaaaaaggatatccaggaaatccagga
attttagggctaccgggtgaagatggagtggttgggatgatgggctttcctggagccact
ggccctccgggaccccctgggaacccaggcatgccagggcagagggggagccttggaatt
ccaggagtaaagggccagagaggaaccccaggagccaaaggggaacaaggagataaagga
aaccccgggccttctcagatatcccacgtgatagggcacaaaggagaaccaggtctcaaa
ggattcgcaggaaatccaggtgagaaaggaaacagaggcgttccagggataccaggttta
aaaggcctcaaaggactacctggaccagcaggaccaccaggccccagaggagatttgggc
agcactgggaatgctggagaaccaggactacgtggtataccaggaagcatggggaacatg
ggcatgccaggttctaaaggaaaaaggggaactttgggattcccaggtcgagcaggaaga
ccaggcctcccaggtattcacggtctccagggagataagggagagccgggttattcagaa
ggtacaaggccaggaccaccaggaccaacgggggatccaggactgccgggtgaaatggga
aagaaaggagaaatggggcaacctggcccacctggacattcggggcctgctggacctgag
ggagcccctggaagtcctggaagtcctggcctcccaggaaagccaggtcctcatggtgac
ttgggttttaaaggaatcagaggcttcccgggccctccaggaatcaaaggccctccaggt
cttccaggattcccaggatctcctggaccaatgggtgtaagaggtgaccaaggacgtgat
ggaattcctggtccagccggagaaaagggagaaacgggtttattgagggcccctccaggc
ccaagaggaaacccgggtgctcaaggagctaaaggagacaggggatccccaggtttccct
ggcctccctggcagaaaaggggccatgggagatgctgggcctcgagggcccacaggcata
gaaggactcccagggccaccaggtcggcccggtgcaattatccctggccggaaaggaaat
catggtccaccaggcttaagaggaagcccaggtgagcctggtccccctggacctccaggg
agtcacgtaacaggcataaaaggagacaaagggtctatgggccaccctgggccaaaaggt
ccacctggaactgtaggagacatgggaccaccaggtcatctgggagcaccaggtactcca
ggtcttccaggactcagaggtgatcctggattccaggggtttccaggcgtgaaaggagaa
aagggtaatcctggatttctaggatccattggacctccaggaccaattgggccaatagga
ccacctggtatacgtggagaccctggcacacttaagattatctcccttccaggaagccca
gggccacctggcacacctggagaaccagggatgcagggagaacctgggccaccagggcca
cctggaaacctaggaccctgtgggccaagaggtaagccaggcaaggatggaaaaccagga
actcctggaccagctggagaaaaaggcaacaaaggttctaaaggagagccaggaccacct
ggatcagatggattgccaggtttgaaaggaaaacgtggagacagtggatcacctgcaacc
tggacaacaagaggctttgtcttcacccgacacagtcaaaccacagcaatcccttcatgt
ccagaggggaccgcgccactctacagtgggttttcttttctctttgtacaagggaatgaa
cgagcccacggacaagacctcggaactctgggcagctgcctgcagcgatttaccacaatg
ccattcttattctgcaatgtcaatgatgtatgtaatttcgcatctcgaaatgattattca
tactggctgtcaacaccagctctgatgccaatgaacatggctcccattactggcagggcc
cttgagccttatataagcagatgcactgtttgtgaagctcctgcgatcgccatagctgtt
cacagccaaaccactgacattcctccatgtcctcacggctggatttctctctggaaagga
ttttcgttcatcatgttcacaagtgcaggttctgagggcaccgggcaagcactggcctcc
cccggctcctgcctggaagaattccgagccagcccatttctagaatgtcatggaagagga
acgtgcaactactattcaaattcctacagtttctggctggcttcattaaacccagaaaga
atgttcagaaagcctattccatcaactgtgaaagctggagaattagaaaaaataataagt
cgctgtcaggtgtgcatgaagaaaagacactaa

KEGG   Macaca mulatta (rhesus monkey): 714809
Entry
714809            CDS       T01028                                 

Gene name
COL9A1
Definition
(RefSeq) collagen alpha-1(IX) chain isoform X4
  KO
K08131  collagen type IX alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    714809 (COL9A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    714809 (COL9A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    714809 (COL9A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    714809 (COL9A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    714809 (COL9A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mcc00535]
    714809 (COL9A1)
Proteoglycans [BR:mcc00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   714809 (COL9A1)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 714809
NCBI-ProteinID: XP_001111519
Ensembl: ENSMMUG00000005577
UniProt: I2CTW6
LinkDB
Position
4
AA seq 678 aa
MARTARDRGALGLLLSGLCLCAAQRGPPGEQGPPGPPGPPGVPGIDGIDGDRGPKGPPGP
PGPAGEPGKPGAPGKPGTPGADGLTGPDGSPGSVGPKGQKGEPGVPGSRGFPGRGIPGPP
GPPGTAGLPGELGRVGPVGDPGRRGRPGPPGPPGPRGTIGFHDGDPLCPNACPPGRSGYP
GLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGELGEVGAQGPPGAQGLRGITGIVGDKGEK
GARGLDGEPGPQGLPGAPGDQGQRGPPGEAGPKGERGAQGARGIPGLPGPKGDTGLPGVD
GRDGIPGMPGTKGEPGKPGPPGDAGLQGLPGVPGIPGAKGVAGEKGSTGAPGKPGPMGNS
GKPGQQGPPGEVGPRGPRGLPGSRGELGPVGSPGLPGKLGSLGSPGLPGLPGPPGLPGMK
GDRGVVGEPGPKGEQGASGEEGETGERGELGDIGLPGPKGSAGNPGEPGLRGPEGSRGLP
GVEGPRGPPGPRGVQGEQGATGLPGVQGPPGRAPTDQHIKQVCMRVIQEHFAEMAASLKR
PDSGASGLPGMPGPPGPPGPPGENGFPGQMGMRGLPGIKGPPGALGLRGPKGDLGEKGER
GPPGRGPKGLPGAVGLPGDPGPASYGRNGRDGERGPPGVAGIPGVPGPPGPPGLPGFCEP
ASCTMQAGQRAFSKGPDP
NT seq 2037 nt   +upstreamnt  +downstreamnt
atggccaggactgcgcgggaccgcggggccctggggctgctgctgtcagggctctgcttg
tgcgcggctcaaagaggtccccccggcgagcagggtcctccagggcctccgggcccccct
ggagttccgggcatagacggcatcgacggtgaccgaggtcctaagggtcccccgggtccc
ccgggtcctgcaggtgaaccgggaaagccaggagctccaggcaagcctggcacacctggc
gctgatggattaacaggacctgatggatcccctggctccgttgggccaaagggacaaaaa
ggagaacctggtgtgcctggatctcgtggatttccaggccgtggtattcctggaccccct
ggtcctcctgggacagcaggactccctggagagcttggccgtgtgggacctgttggtgac
cctgggagaagaggacgacctggcccccctggccccccaggaccccgtggaacaattggc
tttcatgatggagatccattgtgtcccaatgcctgtccaccaggtcgctcaggatatcca
ggcctaccaggcatgaggggtcataaaggggccaaaggagaaattggtgaaccaggaaga
caaggacacaagggtgaagaaggtgaccagggagaactgggagaagttggagctcaagga
cctccaggagcccagggtttgcgaggcatcaccggcatagttggagacaaaggggaaaaa
ggtgctcggggcttagatggcgaacctgggcctcagggtcttcctggtgcacctggtgat
caaggacagcgaggacctccaggagaagcaggtcccaaaggagagagaggggctcaaggt
gctagaggaattcctggtctccctgggcccaaaggagacacgggtttgccaggtgtggat
ggccgtgatgggatccctggaatgcctggaacaaagggtgaaccaggaaaacctgggcct
cctggtgatgcaggattgcaggggttaccaggtgtacctggaattcctggtgcaaagggt
gttgctggtgaaaagggtagtacaggtgctccagggaagcctggtccgatgggaaattca
ggcaaaccgggccaacaggggcctccaggagaggtgggaccccgaggacccagggggctt
cctggcagtagaggagaattaggaccagtgggatccccaggcctaccaggtaaactgggt
tctctgggtagccctggcctccctggcttgcctgggccccctggacttcctggaatgaaa
ggtgacaggggtgtagtcggtgaaccgggtccaaagggtgaacagggtgcctctggtgaa
gaaggtgaaacaggagaaagaggggaacttggagatataggattacctggcccaaaggga
tctgcaggtaatcctggggagcctggcttgagagggcctgagggaagtcgggggcttcct
ggagtggaaggaccaagaggaccacctggaccccggggcgtgcagggagaacagggtgcc
accggcctgcctggtgtccagggccctccgggtagagcaccgacagatcagcacattaag
caggtttgcatgagagtcatacaagagcattttgctgagatggctgctagtcttaagcgt
ccagactcgggtgcctctgggctccctggaatgcctggtcctcctggtccccctggccct
cctggagagaatgggttcccaggccagatgggaatgcgtggccttccgggcattaaaggg
ccccctggtgcccttggtttgaggggacctaaaggtgacttgggagaaaagggggagcgt
ggccctccaggaagaggtcccaaaggtttgcctggagctgtaggtctcccaggtgaccca
ggccctgccagctatgggagaaatggccgagatggtgaacgaggcccccccggggtggca
ggaattcctggcgtgcctggacccccgggacctcctgggcttcccggtttctgtgagcca
gcctcctgcaccatgcaggctggtcagcgagcatttagcaaagggcctgacccgtga

KEGG   Macaca mulatta (rhesus monkey): 715212
Entry
715212            CDS       T01028                                 

Gene name
LAMC2
Definition
(RefSeq) laminin subunit gamma-2
  KO
K06246  laminin, gamma 2
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    715212 (LAMC2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    715212 (LAMC2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    715212 (LAMC2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    715212 (LAMC2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    715212 (LAMC2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    715212 (LAMC2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    715212 (LAMC2)
   05145 Toxoplasmosis
    715212 (LAMC2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_B
Other DBs
NCBI-GeneID: 715212
NCBI-ProteinID: XP_014977667
Ensembl: ENSMMUG00000001479
LinkDB
Position
1
AA seq 1193 aa
MPALWLGCCLCFSLLLPAARATSRREVCDCNGKSRQCIFDQELYRQTGNGFRCLNCNDNT
DGIHCERCKDGFYRHRERDRCLPCNCNSKGSLSARCDNSGRCSCKPGVTGARCDRCLPGF
HMLTDAGCTQDQRLLDSKCDCDPAGIAGPCDAGRCVCKPAVTGERCDRCRSGYYNLDRGN
PEGCTQCFCYGHSASCRSSADYSVHKITSTFNQDVDGWKAVQRNGSPAKLQWSQRHQDVF
SSARRPDPVYFVAPAKFLGNQQVSYGQSLSFDYRVDRGGRHPSAHDVILEGAGLRITAPL
MPLGKTLPCGLTKTYTLRLNEHPSSNWSPQLSYFEYRRLLRNLTALRIRATYGEYSTGYI
DNVTLISARPVSGAPAPWVEQCICPVGYKGQFCQDCASGYKRDSAKLGPFGTCIPCNCQG
GGACDPDTGDCYSGDENPDIECADCPIGFYNDPHDPRSCKPCPCHNGFSCSVMPETEEVV
CNNCPPGVTGARCELCADGYFGDPFGEHGPVRPCQPCQCNNNVDPSASGNCDRLTGRCLK
CIHNTAGIYCDQCKAGYFGDPLAPNPAEKCRACNCNPMGSEPVECRSDGTCVCKPGFGGP
NCEHGAFSCPACYNQVKIQMDQFMQQLQRMEALISKAQGGDGAVLDTELEGRMQQAEQAL
RDILRDAQISEGASRSLGLQLAKVRSQENSYRSRLDDLKMTVERVRALGSQYQNQVRDTH
RLITQMQLSLAESEASLGNTNIPASDHYVGPNGFKSLAQEATRLAESHVESASNMEQLTK
ETEDYSKQALSLVHKALREGGGSGSGSLDGAVVQGVVEKLEKTKSLAQQLTREATQAETE
ADRSYQHSLRLLDSVSQLQGVNDQTFQVEEAKRIKQKADSLSSLVTRHMDEFKRAQKNLG
DWKEETQQLLQNGKNGREKSDQLLSRANLAKSRAQEALSMGNATFYEVESILKNLREFDL
QVDNRKAEAEEAMKRLSYISQKVSDASDKTQQAERALGSAAADAQRAKNGAREALKISSE
IEQEIGSLNLEANVTADGALAMEKGLASLKSEMREVEGELERKELEFDTNMDAVQMVITE
AQRVDTRAKNTGVTIQDTLNTLDGLLYLMDQPVSIDEEGLILLEQKLSRAKNQINSQLRP
MMSELEERARRQRGHLHLLETSIDGILADVKNLENIRDNLPPGCYNTPVLEQQ
NT seq 3582 nt   +upstreamnt  +downstreamnt
atgcctgcgctctggctgggctgctgcctctgcttctcgctcctcctgcccgcagcccgg
gccacctccaggagggaagtctgtgattgcaatgggaagtccaggcagtgtatctttgat
caggaactttacagacaaactggtaatggattccgctgcctcaactgcaatgacaacact
gatggcattcactgcgagaggtgcaaggacggcttttaccggcacagagaaagggaccgc
tgtttgccctgcaattgtaactccaaaggttctcttagtgctcgatgtgacaactccgga
cggtgcagctgtaaaccgggcgtgacaggagccagatgcgaccgatgtctgccaggcttc
cacatgctcacggatgcggggtgcacccaagaccagagactgctagactccaagtgtgac
tgtgacccagctggcatcgcagggccctgtgacgcgggccgctgtgtctgcaagccagct
gtcactggagaacgctgtgataggtgtcgatcaggttactataatctggatagggggaac
cctgagggctgcacccagtgtttctgctatgggcattcagccagctgccgcagctctgca
gactacagtgtccataagatcacctctacctttaatcaagatgttgatggctggaaggct
gtccaacgaaatgggtctcctgcaaagctccaatggtcacagcgccatcaagatgtgttt
agctcagcccgacgaccagaccctgtctattttgtggctcctgccaaatttcttgggaat
caacaggtgagctatgggcaaagcctgtcctttgactaccgtgtggacagaggaggcaga
cacccatctgcccacgatgtgatcctggaaggtgctggtctacggatcacagctcccttg
atgccacttggcaagacactgccttgtgggctcaccaagacttacacactcaggttaaat
gagcatccaagcagtaattggagcccccagctgagttactttgagtatcgaaggttactg
cggaatctcacagccctccgcatccgagctacatatggagaatacagtactgggtacatt
gacaatgtgaccctgatttcagcccgacccgtctctggagccccggcaccctgggttgaa
cagtgtatatgtcctgttgggtacaaggggcaattctgccaggattgtgcttctggctac
aagagagattcagcgaaactggggccttttggcacctgtattccttgtaactgtcaaggg
ggaggggcctgtgatccagacacaggagactgttattcaggggatgagaatcctgacatt
gagtgtgctgactgcccaattggtttctacaacgatccgcacgacccccgcagctgcaag
ccctgtccctgtcataacgggttcagctgctcagtgatgccggagacagaggaggtggtg
tgcaataactgccctcccggggtcaccggtgcccgctgtgagctctgtgctgatggctac
tttggggacccctttggtgaacatggcccagtgaggccttgtcagccctgtcaatgcaac
aacaatgtggaccccagtgcctctgggaactgtgaccggctgacaggcaggtgtctgaag
tgtatccacaacacagccggcatctactgtgaccagtgcaaagcaggctacttcggggac
ccgttggctcccaatccagcagaaaagtgtcgagcttgcaactgcaaccccatgggctca
gagcctgtagaatgtcgaagtgatggcacctgtgtttgtaagccaggattcggtggcccc
aactgtgagcacggagcattcagctgtccagcttgctataatcaagtgaagattcagatg
gatcagtttatgcagcagcttcagagaatggaggccctgatttcaaaggctcagggtggt
gatggagcagtacttgatacagagctggaaggcaggatgcagcaggctgagcaggccctt
cgggacattctgagagatgcccagatttcagaaggtgctagcagatcccttggtctccag
ttggccaaggtgagaagccaagagaacagctaccggagccgcctggatgacctcaagatg
actgtggaaagagttcgggccctgggaagtcagtaccagaaccaagttcgggatactcac
aggctcatcactcagatgcagctgagcctggcagaaagcgaagcttccttgggaaacact
aacattcctgcctcagaccactacgtggggccaaatggctttaaaagtctggctcaggag
gccacgagattagcagaaagccacgttgagtcagccagtaacatggagcaactgacaaaa
gaaactgaggactattccaaacaagccctctcactggtgcacaaggccctgcgtgaagga
ggcggaagcggaagcggtagccttgacggtgctgtggtgcagggggttgtggaaaaattg
gagaaaaccaagtccctggcccagcagttgacaagggaggccactcaagcggaaactgaa
gcagataggtcttatcagcacagtctccgcctcctggattcagtgtctcagcttcaggga
gtcaatgatcagacctttcaggtggaagaagcaaagaggatcaaacaaaaagctgattca
ctctcaagcctggtgaccaggcatatggatgagttcaaacgtgcacaaaagaatctggga
gactggaaagaagaaacacagcagctcttacagaatggaaagaatgggagagaaaaatca
gatcagctgctttcccgtgccaatcttgctaaaagcagagcacaagaagcactaagtatg
ggcaatgccactttttatgaagttgagagcatccttaaaaacctcagagagtttgacctg
caggtggacaacagaaaagcagaagctgaagaagccatgaagagactctcctacatcagc
cagaaggtttcagatgccagtgacaagacccagcaagcagaaagagccctggggagtgct
gctgctgatgcacagagggcaaagaatggggccagggaggccctgaagatctccagtgag
attgaacaggagattgggagtctgaacttggaagccaatgtgacagcagatggagccttg
gccatggaaaagggactggcctctcttaagagtgagatgagggaagtggaaggagagctg
gaaaggaaggagctggagtttgacacgaatatggatgcagtacagatggtgattacagaa
gcccagagggttgacaccagagccaagaacactggggttacaatccaagacacactcaac
acattggatggcctcctgtatctgatggaccagcctgtcagtatagatgaagaggggctg
atcttattggagcagaagctttcccgagccaagaaccagatcaacagccaactgcggccc
atgatgtcagagctggaagagagggcacgtcggcagaggggccacctccatttgctggag
acaagcatagatgggattctggctgatgtgaagaacttagagaacattagggacaacctg
cccccaggctgctacaataccccggttcttgagcaacagtga

KEGG   Macaca mulatta (rhesus monkey): 715265
Entry
715265            CDS       T01028                                 

Gene name
LAMC1
Definition
(RefSeq) laminin subunit gamma-1
  KO
K05635  laminin, gamma 1
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05020  Prion disease
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    715265 (LAMC1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    715265 (LAMC1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    715265 (LAMC1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    715265 (LAMC1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    715265 (LAMC1)
  09164 Neurodegenerative disease
   05020 Prion disease
    715265 (LAMC1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    715265 (LAMC1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    715265 (LAMC1)
   05145 Toxoplasmosis
    715265 (LAMC1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 715265
NCBI-ProteinID: XP_002808304
Ensembl: ENSMMUG00000015084
UniProt: F6WVB2
LinkDB
Position
1
AA seq 1608 aa
MRGSQRAAPALRRRGRLWPVLAVLAAAAAGCAQAAMDECTDEGGRPQRCMPEFVNAAFNV
TVVATNTCGTPPEEYCVQTGVTGVTKSCHLCDAGQPHLQHGAAFLTDYNNQADTTWWQSQ
TMLAGVQYPSSINLTLHLGKAFDITYVRLKFHTSRPESFAIYKRTREDGPWIPYQYYSGS
CENTYSKANRGFIRTGGDEQQALCTDEFSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQ
EWVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYAISDFAVGGRCKCNGHASECMKNEFD
KLVCNCKHNTYGVDCEKCLPFFNDRPWRRATAESASECLPCDCNGRSQECYFDPELYRST
GHGGHCTNCQDNTDGAHCERCRENFFRLGNNEACSPCHCSPVGSLSTQCDSYGRCSCKPG
VMGDKCDRCQPGFHSLTEAGCRPCSCDPSGSIDECNVETGRCVCKDNVEGFNCERCKPGF
FNLESSNPRGCTPCFCFGHSSVCTNAVGYSVYSISSSFQIDEDGWRVEQRDGSEASLEWS
SERQDIAVISDSYFPRYFIAPAKFLGKQVLSYGQNLSFSFRVDRRDTRLSAEDLVLEGAG
LRVSVPLIAQGNSYPSETTVKYVFRLHEATDYPWRPALTPFEFQKLLNNLTSIKIRGTYS
ERSAGYLDDVTLASARPGPGVPATWVESCTCPVGYGGQFCEMCLSGYRRETPSLGPYSPC
VLCTCNGHSETCDPETGVCNCRDNTAGPHCEKCSDGYYGDSTAGTSSDCQPCPCPGGSSC
AVVPKTKEVVCTNCPTGTTGKRCELCDDGYFGDPLGRNGPVRLCRLCQCNDNIDPNAVGN
CNRLTGECLKCIYNTAGFYCDRCKDGFFGNPLAPNPADKCKACSCNPYGTVKQQSNCNPV
TGQCECLPHVTGRDCGACDPGFYNLQSGQGCERCDCHALGSTNGQCDIRTGQCECQPGIT
GQHCERCEVNHFGFGPEGCKPCDCHPEGSLSLQCKDDGRCECREGFVGNRCDQCEENYFY
NRSWPGCQECPACYRLVKDKVADHRVKLQELESLIANLGTGDEMVTDQAFEDRLKEAERE
VMDLLREAQDVKDVDQNLMDRLQRVNNSLSSQISRLQNIRNTIEETGNLAEQARAHVENT
ERLIEIASRELEKAKVAAANVSVTQPESTGDPNNMTLLAEEARKLAERHKQEADDIVRVA
KTANDTSTEAYNLLLRTLAGENQTAFEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAG
DKAVEIYASVAQLSPLDSETLENEANNIKMEAENLERLIDQKLKDYEDLREDMRGKELEV
KNLLEKGKTEQQTADQLLARADAAKALAEEAAKKGRDTLQEANDILNNLKDFDRRVNDNK
TAAEEALRKIPAINQTIIEANEKTREAQQALGNAAADATEAKNKAHEAERIASAVQKNAT
STKAEAERTFAEVTDLDNEVNNMLKQLQEAEKELKRKQDDADQDMMMAGMASQAAQEAEI
NARKAKNSVTSLLSIINDLLEQLGQLDTVDLNKLNEIEGTLNKAKDEMKVSDLDRKVSDL
ENEAKKQEAAIMDYNRDIEEIMKDIRNLEDIRKTLPSGCFNTPSIEKP
NT seq 4827 nt   +upstreamnt  +downstreamnt
atgagggggagccagcgggccgcgccggccctgcggcgccgggggcggctctggcccgtg
ttggccgtgctggcggccgccgcggcgggctgtgcccaggcagccatggacgagtgcacg
gacgagggcgggcggccgcagcgctgcatgcccgagttcgtcaacgccgccttcaacgtg
accgtggtggctaccaacacgtgtgggactccgcccgaggagtactgtgtgcagaccggg
gtgaccggggtcaccaagtcctgtcacctgtgcgacgccgggcagccccacctgcagcac
ggggcagccttcctgaccgactacaacaaccaggccgacaccacctggtggcaaagccag
accatgctggccggggtgcagtaccccagctccatcaacctcacgctgcacctgggaaaa
gcttttgacatcacctatgtgcgtctcaagttccacaccagccgcccagagagctttgcc
atttacaaacgcacgcgggaagatgggccctggattccttaccagtactacagtggttcc
tgcgagaacacctactccaaggcaaaccgcggcttcatcaggacaggaggggacgagcag
caggccttgtgtactgatgaattcagtgacatttctcccctcactgggggcaatgtggcc
ttttctaccctggaaggaaggccaagtgcctataactttgacaatagccctgtgctgcag
gaatgggtaactgccactgacatcagagtaactcttaatcgcctgaacacttttggagat
gaagtgtttaacgatcccaaagttctcaagtcctattattatgcgatctctgattttgct
gtgggtggcagatgtaaatgtaatggacacgcaagcgagtgtatgaagaacgaatttgat
aagctggtgtgtaactgcaaacataacacatatggagtagactgtgaaaagtgtcttcct
ttcttcaatgaccggccatggaggagggcaactgcagaaagtgccagtgaatgcctgccc
tgtgactgcaatggtcgatcccaggaatgctacttcgaccctgaactctatcgttctact
ggccatgggggtcactgtaccaactgccaggataacacagatggcgcccactgtgagagg
tgccgggagaacttcttccgccttggaaacaatgaagcctgctctccatgccactgtagt
cctgtgggctctctaagcacacagtgtgatagttacggcagatgcagctgtaagccagga
gtaatgggggacaaatgtgaccgttgccagcctggattccattctctcactgaggcagga
tgcaggccatgctcttgtgatccctctggcagcatagatgaatgtaatgttgaaacagga
agatgtgtttgcaaagacaatgtcgaaggcttcaattgtgaaagatgcaaacctggattt
tttaatctggaatcatctaatcctaggggttgcacaccctgcttctgctttgggcattct
tctgtctgtacaaatgctgttggctacagtgtttattctatctcctcttcctttcagatt
gatgaggatgggtggcgtgtggaacagagagatggctctgaagcatctctggaatggtcc
tctgagaggcaagatattgccgtgatctcagacagctactttcctaggtacttcattgct
cctgcaaagttcttgggcaagcaggtgttgagttatggtcagaacctctccttctccttt
cgagtggacaggcgagatactcgcctctctgcagaagaccttgtgcttgagggagctggc
ttaagagtatctgtgcccttgatcgctcagggcaattcctatccaagtgagaccactgtg
aagtatgtcttcaggctccatgaagcaacagattacccttggaggcctgctcttacccct
tttgaatttcagaagctcctaaacaacttgacctctatcaagatccgtgggacatacagt
gagagaagtgctggatatttggatgatgtcaccctggcaagtgctcgtcctgggcctgga
gtccctgcaacttgggtggagtcctgcacctgtcctgtgggatatggagggcagttttgt
gagatgtgcctttcaggttacagaagagaaactcctagtcttggaccatacagtccatgt
gtgctttgcacctgcaatggacacagcgagacctgtgatcctgagacaggtgtttgtaac
tgcagagacaatacggctggcccgcactgtgagaagtgcagtgatgggtactatggagat
tcaactgcaggaacctcctctgattgccagccctgtccgtgtcccggaggctcaagttgt
gctgttgtccccaagacaaaagaggtggtgtgcaccaactgtcctactggcaccactggt
aagagatgtgagctctgtgatgatggctactttggagaccccctaggtagaaacggccct
gtgagactttgccgcctgtgccagtgcaatgacaacattgatcccaacgcagttggaaat
tgcaatcgcttgacaggagaatgcctgaagtgcatctataatactgctggcttctattgt
gaccggtgcaaagacggattttttggaaatcccctggctcccaatccagcagacaaatgc
aaagcctgcagttgcaatccgtatgggacagtgaagcagcagagcaactgtaaccccgtg
acggggcaatgtgaatgtttgcctcatgtgactggccgggactgtggtgcttgcgaccct
ggattctacaacctgcagagtgggcaaggctgtgagaggtgtgactgccatgccttgggc
tccaccaatgggcagtgtgacatccgcaccggccagtgtgagtgccagcctggcatcacc
ggtcagcactgtgagcgctgtgaggtcaaccactttgggtttggacctgaaggctgcaaa
ccctgtgactgtcatcccgaggggtctctttcacttcagtgcaaagacgatggtcgctgt
gaatgcagagaaggctttgtgggaaatcgctgtgaccagtgtgaagaaaactatttctac
aatcggtcttggcctggctgccaggaatgtccagcttgttaccgtttggtaaaggataag
gttgctgatcatcgagtgaagctccaggaattagagagtctcatagcaaaccttggaact
ggggatgagatggtgacggatcaagccttcgaggatagactaaaggaagcagagagggaa
gttatggacctccttcgtgaggcccaggatgtcaaagatgttgaccagaatttgatggat
cgcctacagagagtgaataacagtctgtccagccaaattagccgtttacaaaatatccgg
aataccattgaagagactggaaacttggctgaacaagcacgtgcccatgtagagaacaca
gagaggttgattgaaatcgcgtccagagaacttgaaaaagcaaaagtagctgctgccaat
gtgtcagtcactcagccagaatctacaggggacccaaacaacatgactcttttggcggaa
gaggctcgaaagcttgctgaacgtcataaacaggaagctgatgacattgttcgagtggca
aagacagccaatgatacatcaactgaggcatacaacctgcttttgaggacattggcagga
gaaaatcaaacagcatttgagattgaagagcttaataggaagtatgaacaagcgaagaac
atctcacaagatctggaaaaacaagctgcccgagtacatgaggaggccaaaagggccggt
gacaaagctgttgagatctatgccagcgtggctcagctgagccctttggactctgagaca
ctggagaatgaagcaaataacataaagatggaagctgagaatctggaacgtctgattgac
cagaagttgaaagattatgaggacctcagagaagacatgagagggaaggaacttgaagtc
aagaaccttctggagaaaggaaagactgaacagcagaccgcagaccaactcctagcccga
gctgacgctgccaaggccctcgccgaagaggctgcaaagaagggacgtgataccttacaa
gaagctaatgacattctcaacaacctgaaagattttgataggcgcgtgaacgataacaag
acggccgcagaggaggcactaaggaagattcctgccatcaaccagaccatcattgaagcc
aatgaaaagaccagggaagcccagcaggccctgggcaacgctgcggcggacgccacagag
gccaagaacaaggcccatgaggcggagaggatcgcgagcgctgtccaaaagaatgccacc
agcactaaggcagaagctgaaagaacttttgcagaagttacagatctggataatgaggtg
aacaatatgttgaagcaactacaggaagcagaaaaagagctaaagcgaaaacaagatgac
gctgaccaggacatgatgatggcagggatggcttcacaggctgctcaagaagccgagatc
aatgccagaaaagccaagaactctgttactagcctcctcagcattattaatgacctcttg
gaacagctggggcagcttgacacagtggacctgaataagctaaacgagattgaaggcacc
ctaaacaaagccaaagatgaaatgaaggtcagcgatcttgataggaaagtgtctgacctg
gagaatgaagccaagaagcaggaggctgccatcatggactataatcgggatatcgaggag
atcatgaaggacattcgcaatctggaagacatcaggaagaccttaccatctggctgcttc
aacaccccgtccattgaaaagccctag

KEGG   Macaca mulatta (rhesus monkey): 715394
Entry
715394            CDS       T01028                                 

Gene name
LAMA2
Definition
(RefSeq) laminin subunit alpha-2 isoform X1
  KO
K05637  laminin, alpha 1/2
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
mcc05410  Hypertrophic cardiomyopathy
mcc05412  Arrhythmogenic right ventricular cardiomyopathy
mcc05414  Dilated cardiomyopathy
mcc05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    715394 (LAMA2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    715394 (LAMA2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    715394 (LAMA2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    715394 (LAMA2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    715394 (LAMA2)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    715394 (LAMA2)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    715394 (LAMA2)
   05414 Dilated cardiomyopathy
    715394 (LAMA2)
   05416 Viral myocarditis
    715394 (LAMA2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    715394 (LAMA2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    715394 (LAMA2)
   05145 Toxoplasmosis
    715394 (LAMA2)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_N Laminin_B Laminin_II Laminin_G_3 LXG DUF3450 DUF948
Other DBs
NCBI-GeneID: 715394
NCBI-ProteinID: XP_014992959
Ensembl: ENSMMUG00000022136
LinkDB
Position
4
AA seq 3226 aa
MPGAAGVLLLLLLSGGLRGGQAQRPQQQRRPQAHQQRGLFPAVLNLASNALITTNATCGE
KGPEMYCKLVEHVPGQPVRNPQCRICNQNSSNPNQRHPITNAIDGKNTWWQSPSIKNGIE
YHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDDVEYKPWQYHAVTDTECLTLY
NIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARY
IRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDP
ATNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDENVA
RRSLSLNIHGKYIGGGVCINCTQNTAGINCETCIDGFFRPKGVSPNYPRPCQPCHCNPVG
SLNEVCVKDEKHARRGLAPGSCHCKTGFGGVSCDRCARGYTGYPDCKACNCSGLGSKNED
PCFGPCNCKENVEGGDCSRCKSGFFNLQEDNWKGCDECFCSGVSDRCQSSYWTYGKIQDM
SGWYLTDLSGHIRVAPQQDDSDSPQQISISNAEARQALPHSYYWSAPASYLGNKLPAVGG
QLTFTISYDLEEEEEDTEHVLQFMIILEGNDLRISTAQDEVYLHPSEEHVNVLLLKEESF
TIHGTHFPVSRKEFMTVLANLKRVLLQITYSFGMDAIFRLSSVNLESAVSYPTDGSVAAA
VEVCQCPPGYTGSSCEKRHQANASSNLIPANNAKSCWPRHRRVNGTIFGGICEPCQCFGH
AESCDDVTGECLNCKDHTDGPYCDKCLPGFYGDPTKGTSEDCQPCACPLNIPSNNFSPTC
HLDRSLGLICDGCPVGYTGPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPGSCDSL
SGSCLICKPGTTGRYCELCADGYFGDAVDARNCQPCRCNANGSFSEICHSQTGQCECRAN
VQGQRCDKCKPNMWWDPEKRFCVLCDCDPVGSVSPQCDITGRCVCKSDFVGKQCNLGRQV
HQQEEQPRRAQRVLGSPQRWAIGSSSGCPRGAYRAPAPPGTFGLQSARGCVPCNCNSFGS
KSFDCEESGQCWCQPGVTGKKCDRCAHGYFSFQEGGCTACECSHLGNNCDPKTGQCICPP
NTIGEKCSKCAPNTWGHSITTGCKACNCSTVGSLDFQCNVNTGQCNCHPKFSGAKCTECS
RGHWNYPRCNLCDCFLPGTDATTCDSETKKCSCSDQTGQCTCKVNVEGIHCDRCQPGKFG
LDAKNPLGCSSCYCFGTTTQCSEAKGLIRTWVTLKAEQTILPLVDEALQHTTTKGIVFQH
PEIVAHMDLMREDLHWEPFYWKLPEQFEGKKLMAYGGKLKYAIYFEAREETGFSTYNPQV
IIRGGTPTHARIIVRHMAAPLIGQLTRHEIEMTEKEWKYYGDDPRVHRTVTREDFLDILY
DIHYILIKATYGNFMRQSRISEISMEVAEQGRRTAVTPPAHLIEKCDCPLGYSGLSCEAC
LPGFYRLRSQPGGRTPGPTLGTCVPCQCNGHSSLCDPETSICQNCQHHTAGDFCERCALG
YYGIVKGLPNDCQQCACPLISSSNNFSPSCVTEGLDDYRCTACPRGYEGQYCERCAPGYT
GSPSSPGGSCQECECDPYGSLPVPCDPVTGICTCRPGATGRKCDGCKHWHAREGWECVFC
GDECTGLLLGDLARLEQMVMSINLTGPLPAPYKMLYGLENMTQELKHLLSPQRAPERLIQ
LAEGNLNTLVTEMNELLTRATKVTADGEQTGQDAERTNTRAKSLGEFIKELARDAEAVNE
KAIKLNETLGTRDEAFERNLEGLQKEIDQMIKELRRKNLETQKEIAEDELVAAEGLLKKV
KKLFGESRGKNEEMEKDLREKLADYKNKVDDAWDLLREATNKIREANRLFAVNQKNMTAL
EKKKEAVESGKRQIENTLKEGNDILDEANRLADEINSIIDYVEDIQTKLPPMSEELNNKI
DDLSQEIKDRKLAEKVSQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFKAYSNIKDY
IDEAEKVAKEAKDLAHEATKLATGPRGLLKEDAKGSLQKSFRILNEAKKLANDVKENEDH
LNGLKTRIENADARNGDLLRALNDTLGKLSAIPNDTAAKLQAVKDKARQANDTAKDVLAQ
IKELHQNLDGLKKNYDKLADSVAKTNAVVKDPSKNIADADATVKNLEQEADRLIDKLKPI
KELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTYKPEIKKGSYNNIVVNVKT
AVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDSYWYRIMASR
TGRNGTISVRALDGPKASIVPSTYHSTSPPGYTILDVDANAMLFVGGLTGKLKKADAVRV
ITFTGCMGETYFDNKPIGLWNFREKEGDCKGCTVSPQVEDSEGTIQFDGEGYALVSRPIR
WYPNISTVMFKFRTFSSSALLMYLATRDLRDFMSVELTDGHIKVSYDLGSGMASVVSNQN
HNDGKWKSFTLSRIQKQANISIVDIDTNQEENIATSSSGNNFGLDLKADDKIYFGGLPTL
RNLSMKARPEVNLKKYSGCLKDIEISRTPYNILSSPDYVGVTKGCSLENVYTVSFPKPGF
VELSPVPIDVGTEINLSFSTKNESGIILLGSGGTPAPPRRKRRQTGQAYYAILLNRGRLE
VHLSTGARTMRKIVVRPEPNLFHDGREHSVHVERTRGIFTVQVDENRRYMQNLTVEQPLE
VKKLFVGGAPPEFQPSPLRNIPPFEGCVWNLVINSVPMDFARPVSFKNADIGRCVHQKLH
EDEDGAAPAETVIQPEPVPTPAFPTPTAVLTHGPCAAESEPALLMGSKQFGLSRNSHIAI
AFDDTKVKNHLTIELEVRTEAESGLLFYMARINHADFATVQLRNGLPYFSYDLGSGDTNT
MIPTKINDGQWHKIKIMRSKQEGILYVDGASNRTISPRKADILDVVGMLYVGGLPINYTT
RRIGPVTYSIDGCIRNLHMAEAPADLEEPTSSFHIGTCFANAQRGTYFDGTGFAKAVGGF
KVGLDLLVEFEFRTTRTTGVLLGISSQKMDGMGIEMIDEKLMFHVDNGAGRFTAVYDAGV
PGHLCDGQWHKVTANKIKHRIELTVDGNQVEAQSLNPASTSADTNDPVFVGGFPDDLKQF
GLTTSIPFRGCVRSLKLTKGTGKPLEVNFAKALELRGVQPVSCPAN
NT seq 9681 nt   +upstreamnt  +downstreamnt
atgccgggagccgccggggtcctcctcctcctgctgctctccgggggcctcaggggcggt
caggcgcagcggccgcagcagcagcggcggccacaggcacatcagcaaagaggtttattc
cctgctgtcctgaatcttgcgtctaatgctcttatcacaaccaatgcaacatgtggagaa
aagggacctgaaatgtactgcaaattggtggaacatgtccctgggcagcctgtgaggaat
ccccagtgtcgaatctgcaatcaaaacagcagcaatccaaaccagagacacccgattaca
aatgctattgatggaaagaacacttggtggcagagtcccagtattaagaatggaatcgaa
taccattacgtgacaattacactggatttacagcaggtgttccagattgcgtatgtgatt
gtgaaggcagctaactccccccggcctggaaactggattttggaacgctctcttgatgat
gttgaatacaagccctggcagtatcatgctgtgacagacacggagtgcctaaccctttac
aatatttatccccgcactgggccaccgtcatatgccaaagacgatgaggtcatctgcact
tcattttactccaagatacaccccttagaaaatggagagattcacatctctttaatcaat
gggagaccaagtgccgatgatccttctccagaactgctggagtttacctctgctcgctat
attcgcctgagatttcagaggatccgcacactgaatgctgacttgatgatgtttgctcac
aaagacccaagagaaattgaccccattgtcaccagaagatattactactcggtcaaggat
atttcagttggagggatgtgcatctgctatggtcatgccagggcttgtccacttgatcca
gcgacaaataaatctcgctgtgagtgtgaacataacacatgtggcgatagctgtgatcag
tgctgtccaggattccatcagaaaccctggagagctggaacttttctaactaaaactgaa
tgtgaagcatgcaattgtcatggaaaagctgaagaatgctattatgatgaaaatgttgcc
agaagaagtctgagtttgaatatacatggaaagtacattggagggggtgtttgcattaat
tgtacccaaaatactgctggtataaactgtgagacatgtattgatggcttcttcagaccc
aaaggggtatctccaaattatccaaggccatgccagccatgtcattgcaatccagttggt
tccttaaatgaagtctgtgtcaaggatgagaaacatgctcgacgaggtttggcacctgga
tcctgtcattgcaaaactggttttggaggtgtgagctgtgatcggtgtgccaggggctac
actggctacccagactgcaaagcctgtaactgcagtggcttagggagcaaaaatgaggac
ccttgttttggcccctgtaactgcaaggagaatgttgaaggaggagactgtagtcgttgc
aaatctggcttcttcaatttgcaagaggataattggaaaggctgcgatgagtgtttctgt
tcaggtgtttcagacagatgtcagagttcctactggacctatggcaaaatacaagatatg
agtggctggtatctgactgacctttctggccacattcgagtggctccccagcaggacgac
tcggactcacctcagcagatcagcatcagtaacgcggaggcccggcaagccctgccgcac
agctactactggagcgcaccggcttcctatctgggaaacaaactcccagcagtaggagga
cagttgacatttaccatatcatatgacctggaagaagaggaagaagatacagaacatgtt
ctccagtttatgattatcttagagggtaatgacttgagaatcagcacagcccaagatgag
gtgtacctgcacccatctgaagaacatgttaatgtattgttgcttaaagaagaatcattt
accatacatggcacacattttccagtcagtagaaaggaatttatgacagtgcttgcgaat
ttgaagagagtcctcctacaaatcacatacagctttgggatggatgccatcttcaggttg
agctctgttaaccttgaatccgctgtctcctatcctactgatggaagcgttgcagcagct
gtagaagtatgtcagtgcccaccagggtataccggctcctcttgtgaaaagcgacaccaa
gctaatgcttcctctaatctcatacctgccaataacgctaagtcttgttggcctaggcac
aggcgagttaacggcaccatttttggtggcatctgtgagccatgtcagtgctttggtcat
gcggagtcctgtgatgacgtcactggagaatgcctgaactgtaaagatcacacagatggc
ccatattgtgataaatgtcttcctggtttctatggcgatcctactaaaggaacctctgaa
gactgtcaaccgtgcgcctgtccactcaatatcccatccaataactttagcccaacgtgc
catttagatcggagtcttggattgatctgtgatggatgccctgtcgggtacacaggacca
cgctgtgagaggtgtgcagaaggctattttggacaaccctctgtacctggaggatcatgt
cagccatgccaatgcaatgacaaccttgacttctccatccctggcagctgtgacagcttg
tctggctcctgtctgatatgtaaaccaggtacaacaggccgatactgtgagctctgtgct
gatggatattttggagacgcagttgatgcaaggaactgtcagccctgtcgctgtaatgcc
aatggctctttctctgagatttgccacagtcaaactggacagtgtgagtgcagagccaac
gtgcagggtcagaggtgtgacaaatgcaagcccaatatgtggtgggatccagagaagcga
ttctgtgtgctatgtgactgcgatcccgttggctctgtgtcaccgcaatgtgatattaca
ggaagatgtgtctgtaaatcagacttcgtagggaaacagtgcaacctcggcaggcaggtg
caccaacaggaggagcagccgcggagagcgcaacgggtgctgggttctcctcagaggtgg
gctatcggcagctccagcgggtgccctcggggagcctatcgggccccagctccgcctggg
acctttggtctgcaatcagcaaggggctgtgttccctgcaactgcaattcttttgggtct
aagtcattcgactgtgaagagagtggacaatgttggtgccaacccggagtcacagggaag
aaatgtgaccgctgtgcccacggctatttcagcttccaagaaggaggctgcacagcttgt
gaatgttctcatctgggtaataattgtgacccaaagactgggcaatgcatttgccctccc
aataccattggagagaaatgttctaaatgcgcacccaatacctggggccacagcattacc
actggttgtaaggcttgtaactgcagcacagtgggatccttggatttccaatgcaatgta
aatacaggccaatgcaactgtcatccaaaattctctggtgcaaaatgtacagagtgcagt
cgaggtcactggaactaccctcgctgcaatctctgtgactgcttcctccctgggacggat
gccacaacctgtgattcagagactaaaaaatgctcctgtagtgatcaaactgggcagtgc
acttgtaaggtgaatgtggaaggcatccactgtgacagatgccagcctggcaaattcgga
ctcgatgccaagaatccacttggctgcagcagctgctattgcttcggcactactactcag
tgctctgaagcaaaaggactgatccggacgtgggtgactctgaaggctgagcagaccatt
ctacccctggtagatgaggctctgcagcacacgaccaccaagggcattgtttttcaacat
ccagagattgttgcccacatggacctgatgagagaagatctccactgggaacctttttat
tggaaacttccagaacaatttgaaggaaagaagttgatggcctatgggggcaaactcaag
tatgcaatctatttcgaggctcgggaagaaacaggtttctctacatataatcctcaagtg
atcattcgaggtgggacacctactcatgctagaattatcgtcaggcatatggctgctcct
ttgattggccaattgacaaggcatgaaattgaaatgacagagaaagaatggaaatattat
ggggacgatcctcgagttcatagaactgtgacccgagaagacttcttggatatactatat
gatattcattacattcttatcaaggctacttatggaaatttcatgcgacaaagcaggatt
tctgaaatctcaatggaggtagctgaacaaggacgtagaacagcagtgactcctccagct
cacttgattgaaaaatgtgattgccccctgggctattctggcctgtcctgtgaggcatgc
ttgccaggattttatcgactgcgttctcaaccaggtggccgcacccctggaccaaccctg
ggcacctgtgttccatgtcaatgtaatggacacagcagcctgtgtgaccctgaaacatcg
atatgccagaattgtcaacatcacactgctggtgacttctgtgaacgatgtgctcttgga
tactatggaattgtcaagggattgccaaatgactgtcagcaatgtgcctgccctctgatt
tcttccagtaacaatttcagcccctcttgtgtcacagaaggccttgatgactaccgctgc
actgcttgtccacggggatacgaaggccagtactgtgaaaggtgtgcccctggctatact
ggcagtccaagcagccctggaggctcctgccaagaatgtgagtgtgatccctatggctca
ctgcctgtgccctgtgaccctgtcacaggaatctgcacgtgccgacccggagccacggga
aggaagtgtgacggctgcaagcactggcatgcacgcgagggctgggagtgtgttttctgt
ggagatgagtgcaccggccttcttctcggtgacttggctcgcctggagcagatggtcatg
agcatcaacctcactggtccgctgcctgcgccatataaaatgctgtatggtcttgaaaat
atgactcaggagctaaagcacttgctctctcctcagcgggccccagagaggcttattcag
ctggcagagggcaatctgaatacactcgtgaccgaaatgaacgagctgctgaccagggct
accaaagtgacagcagatggcgagcagactggacaggatgctgagaggaccaacacaaga
gcaaagtccctgggagaattcattaaggagcttgcccgggatgcagaagctgtaaatgaa
aaagctataaaactaaatgaaactctaggaactcgagacgaggcctttgagagaaatttg
gaaggccttcagaaagagattgaccagatgattaaagaactgaggaggaaaaatctagag
acacaaaaggaaattgctgaagacgagttggtagctgcagaagggcttctgaaaaaagtg
aagaagctgtttggagagtcccggggaaaaaatgaagaaatggagaaggatctccgagaa
aaactggcagactacaaaaacaaagttgatgacgcttgggacctgttgagagaagccacg
aataaaatcagagaagctaatcgcttatttgcagtaaaccagaaaaacatgactgcgttg
gagaaaaagaaggaggctgttgaaagtgggaaacgacaaattgagaacactttaaaagag
ggcaatgacatactcgatgaagccaaccgtcttgcagatgaaatcaactcgatcatagac
tatgttgaagacatccaaactaaattgccacctatgtctgaggagcttaacaacaaaata
gatgacctctcccaagaaataaaggacaggaagcttgccgagaaggtgtcccaagctgag
agccatgcagctcagttgaatgactcatctgctgtccttgatggaatccttgatgaggct
aaaaacatctccttcaatgccactgcagccttcaaagcttacagcaatattaaggactat
attgatgaagctgagaaagttgccaaagaagccaaagatcttgcacacgaagctacgaaa
ctggcaacaggtcctcggggtttattaaaggaagatgccaaaggctctcttcagaaaagc
ttcaggattcttaacgaagccaagaagttagcaaatgatgtaaaagaaaatgaagaccat
ctaaatggcttaaaaaccaggatagaaaatgctgatgctagaaatggggatctcttgaga
gctttgaatgacactttgggaaagttatcagctattccaaatgacacagctgctaaactg
caagctgtcaaggacaaagccagacaagccaacgacacagctaaagatgtactggcacag
attaaagagctccaccagaacctcgatggcctgaagaagaattacgataaactagcagat
agcgtcgccaaaacgaatgctgtggttaaagatccctccaagaacattgccgacgcagat
gccactgtcaaaaatctagaacaggaagctgaccggctaatagataaactcaaacccatc
aaggaacttgaggataatctaaagaaaaacatctctgagataaaggaattgataaaccaa
gcccggaaacaagccaattctatcaaagtatctgtgtcttcgggaggtgactgcattcga
acgtacaagccagaaatcaagaaaggaagttacaataatattgttgtcaatgtaaagaca
gctgttgctgacaacctcctcttttatcttggaagtgccaaatttattgactttctggct
atagaaatgcgtaaaggcaaagtcagcttcctctgggatgttggatctggagttggacgt
gtagagtacccagatttgactattgatgactcatattggtaccgtattatggcatcaaga
actgggagaaatggaactatttctgtgagagccctggatggacccaaagccagcattgta
cccagcacataccattcgacgtctcctccagggtacactattctagatgtggatgcaaat
gcaatgctgtttgttggtggcctgactgggaaattaaagaaggctgatgctgtacgtgtg
attacattcaccggctgcatgggagaaacatactttgacaacaaacctataggtttgtgg
aatttccgagaaaaagaaggtgactgcaaaggatgcactgtaagtcctcaggtggaagat
agtgaggggactattcagtttgatggagaaggttatgcattggtcagccgccccattcgc
tggtaccccaacatctctaccgtcatgttcaagttcagaacattttcttcgagtgctctc
ctgatgtatcttgcaacacgagacctgagagatttcatgagtgtggagctcactgatggg
cacataaaagtcagttatgatctgggttcaggaatggcttctgttgtcagcaatcaaaac
cataatgatggaaaatggaaatcattcaccctgtcaagaattcaaaaacaagccaatata
tcaattgtagatatagatactaatcaggaggagaacatagcaacttcatcttctggaaac
aactttggtcttgatttgaaagcagatgacaaaatatattttggtggcctgccaacgttg
agaaacttgagtatgaaagcaaggccagaagtaaatttgaagaaatattccggctgtctc
aaagatattgaaatttcaagaacgccgtacaatatactcagtagtcccgattatgttggt
gttaccaaaggatgttccctggagaatgtttacacagttagctttcctaagcctggtttt
gtggagctctcccctgtgccaattgatgtaggaacagaaatcaacctgtccttcagcacc
aagaatgagtctggcatcattcttctgggaagtggagggacaccagcaccacctaggaga
aaacgaaggcagactggacaggcctattatgcaatactcctcaacaggggccgtctggaa
gtgcatctctccacaggggcacgaacaatgaggaaaattgtagtcagaccagagccgaat
ctgtttcatgatggaagagaacattccgttcatgtagagcgaactagaggcatctttaca
gttcaagtcgatgaaaacagaagatatatgcaaaacctgacagttgaacagcctcttgaa
gttaaaaagcttttcgttggaggtgctccgcctgaatttcaaccttccccactcagaaat
attcctccttttgaaggctgtgtgtggaatcttgttattaactctgtccccatggacttt
gcaaggcctgtatccttcaaaaacgctgacattggtcgctgtgtccatcagaaactccat
gaagatgaagatggagcagctccagctgaaacagttatccagcctgagccagttcccacc
ccagcctttcctacacccaccgcagttctgacacatggtccttgtgctgcagaatcagaa
ccagctcttttgatggggagcaagcagttcgggctttcaagaaacagtcacattgcaatt
gcatttgatgacaccaaagttaaaaaccatctcacaattgagttggaagtaagaaccgaa
gctgaatctggcttgcttttttacatggctcgcatcaatcatgctgattttgccacagtt
cagctgagaaatggattgccctacttcagctatgacttggggagtggggacaccaacacc
atgatccccaccaaaatcaatgatggccagtggcacaagattaagataatgagaagtaag
caagaaggaattctttatgtagatggggcctccaacagaaccattagtcccagaaaagct
gacatcctggatgttgtgggaatgctgtatgttggtgggttacccatcaactacactacc
cgaagaattggtccagtgacctatagcattgacggctgcatcaggaatctccacatggca
gaggcccctgctgatctggaagagcccacctccagcttccatatcgggacatgttttgca
aatgctcagaggggaacatattttgatggaacaggttttgccaaagcagttggtggattc
aaagtgggattggaccttcttgtagaatttgaattccgcacaactagaacaactggagtt
cttctggggatcagtagtcaaaaaatggatggaatgggtattgaaatgattgatgaaaag
ctgatgtttcatgtggacaatggcgccggcagattcactgctgtctacgatgctggggtt
ccagggcatttgtgtgatggacaatggcataaagtcactgccaacaagatcaaacaccgc
atcgagctcacagtcgatgggaaccaggtggaagcccaaagcctaaacccagcatctaca
tcagctgacacaaatgaccctgtgtttgttggaggcttcccagatgacctcaagcagttt
ggcctgaccaccagcattccgttccgaggttgcgtcagatccctgaagctcaccaaaggc
acgggcaagccactggaagttaattttgccaaggccctggaactgaggggcgttcaacct
gtatcatgcccagccaactaa

KEGG   Macaca mulatta (rhesus monkey): 716998
Entry
716998            CDS       T01028                                 

Gene name
TNXB
Definition
(RefSeq) tenascin-X isoform X7
  KO
K06252  tenascin
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05165  Human papillomavirus infection
mcc05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    716998 (TNXB)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    716998 (TNXB)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    716998 (TNXB)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    716998 (TNXB)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    716998 (TNXB)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    716998 (TNXB)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   716998 (TNXB)
SSDB
Motif
Pfam: fn3 Fibrinogen_C Pur_ac_phosph_N EGF_2 EGF_Tenascin DUF2369 DUF4998 hEGF CBX7_C
Other DBs
NCBI-GeneID: 716998
NCBI-ProteinID: XP_014991500
Ensembl: ENSMMUG00000000758
LinkDB
Position
4
AA seq 4248 aa
MMPAQYALTSSLALLVLLGTARAGPFSSRSNVTLPAPRPPPQPGGRTVGAGGGRPSSQLY
EHTVEGGEKQVVFTHRINLPPSTGCGCPPGTEPPVPASEVQALRVRLEILEELVKGLKEQ
CTGGCCPTAAQAGTGQTDVRTLCSLHGVFDLSRCTCSCEPGWGGPTCSDPTDAEIPPSSP
PSTSGSCPDDCNDQGRCVRGRCVCFPGYTGPSCGWPSCPGDCQGRGRCVQGVCVCRAGFS
GPDCSQRSCPRGCSQRGRCEDGRCVCDPGYTGEDCGMRSCPRGCSQRGRCENGRCVCNPG
YTGEDCGVRSCPRGCSQRGRCEDGRCVCDPGYTGEDCGTRSCRWDCGEGGRCVDGRCVCW
PGYTGEDCSTRTCPRDCRGRGRCEDGECICDTGYSGDDCGVRSCPGDCNQRGRCEDGRCV
CWPGYTGLDCGSRACPRDCRGRGRCENGVCVCNAGYSGEDCGVRSCPGDCRGRGRCESGR
CVCWPGYTGRDCGTRACPGDCRGRGRCVDGRCVCNPGFTGEDCGSRRCPGDCRGHGSCED
GVCVCDAGYSGEDCSTRSCPGGCRGRGQCIDGRCVCEDGYSGEDCSVRRCPNDCSQHGVC
QDGVCICWEAYVGEDCSIRTCPSNCHGRGRCEEGRCLCDPGYTGPTCATRMCPADCRGRG
RCVQGVCMCHVGYGGEDCGQEEPPASACPGGCGPRELCQAGQCVCVEGFRGPDCAIQTCP
GDCRGRGECHDGSCICKDGYAGEDCGEEVPAIEGMRMHLLEETTVRTEWTPAPGPVDAYE
IQFIPTTEGASPPFTARVPSSASAYDQRGLAPGQEYQVTVRALRGTSWGPPASKTITTMI
DGPQDLRVVAVTPTTLELGWLRPQAEVDRFVVSYVSAGNQRVRLEVPPEADGTLLTDLMP
GVEYVVTVTAERGRAVSYPASVRANTGSSPSGLLGTTDEPPPSGPSTTQGAQAPLLQQRP
QELGELRVLGRDETGRLRVVWTAQPDTFAHFQLRLRVPEGPGAHEEVLPGDVRQALVPPP
APGAPYELSLRGVPPGGKPSDPVIYQGIMDKDEEKPGKSSGPPRLGELTVTDRTSDSLLL
RWTVPEGEFDSFVIQYKDRDGQPQVVPVEGPQRSAVITSLDPGRKYKFVLYGFVGKKRHG
PLVTEAKILPQSDPRPGTPPRLGNLWVTDPTPDSLHLSWTVPEGQFDTFMVQYRDKDGRP
QVVPVEGPERSFVVSSLDPDHKYRFTLFGIANKKRYGPLTADGTTAPERKEEPPRPESLE
QPLLGELTVTGVTPDSLRLSWTVAQGPFDSFMVQYKDAQGQPQAVPVAGDENEVTVPGLD
PDRKYKMNLYGLRGRQRVGPESVVAKTAPQEDVDETPSPTELGTEAPEPSEEPLLGELTV
TGSSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQAVRVGGEESEVTVGGLEPGHKYKMHL
YGLHEGQRVGPVSAVGVTAPQQEETPPATKPPLEPRLGELTVTDVTPNSVGLSWTVPEGQ
FDSFMVQYKDKDGQPQAVPVAADQQEVTIYNLEPERKYKMNMYGLHDGQRVGPLSVVIMT
APLPPAPATEASEPPLEPRLGELTVTDVTPDSVGLSWTVPEGEFDSFVVQYKDRDGQPQV
VPVAADQREVTIPGLEPSRKYKFLLFGIQDGKRRSPVSVEAKTAARGDASPGAPPRLGEL
WVTDPTPDSLHLSWTVPEGQFDSFVVQFKDKDGPQVVPVEGHERSVTIAPLDAGRKYRFL
LYGLLGKKRHGPLTADGTTEARSAMDDTGTKRPPKPRLGEELQVTSVTQNSVGLSWTVPE
GQFDSFVVQYKDRDGQPQVVPVEGSLREVSVPGLDPAHRYKLLLYGLHSGKRVGPISAVV
ITARREDTETETTAPTPPAPEPRLGELTVEEATPHTLHLSWMVTEGEFDSFEIQHTDREG
QLQTVRTGGDRNDITLSGLESDHRYLVTLYGFHDGKHVGPVHVEALTVPEEEEPSEPPTA
TPEPPITPHLGELTVTDATPDSLSLSWTVPEGQFDHFLVQYRNGDGRPKAVRVPGHEDGV
TISGLEPDHKYKMNLYGFHGGQRVGPVSAIGVTAAEEETPSPTEPSTEAPEPPEEPLLGE
LTVTGSSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQVVRVGGEESEVTVGGLEPGNKYK
MHLYGLHEGRRVGPVSAVGVTAPKEESPDAPLVKPRLGEMTVRDATPDSLSLSWTVPEGQ
FDHFLVQYKNGDGQPKAVRVPGHEDGVTISGLEPDHKYKMNLYGFHGGQRMGPVSAVGLT
APGKDEEMAPAPTEPPAPEPPIKPRLGELTMTDATPDSLSLSWTVPEGQFDHFLVQYKNG
DGQPKTTRVPGHEDRVTISGLEPDHKYKMNLYGFHGGQRVGPVSAIGVTAAEEETPSPTE
PSTEAPEPSEEPLLGELTVTGSSPDSLSLSWTIPQGRFDSFTVQYKDRDGRPQVVRVGGE
ESEVTVGSLEPGNKYKMHLYGLHEGRRVGPVSAVGVTAPREDVDETPSPTEPGTEAPEPP
EEPLLGELTVTGSSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQVVRVGGEESEVTVGGL
EPGRKYKMHLYGLHEGQRLGPVSAVGVTAPEDEAEATQAVPTTTPEPPIKPRLGELTVTD
ATPDSLSLSWTVPEGQFDHFLVQYRNGDGQPKAVRVPGHEDGVTISGLEPDHKYKMNLYG
FHGGQRVGPMSVIGVTAAEEETPSPTEPSTEAPEPPEEPLLGELTVTGSSPDSLSLSWTV
PQGRFDSFTVQYKDRDGRPQVVRVGGEESEVTVGGLEPGRKYKMHLYGLHEGQRVGPVSA
VGVTAPEDEAETTQAAPTTTPEPPIKPRLGELTVTDATPDSLSLSWTVPEGQFDHFLVQY
RNGDGQPKAVRVPGHEDGVTISGLEPDHKYKMNLYGFHGGQRVGPVSVIRVTAAEEETPA
PTEPSTEAPEPPEEPLLGELTVTGSSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQVVRV
GGEESEVTVGGLEPGRKYKMHLYGLHEGRRVGPVSAVGVTAPEDEAEATQAVPTTTPEPP
IKPRLGELTVTDATPDSLSLSWTVPEGQFDHFLVQYRNGDGQPKAVRVPGHEDGVTISGL
EPDHKYKMNLYGFHGGQRVGPMSVIGVTVAEEETPSPTEPSTEAPEPPEEPLLGELTVAG
SSPDSLSLSWTVPQGRFDSFTVQYKDRDGQPQAVRVGGEESEVTVRGLEPGRKYKMHLYG
LYEGRRVGPVSAVGVTVPLPTPPPVEPRLGELTVAAVTSDSVGLSWTVAQGPFDSFLVQY
RDTQGQPQAVPVSGDLRAVTVSGLDPARKYKFLLFGLQNGKRHGPVPVEARTAPDTKLSP
RLGELTVTDATPDSVGLSWTVPEGEFDSFVVQYKDKDGRLQVVPVAADQREVTVPGLEPS
RKYKFLLYGLAGRKRMGPISADGTTASLEKEPPPRLGELTVTEETSSSLHLSWTVAQGPF
ESFVVQYRDTDGQPRAVPVAADQHTVTIEDLEPGKKYKFLLYGLVGGKRLGPVSALGMTA
PEEDTPAPELASEAPEPPEEPRLGVLAVTDTAPDSMRLSWSVAQGPFDSFVVQYEDTNGQ
PQALLVDGDQSKILISGLEPSTPYRFLLYGLHEGKRLGPLSAEGTTGPAPAGQTSGEPGP
RLSQLSVTDVTTSSLRLNWEAPPGAFDSFLLRFGVPSPSTLEPHPRPLLQRELMVPGTRH
SAVLRDLRPGTLYSLTLYGLRGPHKADSIQGTARTLSPVLESPRDLQFSEIRETSAKVNW
MPPPSRADSFKVSYQLADGGEPQSVQVDGRARTQKLQGLIPGARYEVTVVSVRGFEESEP
LTGFLTTVPDGPTQLRALNLTEGFAVLHWKPPQNPVDTYDVQVTAPGAPPLQAEAPGSAV
DYPLHDLVLHTNYTATVRGLRGPNLTSPASITFTTGLEAPRDLEAKEVTPRTALLTWTEP
QVRPTGYLLSFDTPGGRTQEILLPGGVTSHQLLGLFPSTLYNARLQAMWGESLLPPVSTS
FTTGGLRIPFPRDCGEEMQNGAGASRTTTIFLNGNRERPLNVFCDMETDGGGWLVFQRRM
DGQTDFWRDWEDYAHGFGNISGEFWLGNEALHSLTQAGDYSMRVDLRAGDEAVFAQYDSF
RVDSAAEYYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNNLLISCAVSYRGAWWYRNCH
YANLNGLYGSTVDHQGVSWYYWKGFEFSVPFTEMKLRPRNFRSPAGGG
NT seq 12747 nt   +upstreamnt  +downstreamnt
atgatgccagcccagtatgctctaacctccagcctggctctcctggtgctactgggcaca
gccagagcaggccccttctcttcacggtccaatgtgacactgccagccccccggccccct
ccccagccagggggccgcacggtgggggcaggagggggaaggccctcttctcagctttat
gagcacacagtggaaggaggggagaagcaggtggtattcacccaccgcattaacctgccc
ccttccactggctgtggttgtcccccaggcactgagcccccagtccctgcttcagaggtg
caggccctgagggtccggctagagatcctggaggagttggtgaaggggctcaaggaacag
tgcactgggggatgttgtcctactgctgcccaggctggcacaggtcagacagatgtgcgg
accctctgcagtctccatggtgtgtttgatctgagccgctgcacctgttcctgtgagcca
ggctggggtgggcccacctgctcagaccccacagatgctgagattcctccctcctcccca
ccctcaacctcggggtcctgcccagatgactgcaatgatcagggtcgctgtgtccgcggt
cgttgcgtgtgctttcccggctacactggccccagctgtggctggccatcctgtcccggg
gactgccaaggccgtgggcgctgcgtgcagggcgtgtgcgtgtgccgggcaggtttctca
ggccccgactgcagccagcgctcctgccctcggggttgcagccagaggggacgatgtgag
gatgggcgctgcgtgtgtgatccaggctacactggtgaggactgtggcatgaggagctgc
cctcgcggttgcagtcagagggggcgctgtgagaatgggcgctgcgtgtgcaaccctggc
tacactggcgaggactgtggggtgaggagctgccctcggggctgcagccagaggggacgc
tgcgaggacgggcgctgcgtgtgtgaccccggctacactggcgaggactgtggtacgcgg
agctgccgctgggactgtggtgagggcgggcgctgcgtggacggccgctgcgtgtgctgg
cccgggtacacaggcgaggactgcagcacgcggacatgtccgagggactgccggggccgc
gggcgctgcgaggacggcgaatgcatttgcgacacgggctacagcggggacgattgcggc
gtgcgcagctgccctggcgactgcaaccaaaggggccgctgtgaggacggccgctgcgtg
tgctggccagggtacactggactcgactgcggctcgcgcgcctgcccacgcgactgtaga
ggtcgcgggcgctgcgagaacggcgtatgtgtttgcaacgcgggctacagcggcgaggac
tgcggtgtgcgcagctgtcctggagactgtcgtggccggggccgctgtgagagtggccgc
tgcgtgtgttggccaggctacacaggccgggactgtggcacgcgcgcctgtcctggcgac
tgtcgcgggcgcgggcgctgcgtggatggccgctgcgtgtgtaacccgggcttcaccggt
gaggactgtgggagccgtcgctgtcccggggactgccgtgggcacggcagttgcgaggat
ggcgtgtgcgtgtgtgacgcaggctactcaggggaagactgcagcacgcgcagctgcccc
gggggctgccgaggccgcggccagtgcatagatgggcgctgtgtgtgcgaggacggctac
tctggtgaggattgcagtgtgaggcggtgcccgaatgactgcagccagcacggcgtgtgc
caggacggtgtgtgcatctgttgggaagcctacgtgggtgaggactgcagcatccgcacc
tgcccctccaactgccatgggcggggccgctgtgaggaggggcgctgcctgtgcgaccca
ggctacactggccccacctgtgccacccgcatgtgcccagctgactgccggggacgtggg
cgctgtgtgcagggagtgtgcatgtgccacgtgggctatggcggtgaggactgcgggcag
gaagagcctccagccagcgcctgccctggaggctgcgggccccgggaactgtgccaggca
ggccagtgtgtgtgtgtagagggcttccgaggccctgactgtgccatccagacatgccca
ggggactgccgtggccgaggagagtgtcacgatggcagctgtatctgcaaagatgggtat
gctggtgaagactgcggggaagaggtgccagccattgagggcatgaggatgcatctcttg
gaggagacaacagttcggacagagtggaccccggctcctggccctgtggatgcctatgaa
attcaattcatccccacgacagagggggcgagccccccattcacagcacgggttccaagc
tctgcctcagcctatgaccagagaggactggcccctggacaggagtaccaggtcactgtc
cgagcccttcgagggaccagctggggccctcctgcctccaagaccatcaccaccatgatc
gatgggccccaggacctccgagtggtggctgtgacgccgacaacactggagcttggctgg
ctgcgtccccaggccgaggtagaccgatttgtggtgtcctacgtcagtgccggcaaccag
agggtgcggctggaagtgccccctgaagcagacgggacgctgctgactgacctgatgcca
ggcgtggaatatgtggtgactgtcacagcggagcggggccgggcagtcagctacccagct
tccgtcagggccaacacagggtcctcaccctcgggcctcttggggactaccgatgagcct
cctccctcaggcccctcaacgacccaaggggcccaggctcctctcttgcagcagcgcccc
caggagctgggagagttgagggtgctgggcagagatgagacagggcgcctccgtgtggtc
tggaccgcccagcccgacacctttgcccacttccaactgcgcctgcgggtgcccgagggg
ccgggggcacatgaggaagtgctgccaggggacgtccgccaggctctggtgcctccaccc
gctcctggagccccgtacgagctgtcacttcgtggggtccctcctgggggcaagccctct
gaccctgtcatctaccaaggcattatggacaaggatgaggagaagcctgggaagtcctca
ggcccaccacgcctgggtgagctgacggtgacagacaggacctccgactccttgctcttg
cgctggacagtccccgagggcgagtttgactccttcgtgatccagtacaaagacagggac
gggcagccccaggtggtgcccgtggaggggccccagcgctcagccgtcatcacctccctg
gatcctggccgcaagtacaaatttgtcctgtatgggttcgttggcaagaagaggcatggt
cccctggtgactgaagccaagatcttgcctcagagtgacccacgtccagggactccaccc
cgtctgggaaacctgtgggtgacagaccctaccccagactcactgcacctctcctggact
gtccctgagggccagttcgacaccttcatggtccagtacagggacaaggatggacggccc
caggtggtacctgtggaagggcctgagcgttcatttgttgtctcctcactggaccctgac
cacaagtacagattcactctgtttggaatcgcgaataagaagcggtatggccccctcacg
gccgatggcaccactgctccagagaggaaagaggagcccccccgccccgagtccctggag
cagcccctcctgggggaactgacagtgaccggtgtgaccccagactctttgcgcctctca
tggacggtggcccagggccccttcgactcgttcatggtccagtacaaagatgcacagggg
cagccccaggcagtgcctgttgcgggggatgagaatgaggttaccgtccccggcctggat
cccgaccggaagtataagatgaacctctacgggcttcgtggcaggcagcgtgtggggccc
gagtctgtggtggccaagactgctcctcaggaggatgtggacgagacccccagccccaca
gaactgggcacggaggccccggagccctctgaggagccgctcctgggggagctgacagtg
acaggatcctcccctgactcgctgagcctctcctggaccgtcccccagggccgcttcgac
tccttcaccgtgcagtacaaagacagggatgggcggccccaggcggtgcgtgttgggggt
gaggagagtgaggtcaccgtggggggcctggagcccgggcacaagtacaagatgcacctg
tacggcctccacgagggacagcgcgtgggcccggtgtctgccgtgggcgtgacagcccca
caacaagaagagacccctccagccaccaagccccctctggagccacgcctaggagagctg
acagtgacagacgtgacccccaactctgtgggcctctcctggacagtccctgagggccag
tttgactccttcatggtccagtacaaagacaaggacgggcagcctcaggcggtgccggtg
gctgcagaccagcaggaggtcacaatctacaacctggagcctgagagaaaatacaagatg
aacatgtatggactacatgatgggcaacgcgtgggccccctgtctgtggtcatcatgacg
gctcccctcccaccagccccagccacggaggcctccgagcctcccctggagccacgccta
ggggagctgacagtgacggatgtaacccctgactctgtgggcctctcgtggacagtccct
gagggtgaattcgactcctttgtggttcagtacaaggacagggacgggcagccgcaggtg
gtgcctgtggctgcagaccagcgggaggtcaccatccctggcttggaaccctcccgcaag
tacaagttcctgctctttgggatccaggacgggaaacgacgcagcccagtctctgtggag
gcaaagacggctgcccgaggtgacgccagcccaggggccccacctcgccttggggagctg
tgggtgacagaccccaccccagactcactgcacctctcctggacggtccctgagggccag
ttcgactcttttgtggtccagttcaaagacaaagacgggccccaggtggtgcccgtggag
ggccacgagcgctctgtcaccatcgcccctctggacgccggccgcaagtacagattcctc
ctctatggcctcctgggcaagaagcgccatggccccctcactgccgacggcaccacggag
gcccggagtgctatggacgatactggaacaaagcgtcccccaaaaccccgtctgggggag
gagctgcaggtgaccagcgtgacccagaactccgtgggcctctcctggacagtccctgag
ggccaatttgactcctttgtggtccagtacaaggacagggacgggcagccccaggtggtg
cccgtggagggcagcctcagggaggtcagcgtgccgggcctggaccccgcccacaggtac
aagctgctgctctacggcctgcacagcggcaagcgtgtgggccccatctcagccgtcgtc
ataactgcccgcagggaagacacggaaactgagaccacggccccgacccctccagcgcct
gagccccgcctcggggagctgacagtggaggaggccacgccgcacaccctgcatctctcc
tggatggtgactgagggagaatttgactccttcgaaatccagcacacagatagagaaggg
caactccaaacggtccgtacaggaggtgaccggaatgacatcaccctctctggcctggaa
tccgaccacagatacctggtgaccctgtatggtttccacgatgggaagcatgtgggtcct
gtccatgtcgaggccctgacagtcccggaggaggaggagccttcggaacctcccaccgca
acccccgagccccccatcacgcctcacctgggggagctgaccgtgacagacgccaccccc
gactccctcagcctgtcctggacagttcctgagggccagtttgaccacttcctggtccag
tacaggaatggagatgggcggcccaaggctgtgcgggtgccagggcatgaggacggggtc
accatctcgggcctggagccagaccacaaatacaagatgaacctgtacggtttccacggt
ggccagcgcgtgggccctgtgtctgcaatcggggtgacggctgcagaggaagagaccccc
agccccacagaacccagcacggaggccccggagccccccgaggagccgctcctgggggag
ctgacagtgacaggatcctcccctgactcgctgagcctctcctggaccgtcccccagggc
cgcttcgactccttcaccgtgcagtacaaggacagggacgggcggccccaggtggtgcgt
gtcgggggcgaggagagcgaggtcactgtagggggcctggagcccgggaacaagtacaag
atgcacctgtacggcctccacgaggggcggcgcgtgggcccggtgtccgctgtgggcgtc
acggcccccaaagaggagtcccctgatgctcctcttgtgaagccgcgcctaggggagatg
acggtgagagacgccacccccgactccctcagcctgtcctggacagtccccgagggccag
tttgaccatttcttggtccagtacaagaatggggacgggcagcccaaggcggtgcgggtg
ccgggacacgaggacggggtcaccatctcgggcctggagccagaccacaagtacaagatg
aacctgtatggcttccacggtggccagcgcatgggccctgtgtctgctgttggtttaact
gccccaggaaaggatgaagaaatggccccagccccgacagaacctcccgcccccgaaccc
cccatcaagcctcgcctgggggagctgaccatgacagatgccacccctgactccctcagc
ctctcctggacggttcctgagggacagtttgaccacttcctggtccagtacaagaatggg
gatgggcagcccaagacaacgcgggtgccaggacatgaggacagggtcaccatctccggc
ctggagccagaccataaatacaaaatgaacctgtatgggttccacggtggccagcgtgtg
ggccccgtgtctgccatcggggtgacggctgcagaggaagagacccccagccccacagaa
cccagcacggaggccccggagccctctgaggagccgctcctgggggagctgacagtgaca
ggatcctcccctgactcgctgagcctctcctggaccatcccccagggccgcttcgactcc
ttcaccgtgcagtacaaggacagggacgggcggccccaggtggtgcgtgttgggggcgag
gagagcgaggtcaccgtggggagcctggagcccgggaacaagtacaagatgcacctgtac
ggcctccacgaggggcggcgcgtgggcccagtgtccgccgtcggcgtgactgccccacga
gaggatgtggatgagacccccagccccacagaaccaggcacagaggccccagagcccccc
gaggagccgctcctgggggagctgacagtgacaggatcctcccctgactcgctgagcctc
tcctggaccgtcccccagggccgcttcgactccttcaccgtgcagtacaaggacagggac
gggcggccccaggtggtgcgtgtcgggggcgaggagagtgaggtcaccgtggggggcctg
gagcccggacgcaagtacaagatgcacctgtacggcctccacgaggggcagcgcctgggc
ccggtgtctgccgtgggcgtcacagctccagaggatgaagccgaggccacccaagcagtg
cccaccacgacccctgagccccccatcaagcctcgcctgggggagctgaccgtgactgac
gccacccccgattccctcagcctgtcctggacggtccccgagggccagtttgaccacttc
ctggtccagtacaggaatggggacgggcagcccaaggcggtgagggtgccggggcacgag
gacggggtcaccatctcaggcctggagccagaccacaagtacaagatgaacctgtacggc
ttccacggtggccagcgcgtgggtcccatgtctgtcatcggggtgacagctgcagaggaa
gagacccccagccccacagaacccagcacggaggccccggagccccctgaggagccgctc
ctgggggagctgacagtgacaggatcctcccctgactcgctgagcctctcctggaccgtc
ccccagggccgcttcgactccttcaccgtgcagtacaaggacagggatgggcggccccag
gtggtgcgtgttgggggcgaggagagcgaggtcaccgtggggggcctggagccggggcgc
aagtacaagatgcacctgtacggcctccacgaggggcagcgcgtgggcccggtgtctgct
gtgggtgtgacagctccagaggatgaagccgagaccacccaagcagcacccaccacgacc
cctgagccccccatcaagcctcgcctgggggagctgaccgtgactgacgccacccccgat
tccctcagcctgtcctggacggtccccgagggccagtttgaccacttcctggtccagtac
aggaatggggacgggcagcccaaggcggtgagggtgccggggcacgaggacggggtcacc
atctcaggcctggagccagaccacaagtacaagatgaacctgtacggcttccacggtggc
cagcgcgtgggtcccgtgtctgtcatcagggtgacggctgcagaggaagaaactcccgcc
cccacagaacccagcacggaggccccagagccccctgaggagccgctcctgggggagctg
acagtgacaggatcctcccctgactcgctgagcctctcctggaccgtcccccagggccgc
ttcgactccttcaccgtgcagtacaaggacagggacgggcggccccaggtggtgcgcgtt
gggggcgaggagagcgaggtcaccgtggggggtctggagcccggacgcaaatacaagatg
cacctgtacggcctccacgaggggcggcgcgtgggcccggtgtctgccgtgggcgtgaca
gctccagaggatgaagccgaggccacccaagcagtgcccaccacgacccctgagcccccc
atcaagcctcgcctgggggagctgaccgtgactgacgccacccccgattccctcagcctg
tcctggacggtccccgagggccagtttgaccacttcctggtccagtacaggaatggggac
gggcagcccaaggcggtgagggtgccggggcacgaggacggggtcaccatctcaggcctg
gagccagaccacaagtacaagatgaacctgtacggcttccacggtggccagcgcgtgggt
cccatgtctgtcatcggggtgacggttgcagaggaagagacccccagccccacagaaccc
agcacggaggccccggagccccccgaggagccgctcctgggggagctgacagtggcagga
tcctcccctgactcgctgagcctctcctggaccgtcccccaggggcgcttcgactccttc
accgtgcagtacaaggacagggacgggcaaccccaggcggtgcgtgtcgggggcgaggag
agcgaggtcactgtgaggggcctggagcctgggcgcaagtacaagatgcacctgtacggc
ctctacgaggggaggcgcgtgggcccggtgtccgctgtgggcgtcacggtgcccctgccc
acaccgccgcccgtggagccccgcctgggggagttgacggtggcagccgtgacctcggac
tcagtgggcctctcgtggaccgtagcccaggggccctttgactccttcctggtgcagtac
agggacacgcaggggcagccccaggcagtgcctgtgagcggagacctccgagcagtcacc
gtctcagggctggacccggcccgcaagtacaagttcctgctctttggactccagaatggg
aaacgccacggcccggtccctgtggaggccaggaccgccccagacacgaaactgtctccc
cgcctgggggagctgactgtgacagatgcgacccctgactccgtgggcctctcgtggacg
gtccctgagggggaattcgactccttcgtggtccagtacaaagataaggatggtcggctc
caggttgtgccagtggcagccgaccagcgggaggtcacagtcccaggcctggagcccagt
aggaaatacaagttcctgctgtatggtctggcaggcaggaagcgaatgggccccatctct
gctgacggcaccacagcttccctggagaaggagccacctccccgcctgggggaactgacc
gtgaccgaggagacctccagctccctgcacctgtcctggacggtagcccagggccccttt
gaatccttcgtggtccagtacagggacacggatgggcagcccagggcagtgcctgtggcc
gcagaccagcacacggtcaccatagaggacctggagcctggcaagaagtacaagtttctg
ctctacgggctcgttgggggaaagcgcctgggcccggtctccgccctgggaatgacagcc
ccagaagaggacacgccagccccagagttagcctcagaggcccctgagcctcctgaagag
ccccgcctaggggtgctggccgtgaccgacacagccccagactccatgcgcctctcgtgg
agcgtggcccagggcccctttgattccttcgtggtccagtatgaggacacgaacgggcag
ccccaggccttgctcgtggacggtgaccagagcaagatcctcatctcaggcctggagccc
agcaccccctacaggttcctcctctatggcctccatgaagggaagcgcctggggcccctc
tcagctgagggcaccacagggccggcccctgctggtcagacctcaggggagccaggaccc
cgcctgtcccagctgtctgtgactgatgtgaccaccagttcactgaggctcaactgggag
gccccacctggggccttcgactccttcctgctccgctttggggttccatcaccaagcact
ctggagccgcatccccgtcctctgctgcagcgcgagctgatggtgccggggacgcggcac
tcagccgtgcttcgggacctgcgtccggggacgctgtacagcctgacactgtatgggctg
cgaggaccccacaaggccgatagcatccagggcactgctcgcaccctcagcccagttctg
gagagcccccgtgacctccaattcagtgaaatcagggagacctcagccaaggtcaactgg
atgcccccgccgtcccgggcggacagcttcaaagtctcctaccagctggcggacggaggg
gagccacagagcgtgcaggtggacggccgggcccggacccagaaactccaggggctgatc
ccaggcgctcgctacgaggtgaccgtggtctccgtccgaggctttgaggagagtgagcct
ctcacaggcttcctcaccacggttcctgacggccccacccagttgcgtgcactgaacttg
accgaggggttcgccgtgctgcactggaagcccccccagaatcccgtggacacctatgac
gtccaggtcacagcccctggggccccgcctctgcaggccgaggccccgggcagcgcggtg
gactaccccctgcatgacctcgtcctgcacaccaactacaccgccaccgtgcgcggcctg
cggggccccaacctcacttccccagccagcatcaccttcaccacagggctagaggcccct
cgggacttggaggccaaggaagtgaccccccgcaccgccctgctcacttggactgagccc
caagtccggcccacaggctacctgctcagcttcgacacccctggtggccggacccaggag
atcctgctcccgggaggggtcacatctcaccagctcctcggcctctttccctccaccctc
tacaatgcacggctccaggccatgtggggcgagagcctcctgccgcccgtgtccacctct
ttcaccaccggtgggctgcggatccccttccccagggactgcggggaggagatgcagaac
ggagccggtgcctccaggaccaccaccatcttcctcaacggcaaccgcgagcggcccctg
aacgtgttttgtgacatggagactgacgggggcggctggctggtgttccagcgccgcatg
gatggacagacagacttctggagggactgggaggactatgcccatggttttgggaacatc
tctggggagttctggctgggcaacgaggccctgcacagcctgacgcaggcaggtgactac
tccatgcgcgtggacctgcgggctggggacgaggctgtgttcgcccagtacgactccttc
cgcgtagactcggctgcggagtactaccgcctccacctggagggctaccacggcaccgca
ggggactccatgagctaccacagcggcagtgtcttctctgcccgtgatcgggaccccaac
aacttgctcatctcctgcgccgtctcctaccgaggggcctggtggtacaggaactgccac
tacgccaacctcaacgggctctacgggagcacagtggaccatcagggagtgagctggtac
tactggaagggcttcgagttctcggtgcccttcacggaaatgaagctgagaccaagaaac
tttcgctccccagccgggggaggctga

KEGG   Macaca mulatta (rhesus monkey): 717572
Entry
717572            CDS       T01028                                 

Gene name
THBS3
Definition
(RefSeq) thrombospondin-3 precursor
  KO
K04659  thrombospondin 2/3/4/5
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04145  Phagosome
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05144  Malaria
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    717572 (THBS3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    717572 (THBS3)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    717572 (THBS3)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    717572 (THBS3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    717572 (THBS3)
  09174 Infectious disease: parasitic
   05144 Malaria
    717572 (THBS3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:mcc04131]
    717572 (THBS3)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    717572 (THBS3)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    717572 (THBS3)
   04990 Domain-containing proteins not elsewhere classified [BR:mcc04990]
    717572 (THBS3)
Membrane trafficking [BR:mcc04131]
 Endocytosis
  Phagocytosis
   Opsonins
    717572 (THBS3)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   717572 (THBS3)
  Exosomal proteins of colorectal cancer cells
   717572 (THBS3)
  Exosomal proteins of bladder cancer cells
   717572 (THBS3)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   717572 (THBS3)
Domain-containing proteins not elsewhere classified [BR:mcc04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   717572 (THBS3)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA cEGF EGF_3 EGF Laminin_G_2 EGF_MSP1_1
Other DBs
NCBI-GeneID: 717572
NCBI-ProteinID: NP_001247443
Ensembl: ENSMMUG00000003902
UniProt: I0FUM9 F6ZVD5
LinkDB
Position
1
AA seq 956 aa
METQELRGALALLLLCSFASASQDLQVIDLLTVGESRQMVAVAEKIRTALLTAGDIYLLS
TFRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQREDGKVHAVNLQQAGLADG
RTHTVLLRLRGPSRPSPALHLYVDCKLGDQHAGLPALAPIPPAEVDGLEIRTGQKAYLRM
QGFVESMKIILGGSMARVGALSECPFQGDESIHSAVTNALHSILGEQTKALVTQLTLFNQ
ILVELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRSHCSPNPCFRGVDCMEVYEYPGYRC
GPCPPGLQGNGTHCSDINECVHADPCFPGSSCINTMPGFHCEACPRGYKGTQVSGVGIDY
ARASKQVCNDIDECNDGNNGGCDPNSICTNTVGSFKCGPCRLGFLGNQSQGCLPARTCHS
PAHSPCHVHAHCLFERNGAVSCQCNVGWAGNGNVCGTDTDIDGYPDQALPCMDNNKHCKQ
DNCLLTPNSGQEDADNDGVGDQCDDDADGDGIKNVEDNCRLFPNKDQQNSDTDSFGDACD
NCPNVPNNDQKDTDGNGEGDACDNDVDGDGIPNGLDNCPKVPNPLQTDRDEDGVGDACDS
CPEMSNPTQTDVDSDLVGDVCDTNEDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECD
GDDDNDGIPDYVPPGPDNCRLVPNPNQKDSDGNGVGDVCEDDFDNDAVVDPLDVCPESAE
VTLTDFRAYQTVVLDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGT
FHVNTVTDDDYAGFLFSYQDSGRFYVVMWKQTEQTYWQATPFRAVAQPGLQLKAVTSVSG
PGEHLRNALWHTGHTPDQVRLLWTDPRNVGWRDKTSYRWQLLHRPQVGYIRVKLYEGPQL
VADSGVIIDTSMRGGRLGVFCFSQENIIWSNLQYRCNDTVPEDFEPFRRQLLQGRV
NT seq 2871 nt   +upstreamnt  +downstreamnt
atggagacgcaggaacttcggggggccctggctcttctcctcctttgctctttcgcatct
gccagtcaggacctgcaggtaattgacctgctgactgtgggcgagtctcggcagatggta
gctgtggcagagaagatccggacagccctgctcactgctggggacatctacctcttgtcc
accttccgcctgccccccaagcagggtggtgtcctctttggcctctattctcgccaagac
aacacgcgatggctggaggcctctgttgtaggcaagatcaacaaagtactggtgcgatac
cagcgggaggatggcaaagtccacgccgtgaacctacagcaagcaggcctggccgatggt
cgcacacacacagttctcctgcgactccgaggtccctccagacccagccctgctctgcat
ctctatgtggactgcaaactgggtgaccaacatgcaggccttccagcactggcccccatt
cctccagcagaggtcgatgggctggagattaggactggacagaaggcatatttgaggatg
cagggctttgtggaatctatgaaaattattctgggtgggtccatggcccgggtaggagcc
ctgagtgagtgtccattccaaggggacgagtccatccacagtgcagtgaccaatgcactg
cactccattctaggggagcagaccaaggcgctggtcacccaactcaccctcttcaaccag
atcctggtggagctgcgggatgatatacgagaccaggtgaaggaaatgtccctgatccga
aacaccattatggagtgtcaggtgtgcggcttccatgagcagcgttcccactgcagcccc
aatccctgcttccgaggcgtggactgcatggaagtgtacgagtacccaggctaccgctgt
gggccctgtccccccggcctacagggcaacggcacccactgcagtgacatcaatgagtgt
gttcacgccgacccctgtttcccgggctccagctgcatcaacaccatgcccggcttccac
tgtgaggcctgtcctcgagggtacaagggcacacaggtgtctggtgtgggcattgactat
gcccgggccagcaaacaggtctgcaatgacatcgatgaatgcaacgatggcaacaatggt
ggctgtgacccaaactccatctgcaccaacactgtgggctctttcaagtgtggtccctgc
cgcctgggtttcctgggcaaccagagccagggctgcctcccagcccggacctgccacagc
ccagcccacagtccctgccacgtccatgctcattgtctctttgaacgcaatggtgcagta
tcctgccagtgtaacgtgggctgggctgggaatgggaacgtatgtgggactgacacagac
atcgatggctatccagaccaagcactgccctgcatggacaacaacaaacactgcaaacag
gacaactgccttttgacacccaactctgggcaggaagatgctgataatgatggtgtgggg
gaccagtgtgatgacgatgctgatggggatgggatcaagaatgttgaggacaactgccgg
ctgttccccaacaaagaccaacagaactcagatacagattcatttggtgatgcctgtgac
aattgccccaacgttcccaacaatgaccagaaggacacagatggcaatggggaaggagat
gcctgtgacaacgacgtggatggggatggcatccccaatggattggacaattgccctaaa
gtccccaacccactacagacagacagggatgaggacggggtgggagatgcttgcgacagc
tgtcctgaaatgagcaatcctacccagacagatgtagacagcgacttggtgggggatgtc
tgtgataccaatgaagacagtgatggggatgggcatcaggacaccaaggacaactgccca
cagctgccaaatagctcccagctggactcagacaacgatggacttggagatgagtgtgat
ggggatgatgacaatgatggcatcccagattacgtgcctcctggtcctgataactgccgc
ctggtacccaatcccaatcagaaggactcagatggcaatggcgttggtgatgtgtgtgag
gatgactttgacaatgatgctgtggtcgaccccctggatgtgtgtcccgaaagtgcagag
gtaacgcttacggattttcgggcctatcagaccgtcgtcctggatcctgagggtgatgct
cagattgacccaaactgggttgtgctcaaccagggcatggaaatcgttcagaccatgaac
agtgaccctggcttggcagttggatacacggccttcaatggtgtggactttgaaggcacc
ttccatgtgaacacagtgactgacgatgactacgcaggctttctcttcagttatcaagac
agtggccgattctacgtagtcatgtggaagcagactgagcagacctactggcaggctaca
ccctttcgggcggttgcccagcccgggctgcagctcaaggcagtgacatcagtgtctggc
ccaggtgagcacctccgaaatgccctgtggcatactggccacacccctgatcaggtgcga
ctactgtggacggacccacgaaatgtgggctggcgggacaagacctcctatcgctggcag
cttctgcaccggcctcaagttggctacattcgggtgaagctctatgagggaccccagctt
gtggcggattctggggtgattattgacacatccatgcgaggggggcgtcttggtgtattc
tgcttttcccaagaaaacatcatttggtccaatctccagtatcgatgcaatgacacagtg
cctgaggactttgagccattccggaggcagctgctccagggaagggtgtga

KEGG   Macaca mulatta (rhesus monkey): 719319
Entry
719319            CDS       T01028                                 

Gene name
COL6A6
Definition
(RefSeq) collagen alpha-6(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    719319 (COL6A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    719319 (COL6A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    719319 (COL6A6)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    719319 (COL6A6)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    719319 (COL6A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    719319 (COL6A6)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    719319 (COL6A6)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   719319 (COL6A6)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   719319 (COL6A6)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   719319 (COL6A6)
SSDB
Motif
Pfam: VWA VWA_2 Collagen Radical_SAM Integrin_beta
Other DBs
NCBI-GeneID: 719319
NCBI-ProteinID: XP_014987688
Ensembl: ENSMMUG00000040631
LinkDB
Position
2
AA seq 2262 aa
MKMLLILFLMITCSHISVSQDSGPKYADVVFLVDSSDRLGSRSFPFVKMFITKMINSLPI
EADKYRVALAQYSDKLHSEFHLSTFKGRSPMLNHLKKNFGFIGGSLQIGKALQEAHRTYF
SATTNGRDKKQFPPILVVLASSESEDDVEKASKALQKDGVKIISVGVQKASEENLKAMAT
SQFHFNLRTVRDLSMFSQNMTQIIKDVAKYKEGAVDDIFVEACQGPSMADVVFLLDMSIN
GSDENLDYLKGFLEESVSALDIKENCMRIGLVAYSNETKVINSLSMGINKSEVLQHIQNL
SPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREG
VTIFTLGIKGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFS
ERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKIFLSEVVGMFNIAPHKVRVGAVQ
YADSWDLEFEINKYSNKQDLGKAIENIRQLGGNTNTGAALNFTLSLLQKAKKQRGNKVPC
HLVVLTNGMSKDSILEPANRLREEHIRVYAIGIKEANQTQLREIAGEEKRVYYVHDFDAL
KDIRNQVVQEICAEEACKEMKADIMFLVDSSGSIGPENFSKMKTFMKNLVSKSQIGPDRV
QIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGSR
PSVRKFLILITDGEAQDIVKEPAVALRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVE
NFDILQRIEDDLVFGICSPREECKRIEVLDVVFVIDSSGSIDYDEYNIMKDFMIDIVRKA
DVGMNRVRFGALKYADDPEVLFYLDNFGTKPEVISVLQTDQAMGGNTYTAEALGFSDHMF
TEARGSRLNKGVPQVLIVITDGDSHDADKLNATAKALRDKGILVLAVGIADANPVELLAM
AGSSDKYFFVETFGGLKGIISDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQQTDFKKM
KEFLVSVVQDFDVSPNRVRIGAAQFSDAYRPEFPLGTFIGAKEISIQIENITQIFGNTHI
GAALRKVEHYFRPDMGSRINTGTPQVLLVLTDGQSQDEVAQAAEALRRRGIDIYSVGIGD
VDDQQLMQITGTAEKKLTVHNFDELKKVNKRIVRNICTTGGESNCFMDVVVGFDVSTQEK
GQTLLEGQPWMETYLQDILRAISSLNGVSCEVGTETQVSVAFQVTNAMEKYSPKFEIYSE
NILNSLKDITVKGPSFLNANLLDSLWDTFQNKSAARGKVVLLFSDGLDDDAEKLEQKSDE
LRKEGLNALITVALDGPADSSDLADLPYIEFGKGFEYRTQLSIGMRDLGSRLSKQLVNIA
ERTCCCLFCKCIGGDGTMGDPGPPGKRGPPGFKGSEGYLGEEGIAGERGAPGPVGEQGTK
GCYGTKGPKGNRGLNGQEGEAGESGIDGLNGEQGDYGLPGRKGEKGDEGSQGSPGKRGIP
GDHGAKGLRGDPGTPGVDNSIEGPTGLKGERGREGRRGWPGPPGTPGSRRRTAAHGRRGH
TGPQGIAGIPGPDGLEGSLGLKGPQGPRGEAGVKGEKGGVGSKGPQGPPGPGGEAGNQGH
LGSQGNKGEPGDLGEKGAVGFPGPRGLQGNDGSPGYGSVGRKGAKGQGGFPGESGPKGEI
GDPGGPGETGLKGARGKTISAGLPGEMGSPGEPGPPGRKGVKGAKGLASFSTCELIQYVQ
DRSPGRRGKPECPVHPTELVFALDQSRDVTEREFERMKEMMAFLVKDIKVRENSCPVGAR
IAILSYNSHARHLVRFSDAYKKNQLLREIEAIPYERSSSSREMGRAMRFISRNVFKRTLP
GAHTRRIATFFSSGQSADSQSITTAAMEFSALEIIPVVITFSNVPSVRRAFAIDDTGTFQ
VIVVPSGADYTPALERLQRCTFCYDVCKPDASCDQARPPPVQSYMDAAFLLDGSRNVGSA
EFEDIRAFLGALLDHFEITPEPETSVTGDRVALLSHAPPDFLPNTQKSPVRAEFNLTTYS
SKRLMKRHVDESVKQLNGDAFIGHALQWTLDNVFLTAPNLRRNKVIFVISAGETSHLDGE
ILKKESLRAKCQGYALFVFSLGPTWDDKELEDLASHPLDHHLVQLGRIHKPDHSYGVKFV
KTFINSIRRAINRYPPINLKIKCNRLNPADPRQPPRHFRSFVPGPYKATLKEDILQKAKF
FQDQKYLSRVARSGRDDAIQNFIRNTSHTFKNGRRVTKSAPK
NT seq 6789 nt   +upstreamnt  +downstreamnt
atgaaaatgttgctaattttgttcctcatgataacttgttcccatatttctgtgagccaa
gattctggccctaagtatgcagatgtcgtgtttctggtggacagctccgatcgcctggga
tccaggtccttcccatttgtgaaaatgttcatcaccaaaatgatcaacagtctccccata
gaggctgacaaataccgcgtggccctcgcccagtacagtgataaacttcacagtgaattc
cacctaagcaccttcaaaggcaggagccccatgctgaaccacctcaagaagaactttgga
ttcattggcgggtccctgcagataggaaaggctcttcaggaggctcacaggacttatttc
tctgcaaccacaaatgggagagacaagaaacagtttcccccaattctagtggtcctggct
tcgtctgagtctgaggatgatgtggaaaaggcttcgaaggccctgcagaaagacggagtg
aaaatcatctccgtaggggtgcagaaagcttctgaggaaaacctgaaggccatggccacg
tctcagtttcatttcaaccttcggacagtcagagacctcagcatgttttcccaaaacatg
acacagatcatcaaggatgtagcaaagtacaaggagggagcagttgatgacatctttgta
gaagcttgccaaggcccttctatggccgatgttgtgttcctattggatatgtcaatcaat
ggaagtgacgagaacttagactatcttaaaggattcttggaagaaagtgtatctgccctt
gacataaaggaaaattgcatgaggattggccttgtggcctatagcaatgagacaaaagtg
ataaattcactgagcatgggcataaataagtcagaggttctccagcatatacagaacctt
tctccccggactgggaaggcctataccggagctgccatcaaaaagctcaggaaggaagtc
tttagtgcacggaatggcagtcggaagaatcagggggtgccccagattgccgtgctggtg
acccaccgagattcggaagacaacgtgacaaaagcagctgtgaacctccgacgggagggt
gtgaccatcttcaccctgggcataaagggggccagtgacacccagttggaaaagatagca
tcccaccctgctgagcagtatgtctccaaactgaagaccttcgctgacctggctgctcac
aaccagacgtttctgaagaagctgcggaaccaaataacacacacagtctctgtcttttca
gagaggaccgaaacgctcaaatctggttgtgtggacactgaggaagcagacatctatctg
ctcatcgatggctcagggagcacccaggccacagatttccatgaaatgaagatcttcctg
tcagaggtggtagggatgttcaacattgccccccataaggtgcgggttggggccgttcag
tatgctgacagctgggacttggaattcgagatcaataaatactccaacaagcaggatttg
ggaaaggccattgagaatatcaggcagctgggtgggaatacaaatacaggtgcagcactg
aatttcacactgagtctgttgcaaaaagcaaagaaacagcgaggaaacaaagttccatgt
caccttgttgtcctgacaaatggcatgtccaaggatagcatcttggagcctgcaaacaga
ctgagagaagagcacatccgagtttatgctatcgggattaaggaggccaaccaaacacag
ctgcgagaaattgcaggagaggaaaagagagtgtattacgtgcatgactttgatgcattg
aaagacataagaaaccaagttgttcaagaaatctgtgctgaagaagcttgcaaagagatg
aaagctgacatcatgtttctggtggacagttctggaagtataggacctgaaaacttcagc
aaaatgaaaacgtttatgaaaaacctggtgagcaagtctcagattggaccagatcgggtg
caaattggtgtagtccagttcagcgacatcaataaggaggagtttcagctcaacagattc
atgtcccaaagcgacatttcaaatgcaatagaccaaatggctcacattggacaaaccacc
ctgactggtagtgccctgagctttgtgtctcagtacttcagccccaccaagggctcccgg
cccagtgtcagaaagtttctcatccttatcacggatggtgaagctcaggacatagtaaag
gaaccagcggtagcgcttcggcaagaaggtgtaatcatctattctgtgggagtgtttggc
tccaatgtcacccagcttgaggagatcagcgggaggcccgagatggttttttatgttgag
aattttgacattctgcagcgcattgaagatgatcttgtttttggaatatgcagcccccgt
gaagaatgcaagcggattgaagttttagatgttgtgtttgtcattgatagctctggcagc
attgactatgatgagtataatatcatgaaggattttatgattgacatagtgagaaaagct
gatgtgggcatgaatcgggtccggtttggggctctgaagtatgctgatgacccagaggtg
ctgttttatctggataactttggcacaaaaccggaggtaatttcagtgctccagactgac
caagccatgggtggcaatacttatactgctgaggcactgggcttctcagaccacatgttc
actgaagcccggggcagtcgcctgaacaagggggtcccccaagtcctcattgtgatcacc
gatggggattcccatgatgctgataaactcaatgccacggcaaaggccttgcgagacaaa
ggcattcttgtcctggctgtgggtattgctgatgccaatcctgtggaactgttagccatg
gcaggatcaagcgacaagtacttcttcgtggagacttttggaggtctgaagggaataatt
tcagatgtgacagccagtgtctgcaactcttcaaaagtggattgtgaaattgacaaagta
gatctggttttccttatggatggttcaactagcattcagcaaactgacttcaagaaaatg
aaggaatttctggtatccgttgttcaagactttgacgtcagccccaacagagtgcgtata
ggagcggcccagtttagcgatgcctatcgcccagagtttccactgggaactttcataggt
gcaaaagagatatcaattcagattgaaaacatcacgcagatctttggaaacacacacatc
ggtgccgcactcaggaaggtggaacattacttcaggccagacatgggcagcaggataaat
acaggtaccccacaggtgttgctggtccttacagatggccagtcccaagacgaggtggcc
caggccgcggaagccctgagacgcagaggtatcgacatctactctgtgggcattggggat
gtggatgaccagcagctcatgcagatcaccgggactgcagagaaaaaactgacagtgcac
aactttgatgaactgaagaaggtcaataaaaggatcgttcgcaacatctgtaccacaggg
ggtgaaagcaactgtttcatggatgttgtggtgggatttgatgtctcaactcaggagaaa
gggcagactttgcttgaaggtcagccttggatggaaacctaccttcaagacatcttacgt
gccatcagctccctcaatggagtaagctgtgaggtgggcacagagactcaggtcagtgtg
gcttttcaagtgacaaatgccatggaaaaatattctcccaagtttgagatctacagtgaa
aacatactgaatagcttgaaggatataacagttaaaggaccatcttttctcaatgcaaac
ctcttggattctctatgggatacatttcagaataaatcggctgctcgaggaaaggtggtc
cttttattttcagatggattggatgatgatgctgagaaacttgaacaaaaatctgatgaa
cttagaaaagaaggcctgaatgctctcataactgttgctctggatggacctgccgattca
agtgacttggctgatcttccctatattgaatttgggaaaggatttgagtacaggacgcag
ctctctattggcatgagagatcttgggagccgtctatcaaagcaactggtcaacattgct
gaaagaacatgctgctgtttgttctgcaagtgcattggaggagatggcacaatgggagat
cctggaccaccagggaaaaggggacctccaggttttaaaggcagtgaaggttacctggga
gaggagggaatcgctggagaaagaggagcccctggaccagtgggagagcaaggtactaag
ggatgctatggcaccaaaggtcctaagggaaacaggggactaaatggacaggagggagaa
gctggggaaagtggaattgatggactaaatggagaacagggtgattatggtcttcctgga
agaaaaggagaaaagggtgatgagggatcccagggaagcccagggaagagagggattcct
ggtgaccatggagcaaagggcctgcgaggggatcccgggactcctggagttgacaatagt
atagaaggacccacaggcttgaaaggagaacgtggaagagaaggtagaagaggctggcca
ggcccccccgggacaccaggctccagaagaaggacagcagctcatggcagaaggggacat
acaggcccacagggaatagcaggcatcccaggaccagatggacttgaaggctccctggga
cttaagggccctcagggcccaagaggagaggctggtgtgaaaggagaaaaaggaggtgtg
ggaagtaaaggtccccagggacctccaggacccggaggagaggcagggaatcaaggccat
ttgggaagccaaggaaataaaggagaacctggagatctgggagaaaaaggagctgttggc
tttcctggtcctcgtggcttgcagggcaatgatggcagtccaggttatggtagtgtcgga
cgcaaaggagcaaagggacaaggaggattccctggagaaagtggacctaagggtgagatt
ggggaccctggtggtccaggagagactgggctgaagggagctagaggcaaaacgatatct
gctgggcttccaggagagatgggatcccctggggaaccaggacctcctggacgtaagggt
gtgaaaggagccaaaggcttggcttcattttctacatgtgagctcattcagtatgtgcaa
gaccgcagtcctggcagacgtggaaaacccgaatgcccagtgcacccaaccgagttggtg
tttgccctggaccagtcccgggatgtcactgagcgggaatttgagaggatgaaggagatg
atggctttcctggtgaaagacattaaagtccgggagaacagctgccccgtgggagcgcgc
atcgccatcctctcctataactcccacgccaggcacctcgtgcgcttctcagatgcctac
aagaagaatcaacttctcagggaaattgaagctattccttatgagagatcctcttccagc
cgggagatgggcagagcaatgcggtttatttccaggaatgtcttcaagcggacgcttccg
ggggcgcacacgagaagaattgccacatttttcagcagcggtcagtctgcggattcccag
tccatcactacggctgccatggagttcagcgcgcttgaaatcattcccgtggtgatcact
ttcagcaacgtgccctcggtcaggcgtgcatttgcgattgacgacactggcacatttcaa
gtaatagtggttccctctggagctgactacacaccagcattagagagactccagcggtgc
actttctgctatgatgtatgcaagccagatgcttcttgtgatcaagccagaccaccccct
gtgcagtcttacatggatgctgctttccttctggatggctcccggaacgtgggaagtgct
gaatttgaagacataagagctttcctgggagcactattagatcactttgaaatcacccca
gagccagagacttctgtcactggagaccgggtggccctattgagccatgctccccccgac
ttcctacccaacactcagaagagtccagttagagctgagttcaatctcaccacctatagc
agtaagcgcctcatgaagaggcacgtggatgagtcagttaaacaactaaatggagatgct
tttattggtcatgccttacagtggactctggacaatgtctttttaactgcacccaatctg
agaagaaacaaagtcatatttgtgatatctgctggggaaaccagccacttagatggggaa
atcttaaagaaggaatctttgcgagccaaatgtcaagggtatgccctatttgtgttttcc
cttggccctacttgggatgacaaggaactggaggatctcgctagccaccctttggatcac
cacctggtccagcttggccgaattcataaacctgaccacagttatggtgtgaagtttgtg
aagacctttataaactcaatcaggcgtgcaatcaacagatacccaccaataaacttaaaa
ataaagtgcaatagacttaaccctgcagatccaaggcagcccccgcgacatttccgaagc
tttgttcctggaccatataaagctaccctcaaagaagatatattacagaaggcaaaattc
tttcaagatcaaaaatatctttcaagagtagcaagaagtggcagagatgatgctattcaa
aattttatcagaaacacctcccatacctttaagaatggaagaagggtgacaaaaagtgct
cccaaatga

KEGG   Macaca mulatta (rhesus monkey): 719331
Entry
719331            CDS       T01028                                 

Gene name
COL6A5
Definition
(RefSeq) collagen alpha-5(VI) chain
  KO
K06238  collagen type VI alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    719331 (COL6A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    719331 (COL6A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    719331 (COL6A5)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    719331 (COL6A5)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    719331 (COL6A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    719331 (COL6A5)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    719331 (COL6A5)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   719331 (COL6A5)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   719331 (COL6A5)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   719331 (COL6A5)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 VWA_CoxE Copine vWA-TerF-like
Other DBs
NCBI-GeneID: 719331
NCBI-ProteinID: XP_014987694
Ensembl: ENSMMUG00000020377
LinkDB
Position
2
AA seq 2613 aa
MKILLIIFVLIIWTETLADQSPGPGPEYADVVFLVDSSDHLGPKSFPFVKTFISKMINSL
PIETNKYRVALAQYSDEFHSEFHLNTFKGRSPMLNHLKMNFQFIGGSLQIGKALQEAHRT
YFSVPTNGRDRKQFPPILVVLASAESEDEVEEASKALQKDGVKIISVGVQKASEENLKAM
ATSHFHFNLRTIRDLSTFSQSMTQIIKDVTKYKEGAIDADTQVPFPISCQKDSLADLVFL
VDESLGTRGNLRHLQTFLENITSSMDVKENCMRLGLISYSSSAKTISFLKSSTTQSEFKQ
QIKNLSIQVGKSNTGAAIEHMRRDGFSESYGSRRAQGVPQIAVLVTHRPSDDKVHDAALN
LRLEDVTMFALSIQGANNTQLEEIVSYPPEQTISMLKSYADLETYSTKFLKKLQNEIWSQ
ISTYAEQRNLDKTGCVDTKEADIHFLIDGSSSIQEKEFEQIKRFMLEVTEMFSIGPDKVR
VGVVQYSDNAEVEFYISDYSNDIGLRKAIFNIKQLTGRTYTGKALDYILQIIKNGSKDRM
SKVPCYLIVLTDGMSMDRVVEPGKRLRAEQITIHAVGIGAANKIELQEIAGKEERVHFGQ
NFDALKSIKNEVVREICTEKGCEDMKADIMFLVDSSWSIGNENFRKMKIFMKNLLTKIQI
GADKTQIGVVQFSDTTKEEFQLNRYFTRQEISDAIDRMSLINKGTLTGKALNFVGQYFTH
SKGARLGAKKFLILITDGVARDYVRDPARILRGKNVTIFSVGVYNANRSQLEEISGDGSL
VFHVENFDHLKALERKLVFRVCALHDCKRITGLDVVFVLDHSGSIKKQYQDHMINLTIHL
VKKADVGRNGVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGSTYTAKALKHA
NALFTEEHGSRIKENVKQMLIVITDGKSHDHDQLNDTASELRDKGITILAVGVGKANQKE
LEGMAGNKNNTIYVDNFDKLKDVFTLVQESMCTEAPEVCSLQEADVIFLCDGSDRVSNSD
FVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKNQIQNVSKSSG
FPRIDFALKKVSNMFNLYAGGRRNAGVPQTLVVITSGDPHYNVADAVKTLKDLGICVLVL
GIGDVSKEQLLLITGNSEKIITFQDFDKLKNVDVRKRIVREICQSCGKTNCFMDIVVGFD
ISTHMQGQPLFQGHPQLESYLPGILEDISSIRGVSCGAGTEVQVSLAFKVNNDQEFPAKF
QIYQKAVFDSLLQVNVAGPTHLNAQFLQSLWDTFKNKSASRGQVLLIFSDGLQSESNIML
ENQSDRLREAGLDALLVVSLNTTAHHEFSSFEFGKGFNYRTHLTIGMRELGKKLSQYLGN
IAERTCCYTFCKCPGIPGPHGTRGPQAMKGSQGLKGSKGHRGEDGDPGAQGDAGPQGDRG
IAGCPGEQGEKGPRGFSGPKGGHGDDGIDGLDGEEGSHGFPGIKGEKGDPGSQGSPGSRG
APGQYGEKGFPGDPGNPGQNSNIKGQKGSKGEQGRQGRTGQKGVQGSPSSTGSRGREGRR
GLQGVSGEPGNPGPTGTLGAEGLQGPQGSQGNPGRKGEKGSQGHKGPQGSPGLMGAKGST
GRPGLLGKKGEPGLLGDPGPVGQTGQRGRQGDSGIPGYGQMGRKGIKGPRGFPGDVGQKG
DIGDPGIPGGPGPKGFRGLALTVGLRGEEGSQGLPGPPGQRGIKGMAGQPVYSQCDLIRF
LRERSPCWKEKCPVYPTELVFALDNSYDVTEESFNKTRDIITSIVNDLNIRENNCLVGAR
VAVVSYNSDTSYLIRWSDYNRKKQLLQQLSQIKYQDTTEPRDVGNAMRFVARNVFKRTYA
EANMRRVAVFFSNGQMAGRSSIITATMEFSALDISPTVFAFDERVFLEAFGFDNTGTFQV
IPVPPNGENQTLERLRRCTLCYDKCFPNACIQEAFLPEDSYMDVAFLIDNSQNIAKDEFK
AVKALVSSVIDNFNIASDPLISDFGDRIALLSYSPWESSRTKMGTVKTEFDFTTYNNQLL
MKNHIQTSFQQLNGEATIGDALMWTTENLFPGTPYLRKHKVIFVVSAGENYERKEFVKMM
ALRAKCQGYVIFVISLGPTCKDDVEELASYPLDQHLIQLGRIHKPDLNYILKFLKPFLYS
VRWGFNQYPPPMLEDACSLINLGENDRNDGFQFTTELQEDFSGENDFIGQELNFGRESSF
VKIEDNGSDYLVYLQSQMFEPQKLMINYEKDKKSAEIASLTSGHENYGRKEEPDHTNEPG
DVSLQEYYMDVAFLIDASQRVGSDEFKEVKAFITSVLDYFHIAPDPLTSTLGDRVAVLSY
SPPGYMPNTEDCPVYLEFDLVTYNSIHQMKHHLQDSQQLNGDVFIGHALQWTIDNVFVGT
PNLRKNKVIFVVSAGETNQLDKDVLRNVSLRVKCQGYSIFVFSFGPKHNDKELEELASHP
LDHHLVQLGRTHKPDWNYIIKFVKPFVHLIRRAINKYPTKDMKATCVNITSPNPENGGTE
NTVLLIPGIYEIKTENGELFDEFDSQVQHFLVLGNNHSNGSGTATDLMQKLYLLFSTEKL
ATKDKEKAHLEEISALVVDKQQEKEDGEDTRSS
NT seq 7842 nt   +upstreamnt  +downstreamnt
atgaagatcctactaattatatttgtcctaatcatttggactgaaacattggcagaccag
agcccagggccaggccctgagtatgcagatgttgtgtttctggtggacagctccgatcac
ctgggacctaagtcgttcccatttgtcaaaacgttcatcagcaaaatgatcaacagtctc
cccatagagaccaacaaataccgtgtagccctggcccagtacagcgatgagtttcacagt
gaattccatctgaacacctttaaaggcaggagccccatgctgaaccacctcaagatgaac
tttcagttcattggcgggtccctgcagataggaaaggctcttcaggaggctcacaggacc
tatttctctgtacccacaaatgggagagacaggaaacaatttcccccaattttggtggtc
ctggcttcggctgagtctgaggatgaagtggaagaggcttcaaaagccctgcagaaagac
ggggtgaaaattatctccgtgggggtgcagaaagcttctgaggaaaatctgaaggccatg
gccacgtctcatttccatttcaaccttcggacaatcagagacctcagcacattttcccaa
agcatgacacagatcatcaaggatgtaaccaagtacaaggagggagccatcgatgctgat
acgcaagttcccttccccatatcctgtcagaaagactcacttgctgacctcgtgttcctg
gtggatgagtcacttgggaccagaggaaatttaaggcatcttcagaccttcctggagaac
attaccagctccatggatgtgaaggaaaattgcatgcgacttggactgataagttacagc
agtagtgccaagactatttcttttcttaaatcaagcacaacccaatctgaatttaagcag
caaatcaagaatctttccatccaagttgggaaatccaatacaggggctgccattgagcac
atgagaagagatggcttctcagagtcatatggcagcagaagagcacagggagtgcctcag
attgcagtcctggtcacccacagaccatcagatgataaggtgcatgatgctgcactgaac
cttcggctggaagatgttacaatgtttgccttgagcatccaaggggctaacaatacccag
ttagaagaaatagtgtcttaccctccagaacagacaatttccatgctgaagtcctatgca
gacttagaaacttatagtacaaagttcctgaaaaaactccagaatgaaatatggtcccaa
atttctacttatgctgaacaaaggaatcttgataaaactggctgtgtggataccaaagag
gctgatatccatttcctcattgatggctcaagcagcatccaggagaaagagtttgagcaa
atcaagagatttatgttggaagtgacagaaatgtttagcattggcccagacaaagtccga
gttggagttgtgcagtattcagataatgcagaagtagaattttatatcagtgactattct
aatgacattggcttaagaaaggctatttttaacattaagcaactaactggtagaacttat
actgggaaagctctggattacatactacaaataataaaaaatggaagtaaggatagaatg
agcaaggttccctgttacctcattgtgttgactgatgggatgtccatggacagagtcgtg
gaacctggtaagagactaagggctgagcaaatcaccattcacgcagttggcattggggca
gctaataaaatagaactgcaagaaattgctgggaaagaagaaagggttcactttgggcag
aactttgatgctttgaaaagcataaaaaatgaagttgttcgtgaaatctgcactgaaaaa
ggatgtgaagacatgaaggctgacatcatgtttctggtggacagttcttggagtatagga
aatgaaaattttaggaaaatgaaaatcttcatgaaaaacctgttaactaagattcaaatt
ggtgcagacaaaacccagattggcgttgttcagttcagtgatacaactaaggaagagttc
cagctcaatagatactttacacggcaagaaatttctgatgcaatagatagaatgtctctc
atcaataaaggcactttaactggaaaggcactaaattttgtaggtcaatacttcacccac
tccaagggggcccgtttgggggccaaaaagtttctcatcctcatcacagatggagtagct
cgggattatgtgagagatcctgctagaattcttcggggcaaaaatgtgaccatcttctct
gtaggagtatacaatgccaatagatctcagctagaagagatcagtggggatggcagccta
gtttttcacgttgagaacttcgatcatctgaaggcactagaaaggaaacttgtctttcgt
gtgtgtgctctccatgattgtaaaaggattacaggactagacgttgtgtttgtgctggat
cattcaggtagtataaaaaaacaatatcaagatcacatgattaacctaactatccatctg
gtgaagaaagcagatgttggcagaaacggagttcagtttggagccctcaaatactctgac
cagcctaacatccttttctaccttaatacgtactcgaacagatcagcaataattgaaaat
ctgaggatgcgcagggacactggagggagcacctacactgccaaggctctcaagcatgca
aatgccctgtttacagaggagcatggcagccgcatcaaggaaaatgtgaagcagatgctg
attgtcatcactgatgggaaatctcatgaccatgatcagctcaatgacacagcatcagaa
ctgagagacaagggcatcaccatccttgcagtgggtgtaggaaaggccaaccaaaaggaa
cttgagggtatggcagggaataaaaacaatactatctatgtagataattttgataaactg
aaagatgttttcacacttgttcaagaaagtatgtgtactgaagcaccagaagtctgtagt
cttcaggaagctgatgtgattttcctttgcgatggctctgacagggtatctaattcagat
tttgttaccatgacaactttcttgtcagacttaattgataattttgacattcaatctcaa
agaatgaaaattggtatggctcaatttggaagcaactaccagagtattattgagttgaaa
aactctctgactaaaacccagtggaagaatcaaattcagaatgtctccaagagcagtggg
tttccgagaattgactttgcccttaaaaaagtgagcaacatgtttaatctatatgctggt
gggagaagaaatgctggtgtcccccaaactttggttgttatcacatctggagatcctcac
tataatgtggcagatgcagtaaaaaccctgaaggaccttggaatttgtgtcctggttttg
ggcataggagatgtttctaaggaacagcttctgctaataacaggcaattctgaaaaaata
atcacttttcaagactttgataagttaaagaatgtggatgtgagaaaaaggatcgtccgt
gaaatttgccagagctgcgggaaaaccaattgctttatggacatagtggttgggttcgac
atctccactcatatgcaggggcagcctttgttccagggccacccccagctggaatcctac
ctcccaggcatcttagaagacatcagctctatcaggggtgtgagctgtggggctggcaca
gaggtgcaggtgagtttggcgtttaaggtgaacaatgatcaagaattccctgccaagttc
cagatctatcagaaagcagtgtttgacagcttgctgcaagtcaacgtcgctgggccaact
catctgaacgcacagttcttgcagtctctttgggacacatttaagaataaatctgcatcc
cggggccaggtgctgcttattttttcagatggtctccagagtgaaagcaacataatgctt
gaaaatcaatcagacaggctcagagaagcaggacttgatgctctgctggtagtgtcgctt
aacacaactgctcatcatgagttttctagctttgaatttggaaaaggattcaattaccgg
actcatctgactattggaatgagagagctgggcaaaaaactatcacagtacctgggaaac
attgcagaaaggacttgctgctatacattctgcaaatgtccaggaattccaggacctcat
gggacccgaggaccacaagccatgaagggttctcaaggtctgaaaggcagcaaaggacac
aggggagaggacggagaccccggagcacaaggagacgcaggaccccaaggagatagaggg
attgcaggatgtccaggggagcagggtgagaagggacccagaggattttctggacctaag
ggaggacatggagacgatgggattgatggacttgacggggaagagggctctcatggattt
cctggaataaaaggagaaaaaggtgatccaggatctcagggcagcccaggttccagaggt
gctcctgggcagtatggagagaagggcttcccaggagatccaggtaatccaggacaaaac
agtaacatcaaaggacaaaagggctccaaaggagaacaaggaagacaaggtagaactgga
cagaaaggggtgcaaggcagtcctagttccacaggaagcaggggaagagaaggtcgaagg
ggactccaaggtgtctcaggagaaccaggaaatcctggacctacaggcacattgggagct
gaaggattacaaggcccacaggggtcacagggaaatcctggcaggaaaggagaaaaagga
agccaagggcacaaaggacctcagggttctcctgggctaatgggagccaaagggagcact
ggaagacctggacttttggggaaaaaaggagagcctggacttcttggagatccagggcca
gtggggcaaactgggcagcgaggaagacagggagattctggcatcccaggctatggtcag
atgggacgaaaaggaataaagggcccaagaggattccctggagatgtggggcagaagggt
gatattggtgatcctggaattcctgggggacctggacccaaaggatttaggggactagca
ctcactgtaggcttgagaggtgaagagggatctcaaggactcccaggccctcctggacag
agaggcattaaaggcatggcagggcagcctgtatattctcaatgtgatctgatccggttt
ttacgggaacgtagtccttgctggaaagaaaaatgtccggtatatccaacagagttagta
tttgctctggacaattcctatgatgtcacagaagagagctttaataaaacacgggacatc
atcacttccattgtcaatgaccttaacatcagggaaaataattgtcttgtgggagcaaga
gttgccgtggtttcctacaactcagacaccagctatctcatccgttggtctgactacaat
aggaagaagcaactcctccagcagctttcccaaataaaatatcaagacaccacagagccc
cgagatgttggtaatgcaatgaggtttgtggcccgcaacgtgttcaaacggacatatgca
gaagccaacatgaggagagttgctgtgttttttagcaatggtcaaatggccggtaggtcg
tccatcatcacggccaccatggagttcagtgccctggatatcagtccaacagtctttgct
tttgatgagagagttttccttgaagcttttgggtttgacaacactggaacatttcaggtg
attccggtccctccaaatggggaaaatcaaacattagaaagacttcggcgctgtacactt
tgctatgataaatgttttccaaatgcttgcattcaagaggctttcttacctgaagattca
tacatggatgtagccttcctcatagacaattctcagaatatagcaaaggatgagtttaag
gctgtgaaagccttggtgagctcagtgattgacaacttcaacattgcttcagacccttta
atctcagactttggtgataggattgccttattgagctattctccttgggaaagttccagg
acaaagatgggtacagtaaaaacagagtttgattttaccacttataacaaccaactccta
atgaagaatcacatccagacttccttccaacagctaaatggagaagccacaattggtgat
gccctaatgtggaccactgaaaatctctttccaggaacaccctatctaagaaaacacaag
gtcatctttgtggtctcagctggagaaaattatgagagaaaagaatttgtaaaaatgatg
gctttgagggctaagtgtcaaggctacgtcatatttgtgatttctctgggccctacatgt
aaggatgacgtggaggagttagccagctacccacttgatcagcacctgatacagcttggg
agaatacataaaccagatctgaattatattctgaagttcttaaagccatttttatactca
gtcaggtggggattcaatcagtacccaccaccgatgcttgaggatgcctgtagtctcatc
aatttaggagagaatgatcgaaatgatggtttccaatttactactgagctacaagaggat
ttttcaggagagaatgacttcattggccaagaattaaattttgggagagaatcatctttt
gtaaagatagaggacaatggaagtgactatttggtttaccttcaaagccaaatgtttgag
ccacaaaaattaatgatcaattatgaaaaagataaaaaatctgcagaaattgcaagtctc
acttctggacatgaaaattatggcagaaaagaagaaccagatcatactaatgaacctgga
gatgtctctcttcaagaatattacatggatgtggctttccttatagatgcttcccaaaga
gtaggaagtgatgagtttaaggaagtaaaagcttttataacctcagtgcttgattacttt
cacatcgcccctgatccactgacctccaccttaggagacagggttgctgtcctgagctac
tctcccccaggctatatgcctaacactgaagattgccctgtctacctggaatttgatttg
gttacttataacagtatacaccaaatgaaacatcatctccaagactctcaacagctcaat
ggagatgtttttattggccatgccttgcagtggacaattgacaatgtctttgtaggaacc
cccaatctaaggaaaaacaaagttatctttgtagtatctgctggcgaaaccaaccagtta
gacaaagatgtcttaaggaatgtgtctctgagagttaagtgtcaaggctactccatattt
gtgttttcctttggccctaaacacaatgacaaagaattggaagaattagccagccaccct
ctggatcatcacttggtccaacttggccgaacccacaagccagattggaactatatcatc
aagtttgtcaagccatttgtccatttaatcagacgtgccatcaacaaatatcccaccaaa
gatatgaaagccacatgtgttaacatcacctctcccaacccagagaacggtggcacagaa
aacactgtattacttattcctgggatatatgaaataaagacagaaaatggtgagctgttt
gatgaatttgattcccaggtgcaacatttccttgtattagggaacaatcatagtaatggt
tctgggacagctactgatttgatgcagaagttatacctgctcttttcaactgagaaattg
gcaacgaaagacaaggaaaaagcacatttagaagaaatttcagctctagtggttgataaa
cagcaagaaaaagaagatggtgaagacacaaggtcatcatag

KEGG   Macaca mulatta (rhesus monkey): 719943
Entry
719943            CDS       T01028                                 

Gene name
COL9A3
Definition
(RefSeq) collagen alpha-3(IX) chain isoform X2
  KO
K08131  collagen type IX alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    719943 (COL9A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    719943 (COL9A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    719943 (COL9A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    719943 (COL9A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    719943 (COL9A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:mcc00535]
    719943 (COL9A3)
Proteoglycans [BR:mcc00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   719943 (COL9A3)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 719943
NCBI-ProteinID: XP_015004116
LinkDB
Position
10
AA seq 684 aa
MARPPALAPLLLLLLLGELLAAAGAQKVGLPGPPGPPGPPGKPGQDGIDGETGPPGLPGP
PGPKGAPGKPGKSGEAGLPGLPGVDGLTGRDGPPGPKGAPGERGSLGPPGPPGLGGKGLP
GPPGEAGVSGPPGGIGLRGPPGPSGLPGLPGPPGPPGPPGHPGVLPEGATDLQCPSICPP
GPPGPPGMPGFKGPTGYKGEQGEVGKDGEKGDPGPPGPAGLPGSVGLQGPRGLQGLPGPL
GPPGDRGPIGFRGPPGIPGAPGKAGDRGDRGPEGFRGPKGDLGRPGPKGIPGVAGPSGEP
GMPGKDGQNGVPGLDGQKGEAGRSGAPGEKGPNGLPGLPGRAGSKGEKGERGRAGELGEA
GPSGEPGVPGDAGMPGERGEAGHRGSAGALGPQGPPGAPGIRGFQGQKGSMGEPGLPGPQ
GFRGDVGDRGPGGTAGPKGDQGIAGSDGLPGDKGELGASGPVGPKGESGSRGELGPKGIQ
GPNGTSGVQGVPGPPGPLGLQGVQGVPGITGKPGVPGKEASEQRIRELCGGLISEQIAQL
ATHLRKPLAPGSIGRPGPAGPPGPPGPPGSIGHPGARGPPGYRGPTGELGDPGPRGSQGD
RGDKGVAGAGLDGPDGDQGPQGPQGVPGTSKDGQDGAPGEPGPPGDPGLPGAIGAQGTPG
ICDTSACQGAVLGGVGEKSGSRSS
NT seq 2055 nt   +upstreamnt  +downstreamnt
atggcccggccgcccgcgctcgccccgctgctgctcctgctcctgctcggggagctcctg
gcggccgccggggcgcagaaagtgggactcccaggccctccaggccccccagggccgccg
gggaagcccggccaggacggcattgacggcgaaactggacctccaggtctgcctgggccc
ccgggaccaaagggggccccaggaaagccggggaaatcaggagaggccgggctgccggga
ctgccgggtgtggacggtctgactgggcgagatggaccccctggacccaagggtgcccct
ggggaacggggaagtctgggacccccggggccgcccgggctggggggcaaaggcctccct
ggaccccccggagaggcaggagtgagcggccccccaggtgggatcggcctccgcggcccc
ccgggaccgtctggactccccggcctccctggccccccgggacctcccggaccccctgga
cacccaggagtcctccctgaaggtgccactgaccttcagtgcccaagtatctgcccccca
ggtcccccagggccccctggaatgccggggttcaagggacccactggctacaaaggcgag
cagggggaagtcggcaaggacggcgagaagggcgaccctggcccccctgggcccgccggc
ctcccgggcagcgtggggctgcagggtccgcgggggttacaaggactgccagggccactc
ggaccccctggggacaggggtcccatcgggttccgagggccacctgggatcccaggagcg
cctgggaaagcgggtgaccgaggcgacaggggcccagaagggttccgtggccccaagggt
gacctcggcagacctggtcccaagggaatccccggagtggccgggccaagtggggagccg
ggcatgccgggcaaggacggccagaatggtgtgccaggactcgatggccagaagggagag
gctggtcgcagcggtgctccgggagagaagggccccaacgggctgccgggcctccccgga
cgagcagggtccaaaggcgagaagggagaaaggggcagagctggggagctgggtgaggct
ggcccctcaggagagccaggcgtccctggagatgctggcatgcctggggagcgtggtgag
gctggccaccggggctcagcgggggccctcggcccacaaggccctcccggagcccctggc
atccgaggcttccagggccagaagggcagcatgggagaacccggccttccaggcccccag
ggcttccgaggtgatgtgggcgaccggggtccgggaggcaccgcaggccctaagggagac
cagggcattgcaggttccgacggtcttcctggggataaaggagaactgggtgccagcggc
ccggtcggacccaagggagagtctggcagtcgaggggagctgggccctaaaggcatccag
ggtcccaatggcaccagcggcgttcagggtgtccccgggccccccggtcctctgggcctc
cagggcgtccagggtgttcctggcatcacggggaagccgggcgttccggggaaggaggcc
agcgagcagcgcatcagggagctgtgtggggggctgatcagcgaacaaattgcacagtta
gccacgcacctgaggaagcctttggcaccggggtccattggccggcccggtccagctggc
ccccctggacccccaggacccccaggctccattggtcaccctggcgctcgaggacccccc
ggataccgcggtcccactggggagctgggagaccccgggcccagaggaagccagggtgac
agaggagacaaaggcgtggcaggagcagggctggacgggcctgacggagaccaggggccc
caaggaccccaaggcgtgcccggcaccagcaaggatggccaggacggtgctcccggcgag
cctgggcctcccggagatcctgggcttccaggtgccattggcgcccaggggacaccgggg
atctgcgacacctcagcctgccaaggagccgtgttaggaggggtcggggaaaaatcaggc
tctagaagctcataa

KEGG   Macaca mulatta (rhesus monkey): 720286
Entry
720286            CDS       T01028                                 

Gene name
COMP
Definition
(RefSeq) cartilage oligomeric matrix protein
  KO
K04659  thrombospondin 2/3/4/5
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04145  Phagosome
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05144  Malaria
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    720286 (COMP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    720286 (COMP)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    720286 (COMP)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    720286 (COMP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    720286 (COMP)
  09174 Infectious disease: parasitic
   05144 Malaria
    720286 (COMP)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:mcc04131]
    720286 (COMP)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    720286 (COMP)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    720286 (COMP)
   04990 Domain-containing proteins not elsewhere classified [BR:mcc04990]
    720286 (COMP)
Membrane trafficking [BR:mcc04131]
 Endocytosis
  Phagocytosis
   Opsonins
    720286 (COMP)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   720286 (COMP)
  Exosomal proteins of colorectal cancer cells
   720286 (COMP)
  Exosomal proteins of bladder cancer cells
   720286 (COMP)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   720286 (COMP)
Domain-containing proteins not elsewhere classified [BR:mcc04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   720286 (COMP)
SSDB
Motif
Pfam: TSP_C TSP_3 EGF_CA COMP EGF_3 cEGF EGF_MSP1_1 EGF hEGF
Other DBs
NCBI-GeneID: 720286
NCBI-ProteinID: XP_014978841
Ensembl: ENSMMUG00000006511
LinkDB
Position
19
AA seq 757 aa
MAPDTACVLLLTLAALGASGQSQIPSGSDLGPQMLRELQETNAALQDVRELLRQQVREIT
FLKNTVMECDACGMQQSVRTGLPSVRPLLHCAPGFCFPGVACIQTESGARCGPCPAGFTG
NGSHCTDVNECNAHPCFPRVRCINTSPGFRCEACPPGYSGPTHEGVGLAFAKANKQVCTD
INECETGQHNCVPNSVCINTRGSFQCGPCQPGFVGDQESGCQRRAQRFCPDGSPSECHEH
ADCVLERDGSRSCVCAVGWAGNGILCGRDTDLDGFPDEKLRCPERQCRKDNCVTVPNSGQ
EDVDRDGIGDACDPDADGDGVPNEKDNCPLVRNPDQRNTDEDKWGDACDNCRTQKNDDQK
DTDQDGRGDACDDDIDGDRIRNQADNCPRIPNSDQKDSDGDGIGDACDNCPQKSNPDQGD
VDHDFVGDACDSDQDQDGDGHQDSRDNCPTVPNSAQQDSDHDGQGDACDNDDDNDGVPDS
RDNCRLVPNPGQEDADRDGVGDVCQGDFDADKVVDKIDVCPENAEVTLTDFRAFQTVVLD
PEGDAQIDPNWVVLNQGREIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFI
FGYQDSSSFYVVMWKQMEQTYWQANPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDT
DSQVRLLWKDPRNVGWKDKKSYRWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGG
RLGVFCFSQENIIWANLRYRCNDTIPEDYETHQLRRA
NT seq 2274 nt   +upstreamnt  +downstreamnt
atggcccccgacaccgcctgcgttcttttgctcaccctggctgccctcggcgcgtccgga
cagagccagatcccgtcgggctcagacttgggcccgcagatgcttcgggaactgcaggaa
accaatgcggcgctgcaggatgtgcgggagctgctgcggcagcaggtcagggagatcacg
ttcctgaaaaacacggtgatggagtgtgacgcgtgcgggatgcagcagtcagtgcgcacc
ggcctacccagcgtgcggcccctgctccactgcgcgcccggcttctgcttccctggcgtg
gcctgcatccagacggagagcggcgcgcgctgcggcccctgccccgcgggcttcacgggc
aacggctcgcactgcaccgacgtcaacgagtgcaacgcccacccctgcttcccgcgagtc
cgctgtatcaacaccagcccggggttccgctgcgaggcttgcccgccggggtacagcggc
cccacccacgagggcgtggggctggctttcgccaaggccaacaagcaggtttgcacggac
atcaacgagtgtgagaccgggcaacataactgcgtccccaactccgtgtgcatcaacacc
cggggttccttccagtgcggcccgtgtcagcccggcttcgtgggcgaccaggagtccggc
tgccagcggcgcgcacagcgcttctgccccgacggctcgcccagcgagtgccacgagcat
gcggactgcgttctggagcgcgatggctcgcggtcgtgcgtgtgtgccgtcggctgggca
ggcaacgggatcctctgtggccgcgacactgacctagacggcttcccggacgagaagttg
cgttgcccagagcgccagtgccgtaaggacaactgcgtaactgtgcccaactcagggcag
gaggatgtggaccgcgatggcatcggagacgcctgcgacccggatgccgacggggatggg
gtccccaatgaaaaggacaactgcccgctggtgcggaacccagaccagcgcaacacggac
gaggacaagtggggcgatgcgtgcgacaactgccggacccagaagaacgacgaccaaaag
gacacagaccaggacggccggggcgatgcgtgcgacgacgacatcgacggcgaccggatc
cgcaaccaggccgacaactgccccaggatacccaactcagaccagaaggacagtgatggc
gatggtataggggatgcctgtgacaactgtccccagaagagcaacccggatcagggggat
gtggaccacgactttgtgggagatgcttgtgacagcgatcaagaccaggacggagatgga
catcaggactctcgggacaactgtcccacggtgcctaacagtgcccagcaggactcagac
cacgacggccagggtgatgcctgcgacaacgacgacgacaatgacggagtccctgacagt
cgggacaactgccggctggtgcccaaccctggccaggaggacgcagacagggacggcgtg
ggcgacgtgtgccagggcgactttgatgcagacaaggtggtagacaagatcgacgtgtgt
ccggagaacgctgaagtcaccctcaccgacttccgggccttccagacagtcgtgctggac
ccggagggtgacgcgcagattgaccccaactgggtggtgctcaaccagggaagggagatc
gtgcagacaatgaacagcgacccaggcctggctgtgggttacactgccttcaatggcgta
gacttcgagggcacgttccatgtgaacacggtcacggatgacgactatgcgggtttcatc
tttggctaccaagacagctccagcttctacgtggtcatgtggaagcagatggagcaaacg
tactggcaggcgaaccccttccgtgcggtggccgaacccggcatccaactcaaggctgtg
aaatcttccacgggccctggggaacagctgcggaacgcactgtggcatacaggagacaca
gactcccaggtgcggctgctgtggaaggacccgcgaaatgtgggttggaaggacaagaag
tcctatcgttggttcctgcagcaccggccccaagtgggctacatcagggtgcgattctat
gagggtcctgagctggtggccgacagcaacgtggtcttggacacaaccatgcggggtggc
cgcctgggggtcttctgtttctcccaggagaacatcatctgggccaacctgcgttaccgc
tgcaatgacaccattccagaggactatgagacccatcagctgcggcgagcctag

KEGG   Macaca mulatta (rhesus monkey): 720945
Entry
720945            CDS       T01028                                 

Gene name
LAMB3
Definition
(RefSeq) laminin subunit beta-3 isoform X1
  KO
K06244  laminin, beta 3
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    720945 (LAMB3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    720945 (LAMB3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    720945 (LAMB3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    720945 (LAMB3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    720945 (LAMB3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    720945 (LAMB3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    720945 (LAMB3)
   05145 Toxoplasmosis
    720945 (LAMB3)
SSDB
Motif
Pfam: Laminin_N Laminin_EGF Laminin_II UPF0242 ATG16 F5_F8_type_C BNR_6
Other DBs
NCBI-GeneID: 720945
NCBI-ProteinID: XP_014973102
Ensembl: ENSMMUG00000016925
LinkDB
Position
1
AA seq 1172 aa
MRSLFLLCFALPGLLHAQQACSRGACYPPVGDLLVGRTRFLRASSTCGLTKPETYCTQHG
EWQMKCCKCDSRQPHNYYSHRVENVASSSGPMRWWQSQNDVNPVSLQLDLDRRFQLQEVM
MEFQGPMPAGMLIERSSDFGKTWRVYQYLAADCTSTFPRVRQGRPQSWQDVRCQSLPQRP
NVRLNGGKVQLNLMDLASGIPATQSKKIQELGEITNLRVNFTRLAPVPQRGYHPPSAYYA
VSQLRLQGSCFCHGHADRCAPKPGASAGPSTTVQVHDVCVCQHNTAGPNCERCAPFYNNR
PWRPAEGRDTHECQRCDCNGHSETCHFDPAVFAASQGAYGGVCDNCRDHTEGKNCERCQL
HYFRNRRPGASIQETCIPCECDPDGAVPGAPCDPVTGQCVCKEHVQGERCDLCKPGFTGL
TYANPQGCHRCDCNILGSRRDVPCDEESGRCLCLPNVVGPKCDQCAPYHWKLASGQGCEP
CACDPHNSLSPQCNQFTGQCPCREGFGGLTCSAAAIRQCPDRTYGDAATGCRACDCDFRG
TEGPGCDKASGRCLCRAGLTGPRCDQCQRGYCNRYPVCVACHPCFQTYDADLREQALRLG
SLRNATASLWSGPGLEDRGLASRILDAKSKIEQIRAVLSSAAVTEQEVAQVASAILSLRR
TLQGLQPDLPLEEETLSLLRDLESLDRSFNGLLTMYQRKREQFEKLSSADPSGAFRMLTT
AYEQSAQAAQQVSDSSHLLDQLRDSRREAERLAQQAGGGGGTGSPQLVALRLEMSSLPDL
TPTFNKLCGSSRQMACTPGSCPGELCPHDNGTACGSYCRGVLPRAGGAFWTAGQVAEQLR
GFNAQLQQTRQMIRAAEESASQIQSNAQRLETQVSTSRSQMEEDVRRTRLLIQQVRDFLT
DPETDAATIQEVSEAVLALWLPTDSATVLQKMNEIQAIAARLPNVDLVLSQTKQDIARAR
RLQAEAEEARSRAHAVEGQVEDVVGNLRQGTVALQEAQDTMQGTSRSLRLIQDRVAEVQQ
VLQPAEKLVTSMTKQLGDFWTRMEELRRQARQQRAQAVQAQQLAEGASEQALSAQEGFER
IKQKYAELKDRLGQSPTLGEQGARIQSVKTEAEELFGETMEMMDRMKDMELELLRGSQAI
MLRSADLTGLEKRVEQIRDHINERVLYYATCK
NT seq 3519 nt   +upstreamnt  +downstreamnt
atgagatcactcttcctcttgtgttttgccttacctggcctcctgcatgcccagcaagcc
tgctcccgtggggcctgctatccacctgtcggggacctgcttgttgggaggacccggttt
ctccgagcttcatctacctgtggactgaccaagcctgagacctactgcacccagcatggt
gagtggcagatgaaatgctgcaagtgtgactccaggcagcctcacaactactacagtcac
cgagtagagaatgtggcttcatcctccggccccatgcgctggtggcagtcacagaatgat
gtgaaccctgtctctctgcagctggacctggacaggagattccagcttcaggaagtcatg
atggagttccaggggcccatgcccgccggcatgctgattgagcgctcctcagacttcggt
aagacctggcgagtgtaccagtacctggctgctgactgcacctccaccttccctcgggtc
cgccagggccggcctcagagctggcaggatgttcggtgccagtccctgccccagaggcct
aatgtgcgcctaaatggggggaaggtccaacttaaccttatggatttagcatctgggatt
ccagcaactcaaagtaaaaaaattcaagagctgggggagatcacaaacttgagagtcaac
ttcaccaggctggcccctgtgccccagaggggctaccaccctcccagcgcctactatgcc
gtgtcccagctccgtctgcaggggagctgcttctgtcacggccatgctgaccgctgcgcc
cccaagcctggggcctctgctggcccctccaccaccgtgcaggtccacgatgtctgcgtc
tgccagcacaacactgccggtccaaattgtgaacgctgtgcgcccttctacaacaaccgg
ccctggagacctgcagagggccgggacacccatgaatgccaaaggtgcgactgcaatggg
cactcagagacatgtcactttgaccccgctgtgtttgccgccagccagggggcatatgga
ggtgtgtgtgacaattgccgggaccacaccgaaggcaagaactgtgagcggtgtcagctg
cactatttccggaaccggcgcccgggagcttccattcaggagacgtgtatcccctgcgag
tgtgatccggatggggcagtgccaggggctccctgtgacccagtgaccgggcagtgtgtg
tgcaaggagcatgtgcagggagagcgctgtgacctatgcaagccaggcttcactggactc
acctacgccaacccgcagggctgccaccgctgtgactgcaacatcctgggatcccggagg
gacgtgccgtgtgacgaggagagtgggcgctgcctgtgtctgcccaacgtggtgggcccc
aagtgtgaccagtgcgctccctaccactggaagctggccagtggccagggctgtgaaccg
tgtgcctgcgacccacacaactccctcagcccccagtgcaaccagttcacagggcagtgc
ccctgtcgggaaggctttggtggcctgacgtgcagcgctgcagccatccgccagtgtcca
gaccggacctatggagacgcagctacaggatgccgagcctgtgactgtgatttccgggga
acagagggcccaggctgcgacaaggcatcgggccgctgcctctgccgcgctggcttgact
gggccccgctgtgaccagtgccagcgaggctactgcaaccgctacccggtgtgtgtggcc
tgccacccttgcttccagacctatgatgcggacctccgggagcaggccctgcgccttggt
agcctccgcaatgccaccgccagcctgtggtcagggccagggctggaggaccgtggcctg
gcctcccggatcctagacgcaaagagtaagattgagcagatccgagcagttctcagcagc
gccgcagtcacagagcaggaggtggctcaggtggccagtgccatcctctccctcaggcga
actctccagggcctgcagccggatctgcccctagaggaagagacgttgtcccttctgaga
gacctggagagtctggacagaagcttcaatggcctcctcactatgtatcagaggaagagg
gagcagtttgaaaaactaagcagtgctgatccttcaggagccttccggatgctaaccaca
gcctacgagcagtcagcccaggctgctcagcaggtctccgacagctcacaccttttggac
cagctcagggacagccggagagaggcagagaggctggcgcagcaggcgggaggaggagga
ggcaccggcagcccccagcttgtggccttgaggctggagatgtcttcgttgcctgacctg
acacccaccttcaacaagctctgtggcagctccaggcagatggcttgcaccccaggatca
tgccctggtgagctgtgtccccatgacaatggcacagcctgtggctcctactgcaggggt
gtccttcccagggctggtggggccttctggacggcagggcaggtggctgagcagctgcgg
ggcttcaatgcccagctccagcagaccaggcagatgattagggcagccgaggaatcagcc
tcgcagattcaatccaatgcccagcgcctggagacccaggtgagcaccagccgctcccag
atggaggaagatgtcagacgcacgcggctcctcatccagcaggtccgggatttcctaaca
gaccccgaaactgatgcagccactatccaggaggtcagcgaggctgtgctggccctgtgg
ctgcccacagactcagctaccgttctgcagaagatgaatgagatccaggccattgcagcc
aggctccccaacgtggacctggtgctgtcccagaccaaacaggacattgcgcgggcccgc
aggctgcaggctgaggctgaggaagccaggagccgagcccatgcagtggagggccaggtg
gaggatgtggttgggaacctgcggcagggcacagtggcactgcaggaagctcaggacacc
atgcaaggcaccagccgctcccttaggcttatccaggatagggttgctgaggttcagcag
gtactgcagccagcagaaaagctggtgacaagcatgaccaagcagctgggtgacttctgg
acacggatggaggagctccgccgccaagcccggcagcagcgggcacaggcagtccaggcc
cagcagcttgcagaaggtgccagcgagcaggcactgagtgcccaagagggatttgagaga
ataaaacaaaagtatgctgagttgaaggaccggttgggtcagagtcccacgctgggtgaa
cagggtgcccggatccagagtgtgaagacagaggcagaggagctatttggggagaccatg
gagatgatggacaggatgaaagacatggagttggagctgctgcggggcagccaggccatc
atgctgcgctcggcggacctgacaggactggagaagcgtgtggagcagattcgtgaccac
atcaatgagcgcgtgctctactatgccacctgcaagtga

KEGG   Macaca mulatta (rhesus monkey): 721855
Entry
721855            CDS       T01028                                 

Gene name
COL6A1
Definition
(RefSeq) collagen alpha-1(VI) chain
  KO
K06238  collagen type VI alpha
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04974  Protein digestion and absorption
mcc05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    721855 (COL6A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    721855 (COL6A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    721855 (COL6A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    721855 (COL6A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    721855 (COL6A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    721855 (COL6A1)
   00536 Glycosaminoglycan binding proteins [BR:mcc00536]
    721855 (COL6A1)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   721855 (COL6A1)
Glycosaminoglycan binding proteins [BR:mcc00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   721855 (COL6A1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   721855 (COL6A1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3
Other DBs
NCBI-GeneID: 721855
NCBI-ProteinID: XP_014988172
UniProt: F6ZXR6
LinkDB
Position
3
AA seq 1028 aa
MRAARALLPLLLQACWTAGQDEPETPRAVAFQDCPVDLFFVLDTSESVALRLKPYGALVD
KVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDDVEIIQGLTRMPGDRDTLKSRVDAI
KYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAK
HLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGQSRDAEEVISQTIDTIVDMIK
NNVEQVCCSFECQPARGPPGLRGDPGFEGERGKPGLPGEKGEAGDPGRPGDLGPVGYQGM
KGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVKGEMGYPGLPGCKGSPGFDGIQGPPGP
KGDPGAFGLKGEKGEPGADGEAGRPGSSGPPGDEGEPGEPGPPGEKGEAGDEGNPGPDGA
PGERGGPGERGPRGTPGTRGPRGDPGEAGPQGDQGREGPVGVPGDPGEAGPIGPKGYRGD
EGPPGSEGARGAPGPAGPPGDPGLMGERGEDGPPGNGTEGFPGFPGYPGNRGPPGINGTK
GYPGLKGDEGEAGDPGDDNNDIAPRGVKGAKGYRGPEGPQGPPGHPGPPGPDECEILDII
MKMCSCCECKCGPIDLLFVLDSSESIGLQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA
GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKNLQWMAGGTFTGEALQYTRDQLLPPSPN
NRIALVITDGRSDTQRDTTPLNVLCSPGVQVVSVGIKDVFDFVPGSDQLNVISCQGLAPS
QGRAGLSLVKENYAELLEDAFLKNVTAQICIDKKCPDYTCPITFSSPADITILLDSSASV
GSHNFDTTKRFAKRLAERFLMAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTAL
ASAVDAMDFINDATDVNDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQ
EAQRAGIEIFVVVVGRQVNEPHIRVLVTGKTAEYDVAYGERHLFRVPSYQALLRGVFHQT
VSRKVALG
NT seq 3087 nt   +upstreamnt  +downstreamnt
atgagggcagcccgcgctctgctgcccctgctgttgcaggcctgctggacggccgggcag
gatgagccggagacccccagggcggtggccttccaggactgccccgtggacctgttcttt
gtgctggacacctcggagagtgtggccctgaggctgaagccctacggggccctcgtggac
aaggtcaagtccttcaccaagcgtttcatcgacaacctgagggacaggtactaccgctgt
gaccgaaacctggtgtggaacgcgggtgcactgcactacagcgacgacgtggagatcatc
caaggcctcacgcgcatgcccggtgaccgcgacacgctcaaaagcagagtggacgccatc
aagtattttgggaagggcacctacactgactgtgccatcaagaaggggctggagcagctc
ctcgtggggggctcccacctaaaggagaacaagtacctgattgtggtgaccgacgggcac
cccctggagggctacaaggagccatgcggggggctggaggacgcagtgaacgaggccaag
cacctgggcgtcaaagtcttctcagtggccatcacgcccgaccacctggagccgcgtctg
agcatcatcgccacggaccacacgtaccggcgcaacttcacggcggctgactggggccag
agccgcgacgcggaggaggtcatcagccagaccatcgacaccatcgtggacatgatcaaa
aataacgtggagcaagtgtgctgctccttcgaatgccagcctgcaagaggacctccgggg
ctccggggcgaccctggcttcgagggagaacgaggcaagccagggctcccgggagagaag
ggagaagccggagatcctggaagacccggggacctcggacctgttgggtaccaggggatg
aagggagaaaaagggagccgtggggagaagggctccaggggacccaagggctacaaggga
gagaagggcaagcgtggcatcgacggggtggacggcgtgaagggggagatggggtaccca
ggcctgccaggctgcaagggctcgcccgggtttgacggtattcaaggaccccccggaccc
aagggagaccccggcgcctttggactgaaaggagaaaagggtgagcctggagctgacggg
gaggcggggagaccagggagctcgggaccacctggagacgagggcgagccaggagagcct
ggtcctcccggagagaaaggagaggcgggtgacgaggggaacccaggacctgacggtgcc
cccggggagcggggcggccctggagagagaggaccacgggggaccccaggcacgcgggga
ccaagaggagaccctggtgaagctggcccacaaggtgatcagggaagagaaggccccgtt
ggtgtccctggagacccgggcgaggctggccctatcggacctaaaggctaccgaggcgat
gagggtcccccaggctccgagggtgccagaggagccccaggacctgctggaccccctgga
gaccctgggctgatgggtgagaggggtgaagacggcccccctggaaatggcaccgagggc
ttccctggcttccccgggtatccgggcaacaggggccctcccgggataaacggcacgaag
ggctaccccggcctcaagggggacgagggggaagccggggaccccggagatgacaacaac
gacattgcaccccgaggagtcaaaggagcaaaggggtaccggggccccgagggaccccag
ggacccccaggacacccaggaccgcccgggccagacgaatgcgagattttggacatcatc
atgaaaatgtgctcttgctgtgaatgcaagtgcggccccatcgacctgctcttcgtactg
gacagctcagagagcatcggcctgcagaacttcgagattgccaaggacttcgtcgtcaag
gtcatcgaccggttgagccgggacgagctggtcaagtttgagccgggacagtcgtacgca
ggtgtggtgcagtacagccacagccagatgcaggagcacgtgagcctgcgcagccccagc
atccggaacgtgcaggagctcaaggaagccatcaagaacctgcagtggatggcgggcggc
actttcacaggagaggccctgcagtacacgcgggaccagctgctgccacccagcccgaac
aaccgcatcgctctagtcatcactgacgggcgctcagacacccagagggacaccacgccg
ctcaacgtgctctgcagtccaggcgtccaggtggtctccgtgggcatcaaggacgtgttt
gacttcgtcccgggctccgaccagctcaatgtcatttcttgccaaggcctggcaccatcc
cagggccgggccggcctctcgctggtcaaggagaactatgcagagctgctggaggacgcc
ttcctgaagaatgtcaccgcccagatctgcatagataagaagtgtccagattacacctgc
cccatcacgttctcctcccccgctgacatcaccatcctgctggacagctctgccagcgtg
ggcagccacaactttgacaccaccaagcgcttcgccaagcgcctggccgagcgcttcctc
atggcgggcaggacggaccccgcccacgacgtgcgggtggctgtggtgcagtacagcggc
acaggccagcagcggccagagcgggcgtcgctgcagttcctgcagaactacacggccctg
gccagtgctgtcgatgccatggactttatcaacgacgccacggacgtcaacgacgccctg
ggctatgtgacccgcttctaccgcgaggcctcgtctggcgctgcaaagaagaggctgctg
ctcttctcagatggcaactcacagggcgccacgcccgcggccatcgagaaggccgtgcag
gaggcccagcgagcgggcatcgagatcttcgtggtggtcgtgggccgccaggtgaatgag
ccccacatccgcgtcctggtcaccggcaagacggccgagtacgacgtggcctacggcgag
cgccacctgttccgtgtccccagctaccaggccctgctgcgcggcgtcttccaccagacc
gtctccaggaaggtggcactgggctag

KEGG   Macaca mulatta (rhesus monkey): 722019
Entry
722019            CDS       T01028                                 

Gene name
VWF
Definition
(RefSeq) von Willebrand factor precursor
  KO
K03900  von Willebrand factor
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc04610  Complement and coagulation cascades
mcc04611  Platelet activation
mcc04613  Neutrophil extracellular trap formation
mcc05165  Human papillomavirus infection
mcc05171  Coronavirus disease - COVID-19
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    722019 (VWF)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    722019 (VWF)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    722019 (VWF)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    722019 (VWF)
   04611 Platelet activation
    722019 (VWF)
   04613 Neutrophil extracellular trap formation
    722019 (VWF)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease - COVID-19
    722019 (VWF)
   05165 Human papillomavirus infection
    722019 (VWF)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03110 Chaperones and folding catalysts [BR:mcc03110]
    722019 (VWF)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:mcc04147]
    722019 (VWF)
Chaperones and folding catalysts [BR:mcc03110]
 Intramolecular chaperones
  Others
   722019 (VWF)
Exosome [BR:mcc04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   722019 (VWF)
SSDB
Motif
Pfam: VWD VWA C8 VWA_N2 TIL VWA_2 VWC Pacifastin_I MCR_beta_N
Other DBs
NCBI-GeneID: 722019
NCBI-ProteinID: NP_001230015
Ensembl: ENSMMUG00000022893
UniProt: F5XVB6
LinkDB
Position
11
AA seq 2813 aa
MIPARLAGVLLALALVLPGTLCAEGTRGRSSMARCSLFGSDFINTFDGSMYSFAGYCSYL
LAGDCQKRSFSIIGDFQNGKRVSLSVYLGEFFDIHLFVNGTVTQGDKSVSMPYASKGLYL
ETEAGYYKLSGEAYGFVARIDGSGNFQVLLSDRYFNKTCGLCGNFNIFAEDDFMTQEGTV
TSDPYDFANSWALSSGEQWCERASPPSSSCNISSGEVQKGLWEQCQLLKSTSVFARCHPL
VDPEPFVALCEKTSCECAGGLECTCPAFLEYTRTCAQEGMVLYGWTDHSACSPVCPAGME
YKQCVSPCARTCQSLHINEVCQERCVDGCSCPEGQLLDEGLCVESTECPCMHSGKRYPPG
ASLSRDCNTCICRNSQWICSNEECPGECLVTGQSHFKSFDNRYFTFSGICQYLLARDCED
HSFSIVIETVQCADDPDAVCTRSVTVRLPGLHNSLVKLKHGGGVAMDGQDVQLPLLKGDL
RIQHSVTASVRLSYGEDLQMDWDGRGRLLVKLSPVYAGKTCGLCGNYNGNQGDDFLTPSG
LAEPRVEDFGNAWKLHADCQDLQKQHSDPCALNPRMTRFSEEACAVLTSPTFEACHRAVS
PLPYLRNCRYDVCSCSDGRECLCGALASYAAACAGRGVRVAWREPGRCELKCPKGQVYLQ
CGTPCNLTCRALSYPDEECNEACLEGCFCPPGLYMDEMGDCVPKAQCPCYYDGEIFQPED
IFSDHHTMCYCEDGFMHCTMSGVPGSLLPDAVLSSPLSHRSKRSLSCRPPMVKLVCPADN
PRAEGLECAKTCQNYDLECMSMGCVSGCLCPPGMVRHENRCVALERCPCLHQGKEYAPGE
AVKIDCNTCVCRDRKWNCTDHVCDATCSTIGMAHYLTFDGLKYMFPGECQYVLVQDYCGG
NPGTFRILVGNEGCSHPSVKCKKHVTILVEGGEIELFDGEVNVKRPMKDETHFEVVESGR
YIILLLGKAISVVWDRHLSISVVLKQTYQEKVCGLCGNFDGIQNNDLTSSNLQVEEDPVD
FGNSWKVSSQCADTRKVPLDSSPATCHNNLMKQTMVDSSCRILTSDVFQDCNKLVDPEPF
LDVCIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVTWRTATLCPQSCEERNLRENGY
ECEWRYNSCAPACRVTCQHPEPLPCPVQCVEGCHAHCPPGKILDELLQTCVSPEDCPVCE
AAGRRFASGKKVTLNPSDPEHCQICHCDGVNLTCEACEEPGGLVVPPTDAPVSPTTPYVE
DISEPPLHDFYCSRLLDLVFLLDGSSRLSEAEFEVLKAFVVDMMERLRISQKWVRVAVVE
YHDGSHAYIGLKDRKRPSELRRIASQVKYAGSQVASTSEVLKYTLFQIFGKIDRPEASRI
ALLLMASQEPQRMSRNFVRYVQGLKKKKVIVIPVGIGPHANLKQIHLIEKQSPENKAFVL
SGVDELEQQRDEIVSYLCDLAPEAPPPTLSPDMAQVTVGPGLLGVSTLGPKRNSMVLDVA
FVLEGSDKIGEADFNRSKEFMEEVIQRMDVGQDGIHVTVLQYSYTVAVEYPFSEAQSKGD
ILQRVREIRYQGGNRTNTGLALQYLSEHSFLVSQGDREQAPNLVYMVTGNPASDEIKRLP
GDIQVVPIGVGPHANVQELERIGWPNAPILIQDFETLPREAPDLVLQSCCSGEGLKIPTL
SPAPDCSQPLDVILLLDGSSSFPAAYFDEMKSFAKAFISKANIGPHLTQVSVLQYGSITT
IDVPWNVAPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTSEMHGARPGASKAVVILV
MDVSVDAVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGPAGNSNVVKLQRIEDLPTM
VTLGNSFLHKLCSGFVRICMDEDGNERRPGDIWTLPDQCHTVTCQPDGQTLLESHRVNCD
RGLRPSCPNSQSPVKVEETCGCRWTCPCVCTGSSTRHIVTFDGQNFKLTGSCSYVLFQNK
EQDLEVILHNGACSPAARQGCMKSIEVKHAALSVELHSDMEVTVNGRLVSVPYVGGNMEV
NVYGAIMHEVRFNHLGHIFTFTPQNNEFQLQLSPKTFASKTYGLCGICDENGANDFMLRD
GTVTTDWKTLVQEWTVQRPGQTCQPILEEQCLVPNSSQCQVLLSALFAECHKVLAPATFY
AICQQDSCHREQVCEVIASYAHLCRTNGVCVDWRTPDFCAMSCPPSLVYNPCERGCPRHC
NGNVSSCGDHPSEGCFCPPNKVMLEGSCVPEEACTQCIGEDGVQHQFLEAWVPDHQPCQI
CTCLSGRKANCTMQPCPTAKAPTCGLCEVARLRQNADQCCPEYECVCDPESCDLPPVPRC
EGGLQPTLTNPGECRPNFTCACRKEECKRVSPPSCPPHRLPTLRKTQCCDEYECACNCVN
STVSCPLGYLASTATNDCGCTTTTCLPDKVCVHRSTIYPVGQFWEEGCDVCTCTDMEDAV
MGLRVVQCSQKPCEDSCRSGFTYVPREGECCGRCLPSACEVVTGSPRGDSQSSWKSVGSH
WASPENPCLINECVRVKEEVFVQQRNVSCPQLEVPVCPSGFQLSCKTSACCPSCRCEPVE
ACMLNGTMIGPGKSVMIDACTTCRCIVQVGIISGFKLECRKTICNPCPLGYKEENNTGEC
CGRCLPTVCTIRLRGGQIMTLKRDETLQDGCDTHFCKVNERGEYFWEKRVTGCPPFDEHK
CLAEGGKIMKIPGTCCDTCEEPECSDITARLQYVKVGSCKSEVEVDIHYCQGKCASKAMY
SIDINDVQDQCSCCSPTRTEPMQVPLHCTNGSVVYHEVLNAMQCECSPRKCSK
NT seq 8442 nt   +upstreamnt  +downstreamnt
atgattcctgccagattggccggggtgctgcttgctctggccctcgtcttgccagggacc
ctttgtgcagaagggactcgcggcaggtcatccatggcccgatgcagcctcttcggaagt
gacttcatcaacacctttgatgggagcatgtacagctttgcgggatactgcagttatctc
ctggcaggggactgccagaaacgctccttttcgattattggggacttccagaatggcaag
agagtgagcctctccgtgtatcttggggaattttttgacatccatttgtttgtaaatggt
actgtgacacagggggacaaaagcgtctccatgccctatgcctccaaagggctgtatcta
gaaactgaggctgggtactacaagctgtccggtgaggcctatggctttgtggccaggatc
gatggcagcggcaactttcaagtcctgctgtcagacagatacttcaacaagacctgcggg
ctgtgtggcaactttaacatctttgctgaagatgacttcatgacccaagaagggaccgtg
acctcagacccctatgactttgccaactcatgggctctgagcagtggagaacagtggtgt
gaacgggcatctcctcccagcagctcgtgcaacatctcctctggggaagtgcagaagggc
ctgtgggagcagtgccagcttctgaagagcacctcagtgtttgcccgctgccaccctctg
gtggatcccgagcctttcgtggccctgtgtgagaagacttcgtgtgagtgtgctgggggg
ctggagtgcacgtgccctgccttcctggagtacacccggacctgtgcccaggagggaatg
gtgctgtatggctggactgaccacagcgcgtgcagcccagtgtgccctgcgggtatggag
tataagcagtgtgtgtccccttgcgccaggacctgccagagcctgcacatcaatgaagta
tgtcaggagcgatgcgtggatggctgcagctgccctgagggacagctcctcgatgaaggc
ctctgcgtggagagcaccgagtgtccctgcatgcattccggaaagcgctaccctcctggt
gcctccctctctcgagactgcaacacctgcatttgccgaaacagccagtggatctgcagc
aatgaagaatgtccaggggagtgcctcgtcacgggtcaatcccatttcaagagctttgac
aacagatacttcaccttcagtgggatctgccagtacctgctggcccgggattgcgaggac
cactccttctccattgtcattgagaccgtccagtgtgctgacgaccctgatgctgtgtgc
acccgttccgtcaccgtccggctgcctggcctgcacaacagccttgtgaagctgaagcat
gggggaggagttgccatggatggccaggacgtccagctcccactcctgaaaggtgacctc
cgcatccagcattcagtgacggcctccgtgcgcctcagctacggggaggacctgcagatg
gactgggacggccgcgggagactgctggtgaagctgtcccccgtctacgccgggaagacc
tgcggcctgtgtgggaattacaatggcaaccagggcgatgacttccttaccccttctggg
ctggcggagccccgcgtggaggacttcgggaacgcctggaagctgcacgcggactgccag
gacctgcagaagcagcacagcgatccctgcgccctcaacccgcgcatgaccaggttctcc
gaggaggcatgcgcggtcctgacgtcccccacgttcgaggcctgccatcgtgccgtcagc
ccgctgccctacctgcggaactgccgctacgacgtgtgctcctgctccgatggtcgcgag
tgcctgtgcggcgccctggccagctatgccgcggcctgcgcggggagaggcgtgcgcgtc
gcgtggcgcgagcctggtcgctgcgagctgaaatgcccgaaaggccaggtgtacctgcag
tgcgggaccccctgcaacctgacctgccgcgcgctctcttacccggatgaggaatgcaat
gaggcctgcctggaaggctgcttctgccccccagggctctacatggatgagatgggggac
tgcgtacccaaggcccagtgcccctgttactatgatggtgagatcttccagccggaagac
atcttctcagaccatcacaccatgtgctactgtgaggatggcttcatgcactgcaccatg
agtggagtccccggaagcttgctgcctgacgctgtcctcagcagtcccctgtctcaccgc
agcaaaaggagcctatcctgtcggcctcccatggtcaagctggtgtgtcccgctgacaac
ccgcgggctgaagggctcgagtgtgccaagacgtgccagaactacgacctggagtgcatg
agcatgggctgtgtctctggctgcctctgcccaccgggcatggtccggcatgaaaacaga
tgtgtggccctggagaggtgtccctgcttgcatcagggcaaggagtatgcccccggagaa
gcagtgaagattgactgcaacacgtgtgtctgtcgggaccggaagtggaactgcacagac
cacgtgtgtgacgccacgtgctccacgatcggcatggcccactacctcaccttcgacggg
ctcaaatacatgttccccggggagtgccagtacgttctggtgcaggattactgtggtggt
aaccctgggacctttcggatcctggtggggaatgagggatgcagccacccttcagtgaaa
tgcaagaaacatgtcaccatcctggtggagggaggagagattgagctgttcgacggggag
gtgaatgtgaagaggcccatgaaggatgagactcactttgaggtggtggagtctggccgg
tacatcattctgctgttgggcaaagccatctccgtggtctgggaccgccacctgagcatc
tctgtggtcctgaagcagacataccaggagaaagtgtgtggcctgtgtgggaattttgat
ggcatccagaacaatgacctcaccagcagcaaccttcaagtggaggaagaccccgtggac
tttgggaactcctggaaagtgagctcgcagtgtgctgacaccagaaaagtgcctctggac
tcatcccctgccacctgccataacaacctcatgaagcagacgatggtggattcctcctgt
agaatccttaccagtgatgtcttccaggactgcaacaagctggtggaccccgagccattc
ctggatgtctgcatatacgacacctgctcctgtgagtccattggggactgcgcctgcttc
tgcgacaccattgctgcctatgctcacgtgtgtgcccagcacggcaaggtggtgacctgg
aggacagccacattgtgcccccagagctgcgaggagaggaatctccgggagaacgggtat
gagtgtgaatggcgctataacagctgcgcacctgcctgtcgagtcacgtgccagcaccct
gagccgctgccctgccctgtgcagtgtgtggagggctgccatgcccactgccctccaggg
aaaatcctggatgagcttttgcagacctgcgtcagccccgaagactgcccggtgtgtgag
gcggctggccggcgttttgcctcaggaaagaaagtcaccttgaatcccagcgaccctgag
cactgccagatttgccactgtgatggtgtcaacctcacctgtgaagcctgcgaggagccg
ggaggccttgtggtgcctcccacagatgctccggtgagccccaccactccgtatgtggag
gacatctcggaaccgcccttgcacgatttctactgcagcaggctactggacctggtcttc
ctgctggatggctcctccaggctgtccgaggctgagtttgaagtgctgaaggcctttgtg
gtggacatgatggagcggctgcgcatctcccagaagtgggtccgcgtggccgtcgtggag
taccatgatggctcccatgcctacatcgggctcaaggaccggaaacgaccatcagagctg
cggcgcattgccagccaggtgaagtatgcaggcagccaggtggcctccaccagtgaggtc
ttgaaatacacactgttccaaatctttggcaagatcgaccgccctgaagcctcccgcatc
gccctgctcctgatggccagccaggagccccaacggatgtcccggaactttgtccgctat
gtccagggcctgaagaagaagaaggtcattgtaatcccggtgggcattgggccccatgcc
aacctcaagcagatccacctcattgaraagcagtcccctgagaacaaggccttcgtgctg
agcggtgtggatgagctggagcagcaaagggatgagattgttagctacctctgtgacctt
gcccctgaagcccctcctcctactctgtcccccgacatggcacaagtcactgtgggccca
gggctcttgggggtttcaaccctggggcccaagaggaattccatggttctggatgtggca
tttgtcctggaaggatcggacaaaattggtgaagccgactttaacaggagcaaggagttc
atggaggaggtgattcagcggatggatgtgggccaggacggcatccacgtcacggtactg
cagtactcctatacagtggcggtggagtaccccttcagcgaggcacagtccaaaggggac
atcctgcagcgggtgcgagagatccgctaccagggcggcaacaggaccaacactgggctg
gccctgcagtacctctccgagcacagcttcttggtcagccagggtgaccgggagcaggcg
cccaacctggtctacatggtcaccggaaatcctgcctctgatgagatcaagaggttgcct
ggagacatccaggtggtgcccattggagtgggccctcatgccaatgtacaggagctggag
aggattggctggcccaatgcccccatcctcatccaggactttgagacacttccccgagag
gctcctgacctggtgctgcagagctgctgctccggagaggggctgaagatccccaccctc
tcccctgcccctgactgcagccagcccctggacgtgatccttctcctagatggctcctcc
agtttcccagctgcttattttgatgaaatgaagagttttgccaaggctttcatttcaaaa
gccaatatagggcctcatctcactcaggtgtcagtgctccagtatggaagcatcaccacc
attgatgtgccgtggaacgtggccccggagaaagcccatttgctgagccttgtggacgtc
atgcagcgggagggaggccccagccaaattggggatgctttgggctttgctgtgcgatac
ttgacttcagaaatgcatggtgccaggccgggagcctcgaaggcggtggtcatcctggtc
atggacgtctctgtggatgcggtggatgcagcagctgatgccgccagatccaacagagtg
acagtgttccctattggaattggagatcgctacgatgcagcccagctaaggatcttggca
ggcccagcaggtaactctaacgtggtgaagctccagcgaatcgaagacctccctaccatg
gtcaccttgggcaattcgttcctccacaaactgtgctctggatttgttaggatttgcatg
gatgaggatgggaatgagaggaggcctggggacatctggaccttgccagaccagtgccac
accgtgacttgccagccagatggccagaccttactggagagtcatcgggtcaactgtgac
cgggggctgaggccttcatgccccaacagtcagtcccctgttaaagtggaagagacctgt
ggctgccgctggacctgcccctgcgtgtgcacaggcagctccactcggcacatcgtgacc
tttgatgggcagaatttcaagctgactggcagctgttcttacgtcctatttcaaaacaag
gagcaggacctggaggtgattctccataatggtgcctgcagccctgcagcaaggcagggc
tgcatgaaatccatcgaggtgaagcacgctgccctctctgtcgagctgcacagtgacatg
gaggtgacagtgaatgggagactggtctctgttccttacgtgggtgggaacatggaagtc
aacgtttatggtgccatcatgcatgaggtcagattcaaccatcttggccacatcttcaca
ttcactccacaaaacaatgagttccaactgcaactcagccccaagacttttgcttcaaag
acgtatggtctctgtgggatctgtgatgagaatggagccaatgacttcatgctgagggat
ggtacagtcaccacagactggaaaacacttgttcaggaatggactgtgcagcggccaggg
cagacatgccagcccattctggaagagcagtgtcttgtccccaacagctcccagtgccag
gtgctcctctcagcactgtttgctgaatgccacaaggtcctggctccagccacattctat
gccatctgccagcaggacagttgccaccgggagcaagtgtgtgaggtgatcgcctcttat
gcccacctctgtcggaccaatggggtctgcgttgactggaggacacctgatttctgtgct
atgtcatgcccaccatccctggtctacaacccctgtgagcgtggctgtccccggcactgc
aatggcaacgtgagctcctgtggggaccatccctcggaaggctgtttctgtcctccgaat
aaagtcatgttggaaggcagctgtgtccctgaagaggcctgcactcagtgcattggtgag
gatggagtccagcaccagttcctggaagcctgggtcccggaccaccagccctgtcagatc
tgcacgtgcctcagcgggcggaaggccaactgcacaatgcagccctgccccacggccaaa
gctcccacctgcggcctgtgtgaggtggcccgcctccgccagaatgcagaccagtgctgc
cctgagtatgagtgcgtgtgtgatccggagagctgtgacctgcccccagtgcctcgctgt
gaaggtggcctccagcccacactgaccaaccctggcgagtgcagacccaacttcacctgc
gcctgcaggaaggaggagtgcaaaagggtgtcgccgccctcctgccccccgcaccgtttg
cccacccttcggaagacccagtgctgtgatgagtatgagtgtgcctgcaactgtgtcaac
tccacagtgagctgtccccttgggtacttggcctcaaccgccactaatgactgtggctgc
accacaaccacctgccttcccgacaaggtgtgtgtccaccgaagcaccatctacccggtg
ggccagttctgggaagagggctgcgacgtgtgcacctgcaccgacatggaggatgctgtg
atgggcctccgcgtggtccagtgctcccagaagccctgtgaagacagctgtcggtcgggc
ttcacttacgttccgcgtgaaggcgagtgctgtggaaggtgcctgccgtctgcctgtgaa
gtggtgaccggctcaccacggggggactcccagtcttcctggaagagcgtcggctcccac
tgggcctccccggagaacccctgcctcatcaacgagtgtgtccgagtgaaggaggaggtc
tttgtacaacaaaggaacgtctcctgcccccagctggaggtccctgtctgcccctcgggc
tttcagctgagctgtaagacctcagcgtgctgcccgagctgtcgctgtgagcctgtggag
gcctgcatgctcaatggcactatgattgggcccgggaagagcgtgatgatcgatgcatgc
acgacgtgccgctgcatcgtgcaggtggggatcatctctggattcaagctggagtgcagg
aagaccatctgcaatccctgccccctgggttacaaggaagaaaataacacaggtgaatgt
tgtgggagatgtttgcctacagtttgcaccattcggctaagaggaggacagatcatgaca
ctgaagcgtgatgagacgctccaggatggctgtgatactcacttctgcaaggtcaatgag
agaggagagtacttctgggagaagagggtcacgggctgcccgccctttgatgaacacaag
tgtctggctgagggaggtaaaattatgaaaattccgggcacctgctgtgacacatgtgag
gagcccgagtgcagtgacatcactgccaggctgcagtatgtcaaggtgggaagctgtaag
tccgaagtagaggtggatatccactactgccagggcaaatgtgccagcaaagccatgtac
tccattgacatcaacgatgtgcaggaccagtgctcctgctgctctccaacacggacggag
cccatgcaggtgcccctgcactgcaccaatggctctgttgtgtaccacgaggttctcaat
gccatgcagtgcgaatgttcccccaggaagtgcagcaagtga

KEGG   Macaca mulatta (rhesus monkey): 722430
Entry
722430            CDS       T01028                                 

Gene name
LAMC3
Definition
(RefSeq) laminin subunit gamma-3
  KO
K06247  laminin, gamma 3
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    722430 (LAMC3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    722430 (LAMC3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    722430 (LAMC3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    722430 (LAMC3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    722430 (LAMC3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    722430 (LAMC3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    722430 (LAMC3)
   05145 Toxoplasmosis
    722430 (LAMC3)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 722430
NCBI-ProteinID: XP_014972034
Ensembl: ENSMMUG00000002267
LinkDB
Position
15
AA seq 1575 aa
MAAAALLLGLALLAPRAAGSGMGACYDGAGRPQRCLPVFENAAFGRLAQASHTCGSPPED
FCPHVGAAGAGAQCQRCDAADPQRHHNASYLTDFHSQDESTWWQSPSMAFGVQYPTSVNI
TLRLGKAYEITYVRLKFHTSRPESFAIYKRSRADGPWEPYQFYSASCQKTYGRPEGQYLR
PGEDERVAFCTSEFSDISPLSGGNVAFSTLEGRPSAYNFEESPGLQEWVTSTELLISLDR
LNTFGDDIFKDPKVLQSYYYAVSDFSVGGRCKCNGHASECGPDAAGQLACRCQHNTTGTD
CERCLPIFQDRPWARGTAEAAHECLPCNCSGRSEECTFDRELFRSTGHGGRCHHCRDHTA
GPHCERCQENFYHWDPRMPCQPCDCHSAGSLHLQCDDTGTCTCKPTVTGWKCDRCLPGFH
SLSEGGCRPCTCNPAGSLDTCDPHSGRCPCKENVEGNLCDRCRPGTFNLQPHNPVGCSSC
FCYGHSKVCAATAQFQVHHIFTDFHQGTEGWWARSVGGPEHPPQWSPNGVLLSPEDEEEL
TAPEKFLGDQRFSYGQPLILTFQVPPRSSPLPVQLRLEGAGLALSLRHSSLSGPQDTGHP
REVELRFHLQETSEDVAPPLPPFHFQRLLTNLTSLRLRVSPGPSPAGQVFLTEVRLTSAR
PGLSPPASWVETCSCPSGYMGQFCESCAPGYKRETPQGGPYASCVPCTCNQHGTCDPNTG
ICVCSHHTEGPSCERCLPGFYGNPFAGQADDCQPCPCPGQSACTTIPESGEVACTHCPPG
QRGRRCEVCDDGFFGDPLGLSGPPQPCYQCQCSGNVDPNAVGNCDPLSGHCLRCLHNTAG
DHCERCREGFYGSALVPRPADKCKPCSCHPQGSVSEQMPCDPVTGQCSCLPHVTGRDCSR
CYPGFFDLQPGRGCRSCKCHPLGSQEDQCHPKTGQCTCRPGVTGQACDRCQLGFFGFSIK
GCRACRCSPLGAASAQCHENGTCICRPGFEGYKCDRCHDNFFLTADGTHCQECPSCYALV
KEEAAKLKAKLTLMEGWLQGSNCGSPWGPLDILLGEAPRGDIYQGHHLLPGVREAFLEQM
MGLEGAVKASREQLQRLSKGAHCAQAGAQKTCTQLADLEAVLESSEEEILHAAAILASLE
IPQDGPGQPTKWSHLATEARALAGSHRDTATKIAATAQRALLASNTSYALLWSLLEGRMA
LETQRDLEDRYQEVQAAQKALGTAMAEVLPEAESVLATVQQVGADTAPYLALLASPGALP
QKSWAEDLSLKAKILEKTVASQQHMATKAARTLHAAAQAALRQTEPLTQLHQEARTALTQ
ASSSVQAATVTVMGARTLLADLEGMKLQFPRPKDQAALQRKADSVRDRLLADMRKKTKQA
ERMLGNAAPLSSSAKKKGREAEVLAKDSVKLAKALLREGNQVHRRASRLTSQTQATLQQA
SRQVLDSEARRQELEEAEQVGAGLSEMEQQIRESRISLEKDIEALSELLARLGSLDTHQA
PAQALNETQWALERLRLQLDSPGALQRKLRLLEQESEQQELQIQGFESDLAEIRADKRNL
EAILHSLPENCASWQ
NT seq 4728 nt   +upstreamnt  +downstreamnt
atggcggcggctgcgcttctgctagggctggcgctgctggcacctcgggcggccggctcg
ggcatgggcgcgtgctacgacggcgcggggcgcccgcagcgctgcctgcccgtgttcgag
aacgcggccttcggccggctagcccaggcctcgcacacgtgcggcagcccgcctgaggac
ttctgcccccacgtgggcgccgcgggcgcgggggcgcagtgccagcgctgcgacgcagcc
gacccccagcgccaccacaatgcctcctacctcaccgacttccacagccaggacgagagc
acctggtggcagagcccgtccatggccttcggcgtgcagtaccccacctcggtcaacatc
accctccgtctggggaaggcttacgagatcacgtatgtgaggctaaagttccacaccagt
cgccctgagagcttcgccatctacaagcgcagccgcgccgacggcccgtgggagccctac
cagttctacagtgcctcctgtcagaagacttacggccggcccgagggccagtacctgcgc
cctggcgaggatgagcgcgtggccttctgcacctcagagttcagcgacatctccccgctg
agtgggggcaatgtggccttctccaccctggagggccggcccagcgcctacaacttcgag
gagagccctgggctgcaggagtgggtcaccagcaccgaactcctcatctctctagaccgg
ctcaacacgtttggggatgacatcttcaaggaccccaaggtgctccagtcctactactat
gccgtgtccgacttctctgtgggcggcaggtgcaagtgcaatgggcatgccagcgagtgt
ggccccgacgcggcgggccagttggcctgccggtgccagcacaacaccaccggcacagac
tgtgagcgctgcctgcccatcttccaggaccgtccgtgggcccggggcactgccgaggct
gcccacgagtgtctgccctgcaactgcagtggccgctccgaggaatgcacgttcgatcgg
gagctcttccgcagcacaggccacggcgggcgctgtcaccactgccgtgaccacacagcc
gggccacactgtgagcgctgtcaggagaatttctatcactgggacccgcggatgccatgc
cagccctgtgactgccattcggcaggctctctgcacctccagtgcgatgacacaggcacc
tgcacctgcaagcccactgtgactggctggaagtgtgatcgctgtctgcccgggttccac
tcgctcagcgagggaggctgcagaccctgcacctgcaatcccgctggcagcctggacacc
tgtgacccccacagtgggcgctgcccctgcaaagagaatgtggaaggcaacctatgtgac
agatgtcgccccgggacttttaacctgcagccccacaatccggtcggctgcagcagctgt
ttctgctatggccactccaaggtgtgcgcggctactgcccagttccaagtgcatcacatc
ttcaccgatttccaccagggaaccgaaggctggtgggccagaagtgtggggggccccgag
caccccccacaatggagcccaaatggggtcctcctgagcccagaagacgaggaggagctc
acagcaccagagaaattcctgggagatcagcggttcagctatgggcagcccctcatcctg
accttccaggtgccccccaggagctccccactccctgtgcagctgaggctggaaggggcg
ggcttggccctgtccctgaggcactctagcttgtctggcccccaggataccgggcatccc
agggaggtagagctcaggttccacctgcaggagacctctgaggatgtggcccctccgctg
ccccccttccacttccagcggctcctcaccaacctgaccagcctccgcctccgtgtcagt
cccggccccagccctgctggccaagtgttcctgactgaggtccggctcacatccgcccgg
ccagggctttccccgccagcctcctgggtggagacttgttcatgtcccagtggctacatg
ggccagttctgtgaatcctgtgctccgggatataagagggagacgccacaggggggtccc
tatgccagctgtgtcccctgcacctgtaaccagcatggcacctgtgaccccaacacaggg
atctgcgtctgcagccaccataccgagggcccatcctgtgaacgctgcttgccgggtttc
tatggcaaccccttcgcgggccaagccgatgactgccagccctgtccctgccctggccag
tcggcctgcacgaccatcccagagagcggggaggtggcatgcacccactgccccccgggc
cagagagggcggcgctgtgaggtctgtgatgatggcttttttggggacccgctggggctc
tcggggcccccccagccctgctaccagtgccagtgcagcgggaatgtggaccccaatgcc
gtgggcaactgtgaccccctgtctggccactgcctgcgctgcctgcacaacaccgcgggt
gaccactgtgagcgctgtcgtgaaggtttctacgggagcgccctggtccctcggcccgca
gacaaatgcaagccctgcagctgccacccacagggctcggtcagtgagcagatgccctgc
gacccagtgacaggccagtgctcctgcctgcctcacgtgactggacgggactgtagccgc
tgctaccctggcttcttcgacctccagcctgggaggggctgccggagctgcaagtgtcac
ccactgggctcccaggaggaccagtgccatcccaagactggacagtgcacctgccgccca
ggtgtcacaggccaggcctgtgacagatgccagctgggtttctttggcttctccatcaag
ggctgccgggcctgcaggtgctccccactgggcgctgcctcagcccagtgccacgagaac
ggcacctgcatatgcaggcctggcttcgagggctacaaatgtgaccgctgccacgacaac
ttcttcctcacggcagatggcacacactgccaggaatgcccgtcctgctatgccctggtg
aaggaggaggcagccaagctgaaggccaaactgactttgatggaggggtggctgcaaggg
tccaactgtggcagtccctggggaccactagacattctgctgggagaggccccgaggggg
gacatctaccagggccaccacctgcttccaggggttcgggaagccttcctggagcagatg
atgggccttgagggtgctgtgaaggcctcccgggagcagctgcagaggctgagcaagggt
gcccactgtgcccaggccggagcccagaagacctgtacccagctggcagacctggaggca
gtgctggagtcctcggaagaggagattctgcacgcagctgccattctcgcgtctctggag
attccccaggatggtcccggccaacccaccaaatggagccacctggccacagaggcccgt
gccctcgccgggagccacagagacaccgccaccaagattgcagccactgctcagagggcc
ctgctcgcctccaacaccagctacgcgcttctctggagtctgctggagggaaggatggcc
ctggagacccagcgggacctggaggacaggtaccaggaggtccaggcggcccagaaagca
ctggggacggctatggcagaggtgctgcctgaagcggaaagcgtgttggccacagtgcag
caagttggcgcagatacagccccgtacctggctttgctggcttccccgggagctctgcct
cagaagtcctgggctgaagacctgagcctgaaggcgaagatcctggagaagacagttgcg
tcacaacagcacatggccaccaaggctgcccgaaccctccatgctgctgcccaggcagcg
ctacggcaaacagagcccctcacacagctacaccaggaggccagaaccgccctgacccag
gcttcctcgtctgtccaggctgccacagtgaccgtcatgggagccaggactctgctggct
gacctggaaggaatgaagctgcagtttcctcggcccaaggaccaggcggcactgcagagg
aaggcagactccgtcagggacagactccttgcagacatgagaaagaagaccaagcaggcg
gagaggatgctaggaaacgcggcccctctttcctccagtgccaagaagaaaggcagagaa
gcagaagtgttggccaaggacagtgtcaagcttgccaaggccttgctgagggaggggaac
caggtgcaccgccgtgccagccggctcaccagccagacgcaagccacgctccaacaggcg
tcccggcaggtgctggattctgaagcacgccgacaagagctggaggaagctgagcaggtg
ggtgctgggctgagcgagatggagcagcagatccgggaatcacgcatctccctggagaag
gacattgaggccttgtcagagctgcttgccaggctggggtcgctggacacccatcaagcc
ccagcccaggccctgaacgagactcagtgggcactagaacgcctgaggctgcagctggac
tccccgggggccttgcagaggaaactcaggctgctggagcaggaatctgagcagcaggag
ctgcagatccaaggcttcgagagtgacctcgccgagatccgcgctgacaaacggaacctg
gaggccattctgcacagcctgcctgagaactgtgccagctggcagtga

KEGG   Macaca mulatta (rhesus monkey): 722472
Entry
722472            CDS       T01028                                 

Gene name
LAMA1
Definition
(RefSeq) LOW QUALITY PROTEIN: laminin subunit alpha-1
  KO
K05637  laminin, alpha 1/2
Organism
mcc  Macaca mulatta (rhesus monkey)
Pathway
mcc04151  PI3K-Akt signaling pathway
mcc04510  Focal adhesion
mcc04512  ECM-receptor interaction
mcc05145  Toxoplasmosis
mcc05146  Amoebiasis
mcc05165  Human papillomavirus infection
mcc05200  Pathways in cancer
mcc05222  Small cell lung cancer
mcc05410  Hypertrophic cardiomyopathy
mcc05412  Arrhythmogenic right ventricular cardiomyopathy
mcc05414  Dilated cardiomyopathy
mcc05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:mcc00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    722472 (LAMA1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    722472 (LAMA1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    722472 (LAMA1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    722472 (LAMA1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    722472 (LAMA1)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    722472 (LAMA1)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    722472 (LAMA1)
   05414 Dilated cardiomyopathy
    722472 (LAMA1)
   05416 Viral myocarditis
    722472 (LAMA1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    722472 (LAMA1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    722472 (LAMA1)
   05145 Toxoplasmosis
    722472 (LAMA1)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_B Laminin_N Laminin_II Laminin_G_3 DUF1080
Other DBs
NCBI-GeneID: 722472
NCBI-ProteinID: XP_001118617
Ensembl: ENSMMUG00000011408
LinkDB
Position
18
AA seq 3079 aa
MRGGGAWGVLLASLLWVAARCQQRGLFPAILNLASNADISTNATCGEKGPEMFCKLVEHV
PGRPVRNPQCRICDGNSVNPRERHPISHAIDGTNNWWQSPSIQNGREYHWVTITLDLRQV
FQVAYVIIKAANAPRPGNWILERSLDGTTFSPWQYYAVSDSECLSRYNITPRRGPPTYRA
DDEVICTSYYSRLVPLEHGEIHTSLINGRPSADDLSPKLLEFTSARYIRLRLQRIRTLNA
DLMTLSHREPKDLDPIVTRRYYYSIKDISVGGMCICYGHASSCPWDETTKKLQCQCEHNT
CGESCNRCCPGYHQRPWRPGTVSSGNTCEACNCHNKAKDCYYDESVAKQKKSLNTAGQFR
GGGVCTNCLQNTMGINCETCIDGYYRPHKVSPYEDEPCRPCDCDPVGSLSSVCVKDDLHS
DLRNGKGPGQCPCKEGYAGEKCDRCQLGYKDYPACVSCGCNPVGSASDEPCTGPCVCKEN
VEGKACDRCKPGFYNLKEKNPQGCSECFCFGVSDVCSSLSWPVGQVSNMSGWLVTDLISP
RRIPSQQDALGGRHQVSINNTAVMQRLAPKYYWAAPEAYLGNKLTAFGGFLKYTVSYDIP
VETVDSNLMSHADVIIKGNRLTLSTQAEGLSLQPYEEYLNVVRLVPENFRDFHSKRQIDR
DQLMTVLANVTHLLIRANYNSAKMALYRLESVSLDVASSNAIGLAVAADVEHCECPQGYI
GTSCELCLSGYYRVDGILFGGICQPCECRGHAAECDVHGICIACAHNTTGDHCERCLPGF
YGEPSRGTPGDCQPCACPLTIASNNFSPTCHLNDGDEVVCDQCAPGYSGAWCERCADGYY
GNPTVPGESCVPCDCSGNVDPSEAGHCDSVTGECLKCLGNTDGAHCERCADGFYGDAVTA
KNCRACRCHAKGSHSAVCHLETGLCDCKPNVTGQQCDQCLHGYYGLDSGHGCRPCNCSVA
GSVSDDCTDEGQCHCVPGVAGKRCDRCARGFYAYQDGSCTPCDCPHTRNTCDPETGECVC
PPHTQGARCEECEDGHWGYDVEVGCQACNCSLVGSSHHQCDVVSGRCQCNWKFGGRACDQ
CSLGYRDFPDCVPCDCDLRGTSGDACNLEQGLCGCAEETGACPCKENVSGPQCNECREGS
FALRADNALGCSPCFCSGLSHLCSELEGYVRTPVTLGSDQPLLRVVSQSNLRGTTEGVYY
QAPDFLLDAATVRQHVHAEPFYWRLPRQFQGAQLMAYGGKLKYSVAFYSLDGVGTSNFEP
QVLIKGGRTRKQVIYMDTPAPENGVRQEQEVAMREHFWKYFNSVSEKPVTREDFMSVLSN
IEYILIKASYGQGLQQSRISDISMEVGRKAEKPHPEEEVASLLENCVCPPGTVGFSCQDC
APGYHRGKLPAGSGRGPRPLVAPCVPCSCNNHSDTCDPDTGKCLNCGDNTAGDHCDVCAF
GYYGKVTGSASDCALCVCPHSPPASFSPTCVLEGDHNFRCDACLLGYEGQHCERCSLGYY
GNPQTLGGSCQKCDCNLHGSVHGDCDRTSGQCVCRLGASGLRCDECEPRHILMETDCVSC
DDECVGVLLNDLDEIGGAILSLNLTGIIPVPYGILSNLENTTKYLQESLLRENMQKDLGR
IKLEGVAEETDNLQKKLTRMLASTQKVNRATERIFKESRDLAIAIERLRMNITEIIEKTT
LNQTLDEDFLLLNSTLQSMQQNGTSLLEIMQRRDFTQLHQNATLELKAAEDLLSQIQENY
QKPLEELEVLKEATSHLLSRHNSELQAAEVLVREAEAKTEESNHLLFMVNANLREFSDKK
LHVQEEQNLTSELIARGRGLIDTAAVQTDAVQDALEHLEHHQDTLLLWSARIRHHVDDLV
MHMSQRNALDLVYSAEDHAAEFQRLADVLDSGLENVRNVSLNATSAAYVHYNIQNLIEES
EELAKDAHRTVTEMSLLSESLVSNGKAAVQRSSRFLNEGDNLSRKLPVIALELSELRNKT
NRFQENADEITRQTNESVLILRAIPEDTRDKGAKTKELATSASHSAVSTLRDVAGLSQEL
LSTSASLSRVNATLQETHQLLQDSTMATLLAGRKVKDVETQANLLFDRLKPLKMLEENLS
RNLSEIKLLISQARKQAASIKVAVSADRDCIRAYQPQISSTNYNTLTLNVKTQEPDNLLF
YLGSSTSSDFLAVEMRRGKVAFLWDLGSGSTRLEFPDFPIDDSRWHSIHVARFGNIGSLS
VKEMSSNQKPPAKTSKSPGTANVLDVNNSTLMFVGGLGGQIKKSPAVKVTHFKGCLGEAF
LNGKSIGLWNYIEREGKCHGCFGSSQNEDASFHFDGSGYSVVEKSLPATVTQIIMLFNTF
SPNGLLLYLGSYGIRDFLSIELFHGRVKVTTDLGSGPLTLLTDRRYNNGTWYKIAFQRNR
KQGVLAVNDAYNTSNKETKQGETPGASSDLNRLDKDPIYVGGLPRSRVVRRGVTAKSFVG
CIKNLEISRSTFDLLRNSYGVRKGCSLEPIRSVSFLKGGYIELPPKSLSPESEWLVTFAT
KNSSGIILAALGGDAEKQRDREESHVPFFSIMLIGGNIEVHVNPGDGTGLRKALLHAPTG
TCSDGQAHSISLVRNRRIITVQLDENNRVEMKLGPLVESRTINMSNLYVGGIPEGEGTSL
LTTRRSFHGCIKDLIFNLELLDFNSAVGHEQVDLDTCWLSERPKPAPDAEDSELLPEPRA
FPEQCVVDAALEYVPGAHQFGLSQNSHFLLPFNQSAVRKRLSVELSIRTFASSGLIYYMA
HQNQADYAVLQLHGGRLHFMFDLGKGRTKVSHPALLSDGKWHTVKTDYVKRKGLITVDGQ
ESPMVTVVGDGTTLDVEGLFYLGGLPSQYRARKIGNITHSIPACIGDVTVNSKQLDKDSP
VSTFTVNRCYAAAQEGTFFDGSGYAALVKEGYKVQTDVNITLEFRTSSQNGVLLGISTAK
VDAIGLELVDGKVLFHVNNGAGRITATYEPKTATALCDGKWHTLQANKSKHRITLIVDGN
AVGAESPHTRSTSVDTNNPIYVGGYPAGVKQKCLSSQTSFRGCLRKLVLIKRPQVQSFDF
SRAFEWHGVFLHSCPGTES
NT seq 9240 nt   +upstreamnt  +downstreamnt
atgcgcggcggcggcgcgtggggcgtgctcctggcctcgctgctgtgggtcgccgcgcgg
tgccagcagagaggcctgtttcctgccattctcaatcttgccagcaacgctgacatcagt
accaatgccacctgtggcgagaaggggcccgagatgttctgcaaacttgtggagcatgtg
ccgggtcggcccgtccgaaacccacagtgccggatctgtgatggcaacagtgtgaacccc
agagaacgccatccaatatcacatgccatagatggcaccaataactggtggcaaagtccc
agcattcagaatgggagagagtatcactgggtcacaatcactctggacttaagacaggtc
tttcaagttgcatatgtcatcattaaagctgccaatgcccctcgacctggaaactggatt
ttggagcgttctctagacggcaccacgttcagcccctggcagtattatgcagtcagcgac
tcagagtgtttgtctcgttacaatataactccaagacgagggccgcccacctacagggct
gatgatgaagtgatctgcacctcctattattccagattggtgccacttgagcatggagag
attcatacatcactcatcaatggcagaccaagcgctgacgatctttcacccaagttgttg
gaattcacttctgcgcgatatattcgccttcgcttacagcgcattagaacgctcaatgca
gatctcatgacccttagccaccgggaacctaaagacctggatcctattgttaccagacgc
tattattattcaataaaggacatttctgttggaggcatgtgtatctgctatggtcatgct
agtagctgtccatgggatgaaactacaaagaaactgcagtgtcaatgtgagcataatact
tgcggggagagctgtaacaggtgctgtcctgggtaccatcagcggccctggaggcctgga
accgtgtcctctggcaatacatgtgaagcatgtaattgtcacaataaagccaaagactgt
tactatgatgaaagtgttgcaaagcagaagaagagtttgaatactgctggacagttcaga
ggaggaggggtttgcacaaactgcttgcagaacaccatgggaatcaactgtgagacctgt
attgatggatattatagaccacacaaagtgtctccttatgaggatgagccttgccgcccc
tgtgactgtgaccctgtggggtccctcagttctgtctgtgttaaggatgaccttcattct
gacttacgcaatgggaaggggccgggtcagtgcccatgtaaggaaggctatgcaggagaa
aaatgtgatcgctgccaacttggctataaggattacccagcctgtgtctcctgtgggtgc
aacccagtgggcagtgccagtgacgagccctgcacagggccctgtgtttgtaaggaaaat
gttgaggggaaagcctgcgatcgctgcaagccaggattctacaacctgaaggagaaaaac
ccccagggctgctccgagtgcttctgctttggcgtttccgatgtctgcagcagcctctct
tggcctgttggtcaggtaagcaatatgtcggggtggctggtcaccgacttgatcagtccc
aggaggattccgtctcagcaagatgcactgggtgggcgccaccaggtcagcatcaacaac
accgcggtcatgcagaggctggctcccaagtactactgggcggcccccgaggcctacctt
ggaaataagctgactgcatttggtggattcctgaaatacacggtgtcttacgatattcca
gtagagacggtagacagtaacctcatgtcgcacgctgatgtcatcattaagggaaacaga
ctcactttaagcacacaggctgagggtctgtcattgcagccttacgaagagtacctaaat
gtggttagacttgtgcccgaaaacttccgagattttcacagcaaaaggcagatcgatcgt
gaccagctgatgactgtccttgccaatgtgacacatcttttgatcagagccaactacaat
tctgcaaaaatggctctttacaggttggagtccgtctctctggacgtagccagctctaat
gccatcggcctggcagtggccgccgatgtggagcactgtgaatgtccacaaggctacata
gggacctcctgtgagttgtgcctctctggctattaccgcgtggatggaatactcttcgga
ggaatttgtcaaccctgtgaatgccgcggccacgcagctgagtgcgacgttcacggcatt
tgcattgcatgtgcgcacaacaccaccggcgaccactgcgagcggtgcttgcccggcttc
tacggggagccttcccgagggactcccggggactgccagccctgcgcctgccctctcacc
atagcctccaacaatttcagccccacctgccacctcaatgatggggatgaggtggtctgc
gaccagtgtgccccgggatactcaggagcttggtgtgagagatgtgcagatggttactat
ggaaacccaacagtgcctggcgaatcttgtgttccctgtgactgcagcggcaacgtggac
ccctcggaggctggtcactgtgactcggtcaccggggagtgcctgaagtgcctggggaac
acagacggcgcccactgtgaaaggtgtgctgacggcttctacggggacgctgtgacagcc
aagaactgccgcgcctgcagatgccatgcgaaaggctcccattctgccgtgtgccatctt
gagaccggactctgcgactgcaaaccaaacgtgactggacagcagtgtgaccagtgtttg
catggctattatgggctggactcaggccatggctgccggccctgcaactgcagcgtggca
ggctccgtgtcagatgactgcacggatgaaggccagtgtcactgcgtcccaggtgtggca
gggaaaaggtgtgacaggtgtgcccgtggcttctacgcctaccaggatggcagctgtaca
ccctgtgactgcccacacactcggaatacctgcgacccagaaactggagagtgtgtctgc
ccccctcacacacagggtgcgaggtgtgaagaatgtgaggatgggcactggggctatgat
gtggaggtggggtgccaggcctgcaactgcagtctcgtggggtcgtctcatcatcagtgt
gacgtggtcagcggccgttgccagtgcaactggaaatttggtggccgggcctgtgatcag
tgctccttgggttacagagactttcccgactgtgttccctgtgactgtgacctgaggggg
acgtcgggggatgcctgcaacctggagcagggtctctgcggctgtgcagaggaaaccggg
gcctgcccttgcaaggagaatgtgtctggccctcagtgcaacgagtgtcgagagggcagc
tttgctctccgcgcagacaacgccctgggctgcagcccctgcttctgctccgggctgtcc
cacctctgctcagagctggagggctacgtgaggaccccagtaacgctgggctccgatcag
cctcttctgcgtgtggtttctcagagtaacctgaggggcacgaccgagggggtttactac
caggcccccgacttcctgctggatgccgccaccgtccggcagcacgtccacgcagagccg
ttttactggcggctgccgcggcagttccagggagcccagctcatggcctatggtggcaaa
ctgaagtacagcgtggccttctattctttggacggcgtcggcacctccaattttgagcct
caagttctcatcaaaggtggtcggaccagaaagcaagtcatttacatggatacaccagca
ccagagaatggagtgaggcaggaacaagaagtagcaatgagagagcatttttggaaatat
tttaactccgtctcggaaaaacctgtcacgcgagaggattttatgtctgtccttagcaac
attgagtacatcctcatcaaggcatcatatggtcaaggattacagcagagcagaatctca
gacatttcaatggaggttggcagaaaggctgaaaagccgcacccagaagaagaggttgca
tctcttttagagaattgtgtctgtcctcctggcactgtgggattctcgtgtcaggactgt
gcccctgggtaccacagagggaagctcccagcagggagtggcaggggaccgcgccctctg
gttgctccttgtgtgccctgcagttgcaacaaccatagtgacacctgtgaccccgacacc
gggaagtgtctgaactgcggcgataacacagcaggtgaccattgcgatgtgtgtgctttt
ggctactatgggaaggtgactggctcagcaagtgactgtgctctgtgtgtctgtcctcac
agcccccccgccagttttagtcccacttgtgtcttggaaggtgaccacaatttccgttgt
gacgcctgtctcctgggctatgaaggacaacattgtgaaaggtgctccttgggctattat
gggaaccctcaaacactaggtggcagttgccagaagtgtgactgcaacctgcacggctct
gtccatggcgactgtgaccgcacatctgggcagtgcgtttgcaggctgggggcctcgggg
ctccggtgcgatgagtgtgaaccgaggcacattctgatggaaacggattgtgtttcctgt
gatgatgaatgtgtaggtgtgctgctgaatgacttggatgagattggtggtgccattctt
tctctgaacctcactggcattatccctgtcccatatggaattttgtcaaacctggaaaat
acaactaaatatctccaggaatctttattaagagaaaatatgcaaaaggacctgggaaga
attaagcttgaaggtgttgcagaagaaacggacaacctgcaaaagaagctcactagaatg
ttagcaagtacccaaaaggtgaacagggcaactgagagaatcttcaaggagagtcgagac
ctggccatagccattgagaggttgcggatgaacatcacagaaattattgaaaagaccact
ttaaatcagactttggatgaagatttcctgctactcaattctacacttcagagcatgcaa
cagaacgggacatctttgctggaaatcatgcagagaagagacttcacacagttgcaccaa
aatgccactcttgaactcaaggctgctgaagatttattgtcacaaattcaggaaaattac
caaaagccgctggaagaattggaggtattaaaagaagcaacaagccacctcctttcaagg
cacaacagtgaactacaggcggctgaggtgcttgtgagggaagccgaggcaaagaccgag
gaaagcaaccacctgctgttcatggtcaatgctaacctgagagaattcagtgataaaaag
ctacatgttcaagaagaacaaaatctgacctcagagctcattgcccgaggacgaggactg
atagatactgccgctgtgcaaacagatgctgtgcaagatgctctagagcacttagagcat
caccaggatacgctacttttatggtctgccagaatcaggcaccacgtagatgacctggtc
atgcacatgtcccaaaggaacgcgcttgacctggtctacagcgctgaggaccatgccgct
gagttccagagactggcagacgttctggacagtggccttgaaaatgtcagaaatgtgtcc
ctgaatgccaccagtgcagcctatgtccattacaacatccagaacctgatcgaagaatcg
gaggaactggccaaagatgctcacaggactgtgactgagatgagcctgctctcagaatcc
cttgtttctaatgggaaagcggccgtgcagcgcagctccagatttctaaatgaaggcgac
aacctcagcaggaagcttccagttattgcattggaactgagtgaactgagaaataagaca
aacagatttcaagagaatgctgatgaaattaccaggcagaccaatgaatcagtcttgata
ctcagagcaattcctgaagatacaagagacaaaggagccaaaaccaaagagctggccacg
tctgcaagccatagcgcggtgagcacgctgagggacgtggcagggctgagccaggagctg
ctgagcacatctgccagcctgtccagggtcaacgccacattacaagagacacaccagctt
ctgcaggactccaccatggccactctgttggctggaaggaaagtcaaagacgtggaaact
caagccaaccttttgtttgatcggttgaagcctttgaagatgttagaagagaatctgagc
agaaacttatcagaaattaaactgctcatcagccaggcccggaaacaagcagcttctatt
aaagtcgccgtgtctgcagacagagattgcatccgggcctaccagcctcagatttcctct
actaactacaacaccttaacactaaatgttaagacacaggaacccgacaatcttctcttc
tacctcggtagcagcacctcttctgatttccttgcagtggagatgcggcgagggaaagtg
gccttcctgtgggacctgggctccggctccacacgcttggagtttccagactttcccatt
gatgacagcagatggcacagtatccatgtggccagatttggaaacattggttcactgagt
gtaaaggaaatgagctcaaatcaaaagccaccagcaaaaacaagtaaatcccctgggaca
gctaatgttctggatgtaaacaattcaacactcatgtttgttggaggtcttggaggacaa
atcaagaaatctcctgctgtgaaggttactcattttaaaggctgcttgggagaggccttc
ctgaatggaaaatccatcggcctatggaactatatcgaaagggaaggcaagtgccacggg
tgctttggaagctcccagaatgaagacgcttccttccattttgacgggagtgggtactct
gtcgtggagaagtcacttccggctaccgtgacccagataatcatgctttttaataccttt
tcacctaatggacttcttctctacctgggttcatacggcataagagactttttatccatc
gagctgttccatggcagagtgaaggttacgacagacctgggttcaggaccccttaccctt
ttgacagacagacgttataacaatggaacctggtacaaaattgccttccagcgaaaccgg
aagcaaggagtcctagcagttaacgatgcctataacaccagtaataaagaaaccaagcag
ggcgagactccgggagcgtcttccgacctcaaccgtctagacaaggatccgatttatgtg
ggtggattaccaaggtcaagagttgtaaggagaggtgtcaccgccaaaagctttgtgggc
tgcatcaagaacctggaaatatccagatcaacctttgacttactccgaaattcctatgga
gtgagaaaaggctgctcactggagcccatccggagtgttagcttcttgaaaggcggctac
attgaattgccacccaagtctttgtcaccagaatcggaatggctggtaacatttgccacc
aagaacagcagtggcatcatcctggctgccctcggcggggatgcggagaagcagcgcgat
cgtgaggagtcacacgtgcccttcttttccatcatgctgattggaggcaacatcgaggta
catgtcaatcctggggatggcacaggcctgagaaaagctctcctgcacgctcccacgggt
acatgcagcgacggacaagcacattccatctccttggtcaggaatcggagaattatcact
gtccaattggatgagaacaatcgtgtggaaatgaagttgggcccattagtagaaagcagg
acgataaatatgtccaatctgtacgtcgggggaattccagagggagaggggacatcgctg
ctcacaacgagaagatcgttccatggctgtatcaaagacctgatctttaatttggaactt
ttggatttcaacagcgcagttggccatgagcaagtcgacttggacacctgctggctgtca
gaaaggcctaagccggctcccgatgcagaggacagcgagctcttgccagagccccgggct
tttccagaacagtgtgtggtggatgcagctctggagtacgttcccggcgcccaccagttt
ggtctctcacaaaacagccatttcctgttgccttttaaccagtcggctgtcagaaagagg
ctctcggttgagctaagtatccgcacgttcgcctccagcggcctgatttactacatggct
catcagaaccaagcagactacgccgtgctccagctgcacgggggccgcctccacttcatg
tttgacctcggcaaaggcagaacaaaggtctctcaccctgcactgctcagcgatggcaag
tggcacacggtcaagacagactatgttaaaagaaaaggcctcataacggttgacggccaa
gagtcccccatggtgaccgtggtgggagacggaaccacgctggatgtggagggtttgttc
tacctaggaggcctgccctcccagtaccgggccaggaaaattggaaatatcacccacagc
atccctgcctgcattggggatgtgacggttaacagcaaacagctggacaaggacagcccg
gtgtccaccttcacggtgaacaggtgctacgcagcggcccaggagggaacgttctttgat
ggaagcggatacgcagctcttgtcaaagagggctacaaagtccagacagatgtgaacatc
acactggagtttcgaacctcctcgcagaatggcgtcctcctggggatcagcactgccaaa
gtggatgccattggactagagcttgtggacggcaaggtcttgttccatgtcaacaatggt
gctggcaggataacagccacgtatgagcccaaaaccgccactgcgctctgtgatggaaaa
tggcacacgcttcaagctaacaaaagcaaacaccgtatcactctgattgtcgatgggaat
gcagttggcgctgaaagtccacacacccggtctacctcggtggacaccaacaatcccatt
tatgttggtggctatcctgctggcgtgaagcaaaaatgcctgagcagccagacctcgttc
cgcgggtgtttgaggaagctagtgctgattaagaggccgcaggtgcaatcctttgacttc
agcagagcgtttgaatggcatggagttttccttcattcctgtcctgggacggagtcctga

DBGET integrated database retrieval system