KEGG   Pongo abelii (Sumatran orangutan): 100174092
Entry
100174092         CDS       T01416                                 

Gene name
VTN
Definition
(RefSeq) vitronectin precursor
  KO
K06251  vitronectin
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04610  Complement and coagulation cascades
pon05165  Human papillomavirus infection
pon05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100174092 (VTN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100174092 (VTN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100174092 (VTN)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    100174092 (VTN)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    100174092 (VTN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100174092 (VTN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100174092 (VTN)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100174092 (VTN)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100174092 (VTN)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100174092 (VTN)
SSDB
Motif
Pfam: Hemopexin Somatomedin_B
Other DBs
NCBI-GeneID: 100174092
NCBI-ProteinID: NP_001127063
Ensembl: ENSPPYG00000008092
UniProt: Q5NVS5
LinkDB
Position
17
AA seq 478 aa
MAPLRPLLILALLAWVALADQESCKGRCTEGFNADKKCQCDELCSYYQSCCTDYTAECKP
QVTRGDVFTMPEDEYSVYDDGEEKNNATVHEQLGGPSLTPDFQAQPEGNPEQAPVLKPEE
EAPAPEVGTSRPEGMDSRPETLHPGRPQPPAEEELCSGKPFDAFTDLKNGSLFAFRGQYC
YELDEKAVRPGYPKLIRDVWGIEGPIDAAFTRINCQGKTYLFKGSQYWRFEDGVLDPDYP
RNISDGFDGIPDNVDAALALPAHSYSGRERVYFFKGKQYWEYQFQHQPSQEECEGSSLSA
VFEHFAMIQRDSWEDIFELLFWGRTSAGTRQPQFISRDWHGVPGQVDAAMAGRIYISGMA
PRPALAKKQKFRRRNRKGYRSHRGHSRGRNQNSRRPSHAMWLSLFSSEDSNLGANNYDDY
RMDWLVPATCEPIQSVFFFSGDKYYRVNLRTRRVDAVDPPYPRSIAQYWLGCPAPGHL
NT seq 1437 nt   +upstreamnt  +downstreamnt
atggcacccctgagaccccttctcatactggccctgctggcatgggttgctctggctgac
caagagtcatgcaagggccgctgcactgagggcttcaacgcggacaagaaatgccagtgt
gacgagctctgctcttactaccagagctgctgcacagactatacggctgagtgcaagccc
caagtgactcgtggggatgtgttcactatgccggaggatgagtactcggtctatgacgat
ggtgaggagaaaaacaatgccaccgtccacgaacagctggggggcccctccctgacccct
gacttccaggcccagcccgaagggaatcctgagcaggcacctgttctgaaacctgaggaa
gaggcccctgcgcctgaggtgggcacctctaggcctgaggggatggactcaaggcctgag
acccttcatccagggagacctcagcccccagcagaggaggagctgtgcagtgggaagccc
ttcgacgccttcactgacctcaagaacggttccctctttgccttccgagggcagtactgc
tatgaactggatgaaaaggcagtgaggcctgggtaccccaagctcatccgagatgtctgg
ggcatcgagggccccatcgatgccgccttcacccgcatcaactgtcaggggaagacctac
ctcttcaagggtagtcagtactggcgctttgaggatggcgtcctggaccctgattacccc
cgaaatatctctgatggcttcgatggcatcccggacaacgtggatgcagccttggccctc
cctgcccatagctacagtggccgggagcgggtctacttcttcaaggggaaacagtactgg
gagtaccagttccagcaccagcccagtcaggaggagtgtgaaggcagctccctgtcggct
gtgtttgaacactttgccatgatacagcgggacagctgggaggacatcttcgagcttctc
ttctggggcagaacctctgctggtaccagacagccccagttcattagccgggactggcac
ggtgtgccagggcaagtggacgcagccatggctggccgcatctacatctcaggcatggca
ccccgccccgccttggccaagaagcaaaagtttaggcgtcgcaaccgcaaaggctaccgt
tcacaccgaggccacagccgtggccgcaaccagaactcccgccggccatcccacgccatg
tggctgtccttgttctccagtgaggacagcaacttgggagccaacaactatgatgactac
aggatggactggcttgtgcctgccacctgtgaacccatccagagtgtcttcttcttctct
ggagataagtactaccgagtcaaccttcgcacacggcgagtggacgctgtggaccctccc
tacccacgctctatcgctcagtactggctgggctgcccagctcctggccatctgtag

KEGG   Pongo abelii (Sumatran orangutan): 100189638
Entry
100189638         CDS       T01416                                 

Gene name
SPP1
Definition
(RefSeq) osteopontin precursor
  KO
K06250  secreted phosphoprotein 1
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04371  Apelin signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04620  Toll-like receptor signaling pathway
pon04929  GnRH secretion
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04371 Apelin signaling pathway
    100189638 (SPP1)
   04151 PI3K-Akt signaling pathway
    100189638 (SPP1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100189638 (SPP1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100189638 (SPP1)
 09150 Organismal Systems
  09151 Immune system
   04620 Toll-like receptor signaling pathway
    100189638 (SPP1)
  09152 Endocrine system
   04929 GnRH secretion
    100189638 (SPP1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100189638 (SPP1)
SSDB
Motif
Pfam: Osteopontin
Other DBs
NCBI-GeneID: 100189638
NCBI-ProteinID: NP_001126162
UniProt: Q5R8A7
LinkDB
Position
4
AA seq 300 aa
MRIAVICFCLLGITYALPVKQADSGSSEEKQLYNKYPDAVATWLKPDPSQKQNLLAPQTL
PSKSNESHDHMDDVDDEDDDDHVDSQDSIDSNDSDDVDDTDDSHQSDESHHSDESDELVT
DFPTDLPATEVFTPVVPTVDTYDGRGDSVVYGLRSKSKKFRRPDIQYPDATDEDITSHME
SEELNGAYKAIPVAQDLNTPSDWDSHGKDSHEMSQLDDQSAETHSHKKSRLYKRKASDES
NEHSDVIDSQELSKVSHEFHSHEFHSHEDMLVVDPKSKEEDKHLKFRISHELDSASSEVN
NT seq 903 nt   +upstreamnt  +downstreamnt
atgagaattgcagtgatttgcttttgcctcttaggcatcacctatgccctaccagttaaa
caggctgattctggaagttctgaggaaaagcagctttacaacaaatacccagatgctgtg
gccacatggctaaagcctgatccatctcagaagcagaatctcctagccccacagaccctt
ccaagtaagtccaacgaaagccatgaccacatggacgatgtggatgatgaagatgatgat
gaccatgtggacagccaggactccattgactcgaacgactctgatgatgtagatgacact
gatgattctcaccagtctgatgagtctcaccattctgatgaatccgatgaactggtcact
gattttcccacggacctgccagcaaccgaagttttcactccagttgtccccacagtagac
acatatgatggccgaggtgatagtgtggtttatggactgagatcaaaatctaagaagttt
cgcagacctgacatccagtaccctgatgctacagacgaggacatcacctcacacatggaa
agcgaggagttgaatggtgcatacaaggccatccccgttgcccaggacctgaacacgcct
tctgattgggacagccatgggaaggacagtcatgaaatgagtcagctggatgaccaaagt
gctgaaacccacagccacaagaagtccagattatataagcggaaggccagtgatgagagc
aatgagcattccgatgtgattgatagtcaggaactttccaaagtcagccatgaattccac
agccatgaatttcacagccatgaagatatgctggttgtagaccctaaaagtaaagaagaa
gataaacacctgaaatttcgcatttctcatgaattagatagtgcatcttctgaggtcaat
taa

KEGG   Pongo abelii (Sumatran orangutan): 100431604
Entry
100431604         CDS       T01416                                 

Gene name
COL6A5
Definition
(RefSeq) collagen alpha-5(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100431604 (COL6A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100431604 (COL6A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100431604 (COL6A5)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100431604 (COL6A5)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100431604 (COL6A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100431604 (COL6A5)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100431604 (COL6A5)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100431604 (COL6A5)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100431604 (COL6A5)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   100431604 (COL6A5)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 VWA_CoxE Copine vWA-TerF-like AhpC-TSA_2 Integrin_beta
Other DBs
NCBI-GeneID: 100431604
NCBI-ProteinID: XP_024100681
LinkDB
Position
3
AA seq 2629 aa
MKILLIIFVLIIWTETLADQSPGPGPVYADVVFLVDSSDHLGPKSFPFVKTFINKMINSL
PVEANKYRVALAQYSDEFHSEFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRT
YFSAPINGRDRKQFPPILVVLASAESEDEVEEASKALQKDGVKIISVGVQEASEENLKAM
ATSHFHFNLRTIRDLSTFSQNMTQIIKDVTKYKEGAVDADMQVPFPISCQKDSLADLVFL
VDESLGTRGNLRHLQTFLENITSSMDVKENCMRLGLMSYSNSAKTISFLKSSTTQSEFQQ
QIKNLSIQVGKSNTGAAIDQMRRDGFSESYGSRRAQGVPQIAVLVTHRPSDDEVHDAALN
LRLEGVTVFALSIQGANNTQLEEIVSYPPEQTISTLKSYADLETYSTKFLKKLQNEIWSQ
ISTYAEQRNLDKTGCVDTKEADIHFLIDGSSSIQEKQFEQIKRFMLEVTEMFSIGPDKVR
VGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGTKDRM
SKVPCYLIVLTDGMSTDRVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEERVSFGQ
NFDALKSIKNEVVREICTEKGCEDMKADIMFLVDSSWSIGNENFRKMKIFMKNLLTKIQV
GADKTQIGVVQFSDKTKEEFQLSRYFTQQEISDAIDRMSLINEGTLTGKALNFVSQYFTH
SKGARLGAKKFLILITDGVAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDGSL
VFHVENFDHLKALERKLVFRVCALHDCKRITLLDVVFVLDHSGSIKKQYQDHMINLTIHL
VKKADVGRNRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA
NALFTEKHGSRIKQNVKQVLIVITDGKSHDHDQLNDTASELRDKGITIFAVGVGKANQKE
LEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTEAPEVCHLQEADVIFLCDGSDRVSNSD
FVTMTTFLSDLIDNFDIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTLIQNVSKSGG
FPRIDFALKKVSNMFNLHAGGRRNAGVPQTLVVITSGDPRYDVADAVRTLKDLGICVLVL
GIGDVYKEQLLPITGNSEKIITFQGFDKLKNVDVKKRIIREICQSCGKTNCFMDIVVGFD
ISTHVQGQPLFQGHPQLESYLPGILEDISSIRGVSCGAGTEAHVSLAFKVNSDQEFPAKF
QIYQKAVFDSLLQVNVSGPTHLNAQFLQSLWDTFKDKSASRGQVLLIFSDGLQSESNIML
ENQSDRLREAGLDALLVVSLNTTAHHEFSSFEFGKRFDYRTHLTIGTRELGKKLSQYLGN
IAERTCCCTFCKCPGIPGHHGTRGLQAMKGSQGLKGSRGHRGEDGDPGVRGDTGPQGDKG
IAGCPGAWGQKGLKGFSGPKGGHGDDGIDGLDGEEGSHGFPGIKGEKGDPGSQGSPGSRG
APGQYGEKGFPGDVGNPGQNSNIKGQKGSKGEQGRQGRTGQKGMQGSPSSRGSTGREGQR
GLRGVSGEPGNPGPTGTLGAEGLRGPQGSQGNPGRKGEKGSQGHKGPQGSPGLMGAKGST
GRPGLLGKKGKPGLPGDLGPVGQTGQRGRQGDSGIPGYGQMGRKGVKGPRGFPGDVGQKG
DIGDPGIPGGPGPKGFRGLALTVGLKGEEGSRGLPGPPGQRGIKGMAGQPVYSQCDLIQF
LREHSPCWKEKCPAYPTELVFALDNSYDVTEESFNKTRDIITSIVNDLNIRENNCPVGAR
VAMVSYNSGTSYLIRWSDYDRKKQLLQQLSQIKYQDTTEPRDVGNAMRFVTRNVFKRTYA
GANVRRVAVFFSNGQTASRSSIITATMEFSALDISPTVFAFDERVFLEAFGFDNTGTFQV
IPVPPNGENQTLERLRRCTLCYDKCFPNACIQEAFLPEDSYMDVAFLIDNSQNIAKDEFK
AVKALVSSVIDNFNIASDPLISDSGDRIALLSYSPWESSRRKMGTVKTEFDFITYNNQLL
MKNHIQTSFQQLNGEATIGRALLWTTENLFPETPYLRKHKVIFVVSAGENYERKEFVKTM
ALRAKCQGYIIFVISLGSTHKDDMEELASYPLDQHLIQLGRIHKPDLNYIAKFLKPFLYS
VRRGFNQYPPPMLEDACRLINLEGENNQNDGFQFITELQEDFSGENGFIGQELNSGRESP
FVKIEDNGSDYLVYLPSQMFEPQKLMINYEKDKKSAEIASLTSGHENYGRKEEPDHTYEP
GDVSLQEYYMDVAFLIDASQRVGSDEFKEVKAFITSVLDYFHIAPAPLTSTLGDRVAVLS
YSPPGYMLNTEECPVYLEFDLVTYNSIHQMKDHLQDSQQLNGDVFIGHALQWTIDNVFVG
TPNLRKNKVIFVISAGETNPVDKEVLRNVSLRAKCQGYSIFVFSFGPKHNDKELEELASH
PLDHHLVQLGRTHKPDWNYIIKFVKPFVHLIRRAINKYPTEDMKATCVNMTSPNPENGGT
ENTVLLLPGIYEIKTENGELFDEFDSQVQHFLVLGNNHSNGSETATDLMQKLYLLFSTEK
LAMKDKEKAHLEEISALVVDKQQEKEGNLEYHRVEDPQVPLFGFSPGGR
NT seq 7890 nt   +upstreamnt  +downstreamnt
atgaagatcctgctaattatatttgtcctaatcatttggactgaaacattggcagaccag
agcccagggccaggccctgtgtatgcagatgtcgtgtttctggtggacagctccgatcat
ctgggacctaagtcgttcccattcgtgaaaacattcatcaacaaaatgatcaacagtctc
cccgtagaggccaacaaataccgtgtagccctggcccagtacagcgacgagtttcacagt
gaattccatctgagcaccttcaaaggcagaagccccatgctgaaccacctcaagaagaac
tttcagttcattggcgggtccctgcagataggaaaggctcttcaggaggctcacaggacc
tatttctctgcacccataaatgggagagacaggaaacagtttcccccaattttggtggtc
ctggcttcggctgagtctgaggatgaagtggaagaggcttcgaaagccctgcagaaagat
ggggtgaaaattatctccgtgggggtgcaggaagcttctgaggaaaatctgaaggccatg
gccacgtcccatttccatttcaaccttcggacaattagagacctcagcacattttcccaa
aacatgacacagatcatcaaagatgtaaccaagtataaggagggagccgtcgatgctgat
atgcaagttcccttccccatatcctgtcagaaagattcacttgctgacctcgtgttcctg
gtggatgagtcacttgggaccagaggaaatttaaggcatcttcagaccttccttgagaac
attaccagctccatggatgtgaaggaaaattgcatgcgacttggactgatgagttacagc
aatagtgccaagactatttcatttcttaaatcaagcacaacccaatctgaatttcagcag
caaatcaagaatctttccatccaagttgggaaatccaatacaggggctgccattgatcag
atgagaagagacggcttctcagagtcatatggcagcagaagagcacaaggagtgcctcag
attgcagtcctggtcacccacagaccatcagatgatgaggtgcatgatgctgcgctgaac
cttcgactggagggtgtaaccgtgtttgccttgagcatccaaggggctaacaatacccag
ttagaagaaatagtgtcttaccctccagaacagacaatttccacgctgaagtcctatgca
gacttagaaacttacagtacaaagttcctgaaaaagctccagaatgaaatatggtcccaa
atttctacttatgctgaacaaaggaatcttgataaaactggctgtgtggatacaaaagag
gctgatatccatttcctcattgatggctcaagcagcatccaggagaaacagtttgagcaa
atcaagagatttatgttggaagtgacagaaatgtttagcattggcccagacaaagtccga
gttggagttgtgcagtattcagatgacacagaagtggaattttatatcactgactattct
aatgacattgacttaagaaaggctatttttaacattaagcaactaactggtggaacttat
actgggaaagctctagattatatactgcaaataataaaaaatggaacgaaggatagaatg
agcaaggttccctgttacctcattgtgttgactgatgggatgtccacagacagagtcgtg
gaacctgctaagagactaagggctgagcaaatcactgttcatgcagttggcattggggca
gctaataaaatagaactgcaagaaattgctgggaaagaagaaagagttagctttgggcag
aactttgatgctttgaaaagcataaaaaatgaagtcgttcgtgaaatctgcactgaaaaa
ggatgtgaagacatgaaggctgacatcatgtttctggtggacagttcttggagtatagga
aatgaaaattttaggaaaatgaaaatcttcatgaaaaacctgttaactaagattcaagtt
ggtgcagacaaaactcagattggtgttgttcagttcagtgataaaactaaggaagagttc
cagcttagtagatactttacacagcaagaaatttctgatgcaatagatagaatgtctctc
atcaatgaaggcactttaactggaaaggcactaaattttgtaagtcaatacttcacccac
tccaagggggcccgtttgggggccaaaaaatttctcatcctcatcacagatggagtagcg
caggatgatgtgagagatcctgctagaattcttcggggcaaagatgtgaccatcttctct
gtaggagtatacaatgccaatagatctcagctagaagagatcagtggggatggcagccta
gtttttcatgttgagaacttcgatcatctaaaggcactagaaaggaaacttgtctttcgt
gtgtgtgctctccatgattgtaaaaggattacactactagacgttgtgtttgtgctggat
cattcaggtagcataaaaaaacaatatcaagatcacatgattaacctaactatccatttg
gtgaagaaagcagatgttggcaggaaccgagttcagtttggagccctcaaatactctgac
caacctaacatccttttctaccttaatacatactccaacagatcagcaataattgagaat
ctgcggatgcgcagggacactggagggaacacctacactgccaaggctctcaagcatgca
aatgccctgtttacagagaaacatggcagccgcatcaagcaaaatgtgaagcaggtgctg
attgtcatcactgatgggaaatcccatgaccatgatcagctcaatgacacagcatcggaa
ctaagagacaaaggcatcaccatctttgcagtgggtgtaggaaaggccaaccaaaaggaa
cttgaggggatggcagggaataaaaacaatactatctacgtagataattttgacaaactg
aaagatgttttcacacttgttcaagaacgtatgtgtactgaagcaccagaggtctgtcat
cttcaggaagctgacgtgattttcctttgcgatggctctgacagggtatctaattcagat
tttgtaaccatgacaactttcttgtcagacttaattgataattttgacattcagtctcaa
agaatgaaaattggtatggctcaatttggaagcaactaccagagtattattgagttgaaa
aactctctgactaaaacccagtggaagactctaattcagaatgtctccaagagcggtgga
tttccgagaatagactttgcccttaaaaaagtgagcaatatgtttaatctacatgctggt
gggagaagaaatgctggtgtcccccagactttggttgttatcacatctggagatcctcgc
tatgatgtggcagatgcagtaagaaccctgaaggaccttggaatttgtgtcctggttttg
ggcataggagatgtttataaggaacagcttctgccaataacaggcaattctgaaaaaata
atcacttttcaaggctttgataaattaaagaatgtggatgtgaaaaaaagaatcatccgt
gaaatctgccagagctgtgggaaaaccaattgctttatggacatagtggttgggtttgac
atctccactcatgtgcaggggcagcctttgttccagggccacccccagctggaatcctac
ctcccaggcatcttagaagacatcagctctatcaggggggtgagctgtggggctggcaca
gaggcacacgtgagcttggcgtttaaagtgaacagtgaccaagaattccctgccaagttc
caaatctatcagaaagcagtgtttgacagcttgctgcaagtcaacgtcagtgggccaact
catctgaacgcacagttcttgcagtctctttgggacacatttaaggataaatctgcatcc
cggggccaggtgctgcttattttttcagacggtctccagagtgaaagcaacataatgctt
gaaaatcaatcagacaggctcagagaagcaggacttgatgctctgctggtagtgtccctt
aacacaactgctcatcatgagttttctagctttgaatttggaaaaagatttgattacagg
actcatctgactattggaacgagagaactgggcaaaaaactatcacagtacctgggaaac
attgcagagaggacttgctgctgtacattctgcaaatgtccaggaattccaggacatcat
gggacccgaggactacaagccatgaagggttctcaaggtctgaaaggcagcagaggacat
aggggagaggatggagaccctggagtacgaggagacacaggaccccaaggagataaaggg
attgcaggatgtccaggggcgtggggtcagaagggactcaaaggattttctggacctaag
ggaggacatggagacgatgggattgatggacttgatggggaagagggctctcatggattt
cctggaataaaaggagaaaaaggtgatccaggatctcagggcagcccaggttccagaggt
gcccctgggcagtatggagagaagggcttcccaggggatgtgggtaatccaggacaaaac
agtaacatcaaaggacaaaagggctccaaaggagaacaaggaagacaaggtagaactgga
cagaaagggatgcaaggcagtcctagttccagaggaagcacgggaagagaaggtcaaagg
ggactccgaggtgtctcaggagaaccaggaaatcctggacctacaggcacattgggagct
gaaggattacgaggcccacaggggtcacagggaaatcctggcagaaaaggagaaaaagga
agccaggggcacaaaggacctcagggttctcctgggctaatgggagctaaagggagcact
ggaagacctggacttttggggaaaaaaggaaagcctggacttcccggagatctagggcca
gtggggcaaactgggcagcgaggaagacagggagattctggcatcccaggctatggtcag
atgggacgaaaaggagtaaagggcccaagaggattccctggagatgtggggcagaagggt
gatattggtgatcctggaattcctgggggacctggacccaaaggatttaggggactagca
ctcactgtaggcctgaaaggtgaagagggatctcgaggactcccaggccctcctggacag
agaggcattaaaggcatggcaggacagcctgtatattctcaatgtgatctgatccagttt
ttgcgggaacatagtccttgctggaaagaaaaatgtccggcatatccaacagagctagta
tttgctctggacaattcctacgatgtaacagaagagagctttaataaaacacgggacatc
atcacttccattgtcaatgaccttaacatcagggaaaataactgtcctgtgggagcaaga
gttgccatggtttcctataactcaggcaccagctatctcatccgttggtctgactacgat
aggaagaagcaactcctccagcagctttcccaaataaaatatcaagacaccacagagccc
cgagatgttggtaatgcaatgaggtttgtgacccgcaacgtgttcaagcggacgtatgca
ggagccaatgtgaggagagttgctgtgttttttagcaatggtcaaacagccagtaggtca
tccatcatcacggccaccatggagtttagtgccctggatatcagtccaacagtctttgct
tttgatgagagagttttccttgaagcttttgggtttgacaacactggaacatttcaggtg
attccagttcctccaaatggggaaaatcaaacattagaaagacttcggcgctgtacactt
tgctatgataaatgttttccaaatgcttgcattcaagaagctttcttacctgaagattca
tacatggatgtagccttcctcatagacaattctcagaatatagcaaaggatgagtttaag
gctgtgaaagccttggtgagctcagtgattgacaacttcaacattgcttcagacccttta
atctcagactctggtgataggattgctttgttgagctattctccttgggaaagttccagg
agaaagatgggtacagtaaaaacagagtttgatttcatcacttataacaaccaactccta
atgaagaatcacatccagacttccttccaacagctaaatggagaagccacaattggtcgt
gccctactgtggactactgaaaatctctttccagaaacaccgtatctaagaaaacacaag
gtcatctttgtggtctcagctggagaaaattatgagagaaaagaatttgtaaaaacgatg
gctttgagggctaaatgtcaaggctacatcatatttgtgatttctctgggctctacacat
aaggatgacatggaggagttagccagctacccacttgatcaacacctgatacagcttggg
agaatacataaaccagatctgaattatattgcgaagttcttaaagccatttttatactcg
gtcaggcggggattcaatcagtacccaccaccgatgcttgaggatgcctgtagactcatc
aatttagaaggagagaataatcaaaatgatggtttccaatttattactgagctacaagag
gatttttcaggagagaatggcttcattggccaagaattaaattctgggagagaatcacct
tttgtaaagatagaggacaatggaagtgactatttggtttaccttccaagccaaatgttt
gagccacaaaaattaatgatcaattatgaaaaagataaaaaatctgcagaaattgcaagt
ctcacttctggacatgaaaattatggcagaaaagaagaaccagatcatacttatgaacct
ggagatgtctctcttcaagaatattacatggatgtggctttcctcatagatgcttcccaa
agagtaggaagtgatgagtttaaggaagtaaaagcttttataacctcagtgcttgattac
tttcacatcgcccctgctccactgacctccaccttaggagacagggttgctgtcctgagt
tactctcccccaggctatatgcttaacactgaagaatgccctgtctacctggaatttgat
ttggttacttataacagtatacaccaaatgaaagatcatctccaagactctcaacagctc
aatggagatgtttttattggccatgccttgcagtggacaattgacaatgtctttgtagga
acccccaatctgaggaaaaacaaagttatctttgtaatatctgctggcgaaaccaaccct
gtagacaaagaagtcttaaggaatgtgtctctgagagccaagtgtcaaggctactccata
tttgtgttttcctttggccctaaacacaatgacaaagaattagaagaattagccagccac
cctctggatcatcacttggtccaacttggccgaacccacaagccagattggaactatatc
atcaagtttgtcaagccatttgtccatttaatcagacgtgccatcaacaaatatcccacc
gaagatatgaaagccacatgtgttaacatgacctctcccaacccagagaacggtggcaca
gaaaacactgtattacttcttcctgggatatatgaaataaagacagaaaatggtgagctg
tttgatgaatttgattcccaggtgcaacatttccttgtattagggaacaatcatagtaat
ggttctgagacagctactgatttgatgcagaagttatacctgctcttttcaactgagaaa
ctggcaatgaaagacaaggaaaaagcacatttagaagaaatttcagctctagtggttgat
aaacagcaagaaaaagaaggtaatcttgagtaccacagagttgaagaccctcaagtacct
ttgtttggattttcccctggtggaagatga

KEGG   Pongo abelii (Sumatran orangutan): 100431991
Entry
100431991         CDS       T01416                                 

Gene name
COL9A3
Definition
(RefSeq) collagen alpha-3(IX) chain
  KO
K08131  collagen type IX alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100431991 (COL9A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100431991 (COL9A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100431991 (COL9A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100431991 (COL9A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100431991 (COL9A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:pon00535]
    100431991 (COL9A3)
Proteoglycans [BR:pon00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   100431991 (COL9A3)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 100431991
NCBI-ProteinID: XP_024094699
LinkDB
Position
20
AA seq 687 aa
MARPPAFARLLLLLLLLLGELLAAAGAQKVGLPGPPGPPGPPGKPGQDGIDGEAGPPGLP
GPPGPKGAPGKPGKPGEAGLPGLPGVDGLTGQDGPPGPKGAPGERGSLGPPGPPGLGGKG
LPGPPGEAGVSGPPGGIGLRGPPGPSGLPGLPGPPGPPGPPGHPGVLPEGATDLQCPSIC
PPGPPGPPGMPGFKGPTGYKGEQGEVGKDGEKGDPGPPGPAGLPGSVGLQGPRGLRGLPG
PLGPPGDRGPIGFRGPPGIPGAPGKAGDRGERGPEGFRGPKGDLGRPGPKGTPGVAGPSG
EPGMPGKDGQNGVPGLDGQKGEAGRNGAPGEKGPNGLPGLPGRAGSKGEKGELGRAGELG
EAGPSGEPGVPGDAGMPGERGEAGHRGSAGALGPQGPPGAPGVRGFQGRKGSMGDPGLPG
PQGLRGDVGDRGPGGAAGPKGDQGIAGSDGLPGDKGELGPSGLVGPKGESGSRGELGPKG
TQGPNGTSGVQGVPGPPGPLGLQGVPGVPGITGKPGVPGKEASEQRIRELCGGMISEQIA
QLAAHLRKPLAPGSIGRPGPAGPPGPPGPPGSIGHPGARGPPGYRGPTGELGDPGPRGNQ
GDRGDKGAAGAGLDGPEGDQGPQGPQGVPGTSKDGQDGAPGEPGPPGDPGLPGAIGAQGT
PGICDTSACQGAVLRGLGGEKSGSRSS
NT seq 2064 nt   +upstreamnt  +downstreamnt
atggcccggccgcccgcgttcgcccggctcctgctcctgctcctgctcctgctcggggag
ctcctggcggccgccggggcgcagaaagtgggactcccaggcccccccggccccccaggg
ccgcccgggaagcccggccaggacggcattgacggagaagctggtcctccaggtctgcct
gggcccccgggaccaaagggggccccaggaaagccggggaaaccaggagaggccgggctg
ccgggactgccgggtgtggatggtctgactgggcaagatggaccccctggacccaagggt
gcccctggggaacggggaagtctgggacccccggggccgcccgggctggggggcaaaggc
ctccctggaccccccggagaggcaggagtgagcggccccccaggtgggatcggcctccgc
ggccccccgggaccttctggactccccggcctccctggccccccaggacctcccggaccc
cctggacacccaggagtcctccctgaaggcgctactgaccttcagtgcccaagtatctgc
ccgccaggtcccccagggccccctggaatgccagggttcaagggacccactggctacaaa
ggcgagcagggggaagtcggcaaggacggcgagaagggtgaccctggccccccggggccc
gccggcctcccgggcagtgtggggctgcagggcccccggggattacgaggactgccaggg
ccacttgggccccctggggaccggggtcccatcgggttccgagggccacctgggatccca
ggagcacctgggaaagcgggtgaccgaggcgagaggggcccagaagggttccgcggcccc
aagggtgacctcggcagacctggtcccaagggaacccccggagtggccgggccaagcgga
gagccgggcatgccgggcaaggacggccagaatggcgtgccaggactcgatggccagaag
ggagaggctggtcgcaacggtgccccgggagagaagggccccaacgggctgccgggcctc
cccggacgagcgggatccaaaggcgagaagggagaactgggcagagctggggagctgggt
gaggctggcccctcaggagagccaggcgtccctggagatgctggcatgcctggggagcgt
ggtgaggctggccaccggggctcagcgggcgccctcggcccacaaggccctcccggagcc
cctggtgtccgaggcttccagggccggaagggcagcatgggagaccctggccttccagga
ccccagggcctccgaggtgacgtgggtgaccggggtccgggaggtgccgcaggccctaag
ggagaccagggtattgcaggttccgacggtcttcctggggataaaggagaactgggtccc
agcggcctggtcggacccaaaggagagtctggcagtcgaggggagctgggccccaaaggc
acccagggtcccaacggcaccagcggtgttcagggtgtccccgggccccccggtcctctg
ggcctgcagggtgtcccgggtgttcctggcatcacggggaagccgggagttccggggaag
gaggccagcgagcagcgcatcagggagctgtgtggggggatgatcagcgaacaaattgca
cagttagccgcgcacctaaggaagcctttggcacccgggtccattggccggcccggtcca
gctggcccccctggacccccaggacccccaggctccattggtcaccctggcgctcgagga
ccccctggataccgcggtcccactggggagctgggagaccctgggcccagaggaaaccag
ggcgacagaggagacaaaggcgcggcaggagcagggctggacgggcctgaaggagaccag
gggccccaaggaccccaaggtgtgcccggcaccagcaaggacggccaggatggtgctccc
ggcgagcctgggcctcccggagatcctgggcttccaggtgccattggggcccaaggaaca
ccggggatctgcgacacctcagcctgccaaggagccgtgttacgggggttggggggggaa
aaatcaggctctagaagctcataa

KEGG   Pongo abelii (Sumatran orangutan): 100432105
Entry
100432105         CDS       T01416                                 

Gene name
IBSP
Definition
(RefSeq) bone sialoprotein 2
  KO
K06253  integrin binding sialoprotein
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100432105 (IBSP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100432105 (IBSP)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100432105 (IBSP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100432105 (IBSP)
SSDB
Motif
Pfam: BSP_II DUF2052 Mpp10 Mitofilin Utp14 Pox_Ag35 SAPS CDC45
Other DBs
NCBI-GeneID: 100432105
NCBI-ProteinID: XP_002815004
Ensembl: ENSPPYG00000014918
UniProt: H2PDV2
LinkDB
Position
4
AA seq 317 aa
MKTALILLSILGMACAFSMKNLHRRVKIEDSEENGVFKYRPRYYLYKHAYFYPHLKRFPV
QGSSDSSEENGDDSSEEEEEEEETSNEGENNEESNEDEDSEAENTTLSATTLGYGEDATL
ATGYPGLAAIQLPKKAGDITSKATKEEESDEEEEEEEEENENEESEAEVDENEQGINGTS
TNSTEAENGNGSSGGDNGEEGEEESVTGANAEGTTAAGGQGKGSSKTTTSPNGGFEPTTP
PQVYRTTSPPFGKTTTVEYEGEYEYTGANEYDNGYEIYESENGEPRGDNYRAYEDEYSYF
KGQGYDGYDGQNYYHHQ
NT seq 954 nt   +upstreamnt  +downstreamnt
atgaagactgctttaattttgctcagcattttgggaatggcctgtgctttctcaatgaaa
aatttgcatcgaagagtcaaaatagaggattctgaagaaaatggggtctttaagtacagg
ccacgatattatctttataagcatgcctacttttatcctcatttaaaacgatttccagtt
cagggcagtagtgactcatctgaagaaaatggagatgatagttcagaagaggaggaggaa
gaagaggagacttcaaatgaaggagaaaacaatgaagaatcgaatgaagatgaagactct
gaggctgagaataccacactttctgctacaacactgggctatggagaggacgccacgctt
gccacagggtatccagggttagctgcaatccagcttcccaagaaggctggggatataaca
agcaaagctacaaaagaggaggaaagtgatgaagaagaagaggaggaagaggaagaaaat
gaaaacgaagaaagcgaagcagaagtggatgaaaacgaacaaggcataaacggcaccagt
accaacagcacagaggcagaaaacggcaacggtagcagcggaggagacaatggagaagaa
ggggaagaagaaagtgtcactggagccaatgcagaaggcaccacagcggccggagggcag
ggcaagggcagctctaagacaacaacctctccaaacggtgggtttgaacctacaacccca
ccacaagtctatagaaccacttccccaccttttgggaaaaccaccactgttgaatatgag
ggggagtacgaatacacgggcgccaatgaatacgacaatggatatgaaatctatgaaagt
gagaacggggaacctcgtggggacaattaccgagcctacgaagatgagtacagctacttt
aaaggacaaggctacgacggctatgatggtcagaattactaccaccaccagtga

KEGG   Pongo abelii (Sumatran orangutan): 100433193
Entry
100433193         CDS       T01416                                 

Gene name
COL2A1
Definition
(RefSeq) collagen alpha-1(II) chain isoform X2
  KO
K19719  collagen type II alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100433193 (COL2A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100433193 (COL2A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100433193 (COL2A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100433193 (COL2A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100433193 (COL2A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100433193 (COL2A1)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100433193 (COL2A1)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 100433193
NCBI-ProteinID: XP_002823196
Ensembl: ENSPPYG00000004451
UniProt: A0A2J8WD11
LinkDB
Position
12
AA seq 1418 aa
MIRLGAPQTLVLLTLLVAAVLRCQGQDVRQPGPKGQKGEPGDIKDIVGPKGPPGPQGPAG
EQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFDE
KAGGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPG
KPGDDGEAGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKG
ESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPG
APGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAG
APGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEG
KRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPG
RPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKG
ANGEPGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPG
PPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDG
PKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTG
PIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKG
EQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPG
SNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEG
PPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTG
PAGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRG
EAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPGPSG
DQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAGPPGNPGPPG
PPGPPGPGIDMSAFAGLGPREKGPDPLQYMRADQAAGGLRQHDAEVDATLKSLNNQIESI
RSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGQTCVYPNPA
NVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNI
TYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTV
IEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL
NT seq 4257 nt   +upstreamnt  +downstreamnt
atgattcgcctcggggctccccagacgctggtgctgctgacgctgctcgtcgccgctgtc
cttcggtgtcagggccaggatgtccggcaaccaggaccaaagggacagaaaggagaacct
ggagacatcaaggatattgtaggacccaaaggacctcctgggcctcagggacctgcaggg
gaacaaggccccagaggggatcgcggtgacaaaggtgaaaaaggtgcccctggacctcgt
ggcagagatggagaacctgggacccctggaaatcctggcccccctggtcctcccggcccc
cctggtccccctggtcttggtggaaactttgctgcccagatggctggagggtttgatgaa
aaggctggtggcgcccagatgggagtaatgcaaggaccaatgggccccatgggacctcga
ggacctccaggccctgcaggtgctcctgggcctcaaggatttcaaggcaatcctggtgaa
cctggtgaacctggtgtctctggtcccatgggtccccgtggtcctcctggtccccctgga
aagcctggtgatgatggtgaagctggaaaacctggaaaagctggtgaaaggggtccgcct
ggtcctcagggtgctcgtggtttcccaggaaccccaggccttcctggtgtcaaaggtcac
agaggttatccaggcctggatggtgctaagggagaggcgggtgctcctggtgtgaagggt
gagagtggttccccgggtgagaacggatctccgggcccaatgggtcctcgtggcctgcct
ggtgaaagaggacggactggccctgctggcgctgcgggtgcccgaggcaacgatggtcag
ccaggccccgcagggcctccgggtcccgtcggtcctgctggtggtcctggcttccctggt
gctcctggagccaagggtgaagccggccccactggtgcccgtggtcctgaaggtgctcaa
ggtcctcgcggtgaacctggtactcctgggtcccctgggcctgctggtgcctctggtaac
cctggaacagatggaattcctggagccaaaggatctgctggtgctcctggcattgctggt
gctcctggcttccctgggccacggggccctcctggtcctcaaggtgcaactggtcctctg
ggcccgaaaggtcagacgggtgaacctggtattgctggcttcaaaggtgaacaaggcccc
aagggagaacctggccccgctggcccccagggagcccctggacccgctggtgaagaaggc
aagagaggtgcccgtggagagcctggtggcgttgggcccatcggtccccctggagaaaga
ggtgctcccggcaaccgcggtttcccaggtcaagatggtctggcaggtcccaagggagcc
cctggagagcgagggcccagtggtcttgctggccccaagggagctaacggtgaccctggc
cgtcctggagaacctggccttcctggagcccggggtctcactggccgccctggtgatgct
ggtcctcaaggcaaagttggcccttctggagcccctggtgaagatggtcgtcctggacct
ccaggtcctcagggggctcgtgggcagcctggtgtcatgggtttccctggccccaaaggt
gccaatggcgagcctggcaaagctggtgagaagggactgcctggtgctcctggtctgagg
ggtcttcctggcaaagatggtgagacaggtgctgcaggaccccctggccctgctggacct
gctggtgaacgaggcgagcagggtgctcctgggccatctgggttccagggacttcctggc
cctcctggtcccccaggtgaaggtggaaaaccaggtgatcagggtgttcccggtgaagct
ggagcccctggcctcgtgggtcccaggggtgaacgaggtttcccaggtgaacgtggctct
cccggtgcccagggcctccagggtccccgtggcctccccggcactcctggtactgatggt
cccaaaggtgcatctggcccagcaggcccccctggggctcagggccctccaggtcttcag
ggaatgcctggcgagaggggagcagctggtatcgctgggcccaaaggtgacaggggtgat
gttggtgagaaaggccctgagggagcccctggaaaggatggtggacgaggcctgacaggt
cccattggcccccctggcccggctggtgctaatggcgagaagggagaagttggacctccc
ggtcctgcaggaagtgctggtgctcgtggcgctccgggtgaacgtggagagactgggccc
cccggaccagcgggatttgctgggcctcctggtgctgatggccagcctggggccaagggt
gagcaaggagaggctggccagaaaggcgacgctggtgcccctggtcctcagggcccctct
ggagcgcctgggcctcagggtcctactggagtgactggtcctaaaggagcccgaggtgcc
caaggccccccgggagccactggattccctggagctgctggccgcgttggacccccaggc
tccaatggcaaccctggaccccctggtccccctggtccttctggaaaagatggtcccaaa
ggtgctcgaggagacagcggcccccctggccgagctggtgaacccggtctccaaggtcct
gctggaccccctggcgagaagggagagcctggagatgacggtccctctggtgccgaaggt
ccaccaggtccccagggtctggctggtcagagaggcatcgtcggtctgcctgggcagcgt
ggtgagagaggattccctggcttgcccggcccatcgggtgagcccggcaagcagggtgct
cctggagcatctggagacagaggtcctcctggccccgtgggtcctcctggcctgactggt
cctgcaggtgaacctggacgagagggaagccccggtgctgatggccctcctggcagagat
ggcgccgctggagtcaagggtgatcgtggtgagactggtgctgtgggagctcctggagcc
cctgggccccctggctctcctggccccgctggtccaactggcaagcaaggagacagagga
gaagctggtgcacaaggccccatgggaccctcaggaccagctggagcccggggaatccag
ggtcctcaaggcccccgaggtgacaaaggagaggctggagagcctggcgagagaggcctg
aagggacaccgtggcttcactggtctgcagggtctgcccggccctcctggtccttctgga
gaccaaggtgcttctggtcctgctggtccttctggccctagaggtcctcctggccccgtc
ggtccctctggcaaagatggtgctaatggaatccctggccccattgggcctcctggtccc
cgtggacgatcaggcgaaaccggccctgctggtcctcccggaaatcctggaccccctggt
cctccaggtccccctggccctggcatcgacatgtccgcctttgctggcttaggcccgaga
gagaagggccctgaccccctgcagtacatgcgggctgaccaggcagctggtggcctgaga
cagcatgatgccgaggtggatgccacactcaagtccctcaacaaccagattgagagcatc
cgcagccccgagggctcccgcaagaaccctgctcgcacctgcagagacctgaaactctgc
cacccggagtggaagagtggagactactggattgaccccaaccaaggctgcaccttggac
gccatgaaggttttctgcaacatggagactggccagacttgcgtctaccccaacccagca
aacgttcccaagaagaactggtggagcagcaagagcaaggagaagaaacacatctggttt
ggagaaaccatcaatggtggcttccatttcagctatggagatgacaacctggctcccaac
actgccaacgtccagatgaccttcctacgcctgctgtccacggaaggctcccagaacatc
acctaccactgcaagaacagcattgcctacctggacgaagcagctggcaacctcaagaag
gccctgctcatccagggctccaacgacgtggagatccgggcagagggcaatagcaggttc
acgtacactgccctgaaggatggctgcacgaaacacaccggtaagtggggcaagactgtc
attgagtaccggtcacagaagacttcacgcctccccatcattgacattgcacccatggac
ataggagggcccgagcaggaattcggtgtggacatagggccagtctgcttcttgtaa

KEGG   Pongo abelii (Sumatran orangutan): 100434204
Entry
100434204         CDS       T01416                                 

Gene name
LAMB3
Definition
(RefSeq) laminin subunit beta-3
  KO
K06244  laminin, beta 3
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100434204 (LAMB3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100434204 (LAMB3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100434204 (LAMB3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100434204 (LAMB3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100434204 (LAMB3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100434204 (LAMB3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100434204 (LAMB3)
   05145 Toxoplasmosis
    100434204 (LAMB3)
SSDB
Motif
Pfam: Laminin_N Laminin_EGF Laminin_II F5_F8_type_C BNR_6
Other DBs
NCBI-GeneID: 100434204
NCBI-ProteinID: XP_024090484
Ensembl: ENSPPYG00000000258
LinkDB
Position
1
AA seq 1172 aa
MRPFFLLCFALPGLLHAQQACSRGACYPPVGDLLVGRTRFLRASSTCGLTKPETYCTQYG
EWQMKCCKCDSRQPHNYYSHRVENVVSSSGPMRWWQSQNDVNPVSLQLDLDRRFQLQEVM
MEFQGPMPAGMLIERSSDFGKTWRVYQYLAADCTSTFPRVRQGRPQSWQDVRCQSLPQRP
NARLNGGKVQLNLMDLASGIPATQSQKIQELGEITNLRVNFTRLAPVPQRGYHPPSAYYA
VSQLRLQGSCFCHGHADRCAPKPGASAGPSTAVQVHDVCVCQHNTAGPNCERCAPFYNNR
PWRPAEGQDAHECQRCDCNGHSETCHFDPAVFAASQGAYGGVCDNCRDHTKGKNCERCQL
HYFRNRRPGASIQETCIPCECDPDGAVPGAPCDPVTGQCVCKEHVQGERCDLCKPGFTGL
TYANPQGCHRCDCNILGSRRDMPCDEESGRCLCLPNVVGPKCDQCAPYHWKLASGQGCEP
CACDPHNSLSPQCNEFTGQCPCREGFGGLTCSAAAIRQCPDRTYGDAATGCRACDCDFRG
TEGPGCDKASGRCLCRPGLTGPRCDQCQRGYCNRYPVCVACHPCFQTYDADLREQALRLG
SLRNATSSLWSGPGLEDRGLASRILDAKSKIEQIRAVLSSATVTEQEVAQVASAILSLRR
TLQGLQPDLPLEEETLSLLRDLESLDRSFNGLLTMYQRKREQFEKISSADPSGAFRMLTT
AYEQSAQAAQQASDSSLLLDQLRDSRREAERLVQQAGGGGGTGSPKLVALRLEMSSLPDL
TPTFNKLCGNSRQLACTPVSCPGELCPQDNGTACGSHCRGVLPRAGGAFLMAGQVAEQLR
GFNAQLQQTRQMIRAAEESASQIQSNAQHLETQVSASRSQMEEDVRRMRLLIQQVRDFLT
DPDTDAATIQEVSEAVLALWLPTDSATVLQKMNEIQAIAARLPNVDLVLSQTKQDIARAR
RLQAEAEEARSRAHAVEGQVEDVVGNLRQGTVALQEAQDTMQGTSRSLRLIQDRVAEVQQ
VLRPAEKLVTSMIKQLGDFWTRMEELRRQARQQGAEAVQAQQLAEGASEQALSAQEGFER
IKQKYAELKDRLGQSAMLGEQGARIQSVKTEAEELFGETMEMMDRMKDMELELLRGSQAI
MLRSADLTGLEKRVEQIRDHINGRVLYYATCK
NT seq 3519 nt   +upstreamnt  +downstreamnt
atgagaccattcttcctcttgtgttttgccttgcctggcctcctgcatgcccagcaagcc
tgctcccgtggggcctgctatccacctgttggggacctgcttgttgggaggacccggttt
ctccgagcttcatctacctgtggactgacaaagcctgagacctattgcacccagtatggc
gagtggcagatgaaatgctgcaagtgtgactccaggcagcctcacaactactacagtcac
cgagtagagaatgtggtttcatcctccggccccatgcgctggtggcagtcacagaatgat
gtgaaccctgtatctctgcagctggacctggacaggagattccagcttcaggaagtcatg
atggagttccaggggcccatgcctgccggcatgctgattgagcgctcctcagacttcggt
aagacctggcgagtgtaccagtacctggctgccgactgcacctccaccttccctcgggtc
cgccagggtcggcctcagagctggcaggatgttcggtgccagtccctgccccagaggcct
aatgcacgcctaaatggggggaaggtccaacttaaccttatggatttagcatccgggatt
ccagcaactcaaagtcaaaaaattcaagagctgggggagatcacaaacttgagagtcaat
ttcaccaggctggcccctgtgccccaaaggggctaccaccctcccagtgcctactatgcc
gtgtcccagctccgtttgcaggggagctgcttctgtcatggccatgctgatcgctgcgca
cccaagcctggggcctctgcaggcccctccaccgctgtgcaggtccacgatgtctgtgtc
tgccagcacaacactgccggcccaaattgtgaacgctgtgcacccttctacaacaacagg
ccctggagacctgcagagggccaggacgcccatgaatgccaaaggtgcgactgcaatggg
cactcagagacatgtcactttgaccccgctgtgtttgccgccagccaaggggcatatgga
ggtgtgtgtgacaattgccgggaccacaccaaaggcaagaactgtgagcggtgtcagctg
cactatttccggaaccggcgcccgggagcttccattcaggagacctgcatcccctgcgag
tgtgatccagatggggcagtgccaggggctccctgtgacccagtgactgggcagtgtgtg
tgcaaggagcatgtgcagggagagcgctgcgacctatgcaagccgggctttactggactc
acctacgccaacccgcagggctgccaccgctgtgactgtaacatcctggggtcccggagg
gacatgccgtgtgacgaggagagtgggcgctgcctttgtctgcccaacgtggtgggtccc
aaatgtgaccagtgtgctccttaccactggaagctggccagcggccagggctgtgaacca
tgtgcctgcgacccgcacaactccctcagcccacagtgcaacgagttcacagggcagtgc
ccctgtcgggaaggctttggtggcctgacgtgcagcgctgcagccatccgccagtgtcca
gaccggacctacggagacgcggccacaggatgccgagcctgtgactgtgatttccgggga
acagagggcccgggctgcgacaaggcatcgggccgctgcctctgccgccctggcttgacc
gggccccgctgtgaccagtgccagcgaggctactgcaaccgctacccagtgtgcgtggcc
tgtcacccttgcttccagacctatgatgcggacctccgggagcaggccctgcgccttggt
agcctccgcaatgccacctccagtctgtggtcagggcctgggctggaggaccgtggcctg
gcatcccggatcctagatgcaaagagtaagattgagcagatccgagcagttctcagcagc
gccacagtcacagagcaggaggtggctcaggtagccagtgccatcctctccctcaggcga
actctccagggcctgcagccggatctgcccctagaggaggagacgttgtcccttctgaga
gacctggagagtctggacagaagcttcaatggtctcctcactatgtatcagaggaagagg
gagcagtttgaaaaaataagcagtgctgatccttcaggagccttccggatgctgaccaca
gcctacgagcagtcagcccaggcggctcagcaggcctccgacagctcgctccttttggac
cagctcagggacagccggagagaggcagagaggctggtgcagcaggcgggaggaggagga
ggcaccggcagccccaagcttgtggccctgaggctggagatgtcttcgttgcctgacctg
acacccaccttcaacaagctctgtggcaactccaggcagttggcttgtaccccagtatca
tgccctggtgagctgtgtccccaagacaatggcacagcttgtggctcccactgcaggggt
gtccttcccagggccggtggggccttcttgatggcagggcaggtggctgagcagctgcgg
ggtttcaatgcccagctccagcagaccaggcagatgattagggcagccgaggaatctgcc
tcacagattcaatccaatgcccagcacttggagacccaggtgagcgccagccgctcccag
atggaggaagatgtcagacgcatgcggctgctaatccagcaggtccgagatttcctgaca
gaccccgacactgatgcagccactatccaggaggtcagcgaggccgtgctggccctgtgg
ctgcccacagactcagctactgttctgcagaagatgaatgagatccaggccattgcagcc
aggctccccaacgtggacttggtgctgtcccagaccaagcaggacattgcgcgtgcccgc
cggttgcaggctgaggctgaggaagccaggagccgagcccatgcagtggagggccaggtg
gaggatgtggttgggaacctgcggcaggggacagtggcactgcaggaagctcaggacacc
atgcaaggcaccagccgctcccttcggcttatccaggacagggttgctgaggttcagcag
gtactgcggccagcagaaaagctggtgacaagcatgatcaagcagctgggtgacttctgg
acacggatggaggagctccgccgccaagcccggcagcagggggcagaagcagtccaggcc
cagcagcttgcggaaggtgccagcgagcaggcattgagtgcccaagagggatttgagaga
ataaaacaaaagtatgctgagttgaaggaccggttgggtcagagtgctatgctgggtgag
cagggcgcccggatccagagtgtgaagacagaggcagaggagctgtttggggagaccatg
gagatgatggacaggatgaaagacatggagttggagctgctgcggggcagccaggccatc
atgctgcgctcggcggacctgacaggactggagaagcgtgtggagcagatccgtgaccac
atcaatgggcgcgtgctctactatgccacctgcaagtga

KEGG   Pongo abelii (Sumatran orangutan): 100434628
Entry
100434628         CDS       T01416                                 

Gene name
COL6A6
Definition
(RefSeq) collagen alpha-6(VI) chain
  KO
K06238  collagen type VI alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100434628 (COL6A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100434628 (COL6A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100434628 (COL6A6)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100434628 (COL6A6)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100434628 (COL6A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100434628 (COL6A6)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100434628 (COL6A6)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100434628 (COL6A6)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100434628 (COL6A6)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   100434628 (COL6A6)
SSDB
Motif
Pfam: VWA VWA_2 Collagen Radical_SAM Integrin_beta DUF5035
Other DBs
NCBI-GeneID: 100434628
NCBI-ProteinID: XP_024100649
LinkDB
Position
3
AA seq 2264 aa
MKMLLILFLVIICSHISVNQDSGPEYADVVFLVDSSDRLGSKSFPFVKMFITKMISSLPI
EADKYRVALAQYSDKLHSEFHLSTFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYF
SAPANGRDKKQFPPILVVLASSESEDDVEEASKALQKDGVKIISVGVQKASEENLKAMAT
SQFHFNLRTVRDLSMFSQNMTHIIKDVTKYKEGAVDDIFVEACQGPSMADVVFLLDMSIN
GSEENFDYLKGFLEESVSALDIKENCMRVGLVAYSNETKVINSLSMGINKSEVLQHIQNL
SPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREG
VTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFS
ERTETLKSGCVDTEEADIYLLIDGSGSTQATDFHEMKTFLSEVVGMFNIAPHKVRVGAVQ
YADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNKVPC
HLVVLTNGMSKDSILEPANRLREEHIRVYAIGVKEANQTQLREIAGEEKRVYYVHDFDAL
KDIRNQVVQEICTEEACKEMKADIMFLVDSSGSIGPENFSKMKTFMKNLVSKSQIGPDRV
QIGAVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGAR
PNVRKFLILITDGEAQDIVKEPAIALRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVE
NFDILQRVEDDLVFGICSPREECKRIEVLDVVFVIDSSGSIDYDEYNIMKDFMIGLVKKA
DVGKNQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGNTYTAEALGFSDHMF
TEARGSRLNKGVPQVLIVITDGESHDADKLNATAKALRDKGILVLAVGIAGANPVELLAM
AGSSDKYFFVETFGGLKGIFSDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQQNDFKKM
KEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGAFIGEKEISFQIENIKQIFGNTHI
GAALRKVEHYFRPDMGSRINTGTPQVLLVLTDGQSQDEVAQAAEALRHRGIDIYSVGIGD
VDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNICTTGGESNCFVDVVVGFDVSTQEK
GQTLLEGQPWMETYLQDILRAVSSLNGVSCEVGTETQVSVAFQVTNAMEKYSPKFEIYSE
NILNSLKDITVKGPSLLNANLLDSLWDTFQNKSAARGKVVLLFSDGLDDDVEKLEQKSDE
LRKEGLNALITVALDGPADSSDLADLPYIEFGKGFEYRTQLSIGMRDLGSRLSKQLVNVA
ERTCCCLFCKCIGRDGTMGDPGPPGKRGPPGFKGSEGYLGEEGIVGERGAPGPVGEQGTK
GCYGTKGPKGNRGLNGQEGEVGETGIDGLNGEQGDNGLPGRKGEKGDEGSQGSPGKRGTP
GDRGAKGLRGDPGAPGVDNSIEGPTGLKGERGRQGRRGWPGPPGTPGSRRKTAAHGRRGH
TGPQGTAGIPGPDGLEGSPGLKGPQGPRGEAGVKGEKGGVGSKGPQGPPGPGGEAGNQGR
LGSQGNKGEPGDLGEKGAVGFPGPRGLQGNDGSPGYGSVGRKGAKGQEGFPGESGPKGEI
GDPGGPGETGLKGARGKMISAGLPGEMGSPGEPGPPGRKGVKGAKGLASFSTCELIQYVR
DRSPGRHGKPECPVHPTELVFALDHSRDVTEQEFERMKEMMAFLVRDIKVRENSCPVGAR
IAILSYNSHARHLVRFSDAYKKSQLLREIEAIPYERSSASREIGRAMRFISRNVFKRTLP
GAHTRKIATFFSSGQSADAHSVTTAAMEFGALEIIPVVITFSNVPSVRRAFAIDDTGTFQ
VIVVPSGADYIPALQRLQRCTFCYDVCRPDASCDQARPPPVQSYMDAAFLLDGSRNVGSA
EFEDIRAFLGALLDHFEITPEPETSVTGDRVALLSHAPPDFLPNTQKSPVRAEFNLTTYR
SKHLMKRHVQESVKQLNGDAFIGHALQWTLDNVFLSTPNLRRNKVIFVISAGETSHLDGE
ILKKESLRAKCQGYALFVFSLGPNWDDKELEDLASHPLDHHLVQLGRIHKPDHSYGVKFV
KSFINSIRRAINKYPPINLKIKCNRLNSIDPKQPPRPFQSFVPGPLKATLKEDVLQKAKF
FQDKKYLSRVARSGRDGAIQNFIRSTSHTFKNGRRIESAPKQHD
NT seq 6795 nt   +upstreamnt  +downstreamnt
atgaaaatgttgctaattttgttcctcgtgataatttgttcccatatttctgtgaatcaa
gattctggccctgagtatgcagatgtcgtgtttttggtggacagctccgatcgcctggga
tccaagtccttcccatttgtgaaaatgttcatcaccaaaatgatcagcagtctccccata
gaggccgacaaataccgtgtggccctggcccagtacagtgataaacttcacagtgaattc
cacctgagcaccttcaaaggcaggagccccatgctgaaccacctcaggaagaactttgga
ttcattggcgggtccctgcagataggaaaggctcttcaggaggctcacaggacttatttc
tctgcacccgcaaatgggagagacaagaaacagtttcccccaattctagtggtcctggct
tcatctgagtctgaggatgatgtggaagaggcttcgaaggccctgcagaaagacggagtg
aaaatcatctctgtaggggtgcagaaagcttctgaggaaaacctgaaggccatggccacg
tctcagtttcatttcaaccttcggacagtcagagacctcagcatgttttcccaaaacatg
acgcacatcatcaaggatgtaacaaagtacaaggagggagcagttgatgacatctttgta
gaagcttgccaaggcccgtctatggccgatgttgtgttcctattggatatgtcaatcaat
ggaagtgaggagaactttgactatcttaaaggattcttggaagaaagtgtatctgccctt
gacataaaggaaaattgcatgagggttggccttgtggcctatagcaatgagacaaaagtg
ataaattcactgagcatgggcataaataagtcagaggttctccagcatatacagaacctt
tctccccgaactgggaaggcctatactggagctgccatcaaaaagctcaggaaggaagtt
tttagtgcacggaatggcagtcggaagaatcagggggtgccccagattgctgtgctggtg
acccaccgagattcagaagacaacgtgacaaaagcagctgttaacctccgacgggagggt
gtgaccatcttcaccctgggcatagagggcgccagcgacacccagttggaaaagatagcg
tcccaccctgctgagcagtatgtctccaaactgaagaccttcgctgacctggctgctcac
aaccagacatttctgaagaagctgcggaaccaaataacacacacagtctctgtcttttca
gagaggactgaaacgctcaaatctggttgtgtggacactgaggaagcagacatctatctg
cttatcgatggctcagggagcacccaggccacagatttccatgaaatgaagaccttcctg
tcagaggtggtagggatgttcaacattgctccccataaggtgcgggttggggccgttcag
tatgctgacagctgggacttggaatttgagatcaataaatactctaacaagcaggatttg
ggaaaggccattgagaatatcaggcagatgggtgggaatacaaacacaggcgcagcactg
aatttcacactgagtctgttgcaaaaagcaaagaagcagcgaggaaacaaagttccatgc
caccttgttgttctgacaaatggcatgtccaaggatagcatcttggagcctgcaaacaga
ctgagagaagagcacatccgagtttatgctattggtgtcaaggaggccaaccaaacacag
ctgcgagaaattgcaggagaggaaaagagagtgtattacgtgcatgactttgatgcattg
aaagacataagaaaccaagttgttcaagaaatctgtactgaagaagcttgcaaagagatg
aaagctgacatcatgtttctggtggacagttctggaagtataggacctgaaaacttcagc
aaaatgaaaacatttatgaaaaacctggtgagcaagtctcagattggaccagatcgggtg
caaattggtgcagtccagttcagcgacatcaataaggaagagtttcagctcaacagattc
atgtcccaaagcgacatttcaaatgcaatagaccaaatggctcacattggacaaaccacc
ctgactggtagtgccctgagctttgtgtctcagtacttcagccccaccaagggcgcccgg
cccaacgtcagaaagtttctcatcctcatcacggatggtgaagctcaggacatagtaaag
gaaccagcgatagcgcttcggcaagaaggtgtaatcatctattctgtgggagtgtttggc
tccaatgtcacccagcttgaggagatcagtgggaggcccgagatggttttttatgttgag
aattttgatattctgcagcgcgttgaagatgatcttgtttttggaatatgcagcccccgt
gaagaatgcaaacggattgaagttttagatgttgtgtttgtcattgatagctctggcagc
attgactatgatgagtataatatcatgaaggattttatgattggcttagtgaaaaaagct
gatgtgggcaagaatcaggtccggtttggggctctgaagtatgctgatgacccagaggtg
ctgttttatctggatgactttggcacaaaactggaggtaatttcagtgctccagaatgac
caagccatgggtggcaatacttatactgctgaggcactgggcttctcagaccacatgttc
actgaagcccggggcagccgcctgaacaagggggtcccccaagtcctcattgtgatcacc
gatggggaatcccatgatgctgataaactcaatgccacggcaaaggccttgcgggacaaa
ggcattcttgtcctggctgtggggattgctggtgccaatcccgtggagctgttagccatg
gcaggatcaagtgacaagtacttcttcgtggagacttttggaggtctgaagggaatattt
tcagatgtgacagccagtgtctgcaactcttcaaaagtagattgtgaaattgacaaagta
gatcttgttttccttatggatggttcaactagcattcagcaaaatgacttcaagaaaatg
aaggaatttctggcatctgttgttcaagactttgatgtcagcctcaacagagtgcgtata
ggagcggcccagtttagcgatacctatcacccagagtttccactgggagctttcataggt
gaaaaagagatatcgtttcagattgaaaacatcaagcagatctttggaaacacacacatc
ggtgctgcactcaggaaggtggaacattacttcaggccagacatgggcagcaggataaat
acaggtaccccacaggtgctgctggtccttacagatggccagtcccaagacgaggtggcc
caggccgcggaagccctgagacacagaggtatcgacatctactccgtgggcattggggat
gtggatgaccagcagctcattcagatcaccgggactgcagagaaaaaactgacagtgcac
aacttcgatgaactgaagaaggtcaataaaaggatcgttcgcaacatctgtaccacaggg
ggtgaaagcaactgtttcgtggatgttgtggtgggatttgatgtctcaactcaggagaaa
gggcagactttgcttgaaggtcagccttggatggaaacctaccttcaagacatcttacgt
gccgtcagctccctcaatggagtaagctgtgaggtgggcacagagactcaggtcagtgta
gcttttcaagtgaccaatgccatggaaaaatattctcccaagtttgagatctacagtgaa
aacattctgaatagcttgaaggatataacagttaaaggaccatctcttctcaatgcaaac
ctcttggattctctatgggatacatttcagaataaatcggctgctcgaggaaaggtggtc
cttttattttcagatggattggatgatgatgttgagaaacttgaacaaaaatctgatgaa
cttagaaaagaaggcctgaatgccctcataactgttgctctggatggacctgctgattca
agtgacttggctgatcttccctatattgaatttgggaaaggatttgagtacaggacacag
ctctctattggcatgagagatcttggaagccggctgtcaaagcagctggtcaatgttgct
gaaaggacatgctgctgtttgttctgcaagtgcattggaagagatggcacaatgggagat
cctggaccaccagggaaaaggggacctccaggttttaaaggcagtgaaggctacctggga
gaggagggaatcgttggagaaagaggagcccctggaccagtgggagagcaaggtactaag
ggatgctatggcaccaaaggtcctaagggaaacaggggactaaatggacaggagggagaa
gttggggaaactggaattgatggattaaatggagaacagggtgataatggtcttcctgga
agaaaaggagaaaagggtgatgagggatcccagggaagcccagggaagagagggactcct
ggtgaccgtggagccaagggcctgcgaggggatcccggagctcctggagttgataatagc
atagaaggacccacaggcttgaaaggagaacgtggaagacaaggtagaagaggctggcca
ggcccccccgggacaccaggctccagaagaaagacagcagctcatggcagaaggggacat
acaggcccacagggaacagcaggcatcccaggaccagatggacttgaaggctccccggga
cttaagggccctcagggcccaagaggagaggctggtgtgaaaggagaaaagggaggtgtg
ggaagtaaaggtccccaggggcctccaggacccggaggagaggcagggaatcaaggccgt
ttgggaagccaaggaaataaaggagaacctggagatctgggagaaaaaggagctgttggc
tttcctggtcctcgtggcttgcagggcaatgatggcagtccaggttatggtagtgtcgga
cgcaagggagcaaagggacaagaaggattccctggagaaagtggacctaagggtgagatt
ggggaccctggtggtccaggagagactgggctgaagggagctagaggcaaaatgatatct
gctgggcttccaggagagatgggatcccctggggaaccaggacctcctggacgtaagggt
gtgaaaggagccaaaggcttggcttcattttctacatgtgaactcattcagtatgtgcga
gaccgcagtcctggcaggcatggaaaaccggaatgcccagtgcacccaaccgagttggtg
tttgccctggaccactcccgggatgtcactgagcaggaatttgaacggatgaaggagatg
atggctttcctggtgagagacattaaggtccgggagaacagctgccccgtgggagcgcgc
atcgccatcctctcctataactcccacgccaggcacctcgtgcgcttctcagacgcctac
aagaagagtcaacttctcagagaaattgaagctattccttatgagagatcctctgccagc
cgggagattggcagagcaatgcggtttatttccaggaatgtcttcaagcggacgcttccg
ggggcacacacgagaaaaatcgccacatttttcagcagcggtcagtccgcggatgcccat
tccgtcaccacggccgccatggagttcggcgcacttgaaatcattcccgtggtgatcact
ttcagcaacgtgccctcggtcaggcgcgcatttgcgattgacgacactggcacatttcaa
gtaatagtggttccctccggggccgactacataccagcattacagagactccagcggtgc
actttctgctatgatgtatgcaggccagatgcttcttgtgaccaagccagaccaccccct
gtgcagtcttacatggatgctgctttccttctggatggctcccggaatgtgggaagtgct
gaatttgaagacataagagccttccttggagcactattagatcactttgaaatcacccca
gagccggagacttctgtcaccggagaccgggtggccctattgagccatgctccccccgac
ttcctacccaacactcagaagagtccagttagagctgagttcaatcttaccacctacaga
agtaagcacctcatgaagaggcatgtgcaagagtcagttaaacaactaaatggagatgct
tttattggtcatgccttacagtggactctggacaatgtatttttaagtacacccaatctg
agaagaaacaaagtcatatttgtgatatctgctggggaaaccagccacttagatggggaa
atcttaaagaaggaatccttgcgagccaaatgtcagggatatgccctatttgtgttttcc
cttggccctaattgggatgacaaggaactggaggatctcgccagccaccctttggatcac
cacctggtccagcttggacgaattcataaacctgaccacagttatggtgtgaagtttgtg
aagtcctttataaactcaatcaggcgtgcaatcaacaaatatccaccaataaacttaaaa
ataaagtgcaacagacttaactctatagatccaaagcagcccccacgaccgttccaaagc
tttgttcctggaccacttaaagctaccctcaaagaagatgtattacagaaggcaaaattc
tttcaagataaaaaatatctttcaagagtagcaagaagtggcagagatggtgctattcaa
aattttatcagaagcacctcccatacctttaagaatggaagaaggatagaaagtgctccc
aaacaacatgattaa

KEGG   Pongo abelii (Sumatran orangutan): 100434817
Entry
100434817         CDS       T01416                                 

Gene name
VWF
Definition
(RefSeq) von Willebrand factor precursor
  KO
K03900  von Willebrand factor
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04610  Complement and coagulation cascades
pon04611  Platelet activation
pon04613  Neutrophil extracellular trap formation
pon05165  Human papillomavirus infection
pon05171  Coronavirus disease - COVID-19
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100434817 (VWF)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100434817 (VWF)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100434817 (VWF)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    100434817 (VWF)
   04611 Platelet activation
    100434817 (VWF)
   04613 Neutrophil extracellular trap formation
    100434817 (VWF)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease - COVID-19
    100434817 (VWF)
   05165 Human papillomavirus infection
    100434817 (VWF)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03110 Chaperones and folding catalysts [BR:pon03110]
    100434817 (VWF)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100434817 (VWF)
Chaperones and folding catalysts [BR:pon03110]
 Intramolecular chaperones
  Others
   100434817 (VWF)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100434817 (VWF)
SSDB
Motif
Pfam: VWD VWA C8 VWA_N2 TIL VWA_2 VWC Pacifastin_I MCR_beta_N
Other DBs
NCBI-GeneID: 100434817
NCBI-ProteinID: NP_001233206
Ensembl: ENSPPYG00000004180
UniProt: F5XVC0
LinkDB
Position
12
AA seq 2813 aa
MIPARFAGVLLALALILPGTLCAEGTRGRSSMARCSLFGSDFINTFDGSMYSFAGYCSYL
LAGDCQKRSFSIIGDFQNGKRVSLSVYLGEFFDIHLFVNGTVTQGDQRVSMPYASKGLYL
ETEAGYYKLSGEAYGFVARIDGSGNFQVLLSDRYFNKTCGLCGNFNIFAEDDFMTQEGTL
TSDPYDFANSWALSSGEQWCERASPPSSSCNISSGEMQKGLWEQCQLLKSTSVFARCHPL
VDPEPFVALCEKTLCECAGGLECTCPAFLEYARTCAQEGMVLYGWTDHSACSPVCPAGME
YKQCVSPCARTCQSLHINEVCQERCVDGCSCPEGQLLDEGLCMESTKCPCMHSGKRYPPG
ASLSRDCNTCICRNSQWICSNEECPGECLVTGQSHFKSFDNRYFTFSGICQYLLARDCQD
HSFSIVIETVQCADDRDAVCTRSVTVRLPGLQDSLVKLKHGGGVAMDGQDVQLPLLKGDL
RIQHTVTASVRLSYGEDLQMDWDGRGRLLVKLSPVYAGKTCGLCGNYNGNQGDDFLIPSG
LAEPRVEDFGNAWKLHGDCQDLQKQHSDPCALNPRMTRFSEEACAVLTSPTFEACHRAVS
PLPYLRNCRYDVCSCSDGRECLCGALASYAAACAGRGVRVAWREPGRCELNCPKGQVYLQ
CGTPCNLTCRSLSYPDEECNEACLEGCFCPPGLYMDERGDCVPKAQCPCYYDGEIFQPED
IFSDHHTMCYCEDGFMHCTMNGVPGSLLPDAVLSSPLSHRSKRSLSCRPPMVKLVCPADN
PRAEGLECAKTCQNYDLECMSMGCVSGCLCPPGMVRHENRCVALERCPCFHQGKEYAPGE
TVKIGCNTCVCRDRKWNCTDHVCDATCSTIGMAHYLTFDGLKYMFPGECQYVLVQDYCGS
NPGTFRILVGNEGCSHPSVKCEKRVTILVEGGEIELFDGEVNVKRPMKDETHFEVVESGR
YIILLLGKALSVVWDHHLSISVVLKQTYQEKVCGLCGNFDGIQNNDLTSSNLQVEEDPVD
FGNSWKVSSQCADTRKVPLDSSPATCHNNIMKQTMVDSSCRILTSDVFQDCNKLVDPEPY
LDVCIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVTWRTATLCPRSCEERNLRENGY
ECEWRYNSCAPACRVTCQHPEPLACPVQCVEGCHAHCPPGKILDELLQTCVDPEDCPVCE
VAGRRFASGKKVTLNPSDPEHCQICHCDGVNLTCEACQEPGGLVVPPTDAPVSPTTPYVE
DISEPPLHDFYCSRLLDLVFLLDGSSRLSEAEFEVLKAFVVDMMERLRISQKWVRVAVVE
YHDGSHAYIGLKDRKRPSELRRIASQVKYAGSQVASTSEVLKYTLFQIFSKIDRPEASRI
ALLLMASQEPQRMSRNFVRYVQGLKKKKVIVIPVGIGPHANLKQIRLIEKQAPENKAFVL
SGVDELEQQRDEMVSYLCDLAPEARPPTLPPHMAQVTVGPGLLGVSTLGPKRNSMVLDVA
FVLEGSDKIGEADFNRSKEFMEEVIQRMDVGQDSIHVTVLQYSYMVTVEYPFSEAQSKGD
ILQRVREIRYQGGNRTNTGLALQYLSDHSFLVSQGDREQAPNLVYMVTGNPASDEIKRLP
GDIQVVPIGVGPNANVQELERIGWPNAPILIQDFETLPREAPDLVLQRCCSGEGLQIPTL
SPAADCSQPLDVILLLDGSSSFPASYFDEMKSFAKAFISKANIGPHLTQVSVLQYGSITT
IDVPWNVAPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTSEMHGARPGVSKAVVILV
TDISVDSVDAAADAARSNRVAVFPIGIGDRYDAAQLRILAGPAGDSNMVKLQRIEDLPTM
VTLGNSFLHKLCSGFVRICMDEDGNEKRPGDVWTLPDQCHTVTCQPDGQTLLKSHRVNCD
RGPRPSCPNSQSPVKVEETCGCRWTCPCVCTGSSTRHIVTFDGQNFKLTGSCSYVLFQNK
EQDLEVILHNGACSPGARQGCMKSIEVKHSALSVELHSDMEVMVNGRLVSVPYVGGNMEV
NVYGAIMHEVRFNHLGHIFTFTPQNNEFQLQLSPKTFASKTYGLCGICDENGANDFILRD
GTVTTDWKTLVQEWTVQRPGQMCQPILEEQCLVPDSSHCQVLLLPLFAECHKVLAPATFY
AICQQDSCHQEQVCEVIASYAHLCRTNGVCVDWRTPDFCAMSCPPSLVYNHCEHGCPRHC
DGNVSSCGDHPSEGCFCPPNKVMLEGSCVPEEACTQCIGEDGVQHQFLEAWVPDHQPCQI
CTCLSGRKVNCTTQPCPTAKDLTCGLCEVARLRQNADQCCPEYECVCDLVSCDLPPVPHC
EGGLQPTLTNPGECRPNFICACRKEECERVSPPSCPPHRLPTLRKTQCCDEYECACNCVN
STVSCPLGYLASTATNDCGCTTTTCLPDKVCVHRSTIYPVGQFWEEGCDVCTCTDMEDAV
MGLRVAQCSQKPCEDSCRSGFTYVLHEGECCGRCLPSACEVVTGSPRGDSQSSWKSVGSQ
WASPENPCLINECVRVKEEVFIQQRNVSCPQLEVPVCPSGFQLSCKTSACCPSCRCERVE
ACTLNGTIIGPGKSVMIDVCTTCRCVVQVGVISGFKLECRKTTCNPCPLGYKEEDNTGEC
CGRCLPTACTIQLRGGQIMTLKRDETLQDGCDTHFCKVNERGEYFWEKRVTGCPPFDEHK
CLAEGGKIMKIPGTCCDTCEEPECNDITARLQYVKVGSCKSEVEVDIHYCQGKCASKAMY
SIDINDVQDQCSCCSPTRTEPMQVALHCTNGSVVYHEVLNAMECKCSPRKCSK
NT seq 8442 nt   +upstreamnt  +downstreamnt
atgattcctgccagatttgctggggtgctgcttgctctggccctcattttgccagggacc
ctttgtgcagaaggaactcgcggcaggtcatccatggcccgatgcagcctcttcggaagt
gacttcatcaacacctttgatgggagcatgtacagctttgcgggatactgcagttacctc
ctggcaggggactgccagaaacgctccttctcgattattggggacttccagaatggcaag
agagtgagcctctccgtgtatcttggggaattttttgacatccatttgtttgtcaatggt
actgtgacacagggggaccaaagagtctccatgccctatgcctccaaagggctgtatcta
gaaactgaggctgggtactacaagctgtccggtgaggcctatggctttgtggccaggatc
gatggcagcggtaactttcaagtcctgctgtcagacagatacttcaacaagacctgcggg
ctgtgtggcaactttaacatctttgctgaagatgactttatgacccaagaagggaccttg
acctcggacccttatgactttgccaactcatgggctctgagcagtggagaacaatggtgt
gaacgggcatctcctcccagcagctcatgcaacatctcctctggggaaatgcagaagggc
ctgtgggagcagtgccagcttctgaagagcacctcggtgtttgcccgctgccaccctctg
gtggaccccgagcctttcgtggccctgtgtgagaagactttgtgtgagtgtgctgggggg
ctggagtgcacctgccctgccttcctggagtacgcccggacctgtgcccaggagggaatg
gtgctgtacggctggaccgaccacagcgcgtgcagcccagtgtgccctgctggtatggaa
tataagcagtgtgtgtccccttgcgccaggacctgccagagcctgcacatcaatgaagtg
tgtcaggagcgatgcgtggatggctgcagctgccctgagggacagctcctggatgaaggc
ctctgcatggagagcaccaagtgtccctgcatgcattccggaaagcgctaccctcccggc
gcctccctctctcgagactgcaacacctgcatttgccgaaacagccagtggatctgcagc
aatgaagaatgtccaggggagtgcctcgtcacaggtcaatcccacttcaagagctttgac
aacagatacttcaccttcagtgggatctgccagtacctgctggcccgggattgccaggac
cactccttctccattgtcattgagaccgtccagtgtgctgatgaccgcgatgctgtgtgc
acccgctccgtcaccgtccggctgcctggcctgcaggacagccttgtgaaactgaagcat
gggggaggagttgccatggatggccaggacgtccagctccccctcctgaaaggtgacctc
cgcatccagcatacagtgacggcctctgtgcgcctcagctacggggaggacctgcagatg
gactgggatggccgcgggaggctgctggtgaagctgtcccccgtttacgccgggaagacc
tgcggcctgtgtgggaattacaacggcaaccagggggacgacttccttatcccctccggg
ctggcggagccccgggtggaggacttcgggaacgcctggaagctgcacggggactgccag
gacctgcagaagcagcacagcgatccctgcgccctcaacccgcgcatgaccaggttctcc
gaggaggcgtgcgcggtcctgacgtcccccacattcgaggcctgccatcgtgccgtcagc
ccgctgccctacctgcggaactgccgctacgacgtgtgctcctgctcggacggccgcgag
tgcctgtgcggcgccctggccagctatgccgcggcctgcgcggggagaggcgtgcgcgtc
gcgtggcgcgagccaggccgctgtgagctgaactgcccgaaaggccaggtgtacctgcag
tgcgggaccccctgcaacctgacctgccgctctctctcttacccggatgaggaatgcaat
gaggcctgcctggagggctgcttctgtcccccagggctctacatggatgagaggggggac
tgcgtgcccaaggcccagtgcccttgttactatgatggtgagatcttccagccggaagac
atcttctcagaccatcacaccatgtgctactgtgaggatggcttcatgcactgtaccatg
aatggagtccccggaagcttgctgcctgatgctgtcctcagcagtcccctgtctcatcgc
agcaaaaggagcctatcctgtcggccccccatggtcaagctggtgtgtcccgctgacaac
ccgcgggctgaagggctcgagtgtgccaaaacgtgccagaactacgacctggagtgcatg
agcatgggctgtgtctctggctgcctctgccccccgggcatggtccggcatgaaaacaga
tgtgtggccctggaaaggtgtccctgcttccatcagggcaaggagtatgcccccggagaa
acagtgaagattggctgcaacacttgtgtctgtcgggaccggaagtggaactgcacagac
catgtgtgtgatgccacgtgctccacgatcggcatggcccactacctcacctttgacggg
ctcaaatacatgttccctggggaatgccagtacgttctggtgcaggattactgcggcagt
aaccctgggacctttcggatcctagtggggaatgagggatgcagccacccctcagtgaaa
tgcgagaaacgggtcaccatcctggtggagggaggagagattgagctgttcgacggggag
gtgaatgtgaagaggcccatgaaggatgagactcactttgaggtggtggagtctggccgg
tacatcattctgctgctgggcaaagccctctccgtggtctgggatcaccacctgagcatc
tccgtggtcctgaagcagacataccaggagaaagtgtgtggcctgtgtgggaattttgat
ggcatccagaacaatgacctcaccagcagcaacctccaagtggaggaagaccccgtggac
tttgggaactcctggaaagtgagctcgcagtgtgctgacaccagaaaagtgcctctggac
tcatcccctgccacctgccataacaacatcatgaagcagacgatggtggattcctcctgt
agaatccttaccagtgacgtcttccaggactgcaacaagctggtggaccccgagccatac
ctggatgtctgcatttatgacacctgctcctgtgagtccattggggactgcgcctgcttc
tgcgacaccattgctgcctatgcccacgtgtgtgcccagcacggcaaggtggtgacctgg
aggacggccacattgtgcccccggagctgcgaggaaaggaatctccgggagaatgggtat
gagtgtgagtggcgctataacagctgtgcgcctgcctgtcgagtcacgtgccagcacccc
gagccgctggcctgccctgtgcagtgtgtggagggctgccatgcccactgccctccaggg
aaaatcctggatgagcttttgcagacctgcgttgaccccgaagactgtccagtgtgtgag
gtggctggccggcgttttgcctcaggaaagaaagtcaccttgaatcccagtgaccctgag
cactgccagatttgccactgtgatggtgtcaacctcacctgtgaagcctgccaggagccg
ggaggcctggtggtgcctcccacagatgccccggtgagccccaccactccgtatgtggag
gacatctcggaaccccccttgcacgatttctactgcagcaggctactggacctggtcttc
ctgctggatggctcctccaggctgtccgaggctgagtttgaagtgctgaaggcctttgtg
gtggacatgatggagcggctgcgcatctcccagaagtgggtccgcgtggccgtggtggag
taccacgatggctcccacgcctacattgggctcaaggaccggaagcgaccgtcagagctg
cggcgcattgccagccaggtgaagtatgcgggcagccaggtggcctccaccagcgaggtc
ttgaaatacacactgttccaaatctttagcaagatcgaccgccctgaagcctcccgcatc
gccctgctcctgatggccagccaggagccccaacggatgtcccggaactttgtccgctac
gtccagggcctgaagaagaagaaggtcatcgtgatcccggtgggtattgggccccatgcc
aacctcaagcagatccgcctcattgagaagcaggcccctgagaacaaggcctttgtgctg
agcggtgtggatgagctggagcagcaaagggacgagatggttagctacctctgtgacctt
gcccctgaagcccgtcctcctactttgcccccccacatggcacaagtcactgtgggcccg
gggctcttgggggtttcgaccctggggcccaagaggaactccatggttctggatgtggcg
tttgtcctggaaggatcggacaaaattggtgaagccgacttcaacaggagcaaggagttc
atggaggaggtgattcagcggatggatgtgggccaggacagcatccatgtcacggtgctg
cagtattcctacatggtgaccgtggagtaccccttcagcgaggcacagtccaaaggggac
atcctgcagcgggtgcgagagatccgctaccagggcggcaacaggaccaacactgggctg
gccctgcagtacctctccgaccacagcttcttggtcagccagggtgaccgggagcaggcg
cccaacctggtctacatggtcaccggaaatcctgcctctgatgagatcaagaggttgcct
ggagacatccaggtagtgcccattggagtgggccctaatgccaacgtgcaggagctggag
aggatcggctggcccaatgcccccatcctcatccaggactttgagacgctcccccgagag
gctcctgacctggtgctgcagaggtgctgctccggagaggggctgcagatccccaccctc
tcccctgccgctgactgcagccagcccctggacgtgatccttctcctggatggctcctcc
agtttcccagcttcttattttgatgaaatgaagagttttgccaaggctttcatttcaaaa
gccaatatagggcctcatctcactcaggtgtcagtgctccagtatggaagcatcaccacc
attgacgtgccatggaacgtggccccggagaaagcccatttgctgagccttgtggacgtc
atgcagcgggagggaggccccagccaaattggggatgccttgggctttgctgtgcgatac
ttgacttcagaaatgcatggtgccaggccgggagtctcgaaggcggtggtcatcctggtc
acggacatctctgtggattcagtggatgcagcagctgatgccgccaggtccaacagagtg
gcagtgttccctattggaattggagatcgctatgatgcagcccagctacggatcttggca
ggcccagcaggcgactccaacatggtgaagctccagcgaatcgaagacctccctaccatg
gtcaccttgggcaattccttcctccacaaactgtgctctggatttgttaggatttgcatg
gatgaggatgggaatgagaagaggcctggggacgtgtggaccttgccagaccagtgccac
accgtgacttgccagccagatggccagacattgctgaagagtcatcgggtcaactgtgac
cgggggccgaggccttcatgccccaacagccagtcacctgttaaagtggaagagacctgt
ggctgccgctggacctgcccctgcgtgtgcacaggcagctccactcggcacatcgtgacg
tttgatgggcagaatttcaagctgactggcagctgttcttatgtcctatttcaaaacaag
gagcaggacctggaggtgattctccataatggtgcctgcagccctggagcaaggcagggc
tgcatgaaatccatcgaggtgaagcacagtgccctctccgtcgagctgcacagtgacatg
gaggtgatggtgaatgggagactggtctctgttccttacgtgggtgggaacatggaagtc
aacgtttatggtgccatcatgcatgaggtcagattcaatcatcttggtcacatcttcaca
ttcactccacaaaacaatgagttccaactgcagctcagccccaagacttttgcttcaaag
acatatggtctctgtgggatctgtgatgagaacggagccaacgacttcatactgagggat
ggcacagtcaccacagactggaaaacacttgttcaggaatggactgtgcagcggccaggg
cagatgtgccagcccatcctggaggagcagtgtcttgtccccgacagctcccactgccag
gtcctcctcttaccactgtttgctgaatgccacaaggtcctggctccagccacattctat
gccatctgtcagcaggacagttgccaccaggagcaagtgtgtgaggtgatcgcctcttat
gcccacctctgtcggaccaatggggtctgcgttgactggaggacacctgatttctgtgct
atgtcatgcccaccatctctggtctacaaccactgtgagcacggctgtccccggcactgt
gatggcaacgtgagctcctgtggggaccatccctcggaaggctgtttctgccctccaaat
aaagtcatgttggaaggcagctgtgtccctgaagaggcctgcactcagtgcattggtgag
gatggagtccagcaccagttcctggaagcctgggtcccggaccaccagccctgtcagatc
tgcacgtgcctcagtgggcggaaggtcaactgcacaacgcagccctgccccacggccaaa
gatctgacgtgcggcctgtgtgaagtggcccgcctccgccagaatgcagaccagtgctgc
cctgagtatgagtgtgtgtgtgacctggtgagctgcgacctgcccccggtgcctcactgt
gaaggtggcctccagcccacactgaccaaccctggcgagtgcagacccaacttcatctgt
gcctgcaggaaggaggagtgcgaaagagtgtccccaccctcctgccccccgcaccgtttg
cccacccttcggaagacccagtgctgtgatgagtatgagtgtgcctgcaactgtgtcaac
tccacagtgagctgtccccttgggtacttggcctcaaccgccaccaatgactgtggctgc
accacaaccacctgccttcccgacaaggtgtgtgtccaccgaagcaccatctaccctgtg
ggccagttctgggaggagggctgcgacgtgtgcacctgcaccgacatggaggatgctgtg
atgggcctccgcgtggcccagtgctcccagaagccctgtgaggacagctgtcggtcgggc
ttcacttatgttctgcatgaaggcgagtgctgtggaaggtgcctgccatctgcctgtgag
gtggtgaccggctcaccgcggggggactcccagtcttcctggaagagtgtcggctcccag
tgggcctccccggagaacccctgcctcatcaatgagtgtgtccgagtgaaggaggaggtc
tttatacaacaaaggaacgtctcctgcccccagctggaggtccctgtctgcccctcgggc
tttcagctgagctgtaagacctcagcgtgctgcccaagctgtcgctgtgagcgcgtggaa
gcctgcacgctcaatggcactatcattgggcccgggaagagtgtgatgatcgatgtgtgc
acaacctgccgctgcgtggtgcaggtgggggtcatctctggattcaagctggagtgcagg
aagaccacctgcaacccctgccccctgggttacaaggaagaagataacacaggtgaatgt
tgtgggagatgtttgcctaccgcttgcaccattcagctaagaggaggacagatcatgaca
ctgaagcgtgatgagacactccaggatggctgtgatactcacttctgcaaggtcaatgag
agaggagagtacttctgggagaagagggtcacaggctgcccgccctttgatgaacacaag
tgtctggctgagggaggtaaaattatgaaaattccaggcacctgctgtgacacatgtgag
gagcctgagtgcaacgacatcactgccaggctgcagtatgtcaaggtgggaagctgtaag
tccgaagtagaggtggatatccactactgccagggcaaatgtgccagcaaagccatgtac
tccattgacatcaacgatgtgcaggaccagtgctcctgctgctctccgacacggacggag
cccatgcaggtggccctgcactgtaccaatggctctgttgtgtaccacgaggttctcaat
gccatggagtgcaaatgctcccccaggaagtgcagcaagtga

KEGG   Pongo abelii (Sumatran orangutan): 100438152
Entry
100438152         CDS       T01416                                 

Gene name
COL1A1
Definition
(RefSeq) collagen alpha-1(I) chain isoform X1
  KO
K06236  collagen type I alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04611  Platelet activation
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100438152 (COL1A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100438152 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100438152 (COL1A1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    100438152 (COL1A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100438152 (COL1A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100438152 (COL1A1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    100438152 (COL1A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100438152 (COL1A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100438152 (COL1A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100438152 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100438152 (COL1A1)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100438152 (COL1A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 100438152
NCBI-ProteinID: XP_024090696
LinkDB
Position
17
AA seq 1464 aa
MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDRDVWKPEPCRI
CVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVCPDGSESPTDQETTGVEGPKGDTGPR
GPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQLSYGYDEKSTGGISVPGP
MGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGR
PGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQ
MGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGP
RGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGP
QGPGGPPGPKGNSGEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGL
PGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGL
TGSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGV
PGPPGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGE
QGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGS
QGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGD
KGETGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGP
PGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGP
AGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGV
VGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGA
EGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGP
VGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGP
RGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSGGFDFSF
LPQPPQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCR
DLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKD
KRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQ
TGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPII
DVAPLDVGAPDQEFGFDVGPVCFL
NT seq 4395 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcttagcggccaccgccctcctgacg
cacggccaagaggaaggccaagtcgagggccaagacgaagacatcccaccaatcacctgc
gtacagaacggcctcaggtaccatgaccgagacgtgtggaaacccgagccctgccggatc
tgcgtctgcgacaacggcaaggtgttgtgcgatgacgtgatctgtgacgagaccaagaac
tgccccggcgccgaagtccccgagggcgagtgctgtcccgtctgccccgacggctcagag
tcacccaccgaccaagaaaccaccggcgtcgagggacccaagggagacactggtccccga
ggcccaaggggacccgcaggcccccctggccgagatggcatccctggacagcctggactt
cccggaccccccggaccccccggacctcccggaccccctggcctcggaggaaactttgct
ccccagctgtcttatggctacgatgagaaatcaaccggaggaatttccgtgcctggcccc
atgggtccctctggtcctcgtggtctccctggcccccctggtgcacctggtccccaaggc
ttccaaggtccccctggtgagcctggcgagcctggagcttcaggtcccatgggtccccga
ggtccccctggcccccctggaaagaacggagatgatggggaagctggaaaacctggtcgt
cctggtgagcgtgggcctcctgggcctcagggtgctcgaggattgcccggaacagctggc
ctccctggaatgaagggacacagaggtttcagtggtttggatggtgccaagggagatgct
ggtcctgctggtcccaagggtgagcctggcagccctggtgaaaatggagctcctggtcag
atgggcccccgtggtctgcctggtgagagaggtcgccctggagcccctggccctgctggt
gctcgcggaaatgatggtgctactggtgctgccggaccccctggtcccaccggccccgcc
ggtcctcctggcttccctggtgctgttggtgctaagggtgaagctggtccccaagggccc
cgaggctctgaaggtccccagggtgtgcgtggtgagcctggcccccctggccctgctggt
gctgctggccctgctggaaaccctggtgctgatggacagcctggtgctaaaggtgccaat
ggtgctcctggtattgctggtgctcctggcttccctggtgcccgaggcccctctggaccc
cagggccccggcggccctcctggtcccaagggtaacagcggtgaacctggtgctcccggc
agcaaaggagacactggtgctaagggagagcccggccccgttggtgttcaaggaccccct
ggccctgctggagaggaaggaaagcgaggagctcgaggtgaacccggacctactggcctg
cccggaccccctggcgagcgtggtggacctggtagccgtggtttccctggcgcagatggt
gttgctggtcccaagggtcccgctggtgaacgtggttctcctggccctgctggccccaaa
ggatctcctggtgaagctggtcgtcccggtgaagctggtctgcctggtgccaagggtctg
actggaagccctggcagccctggtcctgatggcaaaactggcccccctggtcccgctggt
caagatggtcgccccggacccccaggcccccctggtgcccgtggtcaggctggtgtgatg
ggatttcctggacctaaaggtgctgctggagagcccggcaaggctggagagcgaggtgtt
cccggaccgcctggcgctgtcggtcctgctggcaaagatggagaggctggagctcaagga
ccccctggccctgctggtcccgctggcgagagaggtgaacaaggccctgctggctccccc
ggattccagggtctccctggtcctgctggtcctcccggtgaagcaggcaaacctggtgaa
cagggtgttcctggagaccttggtgcccccggcccctctggagcaagaggcgagagaggt
ttccctggcgagcgtggtgtgcaaggtccccctggtcctgctggtccccgaggggccaac
ggtgctcccggcaatgatggtgctaagggtgatgctggtgcccctggagctcccggtagc
cagggtgcccctggccttcagggaatgcctggtgaacgtggtgcagctggtcttccaggg
cctaagggtgacagaggtgatgctggtcccaaaggtgctgatggctctcctggcaaagat
ggcgtccgtggtctgactggccccattggtcctcctggccctgctggtgcccctggtgac
aagggtgaaactggtcccagcggccctgctggtcccactggagctcgtggtgcccccgga
gaccgtggtgagcctggtccccctggccctgctggcttcgctggcccccctggtgctgat
ggccaacctggtgctaaaggcgaacctggtgatgctggtgctaaaggcgatgctggtccc
cctggccctgccggacccgctggaccccctggccccattggtaatgttggtgctcctgga
gccaaaggtgctcgcggcagcgctggtccccctggtgctactggtttccctggtgctgct
ggccgagtcggtcctcctggcccctctggaaatgctggaccccctggccctcctggtcct
gctggcaaagaaggcggcaaaggtccccgtggtgagactggccctgctggacgtcctggt
gaagttggtccccctggtccccctggccctgctggcgagaaaggatcccctggtgctgat
ggtcctgctggtgctcctggtactcccgggcctcaaggtattgctggacagcgtggtgtg
gtcggcctgcctggtcagagaggagaaagaggcttccctggtcttcctggcccctctggt
gaacctggcaaacaaggtccctctggagcaagtggtgaacgtggtccccctggtcccatg
ggcccccctggattggctggaccccctggtgaatctggacgtgagggagctcctggcgcc
gaaggttcccctggacgagacggttctcctggtgccaagggtgaccgtggtgagaccggc
cccgctggaccccctggtgctcctggtgctcctggtgcccctggccccgttggccctgct
ggcaagagtggtgatcgtggtgagactggtcctgctggtcctgccggtcctgtcggccct
gttggcgcccgtggccccgctggaccccaaggcccccgtggtgacaagggtgagacaggc
gaacagggcgacagaggcataaagggtcaccgtggcttctctggcctccagggtccccct
ggccctcctggctctcctggtgaacaaggtccctctggagcctctggtcctgctggtccc
cgaggtccccctggctctgctggtgctcctggcaaagatggactcaacggtctccctggt
cccattgggccccctggtcctcgcggtcgcactggtgatgctggtcctgttggtcccccc
ggccctcccggacctcctggtccccctggtcctcccagcggtggtttcgacttcagcttc
ctgccccagccacctcaagagaaggctcacgatggtggccgctactaccgggctgatgac
gccaatgtggttcgtgaccgtgacctcgaggtggacaccaccctcaagagcctgagccaa
cagatcgagaacatccggagccccgagggcagccgcaagaaccccgcccgcacctgccgc
gacctcaagatgtgccactctgactggaagagcggagagtactggattgaccccaaccaa
ggctgcaacctggatgccatcaaagtcttctgcaacatggagactggtgagacctgcgtg
taccccactcagcccagtgtggcccagaagaactggtacatcagcaagaaccccaaggac
aagaggcatgtctggtttggcgagagcatgaccgatggattccagtttgagtatggcggc
cagggctctgaccctgctgacgtggccatccagctgaccttcctgcgcctgatgtccacc
gaggcctcccagaacatcacctaccactgcaagaacagcgtggcctacatggaccagcag
actggcaacctcaagaaggccctgctcctccagggctccaacgagatcgagatccgcgcc
gagggcaacagccgcttcacctacagcgtcactgtcgacggctgcacgagtcacaccgga
gcctggggcaagacagtgatcgaatacaaaaccaccaagacctcccgcctgcccatcatc
gatgtggcccccttggacgttggcgccccagaccaggaattcggcttcgacgttggccca
gtctgcttcctgtaa

KEGG   Pongo abelii (Sumatran orangutan): 100438698
Entry
100438698         CDS       T01416                                 

Gene name
LAMA1
Definition
(RefSeq) laminin subunit alpha-1
  KO
K05637  laminin, alpha 1/2
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
pon05410  Hypertrophic cardiomyopathy
pon05412  Arrhythmogenic right ventricular cardiomyopathy
pon05414  Dilated cardiomyopathy
pon05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100438698 (LAMA1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100438698 (LAMA1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100438698 (LAMA1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100438698 (LAMA1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100438698 (LAMA1)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    100438698 (LAMA1)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    100438698 (LAMA1)
   05414 Dilated cardiomyopathy
    100438698 (LAMA1)
   05416 Viral myocarditis
    100438698 (LAMA1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100438698 (LAMA1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100438698 (LAMA1)
   05145 Toxoplasmosis
    100438698 (LAMA1)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_B Laminin_N Laminin_II Laminin_G_3 DUF1080 Hepar_II_III
Other DBs
NCBI-GeneID: 100438698
NCBI-ProteinID: XP_024091886
LinkDB
Position
18
AA seq 3079 aa
MRWAGAWVLGLALLLCSVARCQQRGLFPAILNLASNAHISTNATCGEKGPEMFCKLVEHV
PGRPVRNPQCRICDGNSANPRERHPISHAIDGTNNWWQSPSIQNGREYHWVTITLDLRQV
FQVAYVIIKAANAPRPGNWILERSLDGTTFSPWQYYAVSDSECLSRYNITPRRGPPTYRA
DDEVICTSYYSRLVPLEHGEIHTSLINGRPSADDLSPKLLEFTSARYIRLRLQRIRTLNA
DLMTLSHREPKELDPIVTRRYYYSIKDISVGGMCICYGHASSCPWDETTKKLQCQCEHNT
CGESCNRCCPGYHQQPWRPGTVSSGNTCEACNCHNKAKDCYYDESVAKQKKSLNTAGQFR
GGGVCINCLQNTMGINCETCIDGYYRPHKVSPYEDEPCRPCNCDPVGSLSSVCIKDDLHS
DLHNGKWPGQCPCKEGYAGEKCDRCQLGYKDYPTCVSCGCNPAGSASDEPCTGPCVCKEN
VEGKACDRCKPGFYNLKEKNPRGCSECFCFGVSDVCSSLSWPVGQVNDMSGWLVTDLISP
RKILSQQDALGRRHQVSINNTAVMQRLALKYYWAAPKAYLGNKLTAFGGFLKYTVSYEIP
VETVDSNLMSHADVIIKGNGLTLSTQAEGLSLQPYEEYTNVVRLVPENFRDFHSKMQIDR
DQLMTVLANVTHLLIRANYNSAKMALYRLESVSLDIASPNAIDLVVAADVEHCECPQGYT
GTSCELCLSGYYRVDGILFGGICQACECHGHAAECDVHGVCIACAHTTTGDHCEQCLPGF
YGEPSRGTPGDCQPCACPLTIASNNFSPTCHLSDGDEVVCDRCAPGYSGAWCERCADGYY
GNPTVPGESCVPCDCSGNVDPSEAGHCDSVTGECLKCLGNTDGAHCERCADGFYGDAVTA
KNCRACECHVKGSHSAVCHPETGLCDCKPNVTGQQCDQCLHGYYGLDSGHGCQPCNCSVA
GSVSDDCTDEGQCHCVPGVAGKRCDRCARGFYTYQDGSCTPCDCPHTQNTCDPETGECVC
PPHTQGVKCEECEDGHWGYDVEVGCQACNCSLVGSTHHQCDVVTSHCQCKSKFGGRACDQ
CSLGYRDFPDCVPCDCDLRGTSGDACNLEQGLCGCAEETGACPCKENVFGPQCNECREGT
FALRADNPLGCSPCFCSGLSHLCSELEGYVRTPVTLGSDQPLLRVVSQSNLRGTTEGVYY
QAPDFLLDAATIRQHIRAEPFYWRLPQQFQGDQLMAYGGKLKYSVAFYSLDGIGTSNFEP
QVLIKGGRIRKQVIYMDAPAPENGVRQEQEVAMRENFWKYFNSVSEKPVTREDFMSVLSD
IEYILIKASYGQGLQQSRISDISMEVGRKAEKLHPEEEVASLLENCVCPPGTVGFSCQDC
APGYHRGKLPAGSDRGLRPLIAPCVPCSCNNHSDTCDPDTGKCLNCGDNTAGDHCDVCAS
GYYGKVTGSASDCAPCACPHSPPASFSPTCVLEGDHNFRCDACLLGYEGKHCERCSSGYY
GNPQTPGGSCQKCDCNPHGSVHGDCDRTSGQCVCRLGASGLRCDECEPRHILMETDCVSC
DDECVGVLLNDLDEIGDAILSLNLTGIIPVPYGILSNLENTTKYLQESLLKENMQKDLGK
IKLEGVAEEMDNLQKKLTRMLASTQKVNRATERIFKESQDLAIAIERLQMSITEIMEKTT
LNQTLDEDFLLPNSTLQNMQQNGTSLLEIMQIRDFTQLHQNATLELKAAEDLLSQIQENY
QKPLEELEVLKEAASHLLSKHNNELKAAEALVREAEAKMQESNHLLLMVNANLREFSDKK
LHVQEEQNLTSELIAQGRGLIDAAAAQTDAVQDALEHLEDHQDKLLLWSAKIRHHIDDLV
MHMSQRNALDLVYRAEDHAAEFQRLADVLYSGLENIRNVSLNATSAAYVHYNIQSLIEES
EELARDAHRTVTETSLLSESLVYNGKAAVQRSSKFLKEGNNLSRRLPGIALELSELRNKT
NRFQENAVEITWQTNESLLILRAIPEGIRDKGAKTKELATSASQSAVSTLRDVAGLSQEL
LNTSASLSRVNTTLQETHQLLQDSTMATLLAGRKVKDVETQANLLFDRLKPLKMLEENLS
RNLSEIKLLISQARKQAASIKVAVSADRDCIRAYQPQISSTNYNTLTLNVKTQEPDNLLF
YLGSSTASDFLAVEMRRGKVAFLWDLGSGSTRLEFPDFPIDDNRWHSIHVARFGNIGSLS
VKEMSSNQKSPTKTSKSPGTANVLDVNNSTLMFVGGLGGQIKKSPAVKVTHFKGCLGEAF
LNGKSIGLWNYIEREGKCRGCFGSSQNEDPSFHFDGSGYSVVEKSLPATVTQIIMLFNTF
SPNGLLLYLGSYGTKDFLSIELFHGRVKVMTDLGSGPLTLLTDRRYNNGTWYKIAFQRNR
KQGVLAVIDAYNTSNKETKQGETPGASSDLNRLDKDPIYVGGLPRSRVVRRGVTTKSFVG
CLKNLEISRSTFDLLRNSYGVRKGCLLEPIRSVSFLKGGYIELPPKSLLPESEWLVTFAT
TNSSGIILAALGGDVEKRGDREEAHVPFFSIMLIGGNIEVHVNPGDGTGLRKALLHAPTG
TCSDGQAHSISLVRNRRIITVQLDENNPVEMKLGPLVESRTINVSDLYVGGIPEGEGTSL
LTMRRSFHGCIKNLIFNLELLDFNSAVGHEQVDLDTCWLSERPKLAPDAEDSKLLPEPRA
FPEQCVVDAALEYVPGAHQFGLTQNSHFILPFNQSAVRKKLSVELSIRTFASSGLIYYMA
HQNQADYAVLQLHGGRLHFMFDLGKGRTKVSHPALLSDGKWHTVKTDYVKRKGFITVDGQ
ESPMVTVVGDGTMLDVEGLFYLGGLPSQYQARKIGNITHSIPACIGDVMVNSKQLDKDSP
VSAFTVNRCYAVAQEGTYFDGSGYAALVKEGYKVQSDVNITLEFRTSSQNGVLLGISTAK
VDAIGLELVDGKVLFHVNNGAGRITAAYEPKTTTALCDGKWHTLQAKKSKHRITLIVDGN
AVGAESPHTQSTSVDTNNPIYVGGYPAGVKQKCLRSQTSFRGCLRKLALIKSPQVQSFDF
SRAFELHGVFLHSCPGTES
NT seq 9240 nt   +upstreamnt  +downstreamnt
atgcgctgggccggcgcgtgggtcctgggcctggccttgctgctgtgtagcgtcgcgcgg
tgccagcagagaggcctgtttcctgccattctcaatcttgccagcaatgctcacatcagc
accaatgccacctgtggcgagaaggggcccgagatgttctgcaaacttgtggagcatgtg
ccaggtcggcccgtccgaaacccacagtgccggatctgtgatggcaacagcgcaaacccc
agagagcgccatccaatatcacatgccatagatggcaccaataactggtggcaaagtccc
agcattcagaatgggagagagtatcactgggtcacaatcactctggacttaagacaggtc
tttcaagttgcatatgtcatcattaaagctgccaatgcccctcgacctggaaactggatt
ttggagcgttctctggacggcaccacattcagcccttggcagtattatgcagtcagcgac
tcagagtgtttgtctcgttacaatataactccaagacgagggccacccacctacagggct
gatgatgaagtgatctgcacctcctattattccagattggtgccacttgagcatggagag
attcatacatcactcatcaatggcagaccaagcgctgacgatctttcacccaagttgttg
gaattcacttctgcacgatatattcgccttcgcttacaacgcattagaacgctgaatgca
gatctcatgacccttagccaccgggaacctaaagaactggatcctattgttaccagacgc
tattattattcaataaaggacatttctgttggaggcatgtgtatctgctatggccatgct
agtagctgcccgtgggatgaaactacaaagaaactgcagtgtcaatgtgagcataatact
tgcggggagagctgtaacaggtgctgtcctgggtaccatcagcagccctggaggccggga
accgtgtcctccggcaatacctgtgaagcatgtaattgtcacaataaagccaaagactgt
tactatgatgaaagtgttgcaaagcagaagaaaagtttgaatactgctggacagttcaga
ggaggaggggtttgcataaattgcttgcagaacaccatgggaatcaactgtgaaacctgt
attgatggatattatagaccacacaaagtgtctccttatgaggatgagccttgccgcccc
tgtaactgtgaccctgtggggtccctcagttctgtctgtattaaggatgacctccattct
gacttacacaatgggaagtggccgggtcagtgcccatgtaaggaaggctatgcaggagaa
aaatgtgatcgctgccaacttggctataaggattacccgacctgtgtctcctgtgggtgc
aacccagcgggtagtgccagtgatgagccctgcacagggccctgtgtttgtaaggaaaac
gttgaggggaaggcctgtgatcgctgcaagccaggattctataacttgaaggaaaaaaac
ccccggggctgctccgagtgcttctgctttggcgtttctgatgtctgcagcagcctctct
tggcctgttggtcaggtaaacgatatgtccgggtggctggtcaccgacttgatcagtccc
aggaagatcctgtctcagcaagatgcactaggcaggcgccatcaggtcagcatcaacaac
accgcggtcatgcagaggctggctctcaagtactactgggcggcccccaaggcctacctt
ggaaataagctgactgcgtttggtggattcctgaaatacacggtgtcttacgaaattcca
gtagagacggtagacagtaacctcatgtcgcatgctgacgtcatcattaagggaaacgga
ctcactttaagcacacaggctgagggtctgtcattgcagccttatgaagagtacacaaat
gtggttagacttgtgcctgaaaacttccgagattttcacagcaaaatgcagattgatcgt
gaccagctgatgactgtccttgccaatgtgacacatcttttgatcagagccaactacaat
tctgcaaaaatggctctttacaggttggagtccgtctctctggacatagccagccctaat
gccatcgacctggtggtggccgctgatgtggagcactgtgaatgtccacaaggctacaca
gggacctcctgtgagttgtgcctctctggctattaccgcgtggatggaatactctttgga
ggaatttgtcaagcctgcgaatgccacggccatgcagctgagtgcgacgttcatggcgtt
tgcattgcatgtgcgcacaccaccaccggcgaccactgtgaacagtgcttgcccggcttc
tacggggagccttcccgagggacacctggggactgccagccctgcgcctgccctctcacc
atagcctccaacaatttcagccccacctgccacctcagtgatggagatgaagtggtctgt
gaccggtgtgccccgggctactcaggagcttggtgtgagagatgtgcagatggttactat
ggaaacccaaccgtgcctggcgaatcttgtgttccctgtgactgcagcggcaatgtggac
ccctcggaggctggtcactgtgactcagtcaccggggagtgcctgaagtgcctggggaac
acagacggcgcccactgtgaaaggtgtgctgacgggttctacggggatgctgtgacagcc
aagaactgccgcgcctgtgaatgccatgtgaaaggctcccattctgccgtgtgccatcct
gagaccgggctctgcgactgcaaaccaaacgtgactggacagcagtgtgaccagtgcttg
catggctattatgggctggactcaggccatggctgccagccctgcaactgcagtgtggca
ggctccgtgtcagatgactgcacggatgaaggccagtgtcactgcgtcccaggtgtggca
gggaaaaggtgtgacaggtgtgcccgtggcttctacacctatcaggatggtagctgtaca
ccctgtgactgcccacacactcagaatacctgcgacccagaaactggagagtgtgtctgc
ccccctcacacacagggtgtgaagtgtgaagaatgtgaggatgggcactggggctacgat
gtggaggtggggtgccaggcctgcaattgcagtctcgtggggtcgactcatcatcagtgc
gatgtggtcaccagccattgccagtgcaagtcaaaatttggtggccgggcctgcgatcag
tgttccttgggttacagagattttcctgactgtgttccctgtgactgtgacctgaggggg
acgtcgggggacgcctgcaacctggagcagggtctctgcggctgtgcggaggaaaccggg
gcctgcccttgtaaggagaatgtctttggccctcagtgcaacgagtgtcgagagggcacc
tttgctctccgcgcagacaaccccctgggctgcagcccctgcttctgctccgggctgtcc
cacctctgctcagagctggagggctacgtgaggaccccagtaacgctgggctccgatcag
cctcttctgcgtgtggtttctcagagtaacttgaggggcacgaccgagggggtttactac
caggcccccgacttcctgctggatgccgccaccatccggcagcacatccgtgcagagccg
ttttactggcggctgccgcagcagttccaaggagaccagctcatggcctatggtggcaaa
ctgaagtacagcgtggccttctattctttggatggcatcggcacctccaattttgagcct
caagttctcatcaaaggtggtcggatcagaaagcaagtcatttacatggatgcaccagcc
cctgagaatggagtgagacaggaacaagaagtagcaatgagagagaatttttggaaatat
tttaactctgtttctgaaaaacctgtcacgcgagaggattttatgtctgtccttagcgat
attgagtacatcctcatcaaggcatcgtatggtcaaggattacagcagagcagaatctca
gacatttcaatggaggttggcagaaaggctgaaaagctgcacccagaagaagaggttgca
tctcttttagagaattgtgtctgtcctcctggcactgtgggattctcatgtcaggactgt
gcccctgggtaccacagagggaagctcccagcagggagtgacaggggactgcgccctctg
attgctccttgtgtaccctgcagttgcaacaaccatagtgacacctgtgaccctgacacc
gggaagtgtctgaactgtggcgataacacagcaggtgaccattgtgatgtgtgtgcttct
ggttactacgggaaggtgactggctcagcaagtgactgtgctccgtgtgcctgtcctcac
agccctcctgccagttttagtcccacttgcgtcttggaaggtgaccacaatttccgttgt
gacgcctgtctcctgggctatgaaggaaaacattgtgaaaggtgctcctctggctattat
gggaaccctcaaacaccaggtggcagttgccagaagtgtgactgcaacccgcacggctct
gtccacggtgactgtgaccgcacatctgggcagtgcgtttgtaggctgggggcctcgggg
ctccggtgcgatgagtgtgaaccgaggcacattctgatggaaacggattgtgtttcctgt
gatgatgagtgtgtaggtgtgctgctgaatgacttggatgagattggtgatgccattctt
tctctgaacctcactggcattatccctgtcccatatggaattttgtcaaacctggaaaat
acaacgaaatatctccaggaatcattattaaaagaaaatatgcaaaaggacctggggaaa
attaagcttgaaggtgttgcagaagaaatggacaacctgcaaaagaagctcactagaatg
ttagcgagtacccaaaaggtgaatagggcaactgagagaatcttcaaggagagtcaagat
ctggccatagccattgagaggctgcagatgagcatcacagaaattatggaaaagacaact
ttaaatcagactttggacgaagatttcctactacccaattctactcttcagaacatgcaa
cagaatggtacatctttgctagaaatcatgcagataagagacttcacacagttgcaccaa
aatgccacccttgaactcaaggctgctgaagatttattgtcacaaattcaggaaaattac
cagaagccgctggaagaattggaggtattgaaagaagcagcaagccacctcctttcaaag
cacaataatgaactaaaggcggctgaggcgctcgtgagggaagctgaggcaaagatgcag
gaaagcaaccacctgctgctcatggtcaatgctaatctgagagaattcagtgataaaaag
ctacatgttcaagaagaacaaaatctgacctcagaactcattgcccaaggaagaggattg
atagatgctgctgctgcacaaacagatgctgtacaagatgctctagagcacttagaggat
caccaggataagctacttttatggtctgccaaaatcaggcaccacatagatgacctggtc
atgcacatgtcccaaaggaacgcactcgacctggtctacagagctgaggaccatgccgct
gagttccagagactagcagatgttctgtacagtggccttgaaaacatcagaaatgtgtcc
ctgaatgccaccagtgcagcctatgtccattacaacatccagagcctgatcgaagaatcg
gaggaactggccagagatgctcacaggactgtgactgagacgagcctgctctcagaatcc
cttgtttataatgggaaagcggccgtgcagcgcagctccaaatttctaaaagaaggcaac
aacctcagcaggaggcttccaggtattgcattggaactgagtgaattgagaaataagaca
aacagatttcaagagaatgctgttgaaattacctggcagaccaatgaatcactcttgata
cttagagcaattcctgaaggtataagagacaagggagccaaaaccaaagagctggccacg
tctgcaagccagagcgcggtgagcacgctgagggacgtggcggggctgagccaggagctg
ctgaacacatctgccagcctgtctagggtcaacaccacattacaagagacgcaccagctt
ctgcaggactccaccatggccactctgttggctggaagaaaagtcaaagatgtggaaact
caagccaaccttttgtttgatcggttgaagcccttgaagatgttagaggagaatctgagc
agaaacctatcagaaattaaactgttgatcagccaggcccgcaaacaagcagcttctatt
aaagtcgctgtgtctgcagacagagattgcatccgggcctaccagcctcagatttcctct
accaactacaacaccttaacactaaatgttaagacacaggaacccgataatcttctcttc
tacctcggtagcagcaccgcttctgatttccttgcagtggagatgcggcgagggaaagtg
gccttcctgtgggacctgggctctgggtccacacgcttggagtttccagactttcccatt
gatgacaacagatggcacagtatccatgtagccagatttggaaacattggttcactgagt
gtaaaggaaatgagctcaaatcagaagtcaccaacaaaaacaagtaaatcccctgggaca
gctaatgttctggatgtaaataattcaacactcatgtttgttggaggtcttggaggacaa
atcaagaaatctcctgctgtgaaggttactcattttaaaggctgcttgggagaggccttc
ctgaatggaaaatccatcggcctatggaactatattgaaagggaaggcaagtgccgtggg
tgctttggaagctcccagaatgaagacccttccttccattttgacgggagtgggtactct
gtcgtggagaagtcacttccggctaccgtgacccagataatcatgctttttaataccttt
tcacctaacggacttcttctctacctgggttcatacggcacaaaagactttttatccatc
gagctgtttcacggcagagtgaaggttatgactgacctgggttcaggaccccttaccctt
ttgacagacagacgttataacaatggaacctggtacaaaattgccttccagcgaaaccgg
aagcaaggagtcctagcagttatcgatgcctataacaccagtaataaagaaaccaagcag
ggtgagactccgggagcatcttctgacctcaaccgcctagacaaggacccgatttatgtg
ggtggattaccaaggtcaagagttgtaaggagaggtgtcaccaccaaaagctttgtgggc
tgcctcaagaacctggaaatatccagatcaacctttgacttactcagaaattcctatgga
gtgagaaaaggctgtttactggagcccatccggagtgttagcttcttgaaaggcggctac
attgaattgccacccaaatctttgttaccagaatcggaatggctggtaacatttgccacc
acgaacagcagtggcatcatcctggctgccctcggcggggatgtggagaagcggggtgat
cgtgaggaagcacatgtgcccttcttttccatcatgctgatcggaggcaacattgaggta
catgtcaatcctggggatgggacaggcctgagaaaagctctcctgcacgctcccacgggt
acatgcagcgatggacaagcgcattccatctccttggtcaggaatcggagaattatcact
gtccaattggatgagaacaatcctgtggaaatgaagttgggcccattagtagaaagcagg
acgataaatgtgtccgatctgtacgtcgggggaattccagagggagaggggacatcgctg
ctcacaatgagaagatcgttccatggctgtatcaaaaacctgatcttcaatttggaactt
ttggatttcaacagtgcagttggccatgagcaagtcgacttggacacctgctggctgtca
gaaaggcctaagctggctcccgatgcagaggacagcaagctcttgccagagccccgggct
tttccagaacagtgtgtggtggatgcagctctggagtacgttcctggcgctcaccagttt
ggtctcacacaaaacagccatttcatcttgccttttaatcagtcggctgtcagaaagaag
ctctcggttgagctaagtatccgcacgttcgcctccagcggcctgatttactacatggct
catcagaaccaagcagactacgccgtgctccagctgcacgggggccgcctccacttcatg
tttgaccttggcaaaggcagaacaaaggtctctcaccctgcactgctcagcgatggcaag
tggcacacggtcaagacagactatgttaaaagaaaaggcttcataactgttgacggccaa
gagtcccccatggtgactgtggtgggagacggaaccatgctggatgtggagggtttgttc
taccttggaggcctgccctcccagtaccaggccaggaaaattggaaatatcacccacagc
atccctgcctgcattggggatgtgatggttaacagcaaacagctggacaaggacagcccg
gtgtctgccttcacggtgaacaggtgctacgcagtggcccaggaaggaacatactttgac
ggaagcggatatgccgctcttgtcaaagagggctacaaagtccagtcagatgtgaacatc
acactggagtttcgaacctcctcgcagaatggcgtcctcctggggatcagcactgccaaa
gtggatgccattggactagagcttgtggacggcaaggtcttgttccatgtcaacaatggt
gctggcaggataacagctgcatatgagcccaaaaccaccactgcgctctgtgatggaaaa
tggcacactcttcaggctaagaaaagcaaacaccgtatcactctgattgtcgacgggaac
gcagttggcgctgaaagtccacacacccagtctacctcggtggacaccaacaatcccatt
tatgttggtggctatcctgctggtgtgaagcaaaaatgcctgcgcagccagacctcgttc
cgcgggtgtttgaggaagctagctctgattaagagcccacaggtgcaatcctttgacttc
agcagagcgttcgaactgcatggagttttccttcattcctgtcctgggaccgagtcctga

KEGG   Pongo abelii (Sumatran orangutan): 100439107
Entry
100439107         CDS       T01416                                 

Gene name
COL1A2
Definition
(RefSeq) collagen alpha-2(I) chain
  KO
K06236  collagen type I alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04611  Platelet activation
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100439107 (COL1A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100439107 (COL1A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100439107 (COL1A2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    100439107 (COL1A2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100439107 (COL1A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    100439107 (COL1A2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    100439107 (COL1A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100439107 (COL1A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100439107 (COL1A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100439107 (COL1A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100439107 (COL1A2)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100439107 (COL1A2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 100439107
NCBI-ProteinID: XP_024105396
Ensembl: ENSPPYG00000017809
LinkDB
Position
7
AA seq 1366 aa
MLSFVDTRTLLLLAVTSYLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPSGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQ
GGKGEQGPAGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGTKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPVGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK
NT seq 4101 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcggactttgttgctgcttgcagtaacctcatacctagca
acatgccaatctttacaagaggaaactgtaagaaagggcccagccggagatagaggaccg
cgtggagaaaggggtccaccaggccccccaggcagagatggtgaagatggtcccacaggc
cctcctggtccacctggtcctcctggcccccctggtctcggtgggaactttgctgctcag
tatgatggaaaaggagttggtcttggccctggaccaatgggcttaatgggacctagaggc
ccacctggtgcagctggagccccaggccctcaaggttttcaaggacctgctggtgagcct
ggtgaacctggtcaaactggtcctgcaggtgctcgtggtccagctggccctcctggcaag
gctggtgaagatggtcaccctggaaaacccggacgacctggtgagagaggagttgttgga
ccacagggtgctcgtggtttccctggaactcctggacttcctggcttcaaaggcattagg
ggacacaatggtctggatggattgaagggacagcccggtgctcctggtgtgaagggtgaa
cctggtgcccctggtgaaaatggaactccaggtcaaacaggagcccgtgggcttcctggt
gagagaggacgtgttggtgcccctggcccagctggtgcccgtggcagtgatggaagtgtg
ggtcccgtgggtcctgctggtcccattgggtctgctggccctccaggcttcccaggtgcc
cctggcccaaagggtgaaattggagctgttggtaacgctggtcctgctggtcccgccggt
ccccgtggtgaagtgggtcttccaggcctctccggccctgttggacctcctggtaatcct
ggagcaaatggccttactggtgccaagggtgctgctggccttcccggcgttgctggggct
cccggcctccctggaccccgcggtattcctggccctgttggtgctgccggtgccactggt
gccagaggacttgttggtgaacctggtccagctggctccaaaggagagagcggtaacaag
ggtgagcccggctctgctgggccccaaggtcctcctggtcccagtggtgaagaaggaaag
agaggccctaatggggaagctggatctgccggccctccaggacctcctgggctgagaggt
agtcctggttctcgtggtcttcctggagctgatggcagagctggcgtcatgggtcctcct
ggtagtcgtggtgcaagtggccctgctggagtccgaggacccagtggagatgctggtcgc
cctggggagcctggtctcatgggacccagaggtcttcctggttcccctggaaatatcggc
cctgctggaaaagaaggtcctgtcggcctccctggcatcgacggcaggcctggcccaatt
ggcccagctggagcaagaggagagcctggcaacattggattccctggacccaaaggcccc
actggtgatcctggcaaaaacggtgataaaggtcatgctggtcttgctggtgctcggggt
gctccaggtcccgatggaaacaatggtgctcagggacctcccggaccacagggtgtccaa
ggtggaaaaggcgaacagggtcccgctggtcctccaggcttccagggtctgcctggaccc
tcaggtcccgctggtgaagttggcaaaccaggagaaaggggtctccatggtgagtttggt
ctccctggtcctgctggtccaagaggggaacgcggtcccccaggtgagagtggtgctgcc
ggtcctactggtcctattggaagccgaggtccttctggacccccagggcctgatggaaac
aagggtgaacctggtgtggttggtgctgtgggcactgctggtccatctggtcctagtgga
ctcccaggagagaggggtgctgctggcatacctggaggcaagggagaaaagggtgaacct
ggtctcagaggtgaaattggtaaccctggcagagatggtgctcgtggtgctcctggtgct
gtaggtgcccctggtcctgctggagccacaggtgaccggggcgaagctggtgctgctggt
cctgctggtcctgctggtcctcggggaagccctggtgaacgtggtgaggtcggtcctgct
ggccccaatggatttgctggtcccgctggtgctgctggtcaacctggtgctaaaggagaa
agaggaaccaaagggcctaagggtgaaaacggtgttgttggtcccacaggccccgttgga
gctgctggcccagctggtccaaacggtccccccggtcctgctggaagtcgtggtgatgga
ggcccccctggtatgactggtttccctggtgctgctggacggactggtcccccgggaccc
tctggtatttctggccctcctggtcctcctggtcctgctgggaaagaagggcttcgtggt
cctcgtggtgaccaaggtccagttggccgaactggagaagtaggtgcagttggtccccct
ggcttcgctggtgagaagggtccctctggagaggctggtactgctggacctcctggcact
ccaggtcctcagggtcttcttggtgctcctggtattctgggtctccctggctcaagaggt
gaacgtggtctaccaggtgttgctggtgctgtgggtgaacctggtcctcttggcattgct
ggccctcctggggcccgtggtcctcctggtgctgtgggtagtcctggagtcaatggtgct
cctggtgaagctggtcgtgatggcaaccctgggaatgatgggcccccaggtcgcgatggt
caacccggacacaagggagagcgtggttaccctggcaatattggtcccgttggtgctgca
ggtgcacctggtcctcatggccccgtgggtcctgctggcaaacatggaaaccgtggtgaa
actggtccttctggtcctgttggtcctgttggtgctgttggcccaagaggtcctagtggc
ccacaaggcattcgtggtgataagggagagcccggtgaaaaggggcccagaggtcttcct
ggcttaaagggacacaatggattgcaaggtctgcctggtatcgctggtcaccatggtgat
caaggtgctcctggctccgtgggtcctgctggtcctaggggccctgctggtccttctggc
cctgctggaaaagatggtcgcactggacatcctggtacagttggacctgctggcattcga
ggccctcagggtcaccaaggtcctgctggcccccctggtccccctggccctcctggacct
ccaggtgtaagcggtggtggttatgactttggttacgatggagacttctacagggctgac
cagcctcgctcagcaccttctctcagacccaaggactatgaagttgacgctactctgaag
tctctcaacaaccagattgagacccttcttactcctgaaggctctagaaagaacccagct
cgcacatgccgtgacttgagactcagccacccagagtggagcagtggttactactggatt
gaccctaaccaaggatgcactatggatgctatcaaagtatactgtgatttctctactggc
gaaacctgtatccgggcccaacctgaaaacatcccagccaagaactggtataggagctcc
aaggacaagaaacacgtctggctaggagaaactatcaatgctggcagccagtttgaatat
aatgtagaaggagtgacttcaaaggaaatggctacccaacttgccttcatgcgcctgctg
gccaactatgcctctcagaacatcacctaccactgcaagaacagcattgcatacatggat
gaggagactggcaacctgaaaaaggctgtcattctacagggctctaatgatgttgaactt
gttgctgagggcaacagcaggttcacttacactgttcttgtagatggctgctctaaaaag
acaaatgaatggggaaagacaatcattgaatacaaaacaaataagccatctcgcctgccc
ttccttgatattgcacctttggacatcggtggtgctgaccaggaattctttgtggacatt
ggcccagtctgtttcaaataa

KEGG   Pongo abelii (Sumatran orangutan): 100439539
Entry
100439539         CDS       T01416                                 

Gene name
LAMA3
Definition
(RefSeq) laminin subunit alpha-3
  KO
K06240  laminin, alpha 3/5
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100439539 (LAMA3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100439539 (LAMA3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100439539 (LAMA3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100439539 (LAMA3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100439539 (LAMA3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100439539 (LAMA3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100439539 (LAMA3)
   05145 Toxoplasmosis
    100439539 (LAMA3)
SSDB
Motif
Pfam: Laminin_I Laminin_EGF Laminin_G_2 Laminin_N Laminin_G_1 Laminin_II Laminin_B Laminin_G_3 Spc7 EzrA
Other DBs
NCBI-GeneID: 100439539
NCBI-ProteinID: XP_024091751
LinkDB
Position
18
AA seq 3289 aa
MAAAARPRGRAPGPVLPPTPLLLLVLRVLPAWGATARDSGAAAGLSLHPPYFNLAEAARI
WATATCGERGPGEGRPQPELYCKLVGGPTAPGSGHTIQGQFCDYCNSEDPRKAHPVTNAI
DGSERWWQSPPLSSGTQYNRVNLTLDLGQLFHVAYILIKFANSPRPDLWVLERSVDFGST
YSPWQYFAHSKVDCLKEFGREANMAVTRDDDVLCVTEYSRIVPLENGEVVVSLINGRPGA
KNFTFSQTLREFTKATNIRLRFLRTNTLLGHLISKAQRDPTVTRRYYYSIKDISIGGQCV
CNGHAEVCNVNNPEKLFRCECQHHTCGETCDRCCTGYNQRRWRPAAWEQSHECEACNCHG
HASDCYYDPDVERQQASLTTQGIYAGGGVCINCQHNTAGVNCEQCAKGYYRPYGVPVDAP
DGCIPCSCDPEHADGCEQGSGRCHCKPNFHGDNCEKCAVGYYSFPFCLRIPIFPVSMPSP
EDPVAGDIKGNWCDCNLEGVLPEICDAHGRCLCHPGVEGPRCDTCRSGFYSFPICQGSSS
ACDPAGTINSNLGYCQCKLHVEGPTCSSCKQLYWNLDKENPNGCSECQCHKAGTVSGTGE
CRQGDGDCHCKSHVGGDSCDTCEDGYFALEKRNYFGCQGCQCDIGGASSSVCSGPSGVCQ
CREHVVGKACQQPENNYYFPDLHHMKYEIEDGSTPNGRDLRFGFDPLAFPEFSWRGYAQM
TSVQNDVRITLNVGKSSGSLFRVILRYVNPGTEAVSGHITIYPSWGAAQSKEIIFLPSKE
PAFVTVPGNGFADPFSITPGIWVACIKAEGVLLDYLVLLPKDYYEASVLQLPVTEPCAYA
GPPQENCLLYQHLPVTRFPCTLACEARHFLLDGEPRPLAVRQLTPAHPVMVDLSGREVEL
HLRLRVPQVGHYVVVVEYSTKAAQLFVVDVNVKSPGSVLAGQVNIYSCNYSVLCRSAVID
HMSRIAMYELLADADIQLKGHMARFLLHQVCIIPIEEFSAEYVRPQVHCIASYGQFVNQS
ATCVSLAHETPPTALILDGLSGRPFPHLPQQSSPSVDVLPGVTLKAPQNQVTLRGRVPHL
GRYVFVIHFYQAAHPTFPAQVSVDGGWPRAGSFRASFCPHVLGCRDQVITEGQIEFDISE
PEVAATVKVPEGKSLVLVRVLVVPAENYDYQILHKKSMDKSLEFITNCGKNSFYLDPQTA
SRFCKNSARSLVAFYHKGALPCECHPTGATGPHCSPEGGQCPCWPNVIGRQCTRCATGHY
GFPRCKPCSCGRRLCEEMTGQCHCPPRTVRPQCEVCETHSFSFHPMAGCEGCNCSRRGTI
EAAMPECDRDSGQCRCKPRITGRQCDQCASGFYRFPECVPCNCNRDGTEPGVCDPGTGAC
LCKENVEGTECNVCREGSFHLDPANLKGCTSCFCFGVNNQCHSSHKRRTKFVDMLGWRLE
TADRVDIPVSFNPGSNSMVADLQELPATVSSASWVAPTSYLGDKVSSYGGYLTYQAKSFG
LPGDMVLLEKKPDVQLTGQHMSIIYEETNTPRPDRLHHGRVQVVEGNFRHASSRAPVSRE
ELMTVLSRLADVRIQGLHFTETQRLTLSEVGLEEASDTGSGHIALAVEICACPPAYAGDS
CQGCSPGYYRDHKGLYTGRCVPCNCNGHSNRCQDGSGICVNCQHNTAGEHCERCQEGYYG
NAIHGSCRACPCPHTNSFATGCVVSGGDVRCSCKAGYTGTQCERCAPGYFGNPQKFGGSC
QPCSCNSNGQLGSCDPLTGDCINQEPKDSSPAEECDDCDSCVMTLLNDLATMGEQLRLVK
SQLQGLSASAGLLEQMRHLETQAKDLRNQLLNYRSAISNHGSKIEGLERELTDLNQEFET
LQEKAQVNSRKAQTLYNNVNRATQSAKELDVKIKNVIRNVHILLKQISGTDGEGNNVPSG
DFSREWAEAQRMMRELRNRNFGKHLREAEADKRESQLLLNRIRTWQKTHQGENNGLANSI
RDSLNEYEAKLSDLHARLQEAAARAKQANGLNQENERALGAIQRQVKEINSLQSDFTKYL
TTADSSLLQTNIALQLMEKSQKEYEKLAASLNEARQELSDKVRELSRSAGKTSLVEEAEK
HAQSLQELAKQLEEIKRNASGDELVRCAVDAATAYENILNAIKAAEDAANRAASASESAL
QTVIKEDLPRKAKTLSSSSDKLLNEAKMTQKKLKQEVSPALNNLQKTLNIVTVQKEVIDT
NLTTLRDGLRGIQRGDIDAMISSAKSMVRKANDITDEVLDGLSPIQTDVERIKDTYGSTQ
NEDFKKALTDADNSVNKLTNKLPDLWRKIESINQQLLPLGNISDNMDRIRELIQQARDAA
NKVAVPMRFNGKSGVEVRLPNDLEDLKGYTSLSLFLQRPNSRENGGTENMFVMYLGNKDA
SRDYIGMAVVDGQLTCVYNLGDHETELQADQILTKSETKEAVMDRVKFQRIYQFARLNYT
KGATSSKPETPGVYDMDGRNSNTLLNLDPENVVFYVGGYPPDFKLPSRLSFPPYKGCIEL
DDLNENVLSLYNFKKTFNLNTTEVEPCRRRKEESDKNYFEGTGYAQVPTQPHAPIPTFGQ
TIQTTVDRGLLFFAENRDRFISLNIEDGKLMVRYKLNSEPPKERGVGDAINNGRDHSIQI
KIGKLQKRMWINVDVQNTIIDGEVFDFSTYYLGGIPIAIRERFNISTPAFRGCMKNLKKT
SGVVRLNDTVGVTKKCSEDWKLVRSASFSRGGQLSFTDLGLPPTDHLQASFGFQTFQPSG
ILLDHQTQTSSLQVTLEDGYIELSTRDSSGPIFKSPQTYMDGLLHYVSVIRDNSGLRLLI
DDQPLRNNERLKRISSSQQSLRLGGSNFEGCISNVFVQRLSLSPEVLDLTSNSLKRDVSL
GSCSLNKPPFLMLLKGSTRFNKTKTFRINQLLQDTPVASPRSMKVWQDACSPLPKTQANH
GALQFGDIPTSHLLFKLPQELLKPRSQFAVDMQTTSSRGLVFHTGAKNSFMALYLSKGRL
VFALGTDGKKLRIKSKEKCNDGKWHTVVFGHDGEKGRLVVDGLRAREGSLPGNSTISIRA
PVYLGSPPSGKPKSLPTNSFVGCLKNFQLDSKPLDTPSSSFGVSSCLGGPLEKGIYFSEE
GGHVVLAHSVLLGPEFKLVFSIRPRSLTGILIHIGSQPGKHLCVYLEAGKVTASMDSGAG
GTSTSVTPKQSLCDGQWHSVAVTIKQHILHLELDTDSSYTAGQIPFPPASTQEPLHLGGA
PANLTTLRIPVWKSFFGCLRNIHVNHIPVPVTEALEVQGPVSLNGCPDH
NT seq 9870 nt   +upstreamnt  +downstreamnt
atggcggcggccgcgcggcctcggggtcgggcaccggggccagtcctgccgccgacgccg
ctgctgctgctggtactgcgggtgctgccagcctggggggcgaccgctcgggattccggg
gctgcggccgggctcagccttcacccgccttacttcaacctggccgaggcggcgaggatt
tgggccaccgccacctgcggggagaggggacccggcgaggggaggccccagcccgagctc
tactgcaagttggtcgggggccctaccgccccaggcagcggccacaccatccagggccag
ttctgtgactattgcaattctgaggaccccaggaaagcacatcctgtcaccaatgccatc
gatggatctgaacgttggtggcaaagccctcccctgtcctcaggcacacagtacaacaga
gtcaacctcaccttggatctggggcagctcttccatgtggcctatattttaatcaaattt
gcaaattctcctcgccctgacctttgggtcttggaaagatctgtagactttggaagcacc
tactcaccatggcaatattttgctcattctaaagtagactgtttaaaagaatttgggcgg
gaggcaaatatggctgtcacccgggatgatgatgtactttgtgttactgaatattcccgt
attgtacctttggaaaatggtgaggttgtggtgtccttgataaacggtcgtccaggtgca
aaaaattttactttctctcaaaccctgagggagtttaccaaggcaacaaacatccgcttg
cgttttcttcgaaccaatacgcttcttggacacctcatctccaaagcccagcgagatcca
actgtcactcggcggtattattacagcataaaggacatcagcattggtgggcagtgtgtt
tgcaatggccatgctgaagtgtgcaatgtaaacaatcctgaaaaactgtttcggtgtgaa
tgccagcaccacacctgtggggagacgtgtgatcgctgctgcacagggtacaatcagagg
cgctggcggcccgccgcgtgggagcagagccacgagtgtgaagcatgcaactgccacggc
catgccagcgactgttactatgatccagatgttgagcggcagcaggcaagcttgactacc
cagggcatctatgctggtggaggggtgtgcattaactgtcagcacaacacagctggagta
aactgtgaacagtgtgctaagggctattaccgcccttatggggttccagtggatgcccct
gatggctgcatcccctgcagctgtgaccctgagcatgcggatggctgtgaacagggctca
ggccgctgtcactgcaagccaaatttccatggagacaactgtgagaagtgtgcagttgga
tactacagtttcccattttgcttgagaattcccatttttcctgtttctatgccaagtcca
gaagatccagtagctggagatataaaaggcaactggtgtgactgtaatctggaaggcgtt
ctccccgaaatatgtgatgcccatggacggtgcctgtgtcaccctggggttgagggccct
cgatgtgatacctgccgctctggtttctactcattccctatttgccaaggttccagcagt
gcttgtgacccagctggtaccatcaactccaatttggggtattgccagtgcaagcttcat
gttgaaggtcctacttgtagcagctgcaaacagttatattggaatctggacaaagaaaac
cccaatggatgttcagaatgccagtgccataaggcgggaacagtgagtggaactggagag
tgtaggcagggagatggtgactgtcactgcaagtcccatgtgggtggcgattcctgtgac
acctgtgaagatggatattttgctttggaaaagagaaattactttgggtgtcaagggtgt
caatgtgacattggtggggcatcgtcctctgtgtgcagcgggccctccggagtgtgccag
tgccgagagcatgtcgtgggaaaggcgtgccagcagcctgaaaacaactactatttccca
gatttgcatcatatgaagtatgagattgaagatggcagcacacctaatgggagagacctt
cgatttggatttgatccgctggcatttcctgagtttagctggagaggatatgcccaaatg
acctcagtacagaatgatgtaagaataacattgaatgtagggaagtcaagtggctccttg
tttcgtgttattctgagatacgttaaccctggaactgaagcagtatctggccatataact
atttatccatcctggggtgctgctcaaagcaaagagatcatcttcctgccgagtaaggag
ccagcctttgtcaccgtccctggaaatggttttgcagacccattttcaatcacaccagga
atatgggttgcttgtattaaggcagaaggagtccttctggattacctggtgctgctcccc
aaggactactatgaagcctctgtactgcagctgccagtcacagaaccatgtgcctacgca
ggacctccccaagaaaattgcttgctctaccagcatttgccagtgaccagattcccctgt
accctggcttgtgaggccagacacttcctgcttgatggggagccaagacccttggcagtg
aggcagctcacacctgcacaccctgtcatggtggacctcagtgggagagaggtggaattg
catctgcggctgcgcgtcccacaggttggccactacgtggttgtggtcgagtattccacg
aaggcagcccagctgtttgtggttgatgtgaatgtgaagagccccgggtctgttctggca
ggccaggtgaacatttacagctgcaactacagtgttctctgccggagcgctgtgattgat
cacatgagccgcatcgccatgtatgagctattggcagatgcagacattcagctcaaggga
cacatggcccgattccttctgcatcaagtttgtatcatacccattgaagaattctcagct
gagtatgtgagaccgcaagtccactgcattgccagttacgggcaatttgtcaatcaaagt
gccacctgtgtctccctggcccacgaaactcctccaacagcattaattttggatggtcta
agtggcaggcctttccctcacctgccccagcagtcttcaccttctgttgatgttcttcct
ggggtcaccttgaaggcaccgcagaatcaagtgaccctgagaggacgtgtaccacacctg
ggccgatacgtctttgtcatccatttttaccaagcagcacacccgacgtttcccgcgcag
gtgtcagtggacggcgggtggccacgggcaggctccttccgtgcctctttttgcccccat
gtgcttggctgccgggatcaagtgattaccgaaggccagattgagtttgacatctcagag
cctgaagtggctgcaaccgtgaaggttccagaaggaaagtccttggttttggtccgtgtt
ctagtggtgcctgcagaaaactatgactaccaaatacttcacaaaaaatccatggacaag
tcactcgagtttatcaccaattgtggaaaaaacagcttttaccttgacccccagacagcc
tccagattctgtaagaattccgccaggtccctggtggccttttaccacaagggcgccctg
ccttgtgagtgccaccccactggggccaccggccctcactgcagccctgagggtgggcag
tgcccatgctggcccaacgtcattgggcggcagtgcacccgctgtgcaacaggccactac
ggattcccacgctgcaagccgtgcagctgtggtcggcgcctttgtgaagagatgacgggg
cagtgccactgccctccgcgcacggtcaggccccagtgtgaggtgtgtgagacacactca
ttcagcttccatcccatggccggctgcgaaggctgcaattgttccaggaggggcaccatc
gaggctgccatgccggagtgtgaccgggacagtgggcagtgcagatgcaagcccagaatc
acagggcggcagtgtgaccaatgtgcttccgggttttaccgcttccctgagtgtgttccc
tgcaattgcaacagagatgggactgagccaggagtgtgtgacccagggaccggagcttgc
ctctgcaaggaaaatgtagaaggcacagagtgtaatgtgtgtcgagaaggctcattccat
ttggacccagccaatctcaagggttgtaccagctgtttctgttttggagtaaataatcaa
tgtcacagctcacataagcgaaggactaagtttgtggatatgctgggctggcgcctggag
acagcagacagagtggacatccctgtctctttcaacccgggcagcaacagcatggtggca
gatctccaggagctgcctgcaaccgtcagcagcgcgtcctgggtcgcacccacctcctac
ctgggggacaaggtttcttcgtatggtggttacctcacttaccaagccaagtccttcggc
ttgcctggtgacatggttcttctggaaaagaagccggatgtacagctcactggtcagcac
atgtccatcatctatgaggagacaaacaccccacggccagaccggctgcatcatggacga
gtgcaggtggtcgagggaaacttcagacatgccagcagccgtgccccggtgtctagggag
gagctgatgacagtgctgtctagactggcagatgtgcgcatccaaggcctccacttcaca
gagactcaaaggctcaccctgagtgaggtggggctagaggaagcctctgacacaggaagt
ggacacatagcacttgctgtggaaatctgtgcctgcccccctgcctacgctggtgactct
tgtcagggttgtagccctggatactatcgggatcataaaggcttgtataccggacggtgt
gttccctgcaattgcaacggacattcaaatcgatgccaggatggctcaggcatatgtgtt
aactgtcagcacaacaccgcgggagagcactgtgaacgctgccaggagggctactatggc
aacgccatccacggatcctgcagggcctgcccatgtcctcacaccaacagctttgccacc
ggctgtgtggtgagtgggggagacgtgcggtgctcctgcaaagctgggtacacaggaaca
cagtgtgaaaggtgtgcaccgggatatttcgggaatccccagaaattcggaggtagctgc
caaccatgcagttgtaacagcaatggccagctgggcagctgtgatcccctgactggagac
tgcataaaccaagaacccaaagatagcagccctgcagaagaatgtgatgattgcgacagc
tgtgtgatgaccctcctgaacgacctggccaccatgggcgagcagctccgcctggtcaag
tctcagctgcagggcctgagtgccagcgcggggcttctggagcagatgaggcacctggag
acccaggccaaggacctaaggaatcagttgctcaactaccgttctgccatttcaaatcat
ggatcaaaaatagaaggcctggaaagagaactgactgatttgaatcaggaatttgagact
ttgcaagaaaaggctcaagtaaattccagaaaagcacaaacattatacaacaatgttaat
cgggcaacacaaagcgcaaaagaactggatgtgaagattaaaaatgtcatccggaatgtg
cacattctcttaaagcagatctctgggacagatggagagggaaacaacgtgccttcaggt
gacttttccagagagtgggctgaagcccagcgcatgatgagggaactgcggaacaggaac
tttggaaagcacctgagagaagcagaagctgataaaagggagtcgcagctcttgctgaac
cggataaggacctggcagaaaacccaccagggggagaacaatgggcttgctaacagtata
cgggattctttaaatgaatacgaagccaaactcagtgaccttcatgctcggctgcaggag
gcagctgcccgagccaagcaggcaaatggcttgaaccaagaaaacgagagagctttggga
gccatccagagacaagtgaaagaaataaattccctgcagagtgatttcaccaagtatcta
accactgcagactcatctttgttgcaaaccaacattgcgctgcagctgatggagaaaagc
cagaaggaatatgaaaaattagctgccagtttaaatgaagcaagacaagaactaagtgac
aaagtaagagaactttccagatctgctggcaaaacatcccttgtggaggaggcagaaaag
cacgcgcagtccttacaagagctggcaaagcagctggaagagatcaagagaaacgccagc
ggggatgagctggtgcgctgtgctgtggatgccgccaccgcctacgagaacatcctcaat
gccatcaaagcggctgaggatgcggccaacagggctgccagtgcatctgaatctgccctc
cagacagtgataaaggaagatctgccaagaaaagctaaaaccctgagttccagcagtgat
aaactgttaaatgaagccaagatgacacaaaagaagctgaagcaagaagtcagtccggct
ctcaacaacctacagaaaaccctgaatattgtgacagttcagaaagaagtgatagacacc
aatctcacaactctccgagatggtcttcgtgggatacagagaggtgatattgatgcgatg
atcagtagtgcaaagagcatggtcagaaaggccaacgacatcacagatgaggtgctggat
gggctcagccccattcagacagatgtggaaagaattaaggacacctatgggagcacacag
aacgaagacttcaaaaaggctctgactgatgcagataactcggtgaacaagttaaccaac
aaactacctgatctttggcgcaagattgaaagtatcaaccaacagctgttgcccttggga
aacatctctgacaacatggacagaatacgagaactaattcagcaggccagagatgctgcc
aataaggttgctgtccccatgaggttcaatggtaaatctggagtcgaagtccgactgcca
aatgacctggaagatttgaaaggatatacatctctgtccttgtttctccaaaggcccaac
tcaagagaaaatgggggaactgagaatatgtttgtgatgtaccttggaaataaagatgcc
tcccgggactacattggcatggcagttgtggatggccagctcacctgtgtctacaacctg
ggggaccatgagactgaactccaagcggaccagatcttgaccaagagtgaaactaaggag
gcagttatggatcgggtgaaatttcagagaatttatcagtttgcaaggcttaattacacc
aaaggagccacatccagtaaaccagaaacacccggagtctatgacatggatggtagaaat
agcaatacactccttaatttggatcctgaaaatgttgtattttatgttggaggttaccca
cctgattttaaacttcccagtcgactaagtttccctccatacaaaggttgtattgaatta
gatgacctcaatgaaaatgttctgagcttgtacaacttcaaaaaaacattcaatctcaac
acaactgaagtggagccttgtagaaggaggaaggaagagtcagacaaaaattattttgaa
ggtacgggctacgctcaagttccaactcaaccacatgctcccatcccaacctttggacag
acaattcagaccaccgtggatagaggcttgctgttctttgcagaaaacagggatcgcttc
atatctctgaatatagaagatggcaagctcatggtgagatacaaactgaattcagagcca
ccaaaagagagaggagttggagacgccataaacaacggcagagaccattcgattcagatc
aaaattggaaaactccaaaagcgtatgtggataaatgtggatgttcaaaacactataatt
gatggtgaagtctttgatttcagcacatattatctgggaggaattccaattgcaatcagg
gaaagatttaacatttctacgcctgctttccgaggctgcatgaaaaatttgaagaaaacc
agtggtgtcgttagattgaatgatactgtgggagtaaccaaaaagtgctcggaagactgg
aagcttgtgcgatctgcctcattctccagaggaggacaattgagtttcactgatttgggc
ttaccacctactgaccacctccaggcctcatttggatttcagacctttcaacccagtggc
atattattagatcatcagacacagacaagcagcctgcaggtcactctggaagatggttac
attgaattgagcaccagggatagcagcggcccaatttttaaatctccacagacgtatatg
gatggtttactgcattatgtatctgtaataagggacaactctggactacggcttctcatc
gatgaccagcctctgagaaataacgaaaggctaaaacgcatttcaagttcccagcagtct
ctgcgtctgggcgggagcaattttgagggttgtattagcaatgtttttgtccagaggttg
tcactgagtcccgaagtcctagatttgaccagtaactctctcaagagagatgtgtccctg
ggaagctgcagtttaaacaaaccaccttttctaatgttgcttaaaggttctaccaggttt
aacaagaccaagactttccgtatcaaccagctgttgcaggacacaccggtggcctcccca
aggagcatgaaggtgtggcaagatgcttgctcaccacttcccaagacccaggccaatcat
ggagccctccagtttggggacattcccaccagccacttgctattcaagcttcctcaggag
ttgctgaaacccaggtcacagtttgctgtggacatgcagacaacatcctccagaggactg
gtgtttcacacgggcgctaagaactcctttatggctctttatctttcaaaaggacgtctg
gtctttgcactggggacagatgggaaaaaattgaggatcaaaagcaaggagaaatgcaat
gatgggaaatggcacacggtggtgtttggccatgatggagaaaaggggcgcttggttgtg
gatggactgagggcccgggagggaagtttgcctggaaactccaccatcagcatcagagca
ccagtttacctgggatcacctccatcagggaaaccaaagagcctccccacaaacagcttt
gtgggatgcctgaagaactttcagctggattcaaaacccttggataccccttcttcaagc
ttcggggtgtcttcctgcttgggtggtcctttggagaaaggcatttatttctccgaagaa
ggaggtcatgtcgtcttggctcactctgtattgttggggccagaatttaagcttgttttc
agcatccgcccaagaagtctcactggaatcctaatacacatcggaagtcagcccgggaag
cacttatgtgtttacctggaggcaggaaaggtcacggcctctatggacagtggggcaggt
gggacctcgacgtcagtcacaccaaagcagtctctgtgtgatggacagtggcactcggtg
gcagtcaccataaaacaacacatcctgcacctggaactggacacagacagtagctacaca
gctggacagatccccttcccacctgccagcactcaagagccactacaccttggaggtgct
ccagccaatttgacaacactgaggatccctgtgtggaaatcattctttggctgtctgagg
aatattcatgtcaatcacatccctgtccctgtcactgaagccttggaagtccaggggcct
gtcagtctgaatggctgtcctgaccactaa

KEGG   Pongo abelii (Sumatran orangutan): 100441462
Entry
100441462         CDS       T01416                                 

Gene name
LAMC3
Definition
(RefSeq) laminin subunit gamma-3
  KO
K06247  laminin, gamma 3
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100441462 (LAMC3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100441462 (LAMC3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100441462 (LAMC3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100441462 (LAMC3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100441462 (LAMC3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100441462 (LAMC3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100441462 (LAMC3)
   05145 Toxoplasmosis
    100441462 (LAMC3)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 100441462
NCBI-ProteinID: XP_002820355
LinkDB
Position
9
AA seq 1575 aa
MAAAALLLGLALLALRAAGAGMGACYDGAGRPQRCLPVFENAAFGRLAQASHTCGSPPED
FCPHVGAAGAGTQCQRCDAADPQRHHNASYLTDFHSQDESTWWQSPSMAFGVQYPTSVNI
TLRLGKAYEITYVRLKFHTSRPESFAIYKRSRADSPWEPYQFYSASCQKTYGRPEGQYLR
PGEDERVAFCTSEFSDISPLSGGNVAFSTLEGRPSAYNFEESPGLQEWVTSTELLISLDR
LNTFGDDIFKDPKVLQSYYYAVSDFSVGGRCKCNGHASECGPDAAGQLACRCQHNTTGTD
CEHCLPFFQDRPWARGTAEAAHECLPCNCSGRSEECTFDRELFRSTGHGGRCHHCRDHTT
GPHCESCQENFYHWDPRMPCQPCDCHSAGSLHLQCDDTGTCACKPTVTGWKCDRCLPGFH
SLSEGGCRPCTCNPAGSLDTCDPRSGRCPCKEKVEGNLCDRCRPGSFNLQPHNPAGCSSC
FCYGHSKVCAATAQFQVHHILTDFHQGAEGWWARSVGGPEHPPQWSPNGVLLSPEDEEEL
TAPDKFLGDQRFSYGQPLILTFRAPPGNSPLPVQLRLEGAGLALSLRHSSLSGPQDAGHP
REVELRFHLQETSEDVAPPLTPFHFQRLLANLTSLRLRISPGPSPAGQVFLTEVRLTSAR
PGLSPPASWVETCSCPSGYTGQFCESCAPGYKREMPQGGPYASCVPCTCNQHGTCDPNTG
ICVCSHHTEGPSCERCLPGFYGNPFAGQADDCQPCPCPGQSACTTIPESGEVVCTHCPPG
QRGRRCEVCDDGFFGDPLGLSGHPQPCHQCQCSGNVDPNAVGNCDPLSGHCLRCLHNTTG
DHCEHCQEGFYRSALVPRPADKCMPCSCHPQGSVSEQMPCDPVTGQCSCLPHVTGRDCSL
CYPGFFDLQPGRGCRSCKCHPLGSQEDQCHPKTGQCTCRPGVTGQACDRCQLGFFGFSIK
GCWACRCSPLGAASAQCHENGTCVCRPGFEGYKCDRCHDNFFLTADGTHCQECPSCYALV
KEEAAKLKARLTLMEGWLQGSNCGSPWGPLDILLGEAPRGDVYQGHHLLPGAQEAFLEQI
MGLEGAVKATREQLQRLSKGARCAQARSQKTCTQLADLEAVLESSEEEILHAAAILSTLE
IPQEGPSQPTKWSHLATEARALARSHRDTATKIAATARRALLASNTSYALLWNLLEGRVA
LETQRDLEDRYQEVQAAQKALGTAMAEVLPEAESVLATVQRIGTDTAPYLALLASLGALP
QKSRAEDLGLKAKALEKTVASQQHMATEAARTLQAAAQAALRQTGPLTQLHQEARAALTQ
ASSSVQAATVTVMGARTLLADLEGMKLQFPRPKDQAALQRKADSVSDRLLADTRRKTKQA
ERMLGNAAPLSSSAKKKGREAELLAKDSAKLAKALLREGKQAHRRASRLTSQTQATLQQV
SRQALASEAHRQELEEAERVGAGLSEMEQQIRESRISLEKDIEALSELLARLGSLDTHQA
PAQALNETQWALERLRLQLGSPGSLQRKLSLLEQESQQQELQIQGFESDLAEIRADKQNL
EAILHSLPENCASWQ
NT seq 4728 nt   +upstreamnt  +downstreamnt
atggcggcggctgcgctcctgctggggctggcgctgctggcactgcgggcggccggcgcg
ggcatgggcgcgtgctatgacggcgcggggcgcccgcagcgctgcctgccggtgttcgag
aacgcggcgtttgggcggctcgcccaggcctcgcacacgtgcggcagcccgcccgaggac
ttctgtccccacgtgggcgccgcgggcgcggggactcagtgccagcgctgcgacgccgcc
gacccccagcgccaccacaacgcctcctacctcaccgacttccacagccaggacgagagc
acctggtggcagagcccgtccatggccttcggcgttcagtaccccacctcggtcaacatc
accctccgcctggggaaggcttacgagatcacgtacgtgaggctaaagttccacaccagt
cgccctgagagcttcgccatctacaagcgcagccgcgccgacagcccatgggagccctac
cagttctacagcgcctcctgccagaagacctacggccggcccgagggccagtacctgcgc
cccggcgaggacgagcgcgtggccttctgcacctctgagttcagcgacatctccccgctg
agtgggggcaatgtggccttctccaccctggagggccggcccagcgcctacaactttgag
gagagccctgggctgcaggagtgggtcaccagcaccgaactcctcatctctctagaccgg
ctcaacacgtttggggacgacatcttcaaggaccccaaggtgctccagtcctactactat
gctgtgtccgacttctctgtgggtggcaggtgcaagtgcaacgggcatgccagcgagtgc
ggccccgacgcggcgggccagttggcctgccggtgccagcacaacaccaccggcacggac
tgtgagcactgcctgcccttctttcaggaccgcccttgggcccggggcaccgctgaggct
gcccacgaatgtctgccctgcaactgcagtggccgctccgaggaatgcacatttgatcgg
gaactcttccgcagcacaggccacggcgggcgctgtcaccactgccgtgaccacacaact
gggccacactgtgagagctgtcaggagaatttctatcactgggacccgcggatgccatgc
cagccctgtgactgccactcggcaggctccctacacctccagtgcgatgacacaggcacc
tgcgcctgcaagcccactgtgactggctggaagtgtgaccgctgtctgcccgggttccac
tcgctcagtgagggaggctgcagaccctgcacctgcaatcccgctggcagcctggacacc
tgtgacccccgcagtgggcgctgcccctgcaaagagaaggtggaaggcaacctatgtgac
agatgtcgacccgggtcctttaacctgcagccgcacaatccggccggctgcagcagctgt
ttctgctatggccactccaaggtgtgcgcggccactgcccagttccaggtgcatcacatc
ctcaccgatttccaccagggagccgaaggctggtgggccagaagtgtggggggccccgag
caccccccacaatggagcccaaatggggtcctcctgagcccagaagatgaggaggagctt
acagcaccagataagttcctgggagaccaacggttcagctacgggcagcccctcatcctg
accttccgggcgccccctgggaactccccactccctgtgcagctgaggctggaaggggca
ggcttggccctgtccctgaggcactctagcctgtctggcccccaggatgctgggcatccc
agggaggtagagctcaggttccacctgcaggagacctccgaggacgtggcccctccactc
acccccttccacttccagcggctgctcgccaacctgaccagcctccgcctccgcatcagt
ccgggccccagccctgccggccaagtgttcctgactgaggtccggctcacatccgcccgg
ccagggctttccccgccagcctcctgggtggagacttgttcatgtcccagtggctacacg
ggccagttctgtgaatcctgtgctccgggatacaagagggagatgccacaggggggtccc
tatgccagctgtgtcccctgcacctgtaaccagcatggcacctgtgaccccaacacaggg
atctgcgtctgcagccaccataccgagggcccatcctgtgaacgctgcttgccaggtttc
tatggcaaccctttcgcgggccaagctgacgactgccagccctgtccctgccctggccag
tcggcctgcacgaccatcccagagagcggggaggtggtgtgcacccactgccccccgggc
cagagagggcggcgctgtgaggtctgtgatgatggtttttttggggacccgctggggctc
tctgggcatccccagccctgccaccagtgccagtgcagcgggaacgtggaccccaatgcc
gtgggcaactgtgaccccctgtctggccactgcctgcgctgcctgcacaacaccacgggt
gaccactgtgagcactgtcaggaaggcttctacaggagcgccctggtccctcggcccgca
gacaaatgcatgccttgcagctgtcacccacagggctcggtcagtgagcagatgccctgc
gacccagtgacaggccaatgctcctgcctgcctcacgtgactggacgggactgcagcctc
tgctaccctggcttcttcgacctgcagcctgggaggggctgccggagctgcaagtgtcac
ccactgggctcccaggaggaccagtgccatcctaagactggacagtgcacctgccgccca
ggtgtcacaggccaggcctgtgacagatgccagctgggtttcttcggcttctccatcaag
ggctgctgggcctgcaggtgctccccactgggcgctgcctccgcccagtgccacgagaac
ggcacatgcgtgtgcaggcctggcttcgagggctacaaatgtgaccgctgccacgacaac
ttcttcctcacggcagacggcacacactgccaggaatgtccgtcctgctacgccctggtg
aaggaggaggcagccaagctgaaggccagactgactttgatggaggggtggctgcaaggg
tccaactgtggcagtccctggggaccactagacattctgctgggagaggccccaaggggg
gacgtctaccagggccaccacctgcttccaggggctcaggaagccttcctggagcagatc
atgggcctcgagggtgctgtgaaggccacccgggagcagctgcagaggctgagcaagggt
gcccgctgtgcccaggccagatcccagaagacctgcacccagctggcagacctggaggca
gtgctggagtcctctgaagaggagattctgcacgcagctgccattctctcaactctggag
attcctcaggaaggtcccagtcagccgaccaaatggagccacctggccacagaggcccgt
gccctcgccaggagccacagagacaccgccaccaagatcgcagccactgctcggagggcc
ctgctcgcctccaataccagctacgcgcttctctggaatctgctggagggaagggtggct
ctagagacccagcgggacctggaggacaggtaccaggaggtccaggcggcccagaaagca
ctggggacggctatggcagaggtgctgcctgaagcggaaagcgtgttggccaccgtgcag
cgaattggcacagatacagccccgtacctggccttgctggcttccctgggagctctgcct
cagaagtcccgggctgaagacctgggcctgaaggcgaaggccctggagaagacagttgca
tcacaacagcacatggccactgaggctgcccgaaccctccaggctgctgcccaggcggcg
ctacggcaaacaggacccctcacacagctgcatcaggaggccagagccgccctgacccag
gcttcctcgtctgtccaggctgccacagtgaccgtcatgggagccaggactctgctggct
gacctggaaggaatgaagctgcagtttccccggcccaaggaccaggcggcattgcagaga
aaggcagactccgtcagcgacagactccttgcagacacgagaaggaagaccaagcaggcg
gagaggatgctgggaaatgcggcccctctttcctccagtgccaagaagaagggcagagaa
gcagagctgttggccaaggacagtgccaagcttgccaaggccttgctgagggaggggaaa
caggcgcaccgccgtgccagccggctcaccagccagacgcaagccacgctccaacaggtg
tcccggcaggcgctggcgtctgaagcacacagacaggagctggaggaagctgagcgggtg
ggtgctgggctgagcgagatggagcagcagatccgggaatcgcgtatctcactggagaag
gacatcgaggccttgtcagagctgcttgccaggctggggtcgctggacacccatcaagcc
ccagcccaggccctgaacgagactcagtgggcactagaacgcctgaggctgcagctgggc
tctccggggtccttgcagaggaaactcagtctgctggagcaggaatcccagcagcaggag
ctgcagatccagggcttcgagagtgacctcgccgagatccgcgccgacaaacagaacctg
gaggccattctgcacagcctgccggagaactgtgccagctggcagtga

KEGG   Pongo abelii (Sumatran orangutan): 100441641
Entry
100441641         CDS       T01416                                 

Gene name
LAMB2
Definition
(RefSeq) laminin subunit beta-2 isoform X1
  KO
K06243  laminin, beta 2
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100441641 (LAMB2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100441641 (LAMB2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100441641 (LAMB2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100441641 (LAMB2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100441641 (LAMB2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100441641 (LAMB2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100441641 (LAMB2)
   05145 Toxoplasmosis
    100441641 (LAMB2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N F5_F8_type_C Sec5
Other DBs
NCBI-GeneID: 100441641
NCBI-ProteinID: XP_024100812
Ensembl: ENSPPYG00000013880
LinkDB
Position
3
AA seq 1802 aa
MELTSREGGRGQPLPWELRLGLLLSVLAATLAQAPAPDVPGCSRGSCYPATGDLLVGRAD
RLTASSTCGLNGPQPYCIVSHLQDEKKCFLCDSRRPFSARDNPHSHRIQNVVTSFAPQRR
AAWWQSENGIPAVTIQLDLEAEFHFTHLIMTFKTFRPAAMLVERSADFGRTWHVYRYFSY
DCGADFPGVPLAPPRHWDDVVCESRYSEIEPSTEGEVIYRVLDPAIPVPDPYSSRIQNLL
KITNLRVNLTRLHTLGDNLLDPRREIREKYYYALYELVVRGNCFCYGHASECAPAPGAPA
HAEGMVHGACICKHNTRGLNCEQCQDFYHDLPWHPAEDGYSHACRKCECHGHTHSCHFDM
AVYLASGNVSGGVCDGCQHNTAGRHCELCRPFFYRDPTKDLRDPAVCRSCDCDPMGSQDG
GRCDSHDDPALGLVSGQCRCKEHVVGTRCQQCRDGFFGLSISDPLGCRRCQCNARGTVPG
STPCDPSSGSCYCKRLVTGRGCDRCLPGHWGLSHDLLGCRPCDCDVGGALDPQVPYRCDE
ATGQCHCRQHMVGRRCEQVQPGYFRPFLDHLIWEAEDTRGQVLDVVERLVTLGETPSWTG
SGFVRLQEGQTLEFLVASVPKAMDYDLLLRLEPQVPEQWAELELIVQRPGPVPAHSLCGH
LVAKDDRIQGTLQPHARYMMFPNPVCLEPGISYKLHLKLVRTGGSAQPETPYSGPGLLID
SLVLLPHVLVLEMFSGGDAAALERRATFERYQCHEEGLVPSKTSPSEACAPLLISLSTLI
YNGALPCQCNPQGSLSSECNPHGGQCLCKPGVVGRCCDLCASGYYGFGPTGCQACQCSHE
GALSSLCEKTSGQCPCRTGAFGLRCDRCQRGQWGFPSCRPCVCNGHADECNTHTGACLGC
RDHTGGEHCERCIAGFHGDPRLPYGGQCRPCPCPEGPGSQRHFATSCHRDEYSQQIVCHC
RAGYTGLRCEACAPGHFGDPSRPGGRCQLCECSGNIDPMDPDACDPHTGQCLRCLHHTEG
PHCAHCKPGFHGQAARQSCHRCTCNLLGTNPQQCPSPDQCHCDPSSGQCPCLPNVQGPSC
DRCAPNFWNLTSGHGCQPCACHPSRARGPTCNEFTGQCHCRAGFGGRTCSECQELHWGDP
GLQCRACDCDSRGIDTPQCHRSTGHCSCRPGVSGVRCDQCARGFSGIFPACHPCHACFGD
WDRVVQDLAARTRRLEQRAQELQQTGVLGAFESSFWHMQEKLGIVQGIVGARNTSAASTA
QLVEATEELRREIGEATEHLTQLEAELTDVQDENFNANHALSGLERDRIALNLTLRQLHQ
HLDLLKHSNFLGAYDSIRHAHRQSAEAERRANTSALAVPSPVSNSASARHRTEALMDAQK
EDFNSKHMANQRALGKLSAHTHTLSLTDINELVCGAPGDAPCATSPCGGASCRDEDGQPR
CGGLSCNGAAATADLALGRARHTQAELQRALAEGGSILSRVAETRRQASEAQQRAQAALD
KANASRGQVEQANQELRELIQSVKDFLNQEGADPDSIEMVATRVLELSIPASAEQIQHLA
SAIAERVRSLADVDAILARTVGDVRRAEQLLQDARRARSRAEDEKQKAETVQEALEEAQR
AQGVAQGAIRGAVADTQDTEQTLYQVQERMAGAEQALSSAGERARQLDALLEALKLKRAG
NSLAASTAEETAGSAQGRAQEAEQLLRGPLGDQYQTVKALAERKAQGVLAAQARAEQLRD
EARDLLQAAQDKLQRLQELEGTYEENERALESKAAQLDGLEARMRSVLQAINLQVQIYNT
CQ
NT seq 5409 nt   +upstreamnt  +downstreamnt
atggagctgacctcaagggaaggagggaggggacagcctctgccctgggaacttcgactg
ggcctactgctaagcgtgctggctgccacactggcacaggcccctgccccggatgtgcct
ggctgttccaggggaagctgctaccccgccacgggcgacctgctggtgggccgagctgac
agactgactgcctcatccacctgtggcctgaatggcccccagccctattgcatcgtcagt
cacctgcaggacgaaaagaagtgcttcctttgtgactcccggcgccccttctctgctaga
gacaacccacacagccatcgcatccagaatgtagtcaccagctttgcaccacagcggcgg
gcagcctggtggcagtcagagaatggtatccctgcggtcaccatccagctggacctggag
gctgagtttcatttcacacacctcattatgaccttcaagacatttcgccctgctgccatg
ctggtggaacgctcagcagactttggccgcacctggcatgtgtaccgatatttctcctat
gactgtggggctgacttcccaggagtcccactagcccccccacggcactgggatgatgta
gtctgtgagtcccgctactcagagattgagccatccactgaaggcgaggtcatctatcgt
gtgctggaccctgccatccctgtcccagacccctacagctcacggattcagaacctgttg
aagatcaccaacctacgggtgaacctgactcgtctacacacgttgggagacaacctactc
gacccacggagggagatccgagagaagtactactatgccctctatgagctggttgtacgt
ggcaactgcttctgctacggacacgcctcagagtgtgcacccgccccaggggcaccagcc
catgccgagggcatggttcacggagcttgcatctgcaaacacaacacacgtggcctcaac
tgtgagcagtgtcaggatttctatcatgacctgccctggcatccagctgaggacggctat
agtcatgcctgtaggaagtgtgagtgccatgggcacacccacagctgccacttcgacatg
gccgtatacctggcatctggcaacgtgagtggaggtgtgtgtgatggatgtcagcataac
acagctgggcgccactgtgagctctgtcggcccttcttctaccgtgacccaaccaaggac
ctgcgggatccggctgtgtgccgctcctgtgattgtgaccccatgggttctcaagacggt
ggtcgctgtgattcccatgatgaccctgcactgggactggtctccggccagtgtcgctgc
aaagaacatgtggtgggcactcgctgccagcaatgccgtgatggcttctttgggcttagc
atcagtgaccctctgggctgccggcgatgtcaatgtaatgcacggggcacagtgcctggg
agcactccttgtgaccccagcagtggatcctgttactgcaaacgtctagtgactggacgt
ggatgtgaccgctgcctgcctggccactggggcctgagccacgacctgcttggctgccgc
ccctgtgactgcgacgtgggtggtgccttggatccccaagttccctacaggtgtgatgag
gccacaggtcaatgccactgccgccagcacatggttgggcgacgctgtgagcaggtgcaa
cctggctacttccggcccttcctggaccacctaatttgggaggctgaggacacccgaggg
caggtgctcgatgtggtggagcgcctggtgacccttggggaaactccatcctggactggc
tcaggctttgtgcggctacaggaaggtcagaccctggagttcctggtggcctctgtgccg
aaggccatggactatgacctgctgctgcgcttagagccccaggtccctgagcaatgggca
gagttggaactgattgtgcagcgtccagggcctgtgcctgcccacagcctgtgtgggcat
ttggtggccaaggatgaccgcatccaagggactctgcaaccacacgccaggtacatgatg
tttcctaatcctgtctgccttgagcctggtatctcctacaagctgcatctgaagctggta
cggacagggggaagtgcccagcctgagactccctactctggacctggcctgctcattgac
tcgctggtgctgctgccccatgtcctggtgctagagatgtttagtgggggtgatgctgct
gccctggagcgccgggccacctttgaacgctaccaatgccatgaggagggtctggtgccc
agcaagacttctccctctgaggcctgcgcgcccctcctcatcagcctgtccacccttatc
tacaatggtgccctgccatgtcagtgcaaccctcaaggttcactgagttctgagtgcaac
cctcatggtggtcagtgcctgtgcaagcctggagtggttgggcgctgctgtgacctctgt
gcctctggctactatggctttggccccacaggctgtcaagcctgccagtgcagccatgag
ggggcactcagcagtctctgtgaaaagaccagtgggcaatgtccctgtcgaactggtgcc
tttgggcttcgctgtgaccgctgccagcgtggccagtggggattccctagctgccggcca
tgtgtctgcaatgggcatgcagatgagtgcaacacccacacaggcgcttgcctgggctgc
cgtgatcacacagggggtgagcactgtgaaaggtgcattgctggtttccatggggaccca
cggctgccatatgggggccagtgccggccctgtccctgtcctgaaggccctgggagccaa
cggcactttgctacttcttgccaccgggatgaatattcccagcagattgtgtgccactgc
cgggcaggctacacggggctgcgatgtgaagcttgtgcccctgggcactttggggaccca
tcaaggccaggtggccggtgccaactgtgtgagtgcagtgggaacattgacccaatggat
cctgatgcctgtgacccccacacggggcaatgcctgcgctgtttacaccacacagagggt
ccacactgtgcccactgcaagcctggcttccatgggcaggctgcccgacagagctgtcac
cgctgcacatgcaatctgctgggcacaaatccccagcagtgcccatctcctgaccagtgc
cactgtgatccaagcagtgggcagtgcccatgcctccccaatgtccagggccctagctgt
gaccgctgtgcccccaacttctggaacctcaccagcggccatggttgccagccttgtgcc
tgccacccaagccgggccagaggccccacctgcaatgagttcacagggcagtgccactgc
cgtgcaggctttggagggcggacttgttctgagtgccaagagctccactggggagaccct
gggttgcagtgccgtgcctgtgattgtgactctcgtggaatagatacacctcagtgtcac
cgctccacaggccactgcagctgccgcccaggggtgtctggtgtgcgctgtgaccagtgt
gcccgtggcttctcaggaatctttcctgcctgccatccctgccatgcatgcttcggggat
tgggaccgagtggtgcaggacttggctgcccgtacacggcgcctagagcagcgggcgcag
gagttgcagcagacgggtgtgctgggtgcctttgagagcagcttctggcatatgcaggag
aagctgggcattgtgcagggcatcgtaggtgcccgcaacacctcagccgcctccaccgca
cagcttgtggaggccacagaggagctgcggcgtgaaattggggaggccactgagcacctg
actcagctcgaggcagagctgacagatgtgcaagatgagaacttcaatgccaaccatgca
ctaagcggtctggagcgagacaggattgcacttaatctcacactgcggcagctccaccag
catcttgacttgctcaaacattcaaacttcctgggtgcctatgacagcatccggcatgcc
cataggcagtctgcagaggcagaacgtcgtgccaatacctcagccctggcagtacctagc
cctgtgagcaattcggcaagtgctcggcatcggacagaggcactgatggatgctcagaag
gaggacttcaacagcaaacacatggccaaccagcgggcactcggcaagctctctgcccat
acccacaccctaagcctaacagacataaatgagctggtgtgtggggcaccaggggatgca
ccctgtgctacaagcccttgtgggggtgccagctgtcgagatgaggatgggcagccccgc
tgtgggggcctcagctgcaatggggcagcggctacagcagacctagcactgggccgggcc
cggcacacacaggcagagctgcagcgggcactggcagaaggtggtagcatcctcagcaga
gtggctgagactcgtcggcaggcaagcgaggcacagcagcgggcccaggcagccctggac
aaggctaatgcttccaggggacaggtggaacaggccaaccaggaacttcgagaacttatc
cagagtgtgaaagacttcctcaaccaggagggggctgatcctgatagcattgaaatggtg
gccacacgggtgctagagctctccatcccagcttcagctgagcagatccagcacctggca
agtgcaattgcagagcgagtccggagcctggcggatgtggacgcgatcctggcacgtact
gtaggagatgtgcgtcgtgccgagcagctactgcaggatgcacggcgggcaaggagccgg
gctgaggatgagaaacagaaggcagagacagtacaggaagcactggaggaggcccagcgg
gcacagggtgttgcccagggtgccatccggggggcagtggctgacacacaggacacagag
cagaccctgtaccaggtacaggagaggatggcaggtgcagagcaggcactgagctctgca
ggtgaaagggctcggcagttggatgctctcctggaggctctgaaattgaaacgggcggga
aatagtctggcagcctctacagcagaagaaacagcaggcagtgcccagggtcgtgcccag
gaggctgagcagctgctacggggccctctgggtgatcagtaccagacggtgaaggccctg
gctgagcgcaaggcccagggtgtgctggctgcacaggcaagggcagaacaactgcgggat
gaggctcgggacctgttgcaagccgctcaggacaagctgcagcggctacaggaattggaa
ggcacctatgaggaaaatgagcgggcactggagagtaaggcagcccagctggacgggttg
gaggccaggatgcgcagcgtgcttcaagccatcaacttgcaggtgcagatctacaacacc
tgccagtga

KEGG   Pongo abelii (Sumatran orangutan): 100443331
Entry
100443331         CDS       T01416                                 

Gene name
THBS3
Definition
(RefSeq) thrombospondin-3
  KO
K04659  thrombospondin 2/3/4/5
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04145  Phagosome
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05144  Malaria
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100443331 (THBS3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100443331 (THBS3)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    100443331 (THBS3)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100443331 (THBS3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100443331 (THBS3)
  09174 Infectious disease: parasitic
   05144 Malaria
    100443331 (THBS3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pon04131]
    100443331 (THBS3)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100443331 (THBS3)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100443331 (THBS3)
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100443331 (THBS3)
Membrane trafficking [BR:pon04131]
 Endocytosis
  Phagocytosis
   Opsonins
    100443331 (THBS3)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100443331 (THBS3)
  Exosomal proteins of colorectal cancer cells
   100443331 (THBS3)
  Exosomal proteins of bladder cancer cells
   100443331 (THBS3)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100443331 (THBS3)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   100443331 (THBS3)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA cEGF EGF_3 EGF Laminin_G_2 EGF_MSP1_1
Other DBs
NCBI-GeneID: 100443331
NCBI-ProteinID: XP_024103077
Ensembl: ENSPPYG00000000753
LinkDB
Position
1
AA seq 956 aa
METQELRGALALLLLCSFASASQDLQVIDLLTVGESRQMVAVAEKIRTALLTAGDIYLLS
TFRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQREDGKVHAVNLQQAGLADG
RTHTVLLRLRGPSRPSPALHLYVDCKLGDQHAGLPALAPIPPAEVDGLEIRTGQKAYLRM
QGFVESMKIILGGSMARVGALSECPFQGDESIHSAVTNALHSILGEQTKALVTQLTLFNQ
ILVELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRSHCSPSPCFRGVDCMEVYEYPGYRC
GPCPPGLQGNGTHCSDINECAHADPCFPGSSCINTMPGFHCEACPRGYKGTQVSGVGIDY
ARASKQVCNDIDECNDGNNGGCDPNSICTNTVGSFKCGPCRLGFLGNQSQGCLPARTCHS
PAHSPCHIHAHCLFERNGAVSCQCNVGWAGNGNVCGTDTDIDGYPDQALPCMDNNKHCKQ
DNCLLTPNSGQEDADNDGVGDQCDDDADGDGIKNVEDNCRLFPNKDQQNSDTDSFGDACD
NCPNVPNNDQKDTDGNGEGDACDNDVDGDGIPNGLDNCPKVPNPLQTDRDEDGVGDACDS
CPEMSNPTQTDADSDLVGDVCDTNEDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECD
GDDDNDGIPDYMPPGPDNCRLVPNPNQKDSDGNGVGDVCEDDFDNDAVVDPLDVCPESAE
VTLTDFRAYQTVVLDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGT
FHVNTVTDDDYAGFLFSYQDSGRFYVVMWKQTEQTYWQATPFRAVAQPGLQLKAVTSVSG
PGEHLRNALWHTGHTPDQVRLLWTDPRNVGWRDKTSYRWQLLHRPQVGYIRVKLYEGPQL
VADSGVIIDTSMRGGRLGVFCFSQENIIWSNLQYRCNDTVPEDFEPFRRQLLQGRV
NT seq 2871 nt   +upstreamnt  +downstreamnt
atggagacgcaggaacttcggggggccctggctcttctcctcctttgctctttcgcatct
gccagtcaggatctgcaggtaattgacctgctgactgtgggcgagtctcggcagatggta
gctgtggcagagaagatccggacagccctgctcactgctggggacatctacctcttgtcc
acctttcgcctgccccccaagcagggtggtgtcctctttggcctctattctcgccaagac
aacactcgatggttggaggcctctgttgtaggcaagatcaacaaagtactggtgcgatac
cagcgggaggatggcaaagtccacgccgtgaacctacagcaagcaggcctggctgatggg
cgcacacacacagttctcctgcgactccgaggtccctccagacccagccctgccctacat
ctctacgtggactgcaaactgggtgaccaacatgcaggccttccagcactggcccccatt
ccgccagcggaggtcgatgggctggaaattaggactggacagaaggcgtatttgaggatg
cagggctttgtggaatctatgaaaattattctgggtgggtccatggcccgggtaggagcc
ctgagtgagtgtccattccaaggggacgagtccatccacagtgcagtgaccaatgcactg
cactccattctaggggagcagaccaaggcgctggtcacccaactcaccctcttcaaccag
atcctggtggagctgcgggatgatatacgagaccaggtgaaggaaatgtccttgatccga
aacaccattatggagtgtcaggtgtgcggcttccatgagcagcgttcccactgcagcccc
agtccctgcttccgaggcgtggactgcatggaagtgtacgagtacccaggctaccgctgt
gggccctgcccccccggcctgcagggcaacggcacccactgcagtgacatcaatgagtgt
gctcatgctgacccctgtttcccgggctccagctgcatcaacaccatgcccggcttccac
tgtgaggcctgtcctcgagggtacaagggcacacaggtgtctggtgtgggcattgactat
gcccgggccagcaaacaggtctgcaatgacatcgatgaatgcaacgatggcaacaatggt
ggctgtgacccaaactccatctgcaccaacactgtgggctctttcaagtgtggtccctgc
cgcctaggtttcctgggcaaccagagccagggctgcctcccagcccggacctgccacagc
ccagcccacagcccctgccacatccatgctcactgtctctttgaacgcaatggtgcagtg
tcctgccagtgtaacgtgggctgggctgggaatgggaacgtgtgtgggactgacacagac
atcgatggctacccagaccaagcactgccctgcatggacaacaacaaacactgcaaacag
gacaactgccttttgacacccaactctgggcaggaagatgctgataatgatggtgtgggg
gaccagtgtgatgacgatgctgacggggatgggatcaagaatgttgaggacaactgccgg
ctgttccccaacaaagaccagcagaactcagatacagattcatttggtgatgcctgtgac
aattgtcccaacgttcccaacaatgaccagaaggacacagatggcaatggggaaggagat
gcctgtgacaacgacgtggatggggatggcatccccaatggattggacaattgccctaaa
gtccccaacccactacagacagacagggatgaggatggggtgggagatgcttgcgacagc
tgccctgaaatgagcaatcctacccagacagatgcagacagcgacctggtgggggatgtc
tgtgataccaatgaagacagcgatggggatgggcatcaggacaccaaggacaactgccca
cagctgccaaatagctcccagctggactctgataacgacggacttggagatgagtgtgat
ggggatgatgacaatgatggcatcccagattacatgcctcctggtcctgataactgccgc
ctggtacccaatcccaatcagaaggactcagatggcaatggcgttggtgatgtgtgtgag
gatgactttgacaatgatgctgtggtcgaccccctggatgtgtgtcccgaaagtgcagag
gtaacgcttacggattttcgggcctatcagaccgtcgtcctggatcctgagggtgatgct
cagattgacccaaactgggttgtgctcaaccagggcatggaaatcgttcagaccatgaac
agtgaccctggcttggcagttggatacacggccttcaatggtgtggactttgaaggcacc
ttccatgtgaacacagtgactgatgatgactacgcaggctttctcttcagttatcaagac
agtggccgcttctacgtagtcatgtggaagcagaccgagcagacctactggcaggctaca
cctttccgggcagttgcccagcccgggctgcagctcaaggcagtgacatcagtgtctggc
ccaggtgagcacctccgaaatgccctgtggcatactggccacacccccgatcaggtacga
ctgctgtggacagacccacgaaatgtgggctggcgggacaagacctcctatcgctggcag
cttctgcaccggcctcaagttggctacattcgggtgaagctctatgagggaccccagctt
gtggcggattctggggtaatcattgacacatccatgcgaggggggcgtcttggtgtattc
tgcttctcccaagaaaacataatttggtccaatctccagtatcgatgcaatgacacagtg
cctgaggactttgagccattccggaggcagctgctccagggaagggtgtga

KEGG   Pongo abelii (Sumatran orangutan): 100443502
Entry
100443502         CDS       T01416                                 

Gene name
TNXB
Definition
(RefSeq) LOW QUALITY PROTEIN: tenascin-X
  KO
K06252  tenascin
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05165  Human papillomavirus infection
pon05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100443502 (TNXB)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100443502 (TNXB)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100443502 (TNXB)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    100443502 (TNXB)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100443502 (TNXB)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100443502 (TNXB)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100443502 (TNXB)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_2 EGF_Tenascin DUF2369 DUF4959 hEGF
Other DBs
NCBI-GeneID: 100443502
NCBI-ProteinID: XP_024104015
Ensembl: ENSPPYG00000016478
LinkDB
Position
6
AA seq 4244 aa
MMPAQYALTSSLVLLVLLSTARAGPFPSRSNVTLPAPRPPPQPGGRTVGAGVGSPSSQLY
EHTVEGGEKQVVFTHRINLPPSTGCGCPPGTEPPVPASEVQALRVRLEILEELVKGLKEQ
CTGGCCPASAQAGTGQTDVRTLCSLHGVFDLSRCTCSCEPGWGGPTCSDPTDAEIPPSSP
PSASGSCPDDCNDQGRCVRGRCVCFPGYTGPSCGWPSCPGDCQGRGRCVQGVCVCRAGFS
GPDCSQRSCPRGCSQRGRCEDGRCVCDPGYTGDDCGMKSCPRGCSQRGRCENGRCVCNPG
YTGEDCGVRSCPRGCSQRGRCEDGRCVCDPGYTGEDCGTRSCPWDCGEGGRCVDGRCVCW
PGYTGEDCSTRTCPRDCRGRGRCEDGECICDTGYSGDDCGVRSCPGDCNQRGRCEDGRCV
CWPGYTGTDCGSRACPRDCRGRGRCENGVCVCNAGYSGEDCGVRSCPGDCRGRGRCESGR
CVCWPGYTGRDCGTRACPGDCRGRGRCVDGRCVCNPGFTGEDCGSRRCPGDCRGHGRCED
GVCVCDADHSGEDCSTRSCPGGCRGRGQCLDGRCVCEDGYSGEDCGVRQCPNDCSQHGVC
QDGVCICWEGYVGEDCSIRTCPSNCHGRGRCEEGRCLCDPGYTGPTCATRMCPADCRGRG
RCVQGVCLCHVGYGGEDCGQEEPPASACPGGCGPRELCRAGQCVCVEGFRGPDCAIQTCP
GDCRGRGECHDGSCVCKDGYAGEDCGEEVPTIEGMRMHLLEETTVRTEWTPAPGPVDAYE
IQFIPTTEGASPPFTARVPSSASAYDQRGLAPGQEYQVTVRALRGTSWGLPASKTITTMI
DGPQDLRVVAVTPTTLELGWLRPQAEVDRFVVSYVSAGNQRVRLEVPPEADGTLLTDLMP
GVEYVVTVTAERGRAVSYPASVRANTGSSPSGLLGTTDEPPPSGPSTTQGAQAPLLQQRP
QELGELRVLGRDETGRLRVVWTAQPDTFAHFQLRLRVPEGPGAHEEVLPGDVRRALVPPP
PPGTTYELSLRGVPPGGKPSDPIIYQGVMDKDEEKPGKSSGPPRLGELTVTDRTSDSLLL
RWTVPEGEFDSFVIQYKDRDGQPQVVPVEGPQRSAVITSLDPGRKYKFVLYGFVGKKRHG
PLVAEAKILPQSDPSPGTPPRLGNLWVTDPTPDSLHLSWTVPEGQFDTFMVQYRDRDGRP
QVVPVEGPERSFVVSSLDPDHKYRFTLFGIANKKRYGPLTADGTTAPERKEEPPHPEFLE
QPLLGELTVTGVTPDSLRLSWTVAQGPFDSFMVQYKDAQGQPQAVPVAGDENEVTVPGLD
PDRKYKMNLYGLRGRQRVGPESVVAKTAPQEDVDETPSPTELGTEAPESPEEPLLGELTV
TGSSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQAVRVGGKESEVTVGGLEPGHKYKMHL
YGLHKGQRVGPVSAMGVTAPQQEETPPATESPLEPRLGELTVTDVTPNSVGLSWMVPEGQ
FDSFMVQYKDKDGQPQVVPVAADQREVTVYNLEPERKYKMNMYGLHDGQRMGPLSVVIMT
APLPTAPATEASKPPLEPRLGELTVTDVTPDSVGLSWTVPEGEFDSFVVQYKDRDGQPQV
VPVAADQREVTIPDLEPSRKYKFLLFGIQDGKRRSPVSVEAKTVARGDASPGAPPRLGEL
WVTDPTPDSLRLSWTVPEGQFDSFVVQFKDKDGPQVVPVEGHERSVTVTPLDAGRKYRFL
LYGLLGKKRHGPLTADGTTEARSAMDDTGTKRPLKPRLGEELQVTSVTQNSVGLSWTVPE
GQFDSFVVQYKDRDGQPQVVPVEGSLREVSVPGLDPAHRYKLLLYGLHGSKRVGPISAVA
VTAGREEMETETTAPTSPAPEPHLGELTVEEATPHTLHLSWMVTEGEFDSFEIQYTDRDG
QLQMVRTGGDRNDITLSGLESDHRYLVTLYGFRDGKHVGPVRVEALTVPEEEEPSEPPTA
TTEPPIKPHLGELTVTDATPDSLSLSWTVPEGQFGHFLVQYRNGDGQPKAVRVPGHEDGV
TVSGLEPDHKYKMNLYGFHGGQRVGPVSVIGVTAAEEETPSPTEPSMEAPEPPEEPLLGE
LTVTGSSPDSLSLSWTIPQGRFDSFTVQYKDRDGRPQVVRVRGEESEVTVGGLEPGRKYK
MHLYGLHEGQRMGPVSAVGVTAPEEESPDAALVKPRLGEMTVTDVTSDSLSLSWTVPEGQ
FDHFLVQYKNVDGQPKAVRVPGYEDGVSISGLEPDHKYKMNLYGFHGGQRVGPVSAVGLT
APGKDEEMAPDSTEPPTPEPPIKPRLEELTVTDVTPDSLSLSWKVPEGQFDHFLVQYKNG
DGQPKATRVPGHEDRVTISGLEPDHKYKMNLYGFHGGQRVGPVSVIGVTAAEEETPRPTE
PSTEAPEPPEEPLLGELTVTESSPDSLSLSWTVPQGHFDSFTVQYKDRNGQPQVVRVGGE
ESEVTVGGLEPGRKYKMHLYGLHEGQRVGPVSTVGLTAPREDVDKTPSPTEPGTEAPEPP
AEPLLGELTVTGSSPDSLSLSWTIPQGRFDSFTVQYKDRDGRPQAVRVGGQESEVTVGGL
EPGRKYKMHLYGLHEGQRLGPVSAVGITAPEDEAETTQAVPTTTPESPIKPRLGELTVTD
ATPDSLSLSWTVPEGQFDHFLVQYRNGDGQPKAVRVPGHEDGITVSGLEPDHKYKMNLYG
FHSGQRVGPVSVIGVTAAEEETPSPTEPSTEAPEPPEEPLLGELTVTGSSPDSLSLSWTI
PQGRFDSFTVQYKDRDGRPQVVRVRGEESEVTVGGLEPRRKYKMHLYGLHEGRRVGPVST
VGVTDDEAETTQAVPTTTPEPPIKPRLGELTVTDATPDSLSLTWMVPEGQFDHFLVQYRN
GDGQPKAVRVPGHEDGITISGLEPDQKYKMNLYGFHGGQRMGPVSVIGVTAAEEETPAPT
EPSTEAPEPPEEPLLGELTVTGSSPDSLSLSWTVPQGRFDSFTVQYKDRDGRPQVVRVGG
EESEVTVGGLEQGRKYKMHLYGLHEGQRVGPVSAVGVTAPEDEAETTQAVPTTTPEPPIK
PHLGELTVTDATPDSLSLSWTVPEGQFDHFLVQYRNGDGQPKAVRVPGHEDGVTVSGLEP
DHKYKMNLYGFHGGQRVGPVSVIGVTEEENPSPTEPSTEAPEPPEEPLLGELTVTGSSPD
SLSLSWTVPQGRFDFFTVQYKDRNGQPQAVRVGGEESEVTVRGLEPGRKYKMHLYGLHEG
QRMGPVSTVGVTAPLPTPPPVEPRLGELAVAAVTSDSVGLSWTVAQGPFDSFLVQYRDAQ
GQPQAVPVSGDLRAVTVSGLDPARKYKFLLFGLQNGKRHGPVPVEARTAPDTKPSPHLGE
LTVTDATPDSVGLSWTVPEGEFDSFVLQYKDKDGQLQVVPVAANQREVTVQGLEPSRKYR
FLLYGLSGRKRLGPISAESTTAPLEKEPPPRLGELTVAEETSSSLRLSWTVAQGPFDSFV
VQYRDTDGQPRAVPVAADQRTVTIEDLEPGKKYKFLLYGLLGGKRLGPVSALGMTAPEED
TPAPELXPEAPEPPEEPRLGVLTVTDTTPDSLRLSWSVAQGPFDSFVVQYEDTNGQPQAL
LVDGDQSKILISGLEPSTPYRFLLYGLHEGKRLGPLSAEGTTGLAPAGQTSAESRPRLSQ
LSVTDVTTSSLRLNWEAPPGAFDSFLLRFGVPSPSTLEPHPRPLLQRELMVPGTRHSAVL
RDLRPGTLYSLTLYGLRGPHKADSIQGTARTLSPVLESPRDLQFSEIREASAKVNWMXPP
SRADSFKVSYQLADGGEPQSIQVDGRARTQKLQGXIPGARYEVTVVSVRGFEESEPLTGF
LTTVPDGPHAVRALNLTEGFAVLHWKPPQNPVDTYDVQVTAPGAPPLQAEAPGSAVDYPL
HDLVLHTNYTATVRGLRGPXLTSPASITFTTGLEAPRDLEAKEVTPRTALLTWTEPQFRP
TGYLLSFDTPGGQTQEILLPGGITSHQLLGLFPSTPYNARLQAMWGQSLLPPVSTSFTTG
GLWIPFPRDCGEEMQNGAGASRTSTIFLNGNRERPLNVFCDMETDGGGWLVFQRRMDGQT
DFWRDWEDYAHGFGNISGEFWLGNEALHSLTQAGDYSMRVDLRAGDEAVFAQYDSFRVDS
AAEYYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNNLLISCAVSYRGXWWYRNCHYANL
NGLYGSTVDHQGVSWYYWKGFEFSVPFTEMKLRPRNFRSPAGGG
NT seq 12735 nt   +upstreamnt  +downstreamnt
atgatgccagcccagtatgctctaacctccagcctggttctcctggtgctgctgagcaca
gccagagcaggccccttcccttcacggtccaatgtgacactgccagccccccggccccct
ccccagccagggggccgcacagtgggggccggagtgggaagcccctcttctcagctttac
gagcacacagtggaaggaggggagaagcaggtggtattcacccaccgcattaacctgccc
ccttccactggctgtggctgtcccccaggcactgagcccccagtccctgcttcagaggtg
caggccctgagagtccggctagagatcctggaggagttggtgaaggggctcaaggaacag
tgcactgggggatgttgtcctgcctctgcccaagctggcacaggtcagacagatgtgcgg
accctctgcagtctccatggtgtgtttgatctgagccgctgcacctgttcctgtgagcca
ggctggggtgggcccacctgctcagacccaacagatgctgagatccctccctcctcccca
ccctcagcctcggggtcctgcccagatgactgcaatgatcagggtcgctgtgtccgtggt
cgttgcgtgtgctttcccggctacactggccccagctgtggctggccatcctgtcccggg
gactgccaaggccgtgggcgctgcgtgcagggcgtgtgcgtgtgcagggcaggcttctca
ggcccagactgcagccagcgctcctgccctcggggttgcagccagaggggacgctgtgag
gatgggcgctgcgtgtgtgacccaggctacactggtgacgactgtggcatgaagagctgc
cctcgcggttgcagtcagagggggcgctgtgagaatgggcgctgcgtgtgcaacccgggc
tacactggcgaggactgtggggtgaggagctgccctcggggctgcagccagcggggacgc
tgcgaggacgggcgctgcgtgtgtgaccccggctacactggcgaggactgtggtacgcgg
agctgcccctgggactgtggggagggcgggcgctgcgtggacggccgctgcgtgtgctgg
cccgggtacacaggcgaggactgcagcacgcggacatgtccgagggactgccggggccgc
gggcgctgcgaggacggcgaatgcatttgcgacacgggctacagcggggacgattgcggc
gtgcgcagctgccctggtgactgcaaccaaaggggccgctgcgaggacggccgctgcgtg
tgctggccggggtacactggaaccgattgcggctcgcgcgcctgcccacgcgactgtaga
ggtcgcgggcgctgcgagaacggcgtgtgtgtttgcaacgcgggctacagcggcgaggac
tgcggtgtgcgcagctgtcctggggactgtcgtggccggggccgctgtgagagtggccgc
tgcgtgtgttggccggggtacacaggccgggactgcggcacgcgcgcctgtcctggcgac
tgtcgcgggcgcgggcgctgcgtggatggccgctgcgtgtgcaacccgggcttcaccggt
gaggactgtgggagccgtcgctgtcccggggactgccgtgggcacggccgttgcgaggat
ggcgtgtgcgtgtgtgacgcagaccactcaggggaggactgcagcacgcgcagctgcccc
gggggctgccgaggccgcggccagtgcctagatgggcggtgtgtgtgcgaggacggctac
tctggcgaggattgcggtgtgaggcagtgcccgaatgactgcagccagcacggcgtgtgc
caggacggtgtgtgcatctgttgggaaggctacgtgggcgaggactgtagcatccgcacc
tgcccctccaactgccatgggaggggccgctgtgaggaagggcgctgcctgtgcgaccca
ggctacaccggccccacctgtgccacccgcatgtgcccggctgactgccggggacgtggg
cgctgtgtgcagggagtgtgcctgtgccacgtgggctatggcggtgaggactgcgggcag
gaagagcctccagccagcgcctgccctggaggctgcggaccccgggaactgtgccgggca
ggccagtgtgtgtgtgtagagggcttccgaggccctgactgtgccatccagacatgccca
ggggactgccgtggccgaggagagtgtcacgatggcagctgtgtctgcaaagatgggtat
gctggcgaagactgcggggaagaggtgccaaccattgagggcatgaggatgcatctcttg
gaggagacaacagttcggacagagtggaccccggctcctggccctgtggatgcctatgaa
attcagttcatccccacgacagagggggcgagccccccattcacagcacgggttccaagc
tctgcctcagcctatgaccagagaggactggcccctggacaggagtaccaggtcactgtc
cgagcccttcgagggaccagctggggccttcctgcctccaagaccatcaccaccatgatc
gatgggccccaggacctccgagtggtggctgtgacgccaacaacactggagcttggctgg
ctgcgtccccaggccgaggtggaccgatttgtggtgtcctacgtcagtgccggcaaccag
agggtgcggctggaagtgccccctgaagcagatgggacgctgctgactgacctgatgcca
ggcgtagaatatgtggtgactgtcacagcagagcggggccgggcagtcagctacccagct
tctgtcagggccaacacagggtcctcaccctcgggcctcttggggactaccgatgagcct
cctccctcaggcccctcgacgacgcaaggggcccaggctcctctcctgcagcagcgcccc
caggagctgggagagttgagggtgctgggcagagatgagacagggcgcctccgtgtggtc
tggaccgcccagcccgacacctttgcccacttccaactgcgcctgcgggtgcccgagggg
ccgggggcacatgaggaagtgctgccaggggacgtccgccgggctctggtgcctccaccc
cctcctggaaccacgtatgagctgtcacttcgtggggtccctcctgggggcaagccctct
gaccccatcatctaccaaggcgttatggacaaggatgaggagaagcctgggaagtcctca
ggcccaccacgcctgggtgagctgacagtgacagacaggacctccgactccttgctcctg
cgctggacagtccccgagggcgagtttgactccttcgtgatccagtacaaggacagggac
gggcagccccaggtggtgcccgtggagggaccccagcgctcggctgtcatcacctccctg
gatcctggccgcaagtacaaatttgtcctgtacgggttcgttggcaagaagaggcatggt
ccactggtggctgaagccaagatcttgcctcagagtgacccaagtccagggactccaccc
cgcctgggaaatctgtgggtgacagaccctaccccagattcactgcacctctcctggact
gtccctgagggccagttcgacaccttcatggtccagtacagggacagggatggacggccc
caggtggtacctgtggaagggcccgagcgttcatttgttgtctcctcactggaccctgac
cacaagtacagattcactctgtttggaatcgcgaacaagaagcggtatggccccctcacg
gccgatggcaccactgctccagagaggaaagaggagcccccccaccctgagttcctggag
cagcccctcctgggggaactgacagtgaccggtgtgaccccagactccttgcgcctctca
tggacggtagcccagggccccttcgactcgttcatggtccagtacaaggatgcacagggg
cagccccaggcagtgcctgttgcgggggatgagaatgaggttaccgtccccggcctggat
cccgaccggaagtataagatgaacctctacgggcttcgtggcaggcagcgtgtggggccc
gagtctgtggtggccaagactgctcctcaggaggatgtggacgagacccccagccccaca
gaactgggcacggaggccccagagtcccccgaggagccactcctgggggagctgacagtg
acaggatcctcccctgattcgctgagcctctcctggaccgtcccccagggccgcttcgac
tccttcaccgtgcagtacaaggacagggacgggcggccccaggcggtgcgtgttgggggc
aaggagagtgaggtcaccgtggggggcctagagcccgggcacaagtacaagatgcacctg
tacggcctccacaaggggcagcgcgtgggcccggtgtccgccatgggcgtgacagcccca
caacaagaagagacccctccagccaccgagtccccgctggagccacgcctaggagagctg
acagtgacagacgtgacccccaactctgtgggcctctcctggatggtccccgagggccag
tttgactccttcatggtccagtacaaggacaaggacgggcagccccaggtggtgccggtg
gctgcagaccagcgagaggtcacggtctacaacctggagcctgagagaaaatacaagatg
aacatgtatggactacatgatgggcaacgcatgggccccctgtctgtggtcatcatgacg
gctcccctcccaacagccccagccacagaggcctccaagcctcccctggagccacgccta
ggggagctgacagtgacggatgtaacccctgactctgtgggcctctcgtggacagtccct
gagggtgaattcgactcctttgtggttcagtacaaggacagggacgggcagccccaggtg
gtgcccgtggctgcagaccagcgggaggtcaccatccctgacctggaaccctcccgcaag
tacaagttcctgctctttgggatccaggatgggaaacgacgcagcccagtctctgtggag
gcaaagacggttgcccgaggtgacgccagcccaggggccccaccccgccttggggagctg
tgggtgacagaccccaccccagactcactgcgcctctcctggacggttcctgagggccag
ttcgactcttttgtggtccagttcaaggacaaagatgggccccaggtggtgcccgtggag
ggccatgagcgctctgtcaccgtcacccctctggacgccggccgcaagtacagattcctc
ctctatggcctcctgggcaaaaagcgccatggccccctcactgccgacggcaccacggag
gcccggagtgctatggacgatactggaacaaagcgtcccctaaaaccccgtctgggggag
gagctgcaggtgaccagcgtgacccagaactccgtgggcctctcctggacagtccctgag
ggccagtttgactcctttgtggtccagtacaaggacagggacgggcagccccaggtggtg
cccgtggagggcagcctcagggaggtcagcgtgccgggcctggaccccgcccacaggtac
aagctgctgctctacgggctgcacggcagcaagcgtgtgggccccatctcggccgtcgcc
gtaactgccggcagggaagaaatggaaactgagaccacggccccgacctctccagcgcct
gagccccaccttggggagttgacagtggaggaggccacgccacataccctgcatctctcc
tggatggtgactgagggagaatttgactccttcgaaatccagtacacagatagagacggg
caactccaaatggtccgtacaggaggcgaccggaatgacatcaccctctctggcctggaa
tccgaccacagatacctggtgacactgtatggtttccgtgatgggaagcatgtgggtcct
gtccgtgtcgaggccctgacagtcccggaggaggaggagccttcagaacctcccaccgca
accaccgagccccccatcaaacctcacctgggggagctgaccgtgacagatgccacccct
gactccctcagcctgtcctggacggttcccgagggccagtttggccacttcctggtccag
tacaggaatggggacgggcagcccaaggcggtgcgggtgccagggcacgaggatggggtc
accgtctccggcctggagccagaccacaagtacaagatgaacctgtacggcttccacggt
ggccagcgcgtgggtcccgtgtctgtcatcggggtgacggctgcagaggaagagaccccc
agccccacagaacccagcatggaggccccagagccccctgaggagccgctcctgggggag
ctgacagtgacaggatcctcccctgactcgctgagcctctcctggaccatcccccagggc
cgcttcgactccttcaccgtgcagtacaaggacagggacgggcggccccaggtggtgcgt
gtcaggggcgaggagagcgaggtcactgtggggggcctggagcccgggcgcaaatacaag
atgcacctgtacggcctccacgaggggcagcgcatgggcccagtgtctgctgtgggcgtc
acggcccctgaagaggagtcccctgatgctgcccttgtgaagccgcgcctaggggagatg
acagtgacagatgtcacctccgactccctcagcctctcctggacagtccccgagggccaa
tttgaccatttcttggtccagtacaagaatgtggatgggcagcccaaggcggtgcgagtg
ccgggatatgaggacggggtctccatctcgggcctggagccagaccacaagtacaagatg
aacctgtacggcttccacggtggccagcgtgtgggccccgtgtctgctgttggtttaact
gccccaggaaaggatgaagaaatggccccagactcgacagaacctcccacccctgaaccc
cccatcaagcctcgcctggaggagctgaccgtgacagatgtgacccctgactccctcagc
ctgtcctggaaggttcccgagggccagtttgaccacttcctggtccagtacaagaatggt
gatgggcagcccaaggcaacacgggtgccaggacatgaggacagggtcaccatctccggc
ctggagccagaccacaagtacaagatgaacctgtatggcttccacggtggccagcgcgtg
ggtcccgtgtctgtcatcggggtgacggctgcagaggaagagacccccaggcccacagaa
cccagcactgaggccccggagccccctgaggagccgctcctgggggagctgacagtgaca
gaatcctcccctgactcgctgagcctctcctggaccgtcccccagggccactttgactcc
ttcaccgtgcagtacaaggacaggaatgggcagccccaggtggtgcgtgttgggggcgag
gagagcgaggtcaccgtggggggcctggagcctgggcgcaaatacaagatgcacctgtac
ggcctccacgaggggcagcgcgtgggcccagtgtccaccgtgggcctgactgccccacga
gaggatgtggacaagacccccagccctacagaaccaggcacagaggccccagagcccccc
gcggagccactcctgggggagctgacagtgacaggatcctcccctgactcgctgagcctc
tcctggaccatcccccagggccgcttcgactccttcactgtgcagtacaaggacagggac
gggcggccccaggcggtgcgtgttgggggccaggaaagcgaggtcaccgtggggggcctg
gagcctgggcgcaagtacaagatgcacctgtacggcctccacgaggggcagcgcctgggc
ccggtgtctgccgtgggcatcacagctccagaggatgaagctgagaccacccaagcagtg
cccaccacgacccctgagtcccccatcaagcctcgcctgggggagctgaccgtgacagat
gccacccctgactccctcagcctgtcctggacagttcccgagggccagtttgaccacttc
ctggtccagtacaggaatggggacgggcagcccaaggcggtgcgggtgccagggcacgag
gatgggatcaccgtctccggcctggagccagaccacaagtacaagatgaacctgtatggc
ttccacagtggccagcgcgtgggccctgtgtctgtcattggggtgacggctgcagaggaa
gagacccccagccccacagaacccagcactgaggccccagagccccctgaggagccgctc
ctgggggagctgacagtgacaggatcctcccctgactcgctgagcctctcctggaccatc
ccccagggccgcttcgactccttcaccgtgcagtacaaggacagggacgggcggccccag
gtggtgcgtgtcaggggcgaggagagcgaggtcactgtggggggcctggagcccaggcgc
aaatacaagatgcacctgtacggcctccacgaggggcggcgtgtgggcccggtgtccacc
gtgggtgtgacagatgatgaagccgagaccacccaagcagtgcccaccacgacccctgag
ccccccatcaagcctcgcctgggagagctgaccgtgacagatgccacccctgactccctc
agcctgacctggatggttcccgagggccagtttgaccacttcctagtccagtacaggaat
ggggacgggcagcccaaggcggtgcgggtgccagggcacgaggatgggatcaccatctca
ggcctggagccagaccaaaagtacaagatgaacctgtacggcttccatggtggccagcgc
atgggccctgtgtctgtcatcggggtgacggctgcagaggaagaaactcccgcccccaca
gaacccagcacggaggccccggagccccctgaggagccgctcctgggggagctgacagtg
acaggatcctcccctgactcgctgagcctctcctggaccgtcccccagggccgctttgac
tccttcaccgtgcagtacaaggacagggacgggcggccccaggtggtgcgtgtcgggggc
gaggagagcgaggtcaccgtggggggcctggagcaggggcgcaaatacaagatgcacctg
tacggcctccacgaggggcagcgcgtgggcccggtgtccgctgtgggtgtgacagctcca
gaggatgaagcggagaccacccaagcagtgcctaccacgacccctgagccccccatcaag
cctcacctgggggagctgactgtgacagatgccacccctgactccctcagcctgtcctgg
acagttcccgagggccagtttgaccacttcctggtccagtacaggaatggggacgggcag
cccaaggcggtgcgggtgccagggcatgaggatggggtcaccgtctccggcctggagcca
gaccacaagtacaagatgaacctgtacggcttccacggtggccagcgcgtgggccctgtg
tctgtaatcggggtgacagaggaagagaaccccagccccacggaacccagcactgaggcc
ccggagccccctgaggagccgctcctgggggagctgacagtgacaggatcctcccctgac
tcgctgagcctctcctggaccgtcccccagggtcgcttcgacttctttaccgtgcagtac
aaggacaggaatgggcagccccaggcggtgcgtgttgggggcgaggagagcgaggtcacc
gtgaggggcctggagccggggcgcaaatacaagatgcatctgtacggcctccacgagggg
cagcgcatgggcccagtgtccaccgttggcgtcacggcgcccctgcccacaccaccgccg
gtggagccccgcctgggggagctggccgtggcagccgtgacctcggactcagtgggcctt
tcatggacggtggcccagggcccctttgactccttcctggtgcagtatagggacgctcag
gggcagccccaggcagtgcctgtgagcggagacctccgagcagtcaccgtctcggggctg
gacccggcccgcaagtacaagttcctgctctttggactccagaatgggaaacgccacggc
ccagtccctgtggaggccaggaccgccccagacacgaaaccgtctccccacctgggggag
ctgactgtgacagatgcgacccctgactccgtgggcctctcgtggacggtccctgagggc
gaattcgactccttcgtgctccagtacaaggataaggacggtcagctccaggtggtgcca
gtggcagccaaccagcgggaggtcacagtccagggcctggagcccagtaggaaatacagg
ttcctgctctacggtctgtcaggcaggaagcgactgggccccatctctgctgagagcacc
acagcgccgctggagaaggagccacctccccgcctgggggaactgaccgtggctgaggag
acctccagctctctgcgcctgtcctggacggtggcccagggcccctttgactccttcgtg
gtccagtacagggacacggacgggcagcccagggcagtgcctgtggccgcagaccagcgc
acagtcaccatagaggacctggagcctggcaagaaatacaagtttctgctctacgggctc
ctcggggggaagcgcctgggtccggtctccgccctgggaatgacagccccagaagaggac
acgccagctccagagttangcccagaggcccctgagcctcctgaagagccccgcctagga
gtgctgaccgtgaccgacacaaccccagactccctgcgcctctcgtggagtgtggcccag
ggcccctttgattccttcgtggtccagtatgaggacaccaacgggcagccccaggccttg
ctcgtggacggcgaccagagcaagatcctcatctcaggcctggagcccagcaccccctac
aggttcctcctgtatggcctccatgaagggaagcgcctggggcccctctcagctgagggc
accacagggctggcccctgctggtcagacctcagcggagtcaaggccccgcctgtcccag
ctgtctgtgactgatgtgaccaccagttcactgaggctcaactgggaggccccacctggg
gccttcgactccttcctgctccgctttggggttccatcaccaagcaccctggagccgcat
ccgcgtccattgctgcagcgcgagctgatggtgcccggcacgcggcactcggccgtgctc
cgggacctgcgtccggggactctgtacagcctgacactgtatgggctgcgaggaccccac
aaggccgacagcatccagggcaccgcccgcaccctcagcccagttctggagagcccccgt
gacctccaattcagtgaaatcagggaggcctcagccaaggtcaactggatgnccccaccg
tcccgggcagacagcttcaaagtctcctaccagctggcggacggaggggagccgcagagc
atacaggtggacggccgggcccggacccagaaactccagggcntgatcccaggcgctcgc
tatgaggtgaccgtggtctccgtccgaggctttgaggagagtgagcctctcacaggcttc
ctcaccacggttcctgacgggccccacgcagtgcgtgcactgaacttgaccgagggattc
gccgtgctgcactggaagcccccccagaatcctgtggacacctatgacgtccaggtcaca
gcccctggggccccgcctctgcaggctgaggccccaggcagcgcggtggactaccccctg
cacgacctcgtcctccacaccaactacaccgccactgtgcgcggcctgcggggccccnac
ctcacttccccagccagcatcaccttcaccacagggctagaggcccctcgggacttggag
gccaaggaagtgaccccccgcacggccctgctcacttggactgagccccaattccggccc
acaggctacctgctcagcttcgacacccctggtggacagacccaggagatcctgctccca
ggagggatcacatctcaccagctcctcggcctctttccctccaccccctacaatgcacgg
ctccaggccatgtggggccagagcctcctgccgcccgtgtccacctctttcaccacgggt
gggctgtggatccccttccccagggactgcggggaggagatgcagaacggagccggtgcc
tccaggaccagcaccatcttcctcaacggcaaccgcgagcggcccctgaacgtgttttgc
gacatggagactgatgggggcggctggctggtgttccagcgccgcatggatggacagaca
gacttctggagggactgggaggactatgcccatggttttgggaacatctctggagagttc
tggctgggcaatgaggccctgcacagcctgacgcaggcaggtgactactccatgcgcgtg
gacctgcgggctggggacgaggctgtgttcgcccagtacgactccttccgcgtagactcg
gctgcggagtactaccgcctccacttggagggctaccacggcaccgcaggggactccatg
agctaccacagcggcagtgtcttctctgcccgtgatcgggaccccaacaacttgctcatc
tcctgcgctgtctcctaccgagggncctggtggtacaggaactgccactacgccaacctc
aacgggctctacgggagcaccgtggaccatcagggagtgagctggtactactggaagggc
ttcgagttctcggtgcccttcacggaaatgaagctgagaccaagaaactttcgctcccca
gcggggggaggctga

KEGG   Pongo abelii (Sumatran orangutan): 100443710
Entry
100443710         CDS       T01416                                 

Gene name
COL4A4
Definition
(RefSeq) collagen alpha-4(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100443710 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100443710 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100443710 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100443710 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    100443710 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100443710 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100443710 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100443710 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100443710 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100443710 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100443710 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100443710 (COL4A4)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100443710 (COL4A4)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100443710 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100443710
NCBI-ProteinID: XP_024098889
Ensembl: ENSPPYG00000013236
LinkDB
Position
2B
AA seq 1684 aa
MWSLHIVLMRYSFGLTKSLATGPWSLILILFSVQYVYGSGKKYVGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
DGIPGHPGPPGPRGKPGMSGHNGSRGDPGFLGGRGALGPGGPPGHPGEKGEKGNSVFILG
AIKGIQGDRGDPGLPGLPGFWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGIKG
QMGDSGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMIGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT
PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRPDSAPGKPGKPGSPGLPGEPGLQGLPGSSATYCSVGNPGPQGIKGKVGPPGGR
GSKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGIPGWLGTEGDPGSPGAEGPPGL
PGKHGASGPPGNKGAKGDMVVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGLPGERGHPGVPGRPGVRGPDGLKG
QKGDTISCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGSPGRRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPGPPGPKG
PQGLPGFPGFPGERGKPGAEGCPGTKGEPGEKGMSGFPGDRGLRGAKGAIGPPGDEGEMA
IISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGQPGEKGQPGPP
GPPGPPGSMGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASHFGPPGR
KGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDQGMPGLRGQPGE
MGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLHDVGPPGPVGIP
GLKGERGDPGSPGISPPGPYGEKGPPGPPGRSGPPGPAGATGRAPKDIPDPGPPGDQGPP
GPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGP
MGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADVDDCPRIPGLPGAPG
MRGPEGAMGLPGMRGPPGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGYPGGPGP
PGPTGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQD
LGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIRPYVSRC
VVCESPAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDF
RAAPLLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKISRCQVC
VKYS
NT seq 5055 nt   +upstreamnt  +downstreamnt
atgtggtctctgcacatagtactaatgaggtactccttcggattgaccaagtccttggcc
acaggtccctggtcacttatactcattctcttttctgtacaatatgtatatgggagtgga
aagaaatacgttggtccttgcggaggaagagattgctctgtttgccactgtgttcctgaa
aaggggtctcggggtccaccaggaccaccagggccacagggtccaattggacccctggga
gccccaggacccattgggctttcaggagagaaaggaatgagaggggaccgcggccctcct
ggagcagcaggggacaaaggagataagggtccaactggtgttcctggatttccaggttta
gatggcatacctgggcacccagggcctcctggacccagaggcaaacctggcatgagtggc
cacaatggctcaagaggtgacccagggtttctaggaggaagaggagctcttggcccagga
ggccccccaggccatcctggggaaaagggagaaaaaggaaattcagtgttcattttaggt
gccattaaaggtattcagggagacagaggggacccaggactgcctggcttaccaggattt
tggggtgcaggaggaccagcgggccccacaggatatcctggagagccagggttagtggga
cctccgggccaaccagggcgtccaggtttgaagggaaatcccggtgtgggaataaagggg
caaatgggagactcgggtgaggttggtcagcaaggttctcctggacccaccctgttggta
gagccacctgacttttgtctctataaaggagaaaagggtataaaaggaattcccggaatg
attggactgccaggaccaccaggacgcaagggagaatccggtattggggcaaaaggagaa
aaaggtattcccggatttccagggcctcggggggatcctggttcctatggatctccaggt
tttccaggattaaagggagaactaggactggttggagatcctgggctatttggattaatt
ggcccaaagggggatcctggaaatcgagggcacccaggaccaccaggtgttttggtgact
ccacctcttccactcaaaggcccaccaggggacccagggttccctggccgctatggagaa
acaggggatgttggaccacctggtcccccaggtctcttgggcagaccaggggaagcctgt
gcaggcatgataggaccccctgggccacaaggatttcctggtcttcctgggcttccagga
gaagctggtattcctgggagacctgattctgctccaggaaaaccagggaagccaggatca
cctggcttgcctggagaaccaggcctgcagggcctcccaggatcaagtgcgacatactgc
agtgttgggaaccctggaccacaaggaataaaaggcaaagtgggtcccccaggaggaaga
ggctcaaaaggagaaaaaggaaatgaaggactctgtgcctgtgagcctggtcccatgggc
ccccctgggcctccaggacttcctgggaggcaggggagtaagggagacttggggatccct
ggctggcttggaacagaaggtgacccgggatctcctggtgctgaaggacctccagggcta
ccaggaaagcatggtgcctccggaccacctggcaacaaaggggcaaagggtgacatggtt
gtatcaagagttaaagggcacaaaggagaaagaggtcctgatgggcccccaggatttcca
gggcagccaggatcacatggtcgggatggacatgctggagaaaaaggggatccaggaccc
ccaggggatcatgaagatgcgaccccaggtggtaaaggatttcctggacctctgggcccc
ccgggcaaagcaggacctgtggggcccccaggactgggatttcctggtctaccaggagag
cgaggccacccaggagttccaggccgcccaggtgtgaggggccctgatggcttgaagggt
cagaaaggtgacacaatttcttgcaacgtaacctaccctgggaggcaaggccctccaggt
tttgatggacctccaggtccaaagggatttccaggtccccaaggtgcccccgggctgagt
ggttcagatggacataaaggcagacctggcacaccaggaacatcggaaataccaggtcca
cctggttttcgtggtgacatgggagatccgggttttggaggtgaaaaggggtcctcccct
gttgggcccccaggccctcccgggtcaccaggagtgaatggtcagaaaggaatcccggga
gaccctgcatttggtcacctgggatccccaggaaggaggggtctttcaggagtgccaggg
ataaaaggacccagaggtgatccgggatgtccaggggctgaagggccagctggcattcct
ggattcccaggtctcaaaggtcccaaaggcagagagggacatgctgggtttccaggtgtc
ccaggtccgcctggccattcctgtgaaagaggtgctccagggataccagggcaaccggga
ctccctgggtatccaggtgggaaaggacagccgggagatgtggggcctcccgggccagct
ggaatgaaaggtctccccggactcccaggacggcctggggcacatggtcccccaggcctc
ccaggaatcccaggtccttttggggatgatgggctacccggtcctccaggtccaaaggga
ccccaggggctgcctggtttcccaggttttcccggagaaagaggaaagcctggtgcagag
ggatgtcctggcacaaagggagaacctggagagaagggcatgtctggctttcccggagac
cggggactgagaggagccaaaggagccataggacctcccggagatgaaggagaaatggct
atcatttcccaaaagggaacacctggggaacctggacctcctggagatgatggattccca
ggagaaagaggtgataaaggaactcccgggatgcaagggagaagaggagagccgggaaga
tacggaccacctggatttcacagagggcaacctggcgagaaaggtcagccagggcctcct
ggacccccaggccctccaggctcaatgggtctaagagggttcattggttttccaggactt
ccaggtgaccagggtgagccaggttctccaggtccccctggattttcaggaattgatgga
gcaagaggacctaaaggaaacaaaggtgaccctgccagtcactttggtccacctggtcga
aagggtgagccaggtagccctggatgtccagggcattttggagcatccggagagcagggc
ttgcctggcattcaagggcccagaggatcacccggaaggccagggccacctggctcctct
ggaccaccagggtgcccaggtgatcaggggatgcctgggctgaggggacagccaggagaa
atgggagaccctgggccaagaggcctccagggggatccagggataccaggtcctccggga
ataaaaggtccctccggatcacctggtctaaacggcttgcatggattgaagggtcagaaa
ggaaccaaaggtgcttcaggtttgcatgatgtggggccacctggtccagtgggaatacct
gggctaaaaggggagagaggagatcctgggagcccaggaatctctcctccaggtccttat
ggagaaaaaggtcccccaggtcccccagggagatcaggaccacctggtcctgcaggtgcc
acaggaagagctcctaaggacattcctgacccgggtccacctggagatcagggacctcct
ggtcctgatggcccaagaggagcacctgggcctccaggcctccctgggagtgttgacctt
ctgagaggggagccaggtgactgtggtctaccagggccaccaggtccccctggcccacca
ggccctccaggatacaaaggctttccaggatgcgatggaaaagatggccagaaaggacca
atgggattcccggggccgcagggaccacatggatttcctgggccacctggagagaagggt
ttacctggacctccagggagaaaagggcccactggtcttccaggtcccagaggtgaacca
gggccacctgcagatgtggatgactgtccccgaatcccaggccttcctggggcaccaggc
atgagaggaccagaaggagccatggggctccctggaatgagaggccccccaggaccaggg
tgcaaaggagagcctgggctggatggcaggaggggtgtggatggcgtccctgggtctcct
gggcctcctggacgtaaaggtgacacaggagaagacggctaccctggaggaccagggcct
cctggtcccactggggatcctgggcccaaagggtttggccctggatacctcggtggcttc
ctcctggttctccacagtcagacggaccaggagcccacctgccccctgggcatgcccagg
ctctggactgggtatagtctgttatacctggaagggcaagagaaagctcacaatcaagac
cttggtctggcaggttcttgccttcccgtgtttagcacactgccctttgcctactgcaac
atccaccaggtgtgccactatgcccagagaaacgacagatcctactggctggccagtgct
gcgcccctccccatgatgccactctctgaagaggcgatccgcccctatgtcagccgctgt
gtggtatgcgagtccccggcccaggcggtggcggtgcacagccaggaccagtccatcccc
ccatgtccgcagacctggaggagcctctggatcgggtattcattcctgatgcacacagga
gctggggaccaaggaggagggcaggccctcatgtcacctggcagctgcctggaagatttc
agagcagcaccattgcttgaatgccaaggccggcagggaacttgccacttttttgcaaat
aagtatagcttctggctcacaacagtgaaagcagacttgcagttttcctctgctccagca
ccagacaccttaaaagaaagccaggcccaacgccagaaaatcagccggtgccaggtctgt
gtgaagtatagctag

KEGG   Pongo abelii (Sumatran orangutan): 100444077
Entry
100444077         CDS       T01416                                 

Gene name
COL4A3
Definition
(RefSeq) collagen alpha-3(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100444077 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100444077 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100444077 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100444077 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    100444077 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100444077 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100444077 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100444077 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100444077 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100444077 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100444077 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100444077 (COL4A3)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100444077 (COL4A3)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100444077 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100444077
NCBI-ProteinID: XP_024098890
Ensembl: ENSPPYG00000013237
LinkDB
Position
2B
AA seq 1670 aa
MSPRTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSP
GQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGITGLPGFSGSPGLPGTPGNTGPYGLV
GVPGCNGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKGEDIELDAKGDPGLPGAPG
PQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVIGHKGERGVKGLTGPPGP
PGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGPQGKPG
KDGVPGFPGSEGVKGNRGFPGITGEDGIKGQKGDIGLPGFRGPTEYYDTHQEKGDEGTPG
PPGPRGARGPQGPSGPPGVPGSPGSSRPGLRGAPGWPGLKGSKGERGLPGKDAMGTPGSP
GCPGSPGLPGSPGPPGPPGGIVFRKGPPGDRGLPGYLGSPGIPGVDGPKGEPGLLCTQCP
YIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEK
GETLQPEGQVGAPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGD
PGSPGSPGPTGPAGPPSYGPQGEPGPQGTQGAPGAPGPPGEAGPRGELSVSTPVPGLPGP
PGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSL
GCPGKMGEPGLPGKPGLPGTKGEPAVAMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPG
TPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRIGKTGSKGD
PGIPGLDRSGFPGETGSPGMPGHQGEMGPPGQKGYPGNPGILGPPGEDGVIGMMGFPGAI
GPPGPPGNPGMPGQRGSPGIPGVKGHRGTPGAKGEQGDKGNPGPSEISHVIGDKGEPGLK
GFTGNPGEKGNRGIPGMPGLKGLKGLPGPAGPPGPRGDLGSTGNPGELGLRGIPGSMGNR
GMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSEGTRPGPPGPTGDPGLPGDMG
KKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPHGDLGFKGIKGFLGPPGIRGPPG
LPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLGAPPGPRGNPGAQGAKGDRGAPGFP
GLPGRKGVMGDAGPRGPTGIEGFPGPPGLPGAMIPGQKGNHGPPGSRGSPGEPGPPGPPG
SHVIGIKGDKGSMGHPGPKGPPGTVGDMGPPGHLGAPGTPGLPGLRGDPGFQGFPGVTGE
KGNPGFLGSIGPPGPIGPKGPPGIRGDPGTLKIISLPGSPGPPGTPGEPGMQGEPGPPGP
PGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPPGSDGLPGLKGKRGDSGSPAT
WTTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTM
PFLFCNVNDVCNFASRNDYSYWLSTPALMPMDMAPITGRALEPYISRCTVCEGPAIAIAV
HSQSTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMKKRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgagcccccggacggcccccaggccgcaggtgctcctactgccgctcctgctggtgctc
ctggcggcggcgcccgcagccagcaagggctgtgtctgtaaagacaaaggccagtgcttc
tgtgatggggccaaaggggagaagggggagaaaggctttcctggaccccccggttctcct
ggccagaaaggattcacaggtcctgaaggcttacctggaccacagggacccaagggcttt
ccaggacttccaggactcacgggttccaaaggtgtaaggggaataactggattgccagga
ttttccggttctcctggacttccaggcaccccaggcaatactgggccttatggacttgtc
ggtgtaccaggatgcaatggttctaagggtgagcaggggtttccaggactcccagggaca
ctgggctacccagggatcccgggtgctgctggtttgaaaggacaaaagggtgctcctgct
aaaggagaagatatagaacttgatgcaaaaggcgaccccgggttgccaggggctccagga
ccccagggtttgccaggccctccaggttttcctgggcctgttggcccacctggtcctccg
ggattctttggctttccaggagccatgggacctagaggacctaagggtcacatgggtgaa
agagtgataggacataaaggagagcggggtgtgaaagggttaacaggacccccgggacca
ccaggaacagttattgtgaccctaactggcccagataacagaacggacctcaagggggaa
aagggagacaagggagcaatgggcgagcctggacctcctggaccctcaggactgcctgga
gaatcatatggatctgaaaagggtgctcctggagaccctggcccacagggaaaacccgga
aaagatggtgttcctggcttccctggaagtgagggagtcaagggcaacaggggtttccct
gggataacgggtgaagatggcattaagggacagaaaggggacattggccttccaggattt
cgtggtccaacagaatattatgacacacaccaggaaaagggagatgaaggcactccaggc
ccaccagggcccagaggagctcgtggcccacaaggtcccagtggtccccccggagttcct
ggaagtcctggatcttcaaggcctggcctcagaggagcccctggatggccaggcttgaaa
ggaagtaaaggggaacgaggcctcccaggaaaggatgccatggggactcctgggtcccca
ggttgtcctggttcaccaggccttccaggatcaccgggacctccaggaccaccaggtggt
attgtttttcgcaagggtccacctggagatcgtggactgccaggctatctagggtctcca
ggaatcccaggagttgacgggcccaaaggagaaccaggcctcctgtgtacacagtgccct
tatatcccagggcctcccggtctcccaggattgccagggttacatggtgtaaaaggaatc
ccaggaagacaaggtgcagctggcttgaaaggaagcccagggtccccaggaaatacaggt
cttccaggatttccaggtttcccaggtgcccagggtgacccaggacttaaaggagaaaaa
ggtgaaacacttcagcctgaggggcaagtgggtgccccaggtgacccggggctcagaggc
caacctgggagaaagggcttggatggaattcctggaactccgggagtgaaaggattacca
ggacctaaaggcgaactggctctgagtggtgagaagggggaccaaggtcctccaggggat
cctggctcccctgggtccccaggacctacaggaccagctggaccacctagctacggaccc
caaggagaacctggtcctcagggcacgcaaggagctcctggagcccctggaccacccgga
gaagccggccctaggggagagctcagtgtttcaacaccagttccaggcctaccaggacct
ccagggccccctggccatcctggcccccaaggtccacctggtatccctggatccctgggg
aaatgtggagatcctggtcttccagggcctgatggtgaaccaggaattccaggaattgga
tttcctgggcctcctggacctaaaggagaccaaggttttccgggtacaaaaggatcactg
ggttgtcctggaaaaatgggagagcctgggttacctggaaagccaggcctcccaggaacc
aagggagaaccagcagtagccatgcctggaggaccaggaacaccaggttttccaggagaa
agaggcaattctggggaacatggagaaattggactccctggacttccaggtctccctgga
actccaggaaatgaagggcttgatggaccacgaggagatccagggcagcctggaccacct
ggagaacaaggacccccaggaaggtgcatagagggtcccaggggagcccaaggacttcca
ggcttaaatggattgaaagggcaacaaggcagaataggtaaaacagggtcaaagggagac
ccaggaattccaggcttggatagatcaggatttcctggagaaactggatcaccaggaatg
ccaggtcatcaaggcgagatgggaccaccgggtcaaaaaggatatccaggaaatccggga
attttagggccaccaggtgaagatggagtgattgggatgatgggctttcctggagccatt
ggtcctccggggccccctgggaacccaggcatgccagggcagagggggagccctggaatt
ccaggagtaaaaggccatagaggaaccccaggagccaagggggaacaaggagataaagga
aatcccgggccttctgagatatcccacgtaataggggacaaaggagaaccaggtctcaaa
ggattcacaggaaatccaggtgagaaaggaaacagaggcattccagggatgccaggttta
aaaggcctcaaaggactacctggaccagcaggaccaccaggccccagaggagatttgggc
agcactgggaatcctggagaactaggactgcgtggtataccaggaagcatggggaacagg
ggcatgccaggttctaaaggaaaaaggggaactctgggattcccaggtcgagcaggaaga
ccaggcctcccaggtattcacggtctccagggagataagggagagccaggttattcagaa
ggtacaaggccaggaccaccgggaccaacgggggatccaggactgccgggtgatatggga
aagaaaggagaaatggggcaacctggcccacctggacatttggggcctgctggacctgag
ggagcccctggaagtcctggaagtcctggcctcccaggaaagccaggtcctcatggtgat
ttgggttttaaaggaatcaaaggcttcctgggccctccaggaatcagaggccctccaggt
cttccaggattcccaggatctcctggaccaatgggtataagaggtgaccaaggacgtgat
ggaattcctggtccagccggagaaaagggagaaacgggtttattgggggcccctccaggc
ccaagagggaaccctggtgctcaaggagccaaaggagacaggggagccccaggttttcct
ggcctcccaggcagaaaaggggtcatgggagatgctgggcctcgagggcccacaggcata
gaaggattcccagggccaccaggtctgcccggtgcaatgatccctggccagaaaggaaat
catggtccaccaggctcaagaggaagcccaggtgagcctggtccccctggacctccaggg
agtcacgtaataggcataaaaggagacaaagggtctatgggccaccctggcccgaaaggt
ccacctggaactgtaggagacatgggaccaccaggtcatctgggagcaccaggtactcca
ggtcttccaggacttagaggtgatcctggattccaggggtttccaggcgtgacaggagaa
aagggtaatcctggatttctaggatccattggacctccaggaccaattgggccaaaagga
ccacctggtatacgtggagaccctggcacacttaagattatctcccttccaggaagccca
gggccacctggcacacctggagaaccagggatgcagggagaacctgggccaccagggcca
cctggaaacctaggaccctgtgggccaagaggtaagccaggcaaggatggaaaaccagga
actcctggaccagctggagaaaaaggcaacaaaggttctaaaggagagccaggaccacct
ggatcagatggattgccaggtttgaaaggaaaacgtggagacagtggatcacctgcaacc
tggacaacgagaggctttgtcttcacccgacacagtcaaaccacagcaattccttcatgt
ccagaggggacagtgccactctacagcgggttttcttttctttttgtacaaggaaatcaa
cgagcccacggacaagaccttggaactcttggcagctgcctgcagcgatttaccacaatg
ccattcttattctgcaatgtcaatgatgtatgtaatttcgcatctcgaaatgattattca
tactggctgtcaacaccagctctgatgccaatggacatggctcccattactggcagggcc
cttgagccttacataagcagatgcactgtttgtgaaggtcctgcgatcgccatagccgtt
cacagccaaagcactgacattcctccatgtcctcacggctggatttctctctggaaagga
ttttcattcatcatgttcacaagtgcaggttctgagggcaccgggcaagcactggcctcc
cctggctcctgcctggaagaattccgagccagcccatttctagaatgtcatggaagagga
acgtgcaactactattcaaattcctacagtttctggctggcttcattaaacccagaaaga
atgttcagaaagcctattccatcaactgtgaaagctggggaattagaaaaaataataagt
cgctgtcaggtgtgcatgaagaaaagacactga

KEGG   Pongo abelii (Sumatran orangutan): 100444160
Entry
100444160         CDS       T01416                                 

Gene name
COL4A1
Definition
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100444160 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100444160 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100444160 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100444160 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100444160 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100444160 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100444160 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100444160 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100444160 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100444160 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100444160 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100444160 (COL4A1)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100444160 (COL4A1)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100444160 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100444160
NCBI-ProteinID: XP_024086740
Ensembl: ENSPPYG00000005496
LinkDB
Position
13
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGNPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGG
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGLIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPTGPIGDKGQAGFPGGPGSPGLPGPKGEPGKVVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFDG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGLQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggcttctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaagaggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccgcagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggaattcctggccaagacggcccgccaggcccccca
ggtattccaggatgcaatggcacaaagggagagagagggccactcgggcctcctggcttg
cctggtttcgctggaaatcccggaccaccagggttaccgggaatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggtgaaagaggatttcccggaatc
ccagggaatccaggcccaccaggactgccagggcttcaaggtcctgttgggcctccagga
tttacgggaccaccaggtccccccggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccgaaaggtgacaagggtgaccaaggggtcagtgggcctccg
ggagtaccaggacaagctcaagttcaagaaaaaggagacttcgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaaccc
ggaaaaccaggacccagaggaaaacccggaaaagatggtgacaaaggggaaaaagggggt
cccggttttcctggtgaacccgggtacccaggactcataggccgccagggccctcaggga
gaaaagggtgaagcaggtcctcccggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccagggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggacctccaggcctcccggtacctgggcag
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
acatctctgccaggaccaagtggaagagatgggctcccaggtcctcctggttcccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggattgataggcgaaattggagagaaa
ggtcaaaaaggagagagttgcctcatctgtgatatagacggatatcgggggcctcccggg
ccacagggacccccaggagaaataggtttcccaggacagccaggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcgggagtgccagggcctcaaggtacaccagggctg
ataggccagccaggagccaagggggagcctggtgagatttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggcttcccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcgccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggattcccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctactggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
gttgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccagggctgccaggagag
aagggtgctgtgggccagccaggaattggatttccagggccccctggccccaaaggtgtt
gatggcttacctggagacatggggcctccggggactccaggtcgcccgggatttgatggc
ttacctgggaacccaggtgtgcagggccagaagggagagcctggagttggtctgccggga
ctcaaaggtttgccaggtcttcccggcattcctggcacacccggggagaaggggagcatt
ggggtaccaggcgttcctggagaacacggagcgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccctggaggacagggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggacttcctggcataacgggacagtcagggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccctgggcagccgggctcaccaggaccagtgggtgctccggga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggctttaaaggtgataagggggatgtcggtctccctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaaaggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaaggatctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcccggcatccctggcctgcaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcgtggtgaaaagggagatcaagggatagcgggtttcccaggaagccctggtgag
aagggagaaaaaggaagcatcgggatcccaggaatgccagggtccccaggccttaaaggg
tctcccgggagtgttggctatccaggaagccctgggctgcctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggtgtcaaaggagaagcaggtcttcctgggact
cctggccccacaggcccagctggccagaaaggggagccaggcagtgacggaatcccgggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagctgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcctccagggccccagggacag
ccggggttaccgggatccccaggccatgccacggaagggcccaaaggagaccgtggacct
cagggccagcctggcctgccaggacttccgggacctatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtattggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcagggtgtcccgggagctaaa
ggtctcccgggtcctcctggacccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggacttccaggcccgaaa
ggccagcaaggtgttacaggattggtgggtatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgctgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcacagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccatgggtactctttgctctacgtgcaaggcaatgaa
cgggcccatggccaggacctgggcacggccggcagctgcctgcgcaagttcagcacaatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgaaatgactattcg
tactggctgtccacccccgagcccatgcccatgtcaatggcacccatcacgggggacaac
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcccgccatggtgatggccgtg
cacagtcagaccattcagatcccaccgtgccccagcgggtggtcctcgctgtggatcggc
tactcttttgtgatgcacaccagtgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgtctggaggagtttagaagtgcgccattcatcgagtgtcacggccgtggg
acctgtaattactacgcaaacgcttacagcttttggctcgccaccatagagaggagcgag
atgttcaagaagcccacgccgtccaccttgaaggcaggggagctgcgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

KEGG   Pongo abelii (Sumatran orangutan): 100444523
Entry
100444523         CDS       T01416                                 

Gene name
COL4A2
Definition
(RefSeq) collagen alpha-2(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100444523 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100444523 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100444523 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100444523 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    100444523 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100444523 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100444523 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100444523 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100444523 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100444523 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100444523 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100444523 (COL4A2)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100444523 (COL4A2)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100444523 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100444523
NCBI-ProteinID: XP_024086739
Ensembl: ENSPPYG00000005497
LinkDB
Position
13
AA seq 1712 aa
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGITGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYALPKEERD
RYRGEPGEPGLVGFQGPPGRPGYVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTRVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGPRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGPPAYSPHPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAEGGVGFPGLPGSPGARGPKGWKGDA
GDCRCTEGDEAIRGLPGLPGPKGFAGINGEPGTKGDKGDPGQHGLPGFPGLKGVPGNVGA
PGPKGAKGDSRTITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAIGGDRQEVIQPGCIGGPKGLPGLPGPPGPTGTKGLRGIPGFSGADGGPGPKG
LPGDPGREGFPGPPGFIGPRGSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGSPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGERGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPVILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGAPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGPTGDFGDIGDTINLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQTGFPGLTGPPGSQGESGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGADIHGDPGFPGPPGERGEPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGVPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDAGPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQKIAVQPGTVGPQGRRGHPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAVAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMKNL
NT seq 5139 nt   +upstreamnt  +downstreamnt
atggggagagaccagcgcgcggtggccggccctgccctacggcggtggctgctgctgggg
acagtgaccgtggggttcctcgcccagagcgtcttggcgggtgtgaagaagtttgatgtg
ccgtgcggaggaagagattgcagtgggggctgccagtgctaccctgagaaaggtggacgt
ggtcaacctgggccagtgggcccccaggggtacaacgggccaccaggattacaaggattc
ccgggactgcagggacgcaaaggagacaagggtgaacggggagcccccggaataacagga
cccaagggcgacgtgggagcaagaggcgtttctggattccctggtgctgatggaattcct
ggacacccggggcaaggtgggcccaggggaaggccgggctacgatggctgcaacggaacc
cagggagactcaggtccacaggggcctcccggctctgaggggttcaccgggcctcccggg
ccccaaggaccaaaagggcagaaaggtgagccttatgcactgcctaaagaggaacgcgac
agatatcggggtgaacctggagagcctggattggtcggtttccagggacctcccggccgc
cctgggtatgtgggacagatgggtccagttggagctccagggagaccaggaccacccgga
ccccctggaccaaaaggacagcaaggcaacagaggacttggtttctacggagttaagggt
gaaaagggtgatgtagggcagccaggacccaatgggattccatcagacaccctccacccc
atcatcgcgcccacgagagtcaccttccacccagatcagtacaagggtgaaaaaggcagt
gagggggaaccaggaataagaggcatttccttgaagggagaagaaggaatcatgggcttt
cctggaccgaggggttaccctggcttgagtggtgaaaaaggatcaccaggacagaaggga
agccgaggcctggatggctatcaaggccctgatggaccccggggacccaagggagaagcc
ggagacccagggccccctggaccacctgcctactcccctcacccttccctagcaaaaggt
gccagaggtgacccgggattcccaggggcccaaggggagccaggaagccagggtgagcca
ggagacccgggcctcccaggcccccctggcctctccatcggagatggagatcagaggaga
ggcctgccgggtgagatgggacccaagggcttcatcggagaccccggcatccctgcgctc
tacgggggcccacctggacctgatggaaagcgagggcctccaggaccccccgggctccct
ggaccacctggacctgatggcttcctgtttgggctgaaaggagcagaaggaggagtgggc
ttccctgggcttcccggctcccctggagcccgcggaccaaagggatggaaaggtgatgct
ggggactgcagatgtacagaaggtgatgaagctatcagaggtcttccgggactgccagga
cccaagggcttcgcaggcatcaacggggagccgggaacgaaaggggacaaaggagacccc
ggccaacacggcctccccgggttcccagggctcaagggagtacctggcaacgttggagct
cccggacccaaaggagcaaaaggagattccagaacaatcacgaccaaaggtgagcgggga
cagcccggcgtcccaggtgtgcccgggatgaaaggtgacgatggcagcccaggccgcgat
gggctcgatggattccccggcctcccaggccctcccggcgatggcatcaagggccctcca
ggggatccaggctatccaggaatacctggaacaaagggtactccaggagaaatgggcccc
ccaggactgggccttcccggcctcaaaggccaacgtggtttccctggagacgccggctta
cctggaccaccaggcttcctgggccctcctggccccgcagggaccccaggacaaatagat
tgtgacacagatgtgaaaagggccattggaggtgacagacaggaggtcatccagccaggt
tgcataggagggcccaagggattgccaggcctgccaggacccccaggccccacaggtacc
aaaggcctccgaggaatcccaggcttctcaggagctgatggaggaccagggcccaagggc
ttgccaggagacccaggtcgtgaagggttcccaggacccccagggttcataggaccccga
ggatccaaaggtgcagtgggcctccctggcccagatggatccccaggtcccatcggcctg
ccagggccagacgggccccctggggaaaggggccttcctggagaagtcctgggagcccag
cccgggccacggggagatgctggtgtgcctggacagcctgggcttaaaggccttcctgga
gacagaggctcccctggattcagaggaagccaagggatgcctgggatgccagggctgaag
gggcagccaggcctcccaggaccttccggccagccaggcctatacgggcctccaggactg
catggattcccaggagctcctggccaagaggggcccttggggctgccaggaatcccaggc
cgtgaaggtctgcctggtgatagaggggaccctggggacacaggcgctcctggccctgtg
ggaatgaaaggtctctctggtgacagaggagatgctggcttcacaggggagcgaggccat
ccaggaagccctggatttaaaggaattgatggaatgcctgggacccccgggctaaaagga
gatagaggctcacctgggatggatggtttccaaggcatgcctggactcaaagggagaccc
gggtttccagggagcaaaggcgaggctggatttttcggaatacccggtctgaagggtctg
gctggtgagccaggttttaaaggcagccgaggggaccctgggcccccaggaccacctcct
gtcatcctgccaggaatgaaagacattaaaggagagaaaggagacgaagggcctatgggg
ctgaaaggatacctgggcgcaaaaggtatccaaggaatgccaggcatcccggggctgtcg
ggaatccctgggctgcctgggaggcccggccacatcaaaggagtcaagggagacatcgga
gcccccggcatccccggtttgccaggattccctggggtggctggcccccctggaattaca
ggattcccaggattcataggaagccggggtgacaaaggtgccccagggagagcaggcctg
tatggcgagatcggcccgactggtgatttcggtgacatcggggacactataaatttacca
ggaagaccaggcctgaagggggagcggggcaccactggaataccaggtctgaagggattc
tttggagagaagggaacagaaggtgacatcggcttccctgggataacaggcgtgactgga
gtccaaggccctcctggacttaaaggacaaacaggctttccagggctgactgggcctcca
gggtcgcagggagagtcggggcggattggactgcctggtggcaaaggagatgatggttgg
ccgggagctccgggcttaccaggttttccgggactccgtgggatccgcggcttacacggc
ttgccaggcaccaaaggctttccaggatccccaggtgctgacatccacggagacccaggc
ttcccaggccctcctggggaaagaggtgaaccaggagaggccaacacccttccaggccct
gtgggagtcccaggacagaaaggagaccaaggagctccaggggaacgaggcccacctggg
agtccaggacttcaggggttccctggcatcacgcccccttccaacatctctggggtacct
ggtgacaaaggggcgccagggatatttggcctgaaaggttatcggggcccaccagggcca
ccagggtctgctgctcttcctggaagcaaaggtgacacagggaacccaggagctccagga
accccagggaccaaaggatgggccggggacgccgggccccagggcaggcctggtgtgttc
ggtctcccgggagaaaaagggcccaggggtgaacaaggcttcatggggaacactggaccc
accggggcggtgggcgacagaggccccaagggacccaagggagacccaggattccctggt
gcccccgggactgtgggagcccccgggattgcaggaatcccccagaagattgccgtccaa
ccagggacagtgggtccccaggggaggcgaggccaccctggggcaccgggggagatgggg
ccccagggcccccccggagaaccaggtttccgtggggctccagggaaagctgggccccaa
ggaagaggtggtgtgtctgctgttcccgggttccggggagatgaaggacccataggccac
caggggccgattggccaagaaggtgcaccaggccgtccagggagcccaggcctgccgggt
atgccaggccgcagcgtcagcatcggctacctcctggtgaagcacagccagacggaccag
gagcccatgtgcccggtgggcatgaacaagctctggagtgggtacagcctgctgtacttc
gagggccaggagaaggcgcacaaccaggacctggggctggcgggctcctgcctggcgcgg
ttcagcaccatgcccttcctgtactgcaaccctggtgatgtctgctactatgccagccgg
aacgacaagtcctactggctctctaccactgccccgctgcccatgatgcctgtggccgag
gacgagatcaagccctacatcagccgctgctctgtgtgtgaggccccagccgtcgccatc
gcggtccacagtcaggatgtctccattccacactgcccagctgggtggcggagtttgtgg
atcggatattccttcctcatgcacacggcggcgggagacgaaggcggtggccagtcactg
gtgtcaccaggcagctgtctggaggacttccgtgccacaccattcatcgagtgcaatgga
ggccgcggcacctgccactactacgccaacaagtacagcttctggctgaccaccattccc
gagcagagcttccagggctcgccctctgccgacacgctcaaggccggcctcatccgcaca
cacatcagccgctgccaggtgtgcatgaagaacctgtga

KEGG   Pongo abelii (Sumatran orangutan): 100445025
Entry
100445025         CDS       T01416                                 

Gene name
COL9A2
Definition
(RefSeq) collagen alpha-2(IX) chain isoform X1
  KO
K08131  collagen type IX alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100445025 (COL9A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100445025 (COL9A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100445025 (COL9A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100445025 (COL9A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100445025 (COL9A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:pon00535]
    100445025 (COL9A2)
Proteoglycans [BR:pon00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   100445025 (COL9A2)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 100445025
NCBI-ProteinID: XP_024107288
LinkDB
Position
1
AA seq 689 aa
MAAAAAAPRSLLVLLQVLVLALAQIRGPPGERGPPGPPGPPGVPGSDGVDGDKGPPGKAG
PPGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGVKGQPGLPGPPGLPGPGFAGP
PGPPGPVGLPGEIGIRGPKGDPGPDGPSGPPGPPGKPGRPGTIQGLEGSADFLCPTDCPP
GMKGPPGLQGVKGHAGKRGILGDPGRQGKPGPKGDVGASGEQGIPGPPGPQGIRGYPGMA
GPKGETGPHGYKGMVGAVGATGPPGEEGPRGPPGRAGEKGDLGSPGIRGPQGITGPKGAT
GPPGINGKDGTPGTPGMKGSAGQAGQPGSPGHQGLAGVPGQPGTKGGPGDQGEPGPQGLP
GFSGPPGKEGEPGPRGEIGPQGIMGQKGDQGERGPVGQPGPQGRQGPKGEQGPPGIPGPQ
GLPGVKGDKGSPGKTGPRGGVGDPGVAGLPGEKGEKGESGEPGPKGQQGVRGEPGYPGPS
GDAGAPGVQGYPGPPGPRGLAGNRGVPGQPGRQGVEGRDATDQHIVDVALKMLQEQLAEV
AVSAKREALGAVGMMGPPGPPGPPGYPGKQGPHGHPGPRGIPGIVGAVGQIGNTGPKGKR
GEKGDPGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDRGSPGAPGEAGRPGLPGPVGL
PGFCEPAACLGASAYASARLAEPGSIKGP
NT seq 2070 nt   +upstreamnt  +downstreamnt
atggccgccgcggcggccgccccccgcagcctccttgttctcctccaggtgctagtgctc
gctctggcgcagatcagaggtccaccgggagagcggggccccccggggcccccgggaccg
ccgggagtgcctggatccgacggcgtcgacggtgacaaggggccccctggaaaagctggc
cctccgggacccaagggcgagcctggcaaagctgggccagatgggccagacgggaagccc
gggattgatggtttaactggagccaagggggagcctggccccatggggatccctggagtc
aagggccagcccgggcttcctggtcctcctggccttccgggccctggttttgctggacct
cctgggccgcctggacctgttggcctccctggtgagattggaatccgaggccccaagggg
gaccctggaccagatggaccatcggggcccccaggaccccctgggaaacctggtcgcccg
ggaaccatccagggtctggaaggcagtgcggatttcctgtgtccaaccgactgtccaccc
gggatgaaaggtcccccagggctgcagggagtgaaggggcatgcgggcaaacgcgggatt
ctgggtgatcctggccgccaggggaagccgggtcccaagggagatgtgggtgcctctgga
gagcaaggcatccctggaccaccgggtccccagggtatcaggggctacccgggcatggca
ggtcccaagggagagacgggccctcatggatataaaggcatggtgggcgccgtcggtgcc
actgggccaccgggtgaggaaggtcctaggggaccaccaggccgagctggggagaagggt
gacttgggcagcccaggtattcgtggaccccaggggatcacaggcccgaaaggagcaacg
ggccccccaggcatcaatggcaaggacgggaccccaggcacacctggcatgaagggcagt
gcaggacaggcgggacagccaggaagtccaggccaccagggcctagcgggtgtgccaggt
cagcctgggacaaaaggaggccctggagaccagggtgagccgggcccgcagggccttcct
ggattctctggtccccctgggaaagagggagagccagggcctcgaggagaaattggtccc
cagggcatcatgggacagaagggtgaccaaggcgagaggggtccagtggggcagccaggc
cctcagggaaggcagggccctaagggggagcagggcccccctggaattccagggccccaa
ggcttgccaggtgtcaaaggagacaagggctccccagggaagaccgggccccgcggcgga
gtgggtgacccgggggtggccggcctccccggagagaaaggcgagaagggcgagtccggc
gagccggggcccaagggacagcaaggtgtacgtggagaacccggctaccctggccccagc
ggggatgcgggcgccccaggggttcagggctaccctggtccccccggacctcgaggactg
gctgggaaccgaggcgtgccaggacagcccgggagacagggcgtggagggccgggatgcc
actgaccagcacatcgtggatgtggcgctgaagatgctgcaagagcaactggcagaggtt
gccgtgagtgccaagcgggaagccctgggtgcggtgggcatgatgggtcccccaggacct
cctgggccccctgggtacccaggcaagcagggaccccatgggcaccctggccctcggggc
attcctggcatcgtgggagccgtgggtcagatcggcaacacggggcccaagggaaaacgt
ggagagaagggtgatccaggagaagtgggacgggggcaccccgggatgcctgggccccca
gggatcccaggactccctggccggcctggccaggcaatcaacggcaaggatggagatcga
gggtccccaggggctccaggagaggcaggtcgacctggcctgccaggccccgtggggctg
ccaggcttctgtgaacctgcagcctgccttggagcttcggcctatgcctctgcccgcctt
gcagagcctggatccatcaaggggccttga

KEGG   Pongo abelii (Sumatran orangutan): 100446634
Entry
100446634         CDS       T01416                                 

Gene name
LAMC2
Definition
(RefSeq) laminin subunit gamma-2
  KO
K06246  laminin, gamma 2
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100446634 (LAMC2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100446634 (LAMC2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100446634 (LAMC2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100446634 (LAMC2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100446634 (LAMC2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100446634 (LAMC2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100446634 (LAMC2)
   05145 Toxoplasmosis
    100446634 (LAMC2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_B
Other DBs
NCBI-GeneID: 100446634
NCBI-ProteinID: XP_024099499
Ensembl: ENSPPYG00000000429
LinkDB
Position
1
AA seq 1219 aa
MLVTLRVSLEHLQDGLPRYLRKFILHGTPGFGRRHMPRQSRPLLALTGTGGMVCDCNGKS
RQCIFDQELHRQTGNGFRCLNCNDNTDGIHCERCKDGFYRHRERDRCLPCNCNSKGSLSA
RCDNSGWCSCKPGVTGARCDRCLPGFHMLTDAGCTQDQRLLDSKCDCDPAGIAGPCDAGR
CVCKPAVTGERCDRCRSGYYNLDGGNPEGCTQCFCYGHSASCRSSADYSVHKITSTFHQD
VDGWKAVQRNGSPAKLQWSQRHQDVFSSAQRLDPVYFVAPAKFLGNQQVSYGQSLSFDYR
VDRGGRHPSAHDVILEGAGLRITAPLMPIGKTLPCGLTKTYTFRLNEHPSSNWSPQLSYF
EYRRLLRNLTALRIRATYGEYSTGYIDNVTLISARPVSGAPAPWVEQCICPVGYKGQFCQ
DCASGYKRDSARLGPFGTCIPCNCQGGGACDPDTGDCYSGDENPDIECADCPIGFYNDPH
DPRSCKPCPCHNGFSCSVMPETEEVVCNNCPPGVTGARCELCADGYFGDPFGEHGPVRPC
QPCQCNNNVDPSASGNCDRLTGRCLKCIHNTAGIYCDQCKAGYFGDPLAPNPADKCQACN
CNPMGSEPVECRSDGTCVCKPGFGGPNCEHGAFSCPACYNQVKIQMDQFMQQLQRMEALI
SKAQGGDGAVPDKELEGRMQQAEQALQDILRDAQISEGASRSLGLQLAKVRSQENSYRSR
LDDLKMTVERIRALGSQYQNRVRDTHRLITQMQLSLAESEASLRNTNIPASDHYVGPNGF
KSLAQEATRLAESHVESASNVEQLTRETEDYSKQALSLVRKALHEGVGSGSGSPNGAVVQ
GLVEKLEKTKSLAQQLTREATQAEIEADRSYQHSLRLLDSVSQLQGVNDQSFQVEEAKKI
KQKADSLSSLVTRHMDEFKRTQKNLGNWKEETQQLLQNGKSGREKSDQLLSRANLAKSRA
QEALSMGNATFYEVESILKNLREFDLQVDNRKAEAEEAMKRLSYISQKVSDASDKTQQAE
RALGSAAADAQRAKNGAREALEISSEIEQEIGSLNLEANVTADGALAMEKGLASLKSEMR
EVEGELERKELEFDTNMDAVQMVITEAQRVDTRAKNAGVTIQDTLNTLDGLLHLMDQPVS
VDEEGLILLEQKLSRAKTQINSQLRPMMSELEERARRQRGHLHLLETSIDGILADVKNLE
NIRDNLPPGCYNTQALEQQ
NT seq 3660 nt   +upstreamnt  +downstreamnt
atgttagtcactttgcgagtctctttggaacatctgcaggatgggcttcctagatacttg
agaaagttcattcttcatgggacacctggctttgggaggagacacatgcccagacagtca
cgtccgctcctcgccttgacgggcaccggtgggatggtctgtgattgcaatggaaagtcc
aggcagtgtatctttgatcaggaacttcacagacaaactggtaatggattccgctgcctc
aactgcaatgacaacactgatggcattcactgcgagaggtgcaaggacggcttttaccgg
cacagagaaagggaccgctgtttgccctgcaattgtaactccaaaggttctcttagtgct
cgatgtgacaactccggatggtgcagctgtaaaccaggtgtgacaggagccagatgcgac
cgatgtctgccaggcttccacatgctcacagatgcggggtgcacccaagaccagagactg
ctagactccaagtgtgactgtgacccagctggcatcgcagggccctgtgacgcgggccgc
tgtgtctgcaagccagctgtcactggagaacgctgtgataggtgtcgatcaggttactat
aatctggatggggggaaccctgagggctgtacccagtgtttctgctacggacattcagcc
agctgccgcagctctgcagactacagtgtccataagatcacctctacctttcatcaagat
gttgatggctggaaggctgtccaacgaaatgggtctcctgcaaagctccaatggtcgcag
cgccatcaagatgtgtttagctcagcccaacgactagaccctgtctattttgtggctcct
gccaaatttcttgggaatcaacaggtgagctatgggcaaagcctgtcctttgactaccgt
gtggacagaggaggcagacacccatctgcccatgatgtgatcctggaaggtgctggtcta
cggatcacagctcccttgatgccaattggcaagacactgccttgtgggctcaccaagact
tacacattcaggttaaatgagcatccaagcagtaattggagcccccagctgagttacttt
gagtatcgaaggttactgcggaatctcacagccctccgcatccgagctacatatggagaa
tacagtactgggtacattgacaatgtgaccctgatttcagcccgccctgtctctggagcc
ccagcaccctgggttgaacagtgtatatgtcctgttgggtacaaggggcaattctgccag
gattgtgcttctggctacaagagagattcagcgagactggggccttttggcacctgtatt
ccttgtaactgtcaagggggaggggcctgtgatccagacacaggagattgttattcaggg
gatgagaatcctgacattgagtgtgctgactgcccaattggtttctacaacgatccgcac
gacccccgcagctgcaagccatgtccctgtcataacgggttcagctgctcagtgatgcca
gagacagaggaggtggtgtgcaataactgccctcccggggtcaccggtgcccgctgtgag
ctctgtgctgatggctactttggggacccctttggtgaacatggcccagtgaggccttgt
cagccctgtcaatgcaacaacaatgtggaccccagtgcctctgggaactgtgaccggctg
acaggcaggtgtttgaagtgtatccacaacacagccggcatctactgcgaccagtgcaaa
gcaggctacttcggggacccgttggctcccaacccagcagacaagtgtcaagcttgcaac
tgtaaccccatgggctcagagcctgtagaatgtcgaagtgatggcacctgtgtttgcaag
ccaggatttggtggccccaactgtgagcatggagcattcagctgtccagcttgctataat
caagtgaaaattcagatggatcagtttatgcagcagcttcagagaatggaggccctgatt
tcaaaggctcagggtggtgatggagcagtacctgataaagagctggaaggcaggatgcag
caggctgagcaggcccttcaggacattctgagagatgcccagatttcagaaggtgctagc
agatcccttggtctccagttggccaaggtgaggagccaagagaacagctaccggagccgc
ctggatgacctcaagatgactgtggaaagaattcgggctctgggaagtcagtaccagaac
cgagttcgggatactcacaggctcatcactcagatgcagctgagcctggcagaaagtgaa
gcttccttgagaaacactaacattcctgcctcagaccactacgtggggccaaatggcttt
aaaagtctggctcaggaggccacgagattagcagaaagccacgttgagtcggccagtaac
gtggagcaactgacaagggaaactgaggactattccaaacaagccctctcactggtgcgc
aaggccttgcatgaaggagttggaagcggaagcggtagcccgaacggtgctgtggtgcaa
gggcttgtggaaaaattggagaaaaccaagtccctggcccagcagttgacaagggaggcc
actcaagcggaaattgaagcagataggtcttatcagcacagtctccgcctcctggattca
gtgtctcagcttcagggagtcaatgatcagtcctttcaggtggaagaagcaaagaagatc
aaacaaaaagctgattcactctcaagcctggtaaccaggcatatggatgagttcaagcgt
acacaaaagaatctgggaaactggaaagaagaaacacagcagctcttacagaatggaaaa
agtgggagagagaaatcagatcagctgctttcccgtgccaatcttgctaaaagcagagca
caagaagcactgagtatgggcaatgccactttttatgaagttgagagcatccttaaaaac
ctcagagagtttgacctgcaggtggacaacagaaaagcagaagctgaagaagccatgaag
agactctcctacatcagccagaaggtttcagacgccagtgacaagacccagcaagcagaa
agagccctggggagcgctgctgctgacgcgcagagggcaaagaatggggccagggaggcc
ctggaaatctccagtgagattgaacaggagattgggagtctgaacttggaagccaatgtg
acagcagatggagccttggccatggaaaagggactggcctctctgaagagtgagatgagg
gaagtggaaggagagctggaaaggaaggagctggagtttgacacgaatatggatgcagta
cagatggtgattacagaagcccagagggttgataccagagccaagaacgctggggttaca
atccaagacacactcaacacattggatggcctcctgcatctgatggaccagcctgtcagt
gtagatgaagaggggctgatcttattggagcagaagctttcccgagccaagacccagatc
aacagccaactgcggcccatgatgtcagagctggaagagagggcacgtcggcagaggggt
cacctccatttgctggagacaagcatagatgggattctggctgatgtgaagaacttggag
aacattagggacaacctgcccccaggctgctacaatacccaggctcttgagcaacagtga

KEGG   Pongo abelii (Sumatran orangutan): 100446946
Entry
100446946         CDS       T01416                                 

Gene name
COL9A1
Definition
(RefSeq) collagen alpha-1(IX) chain
  KO
K08131  collagen type IX alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100446946 (COL9A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100446946 (COL9A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100446946 (COL9A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100446946 (COL9A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100446946 (COL9A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:pon00535]
    100446946 (COL9A1)
Proteoglycans [BR:pon00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   100446946 (COL9A1)
SSDB
Motif
Pfam: Collagen Laminin_G_3 Laminin_G_2
Other DBs
NCBI-GeneID: 100446946
NCBI-ProteinID: XP_002817092
Ensembl: ENSPPYG00000016759
UniProt: A0A2J8XWU2
LinkDB
Position
6
AA seq 921 aa
MKTCWKIPVFFFVCSFLEPWASAAVKRRPRFPVNSNSNGGNELCPKIRIGQDDLPGFDLI
SQFQVDKAASRRAIQRVVGSATLQVAYKLGNNVDFRIPTRNLYPNGLPEEYSFLTTFRMT
GSTLKKNWNVWQIQDSSGKEQVGIKINGQTQSVAFSYKGLDGSLQTAAFSNLPSLFDSRW
HKIMIGVERSSATLFVDCNRIESLPIKPRGPIDIDGFAVLGKLADNPQVSVPFELQWMLI
HCDPLRPRRETCHELPARITPSQTTDERGPPGEQGPPGPPGPPGVPGIDGIDGDRGPKGP
PGPPGPAGEPGKPGAPGKPGTPGADGLTGPDGSPGSVGPKGQKGEPGVPGSRGFPGRGIP
GPPGPPGTAGLPGELGRVGPIGDPGRRGPPGPPGPPGPRGTIGFHDGDPLCPNACPPGRS
GYPGLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGELGEVGAQGPPGAQGLRGITGIVGDK
GEKGARGLDGEPGPQGLPGAPGDQGQRGPPGEAGPKGDRGAEGARGIPGLPGPKGDTGLP
GVDGRDGIPGMPGTKGEPGKPGPPGDAGLQGLPGVPGIPGAKGVAGEKGSTGAPGKPGQM
GNSGKPGQQGPPGEVGPRGPQGLPGSRGELGPVGSPGLPGKPGSLGSPGLPGLPGPPGLP
GTKGDRGVVGEPGPKGEQGASGEEGEAGERGELGDIGLPGPKGSAGNPGEPGLRGPEGSR
GLPGVEGPRGPPGPRGVQGEQGATGLPGVQGPPGRAPTDQHIKQVCMRVIQEHFAEMAAS
LKRPDSGATGLPGRPGPPGPPGPPGENGFPGQMGIRGLPGIKGPPGALGLRGPKGDLGEK
GERGPPGRGPKGLPGAIGLPGDPGPASYGRNGRDGERGPPGVAGIPGVPGPPGPPGLPGF
CEPASCTMQAGQRAFSKGPDP
NT seq 2766 nt   +upstreamnt  +downstreamnt
atgaagacctgctggaaaattccggttttcttctttgtgtgcagtttcctggaaccctgg
gcatctgcagctgtcaagcgtcgccccagattccctgtcaattccaattctaatggtgga
aatgaactctgtccaaagatcaggattggccaagatgacttaccagggtttgatctgatc
tctcagttccaggtagataaagcagcatctagaagagctatccagagagtagtgggatca
gctacattgcaggtggcttacaagttgggaaataatgtagacttcaggattccaactagg
aatttatatcccaatggactgcctgaagaatactccttcttgacaacatttcgaatgact
ggaagcacactcaaaaagaactggaacgtttggcagattcaggattcctctgggaaggag
caagttggcataaagattaatggccaaacacaatctgttgcattttcatacaagggactg
gacggaagtctccaaacggcagccttttcgaatttgccctccttgtttgattcccggtgg
cataagatcatgattggcgtggagaggagtagtgctactctttttgttgactgcaacagg
attgaatctttacctataaagccaagaggcccaattgacattgatggctttgctgtgctg
ggaaaacttgcagataatcctcaagtttctgttccatttgaacttcaatggatgctgatc
cactgtgaccccctgcggcccaggagagaaacttgccatgagctgccagccagaataacg
cccagccagaccaccgacgagagaggtcccccgggcgagcagggtcctcccgggcctcca
ggcccacctggagttccaggcatcgatggcatcgacggtgaccgaggtcctaagggcccc
ccgggccccccgggtcctgcaggtgaaccgggaaagccaggagctccaggcaagcctggc
acacctggcgctgatggattaacaggacctgatggatcccctggctccgttgggccaaag
ggacaaaaaggagaacctggtgtgcctggatcgcgtggatttccaggccgtggtattcct
ggaccccctggtcctcctgggacagcaggactccctggagagcttggccgtgtaggacct
attggtgaccctgggagaagaggaccacctggcccccctggccccccaggacccagagga
acaattggctttcatgatggagatccattgtgtcccaatgcctgtccaccaggtcgctca
ggatatccaggcctaccaggcatgaggggtcataaaggggctaaaggagaaattggtgaa
ccaggaagacaaggacacaagggtgaagaaggtgaccagggagaactgggagaagttgga
gctcaaggacctccaggagcccagggtttgcgaggcatcactggcatagttggggacaaa
ggggaaaaaggtgctcggggcttagatggtgaacctgggcctcagggtcttcctggtgca
cctggtgatcaaggacagcgaggacctccaggagaagcaggtcccaaaggagatagaggc
gctgaaggtgctagaggaattcctggtctccctgggcccaaaggagacacgggtttgcca
ggtgtggatggccgtgatgggatccctggaatgcctggaacaaagggtgaaccaggaaaa
cctgggcctcctggtgatgcaggattgcaggggttaccaggtgtacctggaattcctggt
gcaaagggtgttgctggtgaaaagggtagcactggtgctccagggaagcctggtcagatg
ggaaattcaggcaaaccgggccaacaggggcctccaggagaggtgggaccccgaggaccc
caggggcttcctggcagtagaggagaattaggaccagtgggatccccaggcctaccaggt
aaaccgggttctctgggtagccctggcctccctggcttgcctgggccccctggacttcct
ggaacgaaaggtgacaggggtgtcgtcggtgaaccgggtccaaagggtgaacagggtgcc
tctggtgaagaaggtgaagcaggagaaaggggggaacttggagatataggattacctggc
ccaaagggatctgcaggtaatcctggggaacctggcttgagagggcctgagggaagtcgg
gggcttcctggagtggaaggaccaagaggaccacctggaccccggggcgtgcagggagaa
cagggtgccaccggcctgcctggtgtccagggccctccgggtagagcaccgacagatcag
cacattaagcaggtttgcatgagagtcatacaagagcattttgctgagatggctgctagt
cttaagcgtccagactcgggtgccactgggcttcctggaaggcctggtcctcctggtccc
cctggccctcctggagagaatggtttcccaggccagatgggaattcgtggccttccgggc
attaaggggccccctggtgctcttggtttgagaggacctaaaggtgacttgggagaaaag
ggggagcgtggccctccaggaagaggtcccaaaggtttgcctggagctataggtctccca
ggtgacccaggccctgccagctatggcagaaatggccgagacggtgagcgaggcccccca
ggggtggcaggaattcctggagtgcctggacccccgggacctcctgggcttcccggtttc
tgtgagccagcctcctgcaccatgcaggctggtcagcgagcatttagcaaagggcctgac
ccttga

KEGG   Pongo abelii (Sumatran orangutan): 100446999
Entry
100446999         CDS       T01416                                 

Gene name
LAMC1
Definition
(RefSeq) laminin subunit gamma-1
  KO
K05635  laminin, gamma 1
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05020  Prion disease
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100446999 (LAMC1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100446999 (LAMC1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100446999 (LAMC1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100446999 (LAMC1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100446999 (LAMC1)
  09164 Neurodegenerative disease
   05020 Prion disease
    100446999 (LAMC1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100446999 (LAMC1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100446999 (LAMC1)
   05145 Toxoplasmosis
    100446999 (LAMC1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B PurA
Other DBs
NCBI-GeneID: 100446999
NCBI-ProteinID: XP_009238479
LinkDB
Position
1
AA seq 1608 aa
MRGSHRAAPAVRPRGRLWPVLAVLAAAAAGCAQAAMDECTDEGGRPQRCMPEFVNAAFNV
TVVATNTCGTPPEEYCVQTGVTGVTKSCHLCDAGQPHLQHGAAFLTDYNNQADTTWWQSQ
TMLAGVQYPSSINLTLHLGKAFDITYVRLKFHTSRPESFAIYKRTREDGPWIPYQYYSGS
CENTYSKANRGFIRTGGDEQQALCTDEFSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQ
EWVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYAISDFAVGGRCKCNGHASECMKNEFD
KLVCNCKHNTYGVDCEKCLPFFNDRPWRRATAESASECLPCDCNGRSQECYFDPELYRST
GHGGHCTNCQDNTDGAHCERCRENFFRLGNNEACSPCHCSPVGSLSTQCDSYGRCSCKPG
VMGDKCDRCQPGFHSLTEAGCRPCSCDPSGSIDECNVETGRCVCKDNVEGFNCERCKPGF
FNLESSNPRGCTPCFCFGHSSVCTNAVGYSVYSISSTFQIDEDGWRVEQRDGSEASLEWS
SERQDIAVISDSYFPRYFIAPAKFLGKQLLSYGQNLSFSFRVDRRDTRLSAEDLVLEGAG
LRVSVPLIAQGNSYPSETTVKYVFRLHEAADYPWRPALTPFEFQKLLNNLTSIKIRGTYS
ERSAGYLDDVTLASARPGPGVPATWVESCTCPVGYGGQFCEMCLSGYRRETPNLGPYSPC
VLCACNGHSETCDPETGVCNCRDNTAGLHCEKCSDGYYGDSTAGTSSDCQPCPCPGGSSC
AVVPKTKEVVCTNCPTGTTGKRCELCDDGYFGDPLGRNGPVRLCRLCQCNDNIDPNAVGN
CNRLTGECLKCIYNTAGFYCDRCKDGFFGNPLAPNPADKCKACNCNPYGTVKQQSSCNPV
TGQCECLPHVTGRDCGACDPGFYNLQSGQGCERCDCHALGSTNGQCDIRTGQCECQPGIT
GQHCERCEVNHFGFGPEGCKPCDCHPEGSLSLQCKDDGRCECREGFVGNRCDECEENYFY
NRSWPGCQECPACYRLVKDKVADHRVKLQELESLIANLGTGDEMVTDQAFEDRLKEAERE
VMDLLREAQDVKDVDQNLMDRLQRVNNTLSSQISRLQNIRNTIEETGNLAEQARAHVENT
ERLIEIASRELEKAKVAAANVSVTQPESTGDPNNMTLLAEEARKLAERHKQEADDIVRVA
KTANDTSTEAYNLLLRTLAGENQTAFEIEELNRKYEQVKNISQDLEKQAARVHEEAKRAG
DKAVEIYASVAQLSPLDSETLENEANNIKMEAENLEHLIDQKLKDYEDLKEDMRGKELEV
KNLLEKGKTEQQTADQLLARADAAKALAEEAAKKGRDTLQEANDILNNLKDFDRRVNDNK
TAAEEALRKIPAINQTIIEANEKTREAQQALGNAAADATEAKNKAHEAERIASAVQKNAT
STKAEAERTFAEVTDLDNEVNSMLKQLQEAEKELKRKQDDADQDMMMAGMASQAAQEAEI
NARKAKNSVTSLLSIINDLLEQLGQLDTVDLNKLNEIEGTLNKAKDEMKVSDLDRKVSDL
ENEAKKQEAAIMDYNRDIEEIMKDIRNLEDIRKTLPSGCFNTPSIEKP
NT seq 4827 nt   +upstreamnt  +downstreamnt
atgagggggagccatcgggccgcgccggccgtgcgaccccgggggcggctctggcccgtg
ctggccgtgctggcggccgccgcggcgggctgtgcccaggcagccatggacgagtgcacg
gacgagggcgggcggccgcagcgctgcatgcccgagttcgtcaacgccgccttcaacgtg
actgtggtggccaccaacacgtgtgggactccgcccgaggaatactgtgtgcagaccggg
gtgaccggggtcaccaagtcctgtcacctgtgcgacgccgggcagccccacctgcagcac
ggggcagccttcctgaccgactacaacaaccaggccgacaccacctggtggcaaagccag
accatgctggccggggtgcagtaccccagctccatcaacctcacgctgcacctgggaaaa
gcttttgacatcacctatgtgcgtctcaagttccacaccagccgcccagagagctttgcc
atttacaagcgcacaagggaagatgggccctggattccttaccagtactacagtggttcc
tgcgagaacacctactccaaggcaaaccgcggcttcatcaggacaggaggggacgagcag
caggccttgtgtactgatgaattcagtgacatttctcccctcactgggggcaatgtggcc
ttttctaccctggaaggaaggcccagcgcctataactttgacaatagccctgtgctgcag
gaatgggtaactgccactgacatcagagtaactcttaatcgcctgaacacttttggagat
gaagtgtttaacgatcccaaagttctcaagtcctattattatgcgatctctgattttgct
gtgggtggcagatgtaaatgtaatggacatgcaagcgagtgtatgaagaacgaatttgat
aagctggtgtgtaattgcaaacataacacatatggagtagactgtgaaaagtgtcttcct
ttcttcaatgaccggccgtggaggagggcaactgcagaaagtgccagtgaatgcctgccc
tgtgactgcaatggtcgatcccaggaatgctacttcgatcctgaactctatcgttccact
ggccatgggggccactgtaccaactgccaggataacacagatggcgcccactgtgagagg
tgccgggagaacttcttccgccttggaaacaatgaagcctgctctccatgccactgtagt
cctgtgggctctctaagcacacagtgtgatagttatggcagatgcagctgtaagccagga
gtgatgggggacaaatgtgaccgttgccagcctggattccattctctcactgaagcagga
tgcaggccatgctcttgtgatccctctggcagcatagatgaatgtaatgttgaaacagga
agatgtgtttgcaaagacaatgtcgaaggcttcaattgtgaaagatgcaaacctggattt
tttaatctggaatcatctaatcctcggggttgcacaccctgcttctgctttgggcattct
tctgtctgtacaaatgctgttggctacagtgtttattctatctcctctacctttcagatt
gatgaggatgggtggcgtgtggaacagagagatggctctgaagcatctctcgagtggtcc
tctgagaggcaagatatcgctgtgatctcagacagctactttcctcggtacttcattgct
cctgcaaagttcttgggcaagcagttgttgagttatggtcagaacctctccttctccttt
cgagtggacaggcgcgatactcgcctctctgcagaagaccttgtgcttgagggagctggc
ttaagagtatcggtacccttgatcgctcagggcaattcctatccaagtgagaccactgtg
aagtatgtcttcaggctccatgaagcagcagattacccttggaggcctgctcttacccct
tttgaatttcagaagctcctaaacaacttgacctctatcaagatccgtgggacatacagt
gagagaagtgctggatatttggatgatgtcaccctggcaagtgctcgtcctgggcctgga
gttcctgcaacttgggtggagtcctgcacctgtcctgtgggatatggagggcagttttgt
gagatgtgtctctcaggttacagaagagaaactcctaatcttggaccatacagtccatgt
gtgctttgcgcctgcaatggacacagcgagacctgtgatcctgagacaggtgtttgtaac
tgcagagacaatacagctggcctgcactgtgagaagtgcagtgatgggtactatggagat
tcaactgcaggcacctcctccgattgccaaccctgtccgtgtcctggaggttcaagttgt
gctgttgtccccaagacaaaggaggtggtgtgcaccaactgtcctactggcaccactggt
aagagatgtgagctctgtgatgatggctactttggagaccccctgggtagaaacggccct
gtgagactttgccgtctgtgccagtgcaatgacaacatcgatcccaacgcagttggaaat
tgcaatcgcttgacaggagaatgcctgaagtgcatctataacactgctggcttctattgt
gaccggtgcaaagacggattttttggaaatcctctggctcccaatccagcagacaaatgc
aaagcctgcaattgcaatccatatgggacagtgaagcagcagagcagctgtaaccccgtg
acggggcaatgtgaatgtttgcctcacgtgactggccgggactgtggtgcctgtgaccct
ggattctacaatctgcagagtgggcaaggctgtgaaaggtgtgactgccatgccttgggc
tccaccaatgggcagtgtgacatccgcaccggccagtgtgagtgccagcccggcatcacc
ggtcagcactgtgagcgctgtgaggtcaaccactttgggtttggacctgaaggctgcaaa
ccctgtgactgtcatcccgagggatctctttcacttcagtgcaaagatgatggtcgctgt
gaatgcagagaaggctttgtgggaaatcgctgtgacgagtgtgaagaaaactatttctac
aatcggtcttggcctggctgccaggaatgtccagcttgttaccggctggtaaaggataag
gttgctgatcatagagtgaagctccaggaattagagagtcttatagcaaaccttggaact
ggggatgagatggtgacagatcaagccttcgaggatagactaaaggaagcagagagggaa
gttatggacctccttcgtgaggcccaggatgtcaaagatgttgaccagaatttgatggat
cgcctacagagagtgaataacactctgtccagccaaattagccgtttacagaatatccgg
aataccattgaagagactggaaacttggctgaacaagcacgtgcccatgtagagaacaca
gagcggttgattgaaatcgcatccagagaacttgagaaagcaaaagtcgctgctgccaat
gtgtcagtcactcagccagaatctaccggggacccaaacaacatgactcttttggcagaa
gaggctcgaaagcttgctgaacgtcataaacaggaagctgatgacattgttcgagtggca
aagacagccaatgatacatcaactgaggcatacaacctgcttttgaggacactggcagga
gaaaatcaaacagcatttgagattgaagagcttaataggaagtatgaacaagtgaagaac
atctcacaggatctggaaaaacaagctgcccgagtacatgaggaggccaaaagggctggt
gacaaagctgtggagatctatgccagcgtggctcagctgagccctttggactctgagaca
ctggagaatgaagcaaataacataaagatggaagctgagaatctggaacatctgattgat
cagaaattaaaagattatgaggacctcaaagaagatatgagagggaaggaacttgaagtc
aagaaccttctggagaaaggcaagactgaacagcagaccgcagaccaactcctagcccga
gctgatgctgccaaggccctcgctgaagaagctgcaaagaagggacgggataccttacaa
gaagctaatgacattctcaacaacctgaaagattttgataggcgcgtgaacgataacaag
acggccgcagaggaggcgctaaggaagattcctgccatcaaccagaccatcattgaagct
aatgaaaagaccagggaagcccagcaggccctgggcaatgctgcagcggatgccacagag
gccaaaaacaaggcccatgaggcggagaggatcgcgagcgctgtccaaaagaatgccacc
agcaccaaggcagaagctgaaagaacttttgcagaagttacagatctggataatgaggtg
aacagtatgttgaagcaactacaggaagcagaaaaagagctaaagagaaaacaagatgac
gctgaccaggacatgatgatggcagggatggcttcacaggctgctcaagaagccgagatc
aatgccagaaaagccaaaaactctgttactagcctcctcagcattattaatgacctcttg
gagcagctggggcagctggatacagtggacctaaataagctaaacgagattgaaggcacc
ctaaacaaagccaaagatgaaatgaaggtcagcgatcttgataggaaagtgtctgacctg
gagaatgaagccaagaagcaggaggctgccatcatggactacaaccgagatatcgaggag
atcatgaaggacattcgcaatctggaggacatcaggaagaccttaccatctggctgcttc
aacaccccgtccattgaaaagccctag

KEGG   Pongo abelii (Sumatran orangutan): 100447159
Entry
100447159         CDS       T01416                                 

Gene name
COMP
Definition
(RefSeq) LOW QUALITY PROTEIN: cartilage oligomeric matrix protein
  KO
K04659  thrombospondin 2/3/4/5
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04145  Phagosome
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05144  Malaria
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100447159 (COMP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100447159 (COMP)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    100447159 (COMP)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100447159 (COMP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100447159 (COMP)
  09174 Infectious disease: parasitic
   05144 Malaria
    100447159 (COMP)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pon04131]
    100447159 (COMP)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100447159 (COMP)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100447159 (COMP)
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100447159 (COMP)
Membrane trafficking [BR:pon04131]
 Endocytosis
  Phagocytosis
   Opsonins
    100447159 (COMP)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100447159 (COMP)
  Exosomal proteins of colorectal cancer cells
   100447159 (COMP)
  Exosomal proteins of bladder cancer cells
   100447159 (COMP)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100447159 (COMP)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   100447159 (COMP)
SSDB
Motif
Pfam: TSP_C TSP_3 EGF_CA COMP cEGF EGF_3 EGF_MSP1_1 EGF hEGF
Other DBs
NCBI-GeneID: 100447159
NCBI-ProteinID: XP_024092426
LinkDB
Position
19
AA seq 757 aa
MAPDTACVLLLTLAALGASGQGQSPLGSDVGPQMLRELQETNAALQDVRELLRQQVREIT
FLKNTVMECDACGMQQSARTGLPSVRPLLHCAPGFCFPGVACIQTESGARCGPCPAGFTG
NGSHCTDVNECNAHXCFPRVRCINTSPGFRCEACPPGYSGPTHEGVGLAFAKANKQVCTD
INECETGQHNCVPNSVCINTRGSFQCGPCQPGFVGDQESGCQRRAQRFCPDGSPSECHEH
ADCVLERDGSRSCVCAVGWAGNGILCGRDTDLDGFPDEKLRCPERQCRKDNCVTVPNSGQ
EDVDRDGIGDACDPDADGDGVPNEKDNCPLVRNPDQRNTDEDKWGDACDNCRSQKNDDQK
DTDQDGRGDACDDDIDGDRIRNQADNCPRVPNSDQKDSDGDGIGDACDNCPQKNNPDQAD
VDHDFVGDACDSDQDQDGDGHQDSRDNCPTVPNSAQEDSDHDGQGDACDDDDDNDGVPDS
RDNCRLVPNPGQEDVDRDGVGDVCQDDFDADKVVDKIDVCPENAEVTLTDFRAFQTVVLD
PEGDAQIDPNWVVLNQGREIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFI
FGYQDSSSFYVVMWKQMEQTYWQANPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDT
ESQVRLLWKDPRNVGWKDKKSYRWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGG
RLGVFCFSQENIIWANLRYRCNDTIPEDYETHQLRRA
NT seq 2274 nt   +upstreamnt  +downstreamnt
atggcccccgacaccgcctgcgttcttctgctcaccctggctgccctcggcgcgtccgga
cagggccagagcccgttgggctcagacgtgggcccgcagatgcttcgggaactgcaggaa
accaacgcggcgctgcaggacgtgcgggagctgctgcggcagcaggtcagggagatcacg
ttcctgaaaaacacggtgatggagtgtgacgcgtgcgggatgcagcagtcagcacgcacc
ggcctacccagcgtgcggcccctgctccactgcgcgcccggcttctgcttccccggcgtg
gcctgcatccagacggagagcggcgcgcgctgcggcccctgccccgcgggcttcacgggc
aacggctcgcactgcaccgacgtcaacgagtgcaacgcccacncctgcttcccccgagtc
cgctgtatcaacaccagcccggggttccgctgcgaggcttgcccgccggggtacagcggc
cccacccacgagggcgtggggctggctttcgccaaggccaacaagcaggtttgcacggac
atcaacgagtgtgagaccgggcaacataactgcgtccccaactccgtgtgcatcaacacc
cggggctccttccagtgcggcccgtgccagcccggcttcgtgggcgaccaggagtccggc
tgccagcggcgcgcacagcgcttctgccccgacggctcgcccagcgagtgccacgagcat
gcagactgcgtcctggagcgcgatggctcgcggtcgtgcgtgtgtgccgtcggctgggcc
ggcaacgggatcctctgtggtcgcgacactgacctggacggcttcccggacgagaagctg
cgctgcccggagcgccagtgccgtaaggacaactgcgtgacggtgcccaactcagggcag
gaggatgtggaccgcgatggcatcggagacgcctgcgatccggatgccgacggggacggg
gtccccaatgaaaaggacaactgcccgctggtgcggaacccagaccagcgcaacacggac
gaggacaagtggggcgatgcgtgcgacaactgccggtcccagaagaacgacgaccaaaag
gacacagaccaggacggccggggcgatgcgtgcgacgacgacatcgacggcgaccggatc
cgcaaccaggccgacaactgccctagggtacccaactcagaccagaaggacagtgatggc
gatggtataggggatgcctgtgacaactgtccccagaagaacaacccggatcaggcggat
gtggaccacgactttgtgggagatgcttgcgacagcgatcaagaccaggatggagacgga
catcaggactctcgggacaactgtcccacggtgcctaacagcgcccaggaggactcagac
cacgacggccagggtgatgcctgcgacgacgacgacgacaatgacggagtccctgacagt
cgggacaactgccgcctggtgcctaaccccggccaggaggacgtggacagggacggcgtg
ggcgacgtgtgccaggacgactttgatgcagacaaggtggtagacaagatcgacgtgtgt
ccggagaacgctgaagtcaccctcaccgacttccgggccttccagacagtcgtgctggac
ccggagggtgacgcgcagattgaccccaactgggtggtgctcaaccagggaagggagatc
gtgcagacaatgaacagcgacccaggcctggctgtgggttacactgccttcaatggcgtg
gacttcgagggcacgttccatgtgaacacggtcacggatgacgactatgcgggcttcatc
tttggctaccaggacagctccagcttctacgtggtcatgtggaagcagatggagcaaacg
tactggcaggcgaaccccttccgtgctgtggccgagcccggcatccaactcaaggctgtg
aagtcttctacaggccccggggaacagctgcggaacgctctgtggcatacaggagacaca
gagtcccaggtgcggctgctgtggaaggacccgcgaaacgtgggttggaaggacaagaag
tcctatcgttggttcctgcagcaccggccccaagtgggctacatcagggtgcgattctat
gagggccctgagctggtggccgacagcaacgtggtcttggacacaaccatgcgtggtggc
cgcctgggggtcttctgcttctcccaggagaacatcatctgggccaacctgcgttaccgc
tgcaatgacaccatcccagaggactatgagacccatcagctgcggcgagcctag

KEGG   Pongo abelii (Sumatran orangutan): 100447813
Entry
100447813         CDS       T01416                                 

Gene name
RELN
Definition
(RefSeq) reelin isoform X1
  KO
K06249  reelin [EC:3.4.21.-]
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05017  Spinocerebellar ataxia
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100447813 (RELN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100447813 (RELN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100447813 (RELN)
 09160 Human Diseases
  09164 Neurodegenerative disease
   05017 Spinocerebellar ataxia
    100447813 (RELN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100447813 (RELN)
Enzymes [BR:pon01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.21  Serine endopeptidases
    3.4.21.-  
     100447813 (RELN)
SSDB
Motif
Pfam: BNR EGF_2 Reeler EGF_Tenascin EB
Other DBs
NCBI-GeneID: 100447813
NCBI-ProteinID: XP_024106125
Ensembl: ENSPPYG00000017885
LinkDB
Position
7
AA seq 3460 aa
MERGGWARQTFLLALLLGATLRARAAAGYYPRFSPFFFLCTHHGELEGDGEQGEVLISLH
IAGNPTYYVPGQEYHVTISTSTFFDGLLVTGLYTSTSVQASQSIGGSSAFGFGIMSDHQF
GNQFMCSVVASHVSHLPTTNLSFIWIAPPAGTGCVNFMATATHRGQVIFKDALAQQLCEQ
GAPTDATVHPHLAEIHSDSIILRDDFDSYHQLKLNPNIWVECNNCETGEQCGAIMHGNAV
TFCEPYGPRELITTGLNTTTASVLQFSIGSGSCRFSYSDPSIIVLYAKNNSADWIQLEKI
RAPSNVSTIIHILYLPEDAKGENVQFQWKQENLRVGEAYEACWALDNILIINSAHRQVVL
EDSLDPVDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSTEDIQEQW
SEEFESQPTGWDVLGAVIGTECGTIESGLSMVFLKDGERKLCTPSMDTTGYGNLRFYFVM
GGICDPGNSHENDIILYAKIEGRKEHIALDTLSYSSYKVPSLVSVVINPELQTPATKFCL
RQKNHEGHNRNVWAVDFFHVLPVLPSTMSHMIQFSINLGCGTHQPGNSVSLEFSTNHGRS
WSLLHTECLPEICAGPHLPHSTVYSSENYSGWNRITIPLPNAALTRNTRIRWRQTGPILG
NMWAIDNVYIGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSSRL
SSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGSKS
VLSTCRAPDQPGEGVLLHYSYDNGITWKLLEHYSYLSYHEPRIISVELPDDARQFGIQFR
WWQPYHSSQGEDVWAIDEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPYCGHDWTL
CFTGDSKLASSMRYVETQSMQIGASYMIQFSLVMGCGQKYTPHMDNQVKLEYSTNHGLTW
HLVQEECLPSMPSCQEFTSASIYHASEFTQWRRVIVLLPQKTWSSATRFRWSQSYYTAQD
EWALDSIYIGQQCPNMCSGHGSCDHGICRCDQGYQGTECHPEAALPSTIMSDFENQNGWE
SDWQEVIGGEIVKPEQGCGVISSGSSLYFSKAGKRQLVSWDLDTSWVDFVQFYIQIGGES
ASCNKPDSREESVLLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPCTRFRWW
QPVFSGEDYDQWAVDDIIILSEKQKQIIPVINPTLPQNFYEKPAFDYPMNQMSVWLMLAN
EGMVKNETFCAATPSAMIFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCANQFSSTAPVL
LQYSHDAGMSWFLVKEGCYPASAGKGCEGNSRELSEPTMYHTGDFEEWTRITIVIPRSLA
SSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCISGVCFCDLGYTAAQGTC
VSNVPNHNEMFDRFEGKLSPLWYKITGAQVGTGCGTLNDGKSLYFNGPGKREARTVPLDT
RNIRLVQFYIQIGSKTSGITCIKPRTRNEGLIVQYSNDNGILWHLLRELDFMSFLEPQII
SIDLPQEAKTPATAFRWWQPQHGKHSAQWALDDVLIGMNDSSQTGFQDKFDGSIDLQANW
YRIQGGQVDIDCLSMDTALIFTENIGKPRYAETWDFHVSASTFLQFEMSMGCSKPFSNSH
SVQLQYSLNNGKDWHLVTEECVPPTIGCLHYTESSIYTSERFQNWKRITVYLPLSTISPR
TRFRWIQANYTVGADSWAIDNVVLASGCPWMCSGRGICDAGRCVCDRGFGGPYCVPVVPL
PSILKDDFNGNLHPDLWPEVYGAERGNLNGETIKSGTSLMFKGEGLRMLISRDLDCTNTM
YVQFSLRFIAKSTPERSHSILLQFSINGGITWHLMDEFYFPQTTNILFINVPLPYTAQTN
ATRFRLWQPYNNGKKEEIWIVDDFIIDGNNLNNPVMLLDTFDFGPREDNWFFYPGGNIGL
YCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLNVNENTIIQFEINVGCSTDSSSADPVK
LEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHHPSSTYYAGTMQGWRREVVHFGKLHLC
GSVRFRWYQGFYPAGSQPVTWAIDNVYIGPQCEEMCNGQGSCINGTKCICDPGYSGPTCK
ISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLMTRDLDL
SHARFVQFFMRLGCGKGIPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIALEI
PLKARSGSTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFTTLDSRKWLLHPG
GTKMPVCGSTGDALVFIEKASTRYVVSTDVAVNEDSFLQIDFAASCSVTDSCYAIELEYS
VDLGLSWHPLVRDCLPTNVECSRYHLQRILVSDTFNKWTRITLPLPPYTRSQATRFRWHQ
PAPFDKQQTWAIDNVYIGDGCVDMCSGHGRCIQGNCVCDEQWGGLYCDDPETSLPTQLKD
NFNRVPSSQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNAEFIQFYFM
YGCLITPNNRNQGVLLEYSVNGGITWNLLMEIFYDQYSKPGFVNILLPPDAKEIATRFRW
WQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPADAGPVGRIAFDMF
MEDKTSVNEHWLFHDDCTVERFCDSPDGVMLCGSHDGREVYAVTHDLTPTEGWIMQFKIS
VGCKVSEKIAQNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGSVSQPSVFFPTKGWKRIT
YPLPESLVGNPVRFRFYQKYSDMQWAIDNFYLGPGCLDNCRGHGDCLREQCICDPGYSGP
NCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTVRQAITQ
DLDLRGAKFLQYWGRIGSENNMTSCHRPICRKEGVLLDYSTDGGITWTLLHEMDYQKYIS
VRHDYILLPEDALTNTTRLRWWQPFVISNGIVVSGVERAQWALDNILIGGAEINPSQLVD
TFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTHNALSSRELIIQPGYMM
QFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQFHEATIY
NSVNSSSWKRITIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPKLCSGHGY
CTTGAICICDESFQGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIGSGCGQLAP
YAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTSQTDSCNSDLSGPHAVDKAVL
LQYSVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGTGHDQWAL
DHVEVVLVSTRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP
NT seq 10383 nt   +upstreamnt  +downstreamnt
atggagcgcggtggctgggcccggcagactttcctcctagcgctgttgctgggggcgacg
ctgagggcgcgcgcggcggcgggctattacccccgcttttcgcccttctttttcctgtgc
acccaccacggggagctggaaggggatggggagcagggcgaggtgctcatttccctgcac
attgcgggcaaccccacctactacgttccgggacaagaataccatgtgacaatttcaaca
agcaccttctttgatggcttgctggtgacgggactatacacatctacgagtgttcaggca
tcacagagcattggaggttccagtgcttttggatttgggatcatgtctgaccaccagttt
ggtaaccagtttatgtgcagtgtggtagcctctcacgtgagtcacctgcccacaaccaac
ctcagtttcatctggattgctccacctgcgggcacaggctgtgtgaatttcatggctaca
gcaacacaccggggccaggttattttcaaagatgctttagcccagcagttgtgtgaacaa
ggagctccaacagatgccactgtgcacccacatctagccgaaatacatagtgacagcatt
atcctgcgagatgactttgactcctatcaccaactgaaattaaatccaaatatatgggtt
gaatgtaacaactgtgagactggagaacagtgtggcgcgattatgcatggcaatgccgtc
accttctgtgaaccatatggcccacgagaactgattaccacaggccttaatacaacaaca
gcttctgtcctccaattttccattgggtcaggttcatgtcgctttagttattcagacccc
agcatcatcgtgttatatgccaagaataactctgcggactggattcagctagagaaaatt
agagccccttccaatgtcagcacaatcatccatatcctctaccttcctgaggacgccaaa
ggggagaatgtccagtttcagtggaagcaggaaaatcttcgtgtaggtgaagcgtatgaa
gcctgctgggccttagataacatcttgatcatcaattcagctcacagacaagtcgtttta
gaagatagtctcgacccagtggacacaggcaactggcttttcttcccaggagctacagtt
aagcatagctgtcagtcagatgggaattccatttatttccatggaaatgaaggcagcgag
ttcaattttgccaccacccgggatgtagatctttccacagaagatattcaagagcaatgg
tcagaagaatttgagagccagcctacaggatgggatgtcttgggagctgtcattggtacg
gaatgtggaacaatagaatcaggcttatcaatggtcttcctcaaagatggagagaggaaa
ttatgcactccatccatggacactaccggttatgggaacctgaggttttactttgtgatg
ggaggaatttgtgaccctggaaattctcatgaaaatgacataatcctgtatgcaaaaatt
gaaggaagaaaagagcacatagcactggataccctttcctattcctcatataaggttccg
tctttggtttctgtggtcatcaatcctgaacttcagactcctgctaccaaattttgtctc
aggcaaaagaaccatgaaggacataataggaatgtctgggctgtagactttttccatgtc
ttgcctgttctaccttctacaatgtctcacatgatacagttttccatcaatctgggatgt
ggaacacatcagcctggtaacagtgtcagcttagagttttctaccaaccatgggcgttcc
tggtccctccttcacactgaatgcttacctgagatctgtgctggaccccacctcccccac
agcactgtctactcctctgaaaactacagtgggtggaaccgaataacaattccgcttcct
aacgcagcactaacccggaacaccaggattcgctggagacaaacaggaccaatccttgga
aacatgtgggcaattgataatgtttatattggcccgtcatgtctcaaattctgttctggc
agaggacagtgcactagacatggttgcaagtgtgaccctggattttctggcccagcttgt
gagatggcatcccagacattcccaatgtttatttctgaaagctttggcagttccaggctc
tcctcttaccataacttttactctatccgtggtgctgaagtcagctttggttgtggtgtc
ttggccagcggtaaggccctggttttcaacaaagatgggcggcgtcagctaattacatcc
ttccttgacagctcccaatccaggtttctccagttcacgctgagactggggagcaaatct
gttctgagcacatgcagagcccctgaccagcctggtgaaggagttttgttgcattattct
tatgataatgggataacttggaaactcctggagcattattcatatctcagctatcatgag
cccagaataatctccgtagaactaccagatgatgcaaggcagtttggaattcagttcaga
tggtggcaaccatatcattcttcccagggagaagatgtatgggctattgatgagattatc
atgacgtctgtacttttcaacagcattagtcttgactttaccaatcttgtggaggttact
cagtctctgggattctaccttggaaatgttcagccatactgtggccacgactggaccctt
tgttttacaggagattctaaacttgcctcaagtatgcgctatgtggaaacacaatcaatg
cagataggagcatcctatatgattcagttcagtttggtgatgggatgtggccagaaatac
accccacacatggacaaccaggtgaagctggagtactcgaccaaccatggccttacctgg
cacctcgtccaagaagaatgccttccaagtatgccaagttgtcaggaatttacatcagca
agtatttaccatgccagtgagtttacacagtggagaagagtcatagtgcttcttccccag
aaaacttggtccagtgctacccgcttccgttggagccagagctattacactgctcaagac
gaatgggctttggacagcatttacattgggcagcagtgccccaacatgtgcagtgggcat
ggctcatgcgatcatggcatatgcaggtgtgaccaggggtaccaaggcactgaatgccac
ccggaagctgcccttccgtccacaattatgtcagattttgagaaccagaatggctgggag
tctgactggcaagaagttattgggggagaaattgtaaaaccagaacaagggtgtggtgtc
atctcttctggatcatctctgtacttcagcaaggctgggaaaagacaactggtgagttgg
gacctggatacttcttgggtggattttgtccagttctacatccagataggcggagagagt
gcttcatgcaacaagcctgacagcagagaggagagcgtcctccttcagtacagcaacaat
gggggcatccagtggcacctgctagcagagatgtacttttcagacttcagcaaacccaga
tttgtctatctggagcttccagctgctgccaagaccccttgcaccaggttccgctggtgg
cagcccgtgttctcaggggaggactatgaccagtgggcagtcgatgacatcatcattctg
tccgagaagcagaagcagatcatcccagttatcaatccaactttacctcagaacttttat
gagaagccagcttttgattaccctatgaatcagatgagtgtgtggttgatgttggctaat
gaaggaatggttaaaaatgaaaccttctgtgctgccacaccatcagcaatgatatttgga
aaatcagatggagatcgatttgcagtaactcgagatttgaccctgaaacctggatatgtg
ctacagttcaagctaaacataggttgtgccaatcaattcagcagtactgctccagttctt
cttcagtactctcatgatgctggaatgtcctggtttctggtgaaagaaggctgttacccg
gcttctgcaggcaaaggatgtgaaggaaactccagagaactaagtgagcccaccatgtat
cacacaggggactttgaagaatggacaagaatcaccattgttattccaaggtctcttgca
tccagcaagaccagattccgatggatccaggagagcagctcacagaaaaacgtgcctcca
tttggtttagatggagtgtacatatccgagccttgtcccagttactgcagtggccatggg
gactgcatttcaggagtgtgtttctgtgacctggggtatactgctgcacaaggaacctgt
gtgtcaaatgtccccaatcacaatgagatgttcgataggtttgaggggaagctcagccct
ctgtggtacaagataacaggcgcccaggttggaactggctgtggaacacttaatgatggc
aaatctctctacttcaatggccctgggaaaagggaagccaggacggtccctctggacacc
aggaatatcagacttgttcaattttatatacaaattggaagcaaaacttcaggcattacc
tgcatcaaaccaagaactagaaatgaagggcttattgttcagtattcaaatgacaatggg
atactctggcatttgcttcgagagttggacttcatgtccttcctggaaccacagatcatt
tccattgacctgccacaggaggcgaagacacctgcaacagcatttcgatggtggcaaccg
caacatgggaagcattcagcccagtgggctttggatgatgttcttataggaatgaatgac
agctcccaaactggatttcaagacaaatttgatggctctatagatttgcaagccaactgg
tatcgaatccaaggaggtcaagttgatattgactgtctctctatggatactgctctgata
ttcactgaaaacataggaaaacctcgttatgctgagacctgggattttcatgtgtcagca
tctacctttttgcaatttgaaatgagcatgggctgtagcaagcccttcagcaactcccac
agtgtacagctccagtattctctgaacaatggcaaggactggcatcttgtcactgaagaa
tgtgttcctccaaccattggctgtctgcactacacggaaagttcaatttacacctcggaa
agattccagaattggaagcggatcactgtctaccttccactctccaccatttctcccagg
acccggttcagatggattcaggccaactacactgtgggggctgattcctgggcgattgat
aatgttgtactggcctcagggtgcccttggatgtgctcaggacgagggatttgtgatgct
ggacgctgtgtgtgtgaccggggctttggtggaccctattgtgttcctgttgttcctctg
ccctcgattcttaaagatgatttcaatgggaacttacatcctgacctttggcctgaagtg
tatggtgcagagagggggaatctgaatggtgaaaccatcaaatctggaacatctctaatg
tttaaaggggaaggactaaggatgcttatttcaagagatctagattgtacaaatacaatg
tatgtccagttttcacttagatttatagcaaaaagtaccccagagagatctcactctatt
ctattacaattctccatcaatggaggaatcacttggcacctgatggatgaattttacttt
cctcaaacaacgaatatacttttcatcaatgttcccttaccatacactgcccaaaccaat
gctacaagattcagactctggcaaccttataataacggtaagaaagaagaaatctggatt
gttgatgacttcattatcgatggaaataatttaaacaaccctgtgatgctcttggataca
tttgactttgggcccagagaagacaattggtttttctatcctggtggtaacattggtctt
tattgtccatattcttcaaagggggcacctgaagaagattcagctatggtgtttgtttca
aatgaagttggtgagcattccattaccactcgtgacctaaatgtgaatgagaacaccatc
atacaatttgagatcaacgttggctgttcgactgatagctcatccgcggatccagtgaaa
ctggaattttcaagggacttcggggcgacctggcaccttctgctgcccctctgctaccac
agcagcagccacgtcagctctttatgctccaccgagcaccaccccagcagcacctactac
gcgggaaccatgcagggctggaggagggaggtcgtgcacttcgggaagctgcacctttgt
ggatctgtccgtttcagatggtaccagggattttaccctgccggctctcagccagtgaca
tgggccattgataatgtctacatcggtccccagtgtgaggagatgtgtaatggacagggg
agctgtatcaatggaaccaaatgtatatgtgaccctggctactcaggtccaacctgtaaa
ataagcaccaaaaatcctgattttctcaaagatgatttcgaaggtcagctagaatctgat
agattcttattaatgagtggtgggaaaccatctcgaaagtgtggaatcctttctagtgga
aacaacctctttttcaatgaagatggcttgcgcatgttgatgacacgagacctggattta
tcacatgctagatttgtgcagttcttcatgagactgggatgtggtaaaggcattcctgac
cccaggagccaacccgtactcctacagtattctctcaacggcggcctctcgtggagtctt
cttcaggagttccttttcagcaattccagcaatgtgggcaggtacattgccctggagata
cccttgaaagcccgttctggttctactcgccttcgctggtggcaaccgtctgagaatggg
cacttctacagcccctgggttatcgatcagattcttattggaggaaatatttctggtaat
acggtcttggaagatgatttcacaacccttgatagtaggaaatggctgcttcaccctgga
ggcaccaagatgcccgtgtgtggctctactggtgatgccctggtcttcattgaaaaggcc
agcacccgttacgtggtcagcacagacgttgccgtgaatgaggattcctttctacagata
gacttcgctgcctcctgctcggtcacagactcttgttatgcgattgaattggaatactca
gtagatcttggattgtcatggcacccattggtaagggactgtctgccgaccaacgtggaa
tgcagtcgctatcacctgcagcggatcctggtgtcagacactttcaacaagtggaccaga
atcactctgcctctccctccttatacccggtcccaagccactcgtttccgttggcatcaa
ccagctccttttgacaagcagcagacatgggcaatagataatgtctatatcggggatggc
tgcgtagacatgtgcagtggccatgggagatgcatccagggaaactgcgtctgtgatgaa
cagtggggtggcctgtactgtgatgaccccgagacctctcttccaacccaactcaaagac
aacttcaatcgagttccatccagtcagaactggctgactgtgaacggagggaaattgagt
acagtgtgtggagccgtggcgtcaggaatggctctccatttcagtgggggttgtagtcga
ttattagtcactgtggatctaaacctcactaatgctgagttcatccaattttacttcatg
tatgggtgcctgattacaccaaacaaccgtaaccaaggtgttctcttggaatattctgtc
aatggaggcattacctggaacctgctcatggagattttctatgaccagtacagtaagccc
ggatttgtgaatatccttctccctcctgatgctaaagagattgccactcgcttccgctgg
tggcagccaagacatgacggcctggatcagaacgactgggccattgacaatgtcctcatc
tcaggctctgctgaccaaaggaccgttatgctggacaccttcagcagcgccccagtaccc
cagcacgagcgctcccctgcagatgccggccctgtcgggaggattgcctttgacatgttt
atggaagacaaaacttcagtgaatgagcactggctattccatgatgattgtacagtagaa
agattctgtgactcccctgacggtgtgatgctctgtggcagtcatgatggacgggaggtg
tatgcagtgacccacgacctgactcccactgaaggctggattatgcaattcaagatctcg
gttggatgtaaggtgtctgaaaaaattgcccagaatcaaattcatgtgcagtattctact
gacttcggtgtgagctggaattatctggtccctcagtgcttgcctgccgacccaaaatgc
tctggaagtgtttctcagccatctgtattctttccaactaaagggtggaaaaggatcacc
tacccacttcctgaaagcttagtgggaaatccggtaaggtttaggttctatcagaagtac
tcagacatgcagtgggcaattgataatttctacctgggacctggatgcttggacaactgc
aggggccatggagattgcttaagggaacagtgcatctgtgatccgggatactcagggcca
aactgctacttgacccacactctgaagactttcctgaaggaacgctttgacagtgaagaa
atcaaacctgacttatggatgtccttagaaggtggaagtacttgcactgagtgtggaatt
cttgccgaggacactgcactctattttgggggatccactgtgagacaagcgattacacaa
gatttggatcttcgaggtgcaaagttcctgcaatactgggggcgcatcggtagtgagaac
aacatgacctcttgccatcgtcccatctgccggaaggaaggcgtgctgttggactactct
accgatggaggaattacctggactttgctccatgagatggattaccagaaatacatttct
gttagacacgactacatacttcttcctgaagatgccctcaccaacacaactcgacttcgc
tggtggcagccttttgtgatcagcaatggaattgtggtctctggggtggagcgtgctcag
tgggcactggacaacattttgattggtggagcagaaatcaatcccagccaattggtggac
acttttgatgatgaaggcacttcccatgaagaaaactggagtttttaccctaatgctgta
aggacagcaggattttgtggcaatccatccttccacctctattggccaaataaaaagaag
gacaagactcacaatgctctctcctcccgagaactcattatacagccaggatacatgatg
cagtttaaaattgtggtgggttgtgaagccacttcttgtggtgaccttcattccgtaatg
ttggaatacactaaggatgcaagatcggattcctggcagctcgtacagacccagtgcctt
ccttcttcttctaacagcattggctgctcccctttccagttccatgaagccaccatctac
aactctgtcaacagctcaagctggaaaagaatcaccatccagctgcctgaccatgtctcc
tctagtgcgacacagttccgctggatccagaagggagaagaaactgagaagcaaagctgg
gcaattgaccacgtgtacattggagaggcttgccccaagctctgcagcgggcatggatac
tgcacgaccggtgccatctgcatctgtgacgagagcttccaaggtgatgactgctctgtt
ttcagtcacgaccttcccagttatattaaagataattttgagtctgcaagagtcaccgag
gcaaactgggagaccattcaaggtggagtcataggaagtggctgtgggcagctggccccc
tacgcccatggagactcgctatactttaatggctgtcagatcaggcaagcagctaccaag
cctctggatctcactcgagcaagcaaaatcatgtttgttttgcaaattgggagcacgtcg
cagacggacagctgcaacagtgacctgagtggcccccacgctgtggacaaggcggtgctg
ttgcagtacagtgtcaacaacgggatcacctggcatgtcatcgcccagcaccagccaaag
gacttcacacaagctcagagagtgtcttacaatgtccccctggaggcacggatgaaagga
gtcttactgcgctggtggcaaccacgccacaatggaacaggtcacgatcaatgggctttg
gaccatgtggaggtcgtcctagtaagcactcgcaaacaaaattacatgatgaatttttca
cgacaacatgggctcaggcacttctacaacagaagacgaaggtcacttaggcgataccca
tga

KEGG   Pongo abelii (Sumatran orangutan): 100449704
Entry
100449704         CDS       T01416                                 

Gene name
FN1
Definition
(RefSeq) fibronectin isoform X1
  KO
K05717  fibronectin 1
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04810  Regulation of actin cytoskeleton
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon05100  Bacterial invasion of epithelial cells
pon05135  Yersinia infection
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05205  Proteoglycans in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100449704 (FN1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100449704 (FN1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100449704 (FN1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    100449704 (FN1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100449704 (FN1)
   05205 Proteoglycans in cancer
    100449704 (FN1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100449704 (FN1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100449704 (FN1)
  09171 Infectious disease: bacterial
   05135 Yersinia infection
    100449704 (FN1)
   05100 Bacterial invasion of epithelial cells
    100449704 (FN1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100449704 (FN1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100449704 (FN1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pon04131]
    100449704 (FN1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100449704 (FN1)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100449704 (FN1)
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100449704 (FN1)
Membrane trafficking [BR:pon04131]
 Endoplasmic reticulum (ER) - Golgi transport
  Forward pathways
   ER-Golgi intermediate compartment (ERGIC) proteins
    100449704 (FN1)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of bladder cancer cells
   100449704 (FN1)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100449704 (FN1)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Fibronectin (FN) domain-containing proteins
  Fibronectin type I domain-containing proteins
   100449704 (FN1)
SSDB
Motif
Pfam: fn3 fn1 Pur_ac_phosph_N fn2 DUF2369
Other DBs
NCBI-GeneID: 100449704
NCBI-ProteinID: XP_009236347
Ensembl: ENSPPYG00000013148
LinkDB
Position
2B
AA seq 2477 aa
MLRGPGPGLLLLAVLCLGTAVPSTAASKSKRQAQQMVQPQSPVAVSQSKPGCYDNGKHYQ
INQQWERTYLGNALVCTCYGGSRGFNCESKPEAEETCFDKYTGNTYRVGDTYERPKDSMI
WDCTCIGAGRGRISCTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCK
PIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTCLGEGSGRITCTSRNRCNDQDTRTSY
RIGDTWSKKDNRGNLLQCICTGNGRGEWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHP
QPPPYGHCVTDSGVVYSVGMQWLKTQGNKQMLCMCLGNGVSCQETAVTQTYGGNSNGEPC
VLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTRGGNSNGALC
HFPFLYNNHNYTDCTSEGRRDNMKWCGTTQNYDADQKFGFCPMAAHEEICTTNEGVMYRI
GDQWDKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVNDTFHKRHEEGHM
LNCTCFGQGRGRWKCDPVDQCQDSETGTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQ
PLQTYPSSSGPVEVFITETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIP
GHLNSYTIKGLKPGVVYEGQLISIQQYGHREVTRFDFTTTSTSTPVTSNTVTGETTPFSP
VVATSESVTEITASSFVVSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTATSVNIPDL
LPGRKYIVNVYQISEDGEQSLILSTSQTTAPDAPPDPTVDQVDDTSIVVRWSRPQAPITG
YRIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVVIQQETTG
TPRSDTVPSPRDLQFVEVTDVKVTIMWTPPESAVTGYRVDVIPVNLPGEHGQRLPISRNT
FAEVTGLSPGVTYYFKVFAVNHGRESKPLTAQQTTKLDAPTNLQFVNETDSTVLVRWTPP
RAQITGYRLTVGLTRRGQPRQYNVGPSVSKYPLRNLQPGSEYTVSLVAIKGNQESPKATG
IFTTLQPGSSIPPYNTEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIVV
SGLTPGVEYVYTIQVLRDGQERDAPIVNKVVTPLSPPTNLHLEANPDTGVLTVSWERSTT
PDITGYRITTTPTNGQQGYSLEEVVHADQSSCTFDNLSPGLEYNVSVYTVKDDKESVPIS
DTIIPEVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFVDSSV
GYYTVTGLEPGIDYDISVITLINGGESAPTTLTQQTAVPPPTDLRFTNIGPDTMRVTWAP
PPSIDLTNFLVRYSPVKNEEDVAELSISPSDNAVVLTNLLPGTEYVVSVSSVYEQHESTP
LRGRQKTGLDSPTGIDFSDITANSFTVHWIAPRATITGYRIRHHPEHLSGRPREDRVPPS
RNSITLTNLTPGTEYVVSIVALNGREESPLLIGQQSTVSDVPRDLEVVAATPTSLLISWD
APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGVDYTITVYAVTGRGDSPA
SSKPISINYRTEIDKPSQMQVTDVQDNSISVKWLPSSSPVTGYRVTTTPKNGPGPTKTKT
AGPDQTEMTIEGLQPTVEYVVSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDVDVDSIK
IAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTAELQGLRPGSEYTVSVVALHDDM
ESQPLIGTQSTAIPAPTDLKFTQVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEIN
LAPDSSSVVVSGLMVATKYEVSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETT
ITISWRTKTETITGFQVDAVPSNGQTPIQRTIKPDVRSYTITGLQPGTDYKIYLYTLNDN
ARSSPVVIDASTAIDAPSNLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSSPREVVP
RPRPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELPQLVTLPHPNLHGPE
ILDVPSTVQKTPFVTHPGYDTGNGIQLPGTSGQQPSVGQQMIFEEHGFRRTTPPTTATPI
RHRPRPYPPNVGEEIQIGHIPREDVDYHLYPHGPGLNPNASTGQEALSQTTISWAPFQDT
SEYIISCHPVGTDEEPLQFRVPGTSTSATLTGLTRGATYNIIVEALKDQQRHKVREEVVT
VGNSVNEGLNQPTDDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCDSSR
WCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDPHEATCYDDGKTYHVGEQWQ
KEYLGAICSCTCFGGQRGWRCDNCRRPGGEPSPEGTTGQSYNQYSQRYHQRTNTNVNCPI
ECFMPLDVQADREDSRE
NT seq 7434 nt   +upstreamnt  +downstreamnt
atgctcaggggtccggggcccgggctgctgctgctggccgtcctgtgcctggggacagcg
gtgccctccactgcagcctcgaagagcaagaggcaggctcagcaaatggttcagccccag
tccccggtggctgtcagtcaaagcaagcccggttgttatgacaatggaaaacactatcag
ataaatcaacagtgggagcggacctacctgggcaatgccttggtttgtacttgttatgga
ggaagccgaggttttaactgcgagagtaaacctgaagctgaagagacttgctttgacaag
tacactgggaacacttaccgagtgggtgacacttatgagcgtcctaaagactccatgatc
tgggactgtacctgcatcggggctgggcgagggagaataagctgtaccattgcaaaccgc
tgccatgaagggggtcagtcctacaagattggggacacctggaggagaccacatgagact
ggtggttacatgttagagtgtgtgtgtcttggtaatggaaaaggagaatggacctgcaag
cccatagctgagaagtgttttgatcatgctgctgggacttcctatgtggttggagaaacc
tgggaaaagccctaccaaggctggatgatggtggattgtacttgtttgggagaaggcagc
ggacgcatcacttgcacctctagaaatagatgcaacgatcaggacacaaggacatcctat
agaattggagacacctggagcaagaaggataatcgaggaaacctgctccagtgcatctgc
acaggcaacggccgaggagagtggaagtgtgaaaggcacacctctgtgcagaccacatca
agcggatctggccccttcactgatgttcgtgcagctgtttaccaaccgcagcctcacccc
cagcctcctccctatggtcactgtgtcacagacagtggtgtggtctactctgtggggatg
cagtggctgaagacacaaggaaataagcaaatgctttgcatgtgcctgggcaacggagtc
agctgccaagagacagctgtaacccagacttatggtggcaactcaaatggagagccatgt
gtcttaccattcacctacaatggcaggacgttctactcctgcaccacagaagggcgacag
gatggacatctttggtgcagcacaacttcgaattatgagcaggaccagaaatattctttc
tgcacagaccacactgttttggttcagactcgaggaggaaattccaatggtgccttgtgt
cacttccccttcctatacaacaaccacaattacactgattgcacttctgagggcagaaga
gacaacatgaagtggtgtgggaccacacagaactatgatgccgaccagaagtttgggttc
tgccccatggctgcccatgaggaaatctgcacaaccaatgaaggggtcatgtaccgcatt
ggagatcagtgggataagcagcatgacatgggtcacatgatgaggtgcacgtgtgttggg
aacggtcgtggggaatggacatgcattgcctactcgcagcttcgagatcagtgcattgtt
gatgacatcacttacaatgtgaatgacacattccacaagcgtcatgaagaggggcacatg
ctgaactgtacatgcttcggtcagggtcggggaagatggaagtgtgatcccgtcgaccaa
tgccaggattcagagactgggacgttttatcaaattggagattcatgggagaagtatgtg
catggtgtcagataccagtgctactgctatggccgtggcattggggagtggcattgccaa
cctttacagacctatccaagctcaagtggtcctgtcgaagtatttatcactgaaactccg
agtcagcccaactcccaccccatccagtggaatgcaccacagccatctcacatttccaag
tacattctcaggtggagacctaaaaattctgtaggccgttggaaggaagctaccatacca
ggccacttaaactcctacaccatcaaaggcctgaagcctggtgtggtatacgagggccag
ctcatcagcatccagcagtacggccaccgagaagtgactcgctttgacttcaccaccacc
agcaccagcacacctgtgaccagcaacaccgtgacaggagagacaactcccttttctccc
gttgtggccacttctgaatctgtgaccgaaatcacagccagcagctttgtggtctcctgg
gtctcagcttctgacaccgtgtcgggattccgggtggaatatgagctgagtgaggaggga
gatgagccacagtacctggatcttccaagcacagccacttccgtgaacatccctgacctg
cttcctggccgaaaatacattgtaaatgtctatcagatatctgaggatggggagcagagt
ttgatcctgtctacttcacaaacaacagcgcccgatgcccctcctgacccgactgtggac
caagttgatgacacctcaattgttgttcgctggagcagaccccaggctcccatcacaggg
tacagaatagtctattcaccatcagtagaaggtagcagcacagaactcaaccttcctgaa
actgcaaactccgtcaccctcagtgatttgcaacctggtgttcagtataacatcactatc
tatgctgtggaagaaaatcaagaaagtacacctgttgtcattcaacaagaaaccactggc
accccacgctcagatacagtgccctctcccagggacctgcagtttgtggaagtgacagat
gtgaaggtcaccatcatgtggacaccgcctgagagtgcagtgaccggctaccgcgtggat
gtcatccctgtcaacctgcctggcgagcacgggcagaggctgcccatcagcaggaacacc
tttgcagaagtcaccgggctgtcccctggggtcacctattacttcaaagtctttgcagtg
aaccacgggagggagagcaagcctctgactgctcaacagacaaccaaactggatgctccc
actaacctccagtttgtcaatgaaactgattctactgtcctggtgagatggactccacct
cgggcccagataacaggataccgactgaccgtgggcctgacccgaagaggccagcccagg
cagtacaatgtgggtccctctgtctccaagtacccactgaggaatctgcagcctggatct
gagtacaccgtatccctcgtggccataaagggcaaccaagagagccccaaagccactgga
atctttaccacactgcagcctgggagctctattccaccttacaacaccgaggtgactgag
accaccattgtgatcacatggacgcctgctccaagaattggttttaagctgggtgtacga
ccaagccagggaggagaggcaccaagagaagtgacttcagactcagggagcatcgttgtg
tctggcttgactccaggagtggaatacgtctacaccatccaagtcctgagagacggacag
gaaagagatgcgccaattgtaaacaaagtggtaacaccattgtctccaccaacaaacttg
catctggaggcaaaccctgacactggagtgctcacagtctcctgggagaggagcaccacc
ccagacattactggttatagaattaccacaacccctacaaacggccagcagggatattct
ttggaagaagtggtccatgctgatcagagctcctgcacttttgataacctgagtcccggc
ctggagtacaatgtcagtgtttacactgtcaaggatgacaaggaaagtgtccctatctct
gataccatcatcccagaggtgccccaactcactgacctaagctttgttgatataaccgat
tcaagcatcggcctgaggtggaccccgctaaactcttccaccattattgggtaccgcatc
acagtagttgcggcaggagaaggtatccctatttttgaagattttgtggactcctcagta
ggatactacacagtcacagggctggagccgggcattgactatgatatcagcgttatcact
ctcattaatggcggcgagagtgcccctactacactgacacaacaaacggctgttcctcct
cccactgacctgcgattcaccaacattggtccagacaccatgcgtgtcacctgggctcca
cccccatccattgatttgaccaacttcctggtgcgttactcacctgtgaaaaatgaggaa
gatgttgcagagttgtcaatttctccttcagacaatgcagtggtcttaacaaatctcctg
cctggtacagaatatgtagtgagtgtctccagtgtctatgaacaacacgagagcacacct
cttagaggaagacagaaaacaggtcttgattccccaactggtattgacttttctgatatt
accgccaactcttttactgtgcactggattgctcctcgagccaccatcactggttacagg
atccgccatcatcccgagcaccttagtgggagacctcgagaagatcgggtgcccccctct
cggaattccatcaccctcaccaacctcactccgggcacagagtatgtggtcagcatcgtt
gctcttaatggcagagaggaaagtcccttattgattggccaacaatcaacagtttctgat
gttccaagggacctggaagttgttgctgcgacccccaccagcctactgatcagctgggat
gctcctgctgtcacagtgagatattacaggatcacttatggagaaacaggaggaaatagc
cctgtccaggagttcactgtgcctgggagcaagtctacagctaccatcagtggccttaaa
cctggagttgattataccatcactgtgtatgctgtcactggccgtggagacagccccgcg
agcagcaagccaatttccattaattaccgaacagaaattgacaaaccatcccagatgcaa
gtgaccgatgttcaggacaacagcattagtgtcaagtggctgccttcaagttcccctgtt
actggttacagagtaaccaccactcccaaaaatggaccaggaccaacaaaaactaaaact
gcaggtccagatcaaacagaaatgactattgaaggcttgcagcccacagtggagtatgtg
gttagtgtctatgctcagaatccaagcggagagagtcagcctctggttcagactgcagta
accaacattgatcgccctaaaggactggcattcactgatgtggatgtcgattccatcaaa
attgcttgggaaagcccacaggggcaagtttccaggtacagggtgacctactcgagccct
gaggatggaatccatgagctattccctgcacctgatggtgaagaagacactgcagagctg
caaggcctcagaccgggttctgagtacacagtcagtgtggttgccttgcacgatgatatg
gagagccagcccctgattggaacccagtccacagctattcctgcaccaactgacctgaag
ttcactcaagtcacacccacgagcctgagcgcccagtggacaccacccaatgttcagctc
actggatatagagtgcgggtgacccccaaggagaagaccggaccaatgaaagaaatcaac
cttgctcctgacagctcatccgtggttgtatcaggacttatggtggccaccaaatatgaa
gtgagtgtctatgctcttaaggacactttgacaagcagaccagctcagggagttgtcacc
actctggagaatgtcagcccaccaagaagggctcgtgtgacagatgctactgagaccacc
atcaccattagctggagaaccaagaccgagacgatcactggcttccaagttgatgctgtt
ccatccaatggccagactcctatccagagaaccatcaagccagatgtcagaagctacacc
atcacaggtttacaaccaggcactgactacaagatctacctgtacaccttgaacgacaat
gcccggagctcccctgtggtcatcgacgcctccactgccattgatgcgccatccaacctg
cgtttcctggccaccacacccaattccttgctggtatcatggcagccgccacgtgccagg
attaccggctacatcatcaagtatgagaagcctgggtcttctcccagagaagtggtccct
cggccccgccctggtgtcacagaggctactattactggcctggaaccgggaaccgaatat
acaatttatgtcattgccctgaagaataatcagaagagcgagcccctgattggaaggaaa
aagacagacgagcttccccaactggtaacccttccacaccccaatcttcatggaccagag
atcttggatgttccttccacagttcaaaagacccctttcgtcacccaccctgggtatgac
actggaaatggtattcagcttcctggcacttctggtcagcaacccagtgttgggcaacaa
atgatctttgaggaacatggttttaggcggaccacaccgcccacaacggccacccccata
aggcataggccaagaccatacccgccgaatgtaggtgaggagatccaaattggtcacatc
cccagggaagatgtagactatcacctctacccacatggtccgggactcaatccaaatgcc
tctacaggacaagaagctctctctcagacaaccatctcatgggccccattccaggacact
tctgagtacatcatttcatgtcatcctgttggcaccgatgaagaacccttacagttcagg
gttcctggaacttctaccagtgccactctgacaggcctcaccagaggtgccacctacaac
atcatagtggaggcactgaaagaccagcagaggcataaggttcgggaagaggttgttacc
gtgggcaactctgtcaacgaaggcttgaaccaacctacggatgactcgtgctttgacccc
tacacagtttcccattatgccgttggagatgagtgggaacgaatgtctgaatcaggcttt
aaactgttgtgccagtgcttaggctttggaagtggtcatttcagatgtgattcatctaga
tggtgccatgacaatggtgtgaactacaagattggagagaagtgggaccgtcagggagaa
aatggccagatgatgagctgcacgtgtcttgggaacggaaaaggagaattcaagtgtgac
cctcatgaggcaacgtgttatgatgatgggaagacgtaccacgtaggagaacagtggcag
aaggaatatctcggtgccatttgctcctgcacatgctttggaggccagcggggctggcgc
tgtgacaactgccgcagacctggcggtgaacccagtcccgaaggcactactggccagtcc
tacaaccagtattctcagagataccatcagagaacaaacactaatgttaattgcccaatt
gaatgcttcatgcctttagatgtacaggctgacagagaagattcccgagagtaa

KEGG   Pongo abelii (Sumatran orangutan): 100451868
Entry
100451868         CDS       T01416                                 

Gene name
TNC
Definition
(RefSeq) tenascin isoform X4
  KO
K06252  tenascin
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05165  Human papillomavirus infection
pon05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100451868 (TNC)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100451868 (TNC)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100451868 (TNC)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    100451868 (TNC)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100451868 (TNC)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100451868 (TNC)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100451868 (TNC)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_Tenascin EGF_2 EGF Pur_ac_phosph_N DUF4998 DUF2369 hEGF fn3_2 Polyketide_cyc2
Other DBs
NCBI-GeneID: 100451868
NCBI-ProteinID: XP_002820188
Ensembl: ENSPPYG00000019537
UniProt: H2PT66
LinkDB
Position
9
AA seq 2201 aa
MRAMTQLLAGVFLAFLALTTEGGVLRKVIRHKRQSGVNATLPEENQPVVFNHVYNIKLPV
GSQCSVDLESVSGEKDLAPPSEPSESFQEHTVDGENQIVFTHRINIPRRACGCAAAPDVK
ELLSRLEELENLVSSLREQCTAGAGCCLQPATGRLDTRPFCSGRGNFSTEGCGCVCEPGW
KGPNCSEPECPGNCHLRGQCIDGQCICDEGFTGEDCSQLACPSDCNDQGKCVNGVCICFE
GYAGADCNREICPVPCSEEHGACVDGLCVCHDGFAGDDCNKPLCLNNCYNRGRCVENECV
CDEGFTGEDCSELICPNDCFDRGRCINGTCYCEEGFTGEDCGKLTCPHACHSQGRCEEGQ
CVCDEGFAGVDCSEKRCPADCHNRGRCVDGQCECDDGFTGADCGELKCPNGCSGHGRCVN
GQCVCDEGYTGEDCSQLRCPNDCHSRGRCVEGKCVCEQGFKGYDCSDMSCPNDCHQHGRC
VNGMCVCDDGYTGEDCRDRQCPRDCSNRGRCVDGQCVCEDGFAGPDCAELSCPNDCHGRG
RCVNGQCMCHEGFMGKDCKEQRCPSDCHGQGRCVDGQCICHEGFTGLDCGQRSCPSDCNN
LGQCVSGRCICNEGYTGEDCLEVSPPKDLVVTEVTEETVNLAWDNEMRVTEYLVVYTPTH
EGGLEMQFRVPGDQTSTIIRELEPGVEYFIRVFAILENKKSIPVSARVATYLPAPEGLKF
KSIKETTVEVEWDPLDIAFETWEIIFRNMNKEDEGEITKSLRRPETSYRQTGLAPGQEYE
ISLHIVKNNTRGPGLKRVTTTRLDAPSQIEVKDVTDTTALITWFKPLAEIDGIELTYGIK
DVPGDRTTIDLTEDENQYSIGNLKPDTEYEVSLISRRGDMSSNPAKETFTTGLDAPRNLR
RVSQTDNSITLEWRNGKAAIDSYRIKYAPISGGDHAEVDVPKSQQATTKTTLTGLRPGTE
YGIGVSAVKEDKESNPATINAATELDMPKDLQVSETAETSLTLLWKTPLAKFDRYRLNYS
LPTGQWVGVQLPRNTTSYVLRDLEPGQEYNVLLTAEKGRHKSKPARVKASTERAPELENL
LVTEVGWDGLRLNWTAADQAYEHFIIQVQEANKVEAAQNLTVPGSLRAVDIPGLKAATPY
TVSIYGVIQGYRTPVLSAEASTGETPNLGEVVVAKVGWDALKLNWTAPEGAYEYFFIQVQ
EADTVEAAQNLTVPGGLRSTDLPGLKAATHYTITIRGVTQDFSTTPLSVEVLTEEVPDMG
NLTVTEVSWDALRLNWTTPDGTYDQFTIQVQEADQVEEAHNLTVPGSLRSVEIAGLRAGT
PYTITLRGEVRGHSTQPLAVEVITEELPQLGDLVVSEVGWDGLRLNWTAADNAYEHFVIQ
VQEVNKVEAAQNLTLPGSLRAVDIPGLEAATPYRVSIYGVIRGYRTPVLSAEASTAKEPE
IGNLNVSDITPESFNLSWTATDGIFETFTIEIIDSNRLRETVEYNISGAERTAHISGLPP
STDFIVYLSGLAPSIRTKTISATATTEALPLLENLTISDINPYGFTVSWMASENAFDSFL
VTVVDSGKLLDPQEFTLSGTQRKLELRGLITGIGYEVMVSGFTQGHQTKPLRAEIVTEAE
LEVDNLLVSDATPDGFRLSWTADEGVFDNFVLKIRDTKKQSEPLEITLLAPERTRDITGL
REATEYEIELYGISKGRRSQPVSAIATTAMGSPKEVIFSDITENSATVSWRAPTAQVESF
RITYVPITGGTPSMVTVDGTKTQTRLVKLIPGVEYLVSIIAMKGFEESEPVSGSFTTALD
GPSGLVTANITDSEALARWQPAIATVDSYVISYTGEKVPEITRTVSGNTVEYALTDLEPA
TEYTLRIFAEKGPQKSATITAKFTTDLDSPRDLTATEVQSETALLTWRPARASVTGYLLV
YESVDGTVKEVIVGPDTTSHSLADLSPSTHYTAKIQALNGPLRSNMIQTIFTTIGLLYPF
PKDCSQAMLNGDTTSGLYTIYLNGDKAQALEVFCDMTSDGGGWIVFLRRKNGRENFYQNW
KAYAAGFGDRKEEFWLGLDNLNKITAQGQYELRVDLRDHGETAFAVYDKFSVGDAKTRYK
LKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAITNCALSYKGAFWYRNCHRVNLMGRYGD
NNHSQGVNWFHWKGHEHSIQFAEMKLRPSNFRNLEGRRKRA
NT seq 6606 nt   +upstreamnt  +downstreamnt
atgagggctatgactcagctgttggcaggtgtctttcttgctttccttgccctcactacc
gaaggtggggtcctcaggaaagtcatccggcacaagcgacagagtggggtgaatgccacc
ctgccagaggagaaccagccagtggtgtttaaccatgtttacaacatcaagctgccggtg
ggatcccagtgttcggtggatctggagtcagtcagtggggagaaagacctggcaccacct
tcagagcccagtgaaagcttccaggagcacacagtggatggggaaaaccagattgtcttc
acacatcgcatcaacatcccccgccgggcctgtggctgtgctgcagcccctgatgttaag
gagctgctgagcagactggaggagctggagaacctggtgtcttccctgagagagcaatgt
actgcaggagcaggctgctgtctccagcctgccacaggccgtttggacaccaggcccttc
tgtagcggtcggggcaacttcagcactgaaggatgtggctgtgtctgcgaacctggctgg
aaaggccccaactgctctgagcccgaatgtccaggcaactgtcaccttcgaggccagtgc
attgatgggcagtgcatttgtgacgagggcttcacgggcgaggactgcagccagctggct
tgccccagcgactgcaatgaccagggcaagtgcgtgaatggagtctgcatctgttttgaa
ggctacgccggggctgactgcaaccgtgaaatctgcccggtgccctgcagtgaggagcac
ggcgcatgtgtagatggcttgtgtgtgtgccatgatggctttgcaggcgatgactgcaac
aagcctctgtgtctcaacaattgctacaaccgtgggcgatgcgtggagaatgagtgcgtg
tgtgatgaaggtttcacgggtgaagactgcagtgagctcatctgccccaatgactgcttc
gaccggggccgctgcatcaatggcacctgctactgcgaagaaggcttcacaggtgaagac
tgcgggaaactcacctgcccgcatgcctgccacagccagggccggtgcgaggaggggcag
tgtgtatgtgatgagggctttgccggtgtggactgcagcgagaagaggtgtcctgctgac
tgtcacaatcgtggccgctgtgtagatgggcagtgtgagtgtgatgatggtttcactgga
gctgactgtggggagctcaagtgtcccaatggctgcagtggccatggccgctgtgtcaat
gggcagtgtgtgtgtgatgagggctatactggggaggactgcagccagctacggtgcccc
aatgactgtcacagtcggggccgctgtgtcgagggcaaatgtgtatgtgagcaaggcttc
aagggctatgactgcagtgacatgagctgtcccaatgactgtcaccagcacggccgctgt
gtaaatggcatgtgtgtttgtgatgacggctacacaggggaagactgccgggatcgccaa
tgccccagggactgcagcaacaggggccgctgtgtggatggacagtgtgtctgtgaggac
ggcttcgccggccctgactgtgcagaactctcctgtccaaatgactgccatggccggggt
cgctgtgtgaacgggcagtgcatgtgccatgaaggattcatgggcaaagattgcaaggag
caaagatgtcccagtgactgtcatggccagggccgctgtgtggacggccagtgcatctgc
cacgagggcttcacaggcctggactgtggccagcgctcctgccccagtgactgcaacaac
ttaggacaatgcgtctcaggccgctgcatctgcaacgagggctacaccggagaagactgc
ttagaggtgtctcctcccaaagacctcgtcgtgacagaagtgacggaagagacggtcaac
ctggcctgggacaatgagatgcgggtcacagagtaccttgtcgtgtacacgcccacccac
gagggcggtctggaaatgcagttccgtgtgcctggggaccagacgtccaccatcatccgg
gagctggagcctggtgtggagtactttatccgtgtatttgccatcctggagaacaagaaa
agcattcctgtcagcgccagggtggccacgtacttgcctgcacctgaaggcctgaaattc
aagtccatcaaggagacaactgtggaagtggagtgggatcctctagacattgcttttgaa
acctgggagatcatcttccggaatatgaataaagaagatgagggagagatcaccaaaagc
ctgaggaggccagagacctcttaccggcaaactggtctagctcctgggcaagagtatgag
atatctctgcacatagtgaaaaacaatacccggggccctggcctgaagagggtgaccacc
acacgcttggatgcccccagccagatcgaggtgaaagatgtcacagacaccactgccttg
atcacctggttcaagcccctggctgagatcgacggcattgagctgacctatggcatcaaa
gacgtgccaggagaccgtaccaccatcgatctcacagaggacgagaaccagtactccatc
gggaacctgaagcctgacactgagtacgaggtgtccctcatctcccgcagaggtgacatg
tcaagcaacccagccaaagagaccttcacaacaggcctcgatgctcccaggaatcttcga
cgtgtttcccagacagataacagcatcaccctggaatggaggaatggcaaggcagctatt
gacagttacagaattaagtatgcacccatctctggaggggaccacgccgaggttgatgtt
ccaaagagccaacaagccacaaccaaaaccacactcacaggtctgaggccgggaactgaa
tatgggattggagtttctgctgtgaaggaagacaaggagagcaatccagcgaccatcaat
gcagccacagagttggacatgcccaaggaccttcaggtttctgaaactgcagagaccagc
ctgaccctgctctggaagacaccgttggccaagtttgaccgctaccgcctcaattacagt
ctccccacaggccagtgggtgggagtgcagcttccaagaaacaccacttcctatgtcctg
agagacctggaaccaggacaggagtacaatgtcctcctgacagccgagaaaggcagacat
aagagcaagcccgcacgtgtgaaggcatccactgaacgagcccctgagctggaaaacctc
ttggtgactgaggttggctgggatggcctcagactcaactggaccgcagctgaccaggcc
tatgagcactttatcattcaggtgcaggaggccaacaaggtagaggcagctcagaacctc
accgtgcctggcagccttcgggctgtcgacatcccgggcctcaaggctgccacgccttat
acagtctccatctatggggtgatccagggctatagaacaccagtgctctctgctgaggcc
tccacaggggaaactcccaatttgggagaggtcgtggtggccaaggtgggctgggatgcc
ctcaaactcaactggactgctccagaaggggcctatgagtactttttcattcaggtgcag
gaggctgacacagtagaggcagcccagaacctcacggtcccaggaggactgaggtccacg
gacctgcctgggctcaaagcagccactcattataccatcaccatccgcggggtcactcag
gacttcagcacaacccctctctctgttgaagtcttgacagaggaggttccagatatggga
aacctcacagtgaccgaggttagctgggatgctctcagactgaactggaccacgccagat
ggaacctatgaccagtttaccattcaggtccaggaggctgaccaggtggaagaagctcac
aatctcaccgttcctggcagcctgcgttccgtggaaatcgcaggcctcagggctggcact
ccttacacaatcaccctgcgcggcgaggtcaggggccacagcactcaaccccttgctgta
gaggtcatcacagaggagctcccacagctgggagatttagtggtatctgaggttggctgg
gatggcctcagactcaactggaccgcagctgacaatgcctatgagcactttgtcattcag
gtgcaggaggtcaacaaagtggaggcagcccagaacctcacgttgcctggcagcctcagg
gctgtggacatcccgggcctcgaggctgccacgccttatagagtctccatctatggggtg
atccggggctatagaacaccagtactctctgctgaggcctccacagccaaagaacctgaa
attggaaatttaaatgtttctgacataactcctgagagcttcaatctctcctggacagct
accgatgggatcttcgagacctttaccattgaaattattgattccaataggttgcgggag
actgtggaatataatatctctggtgctgaacgaactgcccatatctcagggctaccccct
agtactgattttattgtctacctctctggacttgctcccagcatccggaccaaaaccatc
agtgccacagccacgacagaggccctgccccttctggaaaacctaaccatttccgacatt
aatccctacgggttcacagtttcctggatggcatcggagaatgcctttgacagctttcta
gtaacggtggtggattctgggaagctgctggacccccaggaattcacactttcaggaacc
cagaggaagctggagcttagaggcctcataactggcattggctatgaggttatggtctct
ggcttcacccaagggcaccaaaccaagcccttgagggctgagattgttacagaagctgaa
ctggaagttgacaaccttctggtttcagatgccaccccagacggtttccgtctgtcctgg
acagctgatgaaggggtcttcgacaattttgttctcaaaatcagagataccaaaaagcag
tctgagccactggaaataaccctacttgcccccgaacgtaccagggacataacaggtctc
agagaggctaccgaatacgaaattgaactctatggaataagcaaaggaaggcgatcccag
ccagtcagtgctatagcaacaacagccatgggctccccaaaggaagtcattttctcagac
atcactgaaaactcagctactgtcagctggagggcacccactgcccaagtggagagcttc
cggattacctatgtgcccattacaggaggtacaccctccatggtaactgtggatggaacc
aagactcagaccaggctggtgaaactcatacctggggtagagtaccttgtcagcatcatc
gccatgaagggctttgaggaaagtgaacctgtctcagggtcattcaccacagctctggat
ggcccatctggcctggtgacagccaacatcactgactcggaagccttggccaggtggcag
ccagccattgccactgtggacagttatgtcatctcctacacaggggagaaagtgccagaa
attacacgcacggtgtccgggaacacagtggagtatgctctgaccgacctcgagcctgcc
acggaatacacactgagaatctttgcagagaaagggccccagaagagcgcaaccatcact
gccaagttcacaacagacctcgattctccaagagacttgactgctactgaggttcagtca
gaaactgccctccttacctggcgacccgcccgggcatcagtcaccggttacctgctggtc
tatgaatcggtggatggcacagtcaaggaagtcattgtgggtccagataccacctcccac
agcctggcggacctgagcccatccacccactacacggccaagatccaggcactcaatggg
cccctgaggagcaatatgatccagaccatcttcaccacaattggactcctgtaccccttc
cccaaggactgctcccaagcaatgctgaatggagacacgacctctggcctctacaccatt
tatctgaatggtgataaggctcaggcactggaagtcttctgtgacatgacctctgatggg
ggtggatggattgtgttcctgagacgcaaaaacggacgcgagaacttctaccaaaactgg
aaggcatatgctgctggatttggggaccgcaaagaagaattctggcttggtctggacaac
ctgaacaaaatcacagcccaagggcagtacgagctccgggtggacctgcgggaccatgga
gagacagcctttgctgtctatgacaagttcagtgtgggagatgccaagactcgctacaag
ctgaaggtggaggggtacagtgggacagcaggtgactccatggcctaccacaatggcaga
tccttctccacctttgacaaggacacagattcagccatcaccaactgtgctctgtcctac
aaaggggctttctggtacaggaactgtcaccgtgtcaacctgatggggagatatggggac
aataaccacagtcagggcgttaactggttccactggaagggccacgaacactcaatccag
tttgctgagatgaagctgagaccaagcaacttcagaaatcttgaaggcaggcgcaaacgg
gcataa

KEGG   Pongo abelii (Sumatran orangutan): 100452638
Entry
100452638         CDS       T01416                                 

Gene name
TNR
Definition
(RefSeq) tenascin-R isoform X1
  KO
K06252  tenascin
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05165  Human papillomavirus infection
pon05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100452638 (TNR)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100452638 (TNR)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100452638 (TNR)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    100452638 (TNR)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100452638 (TNR)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100452638 (TNR)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100452638 (TNR)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_Tenascin EGF_2 Pur_ac_phosph_N Uroplakin_II EGF
Other DBs
NCBI-GeneID: 100452638
NCBI-ProteinID: XP_024106938
Ensembl: ENSPPYG00000000482
LinkDB
Position
1
AA seq 1358 aa
MGADGETVVLKNMLIGVNLILLGSMIKSSECQLEVTTERVQRQSAEEEGGIANYNTSRKE
QPVVFNHVYNINVPLDSLCSSGLEASAEQEVSAEDETLAEYTGQTSDHESQVTFTHRINF
PKKACPCAGSAQVLQELLSRIEMLEREVSVLRDQCNANCCQESAATGQLDYIPHCSGHGN
FSFESCGCICNEGWFGKNCSEPYCPLGCSSRGVCVDGQCICDSEYSGDDCSELRCPTDCS
SRGLCVDGECVCEEPYTGEDCRELRCPGDCSGKGRCANGTCLCEEGYVGEDCGQRQCLNA
CSGRGQCEEGLCVCEEGYQGPDCSAVAPPEDLRVAGISDRSIELEWDGPMAVTEYVISYQ
PTALGGLQLQQRVPGDWSGVTITELEPGLTYNISVYAVISNILSLPITAKVATHLSTPQG
LQFKTITETTVEVQWEPFSFSFDGWEISFIPKNNEGGVIAQVPSDVTSFNQTGLKPGEEY
IVNVVALKEQARSPPTSASVSTVIDGPTQILVRDVSDTVAFVEWIPPRAKVDFILLKYGL
VGGEGGRTAFRLQPPLSQYSVQALRPGSRYEVSVSAVRGTNESDSATTQFTTEIDAPKNL
RVGSRTATSLDLEWDNSEAEVQEYKIVYSTLAGEQYHEVLVPKGIGPTTRATLTDLVPGT
EYGVGISAVMNSQQSVPATMNARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITF
TPSSGIASEVTVPKDRTSYTLTDLEPGAEYIISITAERGRQQSLESTVDAFTGFRPISHL
HFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVSLDATKRHAVLMGLQPATEY
IVNLVAVHGTVTSEPIVGAITTGIDPPKDIAISNVTKDSVTVSWSPPVASFDYYRVSYRP
TQVGRLDSSVVPNTVTEFTITRLYPATEYEISLNSVRGREESERICTLVHTAMDNPVNLI
ATNITPTEALLQWKAPVGEVENYVIVLTHFAVAGETILVDGVSEEFRLVDLLPSTHYTAT
MYATNGPLTSGTISTNFSTLLDPPANLTASEVTRQSALISWQPPRAEIENYVLTYKSTDG
SRKELIVDAEDTWIRLEGLLENTDYTVLLQAAQDTAWSSITSTAFTTGGRVFPHPQDCAQ
HLMNGDTLSGVYLIFLNGELSQKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVG
FGNLEDEFWLGLDNIHRITSQGRYELRVDMRDGQEAAFASYDRFSVEDSRNLYKLRIGSY
NGTAGDSLSYHQGRPFSTEDRDNDVAVTNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQG
INWYHWKGHEFSIPFVEMKMRPYNHRLMAGRKRQSLQF
NT seq 4077 nt   +upstreamnt  +downstreamnt
atgggggcagatggggaaacagtggttctgaagaacatgctcattggcgtcaacctgatc
cttctgggctccatgatcaagtcttcagagtgtcagctggaggtcaccacagaaagggtc
cagagacagtcagcggaggaggagggaggcattgccaactacaacacatcccgcaaagag
cagcctgtggtcttcaaccacgtgtacaacattaacgtgcccttggacagcctctgctcc
tcagggctagaggcctctgctgagcaggaggtgagtgcagaagatgagactctggcagag
tacacaggccagacctcagaccacgagagccaggtcacctttacacacaggatcaacttc
cccaaaaaggcctgcccatgtgccggttcagcccaggtgctgcaggagctgctgagccgg
atcgagatgctggagagggaggtgtcggtgctgagagaccagtgcaacgccaactgctgc
caagaaagtgctgccacaggacaactggactatatccctcactgcagtggccacggcaac
tttagctttgagtcctgtggctgcatctgcaacgaaggctggtttggcaagaattgctcg
gagccctactgcccgctgggttgctccagccggggggtgtgtgtggatggccagtgcatc
tgtgacagcgagtacagcggggatgactgttccgaactccggtgcccaacagactgcagc
tcccgggggctctgcgtggacggggagtgtgtctgtgaagagccctacactggcgaggac
tgcagggaactgaggtgccccggggactgttcggggaaggggagatgtgccaacggtacc
tgtttatgtgaggagggctacgttggtgaggactgcggccagcggcagtgtctgaatgcc
tgcagtgggcgaggacaatgtgaggaggggctctgcgtctgtgaagagggctaccagggc
cctgactgctcagcagttgcccctccagaggacttgcgagtggctggtatcagcgacagg
tccattgagctggaatgggacgggccgatggcagtgacggaatatgtgatctcttaccag
ccgacggccctggggggcctccagctccagcagcgggtgcctggagattggagtggtgtc
accatcacggagctggagccaggtctcacctacaacatcagcgtctacgctgtcattagc
aacatcctcagccttcccatcactgccaaggtggccacccatctctccactcctcaaggg
ctacaatttaagacgatcacagagaccaccgtggaggtgcagtgggagcccttctcattt
tccttcgatgggtgggaaatcagcttcattccaaagaacaatgaagggggagtgattgct
caggtccccagcgatgttacgtcctttaaccagacgggactaaagcctggggaggaatac
attgtcaatgtggtggctctgaaagaacaggcccgcagcccccccacctcggccagcgtc
tccacagtcattgacggccccacgcagatcctggttcgagatgtctctgacaccgtggct
tttgtggagtggattccccctcgagccaaagtcgatttcattctcttgaaatatggcctg
gtgggcggggaaggtgggaggaccgccttccgcctgcagcctcccctgagccaatactca
gtgcaggccctgcggcctggctcccgctacgaggtgtcagtcagtgccgtcagagggacc
aacgagagcgattctgccaccactcagttcacaacagagatcgatgcccccaagaacttg
cgagttggttctcgcacagcaaccagccttgacctcgagtgggataacagtgaggccgaa
gttcaggagtacaagattgtgtacagcaccctggcgggtgagcaatatcacgaggtactg
gtccccaagggcattggtccaaccaccagggccaccctgacagatctggtacccggcact
gagtatggagttggaatatctgccgtcatgaactcacagcaaagcgtgccagccaccatg
aatgccaggactgaacttgacagtccccgagacctcatggtgacagcctcctcggagacc
tccatctctctcatctggaccaaggccagtggccccattgaccactaccgaattaccttt
accccatcctctgggattgcctcagaagtcactgtacccaaggacaggacctcgtacaca
ctaacagatctagagcctggggcagagtacatcatttccatcactgctgagaggggtcgg
cagcagagcttggagtccactgtggatgctttcacaggcttccgtcccatctctcatctg
cacttttctcacgtgacctcctccagtgtgaacatcacttggagtgatccatctccccca
gcagacagactcattctgaactacagccccagggatgaggaggaagagatgatggaggtc
tccctggatgccaccaagaggcatgctgtcctgatgggcctgcaaccagccacagagtac
attgtgaaccttgtggctgtccatggcacagtgacctctgagcccattgtgggcgccatc
accacaggaattgatcccccaaaagacatcgcaattagcaatgtgaccaaggactcagtg
acggtctcctggagccctcctgttgcatctttcgattactaccgagtatcatatcgaccc
acccaagtgggacgactagacagctcagtggtgcccaacactgtgacagaattcaccatc
accagactgtacccagctaccgaatatgaaatcagtctcaacagcgtgcggggcagggag
gaaagcgagcgcatctgtactctcgtgcacacagccatggacaaccctgtgaatctgatt
gctaccaatatcactccaacagaagccttgctgcagtggaaggcaccagtgggtgaggtg
gagaactacgtcattgttcttacacactttgcagttgctggagagaccatcctggttgac
ggagtcagtgaggaatttcggcttgttgacctgcttcctagcacccactatactgccacc
atgtatgccaccaatgggcctctcaccagtggcaccatcagcaccaacttttctactctc
ctggaccctccggcaaacctgacagccagtgaagtcaccagacaaagtgccctgatctcc
tggcagcctcccagggcagagattgaaaattacgtcttgacctacaaatccaccgacgga
agccgcaaggagctgattgtggatgcagaagacacctggattcgactggagggcctgttg
gagaacacagactacacggtgctcctgcaggcagcacaggacactgcgtggagcagcatc
acctccaccgccttcaccacagggggccgggtgttccctcatccccaagactgtgcccag
catttgatgaatggagacactttgagcggggtttacctcatcttcctcaatggggagctg
agccagaaattacaagtgtactgtgatatgaccaccgacgggggcggctggattgtattc
cagaggcggcagaatggccaaactgattttttccggaaatgggctgattaccgtgttggc
ttcgggaacctggaggatgagttctggctggggctggacaacatacacaggatcacatcc
cagggccgctatgagctgcgtgtggacatgcgagatggccaggaggccgcctttgcctcc
tacgacaggttctctgtcgaggacagcagaaacttgtacaaactccgcataggaagctac
aacggcactgcaggggactccctcagctatcatcaaggacgccctttctccacagaggat
agagacaatgatgttgcagtgactaactgtgccatgtcgtacaagggagcgtggtggtat
aagaactgccaccggaccaacctcaatgggaagtacggagagtccaggcacagtcagggc
atcaactggtaccattggaaaggccacgagttctccatcccctttgtggaaatgaagatg
cgcccgtacaaccaccgtctcatggcagggagaaaacggcagtccttacagttctga

KEGG   Pongo abelii (Sumatran orangutan): 100453010
Entry
100453010         CDS       T01416                                 

Gene name
TNN
Definition
(RefSeq) tenascin-N isoform X1
  KO
K06252  tenascin
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05165  Human papillomavirus infection
pon05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100453010 (TNN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100453010 (TNN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100453010 (TNN)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    100453010 (TNN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100453010 (TNN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100453010 (TNN)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100453010 (TNN)
SSDB
Motif
Pfam: fn3 Fibrinogen_C DUF4998 Pur_ac_phosph_N EGF_2 EGF_Tenascin
Other DBs
NCBI-GeneID: 100453010
NCBI-ProteinID: XP_024086930
Ensembl: ENSPPYG00000000484
LinkDB
Position
1
AA seq 1299 aa
MSLQEMFRFPLGLLLGSVLLAASAPATLEPPGCSNKEQQVTVSHTYKIDVPKSALVQVDA
DPQPLSDDGASLLALGEAGEEQNIIFRHNIRLQTPQKDCELAGSVQDLLARVKKLEEEMV
EMKEQCSAKRCCQGVTDLSRHCSGHGTFSLETCSCHCEEGREGPACERLACPGACSGHGR
CVDGRCLCHEPYVGADCGYPACPENCSGHGECVRGVCQCHEDFMSEDCSEKRCPGDCSGH
GFCDTGECYCEEGFTGLDCAQVVAPQGLQLLKNTEDSLLVSWEPSSQVDHYLLSYYPLGK
ELSGKQIQVPKEQHSYEILGLLPGTKYIVTLRNVKKEISSSPQHLLATTDLAVLGTAWVT
DETENSLDVEWENPSTEVDYYKLRYGPMTGQEVAEVTVPKSSDPKSRYDITGLHPGTEYK
ITVVPMRGELEGKPILLNGRTEIDSPTNVVTDRVTEDTATVSWDPVQAVIDKYVVRYTSA
DGDTKETAVHKDESSTVLTGLKPGEAYKVYVWAERGNQGSKKADTNALTEIDSPANLVTD
RVTENTATISWDPVQATIDKYVVRYTSADDQETREVLVGKEQSSTVLTGLRPGVEYTVHV
WAQKGDRESKKADTNTSTDIDSPKNLVTDRVTENMATVSWDPVQAAIDKYVVCYTSADGE
TRQVPVGKEQSSTVLTGLRPGMEYMVHVWAQKGDQESKKVDTKAQTDIDSPQNLVTDRVT
ENMATVSWDPVQATIDRYVVRYTSAKDGETREVPVGKEQSSTVLTGLRPGVEYTVHVWAQ
KGAQESKKADTKAQTDIDSPQNLVTDRVTENTATVSWDPVQATIDRYVVRYTSANGETRE
VPVGKEQSSTVLTGLRPGVEYTVHVWAQKGNQESKKADTKAQTEIDSPKNLVTGWVTENM
ATVSWDPVQATIDKYVVRYTSADGETREVPVGKEHSSTVLTGLRPGMEYMVHVWAQKGAQ
ESKKSDTKAQTELDPPRNLRPSAVTQSGGVLTWTPPSAEIHGYILTYQFPDGTVKEMQLG
REDQRFELQGLEQGATYPVSLVAFKGGRRSRNVSTTLSTVGARFPHPSDCSQVQQNSNAA
SGLYTIYLHGDASRPLQVYCDMDTDGGGWTVFQRRNTGQLDFFKRWRSYVEGFGDPMKEF
WLGLDKLHNLTTGTPMRYEVRVDLQTANESAYAIYDFFQVASSKERYKLTVGKYRGTAGD
ALTYHNGWKFTTFDRDNDIALSNCALTHHGGWWYKNCHLANPNGRYGETKHSEGVNWEPW
KGHEFSIPYVELKIRPHGYSREPVLGRKKRTLGGRLRTF
NT seq 3900 nt   +upstreamnt  +downstreamnt
atgagtctccaggagatgttccgcttccctttggggctcctgcttggctctgtgctcctg
gcggcttcggccccagccactctggagcctcccggctgcagcaacaaggagcaacaggtc
actgtcagccacacctacaagatcgatgtgcccaagtctgccttggttcaggttgacgct
gaccctcagcccctcagtgacgatggggcttcgctcttggccctgggggaggccggggag
gaacagaacatcatcttcaggcacaacatccgccttcagacgccacagaaggactgtgag
ttggcaggcagtgtccaggacctcctggcccgggtgaagaagctggaggaagagatggtg
gagatgaaggaacagtgtagtgccaagcgctgctgccagggagtcactgatctaagccgc
cactgcagcggccacgggaccttctccctggagacctgcagctgccactgcgaggagggc
agggagggccccgcctgcgagcggctggcctgccccggggcgtgcagcggccacgggcgt
tgcgtggacgggcgctgcctgtgccatgagccctacgtgggtgccgactgcggctacccg
gcctgccctgagaactgcagcgggcacggcgagtgcgtgcgcggcgtatgccagtgccac
gaggacttcatgtcggaggactgcagcgagaagcgctgtcccggcgactgcagcggccac
ggcttctgcgacacgggcgagtgctactgcgaggagggcttcacaggcctggactgtgcc
caggtggttgccccacagggcctgcagctgctcaagaacacggaggattctctgctggtg
agctgggagccctccagccaggtggatcactacctccttagctactaccccctggggaag
gagctctctgggaagcagatccaagtgcccaaggagcagcacagctacgagattcttggt
ttgctgcctggaaccaagtatatagtcaccctgcgtaacgtcaagaaagaaatttctagc
agcccacagcatctacttgccaccacagaccttgctgtgcttggcactgcctgggtgaca
gatgagactgagaactccctcgacgtggagtgggaaaacccctcgactgaggtggactac
tacaagctgcgatatggtcccatgacagggcaggaggtagccgaggtcactgtgcccaag
agcagtgaccccaagagccgatatgacatcactggtctgcacccggggaccgagtataag
atcacggtggtgcccatgagaggagagctggagggcaagccgatcctcctgaatggcagg
acagaaattgacagtccaaccaatgttgtcaccgatcgagtgactgaagacacagcaact
gtctcctgggacccagtgcaggctgtcatagacaagtatgtggtgcgctacacctctgct
gatggggacaccaaggaaacggcagtgcacaaggacgagagcagcactgtcctgacgggc
ctgaagccaggagaggcatacaaggtttacgtgtgggctgaaaggggcaaccaggggagc
aagaaagctgacaccaatgcccttacagaaattgacagcccagcaaacctggtgactgac
cgggtgactgagaataccgccaccatctcctgggacccggtgcaggccaccattgacaag
tatgtggtgcgctacacctctgctgatgaccaagagaccagagaggttctggtggggaag
gagcagagcagcactgtcctgacaggcctgaggccaggtgtggagtacacagtgcatgtc
tgggcccagaagggggaccgagagagcaagaaggctgacaccaacacctcgacagacatt
gacagccccaaaaacctggtgactgaccgggtgacagagaatatggccactgtctcctgg
gacccggtgcaggctgccattgacaagtacgtggtatgctacacctctgccgatggagag
accaggcaggttccggtggggaaggagcagagcagcactgtcctgacaggcctgagaccg
ggcatggagtacatggtgcatgtgtgggcccagaagggggaccaggagagcaagaaggtc
gacaccaaggcccagacagacattgacagcccccaaaacctggtgactgaccgggtgaca
gagaatatggccactgtctcctgggacccggtgcaggccaccattgacaggtatgtggtg
cgctacacctctgccaaggacggagagaccagggaggttccggtggggaaggagcagagt
agcactgtcctgacgggcctgaggccgggtgtggagtacacagtgcacgtgtgggcccag
aagggggcccaggagagcaagaaggctgacaccaaggcccagacagacattgacagcccc
caaaacctggttactgaccgggtgacagagaatacagccactgtctcctgggacccggtg
caggccaccattgacaggtatgtggtgcgctacacgtcagccaacggagagaccagggag
gttccagtggggaaggagcagagcagcactgtcctgacgggcctgaggccaggcgtggag
tacacggtgcacgtgtgggcccagaaggggaaccaggagagcaagaaggctgacaccaag
gcccagacagaaattgacagccccaaaaacctggtgactggctgggtgacagagaatatg
gccactgtctcctgggacccggtgcaggccaccattgacaagtatgtggtgcgctacacc
tctgccgacggggagaccagggaggttccggtggggaaggagcacagcagcaccgtcctg
acgggcctgagaccaggcatggagtacatggtgcacgtgtgggcccagaagggggcccag
gagagcaagaagtctgacaccaaggcccagacagaactcgaccctcccagaaaccttcgt
ccatctgctgtaacgcagtctggtggcgtattgacctggacgcccccctctgctgagatc
catggctacattctgacctaccagttcccagacggcacagttaaggagatgcagctggga
cgagaagaccagaggtttgagttgcaaggccttgagcaaggcgccacctaccctgtctcc
cttgttgcctttaagggtggtcgccggagcagaaatgtatccaccaccctctccacagtt
ggtgcccgtttcccacacccttcggactgcagtcaggttcagcagaacagcaatgccgcc
agtggtctgtacaccatctacctgcacggcgatgccagccggcccctgcaggtgtactgt
gacatggacacggatggaggtggctggactgtcttccaaaggcggaacactgggcagctg
gatttcttcaagcgatggaggagctatgtggaaggctttggggaccccatgaaggagttc
tggcttggacttgacaagctacacaacctcaccaccggcactccaatgcggtatgaagtg
agagtggatttacagactgccaatgaatctgcctatgctatatatgatttcttccaagtg
gcctccagcaaggagcggtataagctgacagttgggaaatacagaggcacggcaggggat
gctcttacttaccacaatggatggaagtttacaactttcgacagagacaacgatattgca
ctcagcaactgtgccctgacgcatcatggtggctggtggtataagaactgccacttggcc
aaccccaatggcagatatggggagaccaagcacagtgagggggtgaactgggagccttgg
aaaggacatgaattctccattccttacgtggagttgaaaatccgccctcatggctacagc
agggagcctgtcctgggcagaaagaagcggacgctgggaggaaggctgcgaacgttctga

KEGG   Pongo abelii (Sumatran orangutan): 100453386
Entry
100453386         CDS       T01416                                 

Gene name
THBS1
Definition
(RefSeq) thrombospondin-1
  KO
K16857  thrombospondin 1
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04015  Rap1 signaling pathway
pon04115  p53 signaling pathway
pon04145  Phagosome
pon04151  PI3K-Akt signaling pathway
pon04350  TGF-beta signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05144  Malaria
pon05165  Human papillomavirus infection
pon05205  Proteoglycans in cancer
pon05206  MicroRNAs in cancer
pon05219  Bladder cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04015 Rap1 signaling pathway
    100453386 (THBS1)
   04350 TGF-beta signaling pathway
    100453386 (THBS1)
   04151 PI3K-Akt signaling pathway
    100453386 (THBS1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100453386 (THBS1)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    100453386 (THBS1)
  09143 Cell growth and death
   04115 p53 signaling pathway
    100453386 (THBS1)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100453386 (THBS1)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    100453386 (THBS1)
   05205 Proteoglycans in cancer
    100453386 (THBS1)
  09162 Cancer: specific types
   05219 Bladder cancer
    100453386 (THBS1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100453386 (THBS1)
  09174 Infectious disease: parasitic
   05144 Malaria
    100453386 (THBS1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pon04131]
    100453386 (THBS1)
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100453386 (THBS1)
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100453386 (THBS1)
Membrane trafficking [BR:pon04131]
 Endocytosis
  Phagocytosis
   Opsonins
    100453386 (THBS1)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100453386 (THBS1)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   100453386 (THBS1)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC EGF_3 TSP1_ADAMTS EGF_CA Laminin_G_3 TSP1_CCN cEGF Laminin_G_2 EGF Laminin_G_1
Other DBs
NCBI-GeneID: 100453386
NCBI-ProteinID: XP_002825327
Ensembl: ENSPPYG00000006338
UniProt: H2NMT2 A0A663D9A9
LinkDB
Position
15
AA seq 1170 aa
MGLAWGLGILFLMHVCGTNRIPESGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFR
IEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVV
SNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELD
VPIQSVFTRDLASIARLRIAKGGVNDNFQGVLQNVRFVFGTTPEDILRNKGCSSSTSVLL
TLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRK
VTEENKELANELRRPPLCYHNGVQYRNNEEWTVDSCTECHCQNSVTICKKVSCPIMPCSN
ATVPDGECCPRCWPSDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQT
RTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARE
TKACKKDACPINGGWGPWSPWDICSVTCGGGVQKRSRLCNNPTPQFGGKDCVGDVTENQI
CNKQDCPIDGCLSNPCFAGVKCTSYPDGSWKCGACPAGYSGNGIQCTDVDECKEVPDACF
NHNGEHRCKNTDPGYNCLPCPPRFTGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKN
AKCNYLGHYSEPMYRCECKPGYAGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPN
LPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYN
HNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEH
NPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDN
DGIPDDRDNCRLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRF
QMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDD
DYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNAL
WHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYD
KTYAGGRLGLFVFSQEMVFFSDLKYECRDP
NT seq 3513 nt   +upstreamnt  +downstreamnt
atggggctggcctggggactaggcatcctgttcctgatgcatgtgtgtggcaccaaccgc
attccagagtctgggggagacaacagcgtgtttgacatctttgaactcaccggggccgcc
cgcaaggggtctgggcgccgactggtgaagggccctgacccttccagcccagctttccgc
atcgaggatgccaacctgatcccccctgtgcctgatgacaagttccaagacctggtggat
gctgtgcgggcagaaaagggtttcctccttctggcctccctgaggcagatgaagaagacc
cggggcacgctgctggccctggagcggaaagaccactctggccaggtcttcagcgtggtg
tccaatggcaaggcgggcaccctggacctcagcctgaccgtccaaggaaagcagcacgtg
gtgtctgtggaagaagctctcctggcaaccggccagtggaagagcatcaccctgtttgtg
caggaagacagggcccagctgtacatcgactgtgaaaagatggagaatgctgagttggac
gtccccatccaaagcgtcttcaccagagacctggccagcatcgccagactccgcatcgca
aaggggggcgtcaatgacaatttccagggggtgctgcagaatgtgaggtttgtctttgga
accacaccagaagacatcctcaggaacaaaggctgctccagctctaccagtgtcctcctc
acccttgacaacaacgtggtgaatggttccagccctgccatccgcactaactacattggc
cacaagacaaaggacttgcaagccatctgcggcatctcctgtgatgagctgtccagcatg
gtcctggaactcaggggcctgcgcaccattgtcaccacgctgcaggacagcatccgcaaa
gtgactgaagagaacaaagagttggccaatgagctgaggcggcctcccctgtgctatcac
aacggagttcagtacagaaacaacgaggaatggactgttgatagctgcactgagtgtcac
tgtcagaactcagttaccatctgcaaaaaggtgtcctgccccatcatgccctgctccaat
gccacagttcctgatggagaatgctgccctcgctgttggcccagcgactctgcggacgat
ggctggtctccatggtctgagtggacctcctgttctacgagctgtggcaatggaattcag
cagcgcggccgctcctgcgatagcctcaacaaccgatgtgagggctcctcggtccagaca
cggacctgccacattcaggagtgtgacaagagatttaaacaggatggtggctggagccac
tggtccccgtggtcatcttgttctgtgacatgtggtgatggtgtgatcacaaggatccgg
ctctgcaactctcccagcccccagatgaacgggaaaccctgtgaaggcgaagcgcgggag
accaaagcctgcaagaaagatgcctgccccatcaatggaggctggggtccttggtcacca
tgggacatctgttctgtcacctgtggaggaggggtacagaaacgtagccgtctctgcaac
aaccccacaccccagtttggaggcaaggactgcgttggtgatgtgacagaaaaccagatc
tgcaacaagcaggactgtccaattgatggatgcctgtccaatccctgctttgccggcgtg
aaatgtactagctaccctgatggcagctggaaatgtggtgcttgtcccgctggttacagt
ggaaatggcatccagtgtacagatgttgatgagtgcaaagaagtgcctgatgcctgcttc
aaccacaatggagagcacaggtgtaagaacacggaccccggctacaactgcctgccctgc
ccgccacgcttcaccggctcacagcccttcggccagggtgtcgaacatgccacggccaac
aaacaggtgtgcaagccccgtaacccctgcacggatgggacccacgactgcaacaagaac
gccaagtgcaactacctgggccactatagcgagcccatgtaccgctgcgagtgcaagcct
ggctacgctggcaatggcatcatctgcggggaggacacagacctggatggctggcccaat
gagaacctggtgtgcgtggccaatgcgacttaccactgcaaaaaggataattgccccaac
ctccccaactcaggacaggaagactatgacaaggatggaattggtgatgcctgtgatgat
gacgatgacaatgataaaattccagatgacagggacaactgtccattccattacaaccca
gctcagtatgactatgacagagatgacgtcggagaccgctgtgacaactgtccctacaac
cacaacccagatcaggcagacacagacaacaatggggaaggagacgcctgtgctgcagac
attgatggagacggtatcctcaatgaacgggacaactgccagtacgtctacaatgtggac
cagagagacactgatatggatggggttggagatcagtgtgacaactgccccttggaacac
aatccggatcagctggactctgactcagaccgcattggagatacctgtgacaacaatcag
gatattgatgaagatggccaccagaacaatctggacaactgtccctatgtgcccaatgcc
aaccaggctgaccatgacaaagatggcaagggagatgcctgtgaccacgatgatgacaac
gatggcattcctgatgacagggacaactgcagactcgtgcccaatcctgaccagaaggac
tctgacggcgatggtcgaggtgatgcttgcaaagatgattttgaccatgacagtgttcca
gacatcgatgacatctgtcctgagaatgtggacatcagtgagaccgatttccgccgattc
cagatgattcctctggatcccaaagggacatcccaaaatgaccctaactgggttgtacgc
catcagggtaaagaactcgtccagactgtcaactgtgatcctggactcgctgtaggttat
gacgagtttaatgctgtggacttcagtggcaccttcttcatcaacaccgaaagagacgat
gactatgctggatttgtctttggctaccagtccagcagccgcttttatgttgtaatgtgg
aagcaagtcacccagtcctactgggacaccaaccccacgagggctcagggatactcgggc
ctttctgtgaaagttgtaaactccaccacagggcctggcgagcacctgcggaatgccctg
tggcacacaggaaacacccctggccaggtgcgcaccctgtggcatgaccctcgtcacata
ggctggaaagatttcaccgcctacagatggcgtctcagccacaggccaaagacgggtttc
attagagtggtgatgtatgaagggaagaaaatcatggctgactcaggacccatctatgat
aaaacctatgctggtggtagactagggttgtttgtcttctctcaagaaatggtgttcttc
tccgacctgaaatacgaatgtagagatccctaa

KEGG   Pongo abelii (Sumatran orangutan): 100455895
Entry
100455895         CDS       T01416                                 

Gene name
THBS2
Definition
(RefSeq) thrombospondin-2
  KO
K04659  thrombospondin 2/3/4/5
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04145  Phagosome
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05144  Malaria
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100455895 (THBS2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100455895 (THBS2)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    100455895 (THBS2)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100455895 (THBS2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100455895 (THBS2)
  09174 Infectious disease: parasitic
   05144 Malaria
    100455895 (THBS2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pon04131]
    100455895 (THBS2)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100455895 (THBS2)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100455895 (THBS2)
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100455895 (THBS2)
Membrane trafficking [BR:pon04131]
 Endocytosis
  Phagocytosis
   Opsonins
    100455895 (THBS2)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100455895 (THBS2)
  Exosomal proteins of colorectal cancer cells
   100455895 (THBS2)
  Exosomal proteins of bladder cancer cells
   100455895 (THBS2)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100455895 (THBS2)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   100455895 (THBS2)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC EGF_3 EGF_CA TSP1_ADAMTS EGF cEGF TSP1_CCN Laminin_G_2
Other DBs
NCBI-GeneID: 100455895
NCBI-ProteinID: XP_024104045
LinkDB
Position
6
AA seq 1172 aa
MVWRLVLLALWVWPSMEAGHQDKDTTFDLFSISNINRKTIGAKQFRGPDPGMPAYRFVRF
DYIPPVNADDLSKITKIMRQKEGFFLTAQLKQDGKSRGTLLALEGPGLSQRQFEIVSNGP
ADTLDLTYWIDGTRHMVSLEDVGLADSQWKNVTVQVAGETYSLHVGCDLIDSFALDEPFY
EHLQAEKSRMYVAKGSARESHFRGLLQNVHLVFENSVEDILSKKGCQQGQGAEINTISEN
TETLRLGPHVTTEYVGPSSERRPEVCERSCEELGNMVQELSGLHVLVNQLSENLKRVSND
NQFLWELIGGPPKTRNMSACWQDGRFFAENETWVVDSCTTCTCKKFKTICHQITCPPATC
ASPSFVEGECCPSCLHSVDGEEGWSPWAEWTECSVTCGSGTQQRGRSCDVTSNTCLGPSI
QTRACSLSKCDTRIRQDGGWSHWSPWSSCSVTCGVGNITRIRLCNSPVPQLGGKNCKGSG
RETKACQGAPCPIDGRWSPWSPWSACTVTCAGGIRERTRVCNSPEPQYGGKTCVGDVQER
QMCNKRSCPVDGCLSNPCFPGAQCSSFPDGSWSCGSCPVGFLGNGTHCEDLDECALVPDI
CFSTSKVPRCVNTQPGFHCLPCPPRYRGSQPVGVGLEAAKMEKQVCEPENPCKDKTHNCH
KHAECIYLGHFSDPMYKCECQTGYAGDGLICGEDSDLDGWPNLNLVCATNATYHCIKDNC
PHLPNSGQEDFDKDGIGDACDDDDDNDGVTDEKDNCQLLFNPRQADYDKDEVGDRCDNCP
YVHNPAQIDTDNNGEGDACSVDIDGDDVFNERDNCPYVYNTDQRDMDGDGVGDHCDNCPL
VHNPDQTDVDNDLVGDQCDNNEDIDDDGHQNNQDNCPYISNANQADHDRDGQGDACDPDD
DNDGVPDDRDNCRLVFNPDQEDLDGDGRGDICKDDFDNDNVPDIDDVCPENNAISETDFR
NFQMVPLDPKGTTQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDR
DDDYAGFVFGYQSSSRFYVVMWKQVTQTYWEDQPTRAYGYSGVSLKVVNSTTGTGEHLRN
ALWHTGNTPGQVRTLWHDPRNIGWKDYTAYRWHLTHRPKTGYIRVLVHEGKQVMADSGPI
YDQTYAGGRLGLFVFSQEMVYFSDLKYECRDI
NT seq 3519 nt   +upstreamnt  +downstreamnt
atggtctggaggctggtcctgctggctctgtgggtgtggcccagcatggaagctggtcac
caggacaaagacacgaccttcgaccttttcagtatcagcaacatcaaccgcaagaccatt
ggcgccaagcagttccgcgggcccgaccccggcatgccggcttaccgcttcgtgcgcttt
gactacatcccacccgtgaacgcagatgacctcagcaagatcaccaagatcatgcggcag
aaggagggcttcttcctcacggcccagctcaagcaggacggcaagtccaggggcacgctg
ctggctctggagggccccggtctctcccagaggcagttcgagatcgtctccaatggcccc
gcggacacgctggacctcacctactggatcgacggcacccggcacatggtctccctggag
gacgtcggcctggctgactcgcagtggaagaacgtcaccgtgcaggtggctggtgagacc
tacagcttgcacgtgggctgcgacctcatagacagcttcgctctggacgagcccttctac
gagcacctgcaggcggaaaagagccggatgtacgtggccaaaggttctgccagagagagt
cacttcaggggtttgcttcagaacgtccacctagtgtttgaaaactctgtagaagatatt
ctaagcaagaagggttgccagcaaggccagggagctgagatcaacaccatcagtgagaac
acggagacgctgcgcctgggtccgcatgtcaccaccgagtacgtgggccccagctcagag
aggaggcccgaggtgtgcgaacgctcctgcgaggagctgggaaacatggtccaggagctc
tcggggctccacgtcctcgtgaaccagctcagtgagaacctcaagagagtgtcgaatgat
aaccagtttctctgggagctcattggtggccctcctaagacaaggaacatgtcagcttgc
tggcaggatggccggttctttgcggaaaatgaaacgtgggtggtggacagctgcaccacg
tgtacctgcaagaaatttaaaaccatttgccaccaaatcacctgcccgccggcaacctgc
gccagcccgtcctttgtggaaggcgaatgctgcccttcctgcctccactcggtggacggg
gaggagggctggtctccgtgggcagagtggaccgagtgctctgtgacgtgtggctctggg
acccagcagagaggccggtcctgtgatgtcaccagcaacacctgcttggggccctccatc
cagacacgagcttgcagcctgagcaagtgtgacacccgcatccggcaggacggcggctgg
agccactggtcaccttggtcttcatgctctgtgacctgtggagttggcaatatcacacgc
atccgtctctgcaactccccagtgccccagttggggggcaagaattgcaaagggagtggc
cgggagaccaaagcctgccagggcgccccatgcccaatcgatggccgctggagcccctgg
tccccgtggtcggcctgcactgtcacctgtgccggcgggatccgggagcgcacccgggtc
tgcaacagccctgagcctcagtacggagggaagacctgtgtgggggatgtgcaggagcgc
cagatgtgcaacaagaggagctgccccgtggatggctgtttatccaacccctgcttcccg
ggagcccagtgcagcagcttccccgatgggtcctggtcatgcggctcctgccctgtgggc
ttcttgggcaatggcacccactgtgaggacctggacgagtgtgccctggtccccgacatc
tgcttctccaccagcaaggtgcctcgctgtgtcaacactcagcccggcttccactgcctg
ccctgcccgccccgttacagagggagccagcccgtcggggtcggcctggaagcagccaag
atggaaaagcaagtgtgtgagcccgaaaacccatgcaaggacaagacacacaactgccac
aagcacgcggagtgcatctacctgggccacttcagcgaccccatgtacaagtgcgagtgc
cagacaggctacgcgggcgacgggctcatctgcggggaggactcggacctggacggctgg
cccaacctcaacctggtctgcgccaccaacgccacctaccactgcatcaaggataactgc
ccccatctgccaaattctggacaggaagactttgacaaggacgggattggcgatgcctgt
gatgatgacgacgacaatgacggtgtgaccgatgagaaggacaactgccagctcctcttc
aatccccgccaggctgactatgacaaggatgaggttggggaccgctgtgacaactgccct
tatgtgcacaaccctgcccagatagacacagacaacaacggagagggtgacgcctgctcc
gtggacattgacggggacgatgtcttcaatgaacgagacaattgtccctatgtctacaac
accgaccagagggacatggacggtgacggtgtgggggatcactgtgacaactgccccctg
gtgcacaaccctgatcagactgacgtggacaatgaccttgtcggggaccagtgtgacaac
aacgaggacatagatgatgacggccaccagaacaaccaggacaactgcccctacatctcc
aacgccaaccaggctgaccacgacagagatggccagggcgacgcctgtgaccccgatgat
gacaacgatggcgtccccgatgacagggacaactgccggcttgtgttcaacccagaccag
gaggacctggacggtgatggacggggtgatatttgtaaagatgattttgacaatgacaac
gtcccagatattgatgatgtgtgtcctgaaaacaatgccatcagtgagacagacttcagg
aacttccagatggtccccttggatcccaaagggaccacccaaattgatcccaactgggtc
attcgccatcaaggcaaggagctggttcagacagccaactcggaccccggcatcgctgta
ggttttgatgagtttgggtccgtggacttcagtggcacattctacgtaaacactgaccgg
gacgacgactatgctggcttcgtctttggctaccagtcaagcagccgcttctacgtggtg
atgtggaagcaggtgacacagacctactgggaggaccagcccacgcgggcctatggctac
tccggtgtgtccctcaaggtggtgaactccaccacggggacgggcgagcacctgaggaat
gcgctgtggcacacggggaacacaccagggcaggtgcgaaccttatggcacgaccccagg
aacattggttggaaggactacacggcctacaggtggcatctgactcacaggcccaagacc
ggctacatcagagtcttagtgcatgaaggaaaacaggtcatggcagactcaggacctatc
tatgaccaaacctacgctggcgggcggctgggtctatttgtcttctctcaagaaatggtc
tatttctcagacctcaagtacgaatgcagagatatttaa

KEGG   Pongo abelii (Sumatran orangutan): 100456207
Entry
100456207         CDS       T01416                                 

Gene name
CHAD
Definition
(RefSeq) chondroadherin
  KO
K06248  chondroadherin
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100456207 (CHAD)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100456207 (CHAD)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100456207 (CHAD)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100456207 (CHAD)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:pon00535]
    100456207 (CHAD)
Proteoglycans [BR:pon00535]
 Extracellular matrix (ECM) proteoglycans
  Small leucine-rich proteoglycan (SLRP) family
   class IV
    100456207 (CHAD)
SSDB
Motif
Pfam: LRR_8 LRR_4 LRR_5 LRR_1 LRRNT LRR_9 LRRCT
Other DBs
NCBI-GeneID: 100456207
NCBI-ProteinID: XP_024090380
Ensembl: ENSPPYG00000008283
LinkDB
Position
17
AA seq 359 aa
MVRPMLLLILGLLAGLLPALAACPQNCHCHGDLQHVICDKVGLQKIPKVSEKTKLLNLQR
NNFPVLAANSFRAMPNLVSLHLQHCQIREVAAGAFRGLKQLIYLYLSHNDIRVLRAGAFD
DLTELTYLYLDHNKVTELPRGLLSPLVNLFILQLNNNKIRELRAGAFQGAKDLRWLYLSE
NALSSLQPGALDDVENLAKFHVDRNQLSSYPSAALSKLRVVEELKLSHNPLKSIPDNAFQ
SFGRYLETLWLDNTNLEKFSDGAFLGVTTLKHVHLENNRLNQLPSNFPFDSLETLTLTNN
PWKCTCQLWGLRRWLEAKASRPDATCASPAKFKGQHIRDTDAFRSCKFPTKRSKKAGRH
NT seq 1080 nt   +upstreamnt  +downstreamnt
atggtccgcccaatgctcttgctcatcctcggcctcctggctggtctgctgccggcgctg
gccgcctgcccccagaactgccactgccacggcgacctgcagcacgtcatctgcgacaag
gtggggctgcagaagatccccaaggtgtcagagaagaccaagctgctcaacctacagcgc
aacaacttcccagtgctggctgccaattcgtttcgggccatgccgaacctcgtgtcgttg
cacctgcagcactgccagatccgcgaggtggccgccggtgccttccgcggcctcaagcag
cttatctacttgtacctgtcccataacgacatccgcgtgctgcgcgcaggcgccttcgac
gacctgaccgagctgacctacctctacctggaccacaacaaggtcactgagctgccccgg
gggttgctgtccccgctggtcaacctcttcatcttgcagctcaacaacaacaagatccgt
gagctgcgcgcaggcgccttccagggagccaaggacctgcgctggctctacctgtcggaa
aacgcgctgagctccctgcaacccggggccctggacgacgtggagaacctcgccaaattc
cacgtggacaggaaccagctgtccagctacccctcggctgccctgagcaagctacgggtg
gtggaggagctgaagctgtcccacaaccccctgaaaagcatcccggacaatgccttccag
tcctttggcagatacctggagaccctctggctagacaacaccaacctggagaagttctca
gatggtgccttcctgggtgtaaccacgctgaaacacgtccatttggagaacaaccgcctg
aaccagctaccctccaacttccccttcgacagcctggagaccctcacccttaccaacaac
ccctggaagtgtacctgccagctctggggccttcggcggtggctggaagccaaggcctcc
cgcccagatgccacctgtgcctcacctgccaagttcaagggccagcacatccgtgacacg
gacgccttccgcagctgcaagttccccaccaagaggtccaagaaagctggtcgccattaa

KEGG   Pongo abelii (Sumatran orangutan): 100456580
Entry
100456580         CDS       T01416                                 

Definition
(RefSeq) LOW QUALITY PROTEIN: laminin subunit alpha-5-like
  KO
K06240  laminin, alpha 3/5
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100456580
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100456580
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100456580
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100456580
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100456580
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100456580
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100456580
   05145 Toxoplasmosis
    100456580
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B Laminin_I
Other DBs
NCBI-GeneID: 100456580
NCBI-ProteinID: XP_024097300
LinkDB
Position
20
AA seq 2289 aa
MAKRLCXGSALCVRGPRGPAPLLLVGLALLGAARAREEAGGGFSLHPPYFNLAEGARIAA
SATCGEEAPARGSPRPTEDLYCKLVGGPVAGGDPNQTIQGQYCDICTAANSNKAHPASNA
IDGTERWWQSPPLSRGLEYNEVNVTLDLGQVFHVAYVLIKFANSPRPDLWVLERSMDFGR
TYQPWQFFASSKRDCLERFGPQTLERITRDDAAICTTEYSRIVPLENGEIVVSLVNGRPG
AMNFSYSPLLREFTKATNVRLRFLRTNTLLGHLMGKALRDPTVTRRYYYSIKDISIGGRC
VCHGHADACDAKDPTDPFRLQCTCQHNTCGGTCDRCCPGFNQQPWKPATANSANECQSCN
CHGHATDCYYDPEVDRRRASQSLDGTYQGGGVCIDCQHHTTGVNCERCLPGFYRSPNHPL
DSPHVCRRCNCESDFTDGTCEDLTGRCYCRPNFSGERCDVCAEGFTGFPSCYPTPSSSND
TREQVLPAGQIVNCDCSAAGTQGNACRKDPRVGRCVCKPNFQGTHCELCAPGFYGPGCQP
CQCSSPGVADNRCDPDTGQCRCRAGFEGATCDRCAPGYFHFPLCQLCGCSPAGTLPEGCD
EAGRCLCQPEFAGPHCDQCRPGYHGFPNCQACTCDPRGALDQLCGAGGLCRCRPGYTGTA
CQECSPGFHGFPSCVPCHCSAEGSLHAACDSQSGQCSCRPRVTGLRCDTCVPGAYNFPYC
EAGSCHPAGLASVDPALPEAQVPCMCRAHVEGPSCDRCKPGFWGLSPSNPEGCTRCSCDP
RGTLGGVAQCQPGTGQCFCKPHVCGQACASCKDGFFGLDQADYFGCRSCQCDIGGALGQS
CEPRTGACRCRPNTQGPTCSKPARDHYLPDLHHLRLELEEAATPEGHAVRFGFNPLEFEN
FSWRGYAQMAPVQPRIVARLNLTSPDLFWLVFRYVNRGAMSVSGRVSVREEGRSAACANC
TAQSQPVAFPPSTEPAFITVPQRGFGEPFVLNPGTWALLVEAEGVLLDYVVLLPSAYYEA
ALLQLRVTEACTYRPSAQQSGDNCLLYTHLPLDGFPSAAGLEALCRQDNSLPRPCPTEQL
SPSHPPLITCTGSDVDVQLQVAVPQPGRYALVVEYANEDARQEVGMAVHTPQRAPQQGLL
SLHPCLYSTLCRGTARDTHDRLAVFHLDSEASVRLTAEQARFFLHGVTLVPIEEFSPEFL
EPRVRCISSHGAFGPGSAACLPSRFPKPPQPLILRDCQVIPLPPGLPLTHAQDLTPATSP
TGPQPRPPTAVDPDAEPTLLREPQATVVFTTHVPMLGRYAFLLHGYQPAHPTFPVEVLIN
AGRVWQGHANASFCPHGYGCRTLVVCEGQALLDVTHSELTVTVRVPEGRWLWLDYVLVVP
ENVYSFGYLREEPLDKSYDFISHCAAQGYHISPSSSSLFCRNAAASLSLFYNNGARPCGC
HEVGATGPTCEPFGGQCPCRAHVIGRDCSRCATGYWGFPNCRPCDCGARLCDELTGQCIC
PPRTVPPDCLLCQPQTFGCHPLVGCEECNCSGPGVQELTDPTCDTDSGQCKCRPNVTGRR
CDTCSPGFHGYPRCRPCDCHEAGTAPGVCDPLTGQCYCKENVQGPRCDQCSLGTFSLDAA
NPKGCTRCFCFGATERCRSSSYTRQEFVDMEGWVLLSTDRQVVPHERRPGTEMLRADLRH
VPEAVPEAFPELYWQAPPSYLGDRVSSYGGTLRYELHSETQRGDVFVPMESRPDVALQGN
QMSITFLEPAYPTPGHVHRGQLQLVEGNFRHTETRNTVSREELMMVLASLEQLQIRALFS
QISSAVSLRRVALEVASPAGQGALASNVELCLCPASYRGDSCQECAPGFYRDVKGLFLGR
CVPCQCHGHSDRCLPGSGICVDCQHNTEGAHCERCQAGFVSSGEDPSAPCVSCPCPLSVP
SNNFAEGCVLRGGRTQCLCKPGYAGASCERCAPGFFGNPLVLGSSCQPCDCSGNGDPNLL
FSDCDPLTGACRGCLRHTTGPRCEICAPGFYGNALLPGNCTRCNCAPCGTEACDPHSGHC
LCKAGVTGQRCDHCQEGHFGFDGCGGCRPCACGPAAEGSECHPQSGQCHCRPGTMGPQCR
ECAPGYWGLPEQGCRRCQCPGGRCDPHTGHCTCPPGLSGERCDTCSQQHQVPVPGGTVGH
SIHCEVCDHCVVLLLDDLERAGALLPAIREQLRGINASSVAWARLHRLNASIADLQSQLR
SPLGPRHETAQQLEVLEQQSTSLGQDARRLGGQAGAPRPPRALGGFHLCSPRFGSLVLIQ
QMLLHASYM
NT seq 6870 nt   +upstreamnt  +downstreamnt
atggcgaagcggctctgcngcgggagcgcactgtgtgttcgcggcccccggggccccgca
ccgctgctgctggtcggactggcgctgctgggcgcggcgcgggcgcgggaggaggcgggc
ggcggcttcagcctgcacccgccctacttcaacctggccgagggcgcccgcatcgccgcc
tccgcgacctgcggagaggaggccccggcgcgcggctccccgcgccccaccgaagacctt
tactgcaagctggtggggggccccgtggccggcggcgaccccaaccagaccatccagggc
cagtactgcgacatctgcacggctgcgaacagcaacaaggcacaccccgcgagcaatgcc
atcgacggcacggagcgctggtggcagagcccaccgctgtcccgcggcctggagtacaac
gaggtcaatgtcaccctggacctgggccaggtcttccatgtggcctacgtcctcatcaag
tttgccaactcaccccggccggacctctgggtgctggagcggtccatggacttcggccgc
acctaccagccctggcagttctttgcctcctcaaagagggactgtctggagcggttcggg
ccacagacgctggagcgcatcacacgggacgacgcggccatctgcaccaccgagtactca
cgcatcgtgcccctggagaacggagagatcgtggtgtccctggtgaacggacgtccgggc
gccatgaatttctcctactcgccgctgctacgtgagttcaccaaggccaccaacgtccgc
ctgcgcttcctgcgcaccaacacgctgctggggcacctcatggggaaggcgctgcgggac
cccacggtcacccgccggtattattacagcatcaaggatatcagcattggaggccgctgt
gtctgccatggccacgcagatgcctgtgatgccaaagaccccacggacccgttcaggctg
cagtgcacctgccagcacaacacctgtgggggcacctgcgaccgctgctgccccggcttc
aaccagcagccgtggaagcctgcgactgccaacagtgccaacgagtgccagtcctgtaac
tgccacggccatgccaccgactgttactacgaccctgaggtggaccggcgccgcgccagc
cagagcctggatggcacctatcagggtgggggtgtctgtatcgactgccagcaccacacc
actggcgtcaactgtgagcgctgcctgcccggcttctaccgctctcccaaccacccactc
gactcgccccacgtctgccgccgctgcaactgcgagtccgacttcacagatggcacctgc
gaggacctgacgggtcgctgctactgccggcccaacttctctggggagcggtgtgacgtg
tgtgccgagggcttcacaggcttcccaagctgctacccgacgccctcgtcgtccaatgac
accagggagcaggtgctgccagctggccagattgtgaattgtgactgcagcgcggcaggg
acccagggcaacgcctgccggaaggacccgagggtgggacgctgtgtgtgcaaacccaac
ttccaaggcacccattgtgagctctgcgcgccagggttctacggccccggctgccagccc
tgccagtgttccagccctggagtggccgacaaccgctgtgaccctgacacaggccagtgc
aggtgccgagcgggcttcgagggggccacgtgtgatcgctgtgcccccggctactttcac
ttccctctctgccagttgtgtggctgcagccctgcaggaaccctgcccgagggctgcgat
gaggccggccgctgcctgtgccagcctgagtttgccggacctcattgtgaccagtgccgc
cctggctaccacggtttccctaactgccaagcatgcacctgtgaccctcggggagccctg
gaccagctctgtggggcgggaggtttgtgccgctgccgccccggctacacaggcaccgcc
tgccaggaatgcagccccggcttccacggctttcccagctgtgtcccctgccactgctct
gccgaaggctccctgcacgcagcctgtgactcccagagtgggcagtgcagctgtcggccc
cgcgtgacggggctgcggtgtgacacgtgtgtgcccggtgcctacaacttcccctactgc
gaagctggctcttgccaccctgctggtctggcctcagtggatcctgcccttcctgaggca
caggttccctgtatgtgccgggctcacgtggaggggccgagctgcgaccgctgcaaacct
gggttctggggactgagccccagcaaccccgagggctgtacccgctgcagctgcgacccc
aggggcacgctgggtggagttgctcagtgccagccgggcaccggccagtgcttctgcaag
ccccacgtgtgtggccaggcctgcgcatcctgcaaggatggtttctttggactggatcag
gctgactatttcggctgccgcagctgccagtgtgacattggcggtgcgctgggccagagc
tgtgaaccgaggacgggcgcctgccggtgccgccccaacacccaaggccccacctgcagc
aagcctgcgagggaccactacctcccagacctgcaccacctgcgcctggaactggaggag
gccgccacacctgagggtcacgccgtgcgctttggcttcaaccccctcgagttcgagaac
ttcagctggaggggctatgcgcagatggcacctgtccagcccaggatcgtggccaggctg
aacctgacctcccccgaccttttctggcttgtcttccgatacgtcaaccggggggccatg
agtgtgagcgggcgggtctctgtgcgagaggagggcaggtcggctgcctgcgccaactgc
acagcgcagagccagcctgtggccttcccacccagcacggaacctgccttcatcactgtg
ccccagaggggcttcggagagccctttgtactgaatcctggcacctgggccctgcttgtg
gaggctgaaggggtgctcctggactacgtggttctgctgcctagcgcatactacgaggcg
gcgctcctgcagctgcgggtgaccgaggcctgcacataccgcccctctgcccagcagtct
ggcgacaactgcctcctctacacacacctccccctggatggcttcccctcggccgccggg
ctggaggccctgtgtcgccaggacaacagcctgccccggccctgccccacggaacagctc
agcccatcgcacccgccactgatcacctgcacgggcagtgatgtggacgtccagcttcaa
gtggcagtgccgcagccgggccgctacgccctagtggtggagtacgccaatgaggatgcc
cgccaggaggtgggcatggccgtgcacaccccacagcgggccccccagcaggggctgctc
tccctgcacccctgcctgtacagcaccctgtgccggggcaccgcccgggatacccatgac
cgcctggctgtcttccacctggactcagaggccagcgtgaggctcacagctgagcaggca
cgcttcttcctgcatggggtcactctggtgcccattgaggagttcagcccggagttcttg
gagccccgggtccgctgcatcagcagccacggcgcctttggccccggcagtgctgcctgt
ctgccctcgcgcttcccaaagccgccccagcccctcatcctcagggactgccaggtgatc
ccactgccgcccggcctcccgctgacccacgcacaggatctcactccagccacgtcccca
accggaccccaacctcggccccccaccgccgtggaccctgacgcggagcccaccctgctg
cgtgagccccaggccaccgtggtcttcaccacccatgtgcccatgttgggccgctacgcc
ttcctgctgcacggctaccagccagcccaccccaccttccccgtggaagtcctcatcaac
gccggccgcgtgtggcagggccacgccaacgccagcttctgtccacatggctatggctgc
cgcaccctggtggtgtgtgagggccaggccctgctggacgtgacccacagcgagctcact
gtgaccgtgcgtgtgcccgagggccggtggctctggctggattatgtactcgtggtccct
gagaacgtctacagctttggctacctccgggaggagcccctggacaaatcctatgacttc
atcagccactgcgcagcccagggctaccacatcagccccagcagctcatccctgttctgc
cgaaacgctgctgcttccctctccctcttctataacaacggagcccgtccatgtggctgc
cacgaagtaggtgctacaggccccacgtgtgagcccttcgggggccagtgtccctgccgt
gcccacgtcattggtcgtgactgctcccgctgtgccaccggatactggggcttccccaat
tgcaggccctgtgactgcggcgcccgcctctgtgacgagctcacgggccagtgcatctgc
ccaccacgcaccgtcccgcccgactgcctgctgtgccagccccagacctttggctgccac
cccctggtcggctgtgaggagtgtaactgctcagggcccggcgtccaggagctcacagac
cctacctgtgacacagacagcggccagtgcaagtgcagacccaacgtgactgggcgccgc
tgtgatacctgctctccgggcttccatggctacccccgctgccgcccctgtgactgtcac
gaggcgggcactgcgcctggcgtgtgtgatcccctcacagggcagtgctactgtaaggag
aacgtgcagggccccagatgtgaccagtgcagccttgggaccttctccctggatgctgcc
aaccccaaaggttgcacccgctgcttctgctttggggccacggagcgctgccggagctcg
tcctacacccgccaggagttcgtggacatggagggctgggtgctgctgagcactgaccgg
caggtggtgccccacgagcggcggccagggacggagatgctccgtgcagacctgcggcat
gtgcctgaggctgtgcccgaggctttccccgagctgtactggcaggccccgccctcctac
ctgggggaccgggtgtcatcctacggtgggaccctccgttatgaactgcactcagagacc
cagcggggagatgtcttcgtccccatggagagcaggccggatgtggcgctgcagggcaac
cagatgagcatcacattcctggagccggcgtaccccacgcctggccacgttcaccgtggg
cagctgcaactggtggaggggaacttccggcacacggagactcgcaacactgtgtcccgc
gaggagctcatgatggtgctggccagcctggagcagctgcagatccgtgccctcttctca
cagatctcctcggctgtctccctgcgcagggtggcactggaggtggccagcccagcaggc
cagggggccctggccagcaatgtggagctgtgcctgtgccccgccagctaccggggggac
tcatgccaggaatgtgcccccggcttctatcgggacgtcaaaggtctcttcctgggccga
tgtgtcccttgtcagtgccatggacactcagaccgctgcctccctggctctggcatctgt
gtggactgccagcacaacaccgaaggggcccactgtgagcgctgccaggctggcttcgtg
agcagcggggaggaccccagcgccccctgtgtcagctgcccctgtcccctctccgtgcct
tccaacaacttcgccgagggctgtgtcctgcgaggcggccgcacccagtgcctctgcaaa
cctggttatgcaggtgcctcctgtgagcggtgtgcgcccggattcttcgggaacccactg
gtgctgggcagctcctgccagccttgcgactgcagcggcaacggtgaccctaacttgctc
ttcagcgactgcgaccccctgacgggtgcctgccgtggctgcctgcgccacaccaccggg
cctcgctgcgaaatctgtgcccccggcttctacggcaatgccctgctgcccggcaactgc
acccgatgcaactgtgccccatgtgggacagaggcctgcgacccccacagcgggcactgc
ctgtgcaaggcaggtgtgactgggcagcgctgtgaccattgtcaggagggacattttggt
ttcgacggctgcgggggctgccgcccgtgtgcttgtggaccggccgccgagggctccgag
tgccacccccagagcggacagtgccactgccgaccagggaccatgggaccccagtgccgc
gagtgtgcccctggctactggggactccctgagcagggctgcaggcgctgccagtgccct
gggggccgctgtgaccctcacacgggccactgcacctgccccccggggctcagcggggag
cgctgcgacacctgcagccagcagcatcaggtgcccgttcccggcgggactgtgggccac
agcatccactgtgaagtgtgtgaccactgtgtggtcctgcttctggatgacctggaacgg
gccggcgccctcctccccgccattcgtgagcaactgcgtggcatcaatgccagctccgtg
gcctgggcccgtctgcacaggctgaatgcctccatcgctgacctgcagagccagcttcgg
agccccctgggcccccgccatgagacagcacagcagctggaggtgctggagcagcagagc
acaagcctcgggcaggatgcacggcggctgggcggccaggcaggagccccaagacccccc
agggccctgggaggctttcacctgtgttccccgcggttcggttcgcttgtcctcattcag
cagatgttacttcatgcctcatatatgtga

KEGG   Pongo abelii (Sumatran orangutan): 100457450
Entry
100457450         CDS       T01416                                 

Gene name
COL4A6
Definition
(RefSeq) collagen alpha-6(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100457450 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100457450 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100457450 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100457450 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    100457450 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100457450 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100457450 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100457450 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100457450 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100457450 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100457450 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100457450 (COL4A6)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100457450 (COL4A6)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100457450 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100457450
NCBI-ProteinID: XP_009233428
Ensembl: ENSPPYG00000020620
LinkDB
Position
X
AA seq 1690 aa
MHPGLWLLLVTLCLTEELAGAGEKSYGKPCGGQDCRRSCQCFPEKGARGRPGPIGIQGPT
GPQGFTGSTGLSGLKGERGSPGLLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGPP
GLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPGLPGLDG
ITGPQGALGSPGAVGPAGPPGLQGPPGPPGPLGPDGNMGLGFQGEKGVKGDVGLPGPAGP
PPSTGELEFMGFPKGQKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGEKGIPGLPGP
RGPMGSEGVHGPPGQQGKKGTLGFPGLNGFQGMEGQKGDIGLPGPDVFIDIDGAVISGNP
GDPGVPGLPGLKGDEGIQGLRGPSGAPGLPALSGVPGALGPQGFPGLKGDQGNPGRTTIG
AAGLPGRDGLPGPPGPPGPPSPEFETETLHNKEPGFPGLRGEQGPKGNPGLKGIKGDSGF
CACDGGVPNTGPPGEPGPPGPRGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSGP
KGKKGEPILSTIQGMPGDRGDSGSQGFHGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGEK
GLPGLPGEKGHPGPPGLPGIGLPGLPGPPGLPGDKGNDGLPGQQGLPGSKGITLPCIIPG
SYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGTKGEPGSPGLVHLPELPGFPGPRGEKG
LPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAESGAPGEQGLQGLTGDKGFLGDSGL
PGLKGVHGKPGLLGPKGEQGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISGH
PGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGSKGEKGS
VGFVGFPGIPGLPGIPGTRGLKGIPGSAGKMGPSGHAGTPGEKGDRGNPGPVGIPSPRRP
MSNLWLKGDKGSQGSAGSGGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPPG
FMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGSP
GPKGQPGESGLKGTKGRDGLIGNIGFPGNKGEDGKIGVSGDVGLPGAPGFPGVAGMRGEP
GLQGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPAG
LPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGDP
GRPGLDGERGRPGPPGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIPGFSGLPGEL
GLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGPQGAPGQTPTAEAVQVPPGP
LGLPGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQGP
PGFEGAPGQQGPFGMPGMPGQSVRVGYTLVKHSQSEQVPLCPIGMSQLWVGYSLLFVEGQ
EKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQI
PQYISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVSP
GSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETLKAGQLHTRV
SRCQVCMKSL
NT seq 5073 nt   +upstreamnt  +downstreamnt
atgcaccctgggttgtggctgctcctggttacgttgtgcctgaccgaggaactggcagga
gcgggagagaagtcttatggaaagccatgtgggggccaagactgcagacggagctgtcag
tgttttcctgagaaaggagcgagagggcgacctggaccaattggaattcaaggcccaaca
ggtcctcaaggattcactggctctactggtttatcaggattgaaaggagaaaggggttcc
ccaggccttctcggaccttatggaccaaaaggagataagggtcccatgggagttcctggc
tttcttggcatcaatgggattccgggccatcctggacagccaggccccagaggcccacct
ggtctggatggctgtaatggaactcaaggagctgttggatttccaggccctgatggctat
cctgggcttctcggaccacccgggcttcctggtcagaaaggatcaaaaggtgaccctgtc
cttgctccaggtagtttcaaaggaatgaagggggatcctgggctgcctggactggatgga
atcactggcccacaaggagcactcggatctcctggagctgtaggacctgcaggaccacca
ggattacaaggtcctccagggcctcctggtcctcttggtcctgatgggaatatggggcta
ggttttcaaggagagaaaggagtcaagggggatgttggcctccctggcccagcaggacct
ccaccatctactggagagctggaattcatgggattccccaaagggcagaaaggatccaag
ggtgaaccagggcctaagggttttccaggcataagtggccctccaggcttcccgggcctt
ggaactactggagaaaagggagaaaagggagaaaagggaatccctggtttgccaggacct
aggggtcccatgggttcagaaggagtccatggccctccagggcaacagggcaagaaaggg
accctgggatttcctgggcttaatggattccaaggaatggagggtcaaaagggtgacatt
ggcctgccaggcccagatgttttcatcgatatagatggtgctgtgatctcaggtaatcct
ggagatcctggtgtacctggcctcccaggccttaaaggagatgaaggcatccaaggccta
cgtggcccttctggtgcccctggcttgccagcattatcaggtgtcccaggagccctaggg
cctcagggatttccagggctgaagggggaccaaggaaacccaggccgtaccacaattgga
gcagctggcctccctggcagagatggtttgccaggcccaccaggtccaccaggcccacct
agtccagaatttgagaccgaaactctacacaacaaagagccagggttccctggtctccga
ggagaacaaggtccaaaaggaaacccaggcctcaaaggaataaaaggagactcaggtttc
tgtgcttgtgacggtggtgttcccaacactggaccacccggggaaccaggcccacctggt
ccacggggtctcataggccttccaggccttaaaggagccagaggagatcgaggctctggg
ggtgcacagggcccagcaggggctccaggcttagttgggcctctgggtccttcaggaccc
aaaggaaagaagggggaaccaattctcagtacaattcaaggaatgccaggggatcggggt
gattctggctcccagggcttccatggtgtaataggagaaccaggcaaggacggagtacca
ggtttaccaggtctgccaggccttccgggtgatggtggacagggcttcccaggtgaaaag
gggttacctggacttcctggtgaaaaaggccatcctggtccacctggcctcccaggaatt
gggttaccaggacttcctggaccccctgggcttcctggagataaaggcaatgatggatta
ccaggacaacaaggccttcccggatctaagggaatcactctgccttgtattattcccggg
tcatatggtccatcaggatttccaggcactcccggattcccaggccctaaagggtcccga
ggcctccctgggaccccaggccagcctgggtcaagcggaactaaaggagagccagggagt
ccaggattggttcatcttcctgaattaccaggatttcctggacctcgtggggagaagggc
ttgcctgggtttcctgggctccctggaaaagatggcttgcctgggatgattggcagtcca
ggcttacctggttccaagggagccactggtgacatctttggtgctgaaagtggtgctccg
ggggaacaaggcctacaaggattaacaggggacaaaggatttcttggagactctggcctt
ccgggactcaagggtgtgcacgggaagcctggcttactaggccccaaaggtgagcagggc
agccctgggacaccaggacaggtgggacagccaggcaccccaggatctagtggtccatat
ggcatcaagggcaaatctgggctcccaggagcaccgggcttcccaggcatctcaggacat
cctggaaagaaaggaacaagaggcaagaaaggtcctcctggatcaattgtaaagaaaggg
ctgccagggttaaaaggccttcctggaaatccaggcctagtaggactgaaaggaagccca
ggctctccaggggtcgctgggttgccagccctctctggatccaagggagagaaggggtct
gttggattcgtaggttttccaggaataccaggtctgcctggtattcctgggacaagagga
ttaaagggaattccgggatcagctggaaaaatgggaccatctggacacgctggtactcct
ggtgaaaagggagacagaggcaatccggggccagtcggaatacctagtccaagacgtcca
atgtcaaacctttggctcaaaggagacaaaggctctcaaggctcagccggatccggtgga
tttcctgggccaagaggtgacaaaggagaggctggtcgacctgggccaccaggcctgcct
ggagctcctggcctcccaggcattatcaaaggagttagtggaaagccagggccccctggc
ttcatgggaatccggggcttacctggcctgaaggggtcctctgggatcacaggtttccca
ggaatgccaggagaaagtggttcacaaggtatcagagggtcgcctggactcccaggagca
tctggtctcccaggcctgaaaggagacaacggccagacagttgaaatttccggtagccca
ggacccaagggacagcctggtgaatctggtcttaaaggtacaaaaggaagagatggacta
ataggcaatataggcttccctggaaacaaaggtgaagatggaaaaattggtgtttccgga
gatgttggccttcctggagctccaggatttccaggagttgccggcatgagaggagaacca
ggacttcaaggttcttctggtcatcaaggggcaattgggcccctaggatcccccggatta
ataggacccaaaggcttccctggatttcctggtttacatggactgaatgggcttccgggc
accaagggtacccatggcactccaggacctagtatcaccggtgtgcccgggcctgctggt
ctccctggacccaaaggagaaaaaggatatccaggaattggcatcggagctccagggaag
ccgggcctgagagggcaaaaaggtgatcgaggtttcccaggtctccagggccctgctggt
ctccccggtgccccaggcatctccttgccttcactcatagcaggacagcctggtgacccc
gggcgaccaggcctagatggagaacgaggccgcccaggcccccctggacccccaggtccc
cctgggccatcctcgaatcaaggcgacaccggagaccctggcttccctggaattcctgga
cctaaagggcctaagggagaccaaggaattccaggtttttctggcctccctggagagcta
ggactgaaaggcatgagaggtgagcctggcttcatggggactccaggcaaggttgggcca
cctggagacccaggatttcccggaatgaaggggaaggcagggccaagaggctcttctggc
ccccaaggtgctcctggacaaacaccaactgcagaagctgtccaggttcctcctggaccc
ttgggtctaccagggatcgatggcatccctggcctcactggggaccctggggctcaaggc
cctgtaggcctacaaggctccaaaggtttacctggcatccccggtaaagatggccccagt
gggctcccgggcccacctggggctcttggtgatcctggtctgcctggactgcaaggccct
ccaggatttgaaggagctccaggacagcaaggccccttcgggatgcctggaatgcctggc
cagagcgtgagagtgggctacacgttggtaaagcacagccagtcggaacaggtgcccctg
tgtcccatcgggatgagccagctgtgggtggggtacagcttactgtttgtggagggacaa
gagaaagcccacaaccaggacctgggctttgctggctcctgtctgccccgcttcagcacc
atgccgttcatctactgcaacatcaacgaggtgtgccactatgccaggcgcaatgataaa
tcttactggctctccactaccgcccctatccccatgatgcccgtcagccagacccagatt
ccccagtacatcagccgctgctctgtgtgtgaggcaccctcgcaagccattgctgtgcac
agccaggacatcaccatcccacagtgccccctgggctggcgcagcctctggattgggtac
tccttcctcatgcacactgccgctggtgctgagggtggaggccagtccctggtctcacct
ggctcctgcctagaggactttcgggccactcctttcatcgagtgcagtggcgcccgaggc
acctgccactactttgcaaacaagtacagtttctggttgaccacagtggaggagaggcag
cagtttggggagttgcctgtgtctgaaacgctgaaagctgggcagctccacactcgagtc
agtcgctgccaggtgtgtatgaaaagcctgtag

KEGG   Pongo abelii (Sumatran orangutan): 100457807
Entry
100457807         CDS       T01416                                 

Gene name
COL4A5
Definition
(RefSeq) collagen alpha-5(IV) chain isoform X3
  KO
K06237  collagen type IV alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04926  Relaxin signaling pathway
pon04933  AGE-RAGE signaling pathway in diabetic complications
pon04974  Protein digestion and absorption
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100457807 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100457807 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100457807 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100457807 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    100457807 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100457807 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100457807 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100457807 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100457807 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100457807 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100457807 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100457807 (COL4A5)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100457807 (COL4A5)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100457807 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100457807
NCBI-ProteinID: XP_002832026
Ensembl: ENSPPYG00000020622
LinkDB
Position
X
AA seq 1685 aa
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTGITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPPGPHIPPSDEICEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDILTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQTIT
QPGKPGLPGNPGRDGEVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGTLGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGERGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGPLGIPGRSGVPGLKGDDGLQGQPGLPGPAGEKGSKGEPGLPGPPGPMDPNLL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPGLPGQPGLT
GPPGLKGAIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISSIGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTPGPPGPKGISGPPG
NPGLPGEPGPVGGGGRPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGLPGPEGSPGLPGNGAIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMK
GDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDSGPPGI
PGQPGLKGLPGLQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGP
PGPDGLQGPPGPPGTSSIAHGFLITRHSQSTDAPQCPQGTLQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPF
ISRCAVCEAPAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMKRT
NT seq 5058 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcctgcagaggctgcggcttgctatgggtgttctccaggatcaaagtgtgactgc
agtggcataaaaggggaaaagggagagagagggtttccaggtttggaaggacatccagga
ttgcctggatttccaggtccagaagggcctccagggcctcggggacaaaagggtgatgat
ggaattccagggccaccaggaccaaaaggaatcagaggtcctcctgggcttcctggattt
ccagggacaccaggtcttcctggaatgccaggccatgatggggccccaggacctcaaggt
attcccggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggaccccctggtatcccaggtatgaagggtgaaccaggtagt
ataattatgtcatcactgccaggaccaaagggtaatccaggatatccaggtcctcctgga
atacaaggcctacctggtcccactggtataccagggccaattggtcccccaggaccacca
ggtttgatgggccctcctggtccaccaggacttccaggacctaaggggaatatgggctta
aatttccagggacccaaaggtgaaaaaggtgagcaaggtcttcagggcccacctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtccc
ccaggtggtgagaaaggtgagaagggtgagcaaggagagccaggcaaaagaggtaaacca
ggcaaagatggagaaaatggccaaccaggaattcctggtttgcctggtgatcctggttac
cctggtgaacctggaagggatggtgaaaagggccaaaaaggtgacactggcccacctgga
cctcctggacttgtaattcctagacctgggactggtataactataggagaaaagggaaac
attgggttgcctggcttgcctggagaaaaaggagagcgaggatttcctggaatacagggt
ccacctggccttcctggacctccaggggctgcagttatgggtcctcctggccctcctgga
tttcctggagaaaggggtcagaaaggtgatgaaggaccacctggaatttccattcctgga
cctcctggacttgatggacagcctggggctcctgggcttccagggcctcctggccctcct
ggccctcacatccctcctagtgatgagatatgtgaaccaggccctccaggccccccagga
tctccaggtgataaaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacact
tgcttcaactgcattggaactggtatttcagggcctccaggtcaacctggtttgccaggt
ctcccaggtcctccaggatctcttggtttccctggacagaaaggtgaaaaaggacaagct
ggtgcaactggtcccaaaggattacccggcattccaggagctccaggtgctccaggcttt
cctggatctaaaggtgaacctggtgatatcctcacttttccaggaatgaagggtgacaaa
ggagagttgggttcccctggagctccagggcttcctggtttacctggtactcctggacag
gatggattgccagggcttcctggcccgaaaggagagcctggtggaattacttttaagggt
gaaagaggtccccctgggaacccaggtttaccaggcctcccagggaatatagggcctatg
ggtccccctggtttcggccctccaggcccagtaggtgaaaaaggcatacaaggtgtggca
ggaaatccaggccagccaggaataccaggtcctaaaggggatccaggtcagactataacc
cagccagggaagcctggcttgcctggtaacccaggcagagatggtgaagtaggtcttcca
ggtgaccctggactcccagggcaaccaggcttgccagggatacctggtagcaaaggagaa
ccaggtatccctggaattgggcttcctggaccacctggtcccaaaggctttcctggaatt
ccaggacctccaggagcacctgggacacctggaagaattggtctagaaggccctcctggg
ccacccggctttccaggaccaaagggtgaaccaggatttgcattacctgggccacctggg
ccaccaggacttccaggtttcaaaggaacacttggtccaaaaggtgatcgtggtttccca
ggacctccaggtcctccaggacgcactggcttagatgggctccctggaccaaaaggtgat
gttggaccaaatggacaacctggaccaatgggacctcctgggctgccaggaataggtgtt
cagggaccaccaggaccaccagggattcctgggccaataggtcaacctggtttacatgga
ataccaggagagaggggggatccaggacctcctggacttgatgttccaggtcccccaggt
gaaagaggcagtccagggatccccggagcacctggtcctataggacctccaggatcacca
gggcttccaggaaaagcaggtgcctctggatttccaggtaccaaaggtgaaatgggtatg
atgggacctccaggcccaccaggacctttgggaattcctggcaggagtggtgtacctggt
cttaaaggtgatgatggcttgcagggtcagccaggacttcctggccctgcaggagaaaaa
ggtagtaaaggagagcctggccttccaggccctcctggaccaatggatccaaatcttctg
ggctcaaaaggagagaagggggaacctggcttaccaggtatacctggagtttcagggccg
aaaggttatcagggtttgcctggagacccagggcaacctggactgagtggacaacctgga
ttaccaggaccaccaggtcccaaaggtaaccctggtctccctggacagccaggtcttaca
ggacctcctggacttaaaggagccatcggtgatatgggttttccagggcctcagggtgtg
gaagggcctcctggaccttctggagttcctggacaacctggctccccaggattacctgga
cagaaaggcgacaaaggtgatcctggtatttcaagcattggtcttccaggtcttcctggt
ccaaagggtgagcctggtctgcctggatacccagggaaccctggtatcaaaggttctgtg
ggagatcctggtttgcccggattaccaggaacccctggagcaaaaggacaaccaggcctt
cctggattcccaggaaccccaggccctcctggaccaaaaggtattagtggccctcctggg
aaccctggccttccaggagaacctggtcctgtaggtggtgggggtcgtcctgggcaacca
gggcctccaggcgaaaaaggcaaacctggtcaagatggtattcctggaccagctggacag
aagggtgaaccaggtcaaccaggctttggaaacccaggaccccctggacttccaggactt
tctggccaaaagggtgatggaggattacctgggattccaggaaatcctggccttccaggt
ccaaagggcgaaccaggctttcacggtttccctggtgtgcagggtcccccgggccctcct
ggttctccgggtccagctctggaaggacctaaaggcaaccctgggccccaaggtcctcct
gggagaccaggtctaccaggtccagaaggttctccgggtctccctggaaatggagctatt
aaaggagagaagggaaatccaggccaacctgggctacctggcttgcctggtttgaaagga
gatcaaggaccaccaggactccagggtaatcctggccggccaggtctcaatggaatgaaa
ggagatcctggtctccctggtgttccaggattcccaggcatgaaaggacccagtggagta
cctggatcagctggccctgagggggaaccgggacttattggtcctccaggtcctcctgga
ttacctggtccttcaggacagagtatcataatcaaaggagattctggtcctccaggaatc
cctggccagcctgggttaaaaggtctaccaggactccaaggacctcaaggtttaccaggt
ccaactggccctccaggagatcctggacgcaatggactccctggctttgatggtgcagga
gggcgcaaaggagacccaggtctgccaggacagccaggtacccgtggtttggatggtccc
cccgggccagatggattgcaaggtcccccaggtccccctggaacctcctctattgcacat
ggatttcttattacacgccacagccagtcaacggatgcaccacaatgcccgcagggaaca
cttcaggtctatgaaggcttttctctcctgtatgtacaaggaaataaaagagcccacggt
caagacttggggacggctggcagctgccttcgtcgctttagtaccatgcctttcatgttc
tgcaacatcaataatgtttgcaactttgcttcaagaaatgactattcttactggctctct
accccagagcccatgccaatgagcatgcaacccctaaagggccagagcatccagccattc
attagtcgatgtgcagtatgtgaagctccagctgtggtgatcgcagttcacagtcagacg
atccagattccccattgtcctcagggatgggattctctgtggattggttattccttcatg
atgcatacaagtgcaggggcagaaggctcaggtcaagccctagcctcccctggttcctgc
ttggaagagtttcgttcagctcccttcatcgaatgtcatgggaggggtacctgtaactac
tatgccaactcctacagcttttggctggcaactgtagatgtgtcagacatgttcagtaaa
cctcagtcagaaacgctgaaagcaggagacttgaggacacgaattagccgatgtcaagtg
tgcatgaagaggacataa

KEGG   Pongo abelii (Sumatran orangutan): 100458670
Entry
100458670         CDS       T01416                                 

Gene name
LAMB1
Definition
(RefSeq) laminin subunit beta-1
  KO
K05636  laminin, beta 1
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100458670 (LAMB1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100458670 (LAMB1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100458670 (LAMB1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100458670 (LAMB1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100458670 (LAMB1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100458670 (LAMB1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100458670 (LAMB1)
   05145 Toxoplasmosis
    100458670 (LAMB1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N CorA F5_F8_type_C
Other DBs
NCBI-GeneID: 100458670
NCBI-ProteinID: XP_009241417
Ensembl: ENSPPYG00000017907
UniProt: H2PN73
LinkDB
Position
7
AA seq 1786 aa
MGLLQVLAFSFLALCRARVRAQEPEFSYGCAEGSCYPATGDLLIGRAQKLSVTSTCGLHK
PEPYCIVSHLQEDKKCFICNSQDPYHETLNPDSHLIENVVTTFAPNRLKIWWQSENGVEN
VTIQLDLEAEFHFTHLIMTFKTFRPAAMLIERSSDFGKTWGVYRYFAYDCEASFPGTSTG
PMKKVDDIICDSRYSDIEPSTEGEVIFRALDPAFKIEDPYSPRIQNLLKITNLRIKFVKL
HTLGDNLLDSRMEIREKYYYAVYDMVVRGNCFCYGHASECAPVDGLNEEVEGMVHGHCMC
RHNTKGLNCELCMDFYHDLPWRPAEGRNSNACKKCNCNEHSSSCHFDMAVYLATGNISGG
VCDDCQHNTMGRNCEQCKPFYYQHPERDIRDPNFCERCTCDPAGSQNEGTCDSYTDFSTG
LIAGQCRCKLNVEGEHCDVCKEGFYDLSSEDPFGCKSCACNPLGTIPGGNPCDSETGHCY
CKRLVTGQHCDQCLPEHWGLSNDLDGCRPCDCDLGGALNNSCFAESGQCSCRPHMIGRQC
NEVEPGYYFATLDHYLYEAEEANLGPGVSIVERQYIQDRIPSWTGAGFVRVPEGAYLEFF
IDNIPYSMEYDILIRYEPQLPDHWEKAVITVQRPGRIPTSSRCGNTIPDDDNQVVSLSPG
SRYVVLPRPVCFEKGTNYTVRLELPQYTSSDSDVESPYTLIDSLVLMPYCKSLDIFTVGG
SGDVVVTNSAWETFQRYRCLENSRSVVKTPMTDVCRNIIFSISAVLHQTGLACECDPQGS
LSSVCDPNGGQCQCRPNVVGRTCNRCAPGTFGFGPSGCKPCECHLQGSVNAFCNPVTGQC
HCFQGVYARQCDRCLPGHWGFPSCQPCQCNGHADDCDPVTGECLNCQDYTMGHNCERCLA
GYYGDPIIGSGDHCRPCPCPDGPDSGRQFARSCYQDPVTLQLACVCDPGYIGSRCDDCAS
GYFGNPSEVGGSCQPCQCHNNIDATDPEACDKETGRCLKCLYHTEGEHCQFCRFGYYGDA
LRQDCRKCVCNYLGTVQEHCNGSDCQCDKATGQCLCLPNVIGQNCDRCAPNTWQLASGTG
CDPCNCNAAHSFGPSCNEFTGQCQCMPGFGGRTCSECQELFWGDPDVECRACDCDPRGIE
TPQCDQSTGQCVCVEGVEGPRCDKCTRGYSGVFPDCTPCHQCFALWDVIIAELTNRTHRF
LEKAKALKISGVIGPYRETVDSVERKVSEIKDILAQSPAAEPLKNIGNLFEEAEKLIKDV
TEMMAQVEVKLSDTTSQSNGTAKELDSLQTEAESLDNTVKELAEQLEFIKNSDIRGALDS
ITKYFQMSLEAEERVNASTTEPNSTVEQSALMRDRVEDVMMERESQFKEKQEEQARLLDE
LAGKLQSLDLSAAAEMTCGTPPGASCSETECGGPNCRTDEGEKKCGGPGCGGLVTVAHNA
WQKAMDLDQDVLSALAEVEQLSKMVSEAKLRADEAKQSAEDILLKTNATKEKMDKSNEEL
RNLIKQIRNFLTQDSADLDSIEAVANEVLKMEMPSTPQQLQNLTEDIRERVESLSQVEII
LQQSAADIARAEMLLEEAKKASKSATDVKVTADMVKEALEEAEKAQVAAEKAIKQADEDI
QGTQNLLTSIESETAASEETLFNASQRISELERNVEELKRKAAQNSGEAEYIEKVVYTVK
QSADDVKKTLHGELDEKYKKVENLIAKKTEESADARRKAEMLQNEAKTLLAQANSKLQLL
KDLERKYEGNQKYLEDKAQELARLEGEVRSLLKDISQKVAVYSTCL
NT seq 5361 nt   +upstreamnt  +downstreamnt
atgggacttctccaggtgttagctttcagtttcttagccctgtgccgagcccgagtgcgc
gctcaggaacccgagttcagctacggctgcgctgaaggcagctgctatcccgccacgggc
gaccttctcatcggccgagcacagaagctttcggtgacctcgacgtgcgggctgcacaag
cccgaaccctactgtatcgtcagccacttgcaggaggacaaaaaatgcttcatatgcaat
tcccaagatccttatcatgagaccctgaatcctgacagccatctcattgaaaatgtggtc
actacatttgctccaaaccgccttaagatttggtggcaatctgaaaatggtgtggaaaat
gtaactatccaactggatttggaagcagaattccattttactcatctcataatgactttc
aagacattccgtccagctgctatgctgatagaacgatcgtccgactttgggaaaacctgg
ggtgtgtatagatacttcgcctatgactgtgaggcctcgtttccaggcacttcaactggc
cccatgaaaaaagtcgatgacataatttgtgattcccgatattctgacattgaaccctca
actgaaggagaggtgatatttcgtgctttagatcctgctttcaaaatagaagatccttat
agcccaaggatacagaatttattaaaaattaccaacttgagaatcaagtttgtgaaactg
catactttgggagataaccttctggattccagaatggaaatcagagaaaagtattattat
gcagtttatgatatggtggttcgaggaaattgcttctgctatggtcatgccagcgaatgt
gcccctgtggatggactcaatgaagaagtggaaggaatggttcacggacactgcatgtgc
aggcataacaccaaaggcttaaactgtgaactctgcatggatttctaccatgatttacct
tggagacctgctgaaggccgaaacagcaacgcctgtaaaaaatgtaactgcaatgaacat
tccagctcatgtcactttgacatggctgtttacctggccacggggaacatcagcggaggc
gtgtgtgatgactgtcagcacaacaccatggggcgcaactgtgagcagtgcaagccgttt
tactaccagcacccagagagggacatccgagatcctaatttctgtgaacgatgtacatgt
gacccagctggctctcaaaatgagggaacttgtgacagctatactgatttttctactggt
ctcattgctggccagtgtcggtgtaaattaaatgtggaaggagaacattgtgatgtttgc
aaagaaggcttctatgatttaagcagtgaagatccatttggttgtaaatcttgtgcttgc
aatcctctgggaacgattcctggagggaatccttgtgattctgagacaggtcattgctac
tgcaagcgtctggtgacaggacagcattgtgaccagtgtctgccagagcactggggctta
agcaatgatttggatggatgtcgaccatgtgactgtgaccttgggggagcgttaaacaac
agttgctttgcagagtcaggccagtgctcatgtcggcctcacatgattgggcgtcagtgc
aacgaagtggaacctggttactactttgccaccctggatcactacctctatgaagcagag
gaagccaacttggggcctggggttagcatagtggagcggcaatatatccaggaccggatt
ccctcctggactggagccggcttcgtccgagtgcctgaaggggcttatttggagtttttc
attgacaacataccatattccatggagtatgacatcctaattcgctacgagccacagcta
cccgaccactgggaaaaagctgtcatcacagtgcagcgacctggaaggattccaaccagc
agccgatgtggtaataccatcccagatgacgacaaccaggtggtgtcattatcaccaggc
tcaagatatgtcgtccttcctcggccagtgtgctttgagaagggaacaaactacacggtg
aggttggagctacctcagtacacctcctctgatagcgacgtggagagcccatacacgctg
atcgattctcttgttctcatgccatactgtaaatcactggacatcttcaccgtgggaggt
tcaggagatgtggtggtcaccaacagcgcctgggaaacctttcagagataccgatgtcta
gagaacagcagaagcgttgtgaaaacgccgatgacagatgtttgcagaaacatcatcttt
agcatttctgctgtgttacaccagacaggcctggcttgtgaatgcgaccctcagggttcg
ttaagttccgtgtgtgatcccaacggaggccagtgccagtgccggcccaacgtggttgga
agaacctgcaacagatgtgcacctggaacttttggctttggccccagtggatgcaaacct
tgtgagtgccatctgcaaggatctgtcaatgccttctgcaatcccgtcactggccagtgc
cactgtttccagggagtgtatgctcggcagtgtgatcggtgcttacctgggcactggggc
tttccaagttgccagccctgccagtgcaatggccacgccgatgactgcgacccagtgacg
ggggagtgcttgaactgccaggactataccatgggtcataactgtgaaaggtgcttggct
ggttactatggtgaccccatcattgggtcaggagatcactgccgcccttgcccttgccca
gatggtcctgacagtggacgccagtttgccaggagctgctaccaagatcctgttacttta
cagcttgcctgtgtttgtgatcctggatacattggttccagatgtgacgactgtgcctca
ggatactttggtaatccatcagaagttggggggtcatgtcagccttgccagtgtcacaac
aacattgacgcgacagacccagaagcctgtgacaaggagactgggaggtgtctcaagtgc
ctgtaccacacggaaggggaacactgtcagttctgccggtttggatactatggtgatgcc
ctccggcaggactgtcgaaagtgtgtctgtaattacctgggcactgtgcaagagcactgt
aatggctctgactgccagtgcgacaaagccactggtcagtgcttgtgtcttcctaatgtg
atcgggcagaactgtgaccgctgtgcgcccaatacctggcagctggccagtggcactggc
tgtgacccatgcaactgcaatgctgctcattccttcgggccatcttgcaatgagttcacg
gggcagtgccagtgcatgcctgggtttggaggccgcacctgcagcgagtgccaggaactc
ttctggggagaccccgatgtggagtgccgagcctgtgactgtgaccccaggggcattgag
acaccacagtgtgaccagtccacgggccagtgtgtctgcgttgagggtgttgagggtcca
cgctgtgacaagtgcacgcgagggtactcgggggtcttccctgactgcacaccctgccac
cagtgctttgctctctgggatgtgatcattgccgagctgaccaacaggacacacagattc
ctggagaaagccaaggccttgaagatcagtggtgtgatcgggccttaccgtgagactgtg
gactcggtggagaggaaagtcagcgagataaaagacatcctggcgcagagccccgcagca
gagccactgaaaaacattgggaatctctttgaggaagcagagaaactgattaaagatgtt
acagaaatgatggctcaagtagaagtgaaattatctgacacaacttcccaaagcaacggc
acagccaaagaactggattctctacagacagaagccgaaagcctagacaacactgtgaaa
gaacttgctgaacaactggaatttatcaaaaactcagatattcggggtgccttggatagc
attaccaagtatttccagatgtctcttgaggcagaggagagggtgaatgcctccaccaca
gaacccaacagcactgtggagcagtcagccctcatgagagacagagtagaagacgtgatg
atggagcgagaatcccagttcaaggaaaaacaagaggagcaggctcgcctccttgatgaa
ctggcaggcaagctgcaaagcctagacctttcagctgctgccgaaatgacctgtggaaca
cccccaggggcctcctgttccgagactgaatgtggcgggccaaactgcagaactgacgaa
ggagagaagaagtgtggggggcctggctgtggtggtctggttactgttgcacacaatgcc
tggcagaaagccatggacttggaccaagatgtcctgagtgccctggctgaagtggaacag
ctctccaagatggtctctgaagcaaaactgagggcagatgaggcaaaacagagtgctgaa
gacattctgttgaagacaaatgctaccaaagaaaaaatggacaagagcaatgaggagctg
agaaatctaatcaagcaaatcaggaactttttgacccaggatagtgctgatttggacagc
attgaagcagttgctaatgaagtattgaaaatggagatgcctagcaccccacagcagtta
cagaacttgacagaagatatacgtgaacgagttgaaagcctttctcaagtagagattatt
ctacagcagagtgctgctgacattgccagagctgagatgttgttagaagaagctaaaaaa
gcaagcaaaagtgcaacagatgttaaagtcactgcagatatggtaaaggaagctctggaa
gaagcagaaaaggcccaggtcgcagcagagaaggcaattaaacaagcagatgaagacatt
caaggaacccagaacctgctaacttcgattgagtctgaaacagcagcttctgaggaaacc
ttgttcaacgcgtcccagcgcatcagcgagttagagaggaatgtggaagaacttaagcgg
aaagctgcccaaaactctggggaggcagaatatattgaaaaagtagtatatactgtgaag
caaagtgcagacgatgtcaagaagactttacatggtgaacttgatgaaaagtataaaaaa
gtagaaaatttaattgccaaaaaaactgaagagtcagctgatgccagaaggaaagctgaa
atgctacaaaatgaagcaaaaactcttttagctcaagcaaatagcaagctgcaactgctc
aaagatttagaaagaaaatatgaaggcaatcaaaaatacttagaagataaagctcaagaa
ttagcgagactggaaggagaagtccgttcactcctaaaggatataagccagaaagttgct
gtgtatagcacatgcttgtaa

KEGG   Pongo abelii (Sumatran orangutan): 100458901
Entry
100458901         CDS       T01416                                 

Gene name
LAMA2
Definition
(RefSeq) laminin subunit alpha-2 isoform X1
  KO
K05637  laminin, alpha 1/2
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
pon05410  Hypertrophic cardiomyopathy
pon05412  Arrhythmogenic right ventricular cardiomyopathy
pon05414  Dilated cardiomyopathy
pon05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100458901 (LAMA2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100458901 (LAMA2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100458901 (LAMA2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100458901 (LAMA2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100458901 (LAMA2)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    100458901 (LAMA2)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    100458901 (LAMA2)
   05414 Dilated cardiomyopathy
    100458901 (LAMA2)
   05416 Viral myocarditis
    100458901 (LAMA2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100458901 (LAMA2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100458901 (LAMA2)
   05145 Toxoplasmosis
    100458901 (LAMA2)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_N Laminin_B Laminin_II Laminin_G_3 LXG DUF948
Other DBs
NCBI-GeneID: 100458901
NCBI-ProteinID: XP_024104095
LinkDB
Position
6
AA seq 3122 aa
MPGAAGVLLLLLLSGGLGGVQAQRPQQQRQSQAHQQRGLFPAVLNLASNALITTNATCGE
KGPEMYCKLVEHVPGQPVRNPQCRICNQNSSNPNQRHPITNAIDGKNTWWQSPSIKNGIE
YHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDDVEYKPWQYHAVTDTECLTLY
NIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSARY
IRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPLDP
ATNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDENVA
RRNLSLNIHGKYIGGGVCINCTQNTAGINCETCIDGFFRPKGVSPNYPRPCQPCHCDPIG
SLNEVCVKDEKHARRGLAPGSCHCRTGFGGVSCDRCARGYTGYPDCKACNCSGLGSKNED
PCFGPCICKENVEGGDCSRCKSGFFNLQEDNWKGCDECFCSGVSNRCQSSYWTYGKIQDM
SGWYLTDLSGRIRVAPQQDGLDSPQQISISNAEARQALPHSYYWSAPAPYLGNKLPAVGG
QLTFTISYDVEEEEEDTERVLQLMIILEGNDLRISTAQDKVYLHPSEEHINVLLLEEESF
TIHGTHFPVSRKEFMTVLANLKRVLLQITYSLGMDAIFRLSSVNLESAVSYPTDGSVAAA
VEVCQCPPGYTGSSCESCWPRHRRVNSTIFGGICEPCQCFGHAESCDDVTGECLNCKDHT
GGPYCDKCLPGFYGDPTKGTSEDCQPCACPLNIPSNNFSPMCHLDRSLGLICDGCAVGYT
GPRCERCAEGYFGQPSVPGGSCQPCQCNDNLDFSIPGSCDSLSGSCLICKAGTTGRYCEL
CADGYFGDAVDAKNCRPCRCNASGSFSEVCHSQTGQCECRANVQGQRCDKCKAGTFGLQS
ARGCVPCNCNSFGSKSFDCEESGQCWCQPGVTGKKCDRCAHGYFNFQEGGCTACECSHLG
NNCDPKTGRCICPPNTIGEKCSKCAPNTWGHSITTGCKACNCSTVGSLDFQCNVNTGQCN
CHPKFSGAKCTECSRGYWNYPRCNLCDCFLPGTDAATCDSETKKCSCSDQTGQCTCKVNV
EGIHCDRCRPGKFGLEAKNPLGCSSCYCFGTTTQCSEAKGLIRTWVTLKAEQTILPLVDE
ALQHTTTKGIVFQHPEIVAHMDLMREDLHLEPFYWKLPEQFEGKKLMAYGGKLKYAIYFE
AREEIGFSTYNPQVIIRGGTPTHARIIVRHMAAPLIGQLTRHEIEMTEKEWKYYGDDPRV
HTTVTREDFLDILYDIHYILIKATYGNFMRQSRISEISMEVAEQGRRTAVTPPADLIEKC
DCPLGYSGLSCEACLPGFYRLRSQPGGRTPGPTLGTCVPCQCNGHSSLCDPETSICQNCQ
HHTAGDFCERCALGYYGIVKGLPNDCQQCACPLISSSNNFSPSCVAEGLDDYRCTACPRG
YEGQYCERCAPGYTGSPSSPGGSCQECECDPYGSLPVPCDPVTGFCTCRPGATGRKCDGC
KRWHAREGWECVFCGDECTGLLLGDLTRLEQMVMSINLTGPLPAPYKMLYGLENMTQELK
HLLSPQRAPERLIQLAEGNLNTLVTEMNELLTRATKVTADGEQTGQDAERTNTRAKSLGE
FIKELARDAEAVNEKAIKLNETLGTRDEAFERNLEGLQKEIDQMIKELRRKNLETQKEIA
EDELVAAEALLKKVKKLFGESRGENEEMEKNLWEKLADYKNKVDDAWDLLREATDKIREA
NRLFAVNQRNMTALEKKKEAVESGKRQIENTLKEGNDILDEANRLADEINSIIDYVEDIQ
TKLPPMSEELNDKIDDLSQEIKDRKLAEKVSQAESHAAQLNDSSAVLDGILDEAKNISFN
ATAAFKAYSNIKDYIDEAEKVAKEAKDLAHEATKLATGPRGLLKEDAKGSLQKSFRILNE
AKKLANDVKENEDHLNGLKTKIENADARNGDLLRALNDTLGKLSAIPNDTAAKLQAVKDK
ARQANDTAKDVLAQIKELHQNLDGLKKNYNKLADSVAKTNAVVKDPSKNKIIADADATVK
NLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTYKP
EIKKGSYNNIVVNVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYP
DLTIDDSYWYRIVASRTGRNGTISVRALDGPKASIVPSTYHSTSPPGYTILDVDANAMLF
VGGLTGKLKKADAVRVITFTGCMGETYFDNKPIGLWNFREKEGDCKGCTVSPQVEDSEGT
IQFDGEGYALVSRPIRWYPNISTVMFKFRTFSSSALLMYLATRDLRDFMSVELTDGHIKV
SYDLGSGMASVVSNQNHNDGKWKSFTLSRIQKQANISIVDIDTNQEENIATSSSGNNFGL
DLKADDKIYFGGLPTLRNLSMKARPEVNLKKYSGCLKDIEISRTPYNILSSPDYVGVTKG
CSLENVYTVSFPKPGFVELSPVPIDVGTEINLSFSTKNESGIILLGSGGTPAPPRRKRRQ
TGQAYYAILLNRGRLEVHLSTGARTMRKIVVRPEPNLFHDGREHSVHVERTRGIFTVQVD
ENRRYMQNLTVEQPIEVKKLFVGGAPPEFQPSPLRNIPPFEGCVWNLVINSVPMDFARPV
SFKNADIGRCAHQKLREDEDGAAPAEIVIQPEPVPTPAFPTPTAVLTHGPCAAESEPALL
IGSKQFGLSRNSHIAIAFDDTKVKNRLTIELEVRTEAESGLLFYMARINHADFATVQLRN
GLPYFSYDLGSGDTHTMIPTKINDGQWHKIKVMRSKQEGILYVDGASNRTISPKKADILD
VVGMLYVGGLPINYTTRRIGPVTYSIDGCIRNLHMAEAPANLEQPTSSFHVGTCFANAQR
GTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRTTGVLLGISSQKLDGMGIEMIDEKLMFH
VDNGAGRFTAVYDAGVPGHLCDGQWHKVTANKIKHRIELTVDGNQVEAQSPNPASTSADT
NDPVFVGGFPDDLKQFGLTTSIRFRGCIRSLKLTKSTGKPLEVDFAKALELRGVQPVSCP
AN
NT seq 9369 nt   +upstreamnt  +downstreamnt
atgccgggagccgccggggtcctcctcctcctgctgctctccggaggcctcgggggcgta
caggcgcagcggccgcagcagcagcggcagtcacaggcacatcagcaaagaggtttattc
cctgctgtcctcaatcttgcttctaatgctcttatcacgaccaatgcaacatgtggagaa
aaaggacctgaaatgtactgcaaattggtagaacatgtccctgggcagcctgtgaggaac
ccgcagtgtcgaatctgcaatcaaaacagcagcaatccaaaccagagacacccaattaca
aatgctattgatggaaagaacacttggtggcagagtcccagcattaagaatggaatcgaa
taccattatgtgacaattacactggatttacagcaggtgttccagatcgcgtatgtgatt
gtgaaggcagctaactccccccggcctggaaactggattttggaacgctctcttgatgat
gttgaatacaagccctggcagtatcatgctgtgacagacacggagtgcctaaccctttac
aatatttatccccgcactgggccaccgtcatatgccaaagacgatgaggtcatctgcact
tcattttactccaagatacaccccttagaaaatggagagattcacatctctttaatcaat
gggagaccaagtgccgatgatccttctccagaactgctggaatttacctccgctcgctat
attcgcctgagatttcagaggatccgcacactgaatgctgacttgatgatgtttgctcac
aaagacccaagagaaattgaccccattgtcaccagaagatactactactcggtcaaggat
atttcggttggagggatgtgcatctgctatggtcatgccagggcttgtccacttgatcca
gcgacaaataaatctcgctgtgagtgtgagcataacacatgtggcgatagctgtgatcag
tgctgtccaggattccatcagaaaccctggagagctggaacttttctgactaaaactgaa
tgtgaagcatgcaattgtcatggaaaagctgaagaatgctattatgatgaaaatgttgcc
agaagaaatctgagtttgaatatacatggaaagtacatcggagggggtgtctgcattaat
tgtacccaaaacactgctggtataaactgcgagacatgtattgatggcttcttcagaccc
aaaggggtatctccaaattatccaaggccatgccagccatgtcattgcgatccaattggt
tccttaaatgaagtctgtgtcaaggatgagaaacatgctcgacgaggtttggcacctgga
tcctgtcattgcagaactggttttggaggtgtgagctgtgatcggtgtgccaggggctac
actggctacccagactgcaaagcctgtaactgcagtgggttagggagcaaaaatgaggat
ccttgttttggcccctgtatctgcaaggagaatgttgaaggaggagactgtagtcgttgc
aaatccggcttcttcaatttgcaagaggataattggaaagggtgcgatgaatgtttctgt
tcaggggtttcaaacagatgtcagagttcctactggacctatggcaaaatacaagatatg
agtggctggtatctgactgacctttctggccgcattcgagtggctccccagcaggacggc
ttggactcacctcagcagatcagcatcagtaacgcggaggcccggcaagccctgccgcac
agctactactggagcgcgccggctccctatctgggaaacaaactcccagcagtaggagga
cagttgacatttaccatatcatatgacgttgaagaagaggaagaagatacagaacgtgtt
ctccagcttatgattatcttagagggtaatgatttgagaatcagcacagcccaagataag
gtgtacctgcacccatctgaagaacatattaatgtattgttacttgaagaagaatcattt
accatacatggcacacattttccagtcagtagaaaggaattcatgacagtgcttgcaaat
ttgaagagagtcctcctacaaatcacatacagcttggggatggatgccatcttcaggttg
agctctgttaaccttgaatccgctgtctcctatcctactgatggaagcgttgcagcagct
gtagaagtgtgtcagtgtccaccagggtatactggctcctcttgtgaatcttgttggcct
aggcacaggcgagttaacagcactatttttggtggcatctgtgagccatgtcagtgcttt
ggtcatgcagagtcctgtgatgacgtcactggagaatgcctgaactgtaaggatcacaca
ggtggcccatactgtgataaatgtcttcctggtttctatggcgatcctactaaaggaacc
tctgaagactgtcaaccctgtgcctgtccactcaatatcccatccaataactttagccca
atgtgccatttagaccggagtcttggattgatctgtgatggatgcgctgttgggtacaca
ggaccacgctgtgagaggtgtgcagaaggctattttggacaaccctctgtacctggagga
tcatgtcagccatgccaatgcaatgacaaccttgacttctccatccctggaagctgtgac
agcttgtctggctcctgtctgatatgtaaagcaggtacaacaggccggtactgtgagctc
tgtgctgatggatattttggagatgcagttgatgcaaagaactgtcggccctgtcgctgt
aatgccagtggctctttctctgaggtttgccacagtcaaactggacagtgtgagtgcaga
gccaacgtgcagggtcagagatgtgacaaatgcaaggctgggacctttggcctacaatca
gcaaggggctgtgttccctgtaactgcaattcttttgggtctaagtcattcgactgtgaa
gagagtggacaatgttggtgccaacctggagtcacagggaagaaatgtgaccgctgtgcc
cacggctatttcaacttccaagaaggaggctgcacagcttgtgaatgttctcatctgggt
aataattgtgacccaaagactgggcgctgcatttgccctcccaataccattggagagaaa
tgttctaaatgtgcacccaatacctggggccacagcattaccactggttgtaaggcttgt
aactgcagcacagtgggatccttggatttccaatgcaatgtaaatacaggccaatgcaac
tgtcatccaaaattctctggtgcaaaatgtacagagtgcagtcgaggttactggaactac
cctcgctgcaatctgtgtgactgcttcctccctgggacggatgccgcaacctgtgattca
gagactaaaaaatgctcctgtagtgatcaaactggacagtgcacttgtaaggtgaatgtg
gaaggcatccactgtgacagatgccggcctggcaaattcggacttgaggccaagaatcca
cttggctgcagcagctgctattgcttcggcactactacccagtgctctgaagcaaaagga
ctgatccggacgtgggtgactctgaaggctgagcagaccattctacccctggtagatgag
gctctgcagcacacaaccaccaagggcattgtttttcaacatccagagattgttgcccac
atggatctgatgagagaagatctccatttggaacctttttattggaaacttccagaacaa
tttgaaggaaagaagttgatggcctatgggggcaaactcaagtatgcaatctatttcgag
gctcgggaagaaataggtttctctacatataatcctcaagtgatcattcgaggtgggaca
cctactcatgctagaattatcgtcaggcatatggctgctcctctgattggccaattgaca
aggcatgaaattgaaatgacagagaaagaatggaaatattatggggacgatcctcgagtc
catacaactgtgacccgagaagacttcttggatatactatatgatattcattacattctt
atcaaagctacttatggaaatttcatgcgacaaagcaggatttctgaaatctcaatggag
gtagctgaacaaggacgtagaacagcagtgactcctccagctgacttgattgaaaaatgt
gattgtcccctgggctattctggcctgtcctgtgaggcatgcttgccaggattttatcga
ctgcgttctcaaccaggtggccgcacccctggaccaaccctgggcacctgtgttccatgt
caatgtaatggacacagcagcctgtgtgaccctgaaacatcgatatgccagaattgtcaa
catcacactgctggtgacttctgtgaacgatgtgctcttggatactatggaattgtcaag
ggattgccaaatgactgtcagcaatgtgcctgccctctgatttcttccagtaacaatttc
agcccctcttgtgtcgcagaaggccttgatgactaccgctgcacggcttgtccacgggga
tacgaaggccagtactgtgaaaggtgtgcccctggctatactggcagtccaagcagccct
ggaggctcctgccaagaatgtgagtgtgatccctatggctcactgcctgtgccctgtgac
cctgtcacaggattctgcacgtgccgacccggagccacgggaaggaagtgtgacggctgc
aagcgctggcatgcacgcgagggctgggagtgtgttttttgtggagatgagtgcactggc
cttcttctcggtgacttgactcgcctggagcagatggtcatgagcatcaacctcactggt
ccgctgcctgcgccatataaaatgctgtatggtcttgaaaatatgactcaggagctaaag
cacttgctctcacctcagcgggccccagagaggcttattcagctggcagagggcaatctg
aatacactcgtgaccgaaatgaatgagctgctgaccagggctaccaaagtgacagcagat
ggtgagcagaccggacaggatgctgagaggaccaacacaagagcaaagtccctgggagaa
ttcattaaggagcttgcccgggatgcagaagctgtaaatgaaaaagctataaaactaaat
gaaactctaggaactcgagacgaggcctttgagagaaatttggaagggcttcagaaagag
attgaccagatgattaaagaactgaggaggaaaaatctagagacacaaaaggaaattgct
gaagatgagttggtagctgcagaagcccttctgaaaaaagtgaagaagctgtttggagag
tcccggggggaaaatgaagaaatggagaagaatctctgggaaaaactggctgactacaaa
aacaaagttgatgatgcttgggacctgttgagagaagccacagataaaatcagagaagct
aatcgcttatttgcagtaaaccagagaaacatgactgcattggagaaaaagaaggaggct
gttgaaagtggcaaacgacaaattgagaacactttaaaagagggcaatgacatactcgat
gaagccaaccgtcttgcagatgaaatcaactccatcatagactatgttgaagacatccaa
accaaattgccacctatgtctgaggagcttaatgataaaatagatgacctctcccaagaa
ataaaggacaggaagcttgctgagaaggtgtcccaggctgagagtcacgcagctcagttg
aatgactcatctgctgtccttgatggaatccttgatgaggctaaaaacatctccttcaat
gccactgcagccttcaaagcttacagcaatatcaaggactatattgatgaagctgagaaa
gttgccaaagaagccaaagatcttgcacatgaagccacaaaactggcaacaggtcctcgg
ggtttattaaaggaagatgccaaaggctctcttcagaaaagcttcaggattcttaacgaa
gccaagaagttagcaaatgatgtaaaagaaaatgaagaccatctaaatggcttaaaaacc
aagatagaaaatgctgatgctagaaatggggatctcttgagagctttgaatgacactttg
ggaaagttatcagctattccaaatgacacagctgctaaactgcaagctgttaaggacaaa
gccagacaagccaacgacacagctaaagatgtactggcacagattaaagagctccaccag
aacctcgatggcctgaagaagaattacaataaactagcagacagcgtcgccaaaacaaat
gctgtggttaaagatccttccaagaacaaaatcattgccgatgcagatgccactgtcaaa
aatttagaacaggaagctgaccggctaatagataaactcaaacccatcaaggaacttgag
gataacctaaagaaaaacatctctgagataaaggaattgataaaccaagctcggaaacaa
gccaattctatcaaagtatctgtgtcttcaggaggtgactgcattcgaacatacaagcca
gaaatcaagaaaggaagttacaataatattgttgtcaacgtaaagacagctgttgctgat
aacctcctcttttatcttggaagtgccaaatttattgactttctggctatagaaatgcgt
aaaggcaaagtcagcttcctctgggatgttggatctggagttggacgtgtagagtaccca
gatttgactattgatgactcatattggtaccgtatcgtagcatcaagaactgggagaaat
ggaactatttctgtgagagccctggatggacccaaagccagcattgtgcccagcacgtat
cattcgacgtctcctccagggtacactattctagatgtggatgcaaatgcaatgctgttt
gttggtggcctgactgggaaattaaagaaggctgatgctgtacgtgtgattacattcact
ggctgcatgggagaaacatactttgacaacaaacctataggtttgtggaatttccgagaa
aaagaaggtgactgcaaaggatgcactgtcagtcctcaggtggaagatagtgaggggact
attcagtttgatggagaaggttatgcattggtcagccgccccattcgctggtaccccaac
atctccaccgtcatgttcaagttcagaacattttcttcgagtgctcttctgatgtatctt
gccacacgagacctgagagatttcatgagtgtggagctcactgatgggcacataaaagtc
agttatgatctgggctcaggaatggcttccgttgtcagcaatcaaaaccataatgatggg
aaatggaaatcattcaccctgtcaagaattcaaaaacaagccaatatatcaattgtagat
atagatactaatcaggaggagaacatagcaacttcgtcttctggaaacaacttcggtctt
gacttgaaagcagatgacaaaatatattttggtggcctgccaacgctgagaaacttgagt
atgaaagcaaggccagaagtaaatctgaagaaatattccggctgcctcaaagatattgaa
atttcaagaactccatacaatatactcagtagtcccgattatgttggtgttaccaaagga
tgttccctggagaatgtttacacagttagctttcctaagcctggttttgtggagctctcc
cctgtgccaattgatgtaggaacagaaatcaacctgtcattcagcaccaagaatgagtcc
ggcatcattcttttgggaagtggagggacaccagcaccacctaggagaaaacgaaggcag
actggacaggcctattatgcaatactcctcaacaggggccgtctggaagtgcatctctcc
acaggggcacgaacaatgaggaaaattgtcgtcagaccagagccgaatctgtttcatgat
ggaagagaacattccgttcatgtagagcgaactagaggcatctttacagttcaagtcgat
gaaaacagaagatacatgcaaaacctgacagttgaacaacctattgaagttaaaaagctt
ttcgttgggggtgctccgcctgaatttcaaccttccccactcagaaatattcctcctttt
gaaggctgtgtatggaatcttgttattaactctgtccccatggactttgcaaggcctgta
tccttcaaaaacgctgacattggtcgctgtgcccatcagaaactccgtgaagatgaagat
ggagcagctccagctgaaatagttatccagcctgagccagttcccaccccagcctttcct
acgcccaccgcagttctgacacatggtccttgtgctgcagaatcagaaccagctcttttg
atagggagcaagcagttcgggctttcaagaaacagtcacattgcaattgcatttgatgac
accaaagttaaaaaccgtctcacaattgagttggaagtaagaaccgaagctgaatccggc
ttgcttttttacatggctcgcatcaatcatgctgattttgcaacagttcagctgagaaac
ggattgccctacttcagctatgacttggggagtggggacacccacaccatgatccccacc
aaaatcaatgatggccagtggcacaagattaaggtaatgagaagtaagcaagaaggaatt
ctttacgtagatggggcctccaacagaaccatcagtcccaaaaaagctgacatcctggat
gtcgtgggaatgctgtatgttggtgggttacccatcaactacactacccgaagaattggt
ccagtgacctatagcattgatggctgcatcaggaatctccacatggcagaggcccctgcc
aatctggaacagcccacctccagcttccatgttgggacatgttttgcaaatgctcagagg
ggaacatattttgacggaaccggttttgccaaagcagttggtggattcaaagtgggattg
gaccttcttgtagaatttgaattccgcacaactagaacgactggagttcttctggggatc
agtagtcaaaaactggatggaatgggtattgaaatgattgatgaaaagttgatgtttcat
gtggacaatggcgcgggcagattcactgctgtctatgatgctggggttccagggcatttg
tgtgatggacaatggcataaagtcactgccaacaagatcaaacaccgcattgagctcaca
gtcgatgggaaccaggtggaagcccaaagcccaaacccagcatctacatcagctgacaca
aatgaccctgtgtttgttggaggcttcccagatgaccttaagcagtttggcctgacaacc
agtattcggttccgaggttgcatcagatccctgaagctcaccaaaagcacgggcaagcca
ctggaggttgattttgccaaggccctggaactgaggggcgttcaacctgtatcatgccca
gccaactaa

KEGG   Pongo abelii (Sumatran orangutan): 100459849
Entry
100459849         CDS       T01416                                 

Gene name
COL6A3
Definition
(RefSeq) collagen alpha-3(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100459849 (COL6A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100459849 (COL6A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100459849 (COL6A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100459849 (COL6A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100459849 (COL6A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100459849 (COL6A3)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100459849 (COL6A3)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100459849 (COL6A3)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100459849 (COL6A3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   100459849 (COL6A3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Kunitz_BPTI fn3
Other DBs
NCBI-GeneID: 100459849
NCBI-ProteinID: XP_009236555
Ensembl: ENSPPYG00000013309
UniProt: H2P914
LinkDB
Position
2B
AA seq 3182 aa
MRKHRHLPLVAVFCLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVRE
FLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGK
GLEYIMQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDA
DEGALKEIASEPLDMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDS
ADIIFLIDGSNNTGSVNFAVILDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYS
TKAQVLGAVKALGFAGGELANIGLALDFVVENHFTRAGGSRVEEGVPQVLVLISAGPSSD
EIRYGVVALKQASVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQEKLLPYIV
GVAQRHIVLKPPTIVTQVIEVNKRDIVFLVDGSSALGLANFNAIRDFIAKVIQRLEIGQD
LIQVAVAQYADTVRPEFYFNTHPTKREVITAVRKMKPLDGSALYTGSALDFVRNNLFTSS
AGYRAAEGIPKLLVLITGGKSLDEISQSAQELKRSSIMAFAIGNKGADQAELKEIAFDSS
LVFIPAEFRAAPLQGMLPGFLAPLRTLSGTPEVHANKRDIIFLLDGSANVGKTNFPYVRD
FVMNLVNSLDVGNDNIRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTG
SALSYVHANHFTEAGGSRIHEHVPQLLLLLTAGQSEDSYLQAANALTRAGILTFCVGASQ
ANKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILF
LFDGSANLVGQFPVVRDFLYKIIDELDVKPDGTRIAVAQYSDDVKVESRFDEHQSKPEIL
NLVKRMKIKTGKALNLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAGRSSDRVDGPA
SNLKQSGVVPFIFQAKNADPAELEQIVLSPAFILAAESLPKIGDLQPQIVNLLKSVHNGA
PAPVSGEKDVVFLLDGSEGVRSGFPLLKEFVQRVVESLDVGQDRVRVAVVQYSDRTRPEF
YLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLT
ADRSGDDVRNPSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV
ISERVTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQSAGPEFQYIRTLIERLVD
YLDVGFDTTRVAVIQFSDDPKVEFLLNAHSSKDEVQNAVQRLRPKGGRQINVGSALEYVS
RNIFKRPLGSRIEEGVPQFLVLISSGKSDDEVDDPAVELKQFGVAPFTIARNADQEELVK
ISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAVESDAADIVFL
IDSSEGVRPDGFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAPVL
DAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSRFA
QVIRSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATP
APPGVDTPPPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGL
VQYNSDPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTRVGLEHLRVNHFVPEAGSRLD
QRVPQIAFVITGGKSVEDAQDVSLALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVG
NVQELSELSEQVLETLHDAMHETLCPGVTDAAKACNLDVILGFDGSRDQNVFVAQKGFES
KVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPSGPVEAFDFDEYQPEMLEKFRNMRSQ
HPYVLTEDTLKVYQNKFRQSSPDSVKVVIHFTDGADGDLADLHRASENLRQEGVHALILV
GLERVANLERLMHLEFGRGFMYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQR
GDRGPIGSIGPKGIPGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFP
GEKGEVGEIGLDGLDGEDGDKGLPGSSGEKGNPGRRGDKGPRGEKGERGDVGIRGDPGNP
GQDSQERGPKGETGDLGPMGVPGRDGVPGGPGETGKNGGFGRRGPPGAKGNKGGSGQPGF
EGEQGTRGAQGPAGPAGPPGLIGEQGISGPRGSGGAAGAPGERGRTGPLGRKGEPGEPGP
KGGIGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGNPGEPGLNGTTGPKGIR
GRRGNSGPPGIVGQKGDPGYPGPAGPKGNRGDSIDQCALIQSIKDKCPCCYGPLECPVFP
TELAFALDTSEGVNQDTFGRMRDVVLSIVNDLTIAESNCPRGARVAVVTYNNEVTTEIRF
ADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARNTFKRVRNGFLMRKVAVFFSNTPT
RASPQLREAVLKLSDAGITPLFLTSQEDRQLINALQINNTAVGHALVLPAGRDLTDFLEN
VLTCHVCLDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFVLDSAETTTLFQFNEMKK
YIAYLVRQLDMSPDPKASQHFARVAVVQHAPSESMGNASMPPVKVEFSLTDYGSKEKLVD
FLSRGMTQLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVQEQQLEEAQRVIL
QAKCKGYFFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFV
SSENAFYLSPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPAMTTKPVTTM
KPVTTTTKPVTTTTKPVAIVNQPSAKPAAAKPAPVKPAPAKPMAAKPVATKTATVRPPVV
VKPATAAKPVAAKPAAVRPPAAAAAKPVVTKPEAPRPQAAKPAATKPATTKPVVRVSREV
QVFEITENSAKLHWERPEPPSPYFYDLTVTSAHDQSLVLKQNLTVTDRIIGGLLAGQTYH
VAVVCYLRSQVRATYHGSFSTKKSQPPPPQPARSASSSTINLMVSTEPLALTETDICKLP
KDEGTCRDFILKWYYDPNTKSCARFWYGGCGGNENKFGSQKECEKVCAPVLAKPGVISVM
GT
NT seq 9549 nt   +upstreamnt  +downstreamnt
atgaggaaacatcggcacttgcccttagtggccgtcttttgtctctttctctcaggcttt
cccacaactcatgcccagcagcagcaagcagatgtcaaaaatggtgcggctgctgatata
atatttctagtggattcctcttggaccattggagaggaacatttccaacttgttcgagag
tttctatatgatgttgtaaaatccttagctgtgggagaaaatgatttccattttgctctg
gtccagttcaacggaaacccacataccgagttcctgttaaatacgtatcgtactaaacaa
gaagtcctttctcatatttccaacatgtcttatattgggggaacaaatcagactggaaaa
ggattagaatacataatgcaaagccacctcaccaaggctgctggaagccgggccggtgac
ggagtccctcaggttatcgtagtgttaactgatggacactcgaaggatggccttgctctg
ccctcagcggaacttaagtctgctgatgttaacgtgtttgcaattggagttgaggatgca
gatgaaggagcattaaaagaaatagcaagtgaaccgctcgatatgcatatgttcaaccta
gagaattttacctcacttcatgacatagtaggaaacttagtgtcctgtgtgcattcatcc
gtgagtccagaaagggctggagacacggaaacccttaaagacatcacagcacaagactct
gctgacattattttccttattgatggatcaaacaacaccggaagtgtcaatttcgcagtc
attctcgacttccttgtaaatctccttgagaaactcccaattggaactcagcagatccga
gtgggggtggtccagtttagcgatgagcccagaactatgttctccttggacacttactcc
accaaggcccaggttctgggtgcagtgaaagccctggggtttgctggtggggagttggcc
aatatcggcctcgcccttgatttcgtggtggagaaccacttcacccgcgcagggggcagc
cgcgtggaggaaggggttccccaggtgctggtcctcataagtgccgggccttctagtgac
gagattcgctacggggtggtagcactgaagcaggctagcgtgttctcattcggcctcgga
gcccaggccgcctccagggcagagcttcagcacatagctaccgatgacaacttggtgttt
actgtcccggaattccgtagctttggggacctccaggagaaattactgccgtacattgtt
ggcgtggcccaaaggcacattgtcttgaaaccgccaaccattgtcacacaagtcattgaa
gtcaacaagagagacatagtcttcctggtggatggctcatccgcactgggactggccaac
ttcaatgccatccgagacttcattgctaaagtcatccagaggctggaaatcggacaggat
cttatccaggtggcagtggcccagtatgcagacactgtgaggcctgaattttatttcaat
acccatccaacaaaaagggaagtcataactgctgtgcggaaaatgaagcccctggacggc
tcggccctgtacactggctctgctctagactttgttcgtaacaacctatttacgagttca
gccggctaccgggctgccgaggggattcctaagcttttggtgctgatcacaggtggtaag
tccctagatgaaatcagccagtctgcccaggagctgaagagaagcagcataatggccttt
gccatcgggaacaagggtgccgatcaggctgagctgaaagagatcgctttcgactcctcc
ctggtgttcatcccggccgagttccgagccgccccattgcaaggcatgctgcctggcttc
ctggcacctctcaggaccctctctggaacccctgaagttcatgcaaacaaaagggatatc
atctttcttttggatggatcagccaacgttggaaaaaccaatttcccttatgtgcgcgac
tttgtaatgaacctagttaacagccttgatgttggaaatgacaatattcgtgttggttta
gtgcaatttagtgacactcctgtaacggagttctctttaaacacataccaaaccaagtca
gacatccttggtcacctgaggcagctgcagctccagggaggttcgggcctgaacacaggc
tcagccctaagctatgtccatgccaaccacttcacagaagctggcggcagcaggatccat
gaacacgtgccacagctcctgcttctgctcacagctgggcagtctgaggactcctatttg
caagctgccaatgccttgacacgtgcgggcatcctgactttttgtgttggagctagccag
gcgaataaggcagagcttgagcagattgcttttaacccaagcctggtgtatctcatggat
gatttcagctccctgccagctttgcctcagcagctgattcagcccctaaccacatatgtt
agtggaggtgtggaggaagtaccactcgcccagccagagagcaagcgagacattctgttc
ctcttcgacggctcagccaatctcgtgggccagttccctgttgtccgtgactttctctac
aagattatcgatgagctcgatgtgaagccagatgggacccgaattgcggtggctcagtac
agtgatgatgtcaaggtggagtcccgtttcgatgagcaccagagtaagcctgagatcctg
aatcttgtgaagagaatgaagatcaagacaggcaaagccctcaacctgggctacgccctg
gactatgcacagaggtacatttttgtgaagtctgctggcagccggatcgaggatggagtg
cttcagttcctggtgctgctggtggcgggaaggtcatctgaccgtgtggatgggccagca
agtaacctgaagcagagtggggttgtgcctttcatcttccaagccaagaacgcggaccct
gctgagttagagcagatcgtgctgtctccagcatttatcctggctgcggagtcacttccc
aagattggagatcttcaaccacagatcgtgaatctcttaaaatcagtgcacaacggagca
ccagcaccagtttcaggcgaaaaggacgtggtgtttctgcttgatggctctgagggtgtc
aggagcggcttccctctgttgaaagagtttgtccagagagtggtggaaagcctggacgtg
ggccaggaccgggtccgcgtggccgtggtgcagtacagcgaccggaccaggcccgagttc
tacctgaattcatacatgaaccagcaggacgttgtcaacgctgtccgccagctgaccctg
ctgggagggccgacccccaacaccggggccgccctggagtttgtcctgaggaacatcctg
gtcagctctgcgggaagcaggataacagaaggcgtgccccagctgctgatcgtcctcacg
gccgacaggtctggggatgatgtgcggaacccctctgtggtcgtgaaaaggggcggggct
gtgcccattggcattggcattgggaacgctgacatcacagagatgcagaccatctccttc
atcccggactttgctgtggccattcccacctttcgccagctggggaccgtccaacaggtc
atctctgagagggtgacccagctcacccgcgaggagctgagcaggctgcagccggtgttg
cagcctctaccgagcccaggtgttggcggcaagagggacgtggtctttctcatcgatggg
tcccaaagtgccgggcctgagttccagtacattcgcaccctcatagagaggctggttgac
tacctggacgtgggctttgacaccacccgggtggccgtcatccagttcagcgatgacccc
aaggtggagttcctgctgaatgctcattcaagcaaggatgaagtgcagaacgcggtgcag
cggctgaggcccaagggagggcggcagatcaacgtgggcagtgccctggagtacgtgtcc
aggaacatcttcaagaggcccctggggagccgcattgaagagggcgtcccgcagttcctg
gtcctcatctcgtccggaaagtctgacgatgaggtggacgacccggcagtggagctcaag
cagtttggcgtggcccctttcacgatcgccaggaacgcagaccaggaggagctggtgaag
atctcgctgagccccgaatatgtgttctcggtgagcaccttccgggagctgcccagcctg
gagcagaaactgctgacgcccatcacaactctgacctcagagcagatccagaagctctta
gccagcactcgctatccacctccagcagttgagagtgatgctgcagacatcgtctttctg
atcgacagctctgagggagttaggccggatggctttgcacatattcgagattttgttagt
aggattgttcgaagactgaacattggccccagtaaagtgagagttggggtcgtgcagttc
agcaatgatgtcttcccagaattctacctgaagacctacagatcccaggccccggtgctg
gatgccatacggcgcctgaggctcagaggggggtcgccactgaacactggcaaggctctc
gaatttgtggcaagaaacctctttgttaagtctgcggggagtcgcatagaagacggggtg
ccccaacacctggtcctggtcctgggtggaaaatcccaggacgacgtgtctaggtttgcc
caggtgatccgttcctcggggattgtgagtttaggggtaggagaccggaacatcgacaga
acagagctgcagaccatcaccaatgaccccagactggtcttcacagtgcgagagttcaga
gagcttcccaacatagaggaacgaatcatgaactcatttggaccctccgcagccactcct
gcacctccaggggtggacacccctcccccttcacggccagagaagaagaaagcagacatt
gtgttcctgttggatggttccatcaacttcaggagggacagtttccaggaagtgcttcgt
tttgtgtctgaaatcgtggacacagtttatgaagatggcgactccatccaagtggggctt
gtccagtacaactctgaccccactgacgaattcttcctgaaggacttctccaccaagagg
cagattattgacgccatcaacaaagtggtctacaaagggggaagacacgccaacacgagg
gtgggccttgagcacctgcgggtgaaccacttcgtgcctgaggcaggcagccgcctggac
cagcgggtccctcagattgcttttgtgatcacgggaggaaagtcggtggaagatgcacag
gatgtgagcctggccctcacccagaggggggtcaaagtgtttgctgttggagtgaggaat
atcgactcagaggaggtcggaaagatagcgtccaacagcgccacagcgttccgtgtgggc
aacgtccaggagctgtccgaactgagcgagcaagttttggaaactttgcatgatgcgatg
catgaaaccctgtgccctggtgtaactgatgctgccaaagcttgtaatctggatgtgatt
ctggggtttgatggttctagagaccagaatgtttttgtggcccagaagggcttcgagtcc
aaggtggacgccatcttgaacagaatcagccagatgcacagggtcagctgcagcggcggc
cgctcgcccactgtgcgtgtgtcagtggtggccaacacgccctcgggcccggtggaggcc
tttgactttgacgagtaccagccagagatgctcgagaagttccggaacatgcgcagccag
cacccctatgtcctcacagaggacaccctgaaggtctaccagaacaagttcagacagtcc
tcgccggacagtgtgaaggtggtcattcattttactgatggagcagatggagatctggct
gatttacacagagcatctgagaacctccgccaagaaggtgtccatgccttgatcctggtg
ggccttgaacgagtggccaacttggagcggctaatgcatctggagtttgggcgagggttc
atgtatgacaggcccctgaggcttaacttgctggacttggattatgaactagcggagcag
cttgacaacattgccgagaaagcttgctgtggggttccctgcaagtgctctgggcagagg
ggagaccgcgggcccatcggcagcatcgggccaaagggtattcctggagaagacggctac
cgaggctatcctggtgatgagggtggacccggtgagcgtggtccgcctggtgtgaatggc
actcaaggtttccagggctgcccgggccagagaggagtaaagggctctcggggattccca
ggagagaagggtgaagtaggagaaattggactggatggtctggacggtgaagatggagac
aaaggattgcctggttcttctggagagaaagggaatcctggaagaaggggtgataaagga
cctcgaggagagaaaggagaaagaggagatgttgggattcgaggggacccgggtaaccca
ggacaagacagccaggagagaggacccaaaggagaaaccggtgacctcggccccatgggt
gtcccagggagagatggagtacctggaggacctggagaaactgggaagaatggtggcttt
ggccgaaggggaccccctggagctaagggcaacaagggcggttctggccagccaggcttt
gagggagagcaggggaccagaggtgcacagggcccagctggtcctgctggtcctccaggg
ctgataggagaacaaggcatttctggacctcggggaagcggaggtgccgccggtgctcct
ggagaacgaggcagaaccggtcccctaggaagaaagggtgagcccggagagccaggacca
aaaggaggaatcgggaaccggggccctcgtggggagacgggagatgacgggagagacgga
gttggcagtgaaggacgcagaggcaaaaaaggagaaagaggattccctggatacccagga
ccaaagggtaacccaggtgagcctgggctaaatggaacaacaggacccaaaggcatcaga
ggccgaaggggaaattcgggacctccagggatagttggacagaagggagaccctggctac
ccaggaccagctggtcccaagggcaacagaggcgactccatcgatcaatgtgccctcatc
caaagcatcaaagataaatgcccttgctgttatgggcccctggagtgtcccgtcttccca
acagaactagcctttgctttagacacctctgagggggtcaaccaagacactttcggccgg
atgcgagatgtggtcttgagtattgtgaatgacctgaccattgctgagagcaactgccca
cggggggcccgggtggctgtggtcacctacaacaatgaggtgaccacggagatccggttt
gctgactccaagaggaagtcggtcctcctggacaagattaagaaccttcaggtggctctg
acgtccaaacagcagagtctggagaccgccatgtcatttgtggccaggaacacatttaag
cgtgtgaggaacggattcctaatgaggaaagtggctgttttcttcagcaacacgcccacg
agagcatccccacagctcagagaggctgtgctcaagctctcagatgcagggatcaccccc
ttgttccttacaagccaggaagaccggcagctcatcaacgctttgcagatcaataacaca
gcagtggggcatgcgcttgtcctgcctgcagggagagacctcacagacttcctggagaat
gttctcacgtgtcatgtttgcttggacatctgcaacatcgacccatcctgcggatttggc
agttggaggccttccttcagggacaggagagcagcagggagcgatgtggacatcgacatg
gcttttgtcttagacagcgctgagaccaccaccctgttccagttcaatgagatgaagaag
tacatagcgtacctggtcagacaactggacatgagcccagaccccaaggcctcccagcac
ttcgccagagtggccgttgtgcagcacgcgccctctgagtccatgggcaatgccagcatg
ccacctgtgaaggtggaattctccctgactgactatggctccaaggagaagctggtggac
ttcctcagcaggggaatgacacagttgcagggaaccagggccttgggaagtgccattgaa
tacaccatagagaatgtctttgaaagtgccccaaacccacgggacctgaaaattgtggtc
ctgatgctgacgggcgaggtgcaggagcagcagctggaggaggcccagagagtcatcctg
caggccaaatgcaagggctacttcttcgtggtcctgggcattggcaggaaggtgaacatc
aaggaggtatacacctttgccagtgagccaaacgacgtcttcttcaaattagtggacaag
tccaccgagctcaacgaggagcctttgatgcgcttcgggaggctgttgccgtccttcgtc
agcagtgaaaatgctttttacttgtccccagatatcaggaaacagtgtgattggttccag
ggggaccaacccacaaagaaccttgtgaagtttggtcacaaacaagtaaatgttccgaat
aatgttacttcaagtcctacatccaacccagcgatgacaacgaagccggtgactacgatg
aagccggtgaccactacaacaaagcctgtaaccaccacaacaaagcctgtagctattgta
aatcagccatctgcgaagccagccgctgcaaagccggcccctgtgaaaccagcccctgca
aaacctatggctgccaagcctgtggccacgaagacggccactgttagacccccagtggtg
gtgaagccagcaacagcagcgaagcctgtagcagcaaagccagcagctgtaagacccccc
gctgctgctgctgcaaaaccagtggtgaccaagcctgaggcccctaggccccaggcagcc
aaaccagctgccaccaagccagccaccactaagcccgtggttagggtgtcccgtgaagtc
caggtgtttgagataacagagaacagcgccaaactccactgggagaggcctgagcccccc
agtccttatttttatgacctcaccgtcacctcagcccatgatcagtccctggttctgaag
cagaaccttacagtcacggaccgcatcatcggaggcctgcttgctgggcagacataccat
gtggctgtggtctgctacctgaggtctcaggtcagagccacctaccacggaagtttcagt
acaaagaaatctcagcccccacctccacagccagcaaggtcagcttctagttcaaccatc
aatctaatggtgagcacagaaccgttggctctcactgaaacagatatatgcaaattgccg
aaagacgaaggaacttgcagggatttcatattaaaatggtactatgatccaaacaccaaa
agctgtgcaagattctggtatggaggttgtggtggaaacgaaaacaaatttggatcacag
aaagaatgtgaaaaggtttgcgctcctgtgctcgccaaacccggagtcatcagtgtgatg
ggaacctaa

KEGG   Pongo abelii (Sumatran orangutan): 100460129
Entry
100460129         CDS       T01416                                 

Gene name
COL6A1
Definition
(RefSeq) collagen alpha-1(VI) chain
  KO
K06238  collagen type VI alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100460129 (COL6A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100460129 (COL6A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100460129 (COL6A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100460129 (COL6A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100460129 (COL6A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100460129 (COL6A1)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100460129 (COL6A1)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100460129 (COL6A1)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100460129 (COL6A1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   100460129 (COL6A1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3
Other DBs
NCBI-GeneID: 100460129
NCBI-ProteinID: XP_024095135
LinkDB
Position
21
AA seq 1028 aa
MRAARALLPLLLQACWTAAQDEPETPRAVAFQDCPVDLFFVLDTSESVALRLKPYGALVD
KVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDALKSSVDAV
KYFGKGTYTDCAIKKGLEQLLVGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAK
HLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGQSRDAEEAISQTIDTIVDMIK
NNVEQVCCSFECQPARGPPGLRGDPGFEGERGKPGLPGEKGEAGDPGRPGDLGPVGYQGM
KGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVKGEMGYPGLPGCKGSPGFDGIQGPPGP
KGDPGAFGLKGEKGEPGADGEAGRPGSSGPPGDEGQPGEPGPPGEKGEAGDEGNPGPDGA
PGERGGPGEGGPQGTPGTRGPRGDPGEAGPQGDQGREGPVGVPGDPGEAGPIGPKGYRGD
EGPPGSEGARGAPGPAGPPGDPGLMGERGEDGPPGNGTEGFPGFPGYPGNRGPPGINGTK
GYPGLKGDEGEAGDPGDDNNDIAPRGVKGAKGYRGPEGPQGPPGHPGPPGPDECEILDII
MKMCSCCECKCGPIDLLFVLDSSESIGLQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA
GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLLPPSPN
NRIALVITDGRSDTQRDTTPLNVLCSPGIQVVSVGIKDVFDFVPGSDQLNVISCQGLAPS
QGRPGLSLVKENYAELLEDAFLKNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASV
GSHNFDTTKRFAKRLAERFLTAGRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTAL
ASAVDAMGFINDATDVNDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATPAAIEKAVQ
EAQRAGIEIFVVVVGHQVNEPHIRVLVTGKTAEYDVAYGERHLFRVPSYQALLRGVFHQT
VSRKVALG
NT seq 3087 nt   +upstreamnt  +downstreamnt
atgagggcggcccgtgctctgctgcccctgctgctgcaggcctgctggacggccgcgcag
gatgagccggagaccccgagggcggtggccttccaggactgccccgtggacctgttcttt
gtgctggacacctctgagagtgtggccctgaggctgaagccctacggggccctcgtggac
aaagtcaagtccttcaccaagcgcttcatcgacaacctgagggacaggtactaccgctgt
gaccgaaacctggtgtggaacgcaggtgcgctgcactacagtgacgaggtggagatcatc
caaggcctcacgcgcatgcctggcggccgcgacgccctcaaaagcagtgtggacgcagtc
aagtattttgggaagggcacctacaccgactgcgccatcaagaaggggctggagcagctc
cttgtggggggctcccacctgaaggagaataagtacctgattgtggtgaccgacgggcac
cccctggagggctacaaggagccctgtggggggctggaggatgctgtgaacgaggctaag
cacctgggcgtcaaagtcttctcggtggccatcacacccgaccacctggagccgcgtctg
agcatcatcgccacggaccacacataccggcgcaacttcacggcggctgactggggccag
agccgcgacgcagaggaggctatcagccagaccatcgacaccatcgtggacatgatcaaa
aataacgtggagcaagtgtgctgctccttcgaatgccagcctgcaagaggacctccgggg
ctccggggcgaccccggcttcgagggagaacgaggcaagccggggctcccaggagagaag
ggagaagctggagatcctggaagacccggggacctcggacctgttgggtaccaggggatg
aagggagaaaaagggagtcgtggggagaagggctccaggggacccaagggctacaaggga
gagaagggcaagcgtggcatcgacggggtggacggcgtgaagggggagatggggtaccca
ggcctgccaggctgcaagggctcgcccgggtttgatggcattcaaggaccccccggcccc
aagggagaccctggcgcctttggactgaaaggagaaaagggcgaacctggagctgacggg
gaggcagggagaccagggagctcaggaccacctggagacgagggccagccgggagagcct
gggccccccggagagaaaggagaggcgggcgacgaggggaacccaggacctgacggtgcc
cccggggagcggggcggccctggagagggaggaccacaggggaccccaggcacgcgggga
ccaagaggagaccctggtgaagctggcccgcagggtgatcagggaagagaaggccctgtt
ggtgtccctggagacccgggcgaggctggccctattggacctaaaggctaccgaggcgat
gagggtcccccagggtccgagggtgccagaggagccccaggacctgccggaccccctgga
gaccccgggctgatgggtgaaaggggagaagacggcccccctggaaatggcaccgagggc
ttccctggcttccctgggtatccgggcaacaggggccctcccgggataaacggcacgaag
ggctaccccggcctcaagggggacgagggggaagccggggaccccggagatgataacaat
gacattgcaccccgaggagtcaaaggagcaaaggggtaccggggtcccgagggcccccag
ggacccccaggacacccaggaccgcctgggccggatgaatgcgagattttggacatcatc
atgaaaatgtgctcttgctgtgaatgcaagtgcggccccatagatctcctgttcgtgctg
gacagctcagagagcatcggcctgcagaacttcgagattgccaaggacttcgtcgtcaaa
gtcatcgaccggctgagccgggatgagctggtcaagttcgagccagggcagtcatacgcc
ggtgtggtgcagtacagccacagccagatgcaggagcacgtgagcctgcgcagccccagc
atcaggaacgtgcaggagctcaaggaagccatcaagagcctacagtggatggctggcggc
accttcaccggcgaggccctgcagtacacgcgggaccagctgctgccgcccagcccaaac
aaccgcatcgccctggtcatcactgacgggcgctcagacactcagagggacaccacacca
ctcaacgtgctctgcagccccggcatccaggtggtctcagtgggcatcaaagacgtgttt
gacttcgtcccaggctccgaccagctcaatgtcatttcttgccaaggcctggcaccgtcc
cagggccggcccggcctctcgctggtcaaggagaactacgcagagctgctggaggacgcc
ttcctgaagaatgtcaccgcccagatctgcatagacaagaagtgtccagattacacctgc
cccatcacgttctcctccccggctgacatcaccatcctgctggacggctccgccagcgtg
ggcagccacaattttgacaccaccaagcgcttcgccaagcgcctggccgagcgcttcctc
acagcgggcaggacggaccccgcccacgatgtgcgggtggcagtggtacagtacagcggc
acgggccagcagcggccagagcgggcgtcgctgcagttcctgcagaactacacggccctg
gccagtgccgtcgatgccatgggctttatcaacgacgccaccgacgtcaacgatgccctg
ggctatgtgacccgcttctaccgcgaggcctcatccggcgctgctaagaagaggctgctg
ctcttctcagatggcaactcacagggcgccacgcccgctgccattgagaaggccgtgcag
gaggcccagcgggcgggcatcgagatcttcgtggtggttgtgggccaccaggtgaatgag
ccccacatccgcgtcctggtcaccggcaagacggccgagtatgacgtggcctacggcgag
cgccacctgttccgtgtccccagctaccaggccctgctccgcggtgtcttccaccagaca
gtctccaggaaggtggcgctgggctag

KEGG   Pongo abelii (Sumatran orangutan): 100461602
Entry
100461602         CDS       T01416                                 

Gene name
COL6A2
Definition
(RefSeq) collagen alpha-2(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100461602 (COL6A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100461602 (COL6A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100461602 (COL6A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100461602 (COL6A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100461602 (COL6A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100461602 (COL6A2)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100461602 (COL6A2)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100461602 (COL6A2)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100461602 (COL6A2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   100461602 (COL6A2)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3 Ins134_P3_kin_N ParE-like_toxin
Other DBs
NCBI-GeneID: 100461602
NCBI-ProteinID: XP_024095114
LinkDB
Position
21
AA seq 1019 aa
MLQGPCSVLLLWGILGAIQAQQQEVISPDTTERNNNCPEKTDCPIHVYFVLDTSESVTMQ
SPTDILLFHMKQFVPQFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFIK
NLQGISSFRRGTFTDCALANMTEQIRQDRSKGTVHFAVVITDGHVTGSPCGGIKLQAERA
REEGIRLFAVAPNQNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRIIKVMKH
EAYGECYKVSCLEIPGPPGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEGPIGFPGPK
GVPGFKGEKGEFGADGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDPGNRGPDGYPGEA
GSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNNGAPGSPGVKGAKGGPGPRGPK
GEPGRRGDPGTKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPGPRGPQGAL
GEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPGQKGEPGPRGPEGGRG
DFGLKGEPGRKGEKGEPADPGPPGEPGPRGPRGVPGPEGEPGPPGDPGLTECDVMTYVRE
TCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRV
GVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLIKESRR
QKTRVFAVVITDGRHDPRDDDLNLRALCDRDVTVTAIGIGDMFHEKHESENLYSIACDKP
QQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPDLPCQTELSVAQCTQRPVDVVFLLDG
SERLGEQNFYKARRFVEQVARRLTLARRDDDPLNARVALLQFGGPGEQQVAFPLSHNLTA
IHEALEAAQYLNSFSHVGAGVVHAINTIVRSPRGGARRHAELSFVFLTDGVTGNDSLHES
AHSMRKQNVVPTVVALGSDVDMDVLTTLSLGDRAAVFREKDYDSLAQPGFFDRFIRWIC
NT seq 3060 nt   +upstreamnt  +downstreamnt
atgctccagggcccctgctccgtgctcctgctctggggaatcctgggggccatccaggcc
cagcagcaggaggtcatctcgccggacactaccgagagaaacaacaactgcccagagaag
accgactgccccatccatgtgtacttcgtgctggacacctcggagagcgtcaccatgcag
tcccccacggacatcctgctgttccacatgaagcagttcgtgccgcagttcatcagccag
ctgcagaacgagttctacctggaccaggtggcactgagctggcgctacggcggcctgcac
ttctccgaccaggtggaggtgttcagcccaccgggcagcgaccgcgcctccttcatcaag
aacctgcagggcatcagctccttccgccgtggcaccttcaccgactgcgcgctggccaac
atgacagagcagatccggcaggaccgcagcaagggcaccgtccacttcgccgtggtcatc
accgacggccacgtcaccggcagcccctgcggtggcatcaagctgcaggccgagcgggcc
cgcgaggagggcatccggctcttcgccgtggcccccaaccagaacctgaaggagcagggc
ctgcgggacatcgccagcacgccgcacgagctctaccgcaacgactacgccaccatgctg
cccgactccaccgagatcgaccaggacaccatcaaccgcatcatcaaggtcatgaaacac
gaagcctacggagagtgctacaaggtgagctgcctggaaatccctgggccccctggtccc
aagggctaccgtggacagaagggtgccaagggcaacatgggtgagccgggagagcctggc
cagaagggaagacagggagacccgggcatcgaaggccccattggattcccaggacccaag
ggcgttcctggcttcaaaggagagaagggtgaatttggagccgacggtcgcaagggggcc
cctggcctggctggcaagaacgggaccgatggacagaagggcaagctggggcgcatcgga
cctcctggctgcaagggagaccctggaaaccggggccccgacggttacccgggggaagca
gggagtccgggggagagaggagaccaaggcggcaagggggaccctggccgcccaggacgc
agagggcccccgggagaaatcggggccaagggaagcaaggggtatcaaggcaacaacgga
gccccaggaagtcctggcgtgaaaggagccaagggcgggcccgggccccgcggacccaaa
ggcgagccggggcgcaggggagaccccggcaccaagggcagcccaggcagcgacggcccc
aagggggagaagggggatcctggccctgaggggccccgcggcctggctggagaggttggc
aacaaaggagccaagggagaccgaggtttgcctggacccagaggcccccagggagctctt
ggggagcccggaaagcagggatctcggggagaccccggtgatgcaggaccccgtggagac
tcaggacagccaggccccaagggagaccccggcaggcctggattcagctacccaggaccc
cgaggagcacccggacaaaaaggcgagcccggcccacgcggccccgagggaggccgaggc
gactttggcttgaaaggagaacctgggaggaaaggagagaaaggagagcctgcggatcct
ggtccccctggtgagccaggccctcgggggccaagaggagtcccaggacccgagggtgag
cccggcccccctggagaccccggcctcacagagtgtgacgtcatgacctacgtgagggag
acctgcgggtgctgcgactgcgagaagcgctgtggcgccctggacgtggtcttcgtcatc
gacagctccgagagcattgggtacaccaacttcacgctggagaagaacttcgtcatcaac
gtggtcaacaggctgggtgccattgctaaggaccccaagtccgagacagggacgcgcgtg
ggcgtggtacagtacagccacgagggcaccttcgaggccatccagctggacgacgaacgt
atcgactccctgtcgagcttcaaggaggctgtcaagaacctcgagtggattgcgggcggc
acctggacaccctcggccctcaaatttgcttacgaccgcctcatcaaggagagccggcgc
cagaagacacgcgtgtttgcggtggtcatcacggacgggcgccacgaccctcgggacgat
gacctcaacctgcgggcgctgtgcgaccgcgacgtcacagtgacagccatcggcatcggg
gacatgttccacgagaagcacgagagcgagaacctctactccatcgcctgcgacaagccg
cagcaggtgcgcaacatgacactgttctccgacctggtcgccgagaagttcatcgatgat
atggaggacgtcctctgcccggaccctcagatcgtgtgcccagaccttccctgccaaaca
gagctgtccgtggcacagtgcacgcagcggcccgtggacgtcgtcttcctgctggacggc
tccgagcggctaggtgagcagaacttctacaaggcccggcgcttcgtggagcaggtggcg
cggcggctgacgctggcccggagagacgacgaccctctcaacgcgcgcgtggcgctgctg
cagtttggcggccccggcgagcagcaggtggccttcccgctgagccacaacctcacggcc
atccacgaggctctggaggctgcgcaatacctgaactccttctcacacgtgggcgcgggc
gtggtgcacgccatcaacaccatcgtgcgcagcccgcgtggcggggcccggaggcacgca
gagctgtctttcgtgttcctcacggatggcgtcacgggcaacgacagcctgcacgagtcg
gcgcactccatgcgcaagcagaacgtggtgcccaccgtggtggccttaggcagcgacgtg
gacatggacgtgctcaccacgctcagcctgggtgaccgcgccgccgtgttccgcgagaag
gactatgacagcctggcacaacccggcttcttcgaccgcttcatccgctggatctgctag

KEGG   Pongo abelii (Sumatran orangutan): 100461836
Entry
100461836         CDS       T01416                                 

Gene name
THBS4
Definition
(RefSeq) thrombospondin-4
  KO
K04659  thrombospondin 2/3/4/5
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04145  Phagosome
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05144  Malaria
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100461836 (THBS4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100461836 (THBS4)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    100461836 (THBS4)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100461836 (THBS4)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100461836 (THBS4)
  09174 Infectious disease: parasitic
   05144 Malaria
    100461836 (THBS4)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pon04131]
    100461836 (THBS4)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100461836 (THBS4)
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100461836 (THBS4)
   04990 Domain-containing proteins not elsewhere classified [BR:pon04990]
    100461836 (THBS4)
Membrane trafficking [BR:pon04131]
 Endocytosis
  Phagocytosis
   Opsonins
    100461836 (THBS4)
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100461836 (THBS4)
  Exosomal proteins of colorectal cancer cells
   100461836 (THBS4)
  Exosomal proteins of bladder cancer cells
   100461836 (THBS4)
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100461836 (THBS4)
Domain-containing proteins not elsewhere classified [BR:pon04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   100461836 (THBS4)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA cEGF EGF_3 EGF
Other DBs
NCBI-GeneID: 100461836
NCBI-ProteinID: XP_024103393
Ensembl: ENSPPYG00000015593
LinkDB
Position
5
AA seq 1002 aa
MGHLRRRPGPRPRFNARPGDLQAGPTPPSPPASRGAGRANMLAPRGAAVLLLHLVLQPWL
AAGAQATPQVFDLLPSSSQRLNPGALLPVLTDPALNDLYVISTFKLQTKSSATIFGLYSS
TDNTKYFEFTVMGRLNKAILRYLKNDGKVHLVVFNNLQLADGRRHRILLRLSNLQRGAGS
LELYLDCIQVDSIHSLPRAFAGPSQKPETIELRTFQRKPQDFLEELKLVVRGSLFQVASL
QDCFLQQSEPLPATGTGDFNRQFLGQMTQLNQLLGEVKDLLRQQVKETSFLRNTIAECQA
CGPLKFQSPTPSSTVVPPAPPAPPTRPPRRCDSNPCFRGVQCTDSRDGFQCGPCPEGYTG
NGITCIDVDECKYHPCYPGVRCINLSPGFRCDACPVGFTGPMVQGVGISFAKSNKQVCTD
IDECRNGACVPNSICVNTLGSYRCGPCKPGYTGDQIRGCKAERNCRNPELNPCSVNAQCI
EERQGDVTCVCGVGWAGDGYICGKDVDIDSYPDEELPCSARNCKKDNCKYVPNSGQEDAD
RDGIGDACDEDADGDGILNEQDNCVLIHNVDQRNSDKDIFGDACDNCLSVLNNDQKDTDG
DGRGDACDDDMDGDGIKNILDNCPKFPNRDQRDKDGDGVGDACDSCPDVSNPNQSDVDND
LVGDSCDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKDGIGDECDDDDDNDGIPDLVPPG
PDNCRLVPNPAQEDSNSDGVGDICESDFDQDQVIDRIDVCPENAEVTLTDFRAYQTVVLD
PEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTQTDDDYAGFI
FGYQDSSSFYVVMWKQTEQTYWQATPFRAVAEPGIQLKAVKSKTGPGEHLRNSLWHTGDT
SDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVRFYEGSELVADSGVTIDTTMRGG
RLGVFCFSQENIIWSNLKYRCNDTIPEDFQEFQTQNFDRFDN
NT seq 3009 nt   +upstreamnt  +downstreamnt
atgggtcacctgcgccgccggcccgggccccgaccgaggttcaacgcacggcccggggac
ctccaggcggggccgacgccgccgtcgcccccggcctcgcggggagcaggaagagccaac
atgctggccccgcgcggagccgccgtcctcctgctgcacctggtcctgcagccgtggcta
gcggccggcgcccaggccaccccccaggtttttgaccttctcccatcttccagtcagagg
ctaaacccaggcgctctgctgccggtcctgacagaccccgccctgaatgatctctatgtg
atttccaccttcaagctgcagactaaaagttcagccaccatcttcggtctttactcttca
actgacaacactaaatattttgaatttactgtgatgggacgcttaaacaaagccatcctc
cgttacctgaagaacgacgggaaggtgcatttggtggttttcaacaacctgcagctggca
gacggaaggcggcacaggatcctcctgaggctgagcaatttgcagcgaggggccggctcc
ctagagctctacctggactgcatccaggtggattccattcacagtctccccagggccttt
gctggcccctcccagaaacctgagaccattgaattgaggactttccagaggaagccacag
gacttcttggaagagctgaagctggtggtgagaggctcactgttccaggtggccagcctg
caagactgcttcctgcagcagagtgagccgctgcctgccacaggcacaggggactttaac
cggcagttcttgggtcaaatgacacaattaaaccaactcctgggagaggtgaaggacctt
ctgagacagcaggttaaggaaacatcatttttgcgaaacaccatagctgaatgccaggct
tgcggtcctctcaagtttcagtctccaaccccaagcagcacagtggtgcccccggcaccc
cctgcaccgccaacacgcccacctcgtcggtgtgactccaacccatgtttccgaggtgtc
caatgtaccgacagcagagatggcttccagtgtgggccctgccccgagggctacacagga
aacgggatcacctgtattgatgttgatgagtgcaaataccatccctgctacccgggtgtg
cgctgcataaatttgtctcctggcttcagatgtgacgcctgcccagtgggcttcacaggg
cccatggtgcagggtgttgggatcagttttgccaagtcaaacaagcaggtctgcactgac
attgatgagtgtcgaaatggagcgtgcgttcccaactcgatctgcgttaatactttggga
tcttaccgctgtgggccttgtaagccagggtatactggtgatcagataaggggatgcaaa
gcggaaagaaactgcagaaacccagagctgaacccttgcagtgtgaatgcccagtgcatt
gaagagaggcagggggatgtgacatgtgtgtgtggagttggttgggctggcgatggctat
atctgtggaaaggatgtggacatcgacagttaccccgacgaagaactgccgtgctctgcc
aggaactgtaaaaaggacaactgcaagtatgtgccaaattctggccaagaagatgcagac
agagatggtattggcgacgcttgtgacgaggatgctgacggagatgggatcctgaatgag
caggataactgtgtcctgattcacaatgtggaccaaaggaacagcgataaagatatcttt
ggggatgcctgtgataactgcctgagtgtcttaaataacgaccagaaagacacggatggg
gatggaagaggagatgcctgtgatgatgacatggatggagatggaataaaaaacattctg
gacaactgcccaaaatttcccaatcgtgaccaacgggacaaggatggtgatggtgtgggg
gatgcctgtgacagttgtcctgatgtcagcaaccctaaccagtccgatgtggataatgat
ctggttggggactcctgtgacaccaatcaggacagtgatggagatggtcaccaggacagc
acagacaactgccccaccgtcattaacagtgcccagctggacaccgataaggatggaatt
ggtgacgagtgtgatgatgatgatgacaatgatggtatcccagacctggtgccccctgga
ccagacaactgccggttggtccccaacccagcccaggaggatagcaacagcgatggagtg
ggagacatctgtgagtctgactttgaccaggaccaggtcatcgatcggatcgacgtctgc
ccagagaacgcagaggtcaccctgaccgacttcagggcttaccagaccgtggtcctggat
cctgaaggggacgcccagatcgatcccaactgggtggtcctgaaccagggcatggagatt
gtgcagaccatgaacagtgatcctggcctggcagtggggtacacggcttttaatggagtt
gacttcgaagggaccttccatgtaaatacccagacagatgatgactatgcaggctttatc
tttggctaccaagacagctccagcttctacgtggtcatgtggaagcagacggagcagaca
tattggcaagccaccccattccgagcggttgcagaacctggcattcagctgaaggctgtg
aagtctaagacaggtccaggggagcatctccggaactccctgtggcacacgggggacacc
agtgaccaggtcaggctgctgtggaaggactccaggaatgtgggctggaaggacaaggtg
tcctaccgctggttcctacagcacaggccccaggtgggctacatcagggtacgattttat
gaaggctctgagttggtggctgactctggagtcaccatagacacgacaatgcgtggaggc
cgacttggcgttttctgcttctctcaagaaaacatcatctggtccaacctcaagtatcgc
tgcaatgacaccatccctgaggacttccaagagtttcaaacccagaatttcgaccgcttc
gataattaa

KEGG   Pongo abelii (Sumatran orangutan): 100461955
Entry
100461955         CDS       T01416                                 

Definition
(RefSeq) collagen alpha-4(VI) chain-like
  KO
K06238  collagen type VI alpha
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon04974  Protein digestion and absorption
pon05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100461955
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100461955
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100461955
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    100461955
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100461955
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pon04147]
    100461955
   00536 Glycosaminoglycan binding proteins [BR:pon00536]
    100461955
Exosome [BR:pon04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   100461955
Glycosaminoglycan binding proteins [BR:pon00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100461955
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   100461955
SSDB
Motif
Pfam: VWA VWA_2 Collagen Kunitz_BPTI VWA_CoxE SRPRB IncFII_repA
Other DBs
NCBI-GeneID: 100461955
NCBI-ProteinID: XP_024100431
LinkDB
Position
3
AA seq 2146 aa
METWKIFWGIILLEAGFGFIKSQRIVCREASLGDVVFLVDTNINSQHARSVRNFLYILVN
SFNVSSETIRVGLAQYSDVPHSEFLLSTYHRKGDVLRHIRQFQFKPGGKKMGLALKFILD
HHFQEASGSRASQGVPQIAMVISSGPVEDNVHGPAEALRRAGILLYAVGVRDAVWAELRE
IASSPQENFTSFVPNFSGLSNLAQKLRQELCDTLAKAAPHVDHVSPACREAALADIVFLV
DSSTSIGPQNFQKVKNFLYSVILGLDISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMVLE
QIQNLPYRTGGTNTGSALEFIRTNYLTEESGSRAKDRVPQIVILVTDGESNDEVQEVADR
LKEDGVVVYVVGVNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAVEGK
IKACATAIPADLVFLIEEFSRVRQPNFQQVVNFLKTIVSSLSIRPDTVRFGLVFYSEEPR
LEFSLDAFQNPAKILEHLDKLTYRERKGRTKTGAALDFLRNEVFIQEKGSRSNHGVQQIA
VVITEGFSQDSVSRPASRLRRAGITIYAVGTHNVSESKDLEKIASYPPWKYSVPLESFLQ
LSIVGSKLTNQLCSEMVDSKVSISGTSYPLHEGCVHIEKADIYFLIDGSGSIYPEDFLEM
KVFMNEVIKMFQIGPNRVQFGVIQYSDKIKSKFILSQYPSVAELKVAIDNIQQGGGGTTT
GEALNNMTQVFADTGRINVARYLIVITDGKSSDPVAEAAEGLRANGVNIYAIGIREANID
ELKEIAKDKIFFVYEFDLLKDIQKEVVQDICSSEACKNSKADIIFLIDGSESISPKDFEK
MKRFVESMVNQSNIGTDGIQIGLLQFSSSPQEEFRLNQYYSKADIYSAIFDVQQMRDGTR
TGKALNFTLPFFDSSKGGRPSVQQYLIVITDGVAQDNVITPAKALRDKNIIIFAIGVGEA
KKSQLVEITNDEDKVYHDVNFEALQNLEKEILSKVCDPQGCNLDLSVGIDISTPSKKVQQ
KLQRLLPELMKQLALLSNISCGISDHINPRFRYLVPGSNGQLIFDSGFEKYSDEIIQKFL
VHQAARNNNMDVNFLQSLGDNAIHLSSAKVKVLLVFTDGLDDDLKRLKKTSEFLHSRGFS
GLLIIGLEGVHKLEELQELEFGRGFAYKQPLSITLRSLPSVLLKQLDTIVERTCCNIYAK
CFREDGYRGDYGSSGRKGEKGFDGLPGHPGEEGRYGERGPRGLPGLRGEEGCPGMRGLKG
ARGFSGETGNPGEEGVDGLDGEQGNRGIPGSSGEKGNRGNRGLTGPPGQPGECGKPGLRG
DPGDPGTNSYIQGPKGEKGRRGSQGSSNVDGPQGETGNVGPWGTIGRRGLPGLKGVPGES
GEQGYQGELGYPGSQGPRGRQGPPGTSGQKGLVGAQGNPGPPGTNGSKGKAGPRGVKGEI
GVAGQRGPQGRQGPRGQPGLPGSDGYGHPGRKGTKGEPGFPGYPGAQGEDGDPGHQGEKG
AKGIRGKRGNAGFPGFVGTSGDQGPPGQMGIKGPKVLVDMTPCKIVDFTRENCLCSRGTS
KCPAFPTELAFALDMSNDVSPLDFERMRDILLSLLMKMEISDSNCPTGAHVAIVSYNAKT
NYLVRFSDYKRKVALLEAVRNITLEQSSGRRKLGATMRFMARHVFKRVRSGLLMRKVAVF
FQAGWAYDAASINTATLELAAVGIIPVVITFTEGHNLPGTLLMDGTNRFYLFIWETERQQ
DVEHMARCILCYDKCRPDPGCPAGAPGPQEVDVDLVFVLDSSYGVDADVYRGSLSLVDAA
LEDLEVAEQPSASHRGARVALVTHTTPNFWPGVGRLPVREGFHLTTYGNRTQMQRHVREA
SARPLQGTPALGHALEWTLENVLLAAPRPRKAQVLFAIVASETSSWDREKLWTLSLEAKC
KGITLFVLALGPGVGTHELAELAELVSAPSEQHLLRLQGVSEPEVNYARGFTRAFLNLLK
SGTNQYPPPELTEECGGLHRGDTVLQLVTPVNRLPRHQFGMSGLADDLEALEATGVFLEE
KRKDITTSITQQEALENYENNKYDIEENEQETPAKQKGTRKEINADTTFGPCSMDPMEGE
CQDYTLKWYYNKEEQVCQQFWYGSCGGNANRFETKEECEAQCVPIQ
NT seq 6441 nt   +upstreamnt  +downstreamnt
atggagacttggaagatattttgggggatcatcctccttgaagctggttttggcttcatc
aagtcacagaggattgtctgcagggaggcctctctgggagatgttgtgtttctggtggac
accaacatcaactcccaacatgcacgcagtgtgcggaacttcttgtacattctggtaaac
agcttcaatgtcagcagtgagaccatccgtgtgggtctggcccagtatagtgatgtgccc
cattcagagttcttgctttccacctaccaccgcaaaggtgatgtgttgagacacattcgg
cagtttcaatttaagcctgggggcaagaagatgggcctggccctgaagttcattctagac
caccacttccaggaagcatcagggagccgggcaagccaaggggtacctcagatcgccatg
gtgatcagcagtgggccagttgaggacaatgtccatggacctgcggaggcattgaggagg
gcaggcatcctgctttatgctgttggtgtcagagatgcagtttgggcagagctcagggaa
attgccagtagtcctcaggagaattttacctcctttgttcctaacttctctggtctgagc
aatcttgcccagaagctgcggcaggaactctgtgatacgttggcaaaggcagctccacat
gttgaccacgtctctccagcttgcagagaagcagccctggcagacattgtgtttctagta
gacagttcaactagcatcggaccccaaaacttccagaaagtcaagaacttcctttactcc
gtcatcttggggcttgacatcagcagtgaccgtgtccgagtgggacttgcccagtataat
gacaatatctacccagccttccagctgaaccagcaccctctgaagagcatggtcttggag
cagatccagaatctgccataccgcacaggaggcacaaacacagggagcgccctggagttt
atcaggaccaactatttgactgaggagtccggcagtcgggccaaggacagagttcctcag
atagttatcctggtgactgacggggagtcaaatgatgaggtccaggaggtggctgaccga
ttgaaagaagatggagttgttgtgtatgtggtaggggtcaatgtccaggatgtccaagag
ttgcaaaaaatagccagtgagccatttgagaagtttctcttcaacaccgaaaacttcaat
atcctgcaggatttttcgggaagtattcttcagactctgtgctcagcagtggagggtaag
ataaaagcatgtgcaactgctatcccggctgacttagtattcctcattgaggaatttagc
agggttaggcaacccaatttccaacaagttgtcaatttcctaaagaccattgtcagttct
ctaagcattcgtcctgatactgtgagatttggcttggtcttctacagtgaggaaccacga
cttgaattttcactggatgcgtttcagaatccggccaaaatcttggagcatttggacaaa
ttaacctaccgggaaagaaaaggaaggacgaagactggtgctgcgttagatttcctgagg
aacgaggttttcattcaggagaagggtagccggtccaaccacggtgtgcagcagatagct
gtagtcatcacggaaggcttctcccaagacagcgtatctagacccgcttctcgcctccga
agggcaggcatcaccatctatgcagtgggcacccacaatgtctcagagagtaaggacctg
gagaagatagcatcgtaccctccttggaagtattcagtccccctggaatcctttttgcag
ctctccatagtaggaagcaagcttacaaaccagctttgctctgagatggtggacagtaaa
gtttccattagtgggacgagctatcccctacacgaaggttgtgtgcacatcgaaaaggca
gatatttacttccttattgatgggtctggcagcatctatccagaagattttcttgagatg
aaggtgttcatgaatgaagtgataaagatgttccagatcgggcccaacagagtacagttt
ggagtcattcagtactcagacaaaattaaaagtaaatttatcctcagccagtatcccagt
gtggcagagctgaaggtagccattgataacatccagcaggggggaggtggcaccacaacc
ggtgaggccttgaacaacatgactcaggtttttgcagacaccggccgaatcaatgttgct
cgataccttatagtcatcactgatggtaaatcttcagacccggtggctgaggctgcagag
ggattgagggcaaatggagttaacatttatgccattggaataagagaagctaacattgat
gagcttaaggaaatagctaaagacaagatattttttgtgtatgagtttgatttactgaag
gacatccaaaaagaagtggtacaggacatctgctcctcagaggcctgtaagaatagtaaa
gctgacattatcttcctgatagatggttcagaatctatctccccaaaagactttgaaaag
atgaaaagatttgtggagagcatggtgaaccaatctaatattggtactgatggaattcaa
attggccttttgcagttcagctcaagcccccaggaagaattcaggctcaaccaatactac
tcaaaggcagacatttacagtgccatctttgatgttcagcagatgagggatggcacccgc
actgggaaagctctgaatttcactctgcctttttttgacagttcaaaaggagggagaccc
agtgttcaacaatatttgattgtgatcaccgatggggtcgcccaagataatgtaatcaca
ccagccaaagccctcagggacaaaaacataattatttttgccattggggtgggagaagcc
aaaaaatcgcagcttgtggagatcactaatgatgaggacaaagtgtaccatgatgtaaat
tttgaggccctgcagaacctggagaaggaaattctttctaaggtctgtgatccacaagga
tgcaacctggatttgtctgtaggaattgatatctcaactccttcaaaaaaagttcagcag
aagcttcaaaggttactgccagagctgatgaaacagttggctttactttctaacatcagc
tgtggcatttctgatcacatcaacccaagattccgctacttggttcctggctcaaatggc
cagcttatctttgactcaggctttgaaaaatacagtgatgagatcatccagaagttcttg
gttcatcaggctgccaggaataacaatatggatgtgaactttttgcagtccttgggagat
aatgctattcatctctcttctgctaaagtaaaggtccttttagtgtttacagatggactg
gatgatgatttaaagagactgaaaaaaacatcggagtttctccacagcagaggattctct
ggactcctaatcattggcctggaaggtgtccataaattagaagagctccaggagctggaa
tttggcagagggtttgcatataagcaacctctgagcatcacactgcgatccctcccaagc
gtcttgctgaagcaacttgatacaattgtggaaagaacatgctgcaatatatatgcaaag
tgttttagagaagatgggtacagaggtgattatgggagttctgggaggaagggagagaag
ggttttgatgggttacccggtcatcctggtgaggaaggcagatatggagaacgaggtccc
cgaggtcttcctggactccgaggtgaggaaggatgccctggtatgaggggacttaaggga
gcaagaggattttcaggagagacgggcaatcctggtgaggaaggagttgatggcttggat
ggagaacagggtaatcgcggaatcccagggtcatctggagaaaaaggaaataggggaaat
cggggcttgacgggaccgccaggacaacctggagagtgtggaaagcctgggttaagggga
gatcctggggatcctggaactaatagctatatccaaggccctaagggagaaaaaggaagg
cgtgggagtcagggaagttctaatgttgatggacctcagggagaaactggaaatgtcggc
ccttgggggacaataggaaggcgaggtctgccaggattaaagggtgtgcctggagaatcc
ggtgaacagggttaccaaggagagcttgggtatccaggctcacagggaccaagaggaagg
caaggaccaccaggaacttctggacaaaaaggcttagtgggtgctcaggggaatcctggg
cctccagggacaaatggttcaaaaggaaaagctggaccaagaggagtgaagggagagatt
ggtgttgcaggacaaagaggcccacagggtcgacaaggaccaagagggcaacctggtctt
cctggttctgatggatatggacatcctggaagaaaaggaacaaagggtgaacctggattc
cctggctatcctggtgcacaaggagaagatggtgaccctggccatcaaggagaaaagggg
gcaaagggaataagagggaagaggggtaatgctggctttcctggatttgttggaacttcg
ggtgaccaaggcccaccaggacaaatgggtatcaagggccccaaagttttggtagatatg
acgccttgcaaaattgttgatttcacacgtgaaaactgcctttgttcaagaggtacttcc
aaatgcccagcattcccaaccgaattggcctttgccttggacatgtcaaatgatgtctcc
ccattggattttgagaggatgagagacattttattgtctctgttgatgaagatggaaata
agtgacagtaactgcccaacaggtgcccatgtggccattgtttcctacaatgccaaaacc
aattacttggttcgtttctcagactacaagaggaaggttgcactcttggaggctgtcagg
aacatcaccctggagcagtcatccggccgcaggaaactcggggccaccatgaggtttatg
gcgagacatgtgttcaaacgtgtacgctcgggtcttctcatgaggaaggtggctgtgttc
ttccaggcgggctgggcctatgatgcagcttccatcaatacagccacactggagcttgct
gcagtgggcatcatccccgtggtcatcaccttcacagaggggcacaacctcccaggcacc
ctgctgatggatggaaccaacagattttacttgttcatctgggagacagagcgccagcag
gatgtggagcacatggcccgctgcattctctgctacgacaaatgcagaccagacccagga
tgcccggctggcgcgccggggccccaggaggtggacgtggacttggtgtttgtgttggac
agctcctatggagtggatgccgacgtgtaccgcgggtctttgagtctagtggacgctgcg
ctagaagacctggaggtggctgagcagccgagcgcgtcccaccgtggggcgcgtgtggcc
ctggtgacgcacacgacacccaacttctggccgggtgtgggtcgcctccctgtgcgcgag
ggcttccacttgaccacctatggcaaccggacgcagatgcagagacatgtgcgcgaggct
tcagcccgccccttacagggaacccccgccctgggccacgccctggagtggacgctggag
aatgtgctcctggcagcccctcggccgcggaaggcacaagtcctcttcgccatcgtggcc
agcgagacaagtagctgggacagggagaaattatggactctgtccctggaggccaaatgc
aagggcatcaccctctttgtgctggccttgggtccgggtgtggggacccatgagctagcc
gagctagccgagctggtcagtgctccctctgagcagcatctactgcgcctacaaggggtc
tcggagccagaggttaactacgctcggggattcactcgggccttcctgaacctcctaaaa
agtgggacaaaccagtacccacctccagagctcactgaagaatgtgggggcctacaccgt
ggggacactgtgctgcaattagtcacacctgtcaacaggttgcccaggcaccagtttggt
atgtctggcttggctgatgatttggaagcacttgaagcaacaggcgtttttctagaagag
aagagaaaagacatcacaacatctataactcagcaagaagcacttgaaaattatgaaaat
aacaaatatgacattgaagaaaatgaacaagaaacaccagcaaaacaaaaaggaactaga
aaagaaataaatgcagacactacctttggtccttgttccatggatccaatggaaggcgag
tgtcaggattacaccctgaagtggtattacaacaaggaggaacaggtttgccagcagttc
tggtatggcagctgtgggggcaatgccaaccggtttgaaaccaaggaagaatgtgaggct
cagtgtgtcccaatacagtaa

KEGG   Pongo abelii (Sumatran orangutan): 100461994
Entry
100461994         CDS       T01416                                 

Gene name
LAMB4
Definition
(RefSeq) LOW QUALITY PROTEIN: laminin subunit beta-4
  KO
K06245  laminin, beta 4
Organism
pon  Pongo abelii (Sumatran orangutan)
Pathway
pon04151  PI3K-Akt signaling pathway
pon04510  Focal adhesion
pon04512  ECM-receptor interaction
pon05145  Toxoplasmosis
pon05146  Amoebiasis
pon05165  Human papillomavirus infection
pon05200  Pathways in cancer
pon05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pon00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100461994 (LAMB4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100461994 (LAMB4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100461994 (LAMB4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100461994 (LAMB4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100461994 (LAMB4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100461994 (LAMB4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100461994 (LAMB4)
   05145 Toxoplasmosis
    100461994 (LAMB4)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N
Other DBs
NCBI-GeneID: 100461994
NCBI-ProteinID: XP_024105656
Ensembl: ENSPPYG00000017908
LinkDB
Position
7
AA seq 1761 aa
MQFQLTLFLHLGWLSYSKAQDDCNRGACHPTTGDLLVGRNMQLTASSTCGLSRAQKYCIL
SYLEGEQKCFICDSRFPYDPYAQPNSHTIENVIVSFEPDREKKCWQSENGLDHVSIRLDL
EALFRFSHLILTFKTFRPAAMLVERSTDYGHTWKVFKYFAKDCATSFPNITSGQAQGVGD
IVCDSKYSDIEPSTGGEVVLKVLDPSFETEXPYSPYIQDLVTLTNLRINFTKLHTLGDTL
LGRRQHDSLDKYYYALYEMIVRGSCFCNGHASECRPMQKMRGDVFSPPGMVHGQCVCQHN
TDGPNCERCKDFFQDVPWRPAADLQDNACRSCSCNSHSSRCHFDMTTYLASGGLSGGVCE
DCQHNTEGQHCDRCRPLFYRDPLKTISDPYACIPCECDPDGTISGGICVSHSDPALRSVA
GQCLCKENVEGAKCDQCKPNHYGLSATDPLGCQPCDCNPLGSLPFLTCDVDTGQCLCLSY
VTRAHCEECTVGYWGLGNHLHGCSPCDCDIGGAYSNVCSPKNGQCECRPHVTGRSCSEPA
PGYFFAPLNFYLYEAEEATPLQGLAPLGSETFGQSPAVHVVLGEPVPGNPVTWTGPGFAR
VLPGAGLRFAVNNIPFPVDFTIAIRYETQSAADWTVQIVVNPPGGSEHCIPKSPQSKPQS
FALPVATRIMLLPTPICLEPDVQYSIDVYFSQPLEGESHAHSHVLVDSLGLIPQINSLEN
FCSKQDLDEYQLHNCVEIASAMGPQVLLGACERLIISMSAKLHDGAVACKCHPQGSVGSS
CSRLGGQCQCKPPVVGRCCDRCSTGSYDLGHHGCHPCHCHPQGSKDTVCDQVTGQCPCHG
EVSGRRCDRCLAGYFGFPSCRPCPCNGFAELCDPETGSCFNCGGFTTGRNCERCIDGYYG
NPSSGQPCRPCLCPDDPSSNQYFAHSCYQNLWSSDVICNCLQGYTGTQCGECSTGFYGNP
RISGAPCRPCACNNNIDVTDPESCSRVTGECLRCLHNTQGANCQLCKPGHYGSALNQTCR
RCSCHASGVSPMECPPGGGACLCDPVTGACPCLPNVTGLACDRCADGYWNLVPGRGCQSC
DCDPRTSQSSHCDQLTGQCPCKLGYGGKRCSECQENYYGDPPGRCIPCDCNRAGTQKPIC
DPDTGMCHCREGVSGQRCDRCARGHSQEFPTCLQCHLCFDQWDHTISSLSKAVQGLMRLA
ANMEDKRETLPVCEADFKGLRGNVSEIERILKHPVFPSGKFLKVKDYHDSVRKQIMQLNE
QLKAVYEFQDLKDTIERAKNEADLLLEDLQEEIDLQSSVLNASIVDSSENIKKYYHISSS
AEKQINETSSTINTSANTRNDLLTILDTLTSKGNLSLERLKQIKIPDIQILNEKVCGDPG
NVPCVPLPCGGALCTGWKGHRKCGGPGCHGSLTLSTNALQKAQEAKSVIHNLDKQVRGLK
NQIKNISKLAEVSKNNALQLMEKLGNIRNQSDSEEEKINLFIKKVKNFLLEENVPPEDIE
KVANGVLDIHLPIPSQNLTDELDKIQKFMQLCEDYRTDENRLNEEADGAQKLLVKAKVAE
KAANVLLNLDKTLNQLQQAQITQGRADSTITQLTANITKIKKNVLQAENQTREMKSELEL
AKQRSGLEDGLSLLQTKLQRHQDHAVDVKVQADSAQHQAGSLEKEFVELKKQYAILQRKT
STTGLTKETLGKVKQLKDAAEKLAGDTEDKIRRITDLERKIQDLNLSRQAKADQLRILED
QVVAIKNEIVEREKKYATCYS
NT seq 5286 nt   +upstreamnt  +downstreamnt
atgcaatttcaactgaccctttttttgcaccttgggtggctcagttactcaaaagctcaa
gatgactgcaacaggggtgcctgtcatcccaccactggcgatctcctggtgggcaggaac
atgcagcttacggcttcttctacctgtgggctgagcagagcccagaaatactgcatcctc
agttacctagagggggaacaaaaatgcttcatctgtgactctagatttccatatgatccg
tacgcccaacccaacagccacaccattgagaatgtcattgtaagttttgaaccagacaga
gaaaagaaatgctggcaatctgaaaatggtcttgatcatgtcagcatcagactggactta
gaggcattatttcggttcagccaccttatcttgacctttaagacttttcggcctgctgca
atgttagttgaacgttccacagactatggacacacctggaaagtgttcaaatattttgca
aaagactgtgccacttcctttcctaacatcacatctggccaggcccagggagtgggagac
attgtttgtgactccaaatactcggatattgaaccctcaacaggtggagaggttgtttta
aaagttttggatcccagttttgaaactgaanaaccttatagcccctacatccaagacctt
gtgacattgacaaacctcaggataaactttaccaagctccacacccttggggatactttg
cttggaaggaggcaacatgattcccttgataaatactactatgctctgtatgagatgatt
gttcggggaagctgcttttgcaatggccatgctagcgaatgtcgccctatgcagaagatg
cggggagatgttttcagccctcctggaatggttcacggtcagtgcgtctgtcagcacaat
acagatggtccgaactgtgagagatgcaaggacttcttccaggacgttccttggaggcca
gctgcagacctccaggacaacgcttgcagatcgtgcagctgtaacagccactccagccgc
tgtcactttgacatgactacgtacctggcaagcggtggcctcagcgggggcgtgtgtgaa
gactgccagcacaacactgaggggcagcactgcgaccgctgcaggcccctcttctacagg
gatccgctcaagaccatctcagatccctacgcgtgcattccttgtgaatgtgaccctgat
gggaccatatctggtggcatttgtgtgagccactctgatcctgccttacggtctgtggcc
ggccagtgcctttgtaaggagaacgtggaaggagccaaatgcgaccagtgcaagcccaac
cactacggactgagcgccactgaccccctgggctgccagccctgcgactgtaaccccctt
gggagtctgccattcttgacctgtgatgtggatacaggccaatgcttgtgcctgtcatat
gtcaccagagcacactgcgaagaatgcactgttggatactggggcctgggaaatcatctc
catgggtgttctccctgtgactgtgatattggaggtgcttattctaacgtgtgctcaccc
aagaatgggcaatgtgaatgccgcccacatgtcactggccgtagctgctctgaaccagcc
cctggctacttctttgctcctttgaatttctatctctatgaggcagaggaagccacacca
ctccaaggactggcacctttgggctcggagacttttggccagagtcctgctgttcacgtt
gttttaggagagccagttcctgggaacccggttacatggactggacctggatttgccagg
gttctccctggggctggcttgagatttgctgtcaacaacattccctttcctgtggacttc
accattgccattcgctatgaaacccagtctgcagctgactggactgtccagatcgtggtg
aacccccctggagggagtgagcactgcatacccaagagtccacagtcaaagcctcagtct
tttgccttaccagtggctaccagaatcatgctgcttcccacacccatctgtttagaacca
gatgtacaatattccatagatgtctatttttctcagcctttggaaggagagtcccacgct
cattcacacgtcctggtggactctcttggccttattccccaaatcaattcattggagaat
ttctgcagcaagcaggacttagatgagtatcagcttcacaactgtgttgaaattgcctca
gcaatgggacctcaagtgctcctgggtgcctgtgaaaggctgatcatcagcatgtctgcc
aagctgcatgatggggctgtggcctgcaagtgtcacccccagggctcagtcggatccagc
tgcagccgacttggaggccagtgccagtgtaaacctcctgtggtcgggcgctgctgtgac
aggtgctcaactggaagctatgatttggggcatcatggctgtcacccatgtcactgccat
cctcaaggatcaaaggacactgtatgtgaccaagtaacaggacagtgcccctgccatgga
gaggtgtctggccgccgctgtgatcgctgcctggcaggctactttggatttcccagctgc
cgcccttgcccttgtaatgggtttgctgaactttgtgatcctgagacagggtcatgcttc
aattgtggaggctttacaactggcagaaactgtgaaaggtgtattgatggttactatgga
aatccttcttcaggacagccctgtcgtccttgcctgtgtccagatgatccctcaagcaat
cagtattttgcccattcctgttatcagaatctgtggagctcagatgtaatctgcaattgt
cttcaaggttatacgggtactcagtgtggagaatgctctactggtttctatggaaatcca
agaatttcaggagcaccttgccgaccatgcgcctgcaacaacaacatagatgtaaccgat
ccagagtcctgcagccgggtaacaggggagtgccttcgatgtttgcacaacactcagggc
gcaaactgccagctctgcaaaccaggtcactatggatcagccctcaatcagacctgcaga
agatgctcctgccatgcttccggcgtgagtcccatggagtgtccccctggtgggggagct
tgcctctgtgaccctgtcactggtgcttgtccttgtctgccgaatgtcacaggactggcc
tgtgaccgttgtgctgatggatactggaatctagtccctggcagaggatgtcagtcatgt
gactgtgaccctaggacctctcaaagtagccactgtgaccagcttacaggccagtgtccg
tgtaaattaggctacggcgggaaacgttgcagtgagtgccaggaaaattattatggtgat
ccacctgggcgatgcattccatgtgattgtaacagggcaggtacccagaagcccatctgt
gatccagacacaggcatgtgccactgccgggagggtgtcagcggccagagatgtgatcgc
tgtgcccggggtcacagccaggaattccctacttgtcttcaatgtcacttgtgctttgat
caatgggaccacaccatttcttccctctccaaagcggtgcaagggttaatgagactggct
gctaacatggaagataaaagagagaccctgcctgtctgtgaggcagacttcaaaggcctc
agagggaacgtgtctgaaatagaaaggattttgaaacatcctgttttcccatctgggaaa
ttcttaaaagtcaaggattatcatgactctgttagaaaacaaatcatgcagctaaatgaa
caattgaaagcagtgtatgaatttcaagatctgaaagatacaatagaaagagcaaagaat
gaagcagacctcttacttgaagaccttcaggaagaaattgatttgcaatccagtgtcctt
aatgcaagcattgtggactcctcagaaaacatcaagaaatattatcacatatcatcatct
gctgaaaagcaaattaatgaaactagttccaccattaatacctctgcaaatacaaggaat
gacttacttaccatcttagatacactaacctcaaaaggaaacttgtcattggaaagatta
aagcagattaagataccagatatccaaatattgaatgaaaaggtgtgtggagatccagga
aatgtgccatgtgtgcccttgccctgtggcggtgctctctgcacgggctggaaggggcac
aggaagtgtgggggtcccggctgtcacggctccctgaccctctcaacgaatgccctccaa
aaagcccaggaagcaaaatccgttattcataatttggacaaacaggttcgtgggttgaag
aatcagataaaaaatataagtaaactggcagaagtctccaaaaacaatgccttacagcta
atggaaaaactgggaaatataagaaaccaaagtgactctgaagaagaaaaaatcaatctt
ttcatcaaaaaagtgaaaaactttttgttagaggaaaacgtgcctccagaagacatcgag
aaggttgcgaatggtgtgcttgatattcacctaccaattccatcccaaaatctaaccgat
gaacttgacaaaatacagaaatttatgcaactctgtgaggattacaggacagatgaaaac
aggttaaatgaagaagcagatggagcccaaaagcttttggtgaaggccaaagtagctgag
aaagcagcaaatgttctattaaatcttgacaaaacattgaaccagttacaacaagctcaa
atcactcaaggacgggcagactctaccattacacagctgactgccaatataacaaaaata
aaaaagaatgtgctgcaggctgaaaatcaaaccagggaaatgaagagtgagctggagtta
gcaaagcagcgatcagggctggaggatggactttccctgctgcagaccaagttgcaaagg
catcaagaccacgctgtcgatgtgaaagttcaggctgactctgcccaacaccaggctggg
agtcttgagaaggaatttgttgagctgaaaaaacaatatgctattctccaacgtaagaca
agcactacaggactaacaaaggagacattaggaaaagttaaacagctaaaagatgctgca
gaaaaattggctggagatacagaggacaagataagaagaataacagatttagaaaggaaa
atccaagatttgaatctaagtagacaagcaaaagctgatcaactgagaatattggaagat
caagttgttgccattaaaaatgaaattgttgaacgagaaaaaaaatatgctacttgctat
agctag

DBGET integrated database retrieval system