KEGG   Nomascus leucogenys (northern white-cheeked gibbon): 100603791
Entry
100603791         CDS       T03265                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
nle  Nomascus leucogenys (northern white-cheeked gibbon)
Pathway
nle04151  PI3K-Akt signaling pathway
nle04382  Cornified envelope formation
nle04510  Focal adhesion
nle04512  ECM-receptor interaction
nle04518  Integrin signaling
nle04820  Cytoskeleton in muscle cells
nle04926  Relaxin signaling pathway
nle04933  AGE-RAGE signaling pathway in diabetic complications
nle04974  Protein digestion and absorption
nle05146  Amoebiasis
nle05165  Human papillomavirus infection
nle05200  Pathways in cancer
nle05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:nle00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100603791 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100603791 (COL4A1)
   04518 Integrin signaling
    100603791 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100603791 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    100603791 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100603791 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100603791 (COL4A1)
  09158 Development and regeneration
   04382 Cornified envelope formation
    100603791 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100603791 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100603791 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100603791 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100603791 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100603791 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:nle04147]
    100603791 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:nle00536]
    100603791 (COL4A1)
Exosome [BR:nle04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100603791 (COL4A1)
Glycosaminoglycan binding proteins [BR:nle00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100603791 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100603791
NCBI-ProteinID: XP_030669544
Ensembl: ENSNLEG00000007914
LinkDB
Position
5:complement(136428524..136586812)
AA seq 1669 aa
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERGFPGI
PGNPGPPGLPGLQGPIGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKLGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLVGRQGPQGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGAPGLPVPGLAGAPGFPGERGEKGDRGFPGVPLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLLCDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEIYFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDT
GPPGPPGYGPTGPIGDKGQAGFPGGPGSPGLPGPKGEPGKVVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGVIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLPGEKGDQGIVGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATVERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctcagcgtctggctgctgctgctgcccgccgcccttctgctccacgag
gagcacagccgggccgctgcgaagggtggctgtgctggctctggctgtggcaaatgtgac
tgccatggagtgaagggacaaaagggtgaaaggggcctcccggggttacaaggtgtcatt
gggtttcctggaatgcaaggacctgaggggccgcagggaccaccaggacaaaagggtgat
actggagaaccaggactacctggaacaaaagggacaagaggacctccgggagcatctggc
taccctggaaacccaggacttcccggcattcctggccaagacggcccaccaggcccccca
ggtattccaggatgcaatggcacaaagggagagagagggccgctcgggcctcctggcttg
cctggtttcgctggaaatcccgggccaccagggttaccaggaatgaagggtgatccaggt
gagatacttggccatgtgcccgggatgctgttgaaaggcgaaagaggatttcccggaatc
ccagggaatccaggcccaccaggactgccagggcttcaaggtcctattgggcctccagga
tttaccggaccaccaggtcccccaggccctcccggccctccaggtgaaaagggacaaatg
ggcttaagttttcaaggaccgaaaggtgacaagggtgaccaaggggtcagtgggcctccg
ggagtaccaggacaagctcaagttcaagaaaaaggagactttgccaccaagggagaaaag
ggccaaaaaggtgaacctggatttcaggggatgccaggggtcggagagaaaggtgaacct
ggaaaactaggacccagaggaaaacccggaaaagatggtgacaaaggggaaaaagggagt
cccggttttcctggtgaacccgggtacccaggactcgtaggccgccagggcccgcaggga
gaaaagggtgaagcaggtcctcccggcccacctggaattgttataggcacaggacctttg
ggagaaaaaggagagaggggctaccctggaactccggggccaagaggagagccaggccca
aaaggtttcccaggactaccaggccaacccggagctccaggcctccctgtacctgggctg
gctggtgcccctggcttccctggtgaaagaggagaaaaaggtgaccgaggatttcctggt
gtacctctgccaggaccaagtggaagagacgggctcccgggtcctcctggttcccccggg
ccccctgggcagcctggctacacaaatggaattgtggaatgtcagcccggacctccaggt
gaccagggtcctcctggaattccagggcagccaggatttataggcgaaattggagagaaa
ggtcaaaaaggagagagttgccttctctgtgatatagatggatatcgggggcctcccggg
ccacaaggacccccaggagaaataggtttcccaggacagccgggggccaagggcgacaga
ggtttgcctggcagagatggtgttgcgggagtgccagggcctcaaggtacaccagggctg
ataggccagccgggagccaagggggagcctggtgagatttatttcgacttgcggctcaaa
ggtgacaaaggagacccaggctttccaggacagcccggcatgccagggagagcgggttct
cctggaagagatggccatccgggtcttcctggccccaagggctcaccgggttctgtagga
ttgaaaggagagcgtggcccccctggaggagttggatttccaggcagtcgtggtgacacc
ggcccccctgggcctccaggatatggtcctactggtcccattggtgacaaaggacaagca
ggctttcctggaggccctggatccccaggcctgccaggtccaaagggtgaaccaggaaaa
gttgttcctttaccaggcccccctggagcagaaggactgccggggtccccaggcttccca
ggtccccaaggagaccgaggctttcccggaaccccaggaaggccaggcctgccaggagag
aagggtgctgtgggccagccgggaattggatttccagggccccccggccccaaaggtgtt
gacggcttacctggagacatggggcctccggggactccaggtcgcccgggatttaatggc
ttacctggaaacccaggtgtgcagggccagaagggagagcctggagttggtctgccggga
ctcaaaggtttgccaggtcttccaggcattcctggcacacctggggagaaggggagcatt
ggggtaccaggcgttcctggagaacacggagtgatcggaccccctgggcttcaggggatc
agaggtgaaccgggacctcctggattgccaggctccgtggggtctccaggagttccagga
ataggcccccctggagctaggggtccccccggaggacaaggaccaccggggttgtcaggc
cctcctggaataaaaggagagaagggtttccccggattccctggactggacatgccgggc
cctaaaggagataaaggggctcaaggacttcctggcataacgggacagtcagggctccct
ggccttcctggacagcagggggctcctgggattcctgggtttccaggttccaagggagaa
atgggcgtcatggggacccccgggcagccgggctcaccaggaccagtgggtgctccggga
ttaccgggtgaaaaaggggaccatggctttccgggctcctcaggacccaggggagaccct
ggcttgaaaggtgataagggggatgtcggtcttcctggcaagcctggctccatggataag
gtggacatgggcagcatgaagggccagaagggagaccaaggagagaaaggacaaattgga
ccaattggtgagaagggatcccgaggagaccctgggaccccaggagtgcctggaaaggac
gggcaggcaggacagcctgggcagccaggacctaaaggtgatccaggtataagtggaacc
ccaggtgctccaggacttccgggaccaaaagggtctgttggtggaatgggcttgccagga
acacctggagagaaaggtgtgcctggcatccctggcccacaaggttcacctggcttacct
ggagacaaaggtgcaaaaggagagaaagggcaggcaggcccacctggcataggcatccca
gggctgcctggtgaaaagggagatcaagggatagtgggtttcccaggaagccctggagag
aagggagaaaaaggaagcattgggatcccaggaatgccagggtctccaggccttaaaggg
tctcccgggagtgttggttatccaggaagccctgggctgcctggagaaaaaggtgacaaa
ggcctcccaggattggatggcatccctggcgtcaaaggagaagcaggtcttcctgggacg
cctggccccacaggcccagctggccagaaaggggagccaggcagtgacggaatcccaggg
tcagcaggagagaagggtgaaccaggtctaccaggaagaggattcccagggtttccaggg
gccaaaggagacaaaggttcaaagggtgaggtgggtttcccaggattagccgggagccca
ggaattcctggatccaaaggagagcaaggattcatgggtcccccggggccccaaggtcag
ccagggttaccgggatccccaggccatgccacggaggggcccaaaggagaccgcggacct
cagggccagcctggcctgccaggacttccgggacccatggggcctccagggcttcctggg
attgatggagttaaaggtgacaaaggaaatccaggctggccaggagcacccggtgtccca
gggcccaagggagaccctggattccagggcatgcctggtatcggtggctctccaggaatc
acaggctctaagggtgatatggggcctccaggagttccaggatttcaaggtccaaaaggt
cttcctggcctccagggaattaaaggtgatcaaggcgatcaaggtgtcccgggagctaaa
ggtctcccgggtcctcctggccccccaggtccttacgacatcatcaaaggggagcccggg
ctccctggtcctgagggccccccagggctgaaagggcttcagggactgccaggcccgaaa
ggccagcaaggtgttacaggattggtgggcatacctggacctccaggtattcctgggttt
gacggtgcccctggccagaaaggagagatgggacctgccgggcctactggtccaagagga
tttccaggtccaccaggccccgatgggttgccaggatccatggggcccccaggcacccca
tctgttgatcacggcttccttgtgaccaggcacagtcaaacaatagatgacccacagtgt
ccttctgggaccaaaattctttaccatgggtactctttgctctacgtgcaaggcaatgag
cgggcccatggccaggacttgggcacggccggcagctgcctgcgcaagttcagcacgatg
cccttcctgttctgcaatattaacaacgtgtgcaactttgcatcacgcaatgactactcg
tactggctgtccacccccgagcccatgcccatgtcaatggcacccatcacgggggacaat
ataagaccatttattagtaggtgtgctgtgtgtgaggcgcctgccatggtaatggccgtg
cacagtcagaccattcagatcccaccgtgccccagtgggtggtcctcgctgtggattggc
tactcttttgtgatgcacaccagcgctggtgcagaaggctctggccaagccctggcgtcc
cccggctcctgtctggaggagtttagaagtgcgccattcatcgagtgtcatggccgtggg
acctgtaattactacgcaaacgcttacagcttttggctcgccaccgtagagaggagcgag
atgttcaagaagcccacgccgtccaccttgaaggcaggggagctacgcacgcacgtcagc
cgctgccaagtctgtatgagaagaacataa

DBGET integrated database retrieval system