KEGG   Neomonachus schauinslandi (Hawaiian monk seal): 110585841
Entry
110585841         CDS       T08474                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
nsu  Neomonachus schauinslandi (Hawaiian monk seal)
Pathway
nsu04151  PI3K-Akt signaling pathway
nsu04510  Focal adhesion
nsu04512  ECM-receptor interaction
nsu04820  Cytoskeleton in muscle cells
nsu04926  Relaxin signaling pathway
nsu04933  AGE-RAGE signaling pathway in diabetic complications
nsu04974  Protein digestion and absorption
nsu05146  Amoebiasis
nsu05165  Human papillomavirus infection
nsu05200  Pathways in cancer
nsu05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:nsu00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    110585841 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    110585841 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    110585841 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    110585841 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    110585841 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    110585841 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    110585841 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    110585841 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    110585841 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    110585841 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    110585841 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:nsu04147]
    110585841 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:nsu00536]
    110585841 (COL4A1)
Exosome [BR:nsu04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   110585841 (COL4A1)
Glycosaminoglycan binding proteins [BR:nsu00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   110585841 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 110585841
NCBI-ProteinID: XP_021551670
EnsemblRapid: ENSNSCG00000008564
UniProt: A0A2Y9HQF1
LinkDB
Position
3:2843129..2979315
AA seq 1669 aa
MGPRLGVWLLLLPAALLLHEERSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGEPGPLGPPGLPGFAGNPGPPGLPGMKGDPGEIIGHVPGTLLKGERGFPGP
PGTPGSPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGLGEKGEPGKPGPRGKPGKDGEKGEKGS
PGFPGDAGYPGLPGREGFRGDKGEAGPPGPPGIAIGPGPSGEKGERGYPGAPGLRGEPGP
KGFPGLQGQPGPPGFPVPGQAGAPGFPGERGEKGDQGFPGTSLPGPSGRDGQPGPPGLPG
PPGQPGHTNGIVECQPGPPGDQGPPGIPGQPGLTGEVGEKGQKGDSCLICDTEGLRGPPG
PQGPPGEIGFPGQPGPKGDRGLPGRDGLEGLPGPQGTPGLMGQPGAKGEPGEIYFDARLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDI
GPPGPPGFGPIGPVGDKGQAGFPGTPGSPGQPGPKGEAGKVVPLPGPPGAQGLPGPPGFS
GPQGDRGFPGTPGRPGLPGEKGTVGQPGIGFPGPPGPKGVDGLPGDVGPPGSPGRPGFNG
LPGNPGVPGQKGEPGIGLPGLKGLPGLPGIPGTPGEKGNIGGAGVPGEHGAIGPPGLQGI
RGDPGPPGLQGPKGAPGAPGIGPPGAMGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPG
PKGDKGSQGLPGLTGQSGLPGLPGQQGTPGVPGFPGPKGEMGVMGTPGQPGSPGPAGLPG
LPGEKGDHGFPGSSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQTG
PTGDKGSRGDPGTPGVPGKDGQAGHPGQPGPKGDPGTGGTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGVPGLPGEKGAKGEKGQVGLPGIGIPGRPGDKGDQGVAGFPGSPGE
KGEKGSAGIPGVPGSPGPKGSPGSVGYPGSPGLPGEKGDKGLPGSDGIPGIKGEAGLPGS
PGPTGPAGQKGEPGSDGIPGSAGEKGEAGLPGRGFPGFPGAKGEKGSKGDVGFPGQAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGAPGHAVEGPKGDRGPQGQPGLPGLPGPVGPPGLPG
LDGLKGDKGNPGWPGTPGAPGPKGEPGFQGLPGIGGSPGITGSKGDMGPPGVPGFQGQKG
LPGLQGAKGDQGDQGFPGSKGLPGPPGPPGPYDVIKGEPGLPGPEGPAGLKGLPGPPGPK
GQQGVTGSVGLPGPPGVPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLPGSMGPPGTP
SVDHGFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPIAGDNIRPFISRCAVCEAPAMVMAV
HSQTIQIPQCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5010 nt   +upstreamnt  +downstreamnt
atggggccccggctgggcgtctggctgcttctgctgcccgccgccctcctgctccacgag
gagcgcagccgggccgcggcgaagggtggttgtgctggctctggctgtgggaagtgtgac
tgccatggagtgaagggacagaagggcgaaagaggcctcccagggttgcaaggtgtcatc
gggtttcccgggatgcaaggacccgaggggccgcaggggccaccaggacaaaagggtgac
actggagaacccgggctgcctggaacgaaagggacaagaggaccccctggagcatcgggt
taccctggaaacccaggactgcctggtattcctggccaagacggtccccctggtcctcca
ggtatccccggatgcaatggaacaaagggcgagccagggcctctggggcccccgggtttg
cctggattcgctggaaatcctggacctccaggattaccaggaatgaagggggatccaggt
gaaatcattggccatgtgcctgggaccctgttgaaaggtgaaagaggatttcctggaccc
ccaggaacaccaggctcgccaggactgccaggcctgcaaggtcctgttggccctccagga
tttaccgggccaccaggtcctccaggccctcctggccctccaggtgaaaaggggcaaatg
ggcttaagttttcaagggccaaaaggcgacaagggtgaccagggagtcagtgggcctccg
ggagtgccaggacaagctcaagttcaagaaaaaggagacttcgccactaaaggagagaag
ggtcaaaaaggtgaacctggatttcagggaatgccagggcttggagagaaaggggagcct
ggaaaaccagggccccgaggaaaacctgggaaagatggcgaaaaaggagaaaaagggagt
ccaggcttcccaggcgacgcggggtacccgggactcccaggccgcgaaggtttcagggga
gacaaaggtgaagcaggtcctccaggcccacctggaattgccatcggcccaggaccctcc
ggagagaaaggagagcgggggtacccgggcgccccagggttgagaggagagccaggcccc
aaagggttcccaggattacaaggccagccaggtcctccaggcttcccggtaccagggcag
gcgggtgctcctggctttcctggtgaaagaggcgagaaaggtgaccaagggtttccaggc
acctctttgccgggaccaagtggaagagatgggcaaccgggcccccccgggcttcccggg
ccccctggacagccaggccacacgaatggaattgtggaatgtcagcctggaccgcctggg
gatcagggtccccctggaattccggggcagccagggctcacgggcgaggttggagaaaaa
ggtcagaaaggagacagctgcctcatctgtgacacagaaggacttcgtgggccccctggg
ccgcagggccccccaggagaaataggtttcccaggacagccagggccgaagggcgacaga
ggcttacccggcagagatggtctggaaggattgcctggcccgcaaggcacaccagggctg
atgggccagccgggagccaagggagagcctggcgagatttacttcgacgcacggctcaaa
ggagacaaaggagacccaggctttccaggacagcctgggatgccaggcagagcagggtct
cctgggagagatggccatccgggtctgcccggccccaaaggctccccgggttcagtagga
ttgaaaggagagcgtggaccccctgggggagttggattccccggcagtcgaggtgacatc
ggccctcctgggcctccagggtttggccctattggccctgttggtgacaaaggacaagcg
ggttttccggggacccctggatccccaggccagccaggtcccaagggtgaagcaggaaaa
gttgtgcccctacctggtccccctggagcacaaggacttccgggacccccaggcttctca
gggccacaaggtgaccgaggttttcctggaaccccaggaaggccaggcctcccaggagag
aagggcactgtcggccagcccggaatcggatttccagggccccccggccccaaaggtgtt
gatggcttacctggagacgtgggacctcctgggagtccgggtcgcccgggatttaacggc
ttacctggcaacccgggtgtgcccggccaaaagggagagccgggcattggtctaccagga
ctcaaaggattgccgggtcttcccggcattcctggcacacctggagagaaggggaacatc
gggggagcaggcgttcctggagagcatggtgctatcggccccccaggccttcagggaatc
agaggtgacccaggaccgcctggattacaaggtcccaaaggagctccgggagctcctgga
attggcccccccggagcgatgggcccccccggaggacagggaccaccaggatcctcaggc
cctcctggagtgaaaggagagaagggctttcccggatttccaggcctggacatgccaggt
cccaaaggagataaagggtcccaagggctgcctggcctgacgggacagtcggggctccct
ggtctccctggacagcagggcacacctggggttcctgggtttccaggtcccaagggagag
atgggcgtcatggggaccccggggcagcctggctcaccgggaccagcgggcctgccagga
ttgccaggagagaaaggggaccacggcttcccgggctcctcggggcccaggggagaccct
ggcttcaagggggacaaaggagacgtgggtctccccggcaagcctggctccatggataag
gtggacatgggcagcatgaagggtcagaagggagaccaaggagagaaaggacagaccgga
ccaactggtgataaaggatcccggggagaccctggaaccccgggagtacctggaaaggac
gggcaggcggggcatcctgggcagccaggacctaaaggtgacccaggcacaggtggaacc
ccgggtgccccaggactccctggacccaaaggatcggttggtggcatgggcctgccagga
acaccgggagaaaaaggtgtgcctggaatccccggcccgcagggcgtccctggcttaccc
ggggagaaaggagccaaaggagagaaagggcaggtgggcctacctggcattggaattccg
ggccgtcctggggacaagggagatcaaggggtcgcaggctttccggggagtcctggagag
aaaggagaaaaaggcagtgctggtatcccaggggtccctggctccccaggccccaaaggc
tcgccagggagtgtcggctatccaggaagccccggattgcctggagaaaaaggtgacaaa
ggcctcccgggatcggatggcattcctggcatcaaaggagaagcaggtcttcctgggagt
cctggccccaccggcccagctggccagaaaggggagcccggcagtgacggaatcccgggg
tcagcgggagagaagggtgaagcaggtctgcccgggagaggattcccagggtttccagga
gccaaaggagagaaaggttcaaagggcgacgtgggtttcccaggacaagccgggagtcca
ggcatccccggatccaaaggagagcaaggattcatgggtcccccggggccgcaaggacag
ccgggcttacctggagctccgggccacgctgtggaggggcccaaaggagaccggggccca
cagggtcaacctggcctgccagggcttccaggacctgtggggcctccggggctccctggg
cttgatggactcaaaggtgacaaaggaaacccgggctggccggggactcccggagctcca
gggcccaagggagagccaggattccaaggcctgcctgggattggtggctcgccagggatc
acaggttctaagggtgacatggggcctccaggagtgccaggatttcaaggtcagaaaggc
ctccctggcctgcagggagcgaagggggatcaaggtgaccagggcttccccggaagtaaa
ggccttcccggccccccgggtcccccgggaccctacgatgtcatcaaaggggagccaggg
cttcctggtcctgagggtcccgcaggtctgaaggggctcccggggcctccaggccccaaa
ggacagcaaggtgtgacaggatccgtgggcttacctggaccgcccggtgttcctggtttt
gacggcgcccctggccagaaaggagagacaggacccttcggccctcctggtccgcgaggg
tttccgggcccgccaggccctgatgggttgccaggatccatgggtcccccaggcaccccg
tctgttgatcacggcttccttgtgaccaggcacagtcagacaacggatgacccacagtgt
cctcctgggaccaaaattctttaccacgggtactccttgctctatgtgcaaggcaacgaa
agggcccatggccaggacttgggcacggctggaagctgcctgcgcaagttcagtacgatg
cccttccttttctgcaacatcaacaatgtgtgtaacttcgcctcccgaaatgactactcc
tactggttgtccacgcctgagcccatgcccatgtccatggcgcccatcgccggggataat
atcagaccgtttattagcaggtgcgcagtgtgtgaggcgccagccatggtgatggccgtg
cacagccagaccattcagatcccgcagtgccccagcggctggtcctccctctggattggc
tattccttcgtgatgcacaccagtgctggtgctgaaggttcgggccaagccctcgcgtcc
cccgggtcttgtctggaagagtttaggagcgcgccattcatcgagtgccacggccgtggg
acctgcaactactacgcgaacgcttacagcttttggcttgccaccatcgagagaagcgag
atgttcaagaagcccacgccatccaccttgaaggccggggagctgcgcacacacgtcagt
cgctgccaagtctgtatgagaagaacgtaa

DBGET integrated database retrieval system