KEGG   Suncus etruscus (white-toothed pygmy shrew): 126001121
Entry
126001121         CDS       T09902                                 
Symbol
COL4A3
Name
(RefSeq) collagen alpha-3(IV) chain
  KO
K06237  collagen type IV alpha
Organism
setr  Suncus etruscus (white-toothed pygmy shrew)
Pathway
setr04151  PI3K-Akt signaling pathway
setr04382  Cornified envelope formation
setr04510  Focal adhesion
setr04512  ECM-receptor interaction
setr04820  Cytoskeleton in muscle cells
setr04926  Relaxin signaling pathway
setr04933  AGE-RAGE signaling pathway in diabetic complications
setr04974  Protein digestion and absorption
setr05146  Amoebiasis
setr05165  Human papillomavirus infection
setr05200  Pathways in cancer
setr05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:setr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    126001121 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    126001121 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    126001121 (COL4A3)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    126001121 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    126001121 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    126001121 (COL4A3)
  09158 Development and regeneration
   04382 Cornified envelope formation
    126001121 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    126001121 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    126001121 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    126001121 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    126001121 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    126001121 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:setr04147]
    126001121 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:setr00536]
    126001121 (COL4A3)
Exosome [BR:setr04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   126001121 (COL4A3)
Glycosaminoglycan binding proteins [BR:setr00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   126001121 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 126001121
NCBI-ProteinID: XP_049624258
LinkDB
Position
2:complement(23182770..23333935)
AA seq 1672 aa
MCPGPSPRLPAVLLLPLLLLLQAQEPSEGKGCICKGKGQCFCGGIKGQKGEQGLPGPGGL
PGQKGFPGPEGLPGLQGPKGSAGLPGLKGPKGTRGITGLPGFSGPPGLPGTPGHPGPYGL
PGLPGCNGSKGEQGFPGFPGLPGYPGMQGAVGLKGEKGEPAESKGVEGGGKGDPGLPGIP
GSQGLPGLPGFPGPVGPPGPPGLFGFPGAMGPPGPKGYMGDNVIGQKGEKGVKGLMGPPG
PPGTVVVTLTGTDNRTDLKGEKGDRGAAGEPGPPGPSGPRGDSYGSEKGERGESGAQGKP
GKDGAPGFPGTKGAKGDRGFPGLVGKDGIKGLKGDMGPPGFPGPTEYYDTYHAKGDKGIP
GPPGPKGARGPQGPSGAPGAPGSPGLSLPGLRGTIGIPGMKGRKGEQGPPGRDAVGLPGA
PGCTGSPGPPGRPGPPGPPGDVVYCKGPLGDQGHPGHIGAQGIPGVEGPKGEPGLPCIQC
PCVQGPPGLPGLPGLDGEKGLPGGEGAAGGKGSQGPPGNPGLPGFPGLPGAQGDAGQKGE
KGEKPKPEGEAGAPGDSGLRGPPGRKGFDGIPGIPGAKGIRGLKGEPALHGVKGDRGPPG
NPGLPGFPGPPGPAGPSDFGPQGETGPKGDQGAPGAPGPPGEAGPRGEDGISIPIPGIPG
PPGPPGCPGPPGPSGVPGSQGKCNDPGLPGPDGEPGIPGIGFPGPPGPKGDRGLPGSRGP
PGCPGDMGKPGFPGTPGATGAKGEPGLSMVGEPGKPGFPGQRGSPGEKGDTGFPGLPGHP
GNAGTRGLDGPQGDPGHLGPPGEKGPPGKCEEGPQGPPGHPGLKGLKGQQGRRGVPGPKG
DPGIPGLDRSGFPGEPGPPGIPGQRGEMGPPGQKGCPGTSGVLGPPGEKGMIGMMGYPGN
IGPPGLAGDPGIPGERGSLGIPGTKGKRGPPGDRGDEGEKGAAGPSQTTHLAGEKGEPGL
KGMAGMPGEKGNRGLPGLPGFQGLPGPPGSPGSPGPRGDPGSFGDPGEPGPRGDPGSMGF
MGVPGLKGKKGALGPPGLTGRPGIPGVRGPQGNKGEPGYSEGTRPGPPGPQGDPGLPGDK
GKKGERGLPGAPGYWGPAGPDGIPGSPGSPGEPGMPGPKGDLGHKGIKGYAGFPGVKGLP
GPPGFPGHPGQVGMKGEQGCDGIPGPAGEKGESGLLGTLPGPRGKPGLPGAKGNRGAPGF
PGLPGRKGAVGDVGPPGPTGMAGPQGPPGSPGVIIPGLKGNQGPPGLRGNPGEPGPPGLP
GSPAKGIKGDKGCIGQPGPSGPPGAVGDMGPRGYPGIPGPQGLPGHRGYPGFHGFPGMKG
EKGDPGFLGPKGPPGRIGPKGKPGARGDFGTVKIISLPGSPGPPGAIGPPGMQGEPGLPG
PPGNPGPCGPRGKPGQDGRPGRPGPDGVKGHKGCKGQQGLPGFNGPPGLKGKPGEPGVPR
TGPIRRGFLLTRHSQTTAIPSCPIGTEPLYTGFSLLSVQGNERAHGQDLGSLGSCLQRFT
PMPFLFCNINDVCNFASRNDYSYWLSTPALMPKDMAPITGRALEPYISRCIVCEGSTVAI
AVHSQTTNVPPCPQGWVSLWKGFSFIMFTSAGSEGSGQALASPGSCMEEFRASPFIECHG
RGTCNYYSNSYSFWLASLNSKRMFRKPIPSTVKAGELENVISRCQVCMKRRN
NT seq 5019 nt   +upstreamnt  +downstreamnt
atgtgccccgggccttcgccgaggttgcccgcggtgctattgctgccactcctgctgctg
ctccaggcccaggagccctccgagggcaagggctgcatatgcaaaggcaaaggccagtgt
ttctgtggaggcatcaaaggccaaaagggagaacaaggtttgcctggacctggtggttta
cctggccagaaaggatttccaggtcctgaaggtctgcctggactccaggggcccaagggc
tctgctggacttcctggactcaagggtcccaaaggaacaaggggtataactggattgcca
ggattttctggtcctcctggacttcctggcaccccaggtcatcctggaccttatggactt
cctggtttgccagggtgcaatggttccaagggtgagcaaggctttcccggatttccagga
ctaccaggctacccagggatgcagggtgctgttggtctaaaaggagaaaagggtgaacct
gctgaaagcaaaggtgtagaaggaggtggaaagggtgatcccggattaccgggaatacca
ggatctcagggtttgccaggcctgccaggctttcctggacctgtgggcccaccaggtcct
ccgggattgtttggctttccaggagccatgggacctccaggacctaagggttacatgggt
gataacgtgataggacagaaaggagagaagggtgttaaaggattgatgggacccccagga
cccccaggaacagttgttgtgacactgactggcacagataacagaacggacctgaagggg
gagaagggagatcggggtgcagcaggtgaaccaggccctccaggaccctcaggtccacgt
ggagattcttatgggtctgaaaagggtgaacgtggggagtccggagcacagggaaaacct
ggaaaagatggcgcccctggcttccctggaaccaagggagccaaaggtgaccggggcttt
ccagggttagtcggtaaagatggcatcaaggggttgaagggagacatgggccctccggga
tttcctggtccaacagaatattatgatacataccatgcaaaaggggataaaggaattcca
ggcccaccagggcccaaaggagctcgtggcccacaaggtcccagtggtgcacctggagct
ccaggaagtcctgggttatctctgcctggcctcagaggaactattgggattccaggcatg
aaaggaaggaaaggagagcaaggaccccctggaagggatgcagtgggattgcctggggcc
ccaggttgtactggctcaccaggccctccagggagacccggacctccaggacctccaggt
gatgttgtttattgtaaaggccctcttggagatcagggacatccaggacatataggagcc
caaggaatcccaggagttgagggacccaaaggggaaccaggccttccatgtatacaatgt
ccttgtgtccaagggcctccgggtctcccaggattgccgggactggatggtgaaaaagga
ctcccaggaggagaaggggcagctgggggaaaaggaagccaggggcctccagggaatcct
ggtcttcctgggtttccggggctcccgggtgctcagggtgatgctggacagaaaggagag
aaaggtgagaagccaaaaccagaaggagaagctggagctccaggtgattctggactcaga
ggacctcctggaagaaagggctttgatggaatccctggaattccaggagcaaaaggaatt
cgaggactgaaaggcgagccggcactgcatggtgtgaagggggaccggggtcctccaggg
aatcctggactccctgggtttccaggacctccaggaccagctggaccatcagactttgga
ccacagggagagactggtccaaagggtgaccaaggagctcctggagcccctgggccacct
ggagaagccggtccaaggggagaagatggcatttcaatcccaataccaggcatcccaggg
cctccgggacctccaggctgccctggccccccaggtccatctggtgttcctggatcccaa
ggaaaatgcaatgatccgggccttcctggaccagatggtgaaccaggaattccaggaatc
ggatttcctgggcctcctggacccaagggagaccgtggactcccaggatcaagaggaccc
cctggttgtcctggagacatgggaaaaccagggtttcctgggacgccaggagccacagga
gccaagggagaacccggattgagcatggttggagaaccaggaaaaccagggtttccaggg
caaagaggtagccctggggaaaaaggagacactggattcccaggacttcctggtcaccct
ggaaatgcaggcaccagaggactggatgggccacaaggggatccagggcatcttggacca
cctggagaaaaaggccctccaggaaagtgcgaagaagggccacagggacctccaggacat
ccaggcttaaaaggattgaaagggcagcaaggcagaagaggtgtgcctgggccaaaggga
gatccaggcattccaggtttggatagatctggatttcctggagaacctggacccccagga
ataccaggtcaacgaggtgagatggggccgcctggtcagaaaggatgtccaggaacatca
ggagttctaggaccaccaggagaaaaaggaatgattgggatgatgggctatcctggaaac
ataggtcctccaggacttgctggggacccaggcataccaggagagagaggaagccttgga
attccaggaacaaagggtaaaagaggacccccaggagacagaggggatgaaggagagaaa
ggcgctgctgggccatctcagacaacccacttagcaggggaaaagggggaacccggtctc
aaagggatggcaggaatgccaggtgagaagggaaacagaggcctcccaggactgccagga
ttccaaggcctgcctggaccacctggttctccaggatcaccaggccccagaggagatcca
ggcagctttggggatcctggggaaccaggcccacgtggagatccaggaagcatggggttc
atgggagtgccaggtcttaagggaaaaaagggagctttgggcccccctggtttaactgga
agaccaggaatcccaggtgttcgtggtccccaagggaataagggagagcccggttattca
gaaggtacaaggccaggaccaccaggaccacagggagatccaggattaccaggtgataag
ggaaagaaaggagaaagagggctacctggcgcacctggatattgggggcctgctggacca
gatggcatccctggaagtcctggaagtcctggtgagccaggaatgccaggtcccaaaggt
gatttgggtcataaaggaatcaaaggttatgcaggctttccaggagttaaaggccttcca
ggtcctccagggttcccaggtcatcctgggcaagtaggtatgaaaggtgaacaaggatgt
gatggaattcctggtccagcaggggaaaagggagaatcaggtttactgggaacacttcca
ggcccaagaggaaaacctggccttccaggagccaaaggaaacaggggagcccctggattt
ccaggtctccctggccggaaaggagcagtgggagatgttgggccaccaggacccacaggc
atggcaggacctcaaggacccccaggttcacctggtgtgatcatcccaggcctcaaagga
aatcaaggtcccccaggcctaagaggaaatccaggtgagcctggtccccctggacttcca
ggaagccctgccaaaggaataaaaggtgacaaagggtgtattggccagcctggtccaagt
ggtccacctggagctgtaggagacatggggccacgtggctatccgggtataccaggtccc
cagggtcttccagggcacagaggttatcctgggttccatggatttccaggcatgaaaggc
gaaaagggtgatccaggatttttgggaccaaaaggacctccaggacgaattggaccaaaa
ggaaaacctggtgcacgtggagactttgggacagtgaaaatcatctcccttccaggaagc
ccagggccacctggtgctataggaccaccaggaatgcaaggagaaccaggcctaccaggg
ccgccaggaaacccaggaccttgtgggccaagaggtaaaccaggtcaggatggaagacca
ggaagacctggcccagatggagttaaaggccacaaaggttgcaaaggacagcaaggtctg
cctggattcaatggaccaccaggcttaaaggggaagcctggagagcctggagtacccaga
acgggacccatcaggagaggttttcttctcactcgtcacagtcagaccacagcaatccct
tcctgtcccataggcacagaaccactctatactgggttttctcttctttccgtgcaagga
aacgagcgagctcatggacaagacctggggagccttggcagttgcctgcagcgattcacc
ccaatgcctttcttattctgcaacatcaatgatgtgtgtaatttcgcatctcgaaatgat
tattcatactggctttctacaccagcactgatgccaaaggacatggctccaatcacgggc
agggccctggaaccttatattagcagatgcatcgtctgtgagggctctacagtggccatt
gctgtccacagccagacaactaatgtccccccatgtccacagggctgggtttctctttgg
aaaggattttctttcatcatgttcacaagtgcaggatctgagggtagtggacaagcattg
gcatccccaggatcctgcatggaagaattccgagccagtccattcatagaatgccatgga
agagggacatgcaactactattctaactcctacagcttctggttggcttcgttaaattct
aaaagaatgttcagaaagcctattccatcaactgtgaaagctggagaactagaaaatgta
atcagtcgctgtcaggtgtgcatgaagagaagaaactga

DBGET integrated database retrieval system