KEGG   Sarcophilus harrisii (Tasmanian devil): 100917586
Entry
100917586         CDS       T02286                                 
Symbol
COL4A4
Name
(RefSeq) collagen alpha-4(IV) chain
  KO
K06237  collagen type IV alpha
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr04151  PI3K-Akt signaling pathway
shr04382  Cornified envelope formation
shr04510  Focal adhesion
shr04512  ECM-receptor interaction
shr04820  Cytoskeleton in muscle cells
shr04926  Relaxin signaling pathway
shr04933  AGE-RAGE signaling pathway in diabetic complications
shr04974  Protein digestion and absorption
shr05146  Amoebiasis
shr05165  Human papillomavirus infection
shr05200  Pathways in cancer
shr05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:shr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100917586 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100917586 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100917586 (COL4A4)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    100917586 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100917586 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    100917586 (COL4A4)
  09158 Development and regeneration
   04382 Cornified envelope formation
    100917586 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100917586 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100917586 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100917586 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100917586 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100917586 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:shr04147]
    100917586 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:shr00536]
    100917586 (COL4A4)
Exosome [BR:shr04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100917586 (COL4A4)
Glycosaminoglycan binding proteins [BR:shr00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100917586 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100917586
NCBI-ProteinID: XP_031813802
Ensembl: ENSSHAG00000013055
UniProt: G3WIR0
LinkDB
Position
3:54245836..54387063
AA seq 1687 aa
MALIKCSFRWLKQFVMGPWSLVIILFSIQQVYGGGKKYSGPCGGRDCSVCQCFPEKGSRG
QPGHLGPQGPIGDLGPPGPIGLPGEKGMKGDSGFPGPVGNQGDKGPTGVPGFPGLDGIPG
YPGPPGPRGKPGIDGRNGSRGDPGFPGERGVPGPEGPAGLQGPAGEKGNSVYISGAIKGL
PGDRGDPGSPGLPGPRGANGSMGEIGFPGIPGSRGPPGYPGRPGDQGNPGVGVKGQIGDP
GEVGLQGFPGPTLLIEPPGSCLIKGEKGMKGMPGVIGPSGPYGVKGEPGIGVKGEKGTSG
FPGPQGGPGSHGLPGFPGVKGELGASGVRGQLGVFGTKGDPGERGKPGPPGVLTTPPVPC
KGPPGDPGFPGLYGEKGSVGLPGPAGPPGRPGEVRIGMTGPPGPPGLPGLQGLQGEAGNP
GRGDSPAGKPGTPGHPGIPGVPGQQGQPGSDVIHCSTGPPGPQGVKGQSGPPGRRGTKGE
KGNEGLCSCCPGPLGLPGAPGPHGLQGRKGEVGLPGRVGEKGDPGSPGAQGSPGPPGIPG
TSGLTGPKGEKGDMVIARIKGRKGEKGHDGPPGYPGQEGVPGQNGLYGKRGEPGLPGSRG
GAIPGAEGSPGAPGSPGRMGPMGPPGLGFPGPEGVRGGPGKTGHPGVKGASGSKGQKGDT
YPCPSPYPGSPGAPGIPGAPGLKGFPGPPGEAGQPGFDGQKGQQGKSGISGILGPPGFRG
DPGESGPEGVEGLAPIGPPGPPGLPGINGQKGMPGDSAYGPPGLPGKMGPPGLPGLKGLR
GDPGFPGATGATGVPGFPGIKGIQGRKGNAGPPGNPGLPGHGCRGVPGLPGQPGLPGSPG
IPGFDGLKGHQGDRGPPGPAGMKGLPGIPGLPGSPGPPGPQGDPGYYGDNGSPGAPGPKG
AQGLPGIPGFPGERGRKGQIGLPGRKGDPGEEGLCGFPGEPGEPGSKGEKGYPGDQGDIS
IVSRKGEKGDLGPPGRYGFPGEKGDKGSPGIPGIKGFPGRDGLPGLQRGEPGKSGHPGLP
GPPGLPGPPGLRGIIGFPGIPGDKGELGLPGLPGLPGPHGTRGSKGNQGDDAASLFGPPG
PKGDPGIPGCPGYFGEQGDPGFPGPPGSSGLPGREGSAGYPGPPGFPGAPGIPGLKGYPG
EAGDPGPPGVRGDPGPPGPKGIKGSPGSPGLDGLPGLKGQKGLKGSSGLNEVGPPGQMGL
PGLKGKKGEPGRPGASCPGFPGDRGPLGPPGIPGLPGLPGPVGKTPEDEILPPGPLGDQG
PPGPDGPKGPPGPPGFPCSYDILKGDPGDDGQPGLPGPQGPPGPPGAKGFPGCEGKDGQK
GPMGYPGPPGPPGFPGPPGDKGLQGPPGRPGPTGPPGCQGEPGPPADTSSCPKIPGLPGV
PGPRGPEGKMGHPGQRGLPGSPGHKGERGVDGQRGLDGGPGPPGPPGSKGDPGEDGCGGA
PGPPGPTGDPGPKGSGTRYLNGFLLVLHSQTDKEPSCPEGMSKLWTGYSLLYLEGQEKAH
NQDLGLAGSCLPMFSTMPFAYCNINQVCHYARRNDKSYWLSSAAPLPMMPLSEDEIQAYI
SRCVVCEAPGQAMAVHSQDQAIPPCPPTWRSLWIGYSFLMHTGAGDQGGGQSLTSPGSCL
EDFRAAPFIECQGRQGTCHFFANEYSFWLTTVPPDSLLSSAPVPDTLKQERAQRQKISRC
QVCMKYS
NT seq 5064 nt   +upstreamnt  +downstreamnt
atggcattaataaagtgttctttcagatggcttaaacaatttgtaatgggtccttggtcg
cttgtaattattcttttttctatacaacaagtttacgggggtggaaaaaagtactctggt
ccttgtggaggacgagattgttcagtctgtcagtgttttcctgaaaaaggatctcggggc
caacctggtcatctagggccccaaggccccattggagatttaggacctcctgggcccatt
gggttaccaggagagaaaggaatgaaaggagacagtggatttcctggaccagtgggaaat
caaggagataagggcccaacaggagtacctggtttcccaggcttggatggcatacctggg
tatccaggacctccaggaccaagaggcaaacctggcattgatggccgcaatggttcaaga
ggagatccagggtttccaggagagagaggagtaccaggaccagaaggtccagctggtctt
caggggccagcaggagaaaaagggaattcagtgtatatttcaggtgctattaaaggcctc
ccgggagatcgaggtgatccaggatctccaggtttacctggacctcgaggggcaaatgga
tcaatgggtgaaataggatttccagggataccaggttctagaggacctccaggctatcct
gggaggccaggtgatcagggcaatccaggtgttggagtaaaaggccaaattggagatccg
ggtgaagttggtcttcaaggttttccaggaccaaccttactgattgaacctcctggttca
tgtctcattaaaggagaaaaggggatgaaaggaatgcccggtgtcattggtccctctgga
ccatatggagtcaagggtgaacctggtattggagtgaagggtgaaaaaggtacttctgga
ttcccaggacctcagggtggccctggttcccatggacttccaggttttccaggagtaaag
ggagaacttggagcaagtggtgtccgtgggcagcttggagtttttggcactaagggagat
cctggagagcgtggaaagccaggaccaccaggtgttttgacaaccccacctgttccttgc
aaaggacctccaggggacccaggattcccaggactctatggagaaaaggggtcagtgggg
ttgcctggtccagcaggaccccctggcagaccaggggaagtcagaataggaatgacagga
ccccctgggcctccagggttgccaggtttgcaaggtcttcaaggtgaagctggaaatcct
ggaagaggtgattcacctgcaggaaaaccagggactccaggacatccaggcataccagga
gtcccagggcaacaaggtcaacctggatcagatgtcatacattgtagcactggccctcct
ggaccacaaggagtgaaaggtcaatcaggacccccaggaagaagaggcacaaaaggagaa
aaaggaaatgaagggctgtgttcatgttgcccaggtcccctgggccttcctggtgctcca
gggccacatggtcttcagggtaggaaaggagaagtgggtctacctgggagagtaggagaa
aaaggtgacccaggctctcctggtgctcaaggatccccaggacctccaggaatacctggt
acctcggggctaactggtcctaaaggagaaaagggtgatatggttatagcaagaatcaaa
ggaagaaaaggagaaaagggtcatgatgggcctcctggatatccagggcaggaaggcgtt
cctggtcagaatggactgtatggaaaaagaggcgagccaggtctcccaggcagtcgtggc
ggagcaattccaggtgcagaaggatctcctggagcaccaggatctcctgggagaatggga
ccaatggggcctccaggactgggatttcctggcccagaaggggtgagaggtggaccaggg
aaaactggtcaccctggtgtgaagggagcatcaggttccaagggtcagaaaggtgacacc
tatccttgtccttccccctatccagggagccctggagctccagggattccaggtgctcca
ggtctgaagggatttccggggcctccaggtgaagctgggcaaccaggatttgatggacag
aagggccaacaaggcaaatcaggaatatcaggaatactgggaccacctggttttcgaggt
gaccctggagagtcaggtcctgaaggtgtagaggggttggcccctattggtcctccaggc
cctccagggttaccaggaatcaatggccaaaaaggaatgccaggtgattctgcttatgga
ccaccggggcttccaggaaagatgggccctccaggattgcctgggcttaaagggctcaga
ggggatccaggatttcctggagctacaggggcaactggtgttccaggattcccaggcatt
aaaggaattcaaggcagaaaagggaatgctggacctcctggtaacccaggtttgccaggc
catggatgccgaggtgttccagggttgccagggcaacctggactccctggatcaccagga
attccagggtttgatggcttgaaaggtcatcaaggagatagaggtcctcctggaccagct
ggaatgaaaggccttccaggaatcccaggacttccaggctcacctggacccccaggacca
caaggggatccaggctactatggagataatggctcaccgggtgccccaggtccaaaaggg
gcccaaggcctgccaggtatcccaggctttccaggagagcgaggaaggaaaggacaaatc
ggacttccaggtagaaagggtgacccaggagaagagggtctttgtggctttccaggagaa
ccaggagagccaggctctaaaggagaaaaaggctatccaggagatcaaggggatatttcc
atagtttcaagaaagggagaaaaaggtgatcttgggccacctggtagatatggattccca
ggagaaaaaggagataaagggagtcctggaatcccaggaataaaaggatttccaggaaga
gatgggctacctggattacaaagaggagaacctggcaagagtggacatccaggacttcca
ggccccccaggcctcccaggacctccagggctcagaggaataattggtttcccaggtatt
ccaggggacaagggtgagctgggcttaccaggcttgcctggactcccaggacctcatggg
acaagaggatctaaaggaaatcaaggtgatgatgcagcaagtctgtttgggccacctgga
ccaaagggagatccgggtatccctggatgtccaggatattttggagaacaaggagatcca
ggttttccaggccctccaggatctagtggactaccaggaagggaaggatctgctggatac
ccaggacctccaggatttccaggtgctccagggattcccgggctgaaagggtatcctggg
gaagcaggtgacccagggcctccaggggtcagaggcgatccaggaccaccagggcccaag
ggaataaaaggttcccctggatcaccaggattggatggcttgcctggactcaagggtcag
aaaggattaaagggtagttcaggcttaaatgaagttggtccaccaggccaaatgggactg
cctggacttaaagggaagaaaggggagcctggaagacctggagcttcttgtccaggtttt
cctggagacagaggtcctctaggtcctccaggcataccaggattgcctggacttcctggt
cctgtgggaaaaactcctgaagatgaaattcttccccctggtcctctgggagatcaaggt
ccccctggccctgatggaccaaaaggtccacctggaccaccaggttttccatgcagttat
gatatcctgaaaggagatccaggtgatgatggtcaaccaggtctacctggaccacaaggt
ccacccggacctccaggagccaaaggctttccaggatgtgaagggaaagatggccagaaa
ggtccgatggggtaccctgggcctccaggcccacctggatttcctggcccacctggagat
aaaggtttgcaaggaccccctgggagacctggccccactggtcctccaggttgtcaaggt
gaacctggcccacctgcagatacctcttcatgccccaaaatcccaggacttccgggggtt
ccaggcccaagaggaccagaagggaagatgggacacccagggcaacgaggtcttccaggc
tcaccaggacacaaaggcgagcgtggtgtggatggccaaaggggtttggatggtggtcct
ggtcctccaggaccacctggaagtaaaggagacccaggagaagacggctgtggaggggcg
ccaggtccccctggcccaactggggatccaggacccaaaggttctggaactagatacctt
aatggattcctactggttctccacagtcagacagacaaagaaccctcttgtcctgaggga
atgtccaagctgtggactggatatagcctgttgtatctggaaggccaggagaaagcacat
aaccaagacctaggtctggctggatcctgccttcccatgttcagcaccatgccatttgcc
tattgcaacatcaaccaagtctgccattacgccagaaggaatgacaaatcctattggctg
tcgagtgctgctcctctacccatgatgcccctctctgaagacgaaatacaagcttacatc
agccggtgtgtggtgtgtgaggctccaggtcaggcgatggctgtgcacagtcaggatcag
gccatccctccatgcccacccacctggagaagcctttggatagggtattcattcctaatg
cacacaggggcgggcgatcaaggcgggggccagtccctcacctcacctggaagctgcctg
gaagacttcagagccgcgccattcatcgagtgccaaggacggcaggggacttgtcacttt
ttcgctaatgaatacagcttctggctaaccacggtgccaccagactcgctgctctcctca
gctcctgtgccagacaccttaaaacaggaacgggcccagcgccagaagatcagcagatgc
caagtctgcatgaagtacagctaa

DBGET integrated database retrieval system