KEGG   Sarcophilus harrisii (Tasmanian devil): 100923506
Entry
100923506         CDS       T02286                                 
Symbol
COL4A1
Name
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr04151  PI3K-Akt signaling pathway
shr04382  Cornified envelope formation
shr04510  Focal adhesion
shr04512  ECM-receptor interaction
shr04820  Cytoskeleton in muscle cells
shr04926  Relaxin signaling pathway
shr04933  AGE-RAGE signaling pathway in diabetic complications
shr04974  Protein digestion and absorption
shr05146  Amoebiasis
shr05165  Human papillomavirus infection
shr05200  Pathways in cancer
shr05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:shr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100923506 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100923506 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100923506 (COL4A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    100923506 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100923506 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100923506 (COL4A1)
  09158 Development and regeneration
   04382 Cornified envelope formation
    100923506 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100923506 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100923506 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100923506 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100923506 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100923506 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:shr04147]
    100923506 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:shr00536]
    100923506 (COL4A1)
Exosome [BR:shr04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100923506 (COL4A1)
Glycosaminoglycan binding proteins [BR:shr00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100923506 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100923506
NCBI-ProteinID: XP_003765783
Ensembl: ENSSHAG00000011617
LinkDB
Position
3:complement(161755929..161948666)
AA seq 1668 aa
MSPRPVVWLLLLLAALLLNEEQSKAAAKGCSGSGCGKCDCNGVKGQKGERGLPGLQGVIG
FPGMQGPEGPHGPPGNKGDAGEPGLPGTKGTRGPPGSSGYPGNPGLPGIPGQDGPPGPPG
IPGCNGTKGDRGPVGPPGLPGFSGIAGPPGLPGMKGDPGEILSLIPGMPLKGERGFPGAP
GPMGPRGNPGLPGMDGPPGYSGPPGPPGPPGLPGEKGQMGLSFQGPKGDKGEQGVSGPPG
EPGQAQVGKKGDLITQGPKGQKGEPGAPGMPGEREKGEPGKPGPRGKPGKDGEKGEKGSP
GLKGDAGFPGIPGQTGAKGERGFPGPQGLPGTVISTQPGGEKGEKGYPGAPGLKGEPGQK
GYPGIPGQPGPPGFPVPGQIGAPGFPGERGEKGDPGPPGQSLPGTSGRDGFPGLPGPPGP
PGLPGHTNGIVECQPGPPGDQGPPGSAGLPGLTGEMGQKGEKGQGCLTCDLDGVPGLPGP
QGPPGEIGFPGQPGAKGDRGLPGHNGLEGMPGPPGLPGLRGEPGAKGEPGEIYLDAKLKG
DKGEPGFPGQPGAPGRAGTPGRDGHPGLQGPKGSPGSIGLKGDRGPPGGVGFPGSRGDIG
PPGPPGFGLPGPVGDKGQAGFPGSPGSPGLPGPKGEAGKVVPLPGPPGAEGVPGPPGFPG
PQGDRGFPGSPGRQGPPGEKGSVGQPGIGFPGPPGPKGVDGLPGAIGQPGNPGRPGFDGL
PGNPGVPGQKGEPGVGLPGPKGSAGLPGIPGRPGEKGNVGGPGIPGEHGLIGPPGLQGLR
GDPGLQGIQGPKGAPGAPGIGPPGVMGPPGGQGPPGSAGPPGIKGEKGYPGFPGSDMPGP
KGDKGSPGLPGLTGQIGLPGPPGTQGTPGLPGFPGPKGEMGVMGTPGQPGPPGPVGSPGF
QGLKGDQGFPGTTGPRGDPGFKGDKGDAGLPGQPGSMDKVDMDSMKGQKGDQGEKGQTGP
IGDKGARGDPGTPGVPGKDGQAGTPGQPGPKGDQGLGGSPGAPGLPGPKGSVGGMGLPGS
PGEKGAQGIPGSQGIPGFPGLKGEKGVKGEAGRPGIGTPGLPGEKGDTGPSGFPGRPGEK
GEKGSPGMPGMPGPPGPGGLPGNVGYPGSPGMPGEKGDKGLPGLDGVPGVKGDAGQPGLP
GPAGPGGQKGEPGGDGIPGSAGEKGEPGLPGRGLPGFPGSKGEKGSKGELGFPGSAGSPG
IPGLKGEPGFMGPPGPPGQQGLPGVPGRAVEGPKGDRGPQGQPGIPGVPGPMGPPGLPGL
DGAKGEKGNPGWPGSPGAPGAKGDPGFQGMPGIGGTPGITGSKGDMGPPGVPGFQGAKGS
PGLQGLKGDQGDQGLPGSKGLPGPPGSPGPYSLIKGEPGIPGPEGPVGLKGIPGPPGPKG
QQGVTGSVGIPGPPGNPGFDGAPGQKGEPGPFGPPGPKGHPGPPGPDGLPGSMGPPGTPS
VDHGFLVTRHSQTIDDPQCPPGTKIIYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTMP
FLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPISGENIRPFISRCAVCEAPAMVMAVH
SQTIQIPQCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGT
CNYYANAYSFWLATIERNEMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5007 nt   +upstreamnt  +downstreamnt
atgagtccacggccggttgtttggctgctgctgttgctggctgccctcctgctcaacgag
gagcagagcaaagcagctgcaaagggttgttctggttctggctgtggaaaatgtgactgc
aatggagtaaaaggacaaaaaggtgaacgaggtcttccaggattacaaggtgttatcggg
tttcctggaatgcaagggcctgaaggtccacatgggcctccgggtaataagggggatgct
ggagaaccaggactaccaggaacaaaaggaacgagagggccaccaggctcatctggttat
cctggaaaccctggacttcctggtattccaggccaagatggtccaccaggtcctcctgga
atcccaggatgcaatggtacaaagggtgatagaggtccagtgggtcctcctggtttgcct
ggattttctgggatagcaggcccccctggattgccaggaatgaagggagacccaggtgaa
atccttagtctaatcccaggcatgccattgaaaggtgaaaggggatttcctggagcccct
ggaccaatgggtccaagaggaaacccagggttaccaggcatggatggtccaccaggatat
tcgggaccacctggtccaccaggtcctccaggacttccaggtgaaaagggtcaaatgggt
ttgagttttcaaggacccaaaggtgacaagggtgaacaaggtgttagtggacctccaggt
gaaccaggacaagctcaagtaggaaaaaagggagatcttattactcagggaccaaaaggt
caaaaaggagaacctggagctcctggaatgccaggtgaaagagagaaaggtgaacctggt
aaaccagggcctcgagggaaacctggaaaggatggtgaaaaaggagaaaagggaagtcca
ggtctaaaaggtgatgcaggattcccaggaatcccaggccaaacgggtgctaagggagag
agaggattcccaggccctcaaggattaccaggcactgtgataagcacacaaccagggggt
gaaaaaggagagaagggatatccaggtgctccaggattaaaaggcgaaccaggtcaaaaa
ggttatccaggtattccaggccaaccaggtcctccaggcttcccagtccccggacaaatt
ggtgctcctggctttcctggtgaaaggggagaaaagggtgatccaggtcctccaggacaa
tcattgcccggaacaagtggaagagatgggttcccaggacttcccgggcctcctggacct
cctggcttaccaggccatacaaatggaatagtggaatgccagcctggaccacctggagat
caaggaccaccgggaagtgctgggctaccaggattgacaggggagatgggacagaaaggt
gagaaaggtcaaggctgcctaacctgtgatctggatggagttcctggacttccaggtcca
caaggaccaccaggggaaataggttttccaggacagccaggtgctaagggagacagaggt
ttacctggacataatggtcttgaagggatgcctgggcctccaggtttgccaggactgagg
ggtgagccaggagcaaagggagagcctggagagatttatttggatgcaaagctgaagggt
gacaaaggagagccaggttttccaggacaacctggtgcccctggaagagcaggaacacct
ggaagagatggccatcctggtcttcagggccccaaaggatcaccgggttcaataggacta
aaaggagaccgtggtccccctggaggagttggttttcctggaagtcgtggtgacataggc
cctcctggcccaccaggatttggcctacctggtccagttggtgacaaagggcaagcaggt
tttccaggaagccctgggtccccaggtctcccaggtcctaagggtgaagcaggaaaagtg
gttcctctacctggtcccccgggagcagaaggagttccaggacccccaggtttcccagga
ccccaaggtgacagaggttttcctggcagcccaggaagacaaggacctccaggagaaaaa
gggtctgttggccaaccaggaatagggtttccaggacccccaggaccaaaaggtgttgat
ggcttgcctggagctataggtcagcctggaaatccaggtcgtccaggatttgatggttta
ccaggaaatccaggtgttccaggccaaaaaggtgagcctggagttggtctcccaggtccc
aaaggatcagcaggtcttcctggcattcctggacgccctggtgaaaaggggaatgttggt
ggaccagggatcccaggagaacatggacttatagggccacctgggctacaaggactcaga
ggtgacccagggctgcaaggaatccaaggcccaaaaggtgcccctggtgctccaggaata
ggccctcccggagttatgggcccccctggaggacaggggccaccaggatcagcaggtcct
cctggaatcaaaggagaaaagggttaccctggattcccaggatcagatatgccaggaccc
aaaggagataagggaagtccaggtcttcctggcttaacaggtcaaataggattacctggg
cctcctggcacacaagggactcctggacttccaggttttccaggtcccaagggagagatg
ggagtaatgggaacccctggtcagccaggtcctcctggaccagtaggttctccaggtttc
caaggactaaaaggtgaccaaggtttcccaggtacaacaggacccaggggtgatccaggt
ttcaaaggtgataagggtgatgctggtctccctggacagccaggatcaatggataaagtg
gacatggacagtatgaagggccagaaaggagatcaaggagaaaagggacaaactgggcca
attggagataaaggagccagaggagatcctggcactccaggagtacctgggaaggacgga
caagcaggaactcccggccagccaggtcctaaaggtgatcaaggtcttgggggatctcca
ggtgctccaggactaccaggaccaaagggttctgttggtggaatgggtttgccagggtca
cctggagaaaaaggagcacaaggaatcccaggttctcaaggtatcccaggcttcccagga
ttaaaaggagaaaagggagtgaaaggagaagcaggtcgcccaggcattgggactcctgga
cttcctggtgaaaagggagacactggaccttcaggtttcccaggaaggcctggagaaaag
ggagaaaaaggaagcccagggatgccaggaatgcctggtcctccaggtcctggaggatta
cctgggaatgtaggctatccagggagccctggaatgcctggagagaagggtgacaaagga
cttcctggtttggatggtgttcctggagttaaaggagatgcaggtcagcctggtcttcca
ggtccagcaggtccaggtggtcaaaaaggtgaaccaggtggtgatggaattccaggatca
gctggtgagaagggtgaaccaggtcttccaggaagaggacttccaggatttccaggttcc
aaaggagaaaaaggttccaagggagagttgggtttcccaggatcagcaggaagtccagga
attcctggactcaaaggagaaccaggatttatgggtcctccgggtcccccaggacaacaa
ggattacctggagttcctggtcgagcggtagaaggtcctaaaggagacaggggtccacaa
ggtcaacctggcatacctggtgttcctggacccatgggccctcctggccttccaggactt
gatggagcaaaaggagaaaaaggaaatccaggttggccaggatccccaggtgctccagga
gctaagggagatccaggatttcagggaatgcctggtattggtggtacaccaggaatcact
ggttctaagggtgatatgggacctccaggtgttccaggatttcaaggtgccaaaggttct
cctggcctccaggggctcaaaggagatcaaggggatcagggactaccaggttcaaaaggt
cttccaggaccacctggttccccaggtccttatagcctaatcaaaggagagccaggtatc
cctggacctgaagggccagtcgggcttaaaggtattccaggccctccaggccctaaagga
caacaaggtgtaacgggctctgtgggtattcctggaccacctggtaacccaggatttgat
ggtgcacctggacaaaaaggagaaccaggcccttttggtcctcctggtccaaaaggacat
cctggtccacccggtccagatgggcttccagggtcaatgggtccaccaggtaccccctct
gtagatcatggcttccttgttactaggcacagtcaaaccatagatgatccacaatgccct
cctgggaccaaaataatttaccatggctattccttgctctacgtacaggggaatgaacgt
gcacatggtcaggacttgggtacggctggaagctgtttgcgaaaattcagtactatgcca
tttctcttctgcaatattaacaatgtttgcaactttgcatcaagaaatgactactcttat
tggttgtctactcctgagcccatgccaatgtcaatggcacctatctctggtgaaaacatt
aggccttttattagtaggtgtgctgtatgtgaggctcctgccatggtgatggcagtccat
agtcagacaatccaaataccacagtgtcccagtggatggtcatcactttggattggctat
tcttttgttatgcacaccagcgctggtgccgaaggttctggccaggcactagcatcccct
ggttcctgtctggaagaatttagaagtgcacccttcattgaatgccatggtcgtgggact
tgtaattattatgcaaatgcttatagcttctggcttgctactatagaaagaaatgagatg
ttcaagaagcctactccatcaactttaaaagcaggagaactgcgcacacacgttagccgc
tgccaagtctgtatgagaagaacataa

DBGET integrated database retrieval system