KEGG   Sarcophilus harrisii (Tasmanian devil): 100923249
Entry
100923249         CDS       T02286                                 
Symbol
COL4A2
Name
(RefSeq) collagen alpha-2(IV) chain
  KO
K06237  collagen type IV alpha
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr04151  PI3K-Akt signaling pathway
shr04382  Cornified envelope formation
shr04510  Focal adhesion
shr04512  ECM-receptor interaction
shr04820  Cytoskeleton in muscle cells
shr04926  Relaxin signaling pathway
shr04933  AGE-RAGE signaling pathway in diabetic complications
shr04974  Protein digestion and absorption
shr05146  Amoebiasis
shr05165  Human papillomavirus infection
shr05200  Pathways in cancer
shr05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:shr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100923249 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100923249 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100923249 (COL4A2)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    100923249 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100923249 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    100923249 (COL4A2)
  09158 Development and regeneration
   04382 Cornified envelope formation
    100923249 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    100923249 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    100923249 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100923249 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100923249 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100923249 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:shr04147]
    100923249 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:shr00536]
    100923249 (COL4A2)
Exosome [BR:shr04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   100923249 (COL4A2)
Glycosaminoglycan binding proteins [BR:shr00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100923249 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 100923249
NCBI-ProteinID: XP_003765782
Ensembl: ENSSHAG00000009660
LinkDB
Position
3:161948857..162220341
AA seq 1704 aa
MGSRMYPTSRLARQRCLLLFVLITVLTERAHAGVKKYDVPCGGRDCSGGCQCYPEKGARG
QPGQVGPQGYDGPPGLQGIPGLQGRKGDKGERGAPGITGPKGDVGQRGVSGFPGADGIPG
HPGQGGPRGRPGSDGCNGTTGDAGTQGPPGSGGLPGLPGPQGPKGQKGEPYAFSPAERDK
YRGDSGEPGFIGLPGPVGYPGSPGKIGPVGAPGRPGPPGPPGPKGQPGNRGLGFYGEKGE
KGDVGRPGPTGLAPDSDLATSFLENVVDHLDQYKGEKGNWGEPGWKGISEKGEEGIVGFP
GQRGLPGNNGREGDQGEKGIRGDVGFPGIDGFKGLPGDIGDRGPPGPPSYSPHPSVIKGI
RGDPGDTGAYGPPGNPGAQGDPGQPGPPGTTIGDGDERRGLPGEMGAKGFIGDPGIPALY
PGPPGIDGKPGLPGKPGSPGPPGPDGFLFGLKGAEGNVGYPGPSGIPGTRGQKGWKGDPG
DCKCDIPSGLPGPPGPIGRPGANGEPGRKGEIGDPGAHGIPGFPGFKGNPGIPGLPGFKG
QKGDSKTITTKGSRGDQGNPGMPGLQGEDGFPGRDGLDGFPGLPGLPGDGIKGLPGDSGF
PGVPGLKGIPGESGPPGLGLPGPKGERGFPGDPGLSGSPGFPGPPGPPGTPGQTDCDGGV
KEPIGQEIIRPGCVIPPKGSQGLPGLPGSPGAKGQRGLPGLPGVDGFIGLKGFQGDAGRE
GFPGPPGFVGPRGSKGATGLPGLDGSPGSSGLPGSVGPAGQKGLPGEVLGAQTGLPGDPG
LPGIPGLKGEQGNRGPQGFRGSPGMPGMPGLKGQPGFPGPSGQPGLPGPPGTHGFPGPQG
QAGLPGLPGPSGFAGLPGDRGDRGDIGVPGPVGMKGLDGDRGDTGFIGEQGRRGEPGFQG
IPGLPGIPGTKGFKGSPGMEGFKGMLGIKGRSGIIGYKGEIGDFGIPGLKGLTGEPGLKG
SRGEPGPPGAPPKFLPGMKDFKGEKGDEGPVGGKGYWGIKGGQGMPGIPGQAGIPGLPGR
PGHIKGAKGEIGVPGVPGLQGFPGVIGSPGIRGFPGFVGVRGDKGMPGQEGLYGETGSIG
DSGDEGDTINLPGRPGLKGEVGTSGLAGEKGFTGEKGTLGDHGFTGLMGIKGSQGLPGLK
GQTGFPGLTGLPGPQGDPGRAGPPGEKGDQGWPGSPGLSGFPGIRGISGLHGLPGTKGFP
GSPGTDGYGNPGFPGLFGDKGERGEPNTIPGPLGAPGQKGERGTIGEQGPSGSPGPQGLP
GFTSPSNISGLPGDKGSPGLFGPQGYRGPPGAPGPTSFPGIKGEVGSPGFTGERGPKGWP
GDPGPQGRPGVYGLPGEKGPKGQQGLMGYHGIPGSIGDRGPKGPKGDRGFPGIPGAMGSP
GIPGVPLRIATEPGVAGPHGKRGPPGMQGEMGPQGPPGDPGLRGLPGKPGPQGRGGMSAL
PGFRGDEGPMGHQGPVGQEGEPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQEPMCPVGM
NKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDICHYASRNDKSYWLS
TTAPLPMMPVAEDEIKPYISRCSVCEAPAVAIAVHSQDVSIPHCPAGWRSLWIGYSFLMH
TAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYFANKYSFWLTTIPEQNFQASP
SADTLKAGLIRTHISRCQVCMKNL
NT seq 5115 nt   +upstreamnt  +downstreamnt
atgggcagcagaatgtatccaacttccagacttgcccgacagcgatgtctgctgctgttc
gtgctgatcacggtgctaactgagagggcacatgctggtgtgaaaaaatatgatgtgcct
tgtggtggaagagactgcagtggaggttgtcaatgctatcctgagaaaggagcccggggc
caaccaggacaagttgggccacaagggtatgatggaccaccagggctacaaggtattcca
ggattacaaggacgcaaaggtgataaaggtgaacggggagcaccaggaataacaggaccc
aaaggagatgtgggacaaagaggtgtttctggatttcctggagctgatggaattcctggt
catccaggtcaaggtggacctcggggaagaccaggcagtgatggttgtaatggaactact
ggagatgctggtacacagggacccccaggttctgggggtttacctgggctccctgggcca
caaggacccaaaggtcagaaaggcgaaccttatgcattttctccggctgaacgagataaa
tataggggtgattcaggagaacctggttttattggtttaccgggacctgtaggttatcct
ggcagtccgggaaaaataggtcctgttggagcccctgggcgaccaggacctcctggacct
ccaggaccaaaaggacaaccaggcaacagaggtcttggtttttatggtgaaaagggtgaa
aagggtgatgttggacgaccaggacccactggacttgcaccagatagtgatcttgctact
tcctttcttgaaaacgtggtggaccatctagatcaatataagggtgaaaaaggaaactgg
ggagaaccaggatggaaaggaatttcagaaaaaggtgaagaaggaatagtgggctttcca
ggacaacggggtctgccaggcaacaacggtagagaaggagatcaaggagagaaaggaatc
agaggagatgttggctttccaggaattgacggctttaaaggactaccgggagacataggt
gaccgaggtccccctggaccaccttcatattcaccccatccttctgtcataaaaggtata
agaggtgatccaggagatacaggggcttatggtccccctggaaatccaggtgcacaagga
gatcctggccaaccaggtcccccagggacaacaataggagacggagatgaacgaagaggc
cttccaggtgaaatgggagccaagggttttataggtgatccaggaattccagccttgtac
ccaggacctccaggtatagatggaaaaccaggacttccagggaaacctggatctcctggc
ccaccaggaccagatggatttctttttggacttaagggagcagagggaaatgtgggttac
ccaggcccttcaggaatcccaggaacacgaggacaaaaaggatggaaaggtgatcctgga
gattgtaaatgtgatattcctagtggtcttcctggtccaccaggacccatcggtcgtcct
ggtgcaaatggagaacctgggaggaagggagaaattggtgatccaggtgcacatggtatt
cctggtttcccaggtttcaagggtaaccctggcattcctggattacctgggtttaaagga
cagaaaggagactctaaaacaattacaacaaaaggtagcagaggtgatcaaggcaatcct
ggcatgcctggtttgcaaggtgaagatggctttccaggacgagatggactagatggtttt
ccaggccttccaggtcttcctggtgatggcatcaaaggccttccaggagattcaggtttc
ccaggagtacctggtttaaaaggcattccaggagaaagtggtcctccaggcctaggtcta
cctggcccaaaaggtgaacgaggctttcctggtgaccctggattaagtggaagtcctggc
ttcccaggccctccaggccccccaggcactccaggccaaacagattgtgatggaggtgtg
aaagagccaattggccaggagataatccgacccggttgtgtaataccacctaaaggatct
caagggttgcctggacttccaggatctccaggtgctaaaggtcaaagaggacttccagga
ttaccaggcgtagatggatttattggacttaaaggattccaaggagatgcaggtcgtgaa
ggatttccaggaccaccaggatttgttggacccagaggatccaaaggtgcaacagggctc
cctggactggatggaagcccaggctcttcaggattaccaggttctgtaggaccagcagga
caaaaaggattacctggagaagtattaggtgcacagactggtctcccaggagatcctgga
ttgcctggaatcccagggttgaaaggagaacaaggaaataggggacctcaaggatttaga
ggaagcccaggtatgcctggaatgcctgggttgaaaggtcagcctggattcccaggtcct
tcaggtcaacctggattaccagggcctccaggaacacatggctttccaggacctcaaggc
caagcaggactaccaggtcttccaggaccttcaggatttgcaggtctacctggtgataga
ggtgatcgaggtgacataggagtccctggccctgtggggatgaaaggtctggatggtgat
agaggtgatactggttttataggagagcaaggcagacgaggagaacctggatttcaaggg
atacctggattgcctggcatcccaggaacaaaaggtttcaaaggatctcctggtatggaa
ggtttcaaaggcatgctgggcatcaaaggaagatcaggaatcattggatataaaggagaa
attggcgattttggaattcctggtttgaaaggtctaactggtgaaccaggcttgaaaggt
agccgtggagaaccaggacctccaggagcacctccaaaatttctgccaggaatgaaagac
ttcaaaggagaaaaaggagatgaaggacctgttggagggaagggatattggggcataaaa
ggtggacaaggaatgccaggcattcctggacaggcaggaattcctgggttacctggaaga
cctggtcacatcaaaggggccaaaggtgaaatcggtgttcctggagtgcctggtttgcaa
gggttccctggtgtaataggttctcctggtatcagaggttttcctggatttgtaggtgtt
aggggtgacaagggtatgcctggacaagaaggtctttatggtgagactggaagtatagga
gattctggtgatgagggagacactataaatttaccaggaaggccagggttaaagggagaa
gtgggcacatctggacttgcaggtgaaaagggattcactggagaaaaaggcacactagga
gatcatggtttcacaggactgatgggcataaaaggatcccaaggcctccctggactgaaa
ggacagacaggttttccaggactgacaggactcccaggaccccaaggagacccaggtcgt
gctggacctcctggtgaaaagggagatcaaggatggccagggagtccaggcttatcaggt
ttccccggcatcagaggaatcagtggattacatggcttaccaggcaccaaaggcttccct
ggatctccaggtacagatggctatgggaatcctggctttcctggtctttttggtgacaaa
ggagaaagaggagagcccaacacaattcctggccccctgggagcaccaggacagaaggga
gaaaggggaaccataggggaacaaggcccatctggtagcccaggacctcaaggacttcca
ggattcacttcaccttccaatatctctgggttaccaggtgacaaaggatccccaggttta
tttggaccacagggttaccgaggcccccctggagcacctggaccaacatctttccctgga
atcaaaggagaagtaggaagtccaggatttacaggagaaaggggacccaaaggatggcca
ggagatcccggaccccagggcagacctggagtatatggactcccaggagaaaaaggaccc
aaaggacaacaaggacttatgggataccatgggataccaggaagtattggtgacagaggc
cccaagggacctaaaggagatagaggattccctggcattcctggtgctatgggatctcct
gggattccaggagtccccctgaggattgctacagaacctggggtggcaggtccccatgga
aagagagggccccctggaatgcaaggagagatgggtccccaaggtcctccaggtgatcca
ggtttgcgaggactgcctggaaaaccaggtccccaaggaagaggtggtatgtcagccctt
ccaggattcagaggtgatgaaggtcccatgggtcatcaaggaccagtaggccaagaaggt
gaaccaggccgaccaggaagtcctgggctacctggaatgccaggccgtagtgttagcatt
ggttatctcctggtgaaacacagtcagactgatcaagagccaatgtgtccagttggcatg
aataaactctggagtggatatagtctactatactttgaaggccaggaaaaagcacacaac
caagatcttggactggctggctcgtgcttggcccgtttcagcaccatgcctttcctgtac
tgcaaccctggagacatctgccattatgccagccggaacgacaaatcctactggctctca
acaactgcaccactgcctatgatgccggtggcagaagacgaaattaaaccttatatcagc
cgttgctctgtatgcgaagcccctgccgtggccattgccgtccacagccaagatgtctcc
atcccccactgtcctgcagggtggcgcagcttatggattggctactcctttctcatgcac
acagccgcgggggacgagggcggcggacagtcgctggtgtctcccggaagctgtctggag
gacttccgggccacgccttttatcgaatgcaacggcggccgaggcacctgccactacttc
gccaacaagtacagcttttggcttaccaccattcccgagcagaacttccaggcctcccca
tcggcagacaccctgaaggccgggctcatccgaactcacatcagccgctgccaggtctgc
atgaagaacctttaa

DBGET integrated database retrieval system