Sarcophilus harrisii (Tasmanian devil): 100923249
Help
Entry
100923249 CDS
T02286
Symbol
COL4A2
Name
(RefSeq) collagen alpha-2(IV) chain
KO
K06237
collagen type IV alpha
Organism
shr
Sarcophilus harrisii (Tasmanian devil)
Pathway
shr04151
PI3K-Akt signaling pathway
shr04382
Cornified envelope formation
shr04510
Focal adhesion
shr04512
ECM-receptor interaction
shr04820
Cytoskeleton in muscle cells
shr04926
Relaxin signaling pathway
shr04933
AGE-RAGE signaling pathway in diabetic complications
shr04974
Protein digestion and absorption
shr05146
Amoebiasis
shr05165
Human papillomavirus infection
shr05200
Pathways in cancer
shr05222
Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:
shr00001
]
09130 Environmental Information Processing
09132 Signal transduction
04151 PI3K-Akt signaling pathway
100923249 (COL4A2)
09133 Signaling molecules and interaction
04512 ECM-receptor interaction
100923249 (COL4A2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100923249 (COL4A2)
09142 Cell motility
04820 Cytoskeleton in muscle cells
100923249 (COL4A2)
09150 Organismal Systems
09152 Endocrine system
04926 Relaxin signaling pathway
100923249 (COL4A2)
09154 Digestive system
04974 Protein digestion and absorption
100923249 (COL4A2)
09158 Development and regeneration
04382 Cornified envelope formation
100923249 (COL4A2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
100923249 (COL4A2)
09162 Cancer: specific types
05222 Small cell lung cancer
100923249 (COL4A2)
09172 Infectious disease: viral
05165 Human papillomavirus infection
100923249 (COL4A2)
09174 Infectious disease: parasitic
05146 Amoebiasis
100923249 (COL4A2)
09167 Endocrine and metabolic disease
04933 AGE-RAGE signaling pathway in diabetic complications
100923249 (COL4A2)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
04147 Exosome [BR:
shr04147
]
100923249 (COL4A2)
00536 Glycosaminoglycan binding proteins [BR:
shr00536
]
100923249 (COL4A2)
Exosome [BR:
shr04147
]
Exosomal proteins
Exosomal proteins of other cancer cells
100923249 (COL4A2)
Glycosaminoglycan binding proteins [BR:
shr00536
]
Heparan sulfate / Heparin
Extracellular matrix molecules
100923249 (COL4A2)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Collagen
C4
Motif
Other DBs
NCBI-GeneID:
100923249
NCBI-ProteinID:
XP_003765782
Ensembl:
ENSSHAG00000009660
LinkDB
All DBs
Position
3:161948857..162220341
Genome browser
AA seq
1704 aa
AA seq
DB search
MGSRMYPTSRLARQRCLLLFVLITVLTERAHAGVKKYDVPCGGRDCSGGCQCYPEKGARG
QPGQVGPQGYDGPPGLQGIPGLQGRKGDKGERGAPGITGPKGDVGQRGVSGFPGADGIPG
HPGQGGPRGRPGSDGCNGTTGDAGTQGPPGSGGLPGLPGPQGPKGQKGEPYAFSPAERDK
YRGDSGEPGFIGLPGPVGYPGSPGKIGPVGAPGRPGPPGPPGPKGQPGNRGLGFYGEKGE
KGDVGRPGPTGLAPDSDLATSFLENVVDHLDQYKGEKGNWGEPGWKGISEKGEEGIVGFP
GQRGLPGNNGREGDQGEKGIRGDVGFPGIDGFKGLPGDIGDRGPPGPPSYSPHPSVIKGI
RGDPGDTGAYGPPGNPGAQGDPGQPGPPGTTIGDGDERRGLPGEMGAKGFIGDPGIPALY
PGPPGIDGKPGLPGKPGSPGPPGPDGFLFGLKGAEGNVGYPGPSGIPGTRGQKGWKGDPG
DCKCDIPSGLPGPPGPIGRPGANGEPGRKGEIGDPGAHGIPGFPGFKGNPGIPGLPGFKG
QKGDSKTITTKGSRGDQGNPGMPGLQGEDGFPGRDGLDGFPGLPGLPGDGIKGLPGDSGF
PGVPGLKGIPGESGPPGLGLPGPKGERGFPGDPGLSGSPGFPGPPGPPGTPGQTDCDGGV
KEPIGQEIIRPGCVIPPKGSQGLPGLPGSPGAKGQRGLPGLPGVDGFIGLKGFQGDAGRE
GFPGPPGFVGPRGSKGATGLPGLDGSPGSSGLPGSVGPAGQKGLPGEVLGAQTGLPGDPG
LPGIPGLKGEQGNRGPQGFRGSPGMPGMPGLKGQPGFPGPSGQPGLPGPPGTHGFPGPQG
QAGLPGLPGPSGFAGLPGDRGDRGDIGVPGPVGMKGLDGDRGDTGFIGEQGRRGEPGFQG
IPGLPGIPGTKGFKGSPGMEGFKGMLGIKGRSGIIGYKGEIGDFGIPGLKGLTGEPGLKG
SRGEPGPPGAPPKFLPGMKDFKGEKGDEGPVGGKGYWGIKGGQGMPGIPGQAGIPGLPGR
PGHIKGAKGEIGVPGVPGLQGFPGVIGSPGIRGFPGFVGVRGDKGMPGQEGLYGETGSIG
DSGDEGDTINLPGRPGLKGEVGTSGLAGEKGFTGEKGTLGDHGFTGLMGIKGSQGLPGLK
GQTGFPGLTGLPGPQGDPGRAGPPGEKGDQGWPGSPGLSGFPGIRGISGLHGLPGTKGFP
GSPGTDGYGNPGFPGLFGDKGERGEPNTIPGPLGAPGQKGERGTIGEQGPSGSPGPQGLP
GFTSPSNISGLPGDKGSPGLFGPQGYRGPPGAPGPTSFPGIKGEVGSPGFTGERGPKGWP
GDPGPQGRPGVYGLPGEKGPKGQQGLMGYHGIPGSIGDRGPKGPKGDRGFPGIPGAMGSP
GIPGVPLRIATEPGVAGPHGKRGPPGMQGEMGPQGPPGDPGLRGLPGKPGPQGRGGMSAL
PGFRGDEGPMGHQGPVGQEGEPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQEPMCPVGM
NKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDICHYASRNDKSYWLS
TTAPLPMMPVAEDEIKPYISRCSVCEAPAVAIAVHSQDVSIPHCPAGWRSLWIGYSFLMH
TAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYFANKYSFWLTTIPEQNFQASP
SADTLKAGLIRTHISRCQVCMKNL
NT seq
5115 nt
NT seq
+upstream
nt +downstream
nt
atgggcagcagaatgtatccaacttccagacttgcccgacagcgatgtctgctgctgttc
gtgctgatcacggtgctaactgagagggcacatgctggtgtgaaaaaatatgatgtgcct
tgtggtggaagagactgcagtggaggttgtcaatgctatcctgagaaaggagcccggggc
caaccaggacaagttgggccacaagggtatgatggaccaccagggctacaaggtattcca
ggattacaaggacgcaaaggtgataaaggtgaacggggagcaccaggaataacaggaccc
aaaggagatgtgggacaaagaggtgtttctggatttcctggagctgatggaattcctggt
catccaggtcaaggtggacctcggggaagaccaggcagtgatggttgtaatggaactact
ggagatgctggtacacagggacccccaggttctgggggtttacctgggctccctgggcca
caaggacccaaaggtcagaaaggcgaaccttatgcattttctccggctgaacgagataaa
tataggggtgattcaggagaacctggttttattggtttaccgggacctgtaggttatcct
ggcagtccgggaaaaataggtcctgttggagcccctgggcgaccaggacctcctggacct
ccaggaccaaaaggacaaccaggcaacagaggtcttggtttttatggtgaaaagggtgaa
aagggtgatgttggacgaccaggacccactggacttgcaccagatagtgatcttgctact
tcctttcttgaaaacgtggtggaccatctagatcaatataagggtgaaaaaggaaactgg
ggagaaccaggatggaaaggaatttcagaaaaaggtgaagaaggaatagtgggctttcca
ggacaacggggtctgccaggcaacaacggtagagaaggagatcaaggagagaaaggaatc
agaggagatgttggctttccaggaattgacggctttaaaggactaccgggagacataggt
gaccgaggtccccctggaccaccttcatattcaccccatccttctgtcataaaaggtata
agaggtgatccaggagatacaggggcttatggtccccctggaaatccaggtgcacaagga
gatcctggccaaccaggtcccccagggacaacaataggagacggagatgaacgaagaggc
cttccaggtgaaatgggagccaagggttttataggtgatccaggaattccagccttgtac
ccaggacctccaggtatagatggaaaaccaggacttccagggaaacctggatctcctggc
ccaccaggaccagatggatttctttttggacttaagggagcagagggaaatgtgggttac
ccaggcccttcaggaatcccaggaacacgaggacaaaaaggatggaaaggtgatcctgga
gattgtaaatgtgatattcctagtggtcttcctggtccaccaggacccatcggtcgtcct
ggtgcaaatggagaacctgggaggaagggagaaattggtgatccaggtgcacatggtatt
cctggtttcccaggtttcaagggtaaccctggcattcctggattacctgggtttaaagga
cagaaaggagactctaaaacaattacaacaaaaggtagcagaggtgatcaaggcaatcct
ggcatgcctggtttgcaaggtgaagatggctttccaggacgagatggactagatggtttt
ccaggccttccaggtcttcctggtgatggcatcaaaggccttccaggagattcaggtttc
ccaggagtacctggtttaaaaggcattccaggagaaagtggtcctccaggcctaggtcta
cctggcccaaaaggtgaacgaggctttcctggtgaccctggattaagtggaagtcctggc
ttcccaggccctccaggccccccaggcactccaggccaaacagattgtgatggaggtgtg
aaagagccaattggccaggagataatccgacccggttgtgtaataccacctaaaggatct
caagggttgcctggacttccaggatctccaggtgctaaaggtcaaagaggacttccagga
ttaccaggcgtagatggatttattggacttaaaggattccaaggagatgcaggtcgtgaa
ggatttccaggaccaccaggatttgttggacccagaggatccaaaggtgcaacagggctc
cctggactggatggaagcccaggctcttcaggattaccaggttctgtaggaccagcagga
caaaaaggattacctggagaagtattaggtgcacagactggtctcccaggagatcctgga
ttgcctggaatcccagggttgaaaggagaacaaggaaataggggacctcaaggatttaga
ggaagcccaggtatgcctggaatgcctgggttgaaaggtcagcctggattcccaggtcct
tcaggtcaacctggattaccagggcctccaggaacacatggctttccaggacctcaaggc
caagcaggactaccaggtcttccaggaccttcaggatttgcaggtctacctggtgataga
ggtgatcgaggtgacataggagtccctggccctgtggggatgaaaggtctggatggtgat
agaggtgatactggttttataggagagcaaggcagacgaggagaacctggatttcaaggg
atacctggattgcctggcatcccaggaacaaaaggtttcaaaggatctcctggtatggaa
ggtttcaaaggcatgctgggcatcaaaggaagatcaggaatcattggatataaaggagaa
attggcgattttggaattcctggtttgaaaggtctaactggtgaaccaggcttgaaaggt
agccgtggagaaccaggacctccaggagcacctccaaaatttctgccaggaatgaaagac
ttcaaaggagaaaaaggagatgaaggacctgttggagggaagggatattggggcataaaa
ggtggacaaggaatgccaggcattcctggacaggcaggaattcctgggttacctggaaga
cctggtcacatcaaaggggccaaaggtgaaatcggtgttcctggagtgcctggtttgcaa
gggttccctggtgtaataggttctcctggtatcagaggttttcctggatttgtaggtgtt
aggggtgacaagggtatgcctggacaagaaggtctttatggtgagactggaagtatagga
gattctggtgatgagggagacactataaatttaccaggaaggccagggttaaagggagaa
gtgggcacatctggacttgcaggtgaaaagggattcactggagaaaaaggcacactagga
gatcatggtttcacaggactgatgggcataaaaggatcccaaggcctccctggactgaaa
ggacagacaggttttccaggactgacaggactcccaggaccccaaggagacccaggtcgt
gctggacctcctggtgaaaagggagatcaaggatggccagggagtccaggcttatcaggt
ttccccggcatcagaggaatcagtggattacatggcttaccaggcaccaaaggcttccct
ggatctccaggtacagatggctatgggaatcctggctttcctggtctttttggtgacaaa
ggagaaagaggagagcccaacacaattcctggccccctgggagcaccaggacagaaggga
gaaaggggaaccataggggaacaaggcccatctggtagcccaggacctcaaggacttcca
ggattcacttcaccttccaatatctctgggttaccaggtgacaaaggatccccaggttta
tttggaccacagggttaccgaggcccccctggagcacctggaccaacatctttccctgga
atcaaaggagaagtaggaagtccaggatttacaggagaaaggggacccaaaggatggcca
ggagatcccggaccccagggcagacctggagtatatggactcccaggagaaaaaggaccc
aaaggacaacaaggacttatgggataccatgggataccaggaagtattggtgacagaggc
cccaagggacctaaaggagatagaggattccctggcattcctggtgctatgggatctcct
gggattccaggagtccccctgaggattgctacagaacctggggtggcaggtccccatgga
aagagagggccccctggaatgcaaggagagatgggtccccaaggtcctccaggtgatcca
ggtttgcgaggactgcctggaaaaccaggtccccaaggaagaggtggtatgtcagccctt
ccaggattcagaggtgatgaaggtcccatgggtcatcaaggaccagtaggccaagaaggt
gaaccaggccgaccaggaagtcctgggctacctggaatgccaggccgtagtgttagcatt
ggttatctcctggtgaaacacagtcagactgatcaagagccaatgtgtccagttggcatg
aataaactctggagtggatatagtctactatactttgaaggccaggaaaaagcacacaac
caagatcttggactggctggctcgtgcttggcccgtttcagcaccatgcctttcctgtac
tgcaaccctggagacatctgccattatgccagccggaacgacaaatcctactggctctca
acaactgcaccactgcctatgatgccggtggcagaagacgaaattaaaccttatatcagc
cgttgctctgtatgcgaagcccctgccgtggccattgccgtccacagccaagatgtctcc
atcccccactgtcctgcagggtggcgcagcttatggattggctactcctttctcatgcac
acagccgcgggggacgagggcggcggacagtcgctggtgtctcccggaagctgtctggag
gacttccgggccacgccttttatcgaatgcaacggcggccgaggcacctgccactacttc
gccaacaagtacagcttttggcttaccaccattcccgagcagaacttccaggcctcccca
tcggcagacaccctgaaggccgggctcatccgaactcacatcagccgctgccaggtctgc
atgaagaacctttaa
DBGET
integrated database retrieval system