Sarcophilus harrisii (Tasmanian devil): 100916041
Help
Entry
100916041 CDS
T02286
Symbol
COL1A2
Name
(RefSeq) collagen alpha-2(I) chain
KO
K06236
collagen type I alpha
Organism
shr
Sarcophilus harrisii (Tasmanian devil)
Pathway
shr04151
PI3K-Akt signaling pathway
shr04510
Focal adhesion
shr04512
ECM-receptor interaction
shr04611
Platelet activation
shr04820
Cytoskeleton in muscle cells
shr04926
Relaxin signaling pathway
shr04933
AGE-RAGE signaling pathway in diabetic complications
shr04974
Protein digestion and absorption
shr05146
Amoebiasis
shr05165
Human papillomavirus infection
shr05205
Proteoglycans in cancer
shr05415
Diabetic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:
shr00001
]
09130 Environmental Information Processing
09132 Signal transduction
04151 PI3K-Akt signaling pathway
100916041 (COL1A2)
09133 Signaling molecules and interaction
04512 ECM-receptor interaction
100916041 (COL1A2)
09140 Cellular Processes
09144 Cellular community - eukaryotes
04510 Focal adhesion
100916041 (COL1A2)
09142 Cell motility
04820 Cytoskeleton in muscle cells
100916041 (COL1A2)
09150 Organismal Systems
09151 Immune system
04611 Platelet activation
100916041 (COL1A2)
09152 Endocrine system
04926 Relaxin signaling pathway
100916041 (COL1A2)
09154 Digestive system
04974 Protein digestion and absorption
100916041 (COL1A2)
09160 Human Diseases
09161 Cancer: overview
05205 Proteoglycans in cancer
100916041 (COL1A2)
09172 Infectious disease: viral
05165 Human papillomavirus infection
100916041 (COL1A2)
09174 Infectious disease: parasitic
05146 Amoebiasis
100916041 (COL1A2)
09166 Cardiovascular disease
05415 Diabetic cardiomyopathy
100916041 (COL1A2)
09167 Endocrine and metabolic disease
04933 AGE-RAGE signaling pathway in diabetic complications
100916041 (COL1A2)
09180 Brite Hierarchies
09183 Protein families: signaling and cellular processes
00536 Glycosaminoglycan binding proteins [BR:
shr00536
]
100916041 (COL1A2)
Glycosaminoglycan binding proteins [BR:
shr00536
]
Heparan sulfate / Heparin
Extracellular matrix molecules
100916041 (COL1A2)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Collagen
COLFI
Motif
Other DBs
NCBI-GeneID:
100916041
NCBI-ProteinID:
XP_031796637
Ensembl:
ENSSHAG00000005410
UniProt:
G3VSR0
LinkDB
All DBs
Position
5:complement(287644346..287672396)
Genome browser
AA seq
1367 aa
AA seq
DB search
MLSFADARLWLLLAATWCLAAAQAVQEPPRGRKGPQGDRGPRGQRGPPGPPGRDGEDGPP
GPPGPPGPPGLGGNFAAQYDASKGLDMGPGPMGLMGPRGPPGASGPPGAQGFQGPAGEPG
EPGQTGPAGARGPPGPPGKSGEDGHPGKPGRPGERGIVGPQGARGFPGTPGLPGFKGIRG
HNGLDGLKGQAGAPGVKGEPGAPGENGTPGQAGARGLPGERGRIGGPGPAGARGSDGSVG
PVGPAGPLGSAGPPGFPGAPGPKGELGPVGNPGPAGPAGPRGELGLPGMTGPVGPAGNPG
ANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPAGAAGASGPRGLAGEPGPAGSKGESGNKG
EPGSAGPQGPPGPNGEEGKRGPNGEPGSTGPMGPPGLRGVPGSRGLPGADGRAGGMGPPG
NRGPSGPAGARGPNGDAGRPGEPGLMGPRGLPGSPGNVGPTGKEGPAGLPGIDGRPGPTG
PAGNRGEPGNIGFPGPKGPNGDPGKAGEKGHAGLAGARGAPGPDGNNGAQGPPGPTGVQG
GKGEQGPAGPPGFQGLPGPSGPAGEGGKVGERGLPGEFGLPGPAGPRGERGPPGESGAVG
PTGSIGSRGPSGPPGPDGNKGEPGVVGAPGNAGPAGSGGVPGERGAAGVPGGKGEKGETG
PRGEFGNPGRDGARGAPGAMGAPGPAGATGERGEAGPAGPVGPTGNRGAPGDRGEAGPAG
PNGFAGPPGAAGQAGAKGERGTKGPKGENGIVGPTGPVGAAGPAGPNGPPGPVGGRGDGG
PPGMTGFPGAAGRTGAPGPAGITGPPGPPGAAGKEGPRGPRGDQGPLGRAGETGAVGPPG
FAGEKGPPGEAGASGPPGSSGPQGLLGAPGILGLPGSRGERGLPGVSGSLGEPGPLGISG
PPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGLAGHKGERGYPGNPGAVGNAG
APGPHGTVGPAGKPGNRGEPGPVGSVGPVGPFGARGPSGPQGPRGDKGEVGDKGPRGMNG
FKGHNGFQGLPGISGQHGDQGAPGSTGPAGPRGPAGPSGPPGKDGRPGHAGAVGPAGLRG
SQGSQGPAGPPGPPGLPGPPGPSGGGYDFGYEGDFYRADQPRTQASLRPKDYEVDATLKS
LNNQIETILSPEGSKKNPARTCRDLRLSHPDWNSGFYWVDPNQGCTMDAIKVYCDFSTGE
TCIQAQPENIPAKAWHNPRSPQGKKHVWFGETLNGGTQFEYNMEGVTSKDMATQLAFMRL
LANHASQNITYHCKNSVAYMDAESGSLKKAVILQGSNDVELTAEGNSRFTYTVLEDGCSR
KNNEWGKTVIEYKTNKPSRLPFLDIAPLDIGGTDQELRLHIGPVCFK
NT seq
4104 nt
NT seq
+upstream
nt +downstream
nt
atgctcagctttgcggacgcgcgcctctggctgctgctcgcggccacttggtgcctggcc
gcagctcaggctgtacaggagccacccagaggaagaaagggaccacaaggagatagaggt
ccacgcggtcaaaggggtcccccaggccctccaggcagagatggtgaagatggtccccca
ggtcctcccggaccccctggtcctcctggtcttggtgggaacttcgcagcccaatacgat
gctagcaaaggattggacatgggcccaggaccaatgggtttgatgggacccagaggcccc
cctggtgcaagtggacctcctggtgcccaaggattccaaggacccgccggtgagccggga
gagcctggtcagactggccccgcaggtgcccgaggaccccctggacctcctggaaagagt
ggtgaagatggtcaccctggaaagccaggaaggcctggtgagagaggaattgtcggaccc
cagggcgctcgaggcttccccgggactcccggtctgcctggattcaagggaatccggggg
cacaatggtcttgatggactgaaaggacaagctggggctcctggagtcaagggtgaaccc
ggcgcccccggagaaaatgggacccctggccaagcgggagcccgaggcctcccaggagag
aggggacgcatcggagggcccggccccgcgggagctcgcggaagcgatggcagcgtcggc
cccgtcggtcccgccggtccccttggctcggcgggtccccctggcttcccaggcgcccct
ggccctaagggagaactcggccctgtcggcaaccctggccccgccggccccgccggccct
cgtggagagctgggtcttcctggcatgactggtcctgttggtcctgcggggaatcccggt
gccaatggtctgacgggagccaagggggcagcgggtctccctggtgtggctggtgccccc
ggtctgcctgggccccgtggcatccctggtcctgccggcgctgctggggcctccggtccg
agaggacttgcgggcgagcctggtcctgctggttccaaaggagagagtggaaacaaggga
gaaccgggctctgccgggccccaaggccctcccggccccaacggtgaagaaggcaagaga
ggacccaatggagagccaggatccacgggccccatgggcccccccggcctcagaggcgtc
cctggatcccggggtcttcctggagccgacggccgagctggaggcatgggtccccccggt
aaccgcggcccctccggacctgctggagcccgaggccccaatggagatgccggccggcct
ggcgagcccgggctcatgggcccccgtggtctccctggctcccctggaaacgtcggcccc
actgggaaggaaggccctgccggtctccctgggatcgatggccgacccggccccacaggt
cctgctgggaacagaggcgagcccgggaacatcggcttcccaggacccaaaggccctaat
ggcgatcctggcaaagccggagagaaaggtcacgctggtctcgctggtgctcggggcgcc
cctggccccgatgggaacaacggtgctcagggaccccccggccccacgggtgtacaaggt
gggaaaggcgaacagggtcccgctggtcctccaggcttccagggtctacctggcccctcg
ggccctgcaggtgaaggcggcaaagtaggagaaaggggtctccctggtgaatttggcctc
cctggccctgccggcccaagaggtgaacgtggtccccccggagaaagcggagctgtcggt
cctacaggttccatcggaagccgcggtccatctggtcccccgggccccgatggcaacaag
ggagagcctggtgtggttggtgcccctggaaacgcaggcccagctggctccggtggagtg
ccaggagagcgtggtgcagccggtgtccctggaggcaaaggagaaaagggtgaaactggt
cccagaggagaattcggaaacccaggcagagacggcgctcggggagctcctggtgccatg
ggtgccccgggtcccgccggagccactggggagaggggtgaagctggtcccgccggtcct
gttggtcccaccgggaaccgaggagctcctggtgaccgtggagaggccgggcctgcaggc
cccaacggatttgctggccctcctggcgctgctggccaagctggagcaaagggagaacgc
ggaaccaaggggcctaagggagagaacggcatcgtgggccccaccggccccgtgggcgca
gctggccccgcgggtcccaatggcccccctggtcctgttggaggtcgtggcgatggcggt
ccccctggtatgactggcttccctggtgctgctggaagaactggagcccccgggcctgct
ggtatcactggccctcccggtccccctggtgctgctggaaaagaagggccccggggtccc
cgtggtgaccagggtcccctcggccgtgctggagagacaggcgccgtcggcccccctggc
tttgctggagagaagggtccccccggagaggccggtgccagtggtccccccggctcctca
ggccctcaaggtcttctcggtgctcctggcattctgggtctccccggctcaagaggcgag
cgtggtcttcccggggtctcgggctctctgggtgaacccggtcctcttggcatttctggt
cctcccggagctcgaggcccccccggggccgtgggtagccctggcgtcaatggtgctcct
ggcgaagctgggcgtgatgggaaccccggcaacgatggtcctcccggccgagacggtctc
gctggtcacaagggagagcgaggctaccccggcaatcctggtgctgttggcaacgctgga
gctcctgggcctcatgggactgtgggtcctgctggcaagcctggaaaccgcggtgagccg
ggtcctgttggctcagtgggtcctgttggtcccttcggtgcaagaggtcccagtggtccc
caaggtccccgaggtgacaaaggagaagttggtgacaagggaccgagaggcatgaatggt
ttcaaaggacacaatggattccaaggcctccccggcatctctggccaacatggcgatcaa
ggtgctcccggctcaactggccctgcaggccccaggggccctgctggtccttctggtcct
cccggtaaagatggtcgccctggacatgcaggtgccgttgggcccgctggtctgcgtggt
tctcagggcagtcaaggccccgcgggtcctcccggcccccccggcctgcccggtcccccc
ggcccaagtggcggcggctacgacttcggttatgagggcgatttctaccgggctgaccag
cctcgtactcaggcctccctcagacccaaggactacgaagtggacgctactctgaaatcc
ctcaacaaccagatcgagaccatcctcagccccgaaggctccaagaagaaccctgcccgc
acctgccgagacctgaggctcagccacccagactggaacagcggtttctactgggtcgac
cccaaccagggctgcaccatggacgccatcaaagtgtactgtgacttctccactggggag
acctgcatccaagcccagcctgagaacatccccgccaaggcctggcacaatcccaggagc
ccccagggcaagaagcatgtgtggtttggggagacgctcaatggtggcacccagtttgaa
tacaacatggaaggggtcacctccaaggacatggccacccagctggccttcatgcgcctg
ctggccaaccacgcctcccagaatatcacctaccactgcaagaacagcgtggcttacatg
gacgccgagagcggctctctgaagaaggccgtcatcctgcagggctccaacgatgtggag
ctcaccgctgagggcaacagccgcttcacctacaccgtcctggaggacggctgctccaga
aagaacaatgagtggggaaagaccgtcatcgaatacaagaccaacaagccttcccgcttg
cccttcctggacatcgcgcctttggacattgggggcacagaccaagaactccgtttgcac
attggcccagtctgtttcaaataa
DBGET
integrated database retrieval system