KEGG   Sarcophilus harrisii (Tasmanian devil): 100924667
Entry
100924667         CDS       T02286                                 
Symbol
COL1A1
Name
(RefSeq) collagen alpha-1(I) chain isoform X1
  KO
K06236  collagen type I alpha
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr04151  PI3K-Akt signaling pathway
shr04510  Focal adhesion
shr04512  ECM-receptor interaction
shr04611  Platelet activation
shr04820  Cytoskeleton in muscle cells
shr04926  Relaxin signaling pathway
shr04933  AGE-RAGE signaling pathway in diabetic complications
shr04974  Protein digestion and absorption
shr05146  Amoebiasis
shr05165  Human papillomavirus infection
shr05205  Proteoglycans in cancer
shr05415  Diabetic cardiomyopathy
Brite
KEGG Orthology (KO) [BR:shr00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    100924667 (COL1A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    100924667 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    100924667 (COL1A1)
  09142 Cell motility
   04820 Cytoskeleton in muscle cells
    100924667 (COL1A1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    100924667 (COL1A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    100924667 (COL1A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    100924667 (COL1A1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    100924667 (COL1A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    100924667 (COL1A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    100924667 (COL1A1)
  09166 Cardiovascular disease
   05415 Diabetic cardiomyopathy
    100924667 (COL1A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    100924667 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:shr00536]
    100924667 (COL1A1)
Glycosaminoglycan binding proteins [BR:shr00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   100924667 (COL1A1)
SSDB
Motif
Pfam: Collagen COLFI VWC VWC2L_2nd
Other DBs
NCBI-GeneID: 100924667
NCBI-ProteinID: XP_003768423
Ensembl: ENSSHAG00000013444
UniProt: G3WK23
LinkDB
Position
4:329722710..329748259
AA seq 1454 aa
MFSFVDPRLLLLLAVTAVLTHGQDEEDIPEGTCVQNGLKYHNGAVWKPETCQICVCDNGS
ILCDEIICEDVSNCPNVEYKDDECCPTCLGADAVSPSSPGETGVEGPKGDTGPRGERGLP
GPPGRDGIPGQPGLPGPPGPPGPPGLGGNFAPQMSYGYDEKSGGGMSVPGPMGPSGPRGL
PGPPGSPGPQGFQGPPGEPGEPGASGPMGPRGPAGPPGKNGDDGEAGKPGRPGERGPPGP
QGARGLPGTAGLPGMKGHRGFSGLDGAKGDSGPAGPKGEPGSPGENGAPGQMGPRGLPGE
RGRPGPPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGSRGSEGPQGV
RGEPGPPGPAGSPGPSGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPSGAPGP
KGNSGEPGTPGNKGDPGAKGEPGPVGVQGPPGPAGEEGKRGSRGEPGPAGLPGPAGERGG
PGSRGFPGADGVAGPKGAPGERGAPGPAGPKGSPGESGRPGEAGLPGAKGLTGSPGSPGP
DGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGP
AGKDGEAGAQGAPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDAGA
PGPSGARGERGFPGERGVQGPPGPQGPRGANGAPGNDGAKGDAGAPGAPGGQGPPGLQGM
PGERGAAGLPGAKGDRGDAGPKGADGAPGKDGVRGLTGPIGPPGPAGPSGDKGESGPSGP
AGPTGARGAPGERGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPTGA
PGPAGNVGAPGPKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGP
RGETGPIGRPGEVGPPGPPGPSGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGE
RGFPGLPGPSGEPGKQGPSGVSGERGPPGPAGPPGLAGPPGESGREGSPGAEGSPGRDGA
PGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKAGDRGETGPSGPAGPAGPTGARGPAGP
QGPRGDKGETGEQGDRGMKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGA
AGKDGLNGLPGPIGPPGPRGRTGDAGPAGPPGPPGPPGPPGPPSGGFDFSFLPQPPQEKA
HDSGRYYRADDANVRDRDLEVDTTLKSLTQQIENIRSPEGTRKNPARTCRDLRMCHSDWK
SGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYINKNPKEKKHVWFGESM
TDGFQFEYGGEGSDPADVAIQMTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKALLL
QGSNEIEIRAEGNSRFTYGVIEDGCTSHTGNWGKTVIEYKTTKTSRLPIIDVAPMDVGAP
NQEFGFDISPVCFL
NT seq 4365 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggatccccggcttctgctcctcttagcagttactgccgtcctcacc
catggccaagacgaagaagacattccagaaggcacctgtgtacagaatggcttaaagtac
cataacggagcagtgtggaaacccgagacctgtcaaatctgcgtctgcgacaatgggagc
atcctgtgtgatgaaataatctgtgaagacgtttccaactgccccaacgtggagtacaag
gacgatgagtgctgccccacctgccttggtgccgatgctgtttcaccatcctctcccgga
gaaacaggagtcgagggtcccaaaggagatactgggccccgaggtgaaaggggactccca
ggcccacctggaagagatggcatcccaggacagcctggtctccccggcccccccggcccc
cctggtccccctggccttggaggaaactttgctccccaaatgtcttatggctatgatgag
aaatcaggtggcggcatgtctgtgcctggacccatgggtccctctggtcctcggggtctc
ccaggtccccccggctcacctggtccccaaggtttccaaggcccccctggtgaacccggc
gagcctggagcttcaggtcccatgggtccccgaggtcctgcaggtccccctggcaagaat
ggagatgatggtgaagctggcaaacctggtcgccccggtgaacgtggtcctcctggtcct
cagggtgctcgaggtctgcccggaactgctggtctccctggcatgaagggacaccggggt
ttcagtggtttggatggtgccaagggagacagtggtcctgctggtcctaagggtgagcct
ggcagccctggtgaaaatggagctcctggacaaatgggccctcgtggtctgcctggtgag
agaggccgccctggaccccctggcccagctggtgctcgtggtaacgatggtgcaacaggt
gctgctggacccccaggtcctactggtcctgctggtccccctggtttccctggtgctgtt
ggtgctaagggtgaagctggtcctcaaggatcccgtggctctgaaggtccccaaggtgtc
cgtggagagcccggtccccctggccctgctggttcccccggcccctctggtaaccctggt
gctgatggacaacctggtgccaaaggtgccaatggtgctcctggaattgctggtgctcct
ggctttcccggtgcccgtggtccttctggacctcagggtcccagtggtgctcccggtccc
aagggtaacagtggtgaacctggtacccctggtaacaaaggtgaccccggtgccaaagga
gaacccggccccgttggtgttcaaggacccccaggccctgcaggtgaggaaggcaagaga
ggatcccgtggtgaacctggacctgctggtctgcctggcccagctggcgaacgaggcggt
cccggaagccgtggtttccctggtgctgatggtgttgctggtcctaagggtgctcctggt
gaacgtggtgctcctggccctgctggtcccaaaggatctcctggtgaatctggtcgtcct
ggggaggctggtctacctggtgccaagggtctgactggaagccctggaagccctggtcct
gatggcaaaactggtcctcccggccctgctggccaagatggtcgtcccggacctccaggc
ccccctggtgcccgtggtcaagctggtgtaatgggattccctggacccaagggtgctgct
ggagaacctggcaaggctggagagagaggtgttcctggaccccctggtgctgtgggcccc
gctggaaaagatggagaagctggtgctcagggtgcccctgggcctgctggtcccgctggt
gagagaggtgaacagggtcctgctggctctcctggattccagggtctccctggccccgct
ggtcctcctggtgaagctggcaaacctggtgaacagggtgttcctggagatgctggtgcc
cccggcccctctggtgcaagaggtgagagaggtttccccggagaacgcggtgtccaaggt
cctcctggccctcagggtccccgtggtgccaatggtgctcctggaaatgatggtgctaag
ggtgatgctggtgctcctggtgctcccggtggccaagggcctcctggtctgcagggaatg
cccggtgaacgtggtgcagctggtcttcccggtgccaagggtgacagaggtgatgctggt
cccaaaggtgctgatggtgctcctggcaaagatggtgtccgtggtctgactggtcctatt
ggtccccctggccctgctggtccttctggtgacaagggtgaatctggtcccagtggccct
gctggtcccaccggagctcgtggtgcccctggagaacgtggtgagcctggcccccctggt
cctgctggctttgctggtcctcctggtgctgatggccaacctggtgctaaaggtgaacct
ggtgatgctggtgctaaaggtgatgctggtccccctggccctgctggacctactggtgct
cctggccctgctggtaatgttggtgctcccggacccaaaggtgcccgtggcagtgctggc
ccccctggtgctactggtttccctggtgctgctggaagagttggtccccctggcccctct
ggtaatgctggaccccctggacctcctggccctgctggcaaagaaggcggcaaaggcccc
cgtggtgagactggccctattggacgtcctggtgaagtcggacctcctggtccccctggc
ccaagtggagagaagggatctcctggtgctgatggtcctgctggtgctcctggtactcct
ggacctcaaggtattgctggacagcgtggtgtggttggtctgcctggacagagaggagaa
agaggtttccctggtcttcctggtccctctggcgaacctggaaaacaaggtccttctgga
gtaagtggagaacgtggcccccctggccctgctggaccccctggattagctggaccacct
ggtgaatctggacgtgagggctctcctggtgctgaaggttccccaggtcgtgatggagct
ccaggccctaagggtgatcgtggtgaaactggtcctgccggcccccctggtgctcctggt
gcccctggcgctcctggccctgtgggtcctgctggaaaggctggagatcgtggtgagact
ggtccctctggtcctgctggtcccgctggtcccactggtgcccgtggtcctgctggacct
caaggcccccgaggtgacaagggtgagactggcgaacagggtgacagaggcatgaaaggt
cacagaggtttctctggtctccagggtccccctggtcctcctggttctcctggtgaacaa
ggtccttctggagcttctggtcctgctggtccccgaggtccccctggctctgctggtgct
gctggaaaagatggactcaatggtctccctggccccattggcccccctggtccccgtggt
cgtactggtgatgctggtcctgctggtccccctggacctcctggcccccctggtcctcca
ggcccccccagcggtggcttcgacttcagcttcctgccccagccaccccaggagaaggcc
cacgactctggccgctattaccgagctgatgacgccaatgtacgtgaccgtgaccttgag
gtagacaccaccctcaagagcctgactcagcagattgagaacatccgcagtcccgagggt
acccgcaagaaccctgcccgcacctgcagagacctcaggatgtgccattcagactggaag
agcggagaatactggattgatcccaaccaaggctgcaacctggatgccatcaaggtcttc
tgcaacatggaaactggagagacctgtgtgtaccccactcaacccagtgtggcccagaag
aactggtacattaacaagaaccccaaggagaagaagcacgtctggttcggcgagagcatg
accgatgggttccagttcgagtatggtggagagggctccgaccctgctgatgtggctatc
cagatgaccttccttcgcctgatgtccactgaggcctcccagaacatcacctaccactgc
aagaacagcgtggcttatatggaccagcagactggcaacctcaagaaggccctgcttctc
cagggctccaatgaaatcgagatccgggctgaaggcaacagccgattcacctacggagtc
attgaggatggctgcacgagtcacactggaaactggggcaagacagtcatcgaatacaag
accaccaaaacctcccgcctgcccatcatcgatgtggcccccatggacgttggcgctccc
aaccaggaatttggcttcgacattagccccgtctgcttcctgtaa

DBGET integrated database retrieval system