KEGG   Physeter catodon (sperm whale): 102974155
Entry
102974155         CDS       T06011                                 

Gene name
LAMA5
Definition
(RefSeq) laminin subunit alpha-5
  KO
K06240  laminin, alpha 3/5
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102974155 (LAMA5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102974155 (LAMA5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102974155 (LAMA5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102974155 (LAMA5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102974155 (LAMA5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102974155 (LAMA5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102974155 (LAMA5)
   05145 Toxoplasmosis
    102974155 (LAMA5)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_I Laminin_G_2 Laminin_G_1 Laminin_II Laminin_B Laminin_G_3 DUF948
Other DBs
NCBI-GeneID: 102974155
NCBI-ProteinID: XP_023984133
UniProt: A0A2Y9T1A3
LinkDB
Position
14
AA seq 3688 aa
MAKPGARPRGSRGPAPLLLAGLALLGAVRARAAAGDGFSLHPPYFNLAEGARIAASATCG
EEAPARGAPRPTEDLYCKLVGGPVAGGDPNQTIQGQYCDICTAANSNRAHPVSNAIDGTE
RWWQSPPLSRGLQYNEVNVTLDLGQVFHVAYVLIKFANSPRPDLWVLERSTDFGHTYQPW
QYFASSKRDCLERFGPRTLERILGDDDVICSTEYSRIVPLENGEIVVSLVNGRPGAMNFS
YSPLLRDFTKATNIRLRFLRTNTLLGHLMGKALRDPTVTRRYYYSIKDISIGGRCVCHGH
ADVCDAKDPADPFRLQCACQHNTCGSSCDRCCPGFNQQPWKPATTDSANECQSCNCHGHA
HDCYYDPEVDRSNASQNQDRVYRGGGVCVDCQHHTTGINCEQCLPGFYRSPDHPLDSPHA
CRRCDCESDFTDGTCEDLTGRCYCRPNFTGARCDACAEGFSGFPRCHPVPSFSHNDTGEQ
VLPAGQIVNCDCSAAGTQGNACRKDPRVGRCVCKPSFQGTHCELCAPGFYGPGCQPCQCS
SPGVVDGTCDRDSGQCTCRAGFEGAACDRCAPGYFHFPLCQLCGCSPTGTLPEGCDDAGR
CPCRPEFDGPRCDRCRPGHHSYPECHACACDPRGALDQLCGAGGVCRCHPNYTGATCQEC
SPGFHGFPDCAPCHCSPEGSLHTACDPHSGQCSCRPRVTGLRCDACAPGAYNFPYCEAGS
CHPAGLAPADHDLPEVHIPCRCRAHVEGPSCDRCKPGFWGLSPSNPEGCTRCSCDPRGTL
GGAAECQPGNGQCFCKPHVCGHTCTACKDGFFGLDQADYFGCRGCRCDVGGALGQGCEPR
TGACLCRPSTQGPTCSKPAQDHYLPDLHQLRLELEEAATPEGHAVRFGFNPLEFESFSWR
GYAQMAPIQPRIVARLSVASPDLFRLVFRYVNRGPASVSGRVSVRDQGKFATCANCTEQS
QPVAFPPSTEPAFVTVPQRGFGEPFVLNPGAWALLVEAEGVLLDYVALLPSAYYEAALLQ
LRVTEPCTFRPDARRSGDHCLLYTHLPLDGFPSAAGPEALCRHGNSLPRPCPAEQLSPAH
PLLVACQGADVDVQLQVAVTRPGPYALVVEYASEDARQEVDVAVHTPQRAPQQGALTLHP
CPYSTLCRGAALDAQRHLAVLHLDTEASVRLTAEQARFFLHSVTLVPAETFSSEFVEPRV
RCISSHGAFSPSSAACLPSRFPKPPQPVVLRDCQVLPLPPGLPLTHSRALTPGAPPPGPR
PRPPTAADPDVEPTLLRHPQGTVVFSTHVPALGRYAFLLHGYQPAHPAFAVEVLISGGRV
WQGHANASFCPHAYGCRTLVVCEGRAVLDVTDSDLTVTVRVPEGRWLWLEYVLVVPEDAY
SPSYFREEPLDKSYDFISQCATHGYHVSPSSSSSFCRSAATSLSLFYNNGARPCGCHEVG
AAGPTCEPFGGQCPCRAHVIGRDCSRCATGYWGFPSCRPCDCSGRLCDELTGQCICPPRT
VPPDCIVCQPQTFGCHPLVGCEECNCSGPGVQALTGPTCDADSGQCKCRPNVAGRRCDTC
APGFHGFPSCRPCDCHEAGSAPGTCDPLTGQCSCKENVQGPRCDQCRLGTFSLDAANPKG
CTRCFCFGATDRCRSSAYARREFVDMEGWTLLSGDRQVVPHERQAEATVLRADLRRGPEA
FPELYWQAPPSYLGDRVSSYGGTLHYELHSETQRGDVFVPTESRPDVVLQGNQMSITFLE
PVYPAPGRVHRGELQLVEGNFRHVETRSAVSREELVMVLAGLEQLHIRALFSQTSSAISL
RRVALEVASEVGGGPPASNVELCMCPASYRGDSCQECAPGYYRDVKGLFLGRCIPCQCHG
HSDRCLPGSGVCVGCQHNTEGDRCERCQAGFVRGGSEDPAAPCISCPCPLAVPSNNFATG
CVLRGGRTQCLCKPGYAGASCERCAPGFFGNPLVLGSSCQPCDCSGNGDPNMLFSDCDPL
TGACRGCLRHTTGPRCESCAPGFYGDALLPGNCTRCDCSPCGTEACDPHSGHCLCKAGVT
GQRCDHCQEGHFGFEGCRGCRPCACGPAAEGSECHPQSGQCHCRPGTGGRQCRECAPGHW
GLPEQGCRRCQCQGGHCDVHTGRCTCPPGLSGERCDTCSHQHQVPMPGGPGGHGVHCEVC
DHCVVLLLDGLERAGALLPAVREQLRGVNASSVIWARLHGLNASIANLQSQLRKPPGPRH
ETAQQLEALERQTSSLGQDVQQLDGQATGARAQASQLLDGTEATLGRAQTLLAAVRAVDR
ALSELESQTDRLSPANASALSGEQLRRTLAEVERLLREMRARDLGVPRAAAEAELGEAQR
LLARVQEQLTRRWERNQALVARTRDQLAQHEAGLMDLREALNRAVGTTREAEELNSRNLE
RLEEALQRKQELLRDNATLGAALQAARDTLVRLSELLRGMDQAKEEYEHLAASLDGARTP
LLEKMRAFSPASGKVNLVEAAEAHAWQLDQLALNLSSIIQGVNQDRFIQRAVEAANAYSS
ILQAVRAAEGAAGQAQQQASRTWAVVVRRGLAPRARELLADSSALEEAVLGEQRRLGLAW
ATLQGTGTQLRDARARKEQLAARVREVQAMLAMDTDETSKKIAHAKAVVTEAQDTAARVQ
SRLRDMQRNVERWQGRYKGLQSQGLGRVVLDAGRSVTTLEKTLPQLLAKLSLLQNRGTHN
ASLALSASIGRVRELIAQARSAANKVKVPMKFSGSSGVQLRTPRDLANLVSYTALKFYLQ
SPEPAPGQLAGDQFVLYMGSRQATGDYMGVALRGQKVHWVYRLGGAGPTALSIDEDIGEQ
FATVSIDRTLQFGHMSVTVENQMVQETKGDTVAPGAEGLLGVQPDDFVFYVGGYPDNFTP
PEPLRFPGYRGCIELDTLNEEVVSLYNFEKTFQLDTAVDKPCARSKSTGDPWLTDGSYLD
GSGFARISVESQLSNTKRFDQELRLVSYSGILFFLRHQDQFLCLAVREGSLVFLYDFGAG
LKEADPLQRPPPLTTTSKAIQVFLLGGSRKRVLVRVERTTVFSVEQDSTLELADAYYLGG
VPPSQLPPSLRRLFPSGGSVRGCIKGIKALGKYVDLKRLNTTGISSGCTADLLVGRAMTF
HGHGYLRLALPSNAVPLTGNVYSGFGFRSSQDSALLFHRESPTGPCEVSLQRGHVTLRLA
RTEVKTRRGFADGAPHYVTFYSNATGVWLYVDDQLQEMKPHRGPWPQPQPEGSSQLFLGG
LPKSDALRNFSGCISNVFVLRLLGPQRVLDLQQNLEGINVSSGCDPAPRTQAPEQSPQGL
RATVSRKASRRSRQPAQDASCSPPWPLRAIRDAYQFGGPLSSHLEFAHVPPPTSNRSRLD
MLIRPHAPQGLLLLAAPLTASSPSLALFLSHGRFVAQTEGPGPQLRVQSRQRSRTGRWHT
VSVRWEKTRIQLVMDGVWAQSREGPGRRHQGEEGPRPHTLFVGGLPAGVHSPQLPVAISR
SGFSGCVKRLRLDGQLLGAPTRVVGATPCFSGPLEKGLFFAGSGGAITLDTLGATLPDLR
LELEVRPQTATGLVFHLGRGPAPPYLQLRVLGRQVLLRADEGSGAFSTLVTLPEALCDGQ
WHRLAVTKGGNRLRLEVDLQSNHTLGPAPAASADGPVSLYLGGLPEPWTRPPAYRGCMRN
LVVNQALVSWPRAAGVQGAVGAGGCPAT
NT seq 11067 nt   +upstreamnt  +downstreamnt
atggcgaagccgggcgcgcggccccgcggttcccggggccccgcgccgctgctgctggcc
gggctggcgctgctgggcgcggtgcgggcgcgggcggcagcgggtgacggcttcagcctg
cacccgccctacttcaacctggcggagggcgcccgcatcgccgcctccgccacctgcggc
gaggaggccccggcgcgcggcgccccgcgccccaccgaggacctctactgcaagttggtg
gggggccccgtggccggcggggaccccaaccagaccatccagggccagtactgcgacatc
tgcacggctgcaaacagcaacagggcacaccctgtgagcaacgccatcgacggcacggaa
cgctggtggcagagcccaccactgtcgcgtggcctgcagtacaatgaggtcaacgtcacc
ctggacctgggccaggttttccacgtggcctacgtgctcatcaagtttgccaactcccct
cggccagacctttgggtgctagaacggtccacggacttcggccacacctaccagccatgg
cagtatttcgcctcctccaagagggactgcctggagcggttcgggccgcggacactggag
cgcatcttgggggacgacgacgtcatctgcagcaccgagtattcgcggatcgtgcccctg
gagaatggcgagatcgtggtgtccctggtgaacgggcgcccaggggccatgaacttctcc
tactcgccgctgctgcgtgatttcaccaaagccaccaacatccgcctgcgcttcctgcgc
accaacacgctgctgggccacctcatgggcaaggcgctgcgggaccccacggtcacccgc
aggtactattacagcatcaaggacatcagcatcggcggccgctgcgtctgccacgggcac
gcggatgtctgtgatgccaaagaccccgcagaccccttcaggctgcagtgcgcctgccag
cacaacacgtgtgggagctcctgtgaccgctgctgccccggcttcaaccagcagccgtgg
aagccggcaaccacggacagcgccaacgagtgccagtcctgcaactgccacggccacgcc
cacgactgctactacgaccccgaggtggaccggagcaacgccagccagaaccaggaccgc
gtctaccggggcgggggcgtgtgcgtcgactgccagcatcacaccaccggcatcaactgt
gagcagtgcctgcccggcttctaccgctccccagaccacccgctcgactcgccccacgcc
tgccgccgctgtgactgcgagtcggacttcacggacgggacgtgtgaggacctgacgggc
cgctgctactgccggccgaacttcacgggggcgcgatgcgatgcgtgtgccgaaggcttc
agcggcttcccgcgctgccacccggtgccctccttctctcacaatgacacgggagagcag
gtgctgccggccggacagattgtgaattgtgactgcagcgccgccgggacccagggcaac
gcctgccggaaggacccacgggtggggcgctgcgtgtgcaaacccagcttccagggcacg
cactgtgagctctgtgccccgggcttctatggcccaggctgccagccctgccagtgctcc
agccccggcgtggtggacggaacctgtgaccgcgactcgggccagtgcacgtgccgggcg
ggattcgagggggcggcgtgcgaccgctgtgcccctggctacttccacttccccctctgc
cagctgtgcggctgcagccccacggggaccctgcccgagggctgcgatgacgctggccgc
tgcccgtgccgacccgagtttgacggcccccgctgtgaccgctgccgcccgggccaccac
agctaccccgagtgccacgcctgcgcctgcgacccccggggtgccctggaccagctgtgt
ggggcgggcggggtgtgccgctgccaccccaactacacgggtgccacctgccaggagtgc
agcccaggcttccacggcttccccgactgtgccccctgccactgctcccccgaaggctcc
ctgcacacagcctgcgacccccacagcgggcagtgcagctgccgaccccgagtgacgggg
ctgcgctgcgacgcgtgtgcgcccggggcctacaacttcccctattgcgaggctggctcc
tgccaccctgctggcctggccccagccgatcatgatcttcctgaggtgcacatcccctgt
aggtgccgggctcacgtagaggggccgagctgtgatcgctgtaaacccgggttctggggg
ctgagtcctagcaaccctgagggctgcacccgctgcagctgtgaccccaggggcacactg
ggtggagctgctgagtgccagccgggcaacggccagtgcttctgcaagccccatgtgtgc
ggccacacctgcacggcctgcaaggacggcttcttcgggctggaccaggccgactacttc
ggctgccgtggctgccggtgcgacgttggcggagctctgggacagggctgtgagccgagg
acaggcgcctgcctgtgccgccccagcacccagggccccacctgcagcaagccggcgcag
gaccactaccttcccgacctgcaccagctgcgcctggagctggaggaggcggccacgccc
gagggccacgccgtgcgcttcggcttcaaccctctcgagttcgagagcttcagctggagg
ggctacgcgcagatggcacccatccagcccaggatcgtggcaaggctgagcgtggcctcc
cctgacctcttccgactggtcttccgatacgtcaaccgcgggcctgcgagcgtgagcggg
cgggtctccgtgcgcgaccagggcaagtttgccacctgcgccaactgcacagagcagagc
cagcccgtggccttcccgcccagcacggagcctgccttcgtcaccgtgccccagaggggc
ttcggggagcccttcgtgctgaaccctggcgcctgggccctgctcgtggaggccgagggg
gtgctcctggactacgtggctctgctgcccagcgcctactacgaggcggccctcctgcag
ctgcgggtgacggagccctgcacattccggcccgacgcgcggcgctccggggaccactgc
ctcctctacacccacctgccgctggacggcttcccctccgctgctgggcctgaggccctg
tgtcgccacggcaacagcttaccccggccctgccccgcggagcagctcagccccgcccac
ccgctgctggtcgcctgccagggcgcagatgtggacgtccagcttcaggtggcagtgacg
cggccgggcccctacgccctggtggtggagtacgccagtgaggacgcccgccaggaggtg
gacgtggccgtgcacaccccccagcgggccccccagcagggggcgctcaccctccacccc
tgcccatacagcaccctgtgccggggcgccgccctggatgcccagcgccatctggcagtc
ttgcacctggacacggaggccagcgtccggctcacggctgagcaggcacgcttcttcctg
cacagtgtcaccctggtgcccgcggagaccttcagctcagagttcgtggagccccgcgtc
cgctgcatcagcagtcatggcgccttcagccccagcagcgccgcctgcctgccctcccgc
ttcccaaagccgccgcagcctgtggtgctcagggactgccaggtgctgccgctgcccccc
ggcctcccgctgacccactctcgggcgctcacgccgggcgcgcccccgcccgggccccgg
ccccggccccccaccgccgcggaccccgacgtggagcctacgctgctgcgccacccccag
ggcaccgtggtcttcagcacccacgtgcccgccctgggccgctacgccttcctgctgcac
ggctaccagccggcccaccctgccttcgctgtggaggtcctcatcagcgggggtcgtgtc
tggcagggccacgccaacgccagcttctgcccgcacgcctacggctgccgcaccctggtg
gtgtgtgagggccgggccgtcctggacgtgaccgacagtgacctcactgtgaccgtgcgc
gtgcccgagggccgctggctctggctggagtacgtgctggtggtccccgaagacgcttac
agccccagctacttccgggaggagcccctggacaaatcctacgacttcatcagccagtgt
gccacccacggttaccacgtcagcccctccagctcgtcgtccttctgccgcagcgccgcc
acctccctctccctcttctataacaacggggctcggccgtgcggctgccatgaggtgggc
gccgcgggccccacgtgtgagcccttcgggggccagtgtccctgccgcgcccacgtcata
ggccgcgactgctcccgctgtgccactggctactggggcttccccagctgcaggccctgt
gactgcagcggccgcctctgtgacgagctcacgggccagtgcatctgcccgccgcgcacc
gtcccgcccgactgcatcgtctgccagccccagacctttggctgccaccccctggtcggt
tgtgaagaatgtaactgctcggggcccggcgtgcaggcgctcacgggccccacctgtgac
gcggacagtggccagtgcaagtgcagacccaacgtggccggacgccgctgtgacacctgc
gctcccggcttccacggcttccccagctgccgcccctgcgactgccacgaagcaggctct
gcacctggaacgtgcgaccccctcacaggccagtgctcttgcaaggagaacgtgcagggc
ccgcggtgtgaccagtgccgcctggggacattctccctggacgctgccaaccccaaaggc
tgcacccgctgcttctgcttcggggccacggatcgctgccggagctcggcctacgcccgc
cgggagttcgtggacatggagggctggacgctgctgagtggtgaccggcaggtggtgccc
cacgagcggcaggcagaggcaacggtgcttcgcgcggacctgcggcgcgggcctgaggcc
ttccccgagctgtactggcaggccccgccctcctacctgggggaccgggtgtcgtcctac
ggcgggaccctccactacgaactgcactcagagacccagcggggagacgtgttcgtcccc
acggagagcaggccagacgtggtgctgcagggcaaccagatgagtatcacgttcctggag
ccagtgtacccggcaccgggccgtgttcaccgcggggagctgcagctggtggagggcaat
ttccggcacgtggagacccgcagcgccgtgtcccgcgaggagctcgtgatggtgctggcc
ggcctggagcagctgcacatccgcgccctcttctcacagacctcctcggccatctccctg
cgcagggtggcgctggaggtggccagcgaggtgggcgggggacctccggccagcaacgtg
gagctgtgcatgtgccccgccagctaccgcggggactcatgccaggaatgtgcccctggc
tactaccgggacgtcaaaggtctctttttgggtcgctgcatcccctgtcagtgccatggc
cactcagaccgctgcctccctggctcgggcgtctgcgtgggctgccagcacaacaccgag
ggtgaccgctgcgagcgctgccaggccggcttcgtgcgcggtggatccgaggaccctgcg
gccccctgcatcagctgcccttgccccctggctgtgccttccaacaacttcgccacgggc
tgcgtcctgcgaggtggccgcacccagtgtctctgcaaacccggctatgctggggcctcc
tgcgagcggtgcgcacctggcttctttgggaaccccctggtgctgggcagctcgtgccag
ccctgcgactgcagcggcaacggcgaccccaacatgctcttcagcgactgcgaccccctg
acgggcgcctgccgcggctgcctgcgccacaccactgggccccgctgcgagagctgcgcc
cccggcttctacggcgacgcgctgctgcccggcaactgcacccggtgtgactgctccccg
tgtgggacggaggcctgtgacccccacagcgggcattgcctgtgcaaagcgggtgtgacc
ggtcagcgctgcgaccactgtcaggaaggacatttcggctttgagggctgccggggctgc
cgcccatgcgcctgtggaccagcggccgagggctccgagtgccacccccagagtgggcag
tgccactgccggccagggactgggggtcgccagtgccgcgagtgtgcccccggccactgg
ggactccctgagcagggctgcaggcgctgccagtgccagggcggccactgtgacgtgcac
acgggccgctgcacgtgcccccccgggctcagtggggagcgctgtgacacctgcagccac
cagcaccaggtgcccatgccaggagggcctgggggccacggcgtgcactgcgaagtgtgt
gaccactgtgtggtcctgctcctggacggcctggagcgggccggcgctctcctccccgcc
gtccgggagcagctgcgtggcgtcaatgccagctccgtgatctgggcccggctgcacggg
ctgaacgcctccatcgccaacctgcagagccagctccggaaaccccctggcccccgtcat
gagaccgcacagcagctggaagcactggaacgacagacctcaagcctcgggcaggacgtg
cagcagctggacggccaggccacaggggcccgagcccaggccagccagcttctggacggc
accgaggccacgctgggccgggcacagacgctgctggcagccgtccgggctgtggaccgc
gccctgagcgagctcgagtcccagacggaccgcctgtcaccagccaacgcctcagcccta
tcgggtgaacagctgcgccggacactggccgaggtggaacggctgctccgggagatgcgg
gcccgtgacctgggtgttccgcgagcagccgctgaggccgagctgggggaggcccagaga
ctgctggcccgtgtgcaggagcagctgaccaggcgctgggagaggaaccaggcgctggta
gcacgcacccgagaccagctggcccagcatgaggccggactcatggacctgcgggaagcc
ctgaacagggcggtgggcactacgcgggaggcggaggagctcaacagccgcaacctggag
cggctggaagaagccctacaacggaaacaggagctgttgagggacaacgccactctgggg
gccgctctacaggccgccagagacaccctggtccggctctctgagcttctgcgtggcatg
gaccaggccaaggaggagtatgaacaccttgctgccagcctggatggggcccggacgcca
ctgctcgagaagatgcgggccttctccccggctagcggcaaggtgaacctggtggaggcc
gccgaggcccacgcgtggcagctggaccagctggcgctcaacctttccagcatcatccag
ggagtcaaccaggaccgcttcatccagcgggccgtcgaggccgccaacgcctacagcagc
attctgcaagccgtgcgggccgccgagggtgccgccgggcaggcacagcagcaggcgagc
cgtacatgggcggtggttgtgcggcggggcctggcgccccgagcccgggagctgctggcg
gacagcagtgccctggaggaggccgtcctcggggagcagcggaggctgggcctcgcgtgg
gccaccctccagggcaccgggacccagctccgcgatgcccgggccaggaaggaacagctg
gcggcccgagtccgggaggtgcaggccatgctggctatggacacagacgagacaagcaag
aagatagcccacgccaaggccgtggtcaccgaagcccaggacacggctgcccgcgtgcag
tcccggcttcgagacatgcagagaaacgtggagcggtggcagggccgctacaaggggctg
cagagccagggcctgggccgggtggtgctcgacgcaggccgctcagtgaccaccctggag
aagacgctgccgcagctgctggccaagctgagcctcctgcagaaccgtgggacgcacaac
gccagcctggccctgtcggccagcatcggccgcgtgcgggagctcatcgcccaggcccgg
agtgctgccaataaggtcaaggtgcccatgaagttcagcgggagctcaggggtgcagctg
cgcacccctcgggacctcgccaaccttgtctcttacaccgccctcaagttctacctgcag
agcccagagccggcgcccggccagctcgccggggaccagtttgtgctgtacatgggcagc
cgccaggccaccggcgactacatgggcgtggctctgcgtggccagaaggtgcactgggtg
taccgcctcgggggagcgggccccacggccctcagcatcgacgaggacatcggggaacag
ttcgcgaccgtcagcattgacaggaccctccagtttggccatatgtctgtcacagtggag
aatcagatggtccaggagaccaagggtgatacggtggcccctggagccgaagggctgctc
ggcgtgcagcctgatgactttgtcttctacgtgggaggctaccctgacaacttcacgccc
cccgaacccctgcgctttcccggctaccggggctgcatcgagctggacacgctgaacgaa
gaggtggtcagtctctacaacttcgagaagaccttccagctggacacagccgtggataag
ccttgtgcgcgctccaagtcgaccggggacccatggctcacggacggctcctacctggac
ggctccggcttcgcccgcatcagcgtggagagccagctgagcaacaccaaacgcttcgac
caggagctgcggcttgtgtcctacagcgggatcctcttcttcctgcggcaccaggaccag
ttcctgtgcctggccgtgcgggaaggcagcctcgttttcctctacgacttcggtgcgggc
ctgaaggaggccgacccgctgcagcgcccgccgcctctgaccacgaccagcaaggcgatc
caggtgttcttgctggggggcagccggaagcgcgtgctggtgcgcgtggagaggaccacg
gtgttcagcgtggagcaggacagcacgctggagctggccgacgcctactacctggggggc
gtgccgcccagccagctgccccctagcctgcggcggctcttcccctccggaggctcggtg
cgcggctgcatcaagggcatcaaagctctgggcaagtacgtggatctcaagaggctgaac
acgacgggcatcagctccggctgcaccgccgacctgctggtgggacgggccatgactttc
cacggccacggctatctgcgcctggcgctccccagcaacgccgtgcccctcacgggcaac
gtctactctggcttcggcttccgcagcagccaggacagcgctctgctcttccaccgagag
tccccgaccgggccatgcgaggtgtccctgcagcgaggccacgtgaccctccggcttgca
aggaccgaggtgaaaacgcgacggggctttgccgatggggccccccattatgtcactttc
tacagcaacgccacgggggtctggctctatgtggacgaccagcttcaggagatgaagccc
caccgggggccgtggccccagccccagcctgaggggtcgagtcagctcttcctggggggc
ttgcccaagtccgacgccctccgtaacttcagcggctgcatcagcaacgttttcgtgctg
cggctcctggggccgcagcgcgtgcttgacctgcagcagaacctggagggcatcaacgtg
agctcaggctgcgacccggcccctcgcacccaggccccagagcagagcccgcaaggactg
cgggccacggtgtctcggaaggcctcccgccgcagccggcagcccgcccaggacgcttcc
tgctcgccaccctggcccctcagggccatacgagacgcctaccagtttgggggtcccctg
tccagtcacctggagtttgcacacgtcccgccccccaccagtaaccggtcgcggcttgac
atgctcatccgcccgcatgccccccaggggctcctgctcctcgccgcccccctgacggcc
agcagcccttccctggccctcttcctgagccacggacgctttgtcgctcagacggaaggc
cctgggccccagctccgagtccagagccgccagcgttcacggactggccggtggcataca
gtgtctgtgcgttgggagaagactcggatccagctggtgatggatggggtctgggcccag
agccgggaggggccgggccggcggcaccagggggaggaaggcccccggccacacactctc
tttgtggggggcctccctgctggtgtccacagcccgcagctcccggtggccatcagccgc
tccgggttcagcggttgtgtgaagagactgaggttggacgggcagctcctgggggctccc
acccgggtggtgggggccacaccctgcttctccggtcccctggagaagggcctgttcttt
gcaggcagcgggggagccatcactctagacaccctgggggccacactgcctgacctgcgc
ctggagctggaggtgcggccgcagacagccaccggcctcgtcttccacttgggtcggggc
ccggcgcccccctacctgcagctgcgcgtgctggggaggcaggtcctgctgcgggcagac
gagggttcaggcgcgttctccacgctggtgacgctccccgaggcactgtgtgacgggcag
tggcaccgcctggcagtgaccaaaggcgggaacaggctccggctggaggtagacctgcag
agcaaccacaccctgggccccgcgccggccgcctcggctgacggcccagtgtcgctgtac
ctcgggggcctgcctgaaccctggacgcggcctcccgcctaccgcggctgcatgcggaac
ctggtggtgaaccaggccctggtctcctggcctcgcgctgcgggcgtgcagggggcagtg
ggggccggcggctgcccagccacgtag

KEGG   Physeter catodon (sperm whale): 102974331
Entry
102974331         CDS       T06011                                 

Gene name
COL9A1
Definition
(RefSeq) collagen alpha-1(IX) chain
  KO
K08131  collagen type IX alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102974331 (COL9A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102974331 (COL9A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102974331 (COL9A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102974331 (COL9A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102974331 (COL9A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:pcad00535]
    102974331 (COL9A1)
Proteoglycans [BR:pcad00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   102974331 (COL9A1)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 102974331
NCBI-ProteinID: XP_028350530
UniProt: A0A455BNG8
LinkDB
Position
10
AA seq 684 aa
MAWTARDRGALGLGLLLSGFCLCAAQRGPPGEQGPPGPPGPPGVPGIDGIDGDRGPKGPP
GPPGPPGEPGKPGAPGKPGTPGADGLTGPNGSPGSVGPRGQKGEPGVPGPRGFPGRGIPG
PPGPPGGAGLPGELGRVGPIGDPGRRGPPGPPGPPGPSGTIGFHDGDPLCPNSCPPGRSG
YPGLPGMRGHKGAKGEIGEPGRQGHKGEEGDQGGLGEVGAQGPPGAQGLRGITGIAGAKG
EKGARGLDGEPGPQGIPGAPGDQGRQGPLGETGPKGDRGAQGSRGIPGLPGPKGNTGLPG
VDGRDGIPGMPGTKGELGKPGPPGEAGLQGLPGIPGIPGAKGVAGEKGNTGAPGKPGQLG
NSGKPGRQGPPGEVGPRGPRGFPGSRGEGGPVGPPGLPGKLGSLGSPGLPGLPGPPGLPG
MKGDRGVVGEPGPKGEQGASGEEGELGERGELGDIGLPGPKGSAGNPGEPGLRGPEGSRG
LPGEEGPQGPPGPRGVQGERGATGLPGIQGPPGRAPTDQHIKQVCMRVIQEHFAEMAASL
KRPDSGASGLPGRPGPPGAPGPPGENGFPGQMGLRGLPGMKGPPGALGLRGAKGDMGERG
DRGPPGRGPKGLPGTSGLPGDPGPASYGRNGRDGGRGPPGVAGIPGVPGPPGPPGPPGFC
EPASCTLQAGQRAFNSKGPDQSKA
NT seq 2055 nt   +upstreamnt  +downstreamnt
atggcctggacggcgcgggaccgcggggccctcgggctgggactgctgctgtcggggttc
tgcctgtgcgcggctcaaaggggccccccgggtgagcagggtcctccggggccgccgggc
ccccctggagtcccgggcatcgacggcatcgacggtgaccgaggtccgaagggtccaccg
ggccccccgggccccccgggagagccgggaaagccgggagctccgggcaagcctggcaca
ccgggcgcagacggcttaacgggacctaacggatctcctggctctgttggaccgagagga
cagaaaggagaacctggcgtgccgggacctcgtggatttccaggccgtggtattcctgga
cctcctggtcctcctgggggagcaggactccccggagagcttggccgtgttggaccaatt
ggtgaccctgggagaagaggaccaccgggaccccctggtcccccgggacccagcggaaca
attggctttcatgatggagatccattgtgtcccaattcctgtccaccgggtcgttctgga
tatccaggcctgccaggcatgaggggtcataaaggggcgaaaggagagattggtgaacct
ggaagacaaggtcacaagggtgaagaaggagaccagggcggactaggagaggttggagct
caaggacctccgggagctcaaggtttgagaggtatcactggcatagctggggccaaaggg
gaaaaaggcgctcggggcttagatggagaacccgggccgcaaggtatccctggtgcacct
ggtgatcaaggacggcaaggacctctaggagaaacaggtcccaagggagacagaggggct
caaggttctcgaggaattcctggtctccctgggcccaagggaaacacgggcttgccaggt
gtggatggccgggacgggatacctggaatgccgggaacaaagggtgaactagggaaacct
ggacctcctggtgaagcagggttgcaggggttacctggtatacctggaattcctggcgca
aagggtgttgccggtgaaaagggtaacacaggtgctccagggaagcccggtcaactgggg
aattcaggcaaaccgggccgccaggggccaccaggagaggtgggaccccgaggacctcgg
ggcttccctggcagtagaggtgaaggaggaccagtaggacccccaggactaccaggtaaa
ctgggttctctcggcagccctggcctccctggcttgcctggcccccctggacttcctgga
atgaaaggtgacaggggtgtagttggtgaacctggtccaaagggtgaacagggtgcctct
ggtgaagagggagaactgggagagaggggtgaactcggagatataggattacctggccca
aagggatctgcaggtaaccccggggagcctggcttgaggggacccgaaggaagccggggg
cttcctggggaggaagggccgcaaggaccacctggcccgcgaggcgtgcagggagaacgg
ggcgccaccggcctgcctgggatccagggcccccccggcagagcaccaaccgatcagcac
attaagcaggtttgcatgagagtcatacaagagcattttgctgagatggctgcaagtctc
aagcgtccagactccggagcctcgggcctccccgggcggcctggcccccctggtgcccca
ggcccccctggagagaacggtttcccaggccagatgggacttcgaggcctcccaggcatg
aagggccccccaggggctcttggtttgaggggagcgaaaggtgacatgggggaaagaggt
gatcgtggccccccaggaagaggtcccaaaggtttgcccgggacttcaggtctcccaggt
gacccaggtcctgccagctacgggaggaacggccgggatggtgggcgaggccccccaggg
gtggcaggcatccctggtgtacccggccccccgggccctcctggccctcctggtttctgc
gagccagcctcctgcaccttacaggctgggcagcgggcatttaacagcaaaggtccagat
caatcaaaggcttag

KEGG   Physeter catodon (sperm whale): 102974557
Entry
102974557         CDS       T06011                                 

Gene name
COL1A2
Definition
(RefSeq) collagen alpha-2(I) chain
  KO
K06236  collagen type I alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04611  Platelet activation
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102974557 (COL1A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102974557 (COL1A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102974557 (COL1A2)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    102974557 (COL1A2)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102974557 (COL1A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    102974557 (COL1A2)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    102974557 (COL1A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102974557 (COL1A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102974557 (COL1A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102974557 (COL1A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102974557 (COL1A2)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102974557 (COL1A2)
SSDB
Motif
Pfam: Collagen COLFI
Other DBs
NCBI-GeneID: 102974557
NCBI-ProteinID: XP_007105361
UniProt: A0A2Y9EMH8
LinkDB
Position
5
AA seq 1366 aa
MLSFVDTRTLLLLAVTSCLATCQSLQEASARKGPTGDRGPRGERGPPGPPGRDGDDGIPG
PPGPPGPPGPPGLGGNFAAQYDGKGVGIGPGPMGLMGPRGPPGASGAPGPQGFQGPPGEP
GEPGQTGPAGARGPPGPPGKAGEDGHPGKSGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGTPGVKGEPGAPGENGIPGQVGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPLGSAGPPGFPGAPGPKGELGPVGNPGPAGPAGSRGEVGLPGVSGPVGPPGNP
GVNGLPGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGDSGNK
GEPGAVGPTGPPGPSGEEGKRGTTGEIGSAGPPGPPGLRGNPGSRGLPGADGRAGVMGLH
GSRGGTGPAGVRGPSGDSGRPGEPGLMGPRGFPGSPGNAGPAGKEGPTGLPGIDGRPGPI
GPAGTRGEPGNIGFPGPKGPTGDPGKNGEKGHAGLAGPRGAPGPDGNNGAQGPPGLQGVS
GGKGEQGPAGPPGFQGLPGPAGTAGEAGKAGERGIPGEFGLPGPAGPRGERGPPGESGAA
GPAGPIGSRGPSGPAGPDGNKGEPGVVGAPGTAGPSGPSGLPGERGAAGIPGGKGEKGET
GLRGDVGSHGRDGARGAPGAVGAPGPAGANGDRGEAGPAGAAGPAGSRGSPGERGEVGPA
GPNGFAGSAGAAGQPGAKGERGTKGPKGENGPVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGATGFPGAAGRTGPPGPSGITGPPGPPGPAGKEGLRGPRGDQGPVGRTGETGASGPP
GFVGEKGPSGEPGTAGSPGTPGPQGLLGSPGFLGLPGSRGERGLPGVAGSLGEPGPLGIA
GPTGARGPPGAVGNPGVNGAPGEAGRDGNPGSDGPPGRDGQAGHKGERGYPGNAGPTGTA
GAPGPQGPVGPTGKHGNRGESGPSGPVGLAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLP
GLKGHNGLQGLPGLAGHHGDQGAPGTVGPAGPRGPAGPSGPSGKDGRTGHPGAVGPAGIR
GSQGSQGPAGPPGPPGPPGPAGPSGGGYDFGFEGDFYRADQPRSPPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPVKNWYRSSKVKKHVWVGETINGGTQFEYNVEGVTTKEMATQLAFMRLL
ANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
INEWRKTIIEYKTNKPSRLPILDIAPLDIGGADQEIRLNIGPVCFK
NT seq 4101 nt   +upstreamnt  +downstreamnt
atgctcagctttgtggatacgcggactctgttgctgcttgcagtaacttcgtgcctagca
acatgccaatctttacaagaggcaagtgcaagaaagggcccaactggagatagaggacca
cgtggagaaaggggtccaccaggcccaccaggcagagatggtgatgatggtatcccaggc
cctcctggtccacctggtcctcctggtccccctggtcttggtgggaactttgctgctcag
tatgatggaaaaggagttggaattggccctggaccaatgggtttgatgggacctagaggc
cctcctggtgcatctggagcccctggccctcaaggtttccaaggacctcctggtgagcct
ggtgagcctggtcaaactggtcctgcaggtgctcgtggtccacctggccctcctggcaag
gctggtgaggatggtcaccctggaaaatctggacgacctggtgagagaggagttgttggg
ccacagggtgctcgtggtttccctggaactcctggactccctggcttcaagggcattagg
ggtcacaatggtctggatgggttgaagggacagcctggtactccaggtgtgaagggtgaa
cctggtgcccctggtgaaaatggaattccaggtcaagtaggagctcgtgggcttcctggt
gagagaggacgtgtcggtgcccctggcccagctggtgcccgtggaagtgatggaagtgtg
ggtcctgtgggtcctgctggtcctcttgggtctgctggccctccaggcttcccaggtgct
cctggccccaagggtgaacttggacctgtcggtaaccctggtcctgctggtcccgcgggt
tcccgtggtgaagtgggtcttccaggtgtttctggccctgttggacctcctggcaaccct
ggagtcaatggccttcctggtgctaagggtgctgccggccttcccggtgttgctggggct
cctggcctccctggaccccggggtattcctggtcctgttggtgctgctggtgctactggt
gccagaggacttgttggtgagcctggtccagctggttccaaaggagacagtggcaacaag
ggtgagcccggtgctgttgggcccacaggtcctcctggtcccagtggcgaagaaggaaag
agaggcaccactggtgaaattggatccgctggccccccaggacctcctgggctgagggga
aatcctggttctcgtggtcttcctggagccgatggcagagctggtgtcatgggccttcat
ggtagtcgtggtggaactggccctgctggtgtgcgaggtcccagtggagattctggtcgc
cctggagagcctggcctcatgggaccccgaggttttcctggttcccctggaaatgctggc
ccagccggtaaagaaggtccgacgggcctccctggtattgatggcaggcctggaccaatt
ggcccagctggaacaagaggagagcctggcaacatcggattccctggacccaaaggcccc
actggtgatcctggcaaaaatggtgaaaaaggtcatgctggtcttgccggtcctcggggt
gctccaggtcctgatggaaacaatggtgctcagggacctcctggactacagggtgtctca
ggtggaaaaggtgaacagggtcccgctggtcctccaggcttccagggtctgcctggccct
gcaggtacagctggtgaagccggcaaagcaggagaaaggggtatccctggtgaatttggt
ctcccaggtcctgctggtccaagaggggagcgtggtcccccaggtgaaagtggtgctgct
ggtcctgctggtcctattggaagccgaggtccttctggacctgcaggccctgatgggaac
aagggcgaacctggtgtggttggtgctccaggcactgctggtccatctggtcctagtgga
ctcccaggagaaaggggtgctgctggcatacctggaggcaagggagaaaagggtgaaact
ggtctcagaggtgatgttggtagccacggcagagatggtgctcgtggtgctcctggtgct
gtaggtgcccctggtcctgctggagccaatggggaccggggtgaagctggtcctgctggt
gccgctggtcctgctggttctcgtggtagccctggtgaacgtggtgaggttggtcccgct
ggccccaatggatttgctggttctgctggcgctgctggtcaacctggtgctaaaggagag
agaggaaccaaaggacccaagggtgaaaatggccctgttggtcccacaggccccgttgga
gctgctggcccagctggtccaaatggtccccctggtcctgctggaagtcgtggtgatgga
ggcccccctggtgctactggtttccctggtgctgctggacggactggtcctcctggaccc
tctggtatcactggcccccctggtccccctggtcctgctggtaaagaaggacttcgtggg
cctcgtggtgaccaaggtccagttggtcgaactggagaaacaggtgcttctggccctcct
ggctttgttggtgagaagggtccctctggagagcctggtactgctggatctcctggcacc
ccaggtcctcaaggtcttcttggttctcctggttttctgggtctcccaggctctagaggt
gaacgtggtctaccaggtgttgctggatctctgggtgaacctggccctctcggcattgca
ggcccaactggggcccgtggtccccctggtgctgtgggtaatcctggcgtcaatggtgct
cctggtgaagctggtcgtgatggcaaccctgggagtgatggtcccccaggccgcgatggt
caagctggacacaagggggagcgtggttaccctggtaacgctggtcccactggtactgcg
ggtgcacctggtcctcaaggccctgtgggtcccactggcaaacatggaaaccgtggtgaa
tctggtccttctggtcccgttggtctggctggtgctgttggtccaagaggtcctagtggc
ccacaaggtattcgaggtgataagggagagcctggtgataaggggcccagaggtcttcct
ggcttaaagggacacaatggattgcagggtcttcctggtcttgctggtcatcatggcgat
caaggtgctcctggcactgtgggtcctgctggtcctaggggccctgctggtccttctggc
ccttctggcaaggatggtcgcactggacatcctggtgcagtcggacctgctggcattcgt
ggctctcagggtagccaaggtcctgctggccctcctggtcctcctggccctcctggccct
gctggcccaagtggtggtggttatgacttcggttttgaaggagacttctacagggctgac
cagcctcgctcaccaccatctctcagacccaaggattatgaagttgatgctactctgaaa
tctctcaacaaccagattgagactcttcttactccagaaggctctaggaagaacccagct
cgcacatgccgtgacttgagactcagccacccagaatggagcagtggttactactggatt
gaccctaaccaaggatgtactatggatgctatcaaagtatactgtgatttctctactggc
gagacctgcatccgggctcagcctgaaaacatcccagtcaagaactggtacagaagttcc
aaggtcaagaagcacgtctgggtaggagaaactatcaatggtggtacccagtttgaatat
aatgttgaaggagtaaccaccaaggaaatggctacccaacttgccttcatgcgcctgctg
gccaaccatgcctctcaaaacatcacctaccattgcaagaacagcattgcatacatggat
gaggagactggcaacctgaaaaaggctgtcattctgcaaggatccaatgatgttgaactt
gttgccgagggcaacagcagattcacttacactgttcttgtagatggctgctctaaaaag
ataaatgaatggagaaagacaatcattgaatataaaacaaataagccatctcgcctgccc
atccttgatattgcacctttggacatcggtggcgctgaccaagaaatcagattgaacatt
ggcccagtctgtttcaaataa

KEGG   Physeter catodon (sperm whale): 102975311
Entry
102975311         CDS       T06011                                 

Gene name
SPP1
Definition
(RefSeq) osteopontin
  KO
K06250  secreted phosphoprotein 1
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04371  Apelin signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04620  Toll-like receptor signaling pathway
pcad04929  GnRH secretion
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04371 Apelin signaling pathway
    102975311 (SPP1)
   04151 PI3K-Akt signaling pathway
    102975311 (SPP1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102975311 (SPP1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102975311 (SPP1)
 09150 Organismal Systems
  09151 Immune system
   04620 Toll-like receptor signaling pathway
    102975311 (SPP1)
  09152 Endocrine system
   04929 GnRH secretion
    102975311 (SPP1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102975311 (SPP1)
SSDB
Motif
Pfam: Osteopontin
Other DBs
NCBI-GeneID: 102975311
NCBI-ProteinID: XP_007102588
UniProt: A0A2Y9EIW7
LinkDB
Position
7
AA seq 295 aa
MRIAVICLCLLGIASALPVKQTDSGSSEEKQLYNKYPDAIATWLKPDPSQKQTFLAPQNT
VSSEETADNKQETLPRKSSESPEHTDTVDDEEDGDGQDADANDDDHSDESHDSDESDEGV
TDFPTDVPATSVFTPPVPTGDTYEGRGDSVAYGLRSKSKKFHRSEVQYPDAREEDISSHL
ESEEVGDAPKAILVAQRLHRASDWDSRGKDSQETSQPEERRMETHSREHSKEFKLKAEVE
SNERFDVTDSQDNSKVSQEFHSHEDKVVPDLKSEEDKHLKFRVSHELDDASSEVN
NT seq 888 nt   +upstreamnt  +downstreamnt
atgagaatcgcagtgatttgcctctgcctcttgggcatcgcctccgcccttccagttaaa
cagactgattctggcagctctgaggaaaagcagctttacaacaaatacccagatgctata
gccacctggctaaagcctgacccgtctcagaagcagactttcctagcaccacagaacact
gtgtcctctgaggagactgctgacaacaagcaagagaccctcccgcgtaagtccagtgaa
agccccgagcacacagacactgtggatgacgaagaggatggggacggccaggacgctgac
gcaaatgacgatgaccattccgacgagtctcacgattctgatgaatctgatgaaggggtc
actgattttcccactgacgttccagcaacctccgttttcactccaccggtcccgaccgga
gacacatatgaaggccgaggtgacagtgtggcttatggactgaggtcgaaatctaagaag
ttccacagatctgaagtccagtatccagatgccagagaggaggacatctcgtcacacctg
gagagcgaggaggtgggtgatgcacccaaggccatcctggttgcccagcgcctccacagg
gcttctgactgggacagccgtgggaaggacagtcaggagacgagtcagccggaggagcgc
aggatggaaacccacagccgcgagcattccaaagaattcaagctgaaggccgaggttgag
agcaatgagcgtttcgatgtgactgacagtcaggataattccaaagtcagccaagaattc
catagccacgaagacaaggtagtcccagaccttaagagtgaagaagacaaacacctgaaa
tttcgcgtttctcatgaattggatgatgcctcttctgaggtcaattaa

KEGG   Physeter catodon (sperm whale): 102975365
Entry
102975365         CDS       T06011                                 

Gene name
COL4A5
Definition
(RefSeq) collagen alpha-5(IV) chain isoform X2
  KO
K06237  collagen type IV alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102975365 (COL4A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102975365 (COL4A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102975365 (COL4A5)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102975365 (COL4A5)
  09154 Digestive system
   04974 Protein digestion and absorption
    102975365 (COL4A5)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102975365 (COL4A5)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102975365 (COL4A5)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102975365 (COL4A5)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102975365 (COL4A5)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102975365 (COL4A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102975365 (COL4A5)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102975365 (COL4A5)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102975365 (COL4A5)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102975365 (COL4A5)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102975365
NCBI-ProteinID: XP_023983993
UniProt: A0A2Y9T0U1
LinkDB
Position
21
AA seq 1685 aa
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGSDGIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMSSLPGPKGNPGFPGPPG
IQGPAGPTGIPGPIGPLGPPGLMGPPGPPGLPGPKGNMGLNFQGPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGVKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDIGPTGPPGLVIPRPGTGVTVGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGLAVTGPPGPPGFPGERGQKGDEGPPGISIPG
SPGLDGQPGAPGLPGPPGPPGPHIPPSDEVCEAGPPGPPGSPGDRGLQGEQGVKGDKGDT
CFNCIGTGVSGPPGEPGLPGLPGPPGSLGFPGQKGEKGHAGATGPKGLTGIPGAPGAPGF
PGPKGEPGDILTFPGMKGDKGDLGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGIAFKG
ERGPPGNPGFPGLPGNRGPMGPLGFGPPGPPGEKGIQGVAGNPGQPGIPGPKGDPGQTIA
QPGKPGLPGNPGRDGEVGLPGEPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPSGPPGFPGLKGEPGFGLPGPPGPPGLPGFKGILGPKGDRGFP
GPQGPPGQAGLDGLPGPKGDIGPNGQPGTMGPPGLPGTGVQGPPGPPGIPGPIGPPGLHG
IPGEKGDPGPPGFDVPGPPGERGSPGIPGAPGPMGPPGSPGLPGKAGASGFPGAKGEMGM
MGPPGPTGPLGIPGRSGVPGLKGDDGLQGQPGLPGPAGEKGSKGEPGLPGLPGPMDPDLL
GSKGEKGDPGLPGIPGVAGPKGYQGLPGDPGQPGLSGQPGLPGPSGPKGNPGLPGKPGLT
GPPGLKGNIGDMGFPGPQGAKGSPGPPGVPGQPGSPGLPGQKGEKGDPGISGIGLPGLPG
PKGEPGLPGYPGNPGIKGAMGDTGLPGLPGTPGAKGQPGLPGFPGTPGLPGPKGINGPPG
NPGLPGEPGPVGGGGRPGPPGPPGEKGKPGQDGIPGPAGQKGEPGQPGFGIPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGLQGPPGPPGSPGPALEGPKGNPGPQGPP
GRPGLPGPEGPRGLPGIGGIKGERGNPGQPGQPGLSGLKGDQGPPGLQGNPGRPGLNGMK
GDPGLPGVPGFPGMKGPSGEPGSTGPEGDPGLIGPPGPPGLPGPSGQSIVIKGDAGPPGV
PGQPGLKGLPGLPGPQGLPGPIGPPGDPGRNGLPGFDGAGGHKGDPGLPGQPGIRGLDGP
PGPDGLQGPPGPPGTSSVAHGFLITRHSQTTDAPQCPQGTIQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMEPLKGQSIQPF
ISRCAVCEAPAVVIAVHSQTIQIPRCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMKRT
NT seq 5058 nt   +upstreamnt  +downstreamnt
atgaaactgcgtggagtcagcctggctgccggcttgttcttactggccctgagtctttgg
gggcagcccgcagaggctgcggcttgctatgggtgttctccaggatcaaagtgtgactgt
agtggtataaaaggagaaaagggggagagaggatttccaggtttggaaggccatccaggt
ttgcctggatttccaggtccagaagggccgccagggcctcggggacaaaagggtagtgat
ggaattccagggccaccaggaccaaaagggatcagaggtcctcctggacttcctggattt
ccagggacaccaggtcttcctggaatgccaggccatgatggggccccaggacctcaaggt
atccccggatgcaatggaaccaagggagaacgtggatttccaggcagtcccggttttcct
ggtttacagggtcctccaggacctcctgggatcccaggtatgaagggagaaccaggtagt
ataattatgtcatcactgccagggccaaagggtaatccaggatttcccggtcctcctgga
atacagggcccagctggtcccactggtataccagggccaattggtccccttggaccacca
ggtttgatgggccctcctggcccaccaggacttccaggaccaaaggggaatatgggctta
aatttccagggacccaagggtgaaaagggtgagcaaggtcttcagggcccccctgggcca
cctgggcagatcagtgaacagaaaagaccaattgatgtagagtttcagaaaggagatcag
ggacttcctggtgaccgagggcctcctggacctccagggatacgtggtcctccaggtcct
ccaggtggtgtgaaaggtgagaagggtgagcaaggagagcctggcaaaagaggtaaaccg
ggcaaagatggagagaatggccaaccaggaattccaggtttgcctggtgatcctggttac
cctggtgaaccaggaagggatggagaaaagggccaaaaaggtgacattggcccaactggg
cctcctggacttgtaattcctaggcctggaactggtgtaactgtaggagaaaaaggaaac
attgggttacctggcttgcctggagaaaaaggagagcgaggatttcctggaatacagggt
ccacctgggcttcctggacctccaggacttgcagttacaggtcctcctgggccccctggc
tttcctggggaaaggggccagaaaggtgatgaaggtccacctggaatttccattcctgga
tctcctggacttgatggacagcctggggctcctggccttccagggcctcctggccctcct
ggccctcacatccctcctagtgatgaggtatgtgaagcaggccctcctggccctccagga
tctccaggtgatagaggactccaaggagaacaaggagtgaaaggtgacaaaggtgacacc
tgcttcaactgtattggaactggtgtttcagggcctccaggtgaacctggtttgccaggt
cttccaggtcccccaggatctcttggtttccctggacagaagggtgaaaaaggacacgct
ggtgcaaccggtcccaaaggattaacaggcataccaggagctccaggcgctccaggcttt
cccggacctaaaggtgaacctggtgacatcctcacttttccaggaatgaagggtgacaaa
ggagatttgggttctcctggagctccaggccttcctggtttacctggcacccctggacag
gatggattgccagggcttcctggccccaaaggagaacctggtggaattgcttttaagggt
gaaagaggtccccctgggaacccaggttttccaggtctcccaggaaatagggggcctatg
ggccctcttggttttggccctccaggcccaccaggtgaaaaaggcatacaaggtgtagca
ggaaatccaggccagccaggaataccaggtcctaaaggtgatccaggtcagaccatagca
cagccagggaagcctggcttgcctggtaacccaggcagagatggtgaagtaggtcttcca
ggtgaacctggactcccaggtcagccaggcttgccaggtatacctggcagcaaaggagaa
ccaggtatccctggaatcgggcttcctggaccacctggtcccaaaggttttcctggaatt
ccaggacctccaggagcacctgggacacctggaagaattggtctagaaggtccttctggg
ccaccaggctttccaggactaaagggagaaccaggatttgggctacctgggccacctggg
cctccaggactcccaggtttcaaaggaatacttggtccgaaaggggatcgtggtttccca
ggacctcaaggtcctccaggacaagctggcttggatgggctacctggaccaaaaggtgac
attggaccaaatggacaacctgggacaatgggacctcctgggctgccaggaacaggtgtt
cagggaccaccagggccaccaggaattcctggaccaataggtccacctggcttgcatgga
ataccaggagagaagggggatccaggacctcctggatttgatgttccaggaccccctggt
gagagaggcagtccagggatccctggagcaccaggtcctatgggccccccaggatcacca
gggctcccaggaaaagcaggtgcctctggctttccaggtgccaaaggtgaaatgggtatg
atgggacctccaggcccaacaggacctttgggaattcctggcaggagtggtgttcctggt
ctcaaaggtgatgatggtttgcagggtcaaccaggacttcctggccctgcaggagagaaa
ggtagtaaaggagagcctggccttccaggccttcctggaccaatggatccagatctgctg
ggctcaaaaggagagaaaggggaccctggcctaccaggtattcctggagttgcagggcca
aaaggttatcagggtttgcctggagacccagggcaacctggactgagtggacagcctgga
ttaccaggaccatcgggtcccaagggtaaccctggtctcccggggaagccaggacttaca
ggacctcctggacttaaaggaaacataggcgatatgggttttccaggcccccagggtgca
aaagggtctcctggacctcctggagttcctggacaacctggctccccaggattacctgga
cagaaaggagaaaaaggtgatcctggtatttcaggcattggtcttccaggtcttcctggc
ccaaagggtgaacctggtttgccggggtacccagggaaccctggtatcaaaggtgctatg
ggagatactggtttgcctggattaccagggacccctggagcaaaaggacaaccaggcctt
cctggattccctggaaccccaggccttcctggaccaaaaggtattaatggtcctcctggg
aaccccggccttccaggagaacctggtcctgtaggtggtggaggtcgtcctgggccacca
gggcctccaggtgaaaaaggcaaaccaggtcaagatggtattcctggaccagctggacag
aagggtgaaccaggtcaaccaggctttggaatcccaggaccccctggacttccaggactt
tctggtcaaaagggtgatggaggattacctggcattccaggaaaccctggccttccaggt
ccaaagggtgaaccaggctttcatggtttccctggtctgcagggtcccccaggccctcct
ggttctccaggtccagctctggaaggccctaaaggcaaccctgggccccaaggtcctccg
gggagaccaggtctaccaggtccagaaggtccccggggtctccctggaattgggggcatt
aaaggagagagaggaaacccaggccaacctggacaacctggtttgtctggtttgaaagga
gatcaaggaccaccaggactccagggtaatcctggccggccaggtctcaatggaatgaaa
ggagatcctggtctccctggtgttccaggatttccaggcatgaaaggacccagcggagaa
cctggttcaactggccctgagggggatccaggacttattggccccccaggtccccctgga
ttacctggtccttcaggacagagtattgtaatcaaaggagatgctggtccaccaggggtt
ccaggccagcctggattaaaaggtctaccaggactaccaggacctcagggtttaccaggt
ccaattggccctccaggagatcctggacgcaatggactccctggctttgatggtgcagga
gggcacaaaggagacccaggtttaccaggccagccaggtatccgtggtttggatggtccc
cccgggccagatggattgcaaggtcccccaggtccccctggaacctcctctgttgcccat
ggatttctcatcacacgtcacagccagacaacagatgcaccacaatgcccacagggaact
atccaggtttatgaaggcttttctctcctgtatgtacaaggaaataaaagagctcatggt
caagacttgggaacggccggcagctgccttcgtcgcttcagtaccatgcctttcatgttc
tgcaacatcaataatgtttgcaactttgcttcaagaaatgactattcttactggctgtcc
accccagagcccatgccaatgagcatggagcccctgaagggccagagcatccagccattc
attagtcgatgtgcagtatgtgaagctccggccgtggtgattgcagttcacagtcagacc
atccagattccccgttgtcctcagggatgggattctctctggattggttattccttcatg
atgcacacaagtgcaggggcagagggctcaggtcaagccctggcctcccctggttcgtgc
ttggaagagtttcgttcagctcccttcattgagtgtcatgggcgggggacctgcaactat
tatgccaactcctacagcttttggctggcaactgtagatgtgtcagacatgttcagcaaa
cctcagtccgaaacgctgaaagcaggagacttgaggacacgtattagtcgatgtcaagtg
tgcatgaagaggacataa

KEGG   Physeter catodon (sperm whale): 102975576
Entry
102975576         CDS       T06011                                 

Gene name
COMP
Definition
(RefSeq) cartilage oligomeric matrix protein
  KO
K04659  thrombospondin 2/3/4/5
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04145  Phagosome
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05144  Malaria
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102975576 (COMP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102975576 (COMP)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    102975576 (COMP)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102975576 (COMP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102975576 (COMP)
  09174 Infectious disease: parasitic
   05144 Malaria
    102975576 (COMP)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pcad04131]
    102975576 (COMP)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102975576 (COMP)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102975576 (COMP)
   04990 Domain-containing proteins not elsewhere classified [BR:pcad04990]
    102975576 (COMP)
Membrane trafficking [BR:pcad04131]
 Endocytosis
  Phagocytosis
   Opsonins
    102975576 (COMP)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102975576 (COMP)
  Exosomal proteins of colorectal cancer cells
   102975576 (COMP)
  Exosomal proteins of bladder cancer cells
   102975576 (COMP)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102975576 (COMP)
Domain-containing proteins not elsewhere classified [BR:pcad04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   102975576 (COMP)
SSDB
Motif
Pfam: TSP_C TSP_3 EGF_CA COMP EGF_3 cEGF EGF_MSP1_1
Other DBs
NCBI-GeneID: 102975576
NCBI-ProteinID: XP_007117001
UniProt: A0A2Y9F800
LinkDB
Position
2
AA seq 756 aa
MVLTAARVLLLTLAALGASGQGQIPLGADLGPQMLRELQETNAALQDVRELLRQQVKEIT
FLKNTVMECDACGMQPALTPRLSVRPLSQCAPGFCFPGVACTETASGARCGPCPAGFTGN
GSHCADVNECNAHPCFPHVRCINTSPGFRCEACPPGYSGPTHEGVGLAFAKANKQVCTDI
NECETMQHNCIPNSVCVNTLGSFQCGPCQPGFVGDQASGCRRRAQRFCPDGTPSPCHEKA
DCVLERDGSRSCVCGVGWAGNGLFCGSDTDLDGFPDEKLSCSERQCRKDNCVTVPNSGQE
DVDRDGIGDACDPDADGDGVLNEKDNCPLVRNPDQRNADGDKWGDACDNCRSQKNDDQRD
TDNDGRGDACDDDIDGDRVRNTVDNCPTVPNSDQKDSDGDGVGDVCDNCPQKSNADQRDV
DHDFVGDACDSDQDQDGDGHQDSRDNCPTVPNSAQQDSDHDGQGDACDDDDDNDGVPDSR
DNCRLVPNPGQEDVDRDGVGDACQGDFDADKVVDKIDVCPENAEVTLTDFRAFQTVVLDP
EGDAQIDPNWVVLNQGMEIVQTMNSDPGLGVGYTAFNGVDFEGTFHVNTVTDDDYAGFIF
GYQDSSSFYVVMWKQMEQTYWQANPFRAVAEPGIQLKAVKSSTGPGEQLRNALWHTGDTK
SQVRLLWKDPRNVGWKDKTSYRWFLQHRPQVGYIRVRFYEGPELVADSNVVLDTTMRGGR
LGVFCFSQENIIWANLRYRCNDTIPEDYEVQRLVQA
NT seq 2271 nt   +upstreamnt  +downstreamnt
atggtcctcaccgccgcccgcgttctcttgctcaccctggctgccctcggcgcatcgggc
caggggcagatcccgctgggtgcagacctgggtccgcagatgctacgcgaactacaggag
accaacgcggcgctgcaggatgtgcgggaactgctgcggcagcaggtcaaggagatcacg
tttctgaaaaacacggtgatggagtgtgacgcgtgcgggatgcaacccgcgctcaccccc
agactgagcgtgaggcccctaagccagtgcgcgcccggcttctgcttccccggcgtggct
tgtaccgagacagcaagcggcgcgcgctgcggaccctgccccgcaggcttcaccggcaac
ggctcgcactgcgccgatgtcaacgagtgcaacgcccatccctgcttcccccacgtccgc
tgcatcaacaccagtccgggcttccgctgcgaggcttgcccaccggggtacagcggcccc
acccacgagggcgtggggctggccttcgccaaggccaacaagcaggtttgcacggacatt
aacgagtgtgagaccatgcagcataactgcatccccaactccgtatgcgtcaataccctg
ggctccttccagtgcggcccgtgccagcccggcttcgtgggagaccaggcatcaggctgc
cgtcggcgcgcacagcgcttctgccccgacggcacgcccagcccgtgccacgagaaggcc
gactgcgtcctggagcgtgatgggtcgcgatcctgcgtgtgcggcgtcggctgggctggc
aacgggctcttctgcggcagcgacacagacttggacggctttcccgacgagaagctgagc
tgctcagagcgccagtgccgaaaggacaactgcgtgacggtgcccaactcagggcaggag
gacgtggatcgcgacggcatcggagacgcctgcgacccggatgccgacggggacggcgtc
ctcaacgagaaggacaactgcccactggtgcggaacccagaccagcgcaatgcggacggc
gacaagtggggcgatgcatgcgacaactgccggtcccagaagaacgacgatcagagggac
acagataacgacggccgaggcgacgcctgcgacgacgacatagacggcgaccgggtccgc
aacacggtggacaactgccccacggtgcccaactcagaccagaaagacagtgatggcgat
ggtgtaggggatgtctgtgacaactgtcctcagaagagcaacgcagaccagagggatgtg
gaccacgactttgtgggagatgcttgtgacagcgaccaagaccaggatggggatgggcac
caggactcgcgggacaactgccccacagtgcccaacagcgcccagcaggactcagaccac
gatggccagggtgacgcctgcgacgacgatgatgacaacgacggggtccccgacagtcgg
gacaactgccgcctggtgcccaacccgggccaggaagacgtggaccgggacggcgtgggc
gacgcgtgccagggggactttgatgcagacaaggtggtggacaagatcgatgtgtgtccg
gagaacgctgaggtcaccctcaccgacttccgggccttccagacggtcgtgctggacccc
gaaggcgacgcccagatagaccctaactgggtggtgctcaaccaggggatggagatcgtg
cagacgatgaacagcgaccctggcctgggtgtgggttacacggccttcaatggcgtggac
ttcgaaggcacgttccacgtgaacacggtcacggatgacgactacgcgggtttcatcttt
ggctaccaggacagctccagcttctatgtggtcatgtggaagcaaatggagcagacatac
tggcaggcaaaccccttccgtgcagtggcggagcctggcatccagctcaaggccgtgaag
tcctccacaggccctggggagcagcttcggaatgcactgtggcacacgggggacacaaag
tcacaggtgcggctgctgtggaaggacccccgcaacgtgggctggaaggacaagacgtcc
taccgctggttcctgcagcaccggccccaagtgggctacatcagagtgaggttctacgag
ggcccggagctggttgcggacagcaatgtggtcctggacacgaccatgcggggaggccgc
ttgggagtcttctgcttctcccaggagaacatcatctgggccaacctgcgctaccgctgc
aatgacaccatccccgaggactacgaggtccagaggctggtgcaggcctag

KEGG   Physeter catodon (sperm whale): 102975888
Entry
102975888         CDS       T06011                                 

Gene name
COL9A3
Definition
(RefSeq) collagen alpha-3(IX) chain
  KO
K08131  collagen type IX alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102975888 (COL9A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102975888 (COL9A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102975888 (COL9A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102975888 (COL9A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102975888 (COL9A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:pcad00535]
    102975888 (COL9A3)
Proteoglycans [BR:pcad00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   102975888 (COL9A3)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 102975888
NCBI-ProteinID: XP_023984361
UniProt: A0A2Y9T0D0
LinkDB
Position
14
AA seq 684 aa
MAGAPALALLLLGQLLAATVTVAEAQKVGPRGPPGPRGPPGKPGKDGIDGEVGPPGLPGP
LGPKGAPGKPGKPGEAGMPGLPGVDGLTGQDGPPGPKGAPGERGSLGPPGPPGLGGKGLP
GSPGEAGVSGVPGGIGLRGPPGPSGLPGLPGPPGPPGPAGRPGVLPEGAADLQCPAICPP
GPPGPPGMPGFKGPTGYKGEQGEVGKDGEKGDPGPPGPPGIPGTVGLQGPRGLQGFPGLL
GPPGDRGPIGFRGPPGIPGAPGKAGDRGRRGPEGLRGPKGDLGRPGPKGVPGMSGLVGEP
GMPGKDGRDGVLGLDGEKGEAGRSGAPGEKGPNGLPGLPGRAGSKGEKGELGRAGELGEA
GPSGEPGVPGGAGVSGERGEAGHRGSAGALGPQGPPGAPGVRGFQGRKGSMGDPGLPGPQ
GLRGGVGDRGPGGAAGPKGDQGIAGADGLPGDKGELGPSGPVGPKGESGSRGELGPKGIQ
GPNGTSGIDGVPGHPGPMGLQGVRGVPGIAGKPGVPGKEASEQRIRELCVGMVSEQIAQL
AAHLRKPLAPGSIGRPGPAGPPGPPGPPGSIGHPGARGPPGYRGPTGELGDPGPRGNQGD
RGDKGSAGEGLDGPDGDQGLQGPQGVPGVGKDGRDGAHGEPGLPGDPGLPGAAGVQGTPG
ICDTSACQGAVMGGGGEKSGPRSS
NT seq 2055 nt   +upstreamnt  +downstreamnt
atggccggagcccccgccctggccctgctcctgcttgggcagctgctggccgccaccgtg
accgtggccgaggcgcagaaagtgggacctcgaggcccccccggcccccgaggaccacct
gggaagccaggcaaggacggcattgatggagaagttggtcctccagggctgcctgggccc
ttgggaccgaaaggggccccggggaagccagggaaaccaggagaggccgggatgccggga
ctgcctggcgtggatggtctgaccgggcaggatggaccccctggacccaagggcgccccc
ggagaacggggaagtctgggaccgccagggccacctgggctggggggcaaaggcctccct
ggatcccccggagaggcaggggtgagcggcgttccaggcggaattggcctccggggcccc
ccgggaccctctggactcccaggcctccctggccccccgggacctcccggccccgctggt
cgcccaggggtccttcccgaaggcgccgctgaccttcagtgccctgccatctgcccgcca
ggccccccggggcccccaggaatgccagggttcaagggacccaccggctacaagggggag
caaggagaagtcggcaaggacggcgagaagggtgaccccggcccccctgggccccctggc
atcccaggcaccgtggggctgcagggtccacggggcctacaaggattcccagggctactc
ggaccccccggggaccggggtcccattggattccgagggccccccgggatcccaggagcc
cctgggaaagcgggtgacagaggccggaggggcccagaggggttgcgcggccccaagggc
gacctcggcaggcctggtcctaaaggcgtccctggcatgtccgggctggtcggagagccg
ggcatgccgggcaaggatggccgggacggtgtgctggggctcgatggcgagaagggagaa
gctggtcgcagtggtgccccaggagagaagggtcccaatgggctgccgggtctccctgga
cgagcagggtccaagggtgagaaaggagaactgggcagagctggagagctgggcgaggcc
ggcccctcaggagagcccggtgtccccggaggcgctggtgtgtccggggagcgtggcgag
gctggccacaggggctcagcgggggctctgggcccacaaggcccccccggagcccctggt
gtccgcggtttccagggccggaagggcagcatgggagaccccggcctgcccggcccccaa
ggcctccgaggcggcgtgggcgaccggggcccgggaggagctgcaggccccaagggcgac
cagggcatcgcgggcgccgacggccttcctggggataaaggagaactgggtcccagtggc
cctgtcggacccaaaggagagtccggcagtcgaggggagctgggtcccaagggcatccag
ggccccaacggcaccagcggcatagacggcgtcccaggccaccctggccccatgggcctg
cagggcgtgcgaggcgtgcctggcatcgccgggaaacccggagtcccggggaaagaggcc
agcgagcagcgcatcagggagctgtgtgtgggcatggtcagcgagcaaatcgcacagcta
gccgcacacttgaggaagcccttagcgccagggtccatcggtcggccgggtcccgctggc
ccccccggccccccggggcccccaggctccattggtcaccccggggctcgagggccccct
ggataccgtggtcccaccggagagctgggagaccccggacccagaggaaaccagggggac
agaggagacaaaggctcagctggcgagggcctggatgggcctgacggagaccaagggctc
caaggaccgcaaggcgtgcccggtgtcggcaaagacggccgcgacggggcccacggagag
ccagggctccctggcgatcctggccttcccggtgctgctggcgttcaggggacccccggc
atctgcgacacctccgcctgccaaggagccgtgatgggagggggcggggaaaaatcaggt
cccaggagctcctaa

KEGG   Physeter catodon (sperm whale): 102976519
Entry
102976519         CDS       T06011                                 

Gene name
COL6A1
Definition
(RefSeq) collagen alpha-1(VI) chain
  KO
K06238  collagen type VI alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102976519 (COL6A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102976519 (COL6A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102976519 (COL6A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102976519 (COL6A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102976519 (COL6A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102976519 (COL6A1)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102976519 (COL6A1)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102976519 (COL6A1)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102976519 (COL6A1)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   102976519 (COL6A1)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3 YcaO
Other DBs
NCBI-GeneID: 102976519
NCBI-ProteinID: XP_007109796
UniProt: A0A2Y9EVD7
LinkDB
Position
8
AA seq 1027 aa
MSLPRVLLPLLLQACWATAQDDVAASRAIAFQDCPVDLFFVLDTSESVALRLKPYGALVD
KVKAFTKRFIDNLKDRYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPGGRDELKSSVDAV
KYFGKGTYTDCAIKKGLEELLVGGSHLKENKYLIVVTDGHPLEGYKEPCGGLEDAVNEAK
HLGIRVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGQSRDAEEVISQTIDTITDMIK
NNVEQVCCSFECQPARGPPGLRGDPGYEGERGKPGLPGEKGDAGDPGRPGDLGPIGYQGM
KGEKGSRGEKGSRGPKGYKGEKGKRGVDGVDGMKGQTGYPGLPGCKGSPGFDGVQGPPGP
KGDAGAFGLKGEKGEPGADGEPGRPGNTGSPGDEGEPGEPGPLGEKGEAGDEGNAGPDGP
PGERGGPGERGPRGTAGARGPRGDLGEAGPQGDQGREGPVGIPGDPGEAGPAGPKGYRGD
EGPPGTEGPRGASGPPGPPGDPGLMGERGEDGPPGNGTEGFPGFPGYPGNRGSPGINGTK
GYPGLKGDEGEAGDPGEDNNDISPRGAKGAKGYRGPEGPQGPPGHMGPPGPDECEILDII
MKMCSCCECKCGPIDILFVLDSSESIGLQNFEISKDFIVKVIDRLSRDELVKFEPGESHA
GVVQYSHNQMQEHVGLRDPNIRNAQDLKEAIKKLQWMGGGTFTGEALQYTRSQLLPLAQN
NRIALVITDGRSDTQRDTTPLSVLCGPDIQVVSVGIKDVFGLAAGSDQLNVISCQGLAPQ
GRPGISLVKENYAELLDDGFLKNITTQICIDKKCPDYTCPITFSSPTDITILLDGSASVG
SHNFDTSKRFAKRLAERFLTAGRTDPGPEARVAVVQYSGTGQQRPERAALQFLQNYTVLA
SALDTMDFFNDATDVTDALGYVTRFYREASSGAAKKRLLLFSDGNSQGATAATIEKAVQE
AQRAGIEIFMVVVGPQVNEPHVRVLVTGKAAEYDVAFGERHLFRVPSYQALLRGVFYQTV
SRKVALG
NT seq 3084 nt   +upstreamnt  +downstreamnt
atgagtctgccccgtgttctgctccccctgctgctgcaggcctgctgggccaccgcgcag
gacgatgtggcagcctcgagggccatcgccttccaagactgccccgtggacctgttcttc
gtgttggacacctccgagagcgtggccctgaggctgaagccctacggggctctggtggac
aaggtcaaggccttcaccaagcgcttcatcgacaacctgaaggacaggtactaccgctgt
gaccgcaacctggtgtggaacgcgggcgcgctgcactacagcgacgaggtggagatcatc
cgcgggctcacgcgcatgcccggcggccgggacgagctcaagagcagcgtggacgcggtc
aagtacttcggcaaaggcacctacacggactgtgccatcaagaaggggctggaggagctg
ctcgtggggggctcccatctgaaggagaacaagtacctgatcgtggtgactgacgggcac
cccctggagggctacaaggagccctgcgggggcctggaggacgccgtgaacgaggccaag
cacctgggtatcagagtcttctctgtggccatcacgcccgaccacctggagccacgtctg
agcattatcgccacggaccacacgtaccggcgcaacttcacggcggccgactgggggcag
agccgcgacgcggaggaggtcatcagccagaccatcgacaccatcacggacatgatcaaa
aacaacgtggagcaagtgtgctgctcgtttgagtgccagcccgccagaggacctcccggg
ctgcggggcgaccccgggtatgagggagaaagagggaagccagggctcccgggagagaaa
ggagatgccggagaccccggaaggcctggggacctcggacccatcggctaccagggcatg
aagggagaaaaagggagccgaggggagaagggctccaggggacccaagggctacaagggc
gagaaggggaagcgtggcgtggacggcgtggacggcatgaaggggcagacggggtaccct
ggcctgccaggctgcaagggctcgcccggattcgatggcgtccaaggaccccctgggccc
aagggtgacgcgggtgccttcggactgaaaggagagaagggtgagcctggagcagacggg
gagcctgggaggcccgggaacacggggtcccccggagacgagggtgagcccggagagcct
ggtcccctgggagagaagggagaagccggcgacgagggaaacgcgggaccagacggaccc
ccgggagagaggggcggccctggggaaagaggaccacgggggaccgcaggcgcgcggggc
ccgagaggagacctgggcgaagctggaccccaaggtgaccagggacgagaaggccctgtt
ggcatccccggagacccgggtgaggctggccccgctgggccgaaaggataccgcggcgac
gaggggcccccagggaccgagggccccagaggagcctcagggcccccaggaccccccgga
gaccccgggctgatgggcgaaaggggtgaagacggcccccccggaaatggcactgagggc
ttccccggcttccctggctatccgggcaacaggggttcccccgggataaacggcaccaaa
ggctaccctggcctcaagggagacgagggagaagctggggaccccggagaggataacaat
gacatttctccccgaggcgccaaaggagcaaagggctaccgaggccccgaaggcccccag
ggacccccaggacacatgggaccaccagggccagacgaatgcgagattttggacatcatc
atgaaaatgtgctcttgctgtgagtgcaagtgcggccccatcgacatcctcttcgtgctg
gacagctccgagagcatcggcctgcagaacttcgagatctccaaggacttcatcgtcaag
gtcatcgaccggctgagcagggacgagctggtcaagttcgagcctggagagtcgcacgcg
ggcgtggtgcagtacagccacaaccagatgcaggagcacgtgggcctgagggaccccaac
atcagaaacgcccaggacctcaaggaagccatcaagaagctgcagtggatgggtggaggc
accttcacaggcgaggctctgcagtacacccggagccagctgctgccactcgcccagaac
aaccgcatcgccctggtcatcacggacggccgctcggacacccagagggacaccacccca
ctcagcgtgctctgcggccctgacatccaggtggtctccgtgggcatcaaggatgtgttc
ggcttagccgcgggctccgaccaactcaacgtcatctcctgccaaggcctggcgccccag
ggacggccgggcatctcgctggtcaaggagaactacgcagaactgctggacgacggcttc
ctgaagaacatcaccacccagatctgcatagacaagaaatgtccagattacacctgccca
atcaccttctcctccccgaccgacatcaccatcctgctggacggctcggccagcgtgggc
agccacaacttcgacacctccaagcgctttgccaagcggctggccgagcgcttcctgacg
gcgggccggacggacccgggcccggaggcgcgcgtagcggtggtgcagtacagcggcacc
gggcagcagcggccggagcgcgcggcactgcagttcctgcagaactacactgtgctggcc
agcgccctggacaccatggacttcttcaacgatgccaccgacgtcaccgacgccctgggc
tacgtgacccgcttctaccgcgaggcctcgtccggcgccgccaagaagaggctgctgctc
ttctccgacggcaactcgcagggcgccacggcggccaccatcgagaaggccgtgcaggag
gcccagcgggcaggcatcgagatcttcatggtggtggtgggcccccaggtgaacgaaccc
cacgtgcgcgtcctggtcaccggcaaggcggccgagtacgacgtggccttcggtgagcgc
cacctgttccgcgtgcccagctaccaggcgctgctgcgcggtgtcttctaccagacggtg
tccaggaaggtggcgctgggctag

KEGG   Physeter catodon (sperm whale): 102977227
Entry
102977227         CDS       T06011                                 

Gene name
THBS4
Definition
(RefSeq) thrombospondin-4
  KO
K04659  thrombospondin 2/3/4/5
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04145  Phagosome
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05144  Malaria
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102977227 (THBS4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102977227 (THBS4)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    102977227 (THBS4)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102977227 (THBS4)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102977227 (THBS4)
  09174 Infectious disease: parasitic
   05144 Malaria
    102977227 (THBS4)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pcad04131]
    102977227 (THBS4)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102977227 (THBS4)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102977227 (THBS4)
   04990 Domain-containing proteins not elsewhere classified [BR:pcad04990]
    102977227 (THBS4)
Membrane trafficking [BR:pcad04131]
 Endocytosis
  Phagocytosis
   Opsonins
    102977227 (THBS4)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102977227 (THBS4)
  Exosomal proteins of colorectal cancer cells
   102977227 (THBS4)
  Exosomal proteins of bladder cancer cells
   102977227 (THBS4)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102977227 (THBS4)
Domain-containing proteins not elsewhere classified [BR:pcad04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   102977227 (THBS4)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA EGF_3 cEGF EGF
Other DBs
NCBI-GeneID: 102977227
NCBI-ProteinID: XP_023979835
UniProt: A0A2Y9STL8
LinkDB
Position
8
AA seq 961 aa
MLAPRGAALLLLHLALQPWLGAGAQASPQVFDLLPSASQRLNPGVLKPVLTDPTLNEFYV
IATFKLQTKSSATIFGLYSSADNSKYFEFTVMGRLNKAILRYLKNDGKIHLVVFNNVQLA
DGRRHRVLLRLNNLHRGAGSVELYLDCSQVDSIHNLPRAFSGLAQSPEAVELRTFQRKAQ
DSLEELKLAVRGSLFQVASLQDCFLQQSEPLATTSTGDFNRQFLGQMSQLNQLLGEVKDL
LRQQVKETSFLRNTIAECQACGPLSFQSPTSNTVVPPVLPASPTPPVRRCDSNPCFRGVR
CTNTRDGFHCGPCPEGYTGNGITCSDVDECKYHPCYPGVRCVNLAPGFRCDACPVGFTGP
TVQGVGIHFAKTNKQVCTDIDECRNGACVLNSICVNTLGSYRCGPCKPGYFGDQTRGCKM
ESSCRNPELNPCSVNAQCIEEKQGGVTCVCRVGWAGDGYICGKDVDIDSYPDEELLCSAR
NCKKDNCKYVPNSGQEDADGDGIGDACDEDADGDGILNEQDNCVLTHNVDQRNGDRDIFG
DACDNCRNVLNNDQKDTDGDGKGDACDDDMDGDGIKNILDNCQKVPNRDQQDKDGDGVGD
ACDSCPEVSNPNQSDVDNDLVGDSCDTNQDSDGDGHQDSTDNCPTVINSAQLDTDKDGIG
DECDDDDDNDGIPDLVPPGPDNCRLVPNPAQEDSNNDGVGDICETDFDHDQVIDQIDVCP
ENAEITLTDFRAYQTVVLDPEGDAQIDPNWVVLNRGMEIVQTMNSDPGLAVGYTAFNGVD
FEGTFHVNTQTDDDYAGFIFGYQDSSSFYVVMWKQTEQTYWQASPFRAVAEPGIQLKAVK
SRTGPGEHLRNSLWHTGDTSDQVRLLWKDSRNVGWKDKVSYRWFLQHRPQVGYIRVQFYE
GSELVADSGVTIDTTMRGGRLGVFCFSQENIIWSNLKYRCNDTIPEDFQEFQTQNFDRLD
N
NT seq 2886 nt   +upstreamnt  +downstreamnt
atgctggccccgcgcggagccgccttgctcctgctgcacctggccctgcagccgtggctg
ggggccggcgcccaggccagcccccaggtctttgaccttctcccatccgccagccagagg
ctgaacccaggtgtcctgaagccagtcctgacagaccccaccctgaacgagttctatgtg
atcgccaccttcaagctgcagacgaaaagttcagccaccatctttggactttactcttca
gctgacaacagcaagtattttgagtttactgtgatgggacgcttgaacaaagccatcctc
cgttacctgaagaatgatgggaagattcatttggtggttttcaacaacgtgcagctggcc
gacggcaggcggcacagggtcctcctgagactgaacaatctgcaccggggagctggctct
gtagagctctacctggactgctcgcaagtggattccattcacaatctccccagagccttt
tctggcctcgcccagagtcctgaggctgttgaattgaggactttccagaggaaggcacag
gactccttggaagagctgaagttggcggtgagaggctcgctgttccaggtggccagcttg
caagactgcttcctgcagcagagtgagccgctggccaccacaagcacaggagacttcaat
cggcagttcttggggcagatgtcacaactaaaccagctactgggagaggtgaaggatctt
ctgagacagcaggtcaaggaaacatcatttttgcgaaacaccatagctgaatgccaggct
tgtggtcctctcagctttcagtctccaacctccaacaccgtggtaccccctgtgctccca
gcatccccgacacccccagtgcgccggtgcgattccaacccgtgtttccgtggcgtccga
tgtaccaacaccagagatggctttcattgtgggccctgccctgagggctacacaggaaat
ggaatcacctgttctgacgttgacgagtgcaaataccatccctgctacccaggcgtgcgc
tgtgtgaacttggctcctggcttcagatgtgacgcctgcccggtgggcttcacagggccc
acggtgcagggtgttgggatccattttgccaagacaaacaagcaggtctgcactgacatt
gatgagtgtcgaaatggagcatgtgttctcaattctatctgtgttaacactttgggatct
tatcgctgtgggccttgtaaaccagggtactttggtgatcagacgaggggatgcaaaatg
gaaagcagctgcaggaatccagaactgaacccttgcagcgtgaacgcacagtgcattgaa
gagaagcaggggggtgtgacatgcgtgtgtagggtcggctgggctggtgacggctacatc
tgcggaaaggacgtggacattgacagttaccctgacgaagaattgctgtgctctgccagg
aactgcaaaaaggacaactgcaagtatgtgccaaattccggccaagaggatgcagacgga
gatggcattggagatgcttgtgacgaggacgctgatggagatgggatcctgaatgagcag
gataactgtgtcctgactcacaatgtggaccagaggaacggtgacagagatatctttggg
gatgcctgtgataactgccggaacgtcctaaataatgaccagaaggacactgacggggat
ggaaaaggagatgcctgtgatgacgacatggatggagatggaataaaaaacatcctggac
aactgccagaaagtccccaaccgtgatcaacaggacaaagatggtgatggcgtgggggat
gcctgtgacagctgtcctgaggtcagcaaccctaaccagtctgacgtggataacgatctg
gttggggactcctgtgataccaatcaggacagtgatggagacgggcaccaggacagcaca
gacaactgccccactgtcattaacagtgcccagctggacactgataaggatggaattggg
gacgagtgtgatgatgacgacgacaatgacggcatcccagacctggtgccccctggacca
gacaactgccggctggtccccaaccctgcccaggaggacagcaacaacgacggcgtgggg
gacatctgcgagacagacttcgaccacgaccaggtcatcgatcagatcgacgtgtgccca
gagaacgcagagatcaccctgaccgacttcagggcctatcagacggtggtcctggaccct
gaaggggatgcccagatcgatcccaattgggtggtcctgaaccggggcatggagatcgtg
cagaccatgaacagtgatccaggcctggcagtggggtacacggcttttaatggagtggac
ttcgaagggaccttccacgtcaacacccagacggacgatgactatgctggctttatcttt
ggttaccaagacagctccagcttctatgtggtcatgtggaagcagacagagcagacatac
tggcaggccagcccgttccgagctgttgcagaacctggcatccagctcaaggccgttaag
tctaggacgggtccgggggagcatctccgcaactccctgtggcacaccggggacaccagc
gaccaggtcaggctgctgtggaaggactccaggaacgtgggctggaaggacaaggtgtcc
taccgctggttcctgcagcaccggccccaggtgggctacatcagggtgcagttttatgaa
ggctccgagttggtggctgactcgggggtcaccatagacaccaccatgcgtggaggccgg
ctcggcgtgttctgcttctcccaggaaaacatcatttggtccaacctcaaatatcgctgc
aatgacaccatcccagaggacttccaagaatttcaaacccagaatttcgatcgcctggat
aactaa

KEGG   Physeter catodon (sperm whale): 102978142
Entry
102978142         CDS       T06011                                 

Gene name
FN1
Definition
(RefSeq) fibronectin isoform X1
  KO
K05717  fibronectin 1
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04810  Regulation of actin cytoskeleton
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad05100  Bacterial invasion of epithelial cells
pcad05135  Yersinia infection
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05205  Proteoglycans in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102978142 (FN1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102978142 (FN1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102978142 (FN1)
  09142 Cell motility
   04810 Regulation of actin cytoskeleton
    102978142 (FN1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102978142 (FN1)
   05205 Proteoglycans in cancer
    102978142 (FN1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102978142 (FN1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102978142 (FN1)
  09171 Infectious disease: bacterial
   05135 Yersinia infection
    102978142 (FN1)
   05100 Bacterial invasion of epithelial cells
    102978142 (FN1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102978142 (FN1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102978142 (FN1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pcad04131]
    102978142 (FN1)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102978142 (FN1)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102978142 (FN1)
   04990 Domain-containing proteins not elsewhere classified [BR:pcad04990]
    102978142 (FN1)
Membrane trafficking [BR:pcad04131]
 Endoplasmic reticulum (ER) - Golgi transport
  Forward pathways
   ER-Golgi intermediate compartment (ERGIC) proteins
    102978142 (FN1)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of bladder cancer cells
   102978142 (FN1)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102978142 (FN1)
Domain-containing proteins not elsewhere classified [BR:pcad04990]
 Fibronectin (FN) domain-containing proteins
  Fibronectin type I domain-containing proteins
   102978142 (FN1)
SSDB
Motif
Pfam: fn3 fn1 Pur_ac_phosph_N fn2 DUF2369 DUF1410
Other DBs
NCBI-GeneID: 102978142
NCBI-ProteinID: XP_007100926
UniProt: A0A2Y9EFF6
LinkDB
Position
2
AA seq 2477 aa
MLGGPGPRLLLLTVLSLGTAVPSTGVSKSKRQAQQIIQPQAPLTVSQSKPGCYDNGKHYQ
INQQWERTYLGSALVCTCYGGSRGFNCESKPEPEETCFDKYTGNTYRVGDTYERPKDSMI
WDCTCIGAGRGRISCTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCK
PIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTCLGEGSGRITCTSRNRCNDQDTRTSY
RIGDTWRKKDNRGNLLQCICTGNGRGEWKCERHTTLQTTSAGSGSFTDVRTAIYQPQPHP
QPASYGHCVTDSGVVYSVGMQWLKTQGNKQMLCTCLGNGVSCQETAITQTYGGNSNGEPC
VLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTRGGNSNGALC
HFPFLYNNRNYTDCTSEGRRDNMKWCGTTRNYDADQKFGFCPMAAHEEICTTNEGIMYRI
GDQWDKQHDMGHMMRCTCVGNGRGEWTCVAYSQLRDQCIVDDITYNVNDTFHKRHEEGHM
LNCTCFGQGRGRWKCDPIDQCQDSESRTFYQIGESWEKFLHGVRYQCYCYGRGIGEWGCQ
PIQTYPGSTGPVQVIITETPSQPNSHPIQWNAPEPSHISKYILRWKPKNSPDRWKEATIP
GHLNSYTIKGLRPGVVYEGQLISVQHYGQREMTRFDFTTTSTSPVVTSNTVTGETTPFSP
VVATSESVTEITASSFVVSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTATSVNIPDL
LPGRKYIVNVYQISEEGEQSLILSTSQTTAPDAPPDPTVDQVDDTSIVVRWSRPQAPITG
YRIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVFIQQETTG
VPRSDKVPPPRDLQFVEVTDVKITIMWTPPESAVTGYRVDVIPVNLPGEHGQRLPISRNT
FAEVTGLSPGVTYHFKVFAVNHGRESKPLTAQQTTKLDAPTNLQFINETDTTVIVTWTPP
RARIAGYRLTVGLTRGGQPKQYNVGPSATQYPLRNLQPGSEYSATLVAVKGSQQSPRVTG
VFTTLQPLGSIPPYNAEVTETTIVITWTPAPRIGFRLGVRPSQGGEAPREVTSDSGSIVV
SGLTPGVEYIYTISVLKDGQERDTPIVKKVVTPLSPPKNLHLETNPDTGVLTVSWERSTT
PDITGYRITTTPTNGQQGYSLEEVVHADQSSCTFENLSPGLEYNVSVYTVKDDKESVPIS
DIIMPEVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFVDSSV
GYYTVTGLEPGIDYDISVITLINGGESAPTTLTQQTAVPSPTDLRFTNIGPDTMRVTWAA
PPSIELTNLLVRYSPVKNEEDVAELSVSPSDNAVVLTNLLPGTEYLVSVSSVYEQHESVP
LRGRQKTGLDSPTGIDFSDITANSFTVHWIAPRATITGYKIRHHPDHMSARPREDRVPPS
RNSITLTNLNPGTDYVVSIVALNGKEESPSLVGQQTTVSDVPRDLEVIAATPTSLLISWD
APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGADYTITVYAVTGRGDSPA
SSKPVSIDYRTEIDKPSQMQVTDVQDNSISVRWLPSSSPVTGYRVTTTPKNGPGPSKTMT
AGPDQTEMTIEGLQPTVEYVVSVYAQNRNGESQPLVQTAVTNIDRPKGLAFTDVDVDSIK
IAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEETAELQGLRPGSEYTVSVVALHDDM
ESQPLIGTQSTAIPAPTNLKFTQVSPTSLAAQWTAPNVQLTGYRVRVTPKEKTGPMKEIN
LAPDSSSVVVSGLMVATKYEVSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETT
ITISWRTKTETITGFQVDAIPANGQTPVQRTIKPDVRSYTITGLQPGTDYKIYLYTLNDN
ARSSPVVIDASTAIDAPSNLHFLATTPNSLLLSWQPPRARITGYIIKYEKPGSPPREVVP
RPRPGVTEATITGLEPGVEYTIQVIALKNNQKSEPLIGRKKTDELPQLVTLPHPNLHGPE
ILDVPSTVQKTPFITNPGYDTGNGIQLPGTSGQQPSLGQQMIFEEHGFRRTTPPTMASPV
RHRPRPYPPNINEEIQIGPVPRGDVDHHLYPHVLGLNPNASTGQEALSQTTISWTPFQES
SEYIISCHPVGIDEEPLQFRVPGTSASATLTGLTRGATYNIIVEAIKDQKRHKVREQVVT
VGNSVDQGLNQPTDDSCFDPYTVSHYAIGEEWERLSESGFKLSCQCLGFGSGHFRCDSSK
WCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDPHEATCYDDGKTYHVGEQWQ
KEYLGAICSCTCFGGQRGWRCDNCRRPGAEPGHEGSTGHSYNQYSQRYHQRTNTNVNCPI
ECFMPLDVQADREDSRE
NT seq 7434 nt   +upstreamnt  +downstreamnt
atgctcgggggtccggggccccggctgctgctgctgaccgtcctgtccctggggacagcc
gtgccctccactggagtctcgaagagcaagaggcaggctcagcagatcattcagccccag
gccccgctgactgtcagccaaagcaagcctggttgttatgacaatgggaaacactatcag
ataaaccaacaatgggagcgcacctacctgggcagtgccttggtctgtacctgttatgga
gggagccggggctttaactgcgagagcaagcctgaacctgaagagacttgctttgacaag
tacaccgggaacacttaccgggtgggcgacacttacgagcgccctaaagactccatgatc
tgggactgcacctgcattggggctgggcgagggagaataagctgcaccattgcaaaccgc
tgccatgaagggggtcagtcctacaagattggtgacacctggaggagaccacacgagact
ggtggttacatgttggagtgtgtgtgtctcggtaacgggaaaggagaatggacctgcaaa
cccatagctgagaaatgttttgatcacgctgctgggacttcctacgttgtcggggagacc
tgggaaaagccttatcaaggctggatgatggtggattgtacttgtctgggagaaggcagt
ggacgcatcacctgtacctccagaaacaggtgcaacgatcaggacactaggacatcctat
agaattggggatacctggcgcaagaaggacaatcgggggaacctgctccagtgcatctgc
accggcaacggccgtggcgagtggaagtgtgagaggcacacgaccctgcagaccacgtct
gccggatctggctccttcacagacgtccgaacggccatttaccagccacagcctcacccc
cagccggcttcatacggtcactgtgtcacggacagcggtgtggtttactccgtgggcatg
cagtggctgaagacacaaggaaataagcaaatgctttgcacttgcctgggcaatggagtc
agttgccaagagacagctataacccagacttatggaggcaattcaaatggggagccttgc
gtcctaccattcacctacaacggcaggactttctactcctgcaccacagaagggcgacag
gatggacatctctggtgcagcacaacttccaattatgagcaagaccagaaatattctttc
tgtacagaccatactgttttggttcaaactcgaggtggaaattccaatggtgccttgtgc
cactttcccttcctgtacaacaaccgaaactacaccgactgtacatccgagggcaggagg
gacaacatgaaatggtgtggaaccacacggaactatgatgctgaccagaagtttggattc
tgccccatggctgcccacgaggaaatctgtacaaccaatgaagggatcatgtatcgcatt
ggagaccagtgggacaaacagcatgatatgggccacatgatgagatgcacatgtgttggg
aacggtcgtggagaatggacatgtgtcgcctactcccagctccgagatcagtgcatcgtc
gacgacatcacatacaatgtgaatgacacattccacaaacgtcacgaagagggacacatg
ctgaattgtacctgcttcggtcagggcaggggccggtggaagtgcgatcccattgatcaa
tgccaggattcagaatcccgtaccttttatcaaattggagaatcatgggagaagtttttg
catggagtcaggtaccagtgctactgctatggccgtggcattggggagtggggctgccag
cccatacagacctatccaggttcaactggtcctgtccaagtaataatcactgagaccccc
agtcagcccaactctcaccccatccaatggaatgcaccagaaccatctcacatttccaag
tacattctcagatggaaacctaaaaattctccagaccgttggaaggaagccaccattcca
ggccacttaaactcctacaccatcaaaggcctgaggccaggcgtggtatacgagggccag
ctcatcagcgtccagcactacggccaaagagagatgacacgcttcgacttcaccaccacc
agcacaagcccagtggtgaccagcaacaccgtgacgggagagacaacacccttttctccc
gtcgtggccacctccgaatccgtgactgaaatcacagccagcagctttgtggtctcctgg
gtctcggcttccgacactgtgtccggattccgggtggagtatgaactgagtgaagaggga
gatgaaccacagtacctggatcttcccagcacagccacttccgtgaacatccctgacctg
cttcctggccgaaaatacattgtgaacgtctatcagatatccgaagaaggagaacagagt
ttgatcctgtctacctcacagacaacagcacctgatgcccctcctgaccctaccgtggac
caggttgatgacacctcgattgtcgttcgctggagcagaccccaggcgcccatcacagga
tacagaatcgtctattccccgtcagtagagggtagtagcacagaactcaaccttcctgaa
actgccaactctgtcactctcagtgacttgcagcctggtgttcaatataacatcactatc
tacgctgtggaagaaaatcaagagagtactcctgtcttcatccaacaggaaaccactggc
gtcccacgttcagataaagttcccccgcccagggacctgcagtttgtggaggtaacggac
gtgaagatcaccatcatgtggacaccccctgagagcgcggtgaccggttaccgagtggac
gtgatccccgtcaacctgcccggggaacatgggcagaggctgccaatcagcaggaacaca
tttgcagaagtcactgggctgtcccctggggtcacctatcacttcaaggtcttcgccgtg
aaccacgggagggagagcaagcctctgaccgcacaacagacgaccaaattggatgctccc
accaacctccagtttatcaatgagactgacaccaccgtcatagtgacctggactccacct
cgtgcccggatagccgggtaccgactgaccgtgggcctgacccgtggaggccagcccaag
cagtacaacgtgggtccctcggccacgcagtatccactgaggaatctgcagcctggctct
gagtactctgcaacccttgtggcggtgaaaggcagccagcagagccccagagtcactgga
gtcttcaccactctgcagcctctgggctccatcccaccttacaacgcagaagtgactgaa
accaccattgtgattacatggacgcctgctccaaggattggttttaggctgggtgtacga
ccaagccagggaggggaagcaccacgagaagtgacctcagactcaggaagcatcgttgtg
tctggcctgaccccaggcgtggaatacatttacaccatttcagtcctgaaagatgggcaa
gagagagatacaccaattgtaaagaaagtggtgacaccactgtctccaccaaagaacttg
cacctggaaacaaaccccgacactggcgtgcttactgtctcctgggagaggagtaccacc
ccagacattactggctatagaattaccaccacccctacaaatggccagcagggatattct
ttggaagaagtggtccatgccgatcagagttcctgcacttttgaaaacctgagtcctggc
ctggagtacaatgtcagtgtttacactgtcaaagatgacaaggaaagtgtccctatctct
gatatcatcatgccagaggtgccccaactcactgacctaagctttgttgatataaccgat
tcaagcatcggcctgaggtggaccccgctaaactcttccaccattattgggtaccgcatc
acagtagttgcggcaggagaaggtatccctatttttgaagattttgtggactcctcagta
ggatactacacagttacagggctggagcctggcattgactatgacatcagcgttatcact
ctcattaatggcggagagagtgcccctactacactgacacaacaaaccgctgtcccttct
cccactgacctgcgatttaccaacattggtcccgacacgatgcgtgtcacctgggccgca
cctccatccattgaattgaccaacttactggtgcgctactcgcctgtgaaaaatgaggag
gatgttgccgaattgtcagtttctccttcagacaatgcagtggtcttaacaaatctcctg
cccggcacagagtatttagtcagcgtctccagtgtttacgagcagcacgagagcgtacct
cttagaggaagacagaaaactggtctcgattccccaaccggcattgacttttctgatatc
actgccaactctttcactgtccattggattgctcctcgagctaccatcaccggctacaag
atccgccatcatcctgaccacatgagtgcaagacctcgggaggatcgagtgcccccctct
cggaattccatcactctcaccaatctcaatccgggcacagattatgtggtcagcattgtc
gctcttaatggcaaagaggaaagtccttccttggttggccagcaaacaacagtttctgat
gttccaagggacctggaagtcattgctgcaacccccactagcctgttgatcagctgggat
gctcctgccgtcacagtgagatattacaggatcacctatggagaaacaggaggaaatagc
cctgtccaggagttcactgtgcctgggagcaagtctacagctaccatcagtggtcttaaa
cctggagcagactacaccatcaccgtttatgctgtcactggccgtggggacagcccggca
agcagcaagcccgtttccatcgattaccgaacagaaattgacaaaccatcccagatgcaa
gtgactgatgtccaggacaacagcattagtgtcaggtggctgccttcaagttcccctgtt
accggttacagagtgaccactactcctaaaaatggcccaggaccatcgaaaacaatgact
gcaggtccagatcaaacagaaatgaccattgaaggcttgcagcccacagtggagtatgtg
gttagtgtctacgctcagaatcgaaacggagagagtcagcctctggttcagacagcagta
accaacattgatcgccctaaaggactggcattcactgatgtggatgtcgattccatcaaa
attgcttgggaaagcccacaggggcaagtttccaggtacagggtgacctactcgagccct
gaggatggaatccatgagctattccctgcacctgatggtgaagaagagactgcagagctg
caaggcctcaggccgggttctgagtacacagtcagtgtggttgccttgcacgatgatatg
gagagccagcccctgattggaacccagtccacagccattcctgcaccaaccaacctgaag
ttcactcaggtttcaccaaccagcctggccgcccagtggacggcacccaatgttcagctc
actggatatcgagtgcgggtgacccctaaggagaagaccggaccaatgaaagaaatcaac
cttgctcccgatagctcatctgtggttgtatcaggactcatggtggccactaaatatgaa
gtgagcgtctatgctcttaaggacaccttgacaagcagaccagctcagggagtcgtcacg
actctggagaatgtcagccctccaagaagggcccgtgtgacagatgctactgagaccacc
atcaccattagctggagaaccaagactgagacgatcaccggcttccaagtcgatgccatc
ccagccaacggccagactccagttcagagaaccatcaagccagacgtcagaagctacacc
attacgggtttacaacccggcactgactacaagatctacttgtacaccttgaatgacaac
gctcggagctcccctgtggtcatcgacgcctctactgccattgatgcaccatccaacctg
catttcctggccaccacacccaactccctgctgttatcatggcagccacctcgtgccagg
attactggttacatcatcaagtatgagaagcctgggtcccctcccagagaagtggtccct
cggccccgccctggtgtcacagaagctactattactggtctggaaccaggagtcgagtac
acaatccaagtcattgccctcaagaacaatcagaagagtgagcctctgatcgggaggaaa
aagacagatgagcttccccaactggtaacccttccacaccccaatcttcatggaccagag
atcctggatgttccctccacagttcaaaagacccctttcatcaccaaccctgggtatgac
actggaaacggtattcagcttcctggcacttctggtcagcagcccagtcttgggcaacaa
atgatctttgaggagcatggttttaggcgaaccacaccgcccacaatggccagccccgta
aggcataggccaagaccgtatccaccgaatataaatgaggagatccaaattggtcctgtc
cccaggggagacgtagaccatcatctctaccctcacgttctgggactcaatccaaatgct
tctacaggacaagaagctctctctcagacaaccatctcatggaccccattccaggaaagc
tctgagtatatcatttcatgtcatccagttggcattgatgaagaacccttgcagttccga
gttcctggaacttctgctagtgccaccttgacgggtctcaccagaggggccacctacaac
atcatagtggaggcaataaaagaccagaagaggcacaaggttcgggagcaagtagttacc
gtgggcaattctgttgaccaaggcctaaaccaacccacggatgactcgtgcttcgacccc
tacacggtttcccattacgccatcggagaggagtgggagcgattgtctgaatctggcttt
aaactctcatgccagtgcttaggctttggcagtggtcatttcagatgcgattcatctaaa
tggtgccatgataatggtgtgaactacaagattggagagaaatgggatcgtcagggagaa
aatggccagatgatgagctgcacgtgtctcggaaatggaaaaggagaattcaagtgcgat
ccccacgaggcaacgtgttacgacgatgggaagacgtaccacgtgggagaacagtggcag
aaagaatatcttggcgccatttgctcctgcacgtgctttggagggcagcggggctggcgc
tgtgacaactgccgcagaccaggggctgaacccggtcacgaaggctcaactggccactcc
tacaaccagtattcgcagagataccatcagagaacaaacactaatgtcaactgcccaatt
gagtgcttcatgcctttagatgtacaggctgacagagaagattcccgagagtaa

KEGG   Physeter catodon (sperm whale): 102978518
Entry
102978518         CDS       T06011                                 

Gene name
COL4A2
Definition
(RefSeq) collagen alpha-2(IV) chain isoform X1
  KO
K06237  collagen type IV alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102978518 (COL4A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102978518 (COL4A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102978518 (COL4A2)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102978518 (COL4A2)
  09154 Digestive system
   04974 Protein digestion and absorption
    102978518 (COL4A2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102978518 (COL4A2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102978518 (COL4A2)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102978518 (COL4A2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102978518 (COL4A2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102978518 (COL4A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102978518 (COL4A2)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102978518 (COL4A2)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102978518 (COL4A2)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102978518 (COL4A2)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102978518
NCBI-ProteinID: XP_023978888
UniProt: A0A2Y9SR44
LinkDB
Position
13
AA seq 1760 aa
MDRDQRAASRPALRRWLLLGAVTVGLLAQGVLAGVKKFDVPCGGRDCSAGCQCYPEKGGW
GQPGPVGPQGYTGPPGLQGFPGLQGRKGDKGERGAPGITGPKGDVGARGVSGFPGADGIP
GHPGQGGPRGPPGYDGCNGTVGDAGYPGPSGPGGFLGPPGPQGPKGQKGEPYALSSEDRD
KYRGEAGVPGLVGFQGPPGHPGPMGQMGPVGAPGRAGPPGPPGPKGQPGNRGLGFYGEKG
EKGDMGQQGPGGIPPDNSYVERPKNDVILPEQYKGEKGSEGEPGRKGISLKAEEGVMGFS
GARGAPGFDGEKGSPGQKGSRGPDGYQGPDGYPGPKGEMGEPGPPGPPAYSPHPSLAKGA
RGDPGFQGAPGDPGARGEPGDPGLPGLPGTSIGDGDEKRGLPGEMGPKGFTGEPGIPALY
PGPPGADGKPGLRGPPGLPGPPGPDGFLFGLKGAKGRVGYPGPSGFPGARGQKGWKGDAG
DCQCAEGDRFIGGLPGPPGPKGFPGTNGEPGRKGSPGDPGQHGIPGFPGFKGGPGDVGPP
GPKGMKGDSRTITTKGERGQPGVPGVPGLKGDSGIPGRDGLDGFPGLPGSPGDGIKGPSG
DAGHPGPPGTKGIPGERGPPGLGLPGPKGERGFPGDAGLPGPPGFPGPPGAPGTPGQIDC
DTGVKRPIGADGQETVQPGCVGGPKGSPGQPGPPGPPGAKGVRGLPGFSGTDGAPGLKGL
PGDPGQEGFPGPPGFVGPRGVKGAVGRPGLDGLPGPSGLPGPVGPPGDRGLPGEVLGAQP
GPRGDTGLPGRSGLKGPPGERGPPGFRGSQGMPGMPGPKGQQGFPGPSGQPGLPGPPGQH
GFPGAPGREGPLGPPGTPGFGGLPGDRGEPGDTGVPGPVGMKGVSGDRGDPGWQGERGHP
GGPGFKGIAGMPGAPGPKGVKGSPGMDGFQGMLGLKGRSGLPGNKGETGFFGVPGLKGLA
GEPGVKGSRGDPGPPGPPPIILPGMKDIKGEKGDEGPMGLKGYLGLKGLPGMPGIPGLSG
VPGLPGKPGHVKGVKGDIGFPGVPGSPGFPGVPGSPGISGFPGFTGSRGDKGAPGRVGLH
GEIGSIGDFGDIGNTIDLPGSPGLKGEGGTAGAPGLKGFFGEKGTEGDVGFPGITGLAGV
QGPPGLKGQTGFPGLTGLQGPQGEPGRAGAPGTKGDFGWPGSPGLPGLPGLRGIGGLHGL
PGTKGFPGSPGADIHGDAGFPGPAGARGDPGEANTRPGPAGAPGRKGERGVPGERGPVGS
PGLQGFPGITPPSNISGLPGDAGAPGIFGLEGYRGPPGPPGPVALPGIKGDEGSPGAPGN
PGTKGWVGDSGPQGRPGVFGLLGEKGPRGEQGFMGNAGAAGSVGDRGPKGPKGDRGLPGA
PGAVGAPGIAGIPQRIAIQRGPVGPQGRRGPPGPQGEMGPQGRPGEPGRSPAAGPPLVSC
LLCFRFCSPAGKHRLSASAGASHPSRGTHACLGGCAGFRGTPGKAGPQGRGGVSAVPGFR
GDQGPVGLQGPVGYEGEPGRPGSPGLPGMPGRSASIGHLLVKHSQTEQEPMCPVGMNKLW
SGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDICYYASRNDKSYWLSTTAP
LPMMPVAEEDIRPYISRCSVCEAPAVAIAVHSQDVSIPHCPPGWRSLWIGYSFLMHTAAG
DEGGGQSLVSPGSCLEDFRATPFIECNGARGTCHYYANKYSFWLTTIPEQSFQGTPSADT
LKAGLIRTHISRCQVCMKNL
NT seq 5283 nt   +upstreamnt  +downstreamnt
atggaccgagaccagcgcgcggcgtcgcgccctgccctgcggcggtggctactgctggga
gctgtgaccgtggggctcctggcccagggcgtcctggcgggcgtgaagaagttcgatgtg
ccctgtggagggagagactgcagtgcgggctgccagtgctatcccgagaaaggaggatgg
ggccagccagggccagtgggcccacagggatacaccgggccccccgggttacaagggttc
ccaggactgcaaggccgcaaaggtgacaagggtgaaaggggggctcccgggatcacagga
ccaaagggagacgtgggagccagaggcgtttctggatttcctggtgccgacggaattccc
ggacatcctggccaaggcgggcccagaggaccgcctggctacgacggctgcaacgggaca
gtgggtgacgcaggctacccgggacccagcggccctggcggcttcctcggcccccctggg
ccccaaggacccaaagggcagaaaggcgagccctacgctctgtctagtgaagaccgcgac
aaatacaggggggaagctggagtgcccggactggtcggtttccaggggcctcccggccac
cctgggcccatgggacagatgggtccggttggagctccaggaagagcaggaccccctgga
ccccctggaccaaaaggacagccaggcaacagaggacttggtttttacggagaaaaagga
gaaaagggcgacatgggacagcagggacccggtgggattccgccagacaatagttacgtc
gagaggcccaaaaatgatgtgatcctcccagagcaatataagggtgaaaaaggaagtgag
ggggagccaggaagaaaaggcatttccttgaaggccgaagaaggagtcatgggcttttca
ggagcgcggggtgcccctggcttcgacggtgagaaaggttcaccgggacagaaaggaagc
agaggaccggatggttaccaaggccctgatggatacccaggacccaagggagaaatgggt
gaaccaggcccccccggcccaccagcctactccccacatccttccctggccaaaggtgcc
agaggggacccaggattccaaggagcccccggggacccaggagcccggggggagcctgga
gaccccggcttgccaggcctgcctggcacgtccatcggagatggagatgagaagagaggc
ttaccgggcgagatgggacccaaaggcttcacaggagagccgggcatccccgccctgtac
cccggcccgccgggagctgacggaaagccggggctccgaggtcccccagggctcccggga
ccacctggcccagacggtttccttttcggccttaaaggagcaaaaggcagggtgggctac
cctgggccttctggtttccccggggctcgcggacagaaaggatggaaaggcgacgccggc
gactgtcaatgtgcagagggtgaccggttcatcgggggtcttccggggccaccaggaccc
aagggctttcccggcaccaatggggaaccagggaggaaaggcagcccgggagaccccggc
cagcacggcatccccgggttcccaggattcaagggaggccccggtgatgttggccctccc
gggcccaaaggaatgaagggagattctcgaaccatcaccaccaaaggtgagcggggccag
ccaggtgtcccaggcgtgcctggactgaaaggggacagcggcatcccggggcgtgacggg
ctggatggattcccaggcctcccaggctctcctggtgatggcatcaaaggcccctcaggg
gacgcaggccacccaggaccacccggtactaagggcattccaggagaaagaggcccccca
ggactggggctgccaggccccaaaggcgagcgtggcttccccggcgatgctggattacct
ggaccgccaggcttccccgggcctcccggcgcccccggcactccaggacaaatagattgt
gacacgggtgtgaaaaggcccatcggggcagatggccaggagaccgtccagccaggttgt
gttggagggcctaagggatcgccaggccagccgggcccccccggccccccaggtgccaaa
ggcgtcagagggcttccgggcttctcgggaacggacggggcaccagggctcaagggtctc
ccgggagatccaggtcaagaggggttcccgggacccccagggttcgtggggccccgagga
gtcaaaggtgcagtgggccgccctggcctggatggactcccgggtccctccggcctgccg
gggccagtcgggcctccgggagacaggggcctccctggagaagtcttgggggcccagccc
gggccccggggagacaccggcctgcctggacgctctgggctgaagggcccccccggagag
agaggccctcctggattccggggaagccaggggatgccgggaatgccgggcccgaagggt
cagcagggcttcccaggaccttcgggccagccgggactgcccgggccaccaggacagcat
ggcttccctggagctcctggccgggaagggcccttggggccgccgggcaccccaggtttt
ggaggtctgcctggagacagaggggagccaggtgacacaggcgtccctggccccgtgggc
atgaagggcgtctccggggatagaggtgaccctggctggcagggcgagagaggtcatccc
ggaggccccggattcaaaggaatagccgggatgcctggtgccccgggccccaaaggagtt
aagggttctcccgggatggacggcttccaaggcatgctcgggctcaaggggagatccggg
cttccaggaaacaaaggagagactggattttttggagtccccgggctgaagggcctggct
ggcgagcccggtgtgaaaggcagccgaggggaccccggcccccccggaccacctcccatc
atcctgccaggaatgaaagacatcaaaggagagaaaggagatgaaggccccatgggactg
aaaggatacctgggcttgaaaggccttcccggaatgccagggatccccgggctgtcggga
gtccctgggctgccagggaaaccaggccacgtcaaaggagtcaaaggagatatcggattc
cccggcgtgcccgggtcaccgggattccccggcgtgcccggctcccccggaatctcaggg
tttccgggcttcacaggcagtaggggcgacaagggagctccggggagagtaggccttcac
ggcgagatcggctccatcggtgactttggtgatatcggaaacactatagaccttccagga
agtccaggcctgaagggggaaggaggcaccgctggagcaccaggtctaaagggattcttc
ggagagaaaggaacagagggtgatgtcggcttccctgggatcaccggcctggcaggagtc
cagggccctcctggactgaaagggcaaacaggctttccaggactgacagggctgcagggg
cctcagggagagccgggacgggcgggagcgcccggcaccaaaggagacttcggctggccg
gggagcccaggcctaccaggtctgcccggcctccgcggcatcggcggattacacggcttg
ccaggcaccaaaggcttcccaggatccccaggtgcggacatccacggggacgccggcttc
ccaggtcctgccggggccaggggtgacccaggagaggccaacacccgtccaggccccgca
ggagccccgggacgaaaaggggagcgtggagtcccaggggaacgaggcccggtcgggagt
ccaggacttcaggggtttccaggcatcacccccccttccaacatctctgggttaccaggt
gacgcaggtgcgcctgggatatttggcctggaaggttaccgaggtcccccggggccacct
gggcccgtggctcttcctggaatcaaaggagatgaggggagtcccggagcccccggaaac
ccagggaccaaaggatgggtcggggactccgggccccagggccggcctggcgtgttcggg
ctcctgggagaaaaagggcccagaggggagcaaggattcatgggcaacgcaggcgccgcc
gggagtgtgggagaccgaggccccaagggacccaaaggagaccgagggctcccgggtgcc
ccgggcgctgtgggcgccccggggattgcaggaataccccagaggattgccatccagcgg
gggccagtgggtccgcagggaaggagaggccccccggggcctcagggggagatgggaccc
cagggccgcccgggagaaccaggtaggagccctgctgcggggccaccgctcgtttcctgt
ttgctctgctttcgcttttgttctcctgctggaaagcaccggctgagcgccagtgctgga
gcatcgcatccttcccggggcactcacgcgtgtctcggtggctgtgcaggtttccgcggg
actccagggaaggcggggccccagggaagaggcggcgtgtcagctgttcccgggttccga
ggagaccaggggcccgtggggctgcaggggcccgttggctatgaaggggagccaggccgc
cccgggagccccggcctgcctggcatgcccggccgcagcgccagcatcgggcacctcctg
gtgaagcacagccagacggagcaggagcccatgtgccccgtgggcatgaacaagctctgg
agcggatacagcctgctgtacttcgagggccaggagaaggcccacaaccaggacctgggg
ctggcgggctcctgcctggcccggttcagcaccatgcccttcctgtactgcaaccctggc
gatatctgctactacgccagccggaacgacaagtcctactggctctccaccaccgccccg
ctgcccatgatgcctgtggccgaggaggacatcaggccctacatcagccgctgctccgtg
tgcgaggccccggccgttgccatcgccgtccacagccaggacgtctccatcccccactgc
ccgcctggatggcggagcttgtggatcggatattccttcctcatgcacacggccgcgggc
gacgaaggcggcggccagtcgctggtgtcgccgggcagctgcctggaggacttccgagcc
acgccattcattgagtgcaacggcgcccgaggcacctgccactactacgccaacaagtac
agcttctggctcaccaccatccccgagcagagcttccagggcacgccctcggccgacacg
ctcaaggcgggcctcattcgcacccacatcagccgctgccaggtgtgcatgaagaacctg
tga

KEGG   Physeter catodon (sperm whale): 102978798
Entry
102978798         CDS       T06011                                 

Gene name
COL4A1
Definition
(RefSeq) collagen alpha-1(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102978798 (COL4A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102978798 (COL4A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102978798 (COL4A1)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102978798 (COL4A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    102978798 (COL4A1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102978798 (COL4A1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102978798 (COL4A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102978798 (COL4A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102978798 (COL4A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102978798 (COL4A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102978798 (COL4A1)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102978798 (COL4A1)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102978798 (COL4A1)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102978798 (COL4A1)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102978798
NCBI-ProteinID: XP_023978941
UniProt: A0A2Y9SJX4
LinkDB
Position
13
AA seq 1668 aa
MGPRLGVWLLLPAALLLHEESSRAAAKGVCAGSGCGKCDCHGVKGQKGERGLPGLQGVIG
FPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPPG
IPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHIPGTLLKGERGFSGPP
GAPGLPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQGPKGDKGDQGVSGPPG
VPGQAQVREKGEYAAKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGEKGEKGSL
GFPGDSGYPGLPGREGLKGDKGEAGPPGPPGIVIGTGPLGEKGERGYPGAPGLKGEPGPK
GFPGIQGLPGPPGFPIPGLIGAPGFPGERGEKGDQGLPGVSLPGPGGRDGLPGPLGPPGP
PGQPGHTNGIVECQPGPPGDQGPPGSAGQPGLTGEVGEKGQKGESCLICDSTGLRGPPGP
QGPPGEIGFPGQPGAKGDRGLPGRDGLEGLPGPQGVPGLMGQPGAKGEPGEIYFETRLKG
DKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDVG
PPGPPGFGPIGPIGDKGQMGFPGNPGSPGLPGPKGEEGKVTPLPGPPGVSGLPGSPGFQG
PQGDRGFPGTPGRPGLSGEKGSVGQPGIGFPGPPGPKGVDGLPGDTGPPGNPGRQGFNGL
PGNPGLPGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGNIGGPGIPGEHGAIGPAGLQGIR
GDPGPPGLQGPKGASGVPGIGPPGALGPPGGQGPPGSSGPPGVKGEKGFPGFPGLDMPGP
KGDKGPQGLPGLTGQSGLPGIPGQQGTPGQPGFPGPKGEMGIMGTPGQPGSPGPAGVPGL
PGEKGDHGFPGTSGPRGDPGFKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGP
SGDKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGI
PGEKGVPGLPGLQGIPGSPGEKGAKGEKGQEGLPGIGIPGRPGEKGDQGVAGFPGSPGEK
GEKGSSGIPGMPGSPGPKGSPGSAGYPGSPGLPGEKGDKGLPGLDGIPGIKGEAGLPGKP
GTTGPAGQKGEPGSDGFPGSAGEKGEPGLPGRGFPGFPGAKGEKGSKGDVGFPGLAGSPG
IPGSKGEQGFMGPPGPQGQPGLPGTPGHAVEGRKGDRGPQGQPGLPGLPGPMGPPGLPGL
DGLKGDKGNPGWPGTPGAPGPKGDPGFQGMPGVGGSPGATGAKGDMGPPGVPGFQGQKGL
PGLQGVKGDQGDQGFPGTKGLPGPPGPPGPYDIIKGEPGLPGPEGPAGLKGLQGPPGPKG
QQGVTGSVGLPGPPGIPGFDGAPGQKGEAGPFGPPGPRGFPGPPGPDGLPGSMGPPGTPS
VDHGFLVTRHSQTTDEPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTMP
FLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGEGIRPFISRCTVCEAPAMVMAVH
SQTIQIPQCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGT
CNYYANAYSFWLATIDRSQMFKKPTPSTLKAGELRTHVSRCQVCMRRT
NT seq 5007 nt   +upstreamnt  +downstreamnt
atggggccccggctcggcgtctggctgctgctgcccgccgccctcctgctccacgaggag
agcagccgggccgccgcgaagggtgtatgtgctggctctggctgcgggaaatgcgactgc
catggcgtaaagggacaaaagggagaaagaggtctcccagggttgcaaggtgtcatcggc
ttcccgggaatgcaaggacctgaggggccgcagggacccccgggacagaagggggacacc
ggagaaccaggactgccaggaactaaagggacgaggggacccccaggagcatctggttac
cctggaaacccaggacttcctggtattcctggccaagacggtcctccgggtcccccaggt
atcccaggatgcaacgggacgaagggtgagagagggcctctggggcctccgggtttgcct
ggattcgctggaaatcccggaccgccagggttaccgggaatgaagggggatccaggtgaa
atacttggccatataccagggaccctgttgaaaggtgaaagaggattttctggacccccc
ggagcacctggtttgccaggactgccagggctgcaaggtcctgttggccccccgggattc
actggaccaccaggtcccccaggccctcctggccctccaggtgaaaaggggcaaatgggc
ttgagttttcaagggccaaaaggtgacaagggtgatcaaggggtcagcgggccccccgga
gtaccaggacaagctcaagttcgagagaaaggagagtatgctgcaaaaggagagaagggc
caaaaaggtgaacctggatttcaggggatgccaggggttggagagaaaggtgaacccgga
aaaccaggaccccgaggaaaaccaggaaaagacggtgaaaaaggagaaaaagggagtcta
gggtttccgggggattcgggatacccaggactcccaggccgagagggtttaaagggagac
aaaggtgaagcaggccctcctggccctcctggaattgttatcggcacagggcccttggga
gagaagggagagcgggggtacccaggggctccagggttgaaaggggagccgggccccaaa
ggtttcccaggaatacaaggcctgccaggccctccaggcttcccgataccagggctgatt
ggtgcccccggcttccccggtgaaagaggagagaaaggtgaccagggcttgccaggcgtg
tccttgcccggaccaggtggaagggatgggctaccaggcccccttgggccccccggcccc
cctgggcagccaggccacacaaatggaattgtggaatgccagcccgggccgccaggtgac
cagggtcctcccggaagtgcggggcagccggggttgacaggcgaagttggagaaaaaggc
caaaaaggagaaagttgcctcatctgtgactcaacaggacttcgtgggcccccagggcca
cagggaccccccggagaaataggtttcccaggacagccaggggccaagggcgacagaggt
ttacccggcagggatggtctcgaaggattgcctggaccacaaggtgtgccagggctgatg
ggccagccaggagccaagggcgagcctggcgagatttacttcgaaactcgactcaagggc
gacaaaggagacccaggtttcccaggccagcccgggatgccaggcagagcaggctctccc
ggaagagacggccatccgggtctgcccggccccaaaggctccccgggttcagtaggatta
aaaggagaacgtggccccccgggaggagttggattccccggcagccgcggtgacgtcggc
cctcctgggcctccagggtttggccctattggccccattggtgacaaaggacagatgggc
tttccgggaaaccccgggtccccaggcctgccaggtcccaagggtgaagaaggaaaggtc
acgcccttacccggcccccctggagtctcaggcctgccggggtcccccggcttccaaggg
cctcaaggtgaccgaggttttcctggaaccccgggaaggccgggcctctctggagagaag
ggttcagtcggccagcctgggattggctttccagggcctcccggccccaaaggtgttgac
ggtttacctggagacactggacctcctggaaatcccggtcgccaaggttttaatggctta
cccggcaacccaggtctgcctggccaaaagggagagcctggagttggtctgccgggactc
aaaggtctgccaggactccctggcatccccggcacccctggagagaagggaaacatcggg
ggaccaggcattcccggagagcacggcgccatcggccctgcaggccttcagggaatcaga
ggtgacccgggacctcctggattgcaaggtcccaaaggagcttctggagtccccggaata
ggccctcctggagctttgggaccccctggaggacagggacccccagggtcatcaggcccc
cctggagtgaaaggagagaagggcttccccggattcccaggcctggacatgccgggcccc
aaaggagacaaagggccgcaggggctccccggcctgacgggacaatcggggctgcctggt
atccctggacagcagggcacacctggacagcccgggttcccaggtcccaagggagagatg
ggcatcatggggacccccgggcagcccggctcgccaggaccggcgggtgtgccaggattg
ccgggtgaaaaaggggaccacggcttcccgggcacctcgggacccaggggagaccctggc
ttcaagggagataaaggagatgtgggtcttcctggcaagccgggctccatggataaagtg
gacatgggcagcatgaagggccagaagggtgaccaaggagaaaaaggacaaatcggccca
agtggtgataaaggatcccggggagatcctggaaccccaggagtgcctggaaaggacggt
caggcaggacaacctgggcagccaggacctaaaggtgatccgggcataagtgggacccca
ggtgctccgggacttcctggacccaaaggatcggttggtggaatgggcctgccaggaata
cctggagaaaaaggtgtgcctggcctccctggcctgcagggcatccctggctcacctgga
gaaaagggagcaaaaggagagaaagggcaggagggtctgcctggcattggaattccagga
cggcccggggaaaagggagaccaaggggtagcaggttttccaggaagccctggagagaag
ggagagaaaggaagcagtgggatcccagggatgcccgggtctccaggccccaaaggctca
ccagggagtgctggctatccaggaagccctgggttgcctggagaaaaaggtgacaaaggc
ctcccgggattggatggtatccctggcatcaaaggagaagcaggtcttcctgggaagcct
ggtaccacaggcccggccggccagaaaggggagcccggcagtgatggattcccggggtca
gcaggagagaagggtgaaccaggtctacccggaagaggattcccagggtttccaggggcc
aaaggagagaaaggttcaaagggcgacgtgggtttcccaggcttagctgggagcccagga
attcctggatccaaaggagaacaaggattcatgggtcctccggggccacagggacagccg
ggattgccaggcaccccgggccacgcagtggaggggcgcaaaggagaccggggcccacag
ggacagcctggcctgccagggcttccgggacccatggggcctccagggctccctgggctt
gatgggctgaaaggtgacaagggaaacccaggctggccgggcactcctggggctccaggg
cccaagggagacccaggattccagggcatgccgggggtcggtggctctccaggagctaca
ggtgctaagggtgatatgggacctccaggagttccagggtttcaagggcagaaaggcctc
cctggccttcagggagttaaaggtgaccaaggagaccaaggtttccctggaactaaaggt
cttcctggccctccgggccccccgggtccatacgacatcatcaaaggggagccagggctc
cctggtcctgagggccccgcaggtctgaaagggcttcagggacctccaggccccaaagga
caacaaggtgtgacaggatctgtgggcttacctgggccaccaggtattcccgggtttgat
ggggcccctggccagaaaggagaggcaggaccctttggacctcctggtccaagaggcttc
ccgggtccacccggccctgatgggttgccgggatccatgggtcccccaggcaccccatct
gttgatcacggcttcctcgtgaccaggcacagtcagacaacagacgagccccagtgtcct
cccgggaccaaaatcctctaccatgggtactctctgctctacgtacaaggcaatgagcgg
gcacacggccaggacttgggcacggcgggcagctgtctgcgcaagttcagcacgatgccc
ttcctcttctgtaacatcaacaacgtctgcaacttcgcctcccgaaatgattactcgtac
tggctgtccaccccggagcctatgcccatgtccatggctcccatcaccggggagggcatc
aggcccttcatcagcaggtgtactgtgtgtgaggccccggccatggtcatggctgtgcac
agccaaaccatccagatcccgcagtgccccagcggctggtcctcgctatggattggctac
tcctttgtgatgcacaccagcgctggagccgaaggttctggccaagccctggcctctcct
gggtcctgtctggaagagttcagaagcgcccccttcatcgagtgccacggccgcgggact
tgcaattactatgcaaacgcttacagcttttggcttgccacgatagacagaagccagatg
ttcaagaagcccacgccgtccaccctgaaggctggagagctgcgcacccacgtcagtcgc
tgtcaagtgtgcatgcggagaacataa

KEGG   Physeter catodon (sperm whale): 102978896
Entry
102978896         CDS       T06011                                 

Gene name
COL6A5
Definition
(RefSeq) collagen alpha-5(VI) chain
  KO
K06238  collagen type VI alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102978896 (COL6A5)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102978896 (COL6A5)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102978896 (COL6A5)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102978896 (COL6A5)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102978896 (COL6A5)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102978896 (COL6A5)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102978896 (COL6A5)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102978896 (COL6A5)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102978896 (COL6A5)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   102978896 (COL6A5)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 VWA_CoxE Copine Leptin vWA-TerF-like
Other DBs
NCBI-GeneID: 102978896
NCBI-ProteinID: XP_007120272
UniProt: A0A2Y9FDK6
LinkDB
Position
1
AA seq 2521 aa
MKILLIVFVLILWTEALTYQSPGPGPEYADVVFLVDSSNHLGIKSFPFVKMFINKMISSL
PIEAHKYRVGLAQYSDRLHREFQLSTFKSRGPMLNHLKKNFRFLGGSLRIGNALQEVHTT
YFSNGRDRRQFPPILVVLASAESEDAMEAPSKALRSDGVRIISVGMQGTPEETLKAMATG
QFHYSLQTVQDLSVFSQNMTQILKDAAQYKEVAAHSHTVVPFPRACQKDSLADLMFLVDE
SVGTKQNLRNLQNFLKNITSSLDVKDSCLRLGLMSYSDRVQTFSLLSSSTTQSEFQKQIQ
KLSLQAGKSNAGAAIEKMRREGFLASSGSRRAQGVPQIAVLVTDRPSDDKVREAALNLRL
EDVTMFAMGIQGANKTQLEEIVSYPPGQTISMMKSYADLELYSKNFLKKIQNEIWSQIST
HAEQMELDKTGCVDTKEADLYFLIDGSSSIPNKHFEQIKEFMLAVIEMFSIGPDKVRVGA
VQYSHLQEVEFDINDYPNDVTLRKAVSNIKQIKGRTLTGAALDFILPLIRKGRKQRMNEV
PCYLIVLIDGMSMDGVLEPAERLRAEHITIHVVGIGEANRTQLQQIAGAEERVNFGKNFD
SLKSIKNEVVRSICTEKGCEYMKVDIMFLVDGSGSIGSVNFGKMKTFMKNLLAKVQIGPD
KTRIGVIQFSGYNREEFQLDKYYTQKEISDAINRMSPIDKNTLTGSALTFVDPYFTEPKG
GRSMVKKFLILITDGEAQDDVINPAKALRDKGVVIFSVGVYGANRTQLEEISGDSSLVFH
VENFDDLKAIESKLIFRVCALHDCKRIRQLDIVFVLDHSSSIRSQDQESMINVTIHLVKK
SDVGPDRVQFGVLTYSDNPEVLFYLNTYSNRSAIINHLRRLRGTQGNTFTAKALEHANKL
FTEEHGSRLKQNVKQMLIIITDGESHDRNQLSEKASKLRAKGIIIYAVGVGNAEQEELET
MAGNKNYTIHVSNFNKLKDIYLPLQDSMCTNAQEVCNIQEADVIFFCDGSDMVSDSDFVT
MTTFLSDLIDNFDIQSQRMKIGVAQFGSRYQEIIELQNSLTKTEWKSRIQNMTKSNGLPR
MDLALKQVRVMFEQSAGGRRKAGVPQTLVVITSGGPHYDVAEAVKTLREDGICILALGLG
DVYKEELLLITGNSEKIITFKDFDKLKNVDVKKRMVREICQSCGKTNCFLDIVVSFDIST
HLLGQPLFHGHPELESHLPGILEDITSLRGVSCGAGTEVQVSVAFKVNSDQNFPAKFQIY
QQTIFDSLLQVTVNGSTHLNAQFVQSMWDTFKDTSASRGQVLLIFSDGLGGESKIMLEDK
SDRLREAGLDALLVVSLNSTSHDEFSSFEFGKGFDYRSHLTIGMRELGNMLSQYLGNIAE
RTCCCTFCKCSQTLGPQGIRGPQALKGFPGLKGSRGHRGEEGDPGRPGDTGPQGNRGIAG
CPGERGLKGSKGFAGYKGEHGDNGIDGLDGEEGFHGLPGKKGEKGDPGSQGSPGSRGPPG
GYGEKGFPGDPGNPGQSSNIKGQKGSKGKQGRQGRTGPKGTQGSPSSRGSSEREGQRGVQ
GASGEPGNPGLKGASGAEGLQGSQGSNGLPGRKGEKGNEGHKGPQGAPGPVGDKGSVGRP
GLSGKKGEPGIPGDPGPLGQAGRQGKQGDYGIPGYGPMGRKGVKGRRGFLGDMGQKGVVG
DPGIPGGPGPKGFRGLTLTVGLKGEEGSQGPQGPPGRRGPKGMAGQRVYSQCDLIQFMKD
HSPCWKEKCPVYPTELVFALDQSYDITEQRFNETRDIIISIVNDLHIRENNCPVGARVVV
VSFDSGTSYIIRGSDYHSKKQLLHLLSKIKYQNSREVRDIGNTMMFVARNIFKRLLAGAN
VQRVAVFFSNGQSASRSSIITATMEFSALDISPAVFAFNERVSLEEAFEFDNTGTFQVLP
VPPNGEYEPLERLRKCILCYDKCFPNTCKKEIFLPENSYMDVTFLLDNSRTITSDEFKDV
KALVSSMLDNFDIASDPLTSDSGDRIALLSYSPWDGRKEDVVKTEFEFTTYDNQALMKRH
IQTYLQQLNGESTIGHALLWTVEDLFQGTPRLRKHRVIFVVSAGENHERKEFLKKMALRA
KCQGYVIFVISLGSTHKEDMEELASHPLDHHLIQLGRIHKPNLDYVVKFVKPFVYSVRRG
FNQYPPMMLENTCRLINSKEEDNRRINLLFTPEPHEIPSGENSFIDQELSVVRGSSFVLE
DSGSDHLPYIPSQMFMPQKLTTKYEKDQDSEEIASLTSGQESHGRKEEPCLTYELRDASL
QEYYMDVAFLIDASQRIGNDEFKEVKAFITSVLDYFHLAPDPLTSTLGDRVAVLTYSPPG
YTPNTEESPIYLEFDLITYNNIHQMKHHLQDSLQQLNGDVFIGHALQWTIDNVFVGTPNL
RKNKVIFVLSAGETNPLDRGVLRNVSLRAKCQGYSIFVFSFGPLHNDKELEELASHPLDH
HLVQLGRTHKPDLNYIIKFVKPFVHSIRRAINKYPPADMRLICVNITSPNPENGGTENIV
L
NT seq 7566 nt   +upstreamnt  +downstreamnt
atgaagatcctgctaattgtatttgttctcatcctttggactgaagcactgacatatcag
agcccagggccaggccctgagtatgccgatgtggtgtttctggtggatagctccaaccac
ctcggaattaaatccttcccatttgtgaagatgttcatcaacaagatgatcagcagcctc
cccatagaggcccacaagtaccgtgtgggcctggcccagtacagcgacaggctccacaga
gaattccagctcagcaccttcaagagcagaggccccatgttgaaccacctcaagaagaac
ttcaggttcctgggcggttccctgcgaattgggaatgctctccaggaggtgcacacgacc
tacttctcaaatgggagggacaggagacagtttcctcctatcctggtggtcttggcttca
gccgagtctgaggatgccatggaagcgccctccaaggcgctgcgcagcgatggcgtgagg
atcatctcagtggggatgcaggggacgcctgaggagactctgaaggccatggccacagga
cagttccactacagcctccagacggtccaagacctcagcgtgttctcccaaaacatgacg
cagattctcaaggatgcagctcagtacaaggaggtagcagcccactctcatacagtagtc
cccttccccagggcctgtcagaaagattcacttgccgacctcatgttcctggtggatgag
tctgttgggaccaaacagaatctaaggaacctgcagaacttcctgaagaatatcacctcc
tccttggatgtgaaagacagctgcctgcggcttggtcttatgagttacagtgacagagtt
cagactttttctcttctgagctcaagcacaacccaatctgaatttcagaaacaaatccag
aagctttctctgcaggctgggaaatccaatgctggtgctgccattgagaagatgaggaga
gaaggcttcttagcatcaagtggcagcagaagggcacagggagtgccccagattgcagtc
ctggtcactgacagaccatcagatgacaaggtgcgtgaggctgcactgaatctccggctg
gaggatgtaactatgtttgccatgggcatccaaggggccaacaagacccagttagaagaa
atagtgtcttaccctccaggacagaccatctccatgatgaagtcttatgcagacttggaa
ctttacagtaagaacttcttgaaaaagatccagaatgaaatatggtcccaaatttcaact
catgctgaacaaatggaacttgataaaactggctgtgtagatacaaaagaggctgatctc
tatttcctcattgatggctcaagcagcattccgaataaacactttgagcaaatcaaggaa
tttatgttggcggtgatagagatgtttagcatcggcccagacaaagtccgagttggagct
gtgcagtattcccacttgcaggaagtggaatttgatatcaatgactatcctaatgatgtg
acattaagaaaggctgtttctaacatcaagcaaattaaaggcagaactcttactggggca
gctctggatttcattttgccactaataaggaagggaaggaagcagaggatgaacgaggta
ccctgttacctcattgtgttgattgatgggatgtccatggatggtgtcctggaacctgct
gagagactaagggctgaacacatcaccatccacgtggtcggcattggtgaagctaacaga
acacaactgcaacaaattgctggggcagaggaaagggttaactttgggaagaactttgat
tctttaaaaagcatcaaaaatgaagttgttcgcagcatctgcactgagaaaggatgtgaa
tacatgaaggtcgatatcatgtttctggtggatggttctggcagtataggatctgtaaat
tttgggaaaatgaaaaccttcatgaaaaacctgttagctaaggttcagattggcccagac
aaaactcgaattggtgtcattcaattcagtggttataatagggaagaattccagcttgat
aaatactatacacaaaaagaaatttctgatgcaataaacagaatgtctcccatcgacaaa
aacactttgactggaagtgcactaacctttgtggatccatacttcactgagcccaagggg
ggccgttcaatggtcaaaaagtttctcatcctcatcactgatggagaagcccaagatgat
gtgataaaccctgcaaaagctcttcgggacaaaggtgtggtcatcttctctgtgggggtg
tacggggccaatagaactcagctggaggagatcagtggggacagcagcctggtcttccat
gttgagaacttcgatgatctaaaggcaatagagagtaaactcatcttccgtgtctgcgct
cttcatgattgtaaaaggattagacagttagacattgtgtttgtgctggaccattcaagc
agcatccggtcccaagaccaagaaagcatgatcaacgtcactatccatttggtgaagaaa
tcggatgttggtccggacagagttcagtttggagtgctcacatactcagataatcctgag
gttctcttctacctcaatacatactcaaacagatcagccatcatcaaccatctgaggagg
ctcaggggcactcaagggaataccttcactgccaaggctctggagcatgcaaacaagctg
ttcacggaggaacatggcagccgtctcaaacaaaacgtgaagcagatgctgatcatcatc
acagatggggaatcccatgaccgaaatcagctcagtgaaaaggcttcaaaattaagagcc
aaaggcatcatcatctatgcagtgggtgtaggaaacgctgaacaagaagaactggagact
atggcgggaaataaaaactatactatccacgtcagtaatttcaacaaactgaaagatatt
tacctgcctctgcaagacagtatgtgtaccaatgcacaagaggtctgtaacattcaagaa
gccgatgtgattttcttttgtgatggctctgacatggtatctgactcagactttgttacc
atgacaactttcttgtcagacttaattgataattttgacattcagtctcaaaggatgaaa
attggggtagctcaatttgggagccgctaccaagaaattattgagttgcaaaactctctg
actaaaaccgagtggaagtctcgaattcaaaatatgacgaagagcaacgggcttccgcga
atggacttggccctgaaacaagtgagagttatgtttgagcaatccgctggtgggagaagg
aaggctggtgtgcctcagactttggtggttatcacatctgggggtccccactatgatgtg
gcagaggcagtgaagaccctgagagaagatggcatttgcattctggctttgggcctagga
gatgtttataaggaagagctcttgttgataacgggcaattctgaaaaaatcatcactttt
aaagactttgataaactaaagaatgtggatgtgaaaaaaagaatggtccgtgaaatctgc
cagagttgcgggaaaaccaactgctttctggacatagtggtcagcttcgacatttccact
cacctgctggggcagccattgttccatggccatcccgagctggaatcacacctcccaggc
atcttagaggacattacctccctcaggggtgtgagctgcggggcaggaacagaggtgcag
gtgagcgtggcctttaaggtgaacagtgaccagaatttccccgccaagttccaaatctac
cagcaaacaatatttgacagcctactgcaagtcaccgtcaatgggtcaactcacctgaat
gcacagttcgtgcagtccatgtgggacacgtttaaggatacgtctgcatcccgaggacag
gtattactcatcttttcagatggtcttgggggtgaaagcaaaataatgctggaagataaa
tcggacaggctcagagaagcaggactcgatgctttgctggtggtttccctaaactcaacc
tctcatgatgagttttccagctttgaatttggaaaaggatttgattacaggagtcacctg
accattggaatgagagaattgggcaatatgttatcacagtatctgggaaacattgcagag
aggacttgctgctgtacgttttgcaagtgttcacagactctaggacctcaagggatccga
ggaccacaagccttaaagggttttccaggtctgaaaggcagcagaggacacaggggagag
gagggagaccctggaagaccaggagacactggaccccaaggaaatagagggattgcagga
tgtccaggggagcggggtctaaagggatccaaaggatttgctggatataagggagaacat
ggagacaatgggattgatggactcgatggggaagagggctttcatggacttcctgggaaa
aagggagaaaagggtgatccaggatcccagggcagcccaggttccagaggccctcctggg
ggatatggggagaagggcttcccaggggatcctggtaatccaggacaaagcagtaacatc
aaaggacaaaagggctccaaaggaaaacaaggaagacaaggtagaaccggcccgaaaggg
acacaaggcagtcctagttccagagggagcagtgaaagagaaggccagagaggggtccag
ggtgcctcgggggaaccagggaaccctggacttaaaggtgcatcaggagccgaaggatta
caaggctcacaggggtcaaatggacttcccggcaggaaaggagagaagggaaacgagggg
cacaaaggacctcagggtgctcctgggccagtgggagataaagggagtgttggaaggcct
ggactttcggggaaaaagggagaacctggaattcctggagatccagggccactggggcaa
gctggacggcaaggaaagcaaggtgattatggcattccgggctatggtcctatgggacga
aaaggagtaaagggccgaagaggattccttggagatatgggtcaaaagggtgttgttggt
gatcctggaattcctggggggcctggacccaaaggatttaggggactaacactcactgta
ggtttgaaaggtgaagaaggctctcaaggaccccaaggccctcctggacggagaggtcct
aaaggcatggcagggcagcgggtatattctcaatgtgatctaatccagttcatgaaggac
catagtccttgttggaaagaaaagtgtccagtgtatccaacagagctagtgtttgccctg
gaccagtcctatgatatcacagagcagagatttaatgaaacaagggacatcatcatttct
attgtcaatgaccttcacatcagggaaaataattgccctgtgggagcaagagttgtcgtg
gtttccttcgactcaggcaccagctacatcatccgtgggtctgactaccacagcaagaag
cagctccttcatcttctttccaaaattaaatatcaaaactccagggaagtccgagacatt
ggtaacacgatgatgtttgtggctcggaacatcttcaagcggttgttagcaggagccaat
gtgcagagagtcgctgtgttttttagcaatggacaatctgccagtaggtcatccatcatc
acagccaccatggagtttagcgccctggatatcagccctgcagtctttgcttttaatgaa
agggtttcccttgaggaggcttttgagtttgacaacacgggaacatttcaggtgcttcct
gttcctccaaatggggaatatgaaccattagaaagacttcggaaatgtattctttgctat
gataaatgttttccaaatacttgcaagaaagagattttcttacctgaaaattcatacatg
gatgtcaccttcctcttagacaattctaggactataacaagtgatgagtttaaggatgtc
aaagccttggtgagctcaatgcttgacaactttgatattgcctcagaccctctaacctca
gactctggtgataggattgccttgttgagctactctccttgggatgggaggaaggaggat
gtggtaaaaacagagtttgagtttacaacttatgacaatcaagctctaatgaagaggcac
atccagacttacctccaacagctaaatggagaatccaccattggccatgccctactgtgg
actgtggaggatctcttccaaggaacacccagactgagaaaacatagggtcatctttgta
gtctctgctggagaaaatcatgagagaaaggaattcttaaagaagatggctctgagggcc
aagtgtcaaggctatgtcatatttgtgatttctctgggctctacacataaggaggacatg
gaggagctagccagccacccacttgatcaccatctgatacagcttgggagaattcataaa
ccaaacctggattacgttgtgaagtttgtaaagccgtttgtgtactctgttagacgagga
ttcaatcagtacccaccaatgatgcttgagaacacctgcagactcatcaattcaaaggaa
gaggataatcgaagaatcaatctcctatttacccctgagccacatgagattccttcagga
gagaacagcttcattgaccaggaattgagtgtggtcagaggttcatcttttgtgttggaa
gacagtggaagtgaccatctgccttacattccaagccaaatgtttatgccacagaaatta
acaactaaatatgaaaaagatcaggattctgaagaaattgcaagtctcacttctggacaa
gaaagccatggcagaaaagaagaaccatgtcttacttatgaacttagagatgcctctctt
caagaatattacatggatgtggcttttctcatagatgcttcccaaagaataggaaatgat
gagtttaaggaagtgaaagcttttataacctcagtgcttgattactttcacttggcccca
gatccactgacctccaccttgggagacagagtggcagtcctgacctactctcctccgggc
tatacgcccaacacagaagaaagccccatctacctagaatttgatttgattacttacaac
aatatacaccaaatgaaacatcatctccaagactcccttcaacagctcaatggagatgtt
tttattggccatgccctacagtggacaattgacaatgtctttgtaggaacccccaacctg
aggaaaaacaaagttatctttgttttatctgctggtgaaaccaaccctttagacagggga
gtcttaagaaatgtatctctgagagccaagtgtcagggctactccatatttgtgttttcc
tttggtcctctacacaatgacaaggaattagaagaattagccagccacccactggatcat
cacttagtccagcttggccgaacccacaagccagatttgaactatatcatcaaatttgtc
aagccatttgttcattcaatcagacgtgccatcaacaaatatccccctgcagatatgaga
ctcatatgtgttaacatcacatctcccaacccagagaatggtggcacagaaaacattgta
ttgtga

KEGG   Physeter catodon (sperm whale): 102979304
Entry
102979304         CDS       T06011                                 

Definition
(RefSeq) LOW QUALITY PROTEIN: collagen alpha-4(VI) chain-like
  KO
K06238  collagen type VI alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102979304
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102979304
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102979304
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102979304
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102979304
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102979304
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102979304
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102979304
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102979304
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   102979304
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 VWA_CoxE Kunitz_BPTI
Other DBs
NCBI-GeneID: 102979304
NCBI-ProteinID: XP_023970945
UniProt: A0A2Y9S448
LinkDB
Position
1
AA seq 2121 aa
METWKTFWGIIFLAAGFGFKSQRIVCRDDSVGDVVFLVDTSNNPQNTHSARSFLYTVVNG
VSVGGEAVCVGLAWYSDQPQSEFLLSTYHRKGEVLRRIQRFPFTPGGHKMGLALQFLLDH
HFQETAGSRASQGVPQVAVVISSSPAEDHVQEPADDFRRAGILLYAVGVGDAVSADLKEI
SSSPVEKFVAFVPNFSALGSFALKLRQELCDTLAKAAPPVGHVSPACRETVLADIVFLVD
SSTSIGXQNFQKVKNFLHSVALGLDISSDQVRVGLAQYNDNIYPAFWLNQHRLKSVVLEH
IWNLPYHTGGTNTGSALEFIRTNYLTEAAGSRAKDGVPQVVILVTDGESNDEVQEAADRL
KQDGVVVYVVGVNVQNVQELQTIASEPLEKFLFNAENFNILQDFSGSILQTLCLAVEGKI
KDSAQHYADVVFLADTSQNTSWTSFQWMQTFISRVDGMLDVGRDKYQIGMAQYGGQGHTE
FLFNTYQTQDEMMTHIHEHFVLWGGSSRTGKALQYLYQTFFQEAEGSRFLQGIPQYAVVI
TSGKSEDEVHEAAETLREKGVKVMSVGVQDSDKRELQGMGTPSLVYEMQGQDRVRQVMHH
VSGVIQGTGQLETKNEAKMEPIEACLTAIPADLVFLIEEFSRAQQSNFQHVVNFLKTTVS
SLSIHPDVVRIGLVFYSEEPXLEFSLDTFQNSAKILEHLDKLTCRRRGERTKTGAALDFL
RNEVFIQEKGSQFQQGVQQVGVVIVEDFSQDNVSXPASLLRRMGVTVYAVGTQLPLESMD
LEKIASYPPCKHVIPLESFLQLAVVGSKIKNQLCPEITGKRASVSGMSSALQEGCVHIEK
AELYFLIDGSDSIHQDDFLEMKVFMNEVIKMFHIGPDRVQFGVFQYSDEISSQFTLSQHT
SVAGLKVAIDGIQQNGGGTMTGQALGSLRQVFADTALSNVPWYLIIITDGKSMDPVAEVV
EALRGDGVTIYAIGIRDANTIELQEIAEDRMFFVNDSDSLKAIQQEVLQDICSSETCKNR
KADIIFLTDGSESISLKDFEKIKGFMKRMVNESNIGADEIQIAFXTLLQFSSNPQEEFRL
NRRYSSKVDIHGAISDVKQINDDTYTGKALNFTLPFFGSSRGGRPSVHQNLIVITDGVAR
DNVAIPARALRNRNIIIFAIGVGEAKHSQLLEITNDQSKVYYEEKFEFLQNLEKKMLYQV
CIPQGECNVDFSVAIDLSTPTRQVQQRLQGLLPELMQELAMLSNISCGVPGQTNVMLRYL
VPGSKGQLIFDSGFEKYSDETIQKFLIHQAASSNHMDVDFLQSLGHSAIHLSSAKVKVLL
VFTDGLDDDLERLKEISKLLRSKGFSRLLTVGLEGVHKLXELEFGGGFAYTQPLSITQRS
LPSILLKQLDTIVERSCCNMYAKCFGEDGHRGDGGSPGRKGERGPQGLPGPRGEEGCWGM
RGPKGVSGFSREKGNPGEEGPDGLDGEQGYHGVPGSSGEKGNWGNRGEPGFPGYPGAQGE
DGDLGHQGEKGAKEIRGKRGNAGLPGFVGTPGDPGPVGRLGIKGPKGVVDMMPCXIINFA
RENSPCSGVSKCPCFPTEVVFALDVSNDVSQLDFERMGGVLLSLLMKMEISGSNCPTGAR
VAIVSYDSKTDYVVRFSDHKERPALLQEVRGLSLAGSSSSRNLGDSMRFVVRHVFKCVRA
GRLLRKVAVFFQAGWTQDAGSISTATLELSALDITSVVITFMEDHKLPDALLMDGTNKFH
LYVWETENQQDMAHCTLCYDQCSPAPECGLGVPRPLAVGMDVAFVVDSSVGVGTDLYRTA
LTLVDTTLDDLEVAAQASASPHGARMALVTHTTPYFWPGAGWPPVREGFHLTSYAXRTQM
QRHVREALDHPLRGAPALGHALEWTLEKVLLANPLPRKVQVLFTIVASETSSWDREKLRT
PSLDAKCKRITLFVLALGPGMGTHVASAPSEQHMLHLEGLLDAEVAYARGFTRAFLNLLK
SGTNQYPPPELIEECGDPSRGDTFLQPILSVKRLPKHQFGKSGLADDLEALKATGSFLEE
NRKAMMTSFTQQEALENYEKSGYNAEENEQEKPTKPKGMGKERNLGTAFGPCSLDPMEGD
YVLKWSYNEKEQACRQFWYGG
NT seq 6366 nt   +upstreamnt  +downstreamnt
atggagacttggaagacattttgggggatcatcttccttgcagctggttttggcttcaag
tcacagagaattgtgtgcagggacgactctgtgggagatgtcgtgtttctggtggacacc
agcaacaacccacagaacactcacagcgcacggagcttcttgtacactgtggtcaacggc
gtcagtgtcggcggagaggcggtctgcgtggggctggcctggtacagtgatcagccccaa
tccgagttcctcctttccacctaccatcggaaaggcgaagtgctgagacgcatccagagg
tttccatttacgccggggggccacaagatggggctggccctgcagttcctcctagatcac
cacttccaggaaacagcggggagcagggcaagccagggcgtacctcaggtcgccgtggtg
atcagcagcagcccagcagaggaccacgtgcaagagcccgcggatgactttaggagggcg
ggcatcctcctctatgccgtgggcgtcggagatgcggtttcagcagacctcaaggagatt
tcgagcagtcctgtggagaagttcgtcgcctttgttcccaacttctctgctttgggcagt
ttcgccctgaagctacgccaggagctctgtgatacgttggcaaaggcagctccacctgtt
ggccacgtctctccagcttgcagagaaacagtcctggctgacattgtcttcctagtagac
agctcaaccagcataggancccagaacttccagaaagtgaagaacttccttcactctgtc
gccttggggctggacatcagcagtgaccaggtccgagtgggactcgcacagtataatgac
aacatctacccagccttttggctgaaccagcaccgtctgaagagtgtggtcctggagcat
atctggaacctgccctaccatacaggaggcacaaacacagggagtgccctggagttcatc
aggaccaactacttgactgaggcagctggcagccgggctaaggatggggttccccaggta
gttatcctggtgacagatggggagtcaaatgatgaagtccaggaggcagctgacaggttg
aaacaagatggggttgtggtgtacgtggtaggggtcaacgtccagaatgtccaggagttg
caaacaatagccagtgagccattggagaagtttctcttcaacgctgaaaacttcaacatc
ctgcaggatttctcaggaagcattctccagactctgtgcttggccgtggagggcaagatc
aaagactctgcccagcattatgcagatgtggtctttctcgctgacacctcacagaacaca
tcatggaccagtttccagtggatgcagactttcatctccagagtggatggcatgctggat
gttggcagagacaagtaccagattgggatggctcaatatggtggtcaaggtcacactgaa
tttttgttcaacacttaccaaacccaggatgagatgatgactcacatccatgaacacttt
gtactttggggaggctccagtaggacaggcaaagctctacaatacctttatcagaccttc
ttccaggaggcagagggaagccggtttctccagggcattccccagtatgcagtggtcatt
acctcaggcaaatctgaggatgaggtccatgaggctgcagagacattgagggagaaaggc
gtgaaagtcatgtctgtgggtgtccaagactctgacaagagagaactacaggggatgggc
accccatcccttgtatatgagatgcaaggacaagacagggtcagacaggtgatgcaccat
gtgagtggggtgatccaaggaactgggcagcttgagaccaagaatgaagccaagatggag
cctatagaagcgtgtctaactgctatcccggctgacttggtgttcctcattgaggaattt
agcagggctcagcaatcgaatttccaacatgttgtcaatttcttaaagaccactgtcagc
tctctaagtattcatccagatgttgtgcgaattggcttggtcttttacagtgaggaacca
tgacttgagttttctctggatacatttcagaattcagccaagatcttggagcatctggac
aaattaacctgcaggagaagaggagaaaggaccaagactggtgctgctttggatttccta
aggaatgaggttttcattcaggagaagggcagtcagttccagcaaggtgtgcagcaggta
ggcgtggtcatcgtggaagacttctcccaggacaatgtgtcctgaccggcttccctcctc
cgcaggatgggggtgactgtctacgcagtgggtactcagctccccttggagagcatggac
ctggagaagatagcatcatatcctccttgcaagcatgtcatccccctggaatcttttttg
caactcgctgttgtgggaagcaagattaagaaccagctctgtcctgagattacgggcaaa
agggcttctgtgtctgggatgagctctgccctacaagaaggctgtgtgcacatcgagaag
gcagaactttacttccttattgatgggtctgacagcatccaccaagacgattttctcgaa
atgaaggtgttcatgaatgaggtgataaagatgttccacatcgggcctgacagagtacag
tttggagtctttcagtactcagacgaaatcagtagtcaatttaccctcagccagcatacc
agcgtggcggggctgaaggtggccattgatggcatccagcagaatggaggtggcaccatg
acaggtcaggccttgggtagcttgaggcaggtctttgcagacactgctctcagcaacgtg
ccttggtatctcatcatcatcactgatggcaagtctatggacccggtggccgaggttgtg
gaagcattgaggggagacggagtcaccatttatgctattggaatcagagatgctaatact
attgagcttcaggagatagctgaagacaggatgttttttgtgaatgattctgattccttg
aaggccatccaacaagaagtgttacaggacatctgctcctcggagacctgtaagaatagg
aaagctgacattatcttcctgacagatggttcagaatccatctccctgaaagactttgaa
aagataaagggattcatgaagcgaatggtcaacgaatctaatatcggtgctgatgaaatt
cagattgccttctgaactcttctgcagttcagctcaaacccccaggaagaattcaggctc
aaccgaagatactcctcaaaggtggacattcacggggccatctcggacgttaagcaaatc
aacgacgacacctacactgggaaagccctgaacttcactctgcctttttttggtagttca
agaggagggaggcccagtgtccatcagaatttgattgtgatcaccgatggtgttgcccgg
gacaatgtagccataccagccagggcccttaggaacagaaacataattatttttgccatt
ggggtaggagaagccaaacactctcagcttttggagatcactaatgaccagagcaaagtg
tactatgaagaaaaatttgagttcctgcaaaacctggagaagaaaatgctttaccaggtc
tgcattccacagggtgaatgcaacgtagatttttctgtagcaattgatctctccactccc
acgaggcaggttcagcagaggcttcaagggttactgccagagctcatgcaggagctggcc
atgctttccaacatcagctgtggtgtccctggtcagaccaatgtgatgctccgctacctg
gttcccggctcaaagggccagcttatctttgactcaggttttgaaaagtacagtgatgag
accattcagaagttcttgattcatcaggctgccagtagcaaccatatggatgtggatttt
ttgcagtccctgggacacagtgctatccatctctcttctgccaaagtgaaggtcctttta
gtgtttacagatggactggatgatgatttagagagactgaaggaaatatccaagcttctc
cgcagcaaaggattctccagactcctgactgttggcctggagggtgtccataaattanng
gagctagaatttggcggaggatttgcatacactcagcctctgagcatcacacagcgatct
cttccgagcatcttactgaagcaacttgacacaattgtagaaagaagttgctgcaatatg
tatgcaaagtgctttggagaagacggacacagaggtgatggtgggagcccggggaggaag
ggagaaagaggcccccaaggtcttcctggaccccgaggtgaggaaggatgctggggtatg
agaggacccaagggagtgagtggattttcaagagaaaagggcaatcctggtgaggaaggt
cctgatggcttggatggagaacagggttatcatggagtcccagggtcatctggagaaaaa
ggaaattggggaaataggggggaacccggatttcctggctaccctggtgcacaaggagaa
gatggtgacctgggccaccaaggagagaagggggcaaaggaaatcagagggaagaggggt
aatgctggacttcctggattcgttggaactccaggtgacccaggaccagtgggaagactt
ggcatcaagggccccaaaggtgtggtagatatgatgccttgttaaatcatcaattttgct
cgcgaaaattccccctgttcaggtgtttccaaatgcccatgtttcccgaccgaggtggtc
tttgccttggacgtgtcaaatgatgtctcccagttggatttcgagaggatgggaggcgtt
ttactatctctgttgatgaagatggaaatcagtgggagtaactgtcccacgggtgcccgg
gtggccatcgtctcctacgacagcaaaacagattacgtggttcgcttctccgaccataag
gagaggcccgcgctcctgcaggaggtcaggggactctccctggcagggtcctctagcagc
aggaaccttggggactccatgaggtttgtggtaagacatgtattcaagtgcgtgcgcgcg
ggccgcctcctgaggaaggtggctgtgttcttccaggcgggctggacccaggatgcaggt
tccatcagcacagccacgctggagctcagtgcgctggacatcacctctgtggtcatcacc
ttcatggaggaccacaagctcccagacgccctactgatggatggaaccaacaaatttcat
ctgtatgtctgggagacagagaaccagcaggacatggcccactgcactctctgctatgat
cagtgcagcccggccccggagtgcgggctgggcgtgcctaggccactggcggtgggcatg
gacgtggcctttgttgtggacagctccgtcggcgtgggcaccgacctctaccgcacagcc
ttgactctggtggacaccacgctcgacgacctggaggtggctgcgcaggcgagcgcgtcc
ccccacggggcccgcatggccctggtgacacacacgaccccctacttctggccgggtgcg
ggatggccacctgtgcgcgagggcttccacctgacctcgtacgctngccggacgcagatg
cagaggcacgtccgcgaggccttggaccaccccctgcggggagccccggccctgggccat
gccctggagtggacactggagaaggtgctcctggcgaacccgctgcccaggaaagtgcag
gtcctctttaccatcgtggccagtgagaccagcagctgggaccgggagaaactaaggact
ccgtccctggatgccaagtgcaagcgcatcactctgtttgtgctggccttgggcccaggc
atggggacccatgtggccagcgccccctccgaacagcacatgctgcaccttgaggggctc
ttggatgcggaggtggcctatgcccggggcttcacgcgggccttcctgaacctcctaaaa
agtggaacaaaccagtacccaccaccagagctcattgaagagtgtggagacccgagccga
ggggacacctttttgcaacccatcctgtctgtcaagaggttgcccaagcaccagtttggc
aaatctggccttgctgatgatttggaagcactcaaagcaacaggctcttttctagaggag
aatagaaaagccatgatgacatctttcactcagcaagaagcactcgaaaactatgaaaag
agtggctacaatgctgaagaaaatgaacaagaaaagcccacaaaaccaaaaggaatggga
aaagaaagaaatttaggcactgcctttggtccctgttccctggatccaatggaaggtgat
tatgtcctgaaatggtcctataatgaaaaggaacaggcttgccgacagttctggtatggt
ggctga

KEGG   Physeter catodon (sperm whale): 102979827
Entry
102979827         CDS       T06011                                 

Gene name
THBS2
Definition
(RefSeq) thrombospondin-2
  KO
K04659  thrombospondin 2/3/4/5
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04145  Phagosome
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05144  Malaria
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102979827 (THBS2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102979827 (THBS2)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    102979827 (THBS2)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102979827 (THBS2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102979827 (THBS2)
  09174 Infectious disease: parasitic
   05144 Malaria
    102979827 (THBS2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pcad04131]
    102979827 (THBS2)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102979827 (THBS2)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102979827 (THBS2)
   04990 Domain-containing proteins not elsewhere classified [BR:pcad04990]
    102979827 (THBS2)
Membrane trafficking [BR:pcad04131]
 Endocytosis
  Phagocytosis
   Opsonins
    102979827 (THBS2)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102979827 (THBS2)
  Exosomal proteins of colorectal cancer cells
   102979827 (THBS2)
  Exosomal proteins of bladder cancer cells
   102979827 (THBS2)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102979827 (THBS2)
Domain-containing proteins not elsewhere classified [BR:pcad04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   102979827 (THBS2)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC TSP1_ADAMTS EGF_3 EGF_CA EGF Laminin_G_3 cEGF TSP1_CCN Laminin_G_2
Other DBs
NCBI-GeneID: 102979827
NCBI-ProteinID: XP_007121308
UniProt: A0A2Y9FF16
LinkDB
Position
10
AA seq 1170 aa
MLWPLLLLALWPWSSAQAGDQDEDTAFDLFSISNISRKTIGAKRFRGPDPSVPAYRFVRF
DYIPPVSAERLERIAEAMRRKEGFFLTASLKQDRKSRGTLLALEGPGAAHRQFEIVSNGP
ADTLDLTYWVDGTQHVISLEDVGLADSQWKNITVQVTGETYSLYVGCDLMDSFTLDEPFY
EQLNTEKSRMYVAKGPSRESHFRGLLQNVYLVFENSVEDLLSKKGCQQSQGAEANAISEN
TETLHLSPPASMEHVGRSEYRGPEVCRHSCEELGSMIRELAGLHAIVNQLHENLRKVSND
NQFLWELIGGPPKTRNMSACWQDGRFFAENETWVVDSCTKCTCKKFKTVCHQISCPPATC
ADPSFVDGECCPSCFHDGEEGWSPWAEWTECSVTCGPGTQQRGRSCDVTSNTCLGPSIQT
RSCSLGRCDHRIRQDGGWSHWSPWSSCSVTCGVGNVTRIRLCNSPVPQMGGRSCKGSGRE
TRACQGPPCPVDGRWSPWSPWSACTVTCAGGIRERTRICNSPEPQHGGKDCVGDVTERQM
CNRRSCPADGCLSNPCFPGAECRSFPDGSWSCGSCPLGFLGNGTHCEDLDECAVVTDVCF
TTSKARRCVNTSPGFHCLPCPPRYKGSQPFGVGLEAARTEKQLCEPEDPCKDKSHNCHRH
AECIYLGHFSDPMYKCECQTGYAGDGLICGEDSDLDGWPNKNLVCATNATYHCIKDNCPL
LPNSGQEDFDKDGIGDACDDDDDNDGVSDEKDNCQLVFNPRQSDYDKDEVGDRCDNCPYV
HNLAQIDTDSNGEGDACSVDIDGDDVFNERDNCPYVYNTDQRDTDGDGVGDHCDNCPLVH
NPDQTDVDNDLVGDQCDNNEDIDEDGHQNNQDNCPYISNANQADHDHDGRGDACDSDDDN
DGVPDDRDNCRLVANPDQEDLDGDGRGDACKDDFDNDSIPDIDDVCPENHAISETDFRNF
QMVHLDPKGTTQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDRDD
DYAGFVFGYQSSSRFYVVMWKQVTQTYWEDQPTRAYGYSGVSLKVVNSTTGTGEHLRNAL
WHTGNTEGQVRTLWHDPKNIGWKDYTAYRWHLTHRPKTGYIRVLVHEGKQVMADSGPIYD
QTYAGGRLGLFVFSQEMVYFSDLKYECRDV
NT seq 3513 nt   +upstreamnt  +downstreamnt
atgctgtggccgctgctgctgcttgccttatggccctggtccagcgctcaggccggcgac
caggacgaggacaccgcctttgacctcttcagcatcagcaatatcagccgcaagaccatc
ggggccaagcggttccgggggccggaccccagcgtgcccgcctaccgcttcgtgcgcttt
gactacatccctccggtgagcgcggagcgcctggagcggatcgccgaggccatgcggcgg
aaggagggctttttcctgacggccagcctgaagcaggaccgcaagtccaggggcaccctg
ctggcactggagggcccaggcgccgcacacaggcagttcgagatcgtgtccaacggcccc
gccgacaccctggacctcacgtactgggtggacggcacccagcacgtcatctccctggag
gacgtgggcctggctgattcccagtggaagaacatcaccgtgcaggtgaccggagagacc
tacagcctctatgtgggctgcgacctgatggacagcttcacgctggacgagcccttttac
gagcagctgaatacagagaagagcaggatgtacgtggccaagggcccctcccgagagagc
cacttcaggggtttgctgcagaacgtctacttagtgtttgaaaactctgtggaagatctt
ctgagcaagaaaggttgtcagcaaagccagggagccgaagccaacgccatcagcgagaac
acggagacgctgcacctgagccccccggcctccatggagcacgtgggccggagcgagtac
aggggcccggaggtgtgcaggcactcctgcgaggagctgggcagcatgatcagggagctg
gcagggctgcacgccatcgtgaaccagctccacgagaacctgcgcaaagtgtcaaatgac
aaccagtttctctgggagctcattggaggaccccccaagaccaggaacatgtccgcctgc
tggcaggacggcaggttctttgcggagaatgaaacctgggtggtggacagctgcaccaag
tgcacctgcaagaaatttaaaaccgtttgccaccagatcagctgccctcccgccacgtgc
gccgacccgtcgtttgtggacggcgagtgctgcccctcctgcttccacgacggcgaggaa
ggctggtccccatgggcggagtggaccgagtgctccgtcacctgcgggcctggcacccag
cagagggggcggtcctgcgacgtcaccagtaacacctgcctggggccatccatccagacg
cggtcgtgcagcctgggcaggtgcgaccaccgcatccggcaggatggtggctggagccac
tggtcgccttggtcctcctgctccgtgacctgtggggtcggcaacgtcacccgcatccgt
ctctgcaactcaccagtgccgcagatgggtgggcggagctgcaaggggagtggccgcgag
accagggcctgccagggcccgccctgcccggtcgacggtcggtggagcccctggtccccg
tggtcggcctgcacggtcacctgtgccgggggcatccgggagcggacgcgcatctgtaac
agcccggagccccagcacggagggaaggactgcgtcggcgacgtcacggagcgtcaaatg
tgtaacaggaggagctgccccgcagacggctgcttgtccaacccctgcttccccggagcc
gagtgccgcagcttccctgacgggtcctggtcctgcggctcttgcccgctgggcttcctg
ggcaatggcacccactgcgaggacctggacgagtgcgctgtggtcaccgacgtgtgcttc
acgaccagcaaggcgcgccgctgcgtcaacaccagccccggcttccactgcctgccctgc
cccccgcgctacaaggggagccagcccttcggcgtcggcctggaggcggccaggacggag
aagcagctgtgtgagcctgaggacccctgcaaggacaagagccacaactgccacaggcac
gcggagtgcatctacctgggccacttcagcgaccccatgtacaagtgcgagtgccagacg
ggctatgcgggcgacgggctcatctgcggggaggactcggacctggatggctggcccaac
aagaacctggtctgcgccaccaacgccacctaccactgcatcaaggacaactgccccctc
ctgcctaactctgggcaagaagacttcgacaaggacggcatcggggacgcctgtgacgat
gacgacgacaacgacggcgtcagcgatgagaaggacaactgccagcttgtcttcaaccca
cgtcagtctgactacgacaaggacgaggttggagaccgctgtgacaactgcccctacgtg
cacaacctggcgcagatcgacacggacagcaacggcgagggcgacgcctgctccgtggac
atcgacggggacgatgtcttcaatgagcgagacaactgtccctacgtctacaacactgac
cagagggacaccgacggggacggcgtgggcgatcactgcgacaactgccccctggtgcac
aacccagaccagaccgacgtggacaatgacctcgtgggagaccagtgtgacaacaacgaa
gacatcgacgaggacggacaccaaaacaaccaggacaactgcccctacatctccaatgcc
aaccaggctgaccacgaccacgacggcaggggagacgcctgtgactcggacgatgacaac
gacggggtccccgatgacagggacaactgccggctggtggccaaccccgaccaggaggac
ttggacggtgatgggcgtggtgatgcttgcaaagatgactttgacaacgacagcatcccc
gatattgacgacgtgtgtcctgagaaccacgccatcagcgagacggacttccggaacttc
cagatggtccacctggaccccaagggcaccactcagatcgatcccaactgggtcattcgc
catcaaggcaaagagctggtgcagacggccaactccgaccctggcatcgctgttggtttc
gatgagtttgggtcagtcgacttcagcggcacattctacgtcaacacggaccgggatgac
gactatgctggcttcgtctttggctaccagtccagcagccgcttctacgtggtgatgtgg
aagcaggtcactcagacctactgggaagaccagcccacccgggcgtacggctactcaggg
gtgtccctcaaggtggtgaactccaccacggggacaggcgagcacctgaggaacgccctg
tggcacacggggaacacagagggacaggttcgcacgttatggcatgaccccaaaaatatt
ggctggaaggactacactgcctacaggtggcatctgacccacagacctaagacaggttac
ataagagtcttagtgcatgaaggaaaacaggtcatggcggactcaggacccatctatgac
caaacctacgctggtgggcggctgggtctgtttgtcttctctcaagaaatggtctacttc
tcggacctgaaatatgaatgcagagatgtctga

KEGG   Physeter catodon (sperm whale): 102980033
Entry
102980033         CDS       T06011                                 

Gene name
VWF
Definition
(RefSeq) von Willebrand factor isoform X1
  KO
K03900  von Willebrand factor
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04610  Complement and coagulation cascades
pcad04611  Platelet activation
pcad05165  Human papillomavirus infection
pcad05171  Coronavirus disease - COVID-19
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102980033 (VWF)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102980033 (VWF)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102980033 (VWF)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    102980033 (VWF)
   04611 Platelet activation
    102980033 (VWF)
 09160 Human Diseases
  09172 Infectious disease: viral
   05171 Coronavirus disease - COVID-19
    102980033 (VWF)
   05165 Human papillomavirus infection
    102980033 (VWF)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03110 Chaperones and folding catalysts [BR:pcad03110]
    102980033 (VWF)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102980033 (VWF)
Chaperones and folding catalysts [BR:pcad03110]
 Intramolecular chaperones
  Others
   102980033 (VWF)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102980033 (VWF)
SSDB
Motif
Pfam: VWD VWA C8 VWA_N2 TIL VWA_3 VWC Pacifastin_I
Other DBs
NCBI-GeneID: 102980033
NCBI-ProteinID: XP_028346736
UniProt: A0A455BS88
LinkDB
Position
6
AA seq 2321 aa
MFFQTSLESGPGNRIQKMIPARLTRLLLALALTLPGALCGEGTLGRSSMARCSLFGGDSI
NTFDESMYSFEGSCSYLLAGDCQTHSFSLIGDFQDGKRVSLSVYLGEFFDIHVFVNGTVL
QGDQRVSMPYASKGLYLETEAGYYKLSSEAYGFVARIDGSGNFQVLLSDRHFNKTCGLCG
NFNIFAEDDFQTQEGTLTSDPYDFANSWALSSGEQRCQRASPPSRLCNVSSELQKGLWEQ
CQLLKSASAFARCRPLVDPEPFVALCEGLLCTCARGLLCPCPVLLEYARACAQQGVLLYG
WADLSSCRPDCPAGMEYKACVPPCPRSCQSLHIDEVCQRQCVDGCSCPEGQLLDEGRCVD
SAECSCVHSGKRYPPGASLSRDCNTCICRNSQWVCSNEDCPGECLITGQSHFKSFDDRHF
TFSGVCQYLLARDCQDHSFSIVIETVQCADDPDAVCTRSVTVRLPSQPSNVVKLKHGGGV
AIDGQDVQIPLLKGDLRIQHTVMASVHLSYGEDLQIEWDGRGRLLVKLSPVYSGRTCGLC
GNYNGNQGDDFLTPAGLVEPLVEHFGNAWKLRADCEDLRKQPSDPCSLNPRLTRFADESC
AILTSPKFKACHGAVGPQPYLQNCRYDVCSCSNGRDCLCDAVASYAAACARRGVRIGWRE
PSFCALSCPRGQVYLQCGTPCNLTCRSLSYPDEECTEVCLEGCFCPLGLYLDETGACVPK
AQCPCYYDGEIFQPEDVFSDHHTMCYCEDGVMRCSTSGPPGSLLPDSVLSSPLSHRSKRS
LSCRPPMVKLVCPADNPRAEGLECAKTCQNYDLECVSMGCVSGCLCPPGMVRQENRCVAL
ERCPCFHQGREYTPGETVKVDCNTCVCRDRKWNCTDHVCDASCSALGLAHYLTFDGLKYL
FPGECQYVLAQDYCGSNPGTFRILVGNEGCSYPSLKCRKRVTILVEGGEIELFDGEVSVK
RSMKDETDFEVVESGLYITVLLGKGLSVVWDRHLGISVFLKQTFQEQVCGLCGNFDGIQN
NDLTSSGLQVEEDPVDFGNSWKVSPRCADTQKVPLDPSPATCHNNVIKQTMVDSSCSVLT
SAVFKECNRLVDPEPYLDVCIYDTCSCESIGDCACFCDTIAAYAHVCARHGAVVAWRTAT
LCPQNCEERNLRDNGYQCEWRYNSCAPACPVTCQHPEPLACPVQCVEGCHAHCPPGKILD
ELLQTCVNPEDCPVCEVAGRRLAPGKKITLNPGDPEHCQICHCDGVSLTCEACREPGGLE
VPPTEGPVGLTTPYVEDTPEPPLHDFFCSKLLDLVFLLDGSSKLSEADFETLKAFVVGVM
ERLHISQKRIRVAVVEYHDGSHAYVALQDRKRPSELRRIAGQVKYAGSEVASTSEVLKYT
LFQIFSKVDRPEASRVALLLTASQEPPRLARNLVRYVQGLKKKKVVMVLVGIGPHASLRQ
IRLIEKQAPENKAFVLSGVDKLEQQRDEIISYLCDLAPEAPALTQRPLTARVTTAPERPG
TSTLAPGRSSLVLDVAFLLEGSDSVGEANFNRSTEFLEDVIRQMDVGRDGVHVAVLQYAY
VVTVEHSFREAQSKEDVLQRLREIRYRGGNRTNTGLALQYLSEHSFSASQGDRQQAPNLV
YMVTGSPASDEIRRVPGDIQVVPIGVGPRVDVQELERIGWPQAPIFIKDFETLPREAPDL
VLQRCCSREGPHLPTLAPVPDCSQPLDVVLLLDGSSSFPASYFDEMKNFAKAFISKANIG
PQLTQVSVLQYGSIATIDVPWTVPLEKTHLQSLVDLMQQEGGPSQIGDALGFAVRYVTSQ
VHGARPEASKAVVILVMGTSTDSVDAAAAAARSSRVAVFPIGIGDGYDLAQLRILAGPGA
SSNVVQLQRIEDLPTMVSLGNSFLHKLCSGFISVCVDEDGNERRPGDIWTLPDQCHTVTC
LPDGQTLLNSHRVNCEQQLRPSCPNSQSPIRVEEACGCRWTCPCVCTGSSTRHIVTFDGQ
NFKLTGNCSYVLFHNKEQDLEVRLHNDACSAGARQACMKSVEVKHNGLSVELRSDMEVVV
DGRRVSAPYVGEDMEVSVYGAIMYEVRLSHLGHILTFTPQNNEFQLQLSPKTFASKMYGL
CGICDENGANDFMLRDGTVTTDWKTLVQEWTVQQPGQTCLLGPEEPCPVSRVSHCQVLLS
ALFAECHAVLAPATFHAICQQDSCHQEQVCEAIASYAHLCRTKGVCVDWRTPDFCAVPCP
PSLVYNHCERGCPRQCEGNSSSCGDHPSEGCFCPLHQVLLEGSCVPEEACTQCVGDDGIR
HQLLETWVPDHQPCQICTCLSGRKVNCTTQPCPTARGEAPP
NT seq 6966 nt   +upstreamnt  +downstreamnt
atgtttttccaaacttccctggagtctggccctggaaaccggatccaaaagatgatccct
gccaggctcacgaggttgctgcttgctctggccctcaccttgccaggggcgctttgtgga
gaaggcactcttggcaggtcgtcgatggcccgatgcagcctcttcggaggtgactccatc
aacacctttgacgagagcatgtacagctttgagggaagctgtagttacctcctggcgggg
gactgccagacgcactccttctctctcataggggacttccaagatggcaagagagtgagc
ctctccgtgtatcttggggaattttttgatatccacgtgtttgtcaatggtactgtgctg
cagggggaccagagggtctccatgccctatgcctccaaagggctgtacctagaaactgag
gctggttactacaagctgtccagcgaggcctacggctttgtggccaggattgatggcagc
ggcaactttcaagtcctgctgtcagacagacatttcaacaagacctgtgggctgtgtggc
aactttaacatctttgccgaagatgacttccagactcaagaagggaccttgacctcggac
ccctacgactttgccaactcctgggccctgagcagtggggaacaacggtgccagcgggcg
tcccctcccagccgcttgtgcaatgtctcctccgagctgcagaagggcctgtgggagcag
tgccagcttctgaagagcgcctcggcgttcgcccgctgccgccccctggtggaccccgag
cctttcgtggccctgtgtgaggggctgctgtgcacgtgtgcccgggggctgctgtgcccc
tgccctgtgctcctggagtacgcccgggcctgcgcccagcagggagtgctgctgtacggc
tgggcggacctcagctcctgccgaccggactgccccgcgggcatggagtacaaggcatgc
gtgcccccctgccccagaagctgccagagcctgcacatcgacgaagtgtgtcagcggcag
tgcgtggatggctgcagctgccctgagggacagctccttgatgaaggccgctgtgtggac
agtgccgagtgctcctgcgtgcattctgggaagcggtaccctccgggcgcctctctctcg
agagactgcaacacctgcatttgccgaaacagccagtgggtctgcagcaacgaggactgt
ccaggggaatgtctcatcacaggacaatcccacttcaagagctttgacgacaggcacttc
accttcagcggggtctgccagtacctgctggcccgggactgccaggaccactccttctcc
atcgtcatagagactgtccagtgtgccgacgaccccgacgcggtctgcacccgctcggtc
accgtccgcctgcccagccagcccagcaacgtcgtgaagctgaagcacgggggaggcgtt
gccatcgatggtcaggacgtccagatccccctcctgaaaggtgacctccgcatccagcac
accgtgatggcctccgtgcacctcagctacggggaggacctgcagatagagtgggacggc
cgcgggaggctgctggtgaagctgtccccggtctactcggggaggacgtgcggcctgtgc
gggaactacaacggcaaccagggggacgacttcctgacgcccgcgggcctggtggagccc
ctggtggagcacttcggaaacgcctggaagctgcgtgcggactgcgaggacctgcggaag
cagcccagcgacccctgcagcctcaacccgcgcctgaccaggttcgcggacgagtcctgc
gccatcctgacgtcgcccaagttcaaggcctgccacggcgccgtgggcccgcagccctac
ctgcagaactgccgctacgacgtgtgctcctgttccaacggcagagactgcctgtgcgac
gcggtggccagctacgcggcggcctgtgcgcggagaggcgtgcgcatcgggtggcgggag
cccagcttctgcgctctgagctgcccgcgaggccaggtgtacctgcagtgtgggaccccc
tgcaacctcacctgccgctccctctcttacccggatgaggaatgcaccgaggtctgcctc
gagggctgcttctgccccctggggctctacctggatgagaccggggcctgtgtgcccaag
gcccagtgcccctgttactacgacggcgagatcttccagcccgaagacgtcttctcggac
catcacaccatgtgctactgtgaggatggcgtcatgcgctgctccacgagtggacccccg
gggagcctgctgcccgactcggtcctcagcagccccctgtcccaccgcagcaaaagaagc
ctgtcctgtcggccccccatggtcaagctggtgtgtcctgctgacaacccgagggctgaa
gggctggagtgcgccaaaacctgccagaactacgacctggagtgcgtgagcatgggctgc
gtgtccggctgcctctgccccccgggcatggtccggcaggagaacaggtgtgtggccctg
gaaaggtgcccctgcttccaccagggcagagagtacacccccggagagacggtgaaggtg
gactgcaacacttgtgtctgtcgggaccggaagtggaactgcacggaccacgtgtgcgat
gcctcgtgctccgccctgggcctggctcactacctcacctttgacgggctcaagtacctg
ttccccggggagtgccagtacgtcctggcacaggattactgtggcagtaaccctgggacc
ttccggatcctggtggggaacgaggggtgcagctacccctccctgaagtgcaggaagcgc
gtcaccatcctggtggaaggaggcgagattgagctgtttgatggggaggtgagcgtgaag
aggtccatgaaggatgagactgactttgaggtggtggagtccggcctgtacatcaccgtg
ctgctaggcaagggcctctccgtggtctgggaccgccacctgggtatctctgtgttcctg
aagcagactttccaggagcaggtgtgtggcctgtgtgggaattttgacggcatccagaac
aacgacctcaccagcagtggcctccaagtggaggaagaccccgtggacttcgggaattcc
tggaaagtgagcccgcggtgcgccgacacccagaaagtgcccctggacccatcccctgcc
acctgccacaacaacgtcatcaagcagacgatggtggattcctcctgcagtgtcctcacc
agcgccgttttcaaggagtgcaaccggctggtggaccccgagccgtacctggatgtttgc
atctacgacacctgttcctgcgagtccatcggcgactgtgcctgcttctgcgacaccatc
gccgcctacgcccacgtgtgtgcccggcacggcgcggtggtggcctggaggacggccacg
ctgtgcccccagaactgcgaggaacggaacctgcgggacaacgggtaccagtgtgagtgg
cgctacaacagctgcgctcccgcctgtcccgtcacgtgccagcaccccgagccgctggcc
tgccccgtgcagtgtgtggagggctgtcacgcacactgccctccggggaaaatcttggat
gagcttttgcagacctgcgtcaaccctgaagactgccctgtgtgcgaggtggccggccgg
cgcttagcccccgggaagaaaatcaccttgaaccccggggaccccgagcactgccagatt
tgtcactgtgatggtgtcagcctcacctgtgaagcctgcagggaaccgggaggcctggag
gtgccccccacagaaggcccggtcggcctcaccaccccgtacgtggaggacacgccggag
ccgcccttgcacgacttcttctgcagcaaacttctggacctggtcttcctgctggacggc
tcctccaagctgtccgaggccgacttcgagacgctgaaggcgttcgtggtgggcgtgatg
gagcgtctgcacatctcccagaagcgcatccgggtggccgtggtggagtaccacgacggc
tcccacgcctacgtcgcgctgcaggaccggaagcggccgtcggagctgcggcgcatcgct
gggcaggtgaagtacgcgggcagtgaggtggcctccaccagcgaggtcttgaagtacacg
ctcttccagattttcagcaaggtggaccggcctgaggcctcccgcgtggccctgctgctc
acggccagccaggagccccccaggctggcccggaacttggtccgctacgtccagggcctg
aagaagaagaaggtcgtcatggtcctcgtgggcatcgggccccacgccagcctcaggcag
atccgcctcatcgagaagcaggcccccgagaacaaagcctttgtgctcagcggcgtggac
aagctggagcagcagagggacgagatcatcagctacctctgcgacctcgcccctgaagcg
cctgccctcacgcagcggccgctcacggcccgggtcaccacggcgccggagcgcccgggg
acttcgacgctggcacccgggaggagctccttggtcctggatgtggcattcctcctggaa
gggtcggacagtgtgggcgaggccaacttcaacaggagcacagagttcttggaggacgtg
atccggcagatggacgtgggccgggacggtgtccacgtcgcggtgctgcagtacgcatac
gtggtgaccgtggagcactccttcagagaggcgcagtccaaggaggatgtcctgcagcgg
ctgcgggagatccgctaccggggtggcaaccggaccaacaccgggctggccctgcagtac
ctgtcggagcacagcttctcggccagccagggggaccggcagcaggcacccaacctggtc
tacatggtcacgggaagcccggcctcagatgagatcaggcgggtgcccggagacatccag
gtggtgcccatcggggtgggaccccgcgtggacgtgcaggagctggagaggatcggctgg
ccccaggcccccatcttcatcaaggacttcgagacgctgccccgagaggctcctgatctg
gtgctgcagaggtgctgctccagagaggggccgcatctccccaccctcgcccctgtccca
gactgcagccagcccctggatgtggtcctcctcctggacggctcctccagctttccagct
tcttactttgacgaaatgaagaattttgccaaggctttcatctcaaaagccaatataggc
cctcagctcacccaggtgtcggtgctccagtatggaagcattgccaccatcgacgtgcca
tggaccgtgcccctggagaaaacccacttacagagcctcgtggacctcatgcagcaggag
ggaggccccagccaaatcggggacgcgttgggctttgcggtgcgctatgtcacatcccaa
gtccacggtgccaggcctgaagcctcgaaggcagtggtcatcctggtcatgggcacttcc
acggactccgtggacgcggccgccgctgccgccagatccagccgagtggcggtgttcccc
attggaattggagatgggtatgacttggcccagctgaggatcttggcgggcccaggggcc
agctccaacgtggtgcagctccagcggatcgaggacctccccaccatggtctccctgggc
aattccttcctccacaagctgtgctctgggttcatcagtgtttgcgtggatgaggacggg
aatgagaggaggcccggggacatctggaccttgccggatcagtgccacacggtgacctgc
ctgccagatggccagaccttgctgaatagtcaccgggtcaactgtgagcagcagctgcgg
ccttcctgccccaacagccagtccccgatcagggtggaagaggcctgtggctgccgctgg
acctgcccctgtgtctgcactggcagctctactcggcacatcgtgacctttgatggacag
aatttcaagctgactggcaactgctcatacgtcttatttcacaacaaggagcaggacctg
gaggtgcggctccataatgatgcctgcagcgccggggcaaggcaggcctgcatgaagtcc
gtggaggtgaaacataacggtctctcggttgagctccgcagcgacatggaggtggtggtg
gatgggaggcgggtctctgccccttacgtgggtgaggacatggaggtcagcgtctatggt
gccattatgtatgaggtcagactcagccatctgggccacatcctcaccttcaccccacaa
aacaacgagttccagctgcagctgagccccaagacctttgcctcaaagatgtacggtctc
tgtggcatctgtgatgagaatggggccaatgacttcatgctgagggatggcacggtcacc
acagactggaagacgctggtccaggagtggacggtgcagcagccgggccagacatgcttg
ctgggtcccgaggagccatgtcccgtctccagggtctcccactgtcaggtccttctctcg
gcgctgttcgccgagtgccacgcggtccttgccccagccacttttcacgccatctgccag
caggacagttgccaccaggagcaagtgtgtgaggccattgcctcttacgcccacctctgt
cggaccaagggggtgtgtgtcgactggaggacccccgatttctgtgctgtgccgtgcccg
ccatccctggtctacaaccactgtgagcgcggctgcccccgccaatgtgagggcaactca
agctcctgtggggaccatccctcagaaggctgcttctgccccctgcaccaagtgctgctg
gaaggcagctgtgttcccgaggaggcctgcacccagtgcgtcggtgacgatgggatccgg
caccagcttctggaaacgtgggtcccagaccaccagccctgccagatctgcacgtgcctc
agcgggcggaaggtcaactgcacgacacagccctgccctacagccagaggtgaggcccct
ccctaa

KEGG   Physeter catodon (sperm whale): 102981656
Entry
102981656         CDS       T06011                                 

Gene name
LAMA2
Definition
(RefSeq) laminin subunit alpha-2 isoform X5
  KO
K05637  laminin, alpha 1/2
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
pcad05410  Hypertrophic cardiomyopathy
pcad05412  Arrhythmogenic right ventricular cardiomyopathy
pcad05414  Dilated cardiomyopathy
pcad05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102981656 (LAMA2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102981656 (LAMA2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102981656 (LAMA2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102981656 (LAMA2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102981656 (LAMA2)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    102981656 (LAMA2)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    102981656 (LAMA2)
   05414 Dilated cardiomyopathy
    102981656 (LAMA2)
   05416 Viral myocarditis
    102981656 (LAMA2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102981656 (LAMA2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102981656 (LAMA2)
   05145 Toxoplasmosis
    102981656 (LAMA2)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_I Laminin_N Laminin_B Laminin_II Laminin_G_3 LXG DUF948 DUF2630
Other DBs
NCBI-GeneID: 102981656
NCBI-ProteinID: XP_023978307
UniProt: A0A2Y9SLJ5
LinkDB
Position
10
AA seq 3124 aa
MPGGAGVLLVLLLSGGLGGGQAQRPQQQQRRQPPAHQQRGLFPAVLNLASSALITTNATC
GEKGPEMYCKLVEHVPGQPVRNPQCRICHQSSSNPQLRHPITNAIDGKNTWWQSPSIKNG
IEYHYVTITLDLQQVFQIAYVIVKAANSPRPGNWILERSLDDVDYKPWQYHAVTDTECLT
LYNIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDPSPELLEFTSA
RYIRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRYYYSVKDISVGGMCICYGHARACPL
DPVTNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEACNCHGKAEECYYDEN
VARRNLSLNIRGKYIGGGVCINCTQNTAGINCETCTDGFFRPKGISPNYPRPCQPCHCDP
VGSLNEVCVKDEKRAQRGLAPGSCHCKPGFRGVSCDRCARGYTGYPDCKPCNCSGAGSTN
EDPCFGSCNCKENVEGGDCSRCKFGFFNLQEDNHKGCDECFCSGVSDRCQSSYWTYGNVQ
DMSGWYLTDISGHIRVTPQQDELDPPQQISISSVEARRALPQSYYWSAPALYLGNKLTAV
GGQLTFTISYDLKEEEEDTEHILQLMIILEGSDLRISTAQDEVYLQPSEEHINVLSLKED
LFTIHGTNFQVSRKEFMTVLANLKRVLIQITYSLGMDAIFRLSSVGLESAVPYPTDRNIA
AAVEVCQCPSGYTGSSCESCWPRHRRVNGTIFGGICEPCQCFGHAESCDDITGECLNCKD
HTGGLYCNTCLPGFYGDPTKGTSDDCQPCACPLNIPSNNFSPTCHLDRSLGLICDECPVG
YTGPHCERCAEGYFGQPSLPGGSCQPCQCNDNLDFSIPGSCDSLSGSCLICKPGTTGRYC
ELCADGYFGDAVDAKNCQSCHCNANGSFSEVCHAQTGQCECKPNVQGRQCDECKPETFGL
QSARGCIPCNCNSFGSKSFDCEESGQCWCQPGVTGKKCDRCAHGYFNFQEGGCTACDCSH
LGNNCDPKTGRCICPPNTIGEKCSKCVPNTWGHSITTGCKPCNCSSAGSLDFQCNINTGQ
CNCRPKFSGAKCTECNRGHWSYPHCNPCDCFLPGTDVSTCDLETKKCSCTDKTGQCTCKV
NVEGIRCDRCRLGKFGLEAKNPLGCSSCYCFGATTQCSEAKGLIHTWVTLKPDQTILPLV
DEALQHTTTRGIIFQHPEIVAHMDLVRQDLHLEPFYWKLPEQFEGKKLMAYGGKLKYTIY
FEAREETGFSTYNPQVIIRGGTPSHVRMIIRHMAAPLIGQLTRHEIEMTEKKWKYYGDDP
RISRTVTREDFLDVLYDIHYILIKATYGNIMRQSRISEISMEVAEQGRITAMTPPAHLIE
RCDCPPGYSGLSCEACMPGFYRLRSEPGGHTPGPALGTCVPCQCHGHSSLCDPETSICQN
CQHHTAGDFCERCVLGYYGIVKGLPDDCQQCACPLISSSNNFSPSCVTEGLDDYRCTACP
REYEGQYCERCAPGYTGSPSSPGGSCQECECDPHGSLSVPCDPVTGICTCRPGATGQKCD
GCKHWHAREGMECVFCGDECTGLLLGDLALLEQMATSINLTGPLPAPYKILYGLENRTQE
LKHLLSPQRAPERLIQLAEGNLNTLVTEMNELLTRATKVTADGEQTGQDAERTNTRANSL
GEFIKDLAQHAEAVSEKAVKLNETLGIQDKAFERNLQELQNEVDKMMTELRRKTLDTQKE
VAEDELVAAEGLLKKVKKLFGESRGKNEEMEKDLREKLADYKSKVDDAWDLLREATDKIK
EANLLSAENQKNMTALEKKKEAIESGKRQTEDTLKEGTDILDEANRLADEINSVINYVED
IQTKLPPLSEDLKDKIDDLSQEIKDRKLAEKVSQAESHAAQLNESSAVLDRILNEAKNIS
FNATAAFKAYSNIKDYIDEAEKIAKEAEVLAREATELATGPQGSLQEGAKGSLQKSFGFL
NEAKKLANDVKENDDHLNDLTTRLDNASVRNRDLLRALNDTLEKLSAIPNDTAAKLQAVK
DKARQANDTAKDVLAQIKDLHQNLDGLKKNYNQLADSVAKTNAVVKDPSKNKIIADADAT
VKNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCIRTY
KPEIKKGSYNNIIVNVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVE
YPDLTIDDSYWYRIEASRTGRNGTISVRALDGPKASIMPSTYHSASPPGYNILDVDANAM
LFVGGLTGKLKKADAVRVITFTGCMGEAYFDSKPIGLWNFREIEGDCKGCTVSPQVEDSE
GTIQFDGEGYALVSRPIRWYPNISTVMFKFRTFSSNALLMYLATRDLKDFMSVELTDGHI
KVSYDLGSGMASVVSNQNHNDGKWKSFTLSRTQKQANISIVDIDTNQEENIATSSPGNNF
GLDLKADDKIYFGGLPTLRNLSMKARPEVNLKKYSGCLKDIEISRTPYNILSSPDYVGVT
KGCSLENVYTVSFPKPGFVELSPVPVDVGTEINLSFSTKNESGIILLGSGGTPAPPRRKR
RQTGQAYYAIFLSKGRLEVHLSTGARTMRKIAVKPEPSLFHDGREHSVHVERTRGIFTVQ
VDEDRRHVQNLTVEQAIEVKKLFVGGAPPEFQPSPLRNIPPFEGCIWNLVINSVPMDFAQ
PVSFKNADIGRCAHQKPPEDEDGAVPAGTVIQPEPVPTPAVPAPTPVLAHGPCAAESEPA
LLIGSKQFGLSRNSHIAVAFDDTKVKNRLTIEFEVRTEAESGLLFYMARINHADFATVQL
RNGLPYFSYDLGSGDTNTMIPTKINDGHWHKIKIIRIKQEGIIYVDGASNRTISPKKADI
LDVVGMLYIGGLPINYTTRRIGPVTYSIDGCIRNLQMAEAPADLEQPTSSYHVGTCFANA
QKGTYFDGTGFAKAVDGFKVGLDLLVEFEFRTTRTTGVLLGISSQKMDGMGIEMIAEKLM
FHVDNGAGRFTAVYDAGIPGHLCDGQWHKVTANKVKHRIELTVDGNHVEAQSPNGASTSA
DTNDPVFVGGFPEGLNQFGLTTNIRFRGCIRSLRLTKGTGKPLEVNFAKALELRGVQPVS
CPAN
NT seq 9375 nt   +upstreamnt  +downstreamnt
atgcctggaggcgccggggtcctcctggtcctgctgctcagcggaggcctcgggggcggc
caggcgcagcgtccgcagcagcagcagcggcggcagccaccggcgcatcagcaaagaggt
ttattccctgctgtcctgaatcttgcttccagtgccctcatcaccacaaatgcaacatgt
ggagaaaaaggaccggaaatgtattgcaaattggtagagcacgtccctgggcagcccgtg
aggaaccctcaatgtcggatctgccatcaaagcagcagcaatccacagctgagacacccg
attacaaatgctattgatggaaagaacacttggtggcagagtcccagtattaagaatgga
atcgaataccattacgtgacaattacactggatttacagcaggtgttccagattgcgtat
gtgattgtgaaagcagctaattcccctaggcctggaaactggattttggaacgctccctt
gacgatgttgattataagccctggcagtatcatgctgtgacggacacagagtgcctaaca
ctctacaatatttatccccgcactggaccaccatcatacgccaaagacgatgaggtcatc
tgcacttcattttattccaagatacaccccttagaaaacggagagattcacatctctttg
atcaatgggagaccgagtgctgatgatccgtctcctgaactgctagaatttacctccgcc
cgctatattcgcctgagatttcagagaatccgcaccttgaatgccgatttgatgatgttt
gctcacaaagacccaagagaaattgaccccattgtcacacgaagatattactactcagtc
aaggatatttctgttggagggatgtgcatctgctatggtcatgccagggcttgtccactt
gatccagtgacaaataaatcccgctgtgaatgtgagcataacacgtgtggtgatagctgt
gatcagtgctgtcctggatttcatcagaaaccgtggagagcaggcacttttctgaccaaa
actgaatgtgaagcctgcaattgtcatgggaaagctgaagaatgctattatgatgaaaat
gttgccagaagaaatctgagtttaaatatacgtggaaagtacattggagggggtgtgtgc
attaattgtactcaaaacactgctggtataaattgtgagacatgcactgatggtttcttc
agacccaaagggatatctccaaattatccaagaccatgccaaccatgccactgtgaccca
gttggttccttaaatgaagtctgtgtcaaggatgaaaaacgtgctcaacgaggtctggca
cctgggtcctgtcattgcaaacctggcttcagaggggtgagctgtgatcggtgtgccagg
ggctacactggctacccagactgcaaaccctgtaactgcagtggtgcagggagcacaaat
gaggacccttgttttggatcctgtaactgcaaggaaaatgtcgaaggaggagactgtagt
cgctgcaaattcggcttcttcaatttgcaagaggataatcataaaggttgtgatgagtgt
ttctgttcaggggtttcagacagatgtcagagctcctactggacctatggcaatgtacaa
gacatgagtggctggtatctgactgacatctctggccacattcgagtgactccgcagcag
gatgaattagatccacctcagcagatcagcatcagcagcgtggaggcccgacgagccctg
ccccagagctactactggagtgcaccagctctgtatctgggaaacaaactcacagcagtt
ggaggacaattgacatttactatatcatacgacctcaaagaagaggaggaagatacagaa
cacatactccagcttatgattatcttagagggaagtgacttgagaatcagcacagcccaa
gatgaggtgtatctgcaaccatctgaagaacatattaatgtgttgtcacttaaagaagat
ttatttaccatacatggcacaaatttccaagtcagtaggaaagagtttatgaccgtgctt
gcgaatttaaagagagtccttatacaaatcacatacagccttgggatggatgccatcttc
aggctgagttctgttggccttgaatctgctgtcccctatcctactgacagaaacattgca
gcagctgtggaagtttgtcagtgtccatcagggtatactggctcctcctgtgaatcttgt
tggcctaggcacagacgagttaacggcactatttttggtggcatctgtgaaccctgtcag
tgctttggtcacgcggaatcctgtgatgatatcactggagaatgcctgaactgtaaggat
cacactggtggcctatattgcaatacatgtcttcctggtttctatggtgatcctactaaa
ggaacctctgatgactgtcagccctgtgcctgtccactcaatatcccatccaataacttt
agcccaacgtgccatttggaccgaagtcttggcctgatctgtgatgaatgccctgtcggg
tacacaggaccacactgtgagaggtgtgcagaaggctattttggacaaccttctctacct
ggaggatcgtgtcagccatgccagtgcaatgacaaccttgacttctccatccctggcagc
tgtgacagcttgtctggctcctgtctgatatgtaagccaggtacaacaggccggtactgt
gagctttgtgctgatggatattttggagacgcagttgatgcaaagaactgtcagtcttgc
cactgtaatgccaacggctccttctctgaggtttgccatgctcagacgggacagtgtgag
tgcaaacctaacgtgcaggggcggcagtgtgatgaatgtaagcctgaaacctttggcctc
caatcagcaagggggtgtattccctgcaactgcaattcctttgggtctaaatcatttgac
tgcgaagagagtggtcaatgttggtgccagcccggagtaacaggaaagaagtgtgatcgc
tgtgcccacggctatttcaactttcaggaaggaggatgcacagcttgtgactgttcccat
ctgggtaataattgtgacccaaagactggtcgatgcatttgcccacccaataccattgga
gagaaatgttctaaatgtgtacctaatacctggggccacagcattaccactggttgtaag
ccttgtaactgcagctcggcgggatccttggatttccaatgcaatataaacacggggcaa
tgcaactgtcgtcccaaattctctggtgcaaaatgtacagagtgcaatcgaggccactgg
agctaccctcactgcaatccctgtgactgcttcctgccagggactgatgtctcaacctgt
gatttagagactaaaaaatgctcctgtaccgataagactgggcagtgtacttgtaaggtg
aatgtggaaggcatccgctgtgacaggtgccggcttggcaaattcggacttgaggccaag
aacccacttggctgcagcagctgctattgcttcggcgccactactcagtgctctgaagca
aaaggactgatccatacgtgggtgactctgaagcctgatcagaccattctgcccctggtg
gatgaggcgctgcagcacacaactaccagaggcatcatttttcaacatccagagattgtt
gcacacatggatctggtgagacaagatctccatttggaacctttttattggaaacttcca
gaacagtttgaaggaaagaagttgatggcctatggtggcaaactcaaatacacaatctat
tttgaggctcgggaagagacaggtttttctacatataacccacaagttatcattcgaggt
gggactcctagtcatgtgagaatgatcatcaggcatatggctgctcctctaattggccag
ttgacaagacatgaaattgaaatgacagagaaaaaatggaaatattatggtgatgatcct
cgaatcagtagaactgtgacccgtgaagacttcttggatgtactatatgatatccattat
attcttatcaaggctacctatggaaatatcatgagacaaagcaggatttctgaaatctca
atggaggtagctgaacaaggacgcatcacagcaatgactcctccagctcacttgatagaa
agatgtgattgtcctccgggctattctggcttgtcctgtgaggcatgcatgccaggattt
tatcgactgcgttctgagccaggtggccatacccctggaccagccctgggcacctgtgtt
ccatgtcagtgtcatggacacagcagcctgtgtgaccctgaaacttccatatgccagaat
tgtcaacatcacactgctggtgacttctgtgaacgatgtgttcttggatattatggcatt
gtcaagggattgccagatgactgtcagcaatgtgcttgccctctgatttcttccagcaac
aatttcagcccttcttgtgtcacggaaggcctggatgattaccgctgcacagcttgccca
cgagaatacgaaggccagtactgcgaacggtgtgcccctggctataccggcagcccaagc
agccccggaggctcctgccaagaatgtgagtgtgacccccatggctcactgtctgtcccc
tgtgaccctgtcacagggatctgcacgtgccgacccggagccacggggcagaagtgtgac
ggctgcaaacactggcatgcacgcgagggcatggagtgtgttttttgtggagatgaatgc
accggccttcttctcggtgacttggctctcctggagcagatggccacgagcatcaacctc
actggcccactgcccgctccttataaaattctgtatggtcttgaaaataggactcaggaa
ctcaagcatttgctctcacctcagcgggccccagagaggctcattcagttggcagaaggc
aatctgaacacgctggtgacagaaatgaatgaacttctgaccagggctaccaaagtgaca
gcagatggcgagcaaactggacaggatgcagagaggaccaacacgagagcaaactcctta
ggagaattcattaaggacctggcccagcatgcagaagctgtaagtgaaaaagctgtaaaa
ctaaatgaaactctaggaattcaagacaaggcctttgagagaaatttgcaagagcttcag
aatgaagttgataagatgatgacagaactgaggaggaaaactctagacacacaaaaggaa
gttgctgaagatgagttggtagctgcagaaggccttctgaagaaagtgaagaagctgttt
ggagaatcccgagggaaaaatgaagaaatggagaaggatctccgggagaaactggcagac
tacaaaagcaaggttgatgatgcttgggacctgttgagagaagccacagataaaatcaaa
gaagctaatctcttatctgcagaaaaccagaaaaacatgactgctttggagaaaaagaag
gaggctattgaaagtggcaaacgacaaactgaggacactttaaaagaaggcactgacata
cttgatgaagccaaccgtcttgcagatgaaatcaactcagtcataaactatgttgaagac
attcaaactaaattgccacccctgtctgaggatctgaaagataaaatagatgacctctcc
caagaaataaaggacaggaagcttgctgagaaggtgtcccaggctgagagccatgcggct
cagttaaatgagtcatctgctgtccttgatagaatcctcaatgaggccaaaaacatctcc
ttcaatgccactgcagccttcaaagcttacagcaatattaaggactatattgatgaagct
gagaaaattgccaaagaagccgaagttcttgcacgggaagccacagaattggcaacaggt
cctcagggttcattacaggaaggtgccaaaggctctcttcagaaaagcttcgggtttctt
aatgaagccaagaagttagcaaatgatgtaaaagaaaatgatgaccatctgaatgacttg
acaaccagattagacaatgccagtgtcagaaacagggatctcctgagagctttgaatgac
actttggaaaagttatcagccattccaaatgacacagccgctaaactgcaagctgttaag
gacaaagcaagacaagccaacgacacagctaaagatgtactggcacagattaaagatctc
caccagaatcttgatggcctgaagaaaaattataatcaactggcagacagcgtagccaaa
acaaatgccgtggtaaaagatccttcaaagaacaaaatcattgcagatgcagatgccact
gtaaaaaatctagaacaagaagctgatcgactgatagataaactcaaaccgatcaaggaa
cttgaggataacctgaagaaaaacatctctgagataaaagaactgataaaccaagcccgg
aagcaggccaattctatcaaagtatctgtgtcttcgggaggtgactgcattcgaacatac
aagccagaaatcaagaaaggaagctacaataacatcattgtcaacgtaaagacggctgtc
gcagacaacctccttttttatcttggaagtgccaaatttattgactttctggctatagaa
atgcgtaaaggcaaagtaagcttcctctgggatgttggatccggagttggacgtgtcgag
tatccagatttgactattgatgactcatattggtaccgtattgaggcatcaagaactggg
agaaatggaacaatttctgtgagagctctggatggacccaaagccagcattatgcccagt
acataccattcagcgtcacctccagggtacaatattctagatgtggacgcaaacgcaatg
ctgtttgttggtggcttgactgggaaattgaagaaggctgatgctgtacgtgtgattaca
ttcactggctgtatgggagaagcatactttgacagcaaacctatagggttgtggaatttc
cgagaaatagaaggtgactgcaaaggatgtaccgtcagtccgcaggtagaagatagtgag
gggactattcagtttgatggagaaggctatgcattagtcagccgccccattcgttggtac
cccaacatctctactgtcatgttcaagttcaggacattttcttcaaatgctctcctgatg
taccttgccacacgagacctgaaagatttcatgagtgtagagctcactgatgggcacata
aaagtcagctatgatctgggttcaggaatggcttccgttgtcagcaatcaaaaccataat
gatgggaaatggaaatcattcaccttgtcaagaactcaaaaacaagccaacatatcaatt
gtagatatagatactaaccaggaggagaacatagcaacttcatctcctggaaacaacttc
ggtcttgacttgaaagcagatgacaaaatatattttggtggtctgccaacactaagaaat
ttgagtatgaaagcaaggccggaagtaaatttgaagaaatattctggctgcctcaaagat
attgaaatttcaagaactccatataatatacttagcagccctgattatgttggtgttacc
aaaggatgttcactggagaacgtttacacagttagcttcccgaagcctggttttgtagag
ctatcccctgtgcctgttgatgtaggaacagaaatcaacctgtccttcagcaccaagaat
gagtctggtattattctcttgggaagtggagggacaccagcgccacctaggagaaaacga
aggcaaactggacaggcctattatgcgatattcctgagcaagggtcgtctggaagtgcat
ctctccacaggggcacgaacaatgaggaaaattgccgtcaaaccagagccgagtctgttt
cacgacgggagagaacattctgttcatgtagagagaactagaggcatctttactgttcaa
gttgatgaagacagaaggcatgtgcaaaatctgacagtagaacaggcgatagaagttaaa
aagcttttcgtcgggggtgctcctcctgaatttcaaccttccccactcagaaatattcct
ccttttgaaggctgtatatggaaccttgttataaactctgtccctatggactttgcacag
cctgtatccttcaaaaatgcagacatcggtcgctgtgcccatcagaaaccacctgaggat
gaagatggagcagttccagctggaacagttatccagccagagccagttcccaccccagcc
gtacctgcacccaccccagttctggcacatggtccttgtgctgcagaatcggaaccagct
ctcctgatagggagcaaacagtttgggctttcaagaaacagtcacattgcagttgcattt
gatgacaccaaagttaaaaaccgtctcaccattgaatttgaagtgcgaaccgaagctgaa
tcaggcttgctcttttacatggcccggatcaatcatgctgattttgctacagttcagctg
agaaatggactgccctacttcagttatgacttgggaagtggtgacaccaacaccatgatc
cccaccaaaatcaatgacggccactggcacaagattaagattattcgaattaagcaagaa
ggaattatttatgtagacggtgcctccaacagaaccatcagtcccaagaaagctgatatt
ctggatgttgtgggaatgctgtatattggcgggctacccattaactacactacccgaaga
attggtccagtgacctacagcattgatggctgcatcaggaatcttcagatggcagaagcc
cctgctgatcttgaacagccaacctccagctaccatgtcgggacgtgttttgcaaatgct
cagaaaggaacatattttgatggaacgggttttgccaaagcagttgatgggttcaaagta
ggattggaccttcttgtagaatttgaattccgcacaactagaaccactggagttcttctg
ggaatcagcagccagaaaatggatggaatgggtattgaaatgattgctgaaaagcttatg
tttcatgtggacaatggcgccggccgattcactgctgtctatgatgctgggatcccaggg
cacttgtgtgacggacagtggcataaagtcacggccaacaaggtcaagcaccgaatcgag
ctgacagtagatgggaaccacgtggaagcccagagcccaaatggagcatctacatccgct
gatacaaatgacccagtgtttgttggagggttcccagagggcctcaaccagtttggcctg
acaaccaacattcgtttccgaggttgcatccgatccctaaggctcaccaaaggcacggga
aagccattggaggtgaattttgccaaggccctggaactaaggggtgttcaacctgtatca
tgcccagccaactaa

KEGG   Physeter catodon (sperm whale): 102982121
Entry
102982121         CDS       T06011                                 

Gene name
LAMA3
Definition
(RefSeq) laminin subunit alpha-3 isoform X4
  KO
K06240  laminin, alpha 3/5
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102982121 (LAMA3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102982121 (LAMA3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102982121 (LAMA3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102982121 (LAMA3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102982121 (LAMA3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102982121 (LAMA3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102982121 (LAMA3)
   05145 Toxoplasmosis
    102982121 (LAMA3)
SSDB
Motif
Pfam: Laminin_G_2 Laminin_I Laminin_G_1 Laminin_II Laminin_G_3 Laminin_EGF COG5 DUF948 WXG100
Other DBs
NCBI-GeneID: 102982121
NCBI-ProteinID: XP_007127273
UniProt: A0A2Y9FN43
LinkDB
Position
19
AA seq 1726 aa
MPPAVRGSTRSAGWLWIFGAALGQCLGYNSQQQRVVLPQLPGQSQRQASYVEFKPSQGCS
PGYYRDNKGLYTGRCVPCNCNGHSNRCQDGSGICLNCQHNTAGEHCERCKEGYYGNAVHG
SCRVCPCPHSNSFATGCVVTGGNVKCSCKPGYTGTQCERCAPGYFGNPQKFGGSCQPCRC
NSNGHLGSCDPLTGDCINQEPKDGDPGEECDDCDSCVMTLLNDLATMEDELLLVKSQLQG
LSASVGTLEQMRHLETQTKDLRSQLLSYSSTISNHKSKLDGLEKELSNLNREFETLQEKV
QINSRKAQTLYNNVDRTTQSAKELDTKIKNVIRNVHILLKQISETEGEGNNLPLDDFSRK
LAEAERMLRELRNRNFGKNLREAEAEKREAQLLLIRIRSWLENHQLENNGLVKNIRDSLN
EYEAKLSDLRAALQEASDQAKQATGLNRENERALESIKRQVKEMNSLQSDFTKHLATADS
SLLKTNVALQLMSKSHEEYEKLAVTLNEARHELSDKVRELSKSSSKASLVEEAEKHAQSL
QELARQLEEIERNTSGDELVRCAVDAATAYENILNAIKAAEDAADKATSASEAALQTVIK
EDVPRKAKTLSSNSDKLLNEAKITQKKLQQEISPALNNLQQTLKIMTVQKGLVDTNLTTI
SDDLRGIQRDDIAGIIDSAKSMVRNANDITNDVLDGLNPIQTDVDRIKDTYGSTGSEDFN
KALTDADNSVKKLTNKLPDLLSKIESINQQLLPLGNISDNMDRIRELIQQARDAANKVAI
PMRFNGKSGVEVRLPNDLEDLKGYTSLSLFLQRPESTESGGTENMFVMYLGNKDASRDYI
GMAIVDGQLTSVYNLGEREAELQVDQTLTKSETQEAVMDRVKFQRIYQFARLNYTRKATS
SKPETPQFHDMDSGDSNTLLDLDPENVVFYVGGYPSDFRLPSRLRFPPYKGCIELDDLNE
NVLSLYNFKKTFNLNTTKVEPCRRRKEESDKNYFEGTGYARVPTQPNAPFPTFGQTIQTT
VDRGLLFFAENQDRFISLNIEDGSLMVRYKLNSEQPKEKGIRSIINDGRDHLILIKIGKA
QKLMRVNVDSQSIKIEGDILDFSTYYLGGIPISIRERFNISTPAFRGCVKNLKKTTGVVR
LNDTVGVTKKCSEDWKLVRSASFSRGGQLSFTNLDFPLPSHFQASFGFQTFQPSGILLNH
QTGASSLQVTLEDGHIELSPRDSSSPVFTSPQTYTDGLLHYVSVISDNSGLRLLIDDQPL
KNNERLLGFSNSQQSLRLGGSNFEGCISNVFIQRLSESPEVLDLASKSTKRDVSLGGCSL
NESPFLMLLKGSTRFNKAKTFNIHQPLCDTPAASPRSLKVRGDAQSCPPLPEAQASHRAL
RFGDSPTSHLLFMLPQELLKPRSQFAMDLQTTASRGLVFYTGTKNSFMALYLSKGRLVFA
LGAGGKQLKLKSKEKCSDGKWHTVAFGQDGERGRLVVDGLRTREGRLPGNSTVSLRAPVY
LGSSPSGKPKTLPQNSFVGCLRNFQLDLKPLDTASASFGVSPCFDGSLEKGIYFSQEGGH
VILANSVLLGPEFKLVFSIRPRSLTGILIHIGSQPGEHLHVYMEAGKVTASVGSEAGRIL
TSVTPKQSLCDGHWHSVTVTIKQHILHLKLDADYSSTAGRHPVPTARTREHLHIGGVPAN
LKTLKLPAWKSFLGCLKNIQVNHIPVPVTEAAEVRGTVSLNGCPDH
NT seq 5181 nt   +upstreamnt  +downstreamnt
atgcccccggcagtgagggggtccacccggagtgcgggatggctgtggatctttggggca
gccctggggcagtgtctgggatacaattcacagcagcaaagggtagtgcttcctcagctt
cccggtcaaagccaacggcaagcaagttatgtggagtttaagcccagccagggttgtagc
cctggatactatcgggataacaaaggcttatatactggacggtgtgttccctgcaattgc
aacggacattcaaatcgctgccaggatggctcgggaatatgccttaactgccagcataac
actgccggggaacactgtgagcgctgcaaagagggttactatgggaacgctgtccacgga
tcctgtagggtctgcccgtgtcctcattcaaacagttttgccaccggctgtgttgtgact
gggggaaacgtgaagtgctcctgcaaacctggatacacaggaacacaatgcgaaaggtgt
gcaccaggatattttggaaatccccagaaatttggaggcagctgccaaccatgccgctgt
aacagcaacggccatttgggcagttgtgaccccctgacgggagactgcataaaccaagaa
cccaaagatggtgaccctggagaagaatgtgatgattgtgacagctgtgtgatgacgctc
ttgaacgacctggccaccatggaggacgagctcctcctggtcaagtctcagctgcagggc
ctgagtgccagtgtggggaccctggagcagatgaggcacttggagacccagaccaaggac
ctgaggagccagttgctcagctacagttccaccatttcaaatcataaatcaaaactggat
ggcctggaaaaagaactgagtaacttgaatcgcgaatttgaaactttgcaagaaaaggtt
caaataaattccagaaaagctcaaacattatataacaatgttgatcggacaacccaaagc
gccaaagagctggacacgaagattaaaaatgtcatccggaatgtgcacattctcttgaag
cagatctctgagacagaaggagaaggaaacaacttgcctttggatgatttttccagaaag
ctggctgaagcggagcgcatgttaagggaactgaggaaccgcaactttgggaagaacctg
agggaagcagaagctgaaaaaagagaggctcagcttctgctgattcggataaggagctgg
ctggaaaaccaccagttggagaacaatgggcttgttaagaatatccgagattctttaaat
gaatacgaagccaaactcagtgacctccgtgccgctctacaggaggccagtgaccaagca
aagcaggccaccggcctcaaccgagaaaacgagagagctttggaatccatcaagagacaa
gttaaagaaatgaattccctccagagtgacttcaccaagcatctagccactgcagactct
tccttactgaaaactaatgttgccttgcagctgatgagtaaaagccatgaggaatatgaa
aaactagctgtcactttaaatgaagcaagacacgaactaagtgacaaagtgagagaactt
tccaaatcgtccagtaaagcatccctggtggaggaggcagagaagcatgcgcagtcttta
caggagctggcaaggcagctggaggagatcgagaggaacaccagtggggatgagctggta
cgctgtgctgtggacgctgccaccgcctatgagaacatcctcaatgccatcaaagcagcc
gaggacgcagcggacaaggccaccagtgcgtctgaggctgccctccagacagtgataaag
gaagatgttccaagaaaagctaaaaccctgagttccaacagcgataaactgttaaatgaa
gccaagataacccagaagaagctacagcaagaaatcagtccggctctcaacaacctgcag
caaactctgaaaattatgaccgttcagaaagggctggtagacaccaatctcacgaccatc
agcgatgatcttcgtgggatacagagagatgacatcgctggtataatcgatagtgcaaag
agcatggtcagaaatgccaacgacatcacaaatgatgttctagatgggctcaaccccatt
cagacagacgtggacagaattaaggatacctatgggagcaccgggagtgaagacttcaac
aaggctctcactgatgcagataattcagtaaagaaattaaccaacaaactgcctgatctt
ttgagcaagattgaaagtatcaaccaacagctgttgccactgggcaacatctctgacaac
atggaccgcatacgggagctaattcagcaggccagagatgctgcgaataaggtcgccatc
cccatgaggttcaacggtaaatctggagtggaagtgcggctgccaaatgacctggaggat
ttgaaaggatacacatctctttctttgtttctccaaagacctgaatcaacagaaagtggg
ggaaccgagaatatgtttgtgatgtacctcggaaataaagatgcctccagggactacata
ggcatggcaatcgtagatggccagctcacgagcgtctacaacctgggagaacgcgaggct
gaactccaagtggaccagaccttgaccaagagcgaaactcaggaggcagttatggaccgg
gtgaaattccagagaatttatcagtttgcaaggctgaattacacccgaaaagccacatcc
agtaagccagaaacacctcaattccatgacatggatagtggggatagcaacacgctcctc
gatttggatcctgaaaatgttgtattttatgttggaggttacccatctgattttagactt
cctagtagactgagattccctccatacaaaggttgtattgaattagatgacctcaatgaa
aatgttctgagcttgtacaacttcaaaaagactttcaatctcaacacaacaaaagtggag
ccttgtagaaggagaaaggaagaatcagacaaaaattattttgaaggtacaggctatgct
cgagttccaactcaaccaaatgctcccttcccaacctttggacagacaattcagaccacc
gtggatagaggtttgctgttctttgcagaaaaccaggatcgcttcatatctctaaatata
gaagatggcagtctcatggtgcgatacaaactgaattcagagcaaccaaaagaaaaagga
attagaagcatcataaacgacgggagagatcatttgattctgatcaaaattggtaaagcc
caaaaacttatgcgggtaaacgtggattctcaaagcattaaaattgaaggtgatatactt
gatttcagcacatattatctgggaggaattccaatttcaattagggaaaggtttaacatt
tctacacctgctttccgaggctgcgtgaaaaacctgaagaagaccactggtgtcgttaga
ctgaatgatactgtgggcgtaacaaaaaagtgctcggaagactggaagctcgtgcgatct
gcctcattctccaggggaggacagttgagcttcaccaatttggacttcccgttgcccagc
cacttccaggcctcctttgggtttcagacctttcaacccagtggcatcttattaaatcat
cagacaggggcaagcagcctacaggtcaccctggaagatggccacattgaactgagcccc
agggacagcagcagcccagttttcacatctccgcagacgtacacggatggtttactgcat
tacgtgtctgtaataagcgacaactctgggctccggctcctcattgatgaccagcctctg
aaaaataacgaaaggctgctaggcttttcaaattcccagcaatccctgcgtctgggcggg
agtaattttgagggctgcatcagcaacgtcttcatccagagattatcagagagtcccgaa
gtcctagatttggccagtaaatccaccaagagagatgtgtccctgggaggctgcagttta
aacgaatcaccttttctaatgttgcttaaagggtctacaaggtttaacaaagccaagact
ttcaatatccaccagccattgtgtgacacgccagccgcctctccaaggagcttgaaggtg
aggggagatgcccagtcttgcccaccacttccagaggcccaggccagccacagagccctc
aggtttggggacagtcccaccagccacttgctattcatgcttccccaggagttgctgaaa
cccaggtcacagtttgccatggacttgcagacaacggcgtccagagggctcgtgttttac
acaggcactaagaactcctttatggctctttatctttcgaaaggacgtctggtctttgca
ctgggggcaggagggaaacaactgaagctcaaaagcaaagagaagtgcagtgacgggaag
tggcacacggtggcatttgggcaagatggagaaagggggcgcttggttgtggatggtctg
aggacccgggagggcagattgcccggaaattccaccgtcagcctcagagcgccagtttac
ctgggatcgtctccatcagggaaaccaaagaccctcccccaaaacagctttgtgggatgc
ctgaggaactttcagttggatttgaaacccctggacactgcttctgcaagctttggggtg
tctccctgcttcgatggctctttggaaaaaggcatctatttttctcaagaaggaggtcat
gtcatcctagctaactccgtgcttttgggtccagaatttaagcttgttttcagcattcgc
ccgagaagtctcactggaattctaattcacatcggaagtcaacccggggagcacttacat
gtttatatggaggcaggaaaggtcacggcctctgtgggcagtgaggcaggtaggatcctg
acatcggtcacaccaaagcagtctctgtgtgacggacattggcactcagtgacagtcacc
ataaaacagcacatcctgcacctgaaactggacgcagactatagttccaccgctggacgg
caccccgtccccactgcccgcacgcgggagcatctgcacattggaggtgtcccagcaaat
ttgaaaaccctgaagcttcctgcgtggaaatcctttcttggctgcctgaagaatattcaa
gtcaaccacattcctgtccctgtcactgaagccgcagaagttcgggggactgtcagtctg
aacggctgtcctgaccactaa

KEGG   Physeter catodon (sperm whale): 102982443
Entry
102982443         CDS       T06011                                 

Gene name
LAMC2
Definition
(RefSeq) laminin subunit gamma-2
  KO
K06246  laminin, gamma 2
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102982443 (LAMC2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102982443 (LAMC2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102982443 (LAMC2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102982443 (LAMC2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102982443 (LAMC2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102982443 (LAMC2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102982443 (LAMC2)
   05145 Toxoplasmosis
    102982443 (LAMC2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_B DUF5660
Other DBs
NCBI-GeneID: 102982443
NCBI-ProteinID: XP_023989618
UniProt: A0A2Y9THB6
LinkDB
Position
4
AA seq 1191 aa
MPALWLSCSLCFSLLLPAAQANSRTPVCDCSGKSRQCVFDEELHRLTGNGFRCLHCSDDT
GGIHCQRCKEGFYRQRDRDRCLPCNCNSKGSLSARCDNAGRCSCKPGVTGDRCDRCLPGF
HTFTEAGCTQDQRLLDSKCDCDPAGISGPCDTGRCVCKPAVTGGRCDRCRPGYYHLDGGN
PEGCTQCFCYGHSASCGSSGDYSVHKIASTFDQDVDGWKAVQRNGSPAKLQWSQRHRDVF
SSARQSDPVYFIAPAKFLGNQQVSYGQSLTFDYRVDKGGRHPSAHDVILEGAGLWITAPL
IPPGKTLPCGVTKTYTFRLNENPSSNWSPQLSYFEYRRLLRNLTALRIRATYGEYSTGYL
DNVTLISARPISGAPAPWVEQCVCPVGYKGQFCQECASGYKRDSARLGPFGTCIPCNCQG
GGACDPDTGDCYSGDENPDIECADCPIGFYSDPRDPRSCKPCPCPDGLDCSAMPETEEVV
CDNCPHGVTGARCELCADGYFGDPFGEHGPARPCQPCQCNNNVDPSASGNCDHLTGRCLK
CLHGTAGAHCDQCKAGYFGDPLAPNPMDKCRVCNCNPVGSEPMECRSDGSCVCKPGFGGL
NCEHAALTNCPACYNQVKIQMDQFMQQLQSLETLISKAQGGGGAVPNTELEGRMQWVEQT
LRDILREAQISEGAIRSLNLQLAKARSQENSYRNRLDDLKMTVERVRALGSQHQNRVQDT
HRLVTQMRLSLEESEAALRNTYVPPSEDYMGPNSFKSLAQEATRLADSHVESANNMEQLA
RETEDYSKQALSLVRKALSEGGGSGRLDGSVVQGLVGKLEKTKSLAQQLSREATQTDIEA
DRSYQHSLRLLSSASQLQGVNDQSFQVEMKRIRQNADSVSSLVTKHVDEFKHVQSNLGNW
EEETRKLLQNGKNGRQKSDQLLSRANLAKSRAQEALSMGNATFYEVENILKNLREFDLQV
EVRKAEAEEAMKRLSYISQKVADASDKTKRAETALGGAAAGAQRAKTAAGEALEITGNIE
QEMGSLNLEANVTADGALAMEKGLATLKSEMREVEGDLARKEREFGTDTDAVQTVITEAQ
RVDNRARNAGVTIQDTLNTLDGILHLIDQPGSMDEEGLILLEQKLFRAKAQINSQLRPLM
SKLEERASCQRGHLRSLETSIDGILADVKNLENIRDSLPPGCYNTQALEQH
NT seq 3576 nt   +upstreamnt  +downstreamnt
atgcctgcgctctggctcagctgcagcctctgcttctcgctcctcctgcctgcagcccag
gccaactccaggacgccagtctgtgattgcagtgggaagtccaggcaatgtgtgtttgat
gaggaacttcacagactgacaggaaatggattccgctgcctccactgcagtgacgacacg
ggtggcattcactgccagaggtgcaaggaaggattttaccgccagagagacagagaccgc
tgtttaccctgcaactgtaactccaaaggttctcttagcgctcgatgtgacaatgctgga
cgatgcagctgtaagccaggtgtgacaggagacagatgtgaccgctgtctgccaggcttc
cacaccttcactgaggctgggtgcacccaagaccagaggctcctagactccaagtgtgac
tgtgatccagctggcatctcggggccctgtgacacaggccgctgtgtctgcaagccagct
gtcaccggagggcgctgtgataggtgccgaccaggttactatcacctggatgggggaaac
cctgagggctgtacccagtgtttttgctacgggcattcagccagctgcggaagctctgga
gactacagtgtccataaaatcgcctctaccttcgatcaagatgttgatggctggaaggct
gtccaaagaaatgggtctcctgcaaaactccagtggtcacagcgccatcgagatgtgttt
agctcagcacgacagtcagaccctgtctattttatagctcctgccaaatttcttgggaat
caacaggtgagctatgggcaaagcctgacttttgactaccgtgtggacaagggaggcaga
cacccatccgcccatgacgtgatcctggaaggtgctggtctatggatcacagctcccttg
attccacctggcaagacactgccttgtggggtcaccaagacttacacattcagattaaat
gaaaatccaagcagtaattggagcccccagctgagttacttcgagtatcggaggttactg
cggaacctcacagccctgcggatccgagctacctatggagaatacagtactgggtacctt
gacaatgtgaccctgatctcagctcgccccatctctggagccccagcaccatgggttgaa
caatgtgtatgtcctgttgggtacaagggacagttctgccaggaatgtgcttctggctac
aaaagagattcagccagactgggaccttttggcacctgtattccatgtaactgccaagga
ggaggggcttgtgatccagacacaggagattgttattcaggggatgaaaaccctgacatc
gagtgtgccgactgccccattggtttctacagcgacccgcgtgacccccgcagctgcaag
ccctgcccctgtcccgatgggctcgactgctccgcgatgcccgagacagaggaggtggtg
tgcgataactgtccccacggcgtcaccggtgcccgctgcgagctctgtgccgatggctac
tttggggacccctttggggaacacggcccagcgaggccttgtcaaccctgtcagtgcaac
aacaacgtggaccccagcgcctctgggaactgtgaccacctgacgggcaggtgtctgaag
tgcctccacggcacagccggtgcccactgtgaccagtgcaaagcgggctactttggggac
cccttggctcccaacccaatggacaagtgtcgagtttgcaactgcaacccagtgggctcg
gagcccatggagtgtagaagtgatggcagctgtgtttgcaagccaggatttggcggcctc
aactgtgagcatgcagcattaaccaactgtccagcttgctataatcaagtgaagattcag
atggatcagtttatgcagcagctccagagcctggagaccctgatttcaaaggctcagggt
ggcggtggagcagtacccaacacagagctggagggcaggatgcagtgggttgagcagacc
cttcgggacattctgagagaagcccagatttcagaaggtgctatcagatccctcaatctc
cagttggccaaggccaggagccaagagaatagctaccggaaccgcctggatgacctcaag
atgactgtggaaagagttcgggccctgggcagccagcatcagaaccgagttcaggatact
cacaggctcgtcactcagatgcgcctgagcctggaggagagtgaagctgccctgcgaaac
acctacgttcctccctcagaggactatatggggccaaacagctttaaaagtctagctcag
gaggccacgagactggcagacagccacgttgagtcagccaataacatggagcaactggcc
agggaaaccgaggactactccaaacaagctctatccttggtgcgcaaggctctgagtgaa
ggaggcggaagcggcaggctggacggctccgtggtgcaagggcttgtgggaaaattggag
aaaaccaagtctctggcccagcagttgtcaagggaggccactcaaactgacattgaagca
gataggtcttaccagcatagtctccgccttctcagttcagcatctcagcttcagggggtc
aatgatcagtcctttcaggtagaaatgaagaggatcagacaaaacgctgactctgtctca
agcctggtgactaagcatgtggatgagttcaagcatgtgcaaagcaatctgggaaactgg
gaagaagaaacccggaagctcttacagaatgggaagaatgggagacagaaatcagatcaa
ctgctttcccgtgccaaccttgctaaaagcagagcccaagaagcactaagtatgggcaat
gccactttttatgaagttgagaacatcttaaagaacctcagagagtttgacctgcaggtt
gaagtcagaaaagcagaagctgaagaggccatgaagagactctcctacatcagccagaag
gttgcagatgccagcgacaagaccaagcgagcggaaacagccctgggtggcgctgctgct
ggcgcccagagggcaaagaccgcagccggggaagccctggagatcaccggcaacatagaa
caggagatggggagtctgaacttggaggccaacgtgacagcagatggagccttggccatg
gagaagggactggccactctcaagagtgagatgagggaagtggaaggagacctggcaagg
aaggagcgggagttcggcacggatacggatgcggtgcagacggtaatcacagaagcccaa
agagttgataacagagccaggaatgctggagttacgatccaagacacactcaacacattg
gacggcatcctacacctaatagaccagcctggcagtatggatgaagaggggctgatctta
ttggagcagaagcttttccgagccaaggcccagatcaacagtcagctgcggccgttgatg
tcaaagctggaagagagggcaagctgtcagaggggccacctccgttcgctggagacaagc
atagatgggattctggctgatgtgaagaacctggagaacattagggacagcctgccccca
ggctgctacaatacccaggctcttgagcagcactga

KEGG   Physeter catodon (sperm whale): 102982705
Entry
102982705         CDS       T06011                                 

Gene name
LAMC1
Definition
(RefSeq) laminin subunit gamma-1
  KO
K05635  laminin, gamma 1
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05020  Prion disease
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102982705 (LAMC1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102982705 (LAMC1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102982705 (LAMC1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102982705 (LAMC1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102982705 (LAMC1)
  09164 Neurodegenerative disease
   05020 Prion disease
    102982705 (LAMC1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102982705 (LAMC1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102982705 (LAMC1)
   05145 Toxoplasmosis
    102982705 (LAMC1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N Laminin_B
Other DBs
NCBI-GeneID: 102982705
NCBI-ProteinID: XP_023989598
UniProt: A0A2Y9TH99
LinkDB
Position
4
AA seq 1608 aa
MRGSKPAAPALRPRGGLWRVLAVLAAAAAGCARAAMDECTDEGGRSQRCMPEFVNAAFNV
TVVATNTCGTPPEEYCVQTGVTGVTKSCHLCDAAQPHLQHGAAFLTDYNNQADTTWWQSQ
TMLAGVQYPNSINLTLHLGKAFDITYVRLKFHTSRPESFAIYKRTREEGPWVPYQYYSGS
CENTYSKANRGFIRTGGDEQQALCTDEFSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQ
EWVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYAISDFAVGGRCKCNGHASECVKNEFD
KLVCNCKHNTYGVDCEKCLPFFNDRPWRRATAEGASECLPCDCNGRSQECYFDPELYRST
GHGGHCTNCQGNTDGANCERCRENFFRLGNREACSPCLCSPVGSLSTQCDSYGRCSCKPG
VMGDKCDRCQPGYHSLTEAGCRPCSCNPSGSIDECNFETGRCVCKDNVEGFNCERCKPGF
FNLESSNPMGCTPCFCFGHSSVCTNAVGYSVYSITSTFQIDEDGWRVEQRDGSEASFEWS
SERQDIAVISDSYFPRYFIAPAKFLGKQVLSYGQNFSFSFRVDRRDTRLSAEDLVLEGAG
LRVSVPLIAQGNSYPSETTVKYVFRLHEATDYPWRPTLTPFEFQKLLNNLTSIKIRGTYS
ERSAGYLDDVTLASARPGPGVPATWVESCTCPVGYGGQFCEMCLSGYRRETPSLGPYSPC
VLCTCNGHSETCDPETGVCNCRENTAGPHCEKCSDGYYGDSTVGTSSDCQPCPCPGGSSC
AVVPKTQEVVCTNCPTGTTGKRCELCDDGYFGDPLGRNGPVRLCRLCQCNDNIDPNAVGN
CNRLTGECLKCIYNTAGFYCDRCKDGFFGNPLAPNPADKCKACDCNPYGTVKQQSSCNPV
TGQCECLPHVTGRDCGACDPGFYNLQSGQGCERCDCHALGSTNGQCDMHTGQCECQPGIT
GQHCEGCEVNHFGFGPEGCKPCDCHPEGSLSLQCKDDGLCECKEGFVGNRCDQCEENYFY
NRSWPGCQECPACYRLVKDKVADHRAKLQELENLIANLGTGDEMVTDQAFEDRLKEAERE
VTDLLREAQDVKDVDQNLMDRLQRVNNTLFSQISRLQNIRNTIEETGNLAEQACARVEST
EQLIEIASRELEKAKVAIANVSITQPESTGDPNNMTLLADEARKLAERHKQEADDIVRVA
KAANATSTEAYNLLLRTLAGENQTAFEIEELNRKYEQAKNISQDLEKQAARVHEEAKKAG
EKAVEIYASVAQLTPVDSEAMENEANKIKKDAEDLDHLIDQKLKDYEDLRQDMRGKEFEV
KNLLEKGKTEQQTADQLLARADAAKALAEEAAKKGRNTLREANDILSNLKDFDRRVNDNK
TAAEEALRRIPAINQTIIEANEKTREAQLALGNAAADATEAKNKAHEAERIASAVQKNAT
STKAEAERTFAEVTDLDNEVNNMLKQLQEAEKELKKKQDDADQDMMMAGMASQAAQEAEI
NARKAKNSVTSLLNLINDLLEQLGQLDTVDLNKLNEIEGTLNKAKDEMKVSDLDRKVSNL
ENEARRQEAAIMDYNRDIEEILKDIRNLEDIKKTLPSGCFNTPSIEKP
NT seq 4827 nt   +upstreamnt  +downstreamnt
atgaggggcagcaaaccggccgcgccggcgctgcggccgcggggcggcctctggcgggtg
ttggctgtgctggcggccgctgcggccggctgcgcccgggcagccatggacgagtgcacg
gacgagggcgggcggtcgcagcgctgcatgcccgaattcgtcaacgccgccttcaacgtg
accgtggtggccaccaacacgtgcgggactccgcccgaggagtactgtgtgcagaccggg
gtgaccggggtcaccaagtcctgtcacctgtgcgacgccgcgcagccccacctgcagcac
ggggcagccttcctgacggactacaacaaccaggccgacaccacctggtggcagagccag
accatgctggccggggtgcagtaccccaactccatcaacctcacgttgcatctgggaaaa
gcgtttgacatcacgtatgtgcgcctcaagttccacaccagccgcccagagagcttcgcc
atttacaagcgcacgcgggaagaggggccctgggtcccttaccagtactacagcggctcc
tgcgagaacacctactcgaaggcaaaccgaggcttcatcaggacgggaggggacgagcag
caggccttgtgcaccgatgaattcagtgacatttcccctctcaccgggggcaacgtggcc
ttttcaaccctagaaggaaggcccagtgcctacaactttgacaatagccctgtgctgcag
gaatgggtaactgccactgacatcagagtaacactcaatcgcctgaacacttttggggat
gaagtgtttaatgaccccaaagttctcaaatcctattattatgcaatctctgatttcgct
gtgggtggtaggtgtaaatgtaacggacatgcaagcgagtgcgtgaagaatgaatttgac
aagctggtgtgtaattgcaaacataatacttatggcgtagactgtgagaagtgccttcct
ttcttcaatgaccggccgtggaggagggcgaccgccgagggtgccagcgaatgcctgccc
tgtgactgcaatggccgatcgcaggaatgctactttgaccctgaactgtaccgttccacg
ggccatggtggccactgtaccaactgccagggtaacacggacggcgccaactgtgagagg
tgccgggagaacttcttccgcctcgggaacagagaagcctgctctccctgcctctgtagt
cctgttggctctctcagcacacagtgtgatagctatggcaggtgtagctgtaagcctgga
gtgatgggggacaaatgtgaccgttgccagcctgggtaccattctcttactgaggctggg
tgcaggccatgctcttgtaatccttctggcagcatagacgaatgtaattttgaaacagga
agatgtgtttgcaaagacaatgtcgaaggcttcaattgtgagagatgcaaacctggattt
tttaatctggaatcatctaatcctatgggttgcacaccttgcttctgctttgggcattct
tctgtctgtacaaatgctgttggctacagtgtttattctataacctctacctttcagatt
gatgaggatgggtggcgtgtagaacaaagggatggctctgaagcatcatttgagtggtcc
tctgagaggcaagatatcgctgtcatttcagatagctactttcctaggtacttcattgcc
cccgcaaagttcttgggcaagcaggtgttgagttatggtcagaacttctccttctccttt
cgagtggacaggcgagacactcgcctctctgcagaagacctggtgcttgagggagctggc
ctaagagtgtctgtgcccttgatcgctcagggcaattcctatccgagtgagaccactgtg
aaatatgtcttcaggctccatgaagcgacagattacccttggaggcctactcttacccct
ttcgaatttcagaaactcctaaacaatttgacctctatcaagatccgtgggacatatagt
gagagaagtgctggatatttggatgatgtcaccctggcaagcgctcgtcctgggcctggg
gtccctgcaacttgggtggagtcctgcacctgtcctgtgggatacggagggcagttttgt
gagatgtgcctctcaggttacagaagagaaactccgagtcttggaccatacagtccctgt
gtgctttgtacctgcaatgggcacagtgagacctgtgaccctgagacaggtgtttgtaac
tgcagagagaatacagccgggccccactgtgagaagtgtagtgacgggtactatggagat
tcaaccgtgggcacctcctctgattgccaaccctgtccgtgtcctgggggctccagttgt
gctgttgtccctaagacacaggaggtggtgtgcaccaactgtcctactggcaccactggt
aaaagatgtgagctctgtgatgacggctactttggagaccctctgggtagaaacggccct
gtgaggctttgtcgcctgtgccagtgcaatgacaacatcgatcccaatgcagttggaaat
tgcaatcgcttgacgggagaatgcctgaagtgcatctataacacagctggcttctactgt
gaccggtgcaaagacggattttttggaaatcccctggctccgaatccagcagacaagtgc
aaagcctgcgactgcaacccctacgggaccgtgaagcagcagagcagctgtaaccctgtg
accgggcagtgtgagtgtctgcctcatgtgaccggccgggactgtggagcttgtgaccct
ggattctacaacctgcagagcgggcagggctgtgagaggtgtgactgccatgctttgggt
tccaccaatggacagtgtgacatgcacactggccagtgtgagtgccagcctggcatcacc
ggtcagcactgcgagggctgcgaggtcaaccacttcgggtttggacctgaaggctgtaaa
ccctgtgactgtcatcctgaggggtctctttccctccagtgcaaagatgatggtctctgt
gaatgcaaagaaggctttgtggggaatcgctgtgaccagtgtgaagagaactatttctac
aatcggtcttggcctggctgccaggaatgtccagcttgttaccggctggtaaaggataag
gttgctgatcatcgagcaaaactccaggaattagagaatctcattgcaaaccttggaact
ggggatgagatggtgacggatcaagcctttgaggatagactaaaggaagcagagagagag
gtgactgacctccttcgtgaggcccaggatgtcaaagatgtagaccaaaatttgatggat
cgcctccagagagtgaataacaccttattcagccaaattagccgtttacagaatattcgg
aataccattgaagagaccggaaacttggctgaacaagcatgtgcccgggtagaaagcaca
gagcagttgatcgaaatcgcatccagagaacttgagaaagcaaaagttgccattgccaat
gtgtcaatcactcagccagaatctacaggggacccaaacaacatgactcttttggcggat
gaggcacgaaagcttgctgaacgccataaacaagaagctgatgacattgtacgagtggca
aaggcagccaatgctacatcaactgaggcatataatctgcttttgaggacactggcagga
gaaaatcaaacagcatttgagattgaagagcttaatagaaagtatgaacaagcgaagaac
atctcacaggatctagagaagcaagctgcccgagtccatgaggaggccaagaaggctggt
gagaaagctgtggagatctatgccagtgttgcccagctgactcctgtggactcagaagcc
atggagaatgaggcaaataaaataaagaaggacgctgaggatctggatcatctgattgac
cagaagttaaaagattatgaggacctcagacaggacatgagagggaaggaatttgaagta
aagaaccttctggaaaaaggaaagaccgaacagcagactgccgaccaactcctagcccga
gctgatgctgccaaggctcttgcggaagaagctgctaaaaagggacgaaataccttacga
gaagccaatgacattctcagcaacctgaaagattttgataggcgagtgaatgataacaag
acggctgcagaggaggcactaaggagaatccctgccatcaaccagactataattgaagcc
aatgaaaagaccagggaagcacagctggctctgggcaacgctgccgcagatgccaccgag
gccaagaacaaggcccacgaggcggagaggatcgccagcgctgtccaaaagaatgctacc
agcaccaaggcagaagctgagaggacttttgcagaagttacagatctggataacgaggtg
aataatatgttgaaacagctacaggaagcagaaaaggagctgaagaaaaaacaagatgat
gctgaccaggatatgatgatggcagggatggcttcacaggctgctcaggaagctgagatc
aatgccagaaaggccaaaaactctgtcaccagcctcctcaaccttattaatgacctcctg
gagcagctggggcagctggacacagtggacctgaataagctaaatgagattgaaggtacc
ctgaacaaagccaaagatgaaatgaaagtcagtgatctagacaggaaagtgtctaacctg
gagaatgaagccaggaggcaggaggctgccatcatggattataacagagacattgaggag
atcctgaaggacatccgcaacctggaagacatcaagaagaccttaccctccggctgcttt
aacaccccgtccattgaaaagccctag

KEGG   Physeter catodon (sperm whale): 102983001
Entry
102983001         CDS       T06011                                 

Gene name
LAMA4
Definition
(RefSeq) laminin subunit alpha-4 isoform X1
  KO
K06241  laminin, alpha 4
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05143  African trypanosomiasis
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102983001 (LAMA4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102983001 (LAMA4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102983001 (LAMA4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102983001 (LAMA4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102983001 (LAMA4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102983001 (LAMA4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102983001 (LAMA4)
   05145 Toxoplasmosis
    102983001 (LAMA4)
   05143 African trypanosomiasis
    102983001 (LAMA4)
SSDB
Motif
Pfam: Laminin_G_2 Laminin_G_1 Laminin_I Laminin_II Laminin_EGF Laminin_G_3 Dna2
Other DBs
NCBI-GeneID: 102983001
NCBI-ProteinID: XP_023971343
UniProt: A0A2Y9RYG7
LinkDB
Position
10
AA seq 1824 aa
MALSSACYAVLPLWLLWGAACSRAASGDDSAFPFDIEGSSVVGQLDPPETIEARVAPGGL
PPAAKKCDAGFFFAQSGECLPCDCNGNSNECLDGSGLCVHCQRNTTGEHCEKCLNGYIGD
AIRGAPRVCQPCACPLPHVANFAESCYRKNGAVRCICKENYAGPNCERCAPGYYGNPLLI
GSTCKKCDCSGNSDPNLIFEDCDEVTGQCRNCLRNTTGFKCEHCAPGYYGDARRAKNCAV
CSCGGGPCDSVTGECLEEGSEPPTGMECPTISCDKCIWDLTDDLRLAALSIEESKSGLLS
VSSGAAAHRHVNEINSTIYLLKTKLSERENQYVLRKIQINNAENTMKSLLSDLEELAGRE
SQATRKSQLAQKESMDTINHATQLAEQAHDVRDKIQEINNKMLYYGEYQELSPEEISEKL
VLAQKMLEEIRHRQPFLTQRELVDEEADEAYELLSQAESWQQQYNDTHSLFPVVLEQLDD
YNAKLSDLQESLDLALDHIRDAEDMNRATAARQRDHKKQHDRVREQMAGVNASLRMSSYS
LTMPRLTLSELDDIIKNASGIYAEIDGAKNELQRKLSNLSNLSHDLVQEAVDHAQNLQQE
ADELIRNLHSSDVNGLVQKALDASNVYENIANYVSEANETAEQALNITDRIYDAVSGIDT
QIIYHKDESENLLNQARELQARANSSSDEAVADTSRRVGGALARKSALKNRLNDAIKRLQ
ATERGDAQQRLGQSKLITAEANKTTMEVQQAAAPMANSLTSWSQNLQHFDSSAYNTAVDS
ARDAVGNLTEVVPQLLDQLRTVEQKRPASNVSASIQRIRELIAQTRSVASKIQVSMMFDG
QSAVEVHPKARMDDLKTFTSLSLYMKPPPVKQPELGGTADQFVLYLGSKNAKKEYMGLAI
KNDNLVYVYNLGTKDVEIPLDSKPVSSWPAYFSIVKIERVGKHGKVFLTVPSLSSTAEEK
FIKKGEFAGDDSLLDLDPEDTVFYVGGVPSNFKLPASLNLPGFVGCLELATLNNDVISLY
NFKHIYNMDPSKSVPCARDKLAFTQSRAASYFFDGSSYAVVRDITRRGKFGQVTRFDIEV
RTPADNGLVLLMVNGSMFFTLEMRNGYLHVSYDFGFSNGPVHLEDTMKKAQINDAKYHEI
SIIYHNDKKMILVVDRRHVKSMDNEKMKVPFTDIYIGGAPPEILESRTLRVHLPLDINFR
GCMKGFQFQKKDFNLLEQTETLGVGYGCPEDSLISRRAYFNGQSFIASIQKISFFDGFEG
GFNFRTLQPNGLLFYYASGSDVFSISLDNGTVIMDVKGIKVQSADKQYNDGLSHFIITSV
SPARYELIVDKSRLGSKNPTKGKVEQTQAGEKKFYFGGSPISPQYANFTGCISNAYFTRL
DRDVEVEDFQRYSEKVHTSLYECPIESSPLFLLHKKGKNSSKPETSQNKKGEKSKDAPSW
DPVGLKFLERNVPRDSHCHLSNSPRAIEHAYQYGGTANSRQEFEHLKGDFGEKSQFSIRL
KTRSSHGMIFYVSDQEENDFMALFLAHGRLVFMFNVGHKKLKIRSQEKYNDGLWHDVIFI
REKSSGRLIIDGLRVLEESLPPTGATWKIKGPIYLGGVAPGKAVKNVQINSVYSFSGCLS
NLQLNGASITSASQTFSVTPCFEGPMETGTYFSTEGGYVVLDESFNIGLKFEIAFEVRPR
SSSGTLVHGHSVNGEYLNVHMKNGQVIVKVNNGIRDFSTLVTPKQSLCDGRWHRITVIRD
SNVVQLDVDSEVNHVVGPLNPKPVDQREPVFVGGVPESLLTPRLAPGRPFTGCIRHFVID
GRPVSFSKAALVSGAVSINSCPAA
NT seq 5475 nt   +upstreamnt  +downstreamnt
atggctttgagctcagcctgctacgcagtcctgcccctgtggctcctctggggcgccgcc
tgctcccgcgccgcgtctggggacgacagcgcttttccttttgacattgaagggagctca
gtggtcggccaactagacccgcctgagaccatcgaggcccgcgtggctcccggaggcctg
ccgcctgctgccaagaaatgcgacgcaggattcttttttgcccagtcgggagaatgtttg
ccctgtgactgtaatggtaattccaacgagtgcttggatggctctggactctgtgtgcac
tgccagcggaacacaacaggcgagcactgtgagaaatgtctaaatggttatatcggagat
gccatcaggggagcaccccgcgtctgccagccgtgcgcctgccccctgcctcatgttgcc
aattttgcagaatcctgctataggaaaaatggagctgttcggtgtatttgtaaagaaaac
tatgctggacctaactgtgaaagatgtgctcctggttactatggaaaccccttgctgatt
ggaagcacctgtaagaaatgtgactgcagtggaaattcagatcctaacctgatctttgaa
gactgcgatgaggtcactggccagtgtaggaattgcttacgcaacaccaccggattcaag
tgtgaacactgcgcacctggctactatggggatgccaggagagccaagaactgtgcagtg
tgcagttgtgggggaggcccatgtgacagcgtaaccggagaatgcttggaagaaggttct
gaaccccctacaggcatggagtgcccaaccataagctgcgataagtgcatctgggatctg
actgatgaccttcggttagcagcgctctccattgaagaaagcaaatctggtctgttgagt
gtgtcatctggtgccgctgctcataggcatgtgaatgaaatcaactccaccatctacctc
ctcaaaacaaaattgtcagaaagagaaaaccagtatgtcctaagaaagatacaaatcaac
aatgctgaaaacacaatgaaaagccttctctctgacctggaggaattagctggaagggaa
agtcaagccacaagaaagagccagttggctcagaaggaaagcatggataccattaaccac
gcaactcagcttgcagagcaagcccacgatgtgagggataaaatccaagagatcaacaat
aagatgctctattatggggaatatcaggaacttagccctgaggaaatctctgagaagctg
gtgttggcccagaagatgcttgaggagatccgacaccgtcaaccattcctcacccaacgg
gaacttgtggacgaggaggcagacgaagcctatgaattgctgagccaggctgagagttgg
cagcagcagtacaacgatactcactctctgtttcctgtggtcctggagcagttggacgac
tataatgctaagttgtcagacctccaggagtcacttgacctggcccttgaccatatcaga
gacgctgaagacatgaacagagccacagcagccaggcagcgggatcacaagaaacagcat
gacagagtgagggaacagatggcaggggtgaacgcgtctctgaggatgtcttcctactca
ctgacaatgcctcgtctgaccctttcagaacttgatgacataataaagaatgcatcaggg
atttatgctgaaatagatggagccaaaaatgaactacaaagaaaactatccaacctaagc
aacctcagtcatgatttagttcaggaggctgttgaccatgcacagaaccttcaacaagag
gctgatgaattgatcaggaatttgcacagttcagatgtgaatgggctggtacagaaggct
ttggacgcgtcaaatgtctatgaaaatattgccaattatgttagtgaagccaacgaaaca
gcagaacaggctttgaacatcactgatcgaatttatgatgctgtgagtgggattgatacg
caaatcatttaccataaagatgagagtgagaacctcctcaatcaagccagagagctgcaa
gcaagggcaaattctagcagtgacgaggcagtggctgacaccagtaggcgcgtgggtgga
gccctagcaaggaagagtgccctcaaaaacagattaaatgatgccattaagagactacaa
gccacagagagaggtgacgctcagcagcgcctgggtcagtccaagctgatcaccgcggaa
gctaacaagaccacaatggaagtccagcaggcggccgccccgatggctaacagtctaacc
agctggtcacagaatcttcaacattttgactcatctgcttacaacactgcagtggactct
gccagagatgcagttggaaatctgacagaggttgttcctcagctcctggatcagcttcgc
acggtagagcagaagcggcctgcaagcaacgtttctgccagtatccagaggatccgagag
ctcattgctcagaccagaagcgttgccagcaagatccaagtctccatgatgtttgatggt
cagtcggcagtcgaagtgcatcccaaagccagaatggatgacttaaagacctttacgtcc
ctgagcctgtacatgaagcctcctcctgtgaagcaaccagagctgggtgggactgcagat
cagtttgtcctatatctcggaagcaaaaacgccaaaaaagagtacatgggtcttgcaatc
aaaaatgataacctggtgtacgtttataatttgggaactaaggatgtagagattcccctg
gactccaagcctgtcagttcctggcctgcttacttcagcattgtcaagattgaaagggta
ggaaaacacggaaaggtgtttttaacagtcccaagtctaagtagcaccgcagaggaaaag
tttattaaaaagggggaatttgcgggagatgactccttgttggatctggatcctgaggac
actgtgttttatgttggtggcgtgccttcaaacttcaagctccctgccagcttaaacctg
cctggcttcgttggctgcctggaattggccactttgaacaatgatgtgatcagcttgtac
aactttaagcacatctataatatggatccctccaaatcagtgccctgtgccagagataaa
ctggcctttactcagagtcgggctgccagctatttcttcgatggttctagttatgccgtg
gtgagggatatcacaaggagagggaaattcggtcaggtgactcgctttgacatagaagtt
agaacaccagctgacaacggccttgtgctcctgatggtcaatggaagtatgtttttcacc
ctggaaatgcgcaatggttacctacatgtgtcctatgactttgggtttagcaatggcccg
gtgcatcttgaagacacaatgaagaaagctcaaattaatgatgcaaaataccatgagatc
tcaatcatttaccacaatgataagaaaatgattttggtagttgacagacgacatgtcaag
agcatggacaatgaaaagatgaaagtaccttttacagacatatatattggaggagctccc
ccagaaattttagaatcaaggaccctaagagtacaccttcccctggatatcaatttcaga
ggatgcatgaagggtttccagttccaaaagaaagatttcaatttattagaacagacagaa
accctgggagttggttatggatgcccagaagactctcttatatcccgcagagcatatttc
aatggacagagtttcattgcttcaattcagaaaatatcttttttcgatggctttgaggga
ggttttaatttccgaacattacagccaaatgggttactattctattatgcttcagggtca
gacgtgttctccatttcactggataatggcactgtgatcatggatgtaaagggaatcaag
gtgcagtcggcagataagcagtataatgacgggctgtcccacttcatcattacctctgtc
tcgcccgcaaggtatgaattgatagtagataaaagcagacttgggagtaagaaccctacc
aaaggaaaagtggaacagacacaagcaggtgaaaagaagttctacttcggtggctcaccc
atcagtccccagtatgctaatttcactggctgtataagtaatgcctactttaccaggttg
gatagagatgtggaggtcgaagatttccagcggtattctgaaaaggtccatacttctctt
tatgagtgtcccattgagtcttcaccattgtttctccttcacaaaaaaggaaaaaattcc
tcaaagcctgaaacaagtcagaataaaaagggagagaaaagtaaagatgctccttcatgg
gatcctgttggcctgaaattcctagagaggaatgttccaagagactctcactgccacctt
tccaacagccctagagcaatagagcacgcctatcagtacggagggacagccaacagccgc
caagagtttgaacacttaaaaggagactttggtgaaaaatcccagttttccattcgtctg
aaaacacgttcctcccatgggatgatattctatgtctcggatcaagaagagaatgacttc
atggctctattcttagcccacggccgcttggttttcatgtttaatgttggccacaagaaa
ctgaagattagaagccaagagaaatacaatgatggtttgtggcatgacgtgatatttatt
cgggaaaagagcagcggccgactgataattgatggtcttcgagtcttagaagaaagtctt
ccccctactggagctacctggaaaatcaagggtcctatttatctgggaggtgtggctcct
ggaaaggctgtgaaaaatgtccagataaactcagtctatagcttcagtggctgcctcagc
aatctccagctcaatggggcctccatcacctctgcttctcagacatttagcgtgactcct
tgttttgaaggcccaatggaaacaggaacatacttttcaacagaaggaggatacgtggtt
ctagatgagtctttcaatattggattgaagtttgagattgcatttgaagtccgtcccaga
agcagttctggaacccttgttcacggccacagtgtcaatggggagtacctaaatgttcac
atgaaaaatgggcaggtcatagtgaaagtcaataatggtatcagagacttttccacctta
gtgacacccaagcagagtctctgtgatggcagatggcacagaattacagttattagagat
tcaaatgtggttcagttggatgtagactctgaagtgaaccatgtggttggacccctgaat
ccaaaaccagttgatcaaagggagcctgtgtttgttggaggtgttccagagtctctactg
acaccacgcttggcccctggcagacccttcacaggctgcatccgtcactttgtgattgac
gggcgcccagtgagcttcagtaaagcagccctggtcagcggtgccgtgagcatcaactcc
tgtccagcagcctga

KEGG   Physeter catodon (sperm whale): 102984074
Entry
102984074         CDS       T06011                                 

Gene name
COL2A1
Definition
(RefSeq) collagen alpha-1(II) chain isoform X1
  KO
K19719  collagen type II alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102984074 (COL2A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102984074 (COL2A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102984074 (COL2A1)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102984074 (COL2A1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102984074 (COL2A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102984074 (COL2A1)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102984074 (COL2A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 102984074
NCBI-ProteinID: XP_007104431
UniProt: A0A2Y9EMB7
LinkDB
Position
6
AA seq 1487 aa
MVRLGAPQTLVLLTLLVAAVLRCHGQDVQKAGSCVQDGQRYNDKDVWKPEPCRICVCDTG
TVLCDDIICEDMKDCLSPETPFGECCPICSTDLTTASGQPGPKGQKGEPGDIKDIVGPKG
PPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFA
AQMASGFDEKAGGAQMGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVAGPMG
PRGPSGPPGKPGDDGEAGKAGKPGERGLPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKG
EAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVG
PAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGSDGIPGAKG
SAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQG
APGPAGEEGKRGARGEPGGVGPAGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPNGLAG
PKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGTRGQPG
VMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPG
PSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGSQGLQGPRG
LPGTPGTDGPKGAAGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPG
KDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSGTAGARGAPGERGETGPPGPAGFAGPPG
ADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPG
AAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGDPGLQGPSGPPGEKGEPG
DDGPSGPDGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPG
PVGPPGLTGPAGEPGREGSPGADGPSGRDGAAGVKGDRGETGPVGAPGAPGAPGSPGPAG
PVGKQGDRGEAGAQGPMGPSGPAGARGMPGPQGPRGDKGETGEAGERGLKGHRGFTGLQG
LPGPPGPAGDQGTSGPAGPSGPRGPPGPIGPSGKDGANGIPGPIGPPGPRGRSGETGPAG
PPGNPGPPGPPGPPGPGIDMSAFAGLGQREKGPDPLQYMRADEAAGNLRQHDAEVDATLK
SLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMDTG
ETCVYPRPASVPKKNWWGSKSKDKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRL
LSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTVLKDGCTK
HTGKWGKTMIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL
NT seq 4464 nt   +upstreamnt  +downstreamnt
atggtccgcctcggggctccccagacgctggtgctgctgacgcttctcgtcgccgctgtc
cttcggtgtcacggccaggatgtccagaaggctggcagctgtgtgcaggacgggcagagg
tataacgataaggatgtgtggaagcccgagccctgccggatctgtgtctgtgacactggg
actgtcctctgcgacgacataatctgtgaagacatgaaagactgcctcagccccgagacc
cccttcggagagtgctgccccatctgctcaactgacctcaccactgccagtgggcaacca
ggaccaaagggacagaaaggagaacctggagacatcaaggatattgtaggacccaaagga
cctcctgggcctcagggacctgcaggtgaacaaggacccagaggtgatcgtggtgacaaa
ggtgaaaaaggtgctcctggacctcgtggcagagatggagagcctgggacccctggaaac
cctggcccccctggccctcctggcccccctggcccccctggtcttggtggaaactttgct
gctcagatggcttcaggatttgatgagaaggctggtggcgcccagatgggagtaatgcag
ggaccaatgggccccatgggacctcggggacctccaggccctgctggtgctcccggacct
caaggatttcaaggcaaccctggtgaacctggggaacccggcgttgctggtcccatgggt
ccccgcggtccttctggcccccctggaaaacctggtgatgatggtgaagctggaaaggct
ggaaaacctggcgagagaggccttcctggtcctcagggtgctcgcggcttcccaggaacc
ccaggccttcctggtgtcaaaggtcacagaggttacccaggtctagacggtgctaaggga
gaagctggtgctccgggtgtgaagggtgagagtggttccccaggagagaacggttctcca
ggcccaatgggtccccgtggcctgcctggtgagagaggacggaccggccctgctggcgct
gcgggtgctcggggcaacgatggccaacccggccccgcagggcctccgggtcccgtgggt
cctgctggcggtcctggcttccccggtgctcctggtgccaagggtgaagctggccccacc
ggtgctcgaggtcctgaaggtgcccaaggtcctcgcggtgaacctggtactcctgggtcc
cccgggccggctggtgcctctggtaaccctggatctgatgggattcctggagctaaagga
tctgctggtgcccctggcattgctggtgctcccggcttccccgggccccgtggtccaccc
ggccctcaaggtgcaactggtcctctgggcccgaaaggtcaaacgggtgagcctggtatt
gctggcttcaaaggtgaacaaggccccaagggagaaccgggccctgctggtccccaagga
gcccccggtcctgctggcgaagaaggcaaaagaggggcccgtggagagcctggtggtgtt
gggcccgctggtccccctggagagagaggtgctcctggcaaccgtgggttcccaggtcag
gatggtctggcaggtcccaagggagcccctggagagcgagggcccaatggccttgctggt
cccaagggagccaatggtgaccctggccgtcctggagaacctggcctgcctggagcccgg
ggtctcactggtcgccctggtgatgctggtcctcaaggcaaagttggtccttctggagcc
cctggtgaagatggtcgccctggacctccaggtccgcaggggactcgtgggcagcctggt
gtcatgggtttccctggccccaaaggcgccaatggtgagcctggcaaagctggtgagaaa
ggactgcctggtgctcccggtctgagaggtctccctggcaaagatggtgagacaggagct
gcagggccccctggacccgctggacctgccggtgaacgaggcgagcagggtgctcctggg
ccatctgggttccagggacttcctggcccgcccggtcccccaggtgaaggtggaaaacca
ggtgaccagggcgttcctggtgaagctggagctcccggcctcgtgggtcccaggggtgaa
cgaggtttcccaggggaacgtggctctcccgggtcccagggcctccaggggccccgcggc
ctccctggcactcctggcactgatggtcccaaaggcgcagctggcccagctggcccccct
ggggctcagggccctccaggtctgcaggggatgcccggtgagagaggagcagctggcatc
gctgggcccaagggagacaggggtgatgttggtgagaaaggccccgagggagcccccggg
aaggacggtggacgaggtctgactggccccattggcccccctggcccggccggtgccaac
ggtgagaagggagaagttggacctcctggtccttcaggaactgctggtgctcgaggtgcc
ccgggtgaacgtggagagactggaccccccgggcctgctggatttgccggtcctcctggt
gctgacggccagcccggtgccaaaggcgagcaaggagaggctggccagaaaggtgacgct
ggtgccccgggtcctcagggcccctctggagctcccgggcctcagggtcctactggtgtg
actggtcctaaaggagcccgaggtgctcaaggccccccgggggccaccggattccccgga
gccgcaggccgcgtcggacccccaggttccaatggcaaccctggaccccccggcccccct
ggtccttctggaaaagatggtcccaaaggcgctcgaggagacagcggcccccctggccga
gctggtgaccctggcctccaaggtccttccggaccccctggcgagaagggagagcctgga
gatgacggtccttctggtcccgacggtcctccaggtccccagggtctggctggtcagagg
ggcatcgtgggtctgcctgggcagcgtggtgagcgtggattccccggcctgcccggcccg
tcgggtgagcctggcaagcagggagctcctggagcatctggagaccgaggtccccctggc
cccgtgggtcctcccggcctgactggtcctgcgggtgaacctggacgcgagggaagccct
ggtgctgacggcccctctggcagagacggtgcagctggagtcaagggtgatcgtggtgag
accggccccgtgggtgcccccggagccccaggggcccctggctctcctggccccgccggc
ccagtcggcaagcagggagaccgaggagaagctggtgcgcaaggccccatggggccctca
ggaccagctggagcccggggaatgccgggccctcaaggcccccgaggtgacaaaggagaa
acgggagaggctggcgagaggggactgaagggacaccgtggcttcactggtctgcagggt
ctgcccggcccccctggtcctgctggagaccaaggtacttctggtcctgccggtccttct
ggccccagaggtcctcctggccccatcggcccctctggcaaagatggcgctaatggaatc
cccggccccattggacctcctggaccccgtggacgttcaggcgaaactggccctgctggt
cctcctggaaatcctggaccccctggccctccaggtccccctggccctggcattgacatg
tccgcctttgctggcctaggccagagagagaagggccccgaccccctgcagtacatgcgg
gccgatgaggcagccggcaacctgagacagcacgacgccgaggtggatgccacactcaag
tccctcaacaaccagatcgagagcatccgcagccccgagggatcccgcaagaaccccgct
cgcacctgccgggacctgaaactctgccaccctgagtggaagagcggagattactggatc
gacccgaaccagggctgcaccctggacgccatgaaggttttctgcaacatggacactggc
gagacctgcgtctacccccgcccagccagcgttcccaagaagaattggtggggcagcaag
agcaaggacaagaaacacatctggtttggagaaaccatcaacggtggcttccacttcagc
tatggagatgacaacctggctcccaacaccgccaacgtccagatgaccttcctgcgcctg
ctgtccaccgagggctcccagaacatcacctaccactgcaagaacagcattgcctacctg
gacgaagccgccggcaacctcaagaaggccctgctcatccagggctccaacgacgtggag
atccgggctgagggcaacagcaggttcacgtatactgttctgaaggatggctgcacgaaa
cacaccggtaagtggggcaagactatgatcgagtaccggtcgcagaagacctcacgcctc
cccatcattgacattgcacccatggacataggagggcccgagcaggaattcggtgtggac
atagggcctgtctgcttcttgtaa

KEGG   Physeter catodon (sperm whale): 102984195
Entry
102984195         CDS       T06011                                 

Gene name
COL1A1
Definition
(RefSeq) collagen alpha-1(I) chain
  KO
K06236  collagen type I alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04611  Platelet activation
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102984195 (COL1A1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102984195 (COL1A1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102984195 (COL1A1)
 09150 Organismal Systems
  09151 Immune system
   04611 Platelet activation
    102984195 (COL1A1)
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102984195 (COL1A1)
  09154 Digestive system
   04974 Protein digestion and absorption
    102984195 (COL1A1)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    102984195 (COL1A1)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102984195 (COL1A1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102984195 (COL1A1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102984195 (COL1A1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102984195 (COL1A1)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102984195 (COL1A1)
SSDB
Motif
Pfam: Collagen COLFI VWC
Other DBs
NCBI-GeneID: 102984195
NCBI-ProteinID: XP_007128607
UniProt: A0A2Y9FQF2
LinkDB
Position
Unknown
AA seq 1463 aa
MFSFVDLRLLLLLAATALLTHGQEEGQEEGQEEDIPPVTCVQNGLRYHDRDVWKPVPCQI
CVCDNGNVLCDDVICDEIKNCPNAKVPAGECCPVCPEGEASPTDQETTGVEGPKGDTGPR
GPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQMSYGYDEKSTGISVPGPM
GPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRP
GERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQM
GPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGSTGPAGPPGFPGAVGAKGEAGPQGSR
GSEGPQGVRGEPGPPGPAGAAGPAGNPGTDGQAGAKGANGAPGIAGAPGFPGARGPSGPQ
GPSGPPGPKGNSGEPGAPGNKGDTGAKGEPGPTGIQGPPGPAGEEGKRGARGEPGPAGLP
GPPGERGGPGSRGFPGSDGVAGPKGPAGERGAPGPAGPKGSPGEAGRPGEAGLPGAKGLT
GSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGSRGQAGVMGFPGPKGAAGEPGKAGERGVP
GPPGAAGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGESGKPGEQ
GAPGDIGAPGPSGARGERGFPGERGVQGPSGPAGPRGSNGAPGNDGAKGDAGAPGAPGNQ
GAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGAPGKDGVRGLTGPIGPPGPAGAPGDK
GETGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGAP
GAAGPTGPPGPIGNVGAPGPKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPSGPA
GKEGSKGPRGETGPAGRAGEVGPPGPPGPAGEKGAPGADGPAGAPGSPGPQGIAGQRGVV
GLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAE
GSPGRDGSPGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPIGPV
GARGPTGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPR
GPPGSAGTPGKDGLNGLPGPIGPPGPRGRTGDAGPAGPPGPPGPPGPPGPPSGGYDFSFL
PQPPQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRD
LKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTEPSVAQKNWYISKNPKDK
RHVWYGESMTGGFQFEYGSQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQT
GNLKKALLLQGSNEIEIRAEGNSRFTYSVTYDGCTSHTGAWGKTVIEYKTTKTSRLPIID
VAPLDVGAPDQEFGIDVGPVCFL
NT seq 4392 nt   +upstreamnt  +downstreamnt
atgttcagctttgtggacctccggctcctgctcctcttagcggccaccgccctcctgacg
cacggccaagaggagggccaagaagaaggccaagaagaagacatcccaccagtcacctgc
gtacagaacggcctcaggtaccatgaccgtgacgtgtggaaacccgtgccctgccagatc
tgtgtctgcgacaacggcaacgtgttgtgcgatgacgtaatctgcgacgaaatcaagaac
tgtcccaacgccaaagtccccgcgggcgagtgctgccccgtctgcccggaaggcgaggcg
tcacccacggaccaagaaaccactggagtcgagggacccaaaggagacactggcccccga
ggcccaaggggacccgctggcccccctggccgagatggcattcctggacaacctggactt
cccggaccccccggacccccaggacctcccggaccccctggcctcggaggaaactttgct
ccccagatgtcttacggctatgatgagaaatcaacaggaatttctgtgcctggccccatg
ggtccttctggtcctcgtggtctccctggcccccctggcgcacctggtccccaaggtttc
caaggcccccctggtgagcctggcgagcccggagcctcaggtcccatgggtccccgtggt
ccccctggcccccctggcaagaacggagatgatggtgaagctggaaagcctggtcgtcct
ggtgagcgcgggcctcctggacctcagggtgctcggggattgcccggaacagctggcctc
cctggaatgaagggacacaggggtttcagtggtttggatggtgccaagggagatgccggt
cctgctggtcccaagggtgagcctggtagccctggtgaaaatggagctcctggtcagatg
ggcccccgtggtctgcctggtgagagaggtcgccctggagcccctggccctgctggtgct
cgtggaaatgatggtgctactggtgctgctggaccccctggttccactggccccgctggt
cctcctggtttccctggtgctgttggtgctaagggtgaagctggtccccaaggatcccga
ggctctgaaggtccccaaggtgtgcgtggtgagcccggcccccctggccctgccggtgcc
gctggccctgccggaaaccctggtactgatggacaagctggtgctaaaggtgccaatggc
gctcctggcattgctggtgctcctggcttcccgggcgcccgaggcccctctgggccccag
ggtcccagtggcccccctggtcccaagggtaacagcggtgaacctggcgctcctggcaac
aaaggagacactggcgccaagggagagcctggccccactggtattcaaggtccccctggc
cctgctggggaagaaggaaagcgaggagcccgaggtgaacccggacctgctggcctgcct
ggaccccctggcgagcgtggtggacctggtagccgcggtttccctggctcagacggtgtt
gctggtcccaagggtcccgctggtgaacgtggtgctcctggccctgctggccccaaaggt
tctcctggtgaagctggtcgccccggtgaagctggtctgcctggtgccaagggtctgact
ggaagccctggcagcccaggtcctgatggcaaaactggcccccctggtcctgctggtcaa
gatggccgccccggacccccaggccctcctggtagccgtggtcaggccggtgtgatgggt
ttccctgggcctaaaggtgctgctggagagcctggcaaggctggagaacgaggcgttcct
ggaccccccggcgctgctggtcctgctggcaaagacggagaagctggagctcagggacct
cccggacctgctggccctgctggtgagagaggtgaacaaggccctgctggctcccccgga
ttccagggtctccctggccctgctggtcctcctggtgaatcaggcaaacccggtgaacag
ggtgctcctggagacattggtgcccccggcccctctggagcaagaggcgagagaggtttc
cccggcgagcgtggtgtgcaagggccctccggtcctgcaggtccccgtgggtccaacggt
gcccctggcaatgatggtgctaagggcgatgctggtgcccctggagcccctggtaaccag
ggcgcccctggccttcaaggaatgcctggtgaacgaggtgcagcaggtctcccaggccct
aagggtgacagaggcgatgctggtcccaaaggtgctgatggtgctcctggcaaagatggc
gtccgtggtctgactggtcccattggtcctcctggccccgctggcgcccccggtgacaag
ggtgaaactggtcccagcggccctgctggtcccactggagctcgtggtgcccccggagac
cgtggtgagcctggtccccccggccctgctggcttcgctggcccccctggtgctgatggc
caacctggtgctaaaggtgaacctggtgatgctggtgctaaaggcgatgctggtgccccc
ggcgctgctggacccactggacctcctggtcccattggtaacgttggtgctcccggaccc
aaaggtgctcgtggcagcgctggtccccctggtgctactggtttccctggtgctgctggc
cgagtcggtccccctgggccctctggaaatgctggaccccctggaccttctggtcctgct
ggcaaagaaggcagcaaaggcccccgtggtgagactggccccgctgggcgtgctggtgaa
gtcggtcctcccggtccccctggccccgctggtgagaaaggagcccctggtgctgatgga
cctgctggcgctcccggctctcccggacctcagggtattgctggacagcgtggcgtggtc
ggcctgcccggtcagcgaggagaaagaggcttccctggtcttcctggcccctctggtgaa
cccggcaaacaaggtccttctggagcaagtggtgaacgtggcccccctggtcccatgggc
ccccccggattggctggaccccctggcgagtctggacgtgagggagcccctggtgctgaa
ggatcccctggacgagatggttctccgggccccaagggtgaccgtggtgagaccggccct
gctggaccccctggtgctcctggtgcccctggtgcccccggccccgtcggccctgctggc
aagagcggcgatcgtggtgagactggtcctgctggtcctgctggtcccattggccccgtt
ggtgcccgcggccccactggaccccaaggcccccgtggtgacaagggtgagacaggcgaa
cagggcgacagaggcattaagggtcaccgtggcttctctggtctccagggtccccctggc
cctcccggctctcctggtgagcaaggtccctccggagcttctggtcctgctggtccccgc
ggtccccctggctctgctggtactcctggcaaagatggactcaacggtctcccaggcccc
atcggtccccctggtcctcgtggtcgcactggtgatgctggtcctgctggtccccccggc
cctcccggaccccctggtccccctggtcctcccagcggtggttacgacttcagcttcctg
ccccagccacctcaagagaaggctcacgatggtggccgctactaccgggccgatgatgcc
aatgtggtccgtgaccgtgacctcgaggtggacaccactctcaagagcctgagccagcag
atcgagaacatccggagccctgaaggcagccgcaagaaccctgcccgcacctgccgtgac
ctcaagatgtgccactccgactggaagagcggagaatactggattgaccccaaccaaggc
tgcaacctggatgccattaaggtgttctgcaacatggagacaggtgagacctgcgtgtat
cccactgagcccagcgtggcccagaagaactggtacatcagcaagaaccccaaggacaag
aggcacgtctggtacggcgagagcatgaccggcggattccagttcgagtacggcagccag
ggctccgatcctgccgatgtggccatccagctgactttcctgcgcctgatgtccactgag
gcctcccagaacatcacctaccactgcaagaacagcgtggcctatatggaccagcagacc
ggcaacctcaagaaggccctgctcctccagggctccaacgagattgagatccgggccgag
ggcaacagccgcttcacctacagcgtcacctacgacggctgcacgagtcacaccggagcc
tggggcaagacagtgatcgaatacaaaaccaccaagacctcccgcttgcccatcatcgat
gtggcccccttggacgttggcgccccagaccaggaattcggcatcgacgttggccctgtc
tgcttcctgtaa

KEGG   Physeter catodon (sperm whale): 102984497
Entry
102984497         CDS       T06011                                 

Gene name
COL6A6
Definition
(RefSeq) collagen alpha-6(VI) chain
  KO
K06238  collagen type VI alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102984497 (COL6A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102984497 (COL6A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102984497 (COL6A6)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102984497 (COL6A6)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102984497 (COL6A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102984497 (COL6A6)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102984497 (COL6A6)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102984497 (COL6A6)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102984497 (COL6A6)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   102984497 (COL6A6)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Integrin_beta
Other DBs
NCBI-GeneID: 102984497
NCBI-ProteinID: XP_007115525
UniProt: A0A2Y9F759
LinkDB
Position
1
AA seq 2282 aa
MKMLLTLLLMIICSHVSVNQHSGPEYADVVFLVDSSDHLGTKSFPFVKTFINKMITSLPI
EAHKYRVGLAQYSDGLHREFQLSTFESRGPMLKHLKKNFRFLGGSLRIGNTLQEVHKTYF
SNGRDRRHFPLILVVLASAESEDAVEEASEALRNDGVRIISVGMQGVSEETLKAMATDQF
HYSLRTVRDLSTFSQNMTKILKDAAQYKEGAVNDILVEVCQGPSVADVVFLLDMSINGSW
EDFDYLKEFLEESVSALDIKENCMRVGLVAYSNETKVINPLSSGINKSEILQDIQSLSPQ
AGKAYTGAALRKIRKEVFSVQHGSRKNQGVPQIAVLVTHSPSEDNVTKAAVNLRRQGVTI
FTMGIEGASDAQLEKIASHPAEQYVSKLKTFSDLAAHNQTFLKKLRNQITHTVSVISERT
ETLKSGCVDTEEADIYLLIDGSGSTQATDFQEMKTFLSEVVEMFNIAPQKVRVGAVQYAD
RWDLEFEINKYTNKHDVGKAIENIRQLGGNTNTGAALNFTLGLLQKAKRQRENRVPCHLV
VLTNGVSQDSILGPANRLREELIHVYAIGVKEANETQLREIAGEEKRVYYVHDFDALKDI
RNQVVQEICAEEACKEMKADIMFLVDSSGSIGRENFIKMKTFMKNLVRKSQIGGDRVQIG
VVQFSDINKEEFQLNRYMSQGEISDAIDRMAHIGETTLTGSALTFVSQYFSPAKGARPNI
RKFLILITDGEAQDTVKDPAVALRQEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVENFD
ILQHIEDDLVFGICSPREECKRIEVLDVVFVIDSSGSIDHDEYNIMKDFMIDLVKKANVG
KNHVRFGALKYADDPEVLFYLDNLGTKRQVISVLQNDQPMGGNTYTAKALGFSDHMFTEA
RGSRLHKGVPQVLIVITDGESHDADTLNATAKALRDKGILVLAVGIAGANAVELLAMAGS
SDKYFFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSDSIHPDDFKKMKRF
LASVVQDFDVSVNRVHIGAAQFSHTYRPEFALGTFIGKKEISFQIENIQQIFGYTHIGSA
LRQVGHYFQPDMGSRINTGTPQVLLVLTDGQSQDEVAQAAEELRHKGINIYSVGIGDVDD
QQLVQITGTADKKLTVHNFDELKKVKKRIVRNICTSGSESNCFVDIVVGFDISTQQNGQT
LLEGQSWMESYLQDILRAIRSLSGVSCEVGTETQVSVAFQVTNAMEKYSPKFEIYSENIL
NSLKDVTVKRPSLLNANLLSSLWGAFQNKSAARGKVVLLFSDGLDDDVEKLEQKSDELRK
EGLNALITVALDGATNSNELADLLYIEFGKGFEYRTQLTLGMRDLGSRLSKHLVSVAERT
CCCLFCKCIGGDGTVGDPGPAGKKGPPGFKGSEGYLGEEGIAGERGTTGPMGEQGTKGCY
GDKGPKGNRGLNGQEGEVGESGIDGLNGEQGDHGLPGAKGEKGDEGAQGIPGERGIPGDH
GAKGLCGDPGVPGFDNSIEGPKGLKGEYGRQGRRGWPGPPGTPGSRRKTAVHGQRGHTGP
QGKTGIPGPDGLAGSLGLKGPQGPRGEAGMKGEKGGLGSKGPQGLPGPAGEAGSQGRLGS
QGNKGEPGDVGEKGAIGLRGPRGLPGDDGNLGYGNVGSKGAKGQEGFPGEIGPKGEVGDP
GGPGETGPKGTRGKTVSAGLPGEPGSPGELGPPGRKGVKGAKGLPSFSTCELIQHVQDHS
PGRHGALRCPVHPTELVFALDQSRGVTEPEFARMKEMMTSLLGGLRVRENHCSAGARVAV
LTYNSHARLLIRFSDAYRKDRLLRQIEALPYERSTSSRDIGKAMRFVSRNVFKRMLPGAH
TRRIATFFSGGPSADAQTITTAGLEFSTLDIIPVVIAFSQVPAIKHSFVIDDTGRFQVIM
VPHRIDSAPALEGLQRCTFCYDICKPDASCDQARPLAMQSYMDTAFLLDGSRHVGSAEFE
DIRGFLGALLDHFEITPEPETSVTGDRVALLSHAPPHFLPNTQRSPVRSEFNLTTYKSKR
LMKRHVEESVQQLNGDAFIGHALQWTLNNVFLSTPNLRRNKVIFVISAGETSHMDRETLK
KESLRAKCQGYALFVFSLGPSWNDQELEDLASYPLDHHLVQLGRIHKPDHRYGVKFAKAF
ISSVRRAINKYPPINIKAKCYRLSSADTQQPPRQLRSFLPGPHKATLKEDVLQKAKFFQH
KKYLSRIARGSRDDTVQNFTRNTSHTFKSGKGVIKTAPKQHDKEVLEEVSLRKHGKTLGL
NN
NT seq 6849 nt   +upstreamnt  +downstreamnt
atgaaaatgctgctcactttgctccttatgataatttgttcccatgtttccgtcaaccaa
cattctggccctgagtatgctgatgtggtgtttctggtggacagctctgaccacctggga
actaaatccttcccgtttgtaaagacattcatcaacaaaatgatcaccagcctccccata
gaggcccacaagtaccgtgtgggcctggcccagtacagcgatgggctccacagagaattc
cagctcagcaccttcgagagcagaggccccatgttgaagcacctcaagaagaacttcagg
ttcctgggtggctccctgcggattgggaacactctccaggaggtacacaaaacctacttc
tcaaacgggagggacaggagacattttcccctgatcctggtggtcctggcttcggcggag
tctgaggatgccgtggaagaggcctccgaggcgctgcgcaacgatggggtgaggatcatc
tccgtggggatgcagggggtgtccgaggagaccctgaaggccatggccacagaccagttc
cactacagcctccggacagtccgagacctcagcacgttctcccaaaacatgacgaagatt
ctcaaggatgcggctcagtacaaggaaggagcagtcaatgacattcttgtggaagtttgc
caaggcccttctgtggctgatgttgtgttcctgttggatatgtccatcaacggcagctgg
gaggactttgactatcttaaagaattcctggaagaaagtgtatctgctctagacatcaag
gaaaactgcatgagggttggccttgtggcctatagcaatgagacaaaggtgattaaccca
ctgagcagtggcataaacaagtcggagattctccaggatatacagagcctctctccccag
gctgggaaggcctacactggagctgccttgagaaagatcaggaaggaggtcttcagcgtg
cagcacggcagtaggaagaatcagggggtgccacagatagctgtgctggtgacccacagc
ccatctgaggacaacgtgaccaaggctgctgtgaacctccggcgccagggtgtgaccatc
ttcaccatgggcatagagggggccagcgacgcccagctggagaagattgcatctcaccct
gccgagcagtacgtctccaaactgaaaaccttctctgacctggctgcccacaaccagaca
tttctaaagaaactgcggaaccagatcacacacacagtctctgtcatttccgaaaggact
gaaaccctcaaatctggttgtgtggacactgaggaagcagacatctatctgctcatcgat
ggctctgggagcacccaggccacggacttccaggaaatgaagaccttcctttcagaggtt
gtagagatgttcaacatcgctccccagaaggtccgagttggggctgtccagtatgctgac
cgctgggacttggaatttgagatcaataaatacaccaacaagcatgatgtgggaaaggcc
atcgagaacatcaggcagctgggcgggaacacaaacacgggcgcagccttgaacttcacc
ctggggctgttgcagaaagcaaagaggcagcgggaaaacagagtgccatgccaccttgtt
gtcctgaccaacggcgtgtcccaggatagcatcttggggcccgcaaacagactgagagag
gagctcatccacgtctatgccatcggggtcaaggaggccaacgagacacagctgcgagaa
atcgcaggcgaggagaagcgggtgtactatgtgcatgactttgatgccttgaaagacata
agaaaccaggttgttcaagaaatctgtgccgaagaagcctgcaaagagatgaaggcggac
atcatgtttctggtggacagttctggcagtataggacgtgaaaacttcatcaaaatgaaa
acatttatgaaaaacctggtgagaaaatctcagattgggggagatagggtgcaaatcggt
gtagtccagttcagtgacatcaataaggaggagtttcagctcaacagatatatgtcccag
ggggaaatttcagatgcaatagaccgaatggctcacattggagaaactaccttgaccggt
agcgccctgaccttcgtgtctcagtacttcagccctgccaagggggcccggcccaacatc
aggaagttcctcatcctcatcacggacggtgaggctcaggacacagtcaaggacccagca
gtggcacttcggcaagaaggcataattatctactctgtgggggtgtttggctccaatgtc
acccagcttgaggagatcagtgggaggccagagatggttttttatgttgagaattttgac
attctgcagcacattgaagacgaccttgtttttggaatatgcagcccccgtgaagaatgc
aagcgcattgaagttttagacgtcgtgtttgtaattgacagttctggcagcattgaccat
gatgagtataatatcatgaaggacttcatgattgacttggtgaaaaaagctaatgtgggc
aagaatcatgtccggtttggggctctgaagtatgcggatgacccagaggtgctgttttat
ctggataaccttggcacaaaaaggcaggtgatctcggtgctccaaaatgaccagcccatg
gggggcaacacttacaccgccaaggccttgggcttctcagaccacatgttcactgaagcc
cggggaagccgcctgcacaagggggtcccccaggtcctcatcgtaatcaccgatggggaa
tcccatgatgccgatacgctcaatgccacagccaaggccttgcgggacaagggcatcctt
gtcctggccgtggggatcgctggtgccaatgctgtggagctgctggccatggcaggatcg
agtgacaagtacttctttgtggaaacttttggaggcctgaaggggatattttcagatgtg
tccgccagtgtgtgtaactcttcaaaagtagattgtgaaattgaaaaagtagacctcgtg
ttcctcatggatggttcggatagcattcatccagatgacttcaagaagatgaagagattc
ttggcatctgtcgttcaagacttcgatgtcagtgtcaacagagtgcatataggagctgcc
cagttcagccacacctaccggccggagtttgcactgggaactttcatcggcaaaaaggag
atctcctttcagattgaaaacatccagcagatctttggctacacgcacattggctctgct
ctccggcaggtagggcactacttccagccagacatgggcagccggatcaacacaggtacc
ccgcaggtgctgctggtcctcacagatggccagtcccaagacgaagtggctcaggccgcc
gaagagctgagacacaaagggatcaacatttactccgtgggcatcggggatgtggatgac
caacagctcgttcagatcaccgggaccgccgataaaaaactgacagttcacaactttgac
gaactgaaaaaggtcaagaagaggatcgttcgcaacatctgtacctcggggagtgagagc
aattgttttgtggacattgtggtgggatttgacatctcaactcagcagaatgggcagact
ttgcttgaaggtcagtcttggatggaaagttaccttcaagacatcttacgtgccatcagg
tccctcagtggggtaagctgtgaggtgggcacggagactcaggttagtgtggcttttcaa
gtgaccaatgccatggaaaaatattcccccaagtttgagatctacagtgaaaacatactg
aacagcttgaaggatgtaacagttaaaagaccatctcttctcaacgcaaacctcttgagt
tctctgtggggtgcatttcagaataaatcagctgctcggggaaaggtggttctcttattt
tcagatggactggatgatgatgttgaaaaacttgaacaaaaatctgatgaacttagaaaa
gaaggcctgaatgccctcatcacagttgctctggatggagccaccaattccaatgaactg
gctgaccttctctacattgaatttgggaaaggatttgagtataggacacagctcactctt
gggatgagggatcttgggagccgactgtcaaagcacctggtcagtgttgcagaaaggaca
tgctgctgtttattctgcaagtgcattggaggagatggtacagtgggagaccctggacca
gcagggaaaaagggacccccaggttttaaaggcagtgaaggctacctgggagaggagggc
attgctggggaaagaggaaccaccggaccaatgggagagcaaggtactaaaggatgctat
ggtgacaaaggtcctaagggaaacaggggactaaatggacaggagggtgaagttggggaa
agtggaattgatgggttaaatggagaacagggtgaccatggtcttccaggagcaaaagga
gaaaaaggagatgaaggagcccagggaatcccaggagagagaggaattcctggtgaccat
ggagcaaagggcctgtgtggagaccctggagttcctggatttgacaatagcatagaagga
cccaagggcttgaaaggggaatatggaagacagggaagaagaggctggccaggcccccct
ggaaccccaggctccagaagaaagacagcagttcatggccagaggggacatacaggccca
caggggaaaacaggcatcccaggcccagatggacttgcggggtcactgggacttaagggt
cctcagggcccaagaggagaggctggtatgaaaggagaaaaaggaggtctgggaagtaaa
ggtccccaggggcttccaggaccagcaggagaggcagggagtcaaggccgtttgggaagc
caaggaaataaaggagaacctggcgatgtgggagaaaagggagccattggccttcggggt
cctcgaggcttgccgggtgatgatggcaacctaggttatggtaatgttggaagtaaagga
gcaaaggggcaagaaggattccctggagaaattggacctaagggtgaggttggggaccct
ggtggtccaggagaaactggacccaagggcactagaggcaagacggtgtctgctgggctt
ccaggagagccgggatcccctggggagctgggacctcctggacgcaagggtgtgaaaggg
gccaaaggattgccttccttctccacatgtgagctcatccagcatgtgcaggaccacagt
cctggcagacatggagccctgcggtgccctgtgcatcccacggagctggtgttcgcgctg
gaccagtcccggggcgtcacggagccagagttcgcacggatgaaggagatgatgacctcc
ctgctgggcggtctccgggtccgggagaaccactgctctgcgggcgctcgcgtggccgtg
ctcacctacaactcccacgccaggctcctcatccgcttctccgacgcctacaggaaggac
agactcctcagacagatcgaggcccttccttacgagcggtccacgtccagcagggacatt
ggcaaggccatgaggtttgtctccaggaatgtcttcaagcggatgctcccaggggctcac
accagaagaatcgccacattcttcagtggcggcccatctgcagatgcccagaccatcacg
acagccggcctggagttcagcaccctggacattatcccggtggtgatcgccttcagccag
gtgcctgccatcaagcactcgtttgtgattgacgacactggcagattccaagtcatcatg
gttccacacaggattgactctgcgccagcactagagggactccagcggtgcactttctgc
tatgatatatgcaagccggatgcttcttgcgaccaagccagaccactcgccatgcaatct
tacatggacactgctttccttctggatggctcccggcatgtgggaagtgcagaatttgaa
gacataagaggcttcctgggagcactgttagatcactttgaaatcaccccagagcctgag
acctctgtcactggagaccgggtggccctattaagccatgctcccccccacttcctaccc
aacactcagaggagtcctgttagaagtgaattcaaccttaccacctacaagagtaaacgc
ctcatgaagaggcacgtggaagaatcagttcagcagctcaatggagatgcttttattggt
catgccttacagtggactctgaacaatgtcttcttgagtacacccaatctgagaagaaat
aaagtcatttttgtgatatctgctggggaaaccagtcacatggatagggaaaccttaaag
aaagaatccttgagagccaagtgtcaaggttatgccttatttgtgttttcccttggccct
tcttggaatgaccaggaactggaagatctagccagctaccctttggatcaccacttggtc
cagcttggccgcattcataagcctgaccacagatatggtgtgaagtttgcgaaggccttt
ataagctcagtcaggcgtgcaatcaacaaatatccaccaataaacatcaaagcaaagtgc
tatagactcagctctgcagatacgcagcagcccccgcgccagttgcgaagcttccttcct
ggaccacataaagctaccctcaaagaagatgtattacagaaggcaaaattctttcaacat
aaaaaatatctgtcaagaatagcaagaggcagcagagatgatactgttcaaaattttacg
agaaacacatcccatacctttaaaagtgggaaaggggtgataaaaacggctcccaaacaa
catgataaagaagtgctagaagaggttagcctcaggaagcatggtaagactctgggctta
aataactaa

KEGG   Physeter catodon (sperm whale): 102984824
Entry
102984824         CDS       T06011                                 

Gene name
COL4A6
Definition
(RefSeq) collagen alpha-6(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102984824 (COL4A6)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102984824 (COL4A6)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102984824 (COL4A6)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102984824 (COL4A6)
  09154 Digestive system
   04974 Protein digestion and absorption
    102984824 (COL4A6)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102984824 (COL4A6)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102984824 (COL4A6)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102984824 (COL4A6)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102984824 (COL4A6)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102984824 (COL4A6)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102984824 (COL4A6)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102984824 (COL4A6)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102984824 (COL4A6)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102984824 (COL4A6)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102984824
NCBI-ProteinID: XP_023983830
UniProt: A0A2Y9T4V1
LinkDB
Position
21
AA seq 1669 aa
MKKVVMDGEGRPGPIGIQGPVGPQGFPGPSGLSGLKGERGSPGPLGPYGPKGDKGPMGVP
GFLGINGIPGHPGQPGPRGPPGLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGEP
ILAQGSFKGMKGEPGLPGLDGITGPQGAPGSPGAVGPIGPPGLQGPPGPPGFPGPDGNMG
LGFQGEKGVKGDVGLPGPAGPPPSTGELEFMGFPKGKKGSKGEPGLQGFPGISGPPDFPG
LGGFGIAGEKGEKGIPGLPGPRGPTGLEGVQGPPGSQGKKGSSGFPGLNGFPGIKGEKGD
IGLPGPDVFIDTDGAVISGYPGDPGMPGLPGLKGDEGIQGLPGPPGAPGLPALPGGPGAL
GLQGVPGLKGDRGNPGRTTIGAAGLPGRDGLPGLPGLPGPPGPAFETGTLQSTEPGLPGL
RGERGPKGNPGLKGIKGDSGFCSCDGGVPNIGPPGEPGLPGPPGLLGLPGLKGTRGDPGS
GGAQGPSGYPGVFGPRGLAGPKGKKGEATLSTESGMPGDQGDPGPQGLPGETGAPGKDGV
PGLPGLPGFQGDGGQGFPGEKGLPGLPGEKGRTGPIGPPGIGLPGSSGPRGLPGDKGVDG
FPGQPGPPGIPGDCCCREKVGRGDLDTEGGITLPCIIPGPYGPSGFPGAPGFPGPNGARG
LPGTPGQPGLRGNKGEPGSPGLVHLPELPGFPGPRGEKGLPGFPGLPGKDGLPGQVGSPG
LPGPKGAPGDIIGAENGAPGEQGLQGLPGDRGVPGDSGLTGPKGLLGKSGLLGPKGERGS
PGVPGRAGQPGPRGSGGLFGIKGKPGLPGAPGFPGASGHPGKKGPRGDIGPPASAGKRGL
SGLKGLPGPPGLIGFLGNSGLPGATGLPGLPGPKGEKGSVGLIGFPGIPGLPGIPGASGL
KGIPGSVGRVGPSGQAGSPGEKGDRGDPGPVGVPSPRLPVLNLHLTGDKGSQGSAGSIGF
PGPRGDKGEAGRPGPPGLPGAPGFPSVVKGLSGRPGPPGSTGLRGLPGLKGSPGITGFPG
IPGESGSQGLNGAPGLPGASGLPGLKGDRGQTLGISGSPGPKGQPGEPGFKGVKGKDGPV
GDVGFPGNKGEDGKAGLSGDVGLPASSGLPGVAGMRGNPGLPGSSGHPGAAGPLGPPGLI
GTKGFPGLPGLHGLNGLPGTKGTHGTPGPSITGVPGPAGLPGPKGEKGSPGIGIGAPGKP
GVRGPKGSRGFPGLQGRAGLPGAPGLSLPSVIAGQPGDPGRPGLDGERGRPGPPGPPGPP
GPSSDQGDPGDPGFPGIPGPQGPKGDQGIPGFSGLPGELGLKGMRGGPGFMGTPGKVGPP
GDPGLPGMKGKAGPRGLSGPRGAPGQTPIAEAIQAPPGPMGLTGIDGIPGLMGDPGIQGP
GGLQGFKGLPGIPGKDGPNGLPGPPGALGDPGLPGLQGPPGFEGAPGRPGPFGRAGMPGQ
SVRVGYTLVKHSQSEQVPLCPVGMSQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTM
PFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVGQTQIPQYISRCSVCEAPSQAIAVHS
QDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVSPGSCLEDFRATPFIECSGARGT
CHYFANKYSFWLTTVEERQQFGEEPVSETLKAGQLHTRVSRCQVCMKSV
NT seq 5010 nt   +upstreamnt  +downstreamnt
atgaaaaaagtggtgatggatggagaggggcgaccaggaccaattggaattcaaggtcca
gtgggtcctcaaggatttcctggccctagtggtttatcagggttgaaaggagaaaggggc
tccccaggacctctgggaccgtatggaccaaaaggagataagggtcccatgggagttcct
ggctttcttggcatcaatgggattccgggccaccctggacagccaggacccagaggccca
cctggcctggatggctgtaatggaacccaaggagctgttggatttccaggccctgatggc
tatcccgggcttctcggaccacccgggcttcctggtcagaaaggctctaaaggtgaacct
atccttgcccaaggtagtttcaaaggaatgaagggggagcctgggctgcccggactggat
ggaatcactggcccacaaggagcacccggatctccaggagctgtgggacccataggacca
ccaggattgcaaggtcctccaggcccccctggtttccctggtcctgatgggaatatgggg
ttaggtttccaaggagagaaaggagtcaagggggatgttggcctgcctggccctgcagga
cccccaccatctactggggaactggaattcatgggatttcccaaagggaagaaaggatcc
aagggtgaaccagggcttcagggcttcccaggaataagtggtcctccagatttccctgga
ttgggtggatttggaattgctggagaaaaaggagaaaagggaatccctggtttgccggga
cctaggggtcccacgggtttagaaggagtccaaggccctccagggagtcagggcaagaag
gggtcttcaggtttccccgggcttaatggattcccaggaattaagggtgaaaagggtgac
attggcctgccaggcccggatgttttcatcgatacagatggtgctgtgatctcaggttat
cctggagaccccggtatgccaggcctcccaggccttaaaggagacgaaggcatccagggc
ctgcctggccctcctggtgcccctggattaccagctttaccgggtggccccggtgctcta
gggctccagggagttccaggcctgaagggggaccgaggaaaccccggccgcaccacaatt
ggggcagctggcctccctgggagagatggtttgccaggcctaccaggcctaccaggccca
cccggtccagcatttgagacgggaactctacagagcacagagccagggttgcctggtctc
cgaggagaacgaggtccgaaaggcaacccaggcctcaaaggaatcaaaggggactcaggt
ttttgttcttgtgatggtggtgtcccaaacattggaccacctggggagccaggcctgcct
gggccaccaggtctcctgggccttccgggcctgaaaggaacgagaggagatccaggctct
gggggtgcacagggcccatcagggtatccaggtgtatttgggcctcgaggtctggcaggc
cccaaaggaaagaaaggggaagcgactctcagtacagaatcagggatgccaggagatcag
ggtgatcctggtccccaggggctccctggtgagacaggagccccaggcaaggacggagta
ccaggtttaccaggcctgccaggctttcagggtgatggtggacagggcttcccaggtgag
aaggggttaccaggacttcctggtgaaaaaggccgcactggtccaattggccccccggga
attgggctaccaggatcttctggacctcgtggacttcctggagataaaggggtagatgga
ttcccagggcaaccaggcccccctgggattccaggtgactgctgctgcagggagaaggtt
gggcgaggagacttagacacagagggaggtatcaccttgccttgtattattcctgggcca
tacggtccatcaggattcccaggagctcccggattcccaggccctaacggggcccgtggc
ctccctgggactccaggccagcctggattgcgtggaaataaaggagagcccggaagtcca
ggattggttcacctccctgaattgccaggatttcctggacctcgtggagagaagggcttg
cctggatttcctgggctccctggaaaagatggcttgcctgggcaagttggcagtcctggc
ttaccaggtcccaagggagcccctggtgacatcattggtgctgaaaacggtgctcctggg
gagcaaggcctacagggactgccaggggacagaggagttcctggagactctggccttaca
ggacccaagggtttgcttgggaagtcgggcttgctcggccccaaaggtgagcggggcagc
cctggagtaccaggccgcgcaggacagccaggcccccgggggtctggtggtctattcggc
atcaagggcaaacccgggctcccaggagcaccaggctttccaggcgcttcaggacatcct
ggcaagaaaggtccaagaggcgatataggtccccctgcatcagctggaaagagaggcctg
tctgggctgaaaggccttccgggacctccagggctaattggcttcctggggaactccggc
ttgccaggggccactggactgccaggactgccaggtccgaagggtgagaagggatctgtt
ggactgataggttttccagggattccaggtctccctggtattcctggtgcaagtggatta
aagggaattcctgggtcggtgggaagagttggaccatctggacaggctggtagccctggt
gaaaaaggagacagaggcgacccggggccagtcggagtacccagccccagacttcccgtg
ctgaacctccacctcacaggagacaaaggctctcagggctcagctggatccatcggattt
cctgggcccagaggtgacaaaggtgaggctggccgccccgggccaccaggcttgcctgga
gctcctggcttccccagcgtcgtcaagggacttagtgggagaccaggcccccctggctcc
acaggactacggggcttacccggcctgaaggggtcccctggaatcacaggtttcccagga
ataccaggagaaagtggttcacagggtctcaatggagctcccgggctcccaggagcatct
ggtctcccaggtttaaaaggagatcgaggccagacacttggaatttctggtagcccagga
cccaagggacaacctggggagcctggtttcaaaggtgtgaaaggaaaagatggaccagtt
ggtgatgtgggtttcccaggaaacaaaggtgaagatgggaaagctggtctttctggagat
gttggccttcctgcctcctcaggactccccggggtcgcaggcatgagaggaaatccaggg
cttccaggttcttcaggccacccaggggcagctgggcccctgggaccccctggcctaata
ggaaccaaaggtttccctggacttcctggtttacatggactgaatgggctaccaggaacc
aagggaacccatggaactccaggacctagtataaccggtgtgcctgggccagctggcctg
cctggtcccaaaggagaaaaaggttctccaggaattggcatcggagccccaggaaagcca
ggcgtgagaggaccaaaaggtagccgaggtttcccaggtctccagggccgtgctggtctc
cctggtgccccaggcctctccttgccctcagtcatagcaggacagcctggcgaccccggg
cgaccaggcctagatggagaacgaggccgcccaggccccccgggccccccaggcccccct
gggccatcctcggatcaaggcgacccgggagaccctggcttccctggaattcctggccct
caagggcccaagggagaccaaggaattccaggtttctctggcctccctggagagctagga
ctgaaaggcatgagaggtgggcctggcttcatggggactccaggcaaagttgggccacct
ggagacccaggacttccggggatgaaggggaaggcagggccaagaggcttatccggcccc
cgaggcgctcctggacaaacaccaattgcagaagccatccaggctcctcctggacccatg
ggtctaacaggcatcgatggcatccctggcctcatgggggaccctgggatccaaggccct
ggaggcctacaaggcttcaaaggtttacctggcatcccgggcaaagatggccccaatgga
ctccctggcccacctggggctcttggtgatcctggtctccctggactgcaaggccctcca
ggatttgaaggagctccagggaggccgggccccttcgggagggctggaatgcctgggcag
agcgtgagagtgggctacaccttggtgaagcacagccagtcggaacaggtgcccctgtgc
cccgtcgggatgagccagctgtgggtgggttatagcctgctgttcgtggaggggcaggag
aaagcccacaaccaggacctggggtttgctggctcctgcctgccccgcttcagcaccatg
cccttcatctactgcaacatcaacgaagtgtgccactatgcccggcgcaacgataaatcc
tactggctctccaccaccgcgcccatccccatgatgcccgtcggccagacgcagattccc
cagtacatcagccgctgctctgtgtgtgaggcgccctcgcaagccattgccgtgcacagc
caggacatcaccatcccgcagtgccccctgggctggcgcagcctctggattgggtactcc
ttcctcatgcacacggccgccggcgctgagggcggcggccagtccctggtctcccctggc
tcctgcctagaggatttccgagccactcctttcatcgagtgcagtggtgcccggggcacc
tgccactactttgccaacaagtacagtttctggctgacgacggtggaagagaggcagcag
tttggggaggagcctgtatctgagacgctgaaagccgggcagctccacacccgggtcagc
cgctgccaggtgtgtatgaaaagcgtgtag

KEGG   Physeter catodon (sperm whale): 102984868
Entry
102984868         CDS       T06011                                 

Gene name
TNXB
Definition
(RefSeq) LOW QUALITY PROTEIN: tenascin-X
  KO
K06252  tenascin
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05165  Human papillomavirus infection
pcad05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102984868 (TNXB)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102984868 (TNXB)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102984868 (TNXB)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    102984868 (TNXB)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102984868 (TNXB)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102984868 (TNXB)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102984868 (TNXB)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_2 Pur_ac_phosph_N EGF_Tenascin DUF2369 DUF4959 fn3_2 CBX7_C
Other DBs
NCBI-GeneID: 102984868
NCBI-ProteinID: XP_028334630
UniProt: A0A455AE44
LinkDB
Position
18
AA seq 3885 aa
MMPAQGSLTSNLALLVLLGTVRAGPFSPRSNVTLPAPRPPPQPGGRTEQPGGGGPSSQLY
EHTVEGGEKQVVFTHRINLPTSGSCGCPPGTEPPVPASEVQALRVRLEILEELVKGLKEQ
CTGGCCPAAAQAGTGQTDVRSLCSLHGVFDLSRCACSCEPGWGGPTCSDPTDAAAPPSSP
PSASRSCPDDCNDQGRCVRGRCVCFPGYTGPSCSWPSCPGDCHGRGRCVQGVCVCRAGFS
GDDCSLRSCPRGCSQRGRCENGRCVCKPGYTGEDCGARSCPRGCSQKGRCEDGRCVCDPG
YAGEDCGSRSCPWDCGEGGRCVDGRCVCWPGYAGEDCSTRTCPRDCRGRGRCEEGECICD
PGYSGDDCGVRSCPGDCNQRGRCEDGRCVCWPGYSGPDCGARACPRDCRGRGRCENGVCV
CNAGYSGEDCGARSCPGDCSGRGRCESGRCVCWPGYTGRDCGTRACPGDCRGRCVDGRCV
CNPGFTGEDCGKRRCPGDCRGRGRCEDGVCLCDVGYEGEDCGARSCPRGCQGRGQCLEGR
CVCDDGYEGEDCGVRRCTRDCSQRGVCQDGVCACWEGYTGEDCSLRTCPSNCHRRGRCED
GRCVCDSGYTGPYCATRSCPADCRGRGRCVQGVCVCHAGYSGEDCGQEEPPASACLGGCG
PRELCRAGQCVCVEGFRGPDCAIQTCPGDCRGRGECREGSCVCQDGYAGEDCGEEVPAVE
GMRMHLLEETMVRTEWTRAPGNVDAYEIQFIPTTEGPSPPFTARVPSSASAYDQRGLAPG
QEYQVTVRALRGTSWGPPASKTITTMIDGPQDLRVVAVTPTTLELNWLRPQAEVDQFVVS
YVSAGNQRVRLEVPSEADGTLLTGLMPGVEYVVTVTAERGRAVSYPASVRANTGSSPSDL
LGATDEPPPSGPSTTQGAQAPVLQQRPQELAELRVLGRDKTGHLRVAWTAQPDTFAHFQL
RLRVPEGPGAHEELLPGDVRQALVPSPPPGSPYELSLRGVPPGGEPSAPLIYQGIMDRDG
EKPGQPLAPPRLGKLTVTDMTSNSLLLHWTVPEGEFDSFVIQYKDRDRPQVVPVEGPQRS
ALISNLDVGRKYKFILYGLVGKKRHGPLVAEAKILPQTDSSSVTPPRLGKLWVTDPTPDS
LHLSWTVPEGQFDSFVVQYRDRDGRPQVVPVEAPERSVIVSPLDPDHKYRFTLFGIADKK
RHGPLTADGTTAAEKREEPRHPESPEQPLLGELTVTGTTPDSLRLSWTVTQGSFDSFMVQ
YKDAQGRAQAVPIRGHENEVTIPGLESDRKYKMNLYGLHGRQRVGPVSVVATTAPQEGVD
ETPSPTAPSTEAPEPREEPLLGELMVTGSSPDSLSLSWTVPQGHFDSFTIQYRDRDGQPK
VVRVGGEESEVTAGGLEPGRKYKMHLYGLHGGRRVGPVSTVGVTAPQPEETPPATEPPLE
PRLGELTVTDVTPSSVGLAWTVPEGQFDSFVVQYKDRDGQPQVVPVAADQREATVPGLEP
ARKYKMNFYGLHGGQRVGPLSVVAMTAPGTEPPLEPRLGELTVTDVTPDSVDLSWTVPEG
EFDSFVVQYRDRDRQPQEVPVAADQREVTIPGLEPSRKYKFLLFGIQDGKRHSPVSVEAK
TVARGDTSPGAPPRLGELWVTDPTPDSLRLSWTVPEGHFDSFVVQFKDRDRPRVVPAEGH
ERSVTIGPLDASRKYRFLLYGLLGKRRHGPLTAEGTTETRRAVDEAGTKPRLGEELQVTS
VTSDSVGLSWTVPEGQFDSFVVQYKDRDGQPQVVPVEGSLREVSVSGLDPARRYKLLLYG
LYEGKRVGPISTVAMTAPREEVEAETEAPSPPASEPRLGEVTVEESTPHTLHLSWTVTEG
EFDSFEVQYTDEDGQLQVVHLGGNQNDITLSGLESDHRYLVSLYGLHDGQRVGPAHVEAL
TAPNEEENEPSESPSTTPEPHVTPRLGELAVTDATPESLSLSWTIPEGQFDHFLIQYKNG
DGQPKAVRVPGYEDGVTISGLEPDRKYKMNLYGFHDRQRVGPVSVIGVTTAEEDTPSPTE
MEETPSPTEPSTEASEAPEEPLLGELMVTGSSPDSLSLSWTVPQGHFDSFTVQYRDRDGQ
PKVMRVPGHQDGVTISGLEPDHKYKMNLYGFHDRQRVGPVSVIGVTSEWTVEAPGWDLVG
GSLSLRXWVSGVQGTPQLEADVSPCQYKGRDGPQVVRVGGEESEVTAGGLEPGRKYKMHL
YGLHGGRRVGPVSTVGVTAPKEEPPASPALKPQLGELTVTDATPDSLSLFWNVPEGQFDQ
FLIQYKNGDGQPKAVRVPGHQDGVTISGLEPDHKYKMNLYGFHSGQRMGPVSTVGLTASE
NDQEMTPAPTDLPTTAPEPPVKPHLGELAVTDATPESLSLSWTIPEGQFDHFLIQYKNGD
GQPKAVRVPGYEDGVTISGLEPDHKYKMNLYGFHDSQRVGPVSVIGVTSEWTVEAPGPTE
PSTEASEAPEEPLLGELTVTGSSPDSLSLSWTVPQGHFDSFTVQYKGRDGPQVVRVGGEE
SEVTAGGLEPGRKYKMHLYGLHGGRRVGPVSTVGVTALQEVVEETPSPTEPSTEAPEPPK
EPLLGELTVTGSSPDSLSLSWTVPQGHFDSFTVQYKGRDGPQVVRVGGEESEVTAGGLEP
GRKYKMHLYGLHGGRRVGPVSTVGVTDPDYKAMTTQAPSTAAPEPPVKPQLGELAVSDAT
PESLSLSWTIPEGQFDHFLIQYKNGDGQPKAVRVPGYEDGVTISGLEPDHKYKMNLYGFH
DSQRVGPVSVIGVTTAEEDTPSPTEMEETPSPTEPSTEASEAPEEPLLGELTVTGSSPDS
LSLSWTVPQGHFDSFTVQYKGRDGPQVVRVGGEESEVTAGGLEPGRKYKMHLYGLHGGRR
VGPVSTVGVTASLNTEPPVAARLGELAVAAVTSDTVHLSWTVEQGPFDSFLVQYKDVQEQ
PQAVPVGGHLREVSVSGLAPGHKYKFLLFGLQDEKQHGPVSADAKTLPDTKPSLRLGELT
VTDVTPDSVGLSWTVPEGEFDSFVVQYRDRDGQPHVVPVASDQREITIPGLEPSSKYKFL
LYGLAGRKRLGPISAEGSTAPLKTERQRPPRLGELTVTDETSDSLRLLWTVAQGPFDSFV
VQYRDTDGQPSAVPVAADQREVTIEGLEPGRKYKFLLYGLLRGQRLGPVSVLGMTAPEEA
MPAPWHLATEAPEPPEGPRLGALAVKDVSPDALRLSWSVAQGPFDSFVVQYQSTGGEPQA
LLVGGDQNKVLVSGLEPSTSYKFFLYGLHEGKRLGPVSAEGTTGPAPAGQTPGEPGPRLS
HLSVTDVTTNSLRLDWEAPPEAFDSFLLRFGVPSPSTLELHPRSPLQRELTVPGTRRSAV
LRDLSPGTLYTLTLYGLRGPHKADSVQGTARTLSPALERPRDLQFSEIGETSAKVSWMPP
PSRVDSFKVSYQLADGGEPQSVQVDGRARTQKLEGLVPGAHYDVTVVSVRGFEESEPLTG
FLTTVPDGPTQLRALNLTEGSALLLWQPPQAPVDTYDIKVTAPGAPSLQGSAPGSAVDYP
LQGLELHTNYTATLHGLRGPSLTSPASITFTTGLEAPQDLEAKEVTPRTALLTWIAPPVP
PTGYLLSFNTPGGQTQEILLPGGVTSHQLHGLFPSTPYSVWLRAMWDHSFTPPVSTSFTT
GGLRIPFPRDCGEEMQNGPSTSRTTTIFLNGNRERPLNVFCDMETDGGGWLVFQRRMDGK
TDFWRDWEAYAHGFGNISAEFWLGNEALHSLTKAGDYSLRVDLRDGDEAVFAQYDSFQVD
SADEYYRLHLEGYRGTAGDSMSYHSGSVFSARDRDPNNLLISCAVSYRGAWWYRNCHYAN
LNGLYGSTVDHQGVSWYYWKGFDFSVPFTEMKLRPRSYRPPGRGG
NT seq 11658 nt   +upstreamnt  +downstreamnt
atgatgccagcccaaggttccctaacctccaacctggccctcctggtgctgctgggcaca
gtcagagcaggccctttctctccacggtccaatgtgacgctcccagccccacggcctcct
ccccagccaggaggccgcacagaacagccaggagggggaggcccttcttctcagctttat
gagcacaccgtggaaggaggggagaagcaagtggtgttcacccaccgcattaacctgccc
acttcaggtagctgtggctgtcccccaggcactgagccccccgtccctgcttcagaggtg
caggccctgagggtccggctggagatcctggaggagctggtgaaggggctcaaggaacag
tgcactgggggatgttgtcctgctgctgcccaggctggcacaggccagacggacgtgcgg
agcctctgcagtctccatggcgtgtttgacctgagccgctgcgcctgctcctgcgagcca
ggctggggtgggcccacctgctcagaccccacggatgccgcggcgcccccctcctctccc
ccctcagcttcccggtcctgcccagatgactgcaatgatcagggtcgctgcgtccgtggt
cgctgcgtgtgcttccctggctacaccggccccagctgtagctggccctcctgcccgggg
gactgccacggccgcgggcgctgcgtgcagggcgtgtgcgtgtgccgcgcgggcttctcc
ggggacgactgcagcctgcgctcctgccctcggggctgcagccagaggggacgctgcgag
aacgggcgctgcgtgtgcaaaccaggctacactggcgaggactgtggggcgaggagctgc
ccccgaggttgcagccagaaggggcgctgcgaggacgggcgctgcgtctgtgaccctggc
tacgctggcgaggactgcggctctcggagctgcccgtgggactgtggcgagggcgggcgc
tgtgtggacggccgctgcgtgtgctggcccgggtacgcgggcgaggactgcagcacgcgg
acctgcccccgcgactgtcggggccgagggcgctgcgaggaaggcgaatgcatctgcgac
ccaggctacagcggggacgattgcggagtgcgcagctgcccgggcgactgcaaccaaagg
ggccgctgcgaggacggccgctgcgtgtgctggcctgggtactcggggcccgactgcggc
gcccgcgcttgcccgcgcgactgtcggggccgcgggcgctgcgagaacggcgtgtgcgtg
tgcaacgcgggctacagcggcgaggactgcggcgcgcgcagctgtcctggggactgtagc
ggccggggccgttgcgagagcggccgctgcgtgtgttggcccgggtacacaggccgggac
tgcggcacgcgcgcctgccctggcgactgtcgcgggcgctgcgtggacggccgctgcgtg
tgtaacccgggcttcacaggcgaggactgcgggaaacgacggtgtcccggggactgccgc
ggtcgcggccgctgcgaggatggcgtgtgcttgtgcgacgtgggctacgagggtgaagac
tgcggcgcgcgcagctgccccaggggttgccagggtcgcggccagtgtctggaggggcga
tgcgtgtgcgacgacggctacgagggtgaagactgcggcgtgagacggtgcacgcgcgac
tgcagccagcgcggcgtgtgccaggatggtgtgtgcgcctgttgggagggctacacgggc
gaggactgcagcctccgtacctgcccctccaactgtcaccggcgcggccgctgtgaggac
gggcgctgcgtgtgcgactccggctacaccggcccctactgcgccacccgaagctgcccg
gcggactgccggggccgtggacgttgtgtgcagggcgtgtgcgtgtgccacgcgggctac
agcggcgaggactgtgggcaggaagagcctcccgccagcgcctgcctcgggggctgcggg
cctcgggaactgtgccgcgcaggccagtgtgtatgcgtcgagggcttccgaggccccgac
tgcgccatccagacgtgccccggggactgccgcggtcggggagagtgtcgtgagggcagc
tgcgtctgccaagatggctatgcaggggaggactgcggggaagaggtgccagccgttgag
ggcatgaggatgcacctcctggaggagacgatggttcggacagagtggacccgggctcct
ggcaatgtggatgcctacgaaattcagttcatccccacgacagaggggccgagcccccca
ttcacagcacgggtgcccagctctgcctcagcctatgaccagagaggactggcccccggt
caggagtaccaggtcactgtccgtgcccttcgagggactagctggggccctcctgcctcc
aagaccatcaccaccatgatcgatggcccccaggacctccgagtggtggctgtgacacca
accacgctggagctcaactggctgcgcccccaggctgaggtggaccagttcgtggtgtcc
tacgtcagtgccggcaaccagagggtgcggctggaagtgccctctgaggcagatgggacg
ctgctgaccggcctgatgccaggcgtagaatacgtggtgaccgtcactgccgagcggggc
cgggcagtaagctacccggcttctgtcagggccaacacagggtcctccccctcagacctc
ctgggggccactgatgagcctcctccctcaggcccttcaacgactcaaggggctcaggcc
cctgtcctgcagcagcgaccccaggagctggcagagttgagggtgctgggcagagacaag
acagggcacctccgcgtggcctggaccgcccagcctgacacctttgcccacttccagctg
cgcctacgggtgcccgaagggcctggggcacatgaggaactactgccaggggatgtccgt
caggctctggtgccctcaccccctcctggatccccctatgagctgtcacttcgtggggtc
ccccccgggggcgagccctctgcccctctcatctaccaaggcattatggacagggatggg
gagaagcctgggcagcctttggccccgccacgcctgggcaagctgacagtgacggacatg
acttccaactccctgcttctgcactggacggtccctgagggcgagtttgactccttcgtg
atccaatacaaagacagggacaggccccaggtggtgcctgtagagggaccccagcgctca
gccctcatctccaacctggacgtcggccgcaagtacaagttcatcctgtacgggctcgtg
ggcaagaagaggcatggccccctggtggctgaagccaagatcttgcctcagactgattcc
agctcagtgactccaccccgcctgggaaagctgtgggtgacagatcccaccccagattcg
ctgcacctctcctggactgtccctgagggccagtttgactccttcgtggtccaatacagg
gacagggatggacggccccaggtggtgcccgtggaggctcccgagcgctcggtcattgtc
tccccgctggaccccgaccacaagtacagattcactctgtttgggattgccgacaagaag
cggcacggccccctcacggccgatggcaccactgccgcagagaagagagaggagccccgc
cacccagagtcccctgagcagcccctgctgggggagctgacagtgactggcaccacccca
gactccctgcgcctgtcctggacggtgacccagggctcttttgactccttcatggtccag
tacaaggatgcgcaggggcgggcccaggcagtgcccatcagggggcatgagaatgaggtc
accatccccggcctggagtccgaccggaagtataagatgaacctctatgggcttcatggc
aggcagcgcgtggggcctgtgtccgtggtggccaccacagccccgcaggaaggcgtggac
gagacccccagccccacagcgcccagcacagaggcaccagagccccgcgaggagcccctc
ctgggggagctgatggtgacaggatcctccccggattcgctgagcctctcctggactgtc
ccccagggccacttcgactctttcaccatccagtacagagacagggatgggcagcccaag
gtggtgcgtgtcgggggcgaggagagtgaggtgaccgctgggggcctggagcccgggcgc
aagtacaagatgcacctgtacggcctgcacggggggcggcgtgtgggccccgtgtccacc
gtgggcgtgaccgctccacaaccagaagagactcctccagccaccgagccccccctggag
ccacgccttggggagctgacagtgacagatgtgacccccagctccgtgggccttgcatgg
acagtccctgagggccagtttgactccttcgtggtccagtacaaggacagggacgggcag
ccccaagtggtgcctgtggccgcagaccagcgagaggccactgtccccggcctggagcct
gcacgcaaatacaagatgaacttctacgggctacatggtgggcagcgcgtgggccccctc
tctgtggtagccatgaccgccccaggcacagagcccccccttgagcctcgtctgggggag
ctgacagtgacggacgtgaccccggactccgtggacctctcctggacagtccccgagggt
gaattcgactccttcgtggtccagtacagggacagggacaggcagccccaggaggtgcct
gtggctgcagaccagcgtgaggtcaccatccccggcctggagcccagtaggaaatacaag
ttcctgctctttgggatccaggatgggaaacgtcacagcccagtctctgtggaggcaaag
acagttgctcgaggtgacaccagcccaggagccccaccccgcctcggggagctgtgggtg
acagaccccaccccagactcactgcgcctctcctggacagtccccgagggccactttgac
tcctttgtggtccagttcaaggacagggacaggccccgggtggtgcctgcagagggccac
gagcgctcagtcaccatcggccctctggacgccagccgcaagtacagattcctcctctac
ggcctcctgggcaagaggcgccatggccccctcaccgccgagggcaccacagagacccgg
agagctgtggacgaggctggaacaaaaccacgtctgggggaggagctgcaggtgaccagc
gtgacatcagactccgtgggcctctcgtggacggtccccgagggccagtttgactccttt
gtggtccagtacaaggacagagatgggcaaccccaggtggtgcccgtggagggcagcctc
agggaggtcagtgtctcaggcctggaccctgcccgcaggtacaagctgctgctctacggg
ctatacgagggcaagcgtgtgggtcccatctccaccgtcgccatgaccgcccccagggaa
gaagtcgaagctgagactgaggcccccagccctccagcatccgagccccgtctgggggag
gtgactgtggaggagagcaccccacacaccctgcatctctcctggaccgtgactgaggga
gaatttgactccttcgaggtccagtacacagacgaggatgggcaactccaagtagtccat
ttagggggcaaccagaatgacatcaccctctctggcctggaatccgaccacagatacctg
gtgagcctgtacggtctccacgatgggcagcgtgtgggtcctgcgcacgtcgaggccctg
acagccccaaatgaggaagagaatgaaccttcagaatctccctctaccactcctgagcct
cacgtcacgcctcgcctgggggagctagcagtgacagacgccacccccgaatccctgagc
ctctcctggaccatccccgagggccagtttgaccacttcctgatccagtacaagaacggg
gacgggcagcccaaggcagtgagggtgccgggatacgaggacggggtcaccatctcgggc
ctggagccagaccgcaagtacaagatgaacctgtacggcttccatgacagacagcgtgtg
ggccccgtctctgtcatcggggtgacaactgcagaggaagacacccccagccccacagag
atggaggagacccccagccccacagaacccagcacagaggcctcggaggcccccgaggag
cccctcctgggggagctgatggtgacaggatcctccccggactcgctgagcctctcctgg
actgtcccccagggccacttcgactctttcaccgtacagtacagagacagggatgggcag
cccaaggtgatgcgggtgccagggcaccaggacggggtcaccatctcgggcctggagcca
gaccacaagtacaagatgaacctgtatggcttccatgacagacagcgcgtgggccccgtc
tcagtcatcggggtgacaagtgagtggacggtggaagccccagggtgggacctcgtggga
ggatcactctctctcaggtgatgggtgagtggggtgcaggggacccctcaactggaggct
gatgtgtccccctgccagtacaagggcagggacgggccccaggtggtgcgtgtcgggggc
gaggagagtgaggtgaccgctgggggcctggagcccgggcgcaagtacaagatgcacctg
tacggcctgcacggggggcggcgtgtgggccccgtgtccaccgtgggcgtgaccgcccct
aaagaggagccgcctgccagccccgctttgaagccacagctgggagagctgactgtgacg
gacgccacccccgactccctgagcctcttctggaacgtccccgagggccagtttgaccaa
ttcctgatccagtacaagaatggagatgggcagcccaaggcggtgagggtgccggggcac
caagacggggtcaccatctcgggcctggagccagaccacaagtacaagatgaacctgtac
ggcttccacagcggccagcgtatgggccctgtctccactgtcggtttaactgcctcagaa
aacgaccaagaaatgaccccagccccaacagacttgcccaccacagcccccgagcctccc
gtcaagccgcacctgggggagctggcagtgacagacgccacccctgaatccctgagcctc
tcctggaccatccccgagggccagtttgaccacttcctgatccagtacaagaacggggac
gggcagcccaaggcagtgagggtgccgggatacgaggacggggtcaccatctcgggcctg
gagccagaccacaagtacaagatgaacctgtacggcttccatgacagccagcgcgtgggc
cccgtctcagtcatcggggtgacaagtgagtggacggtggaagccccaggccccacagaa
cccagcacagaggcctcggaggcccccgaggagcccctcctgggggagctgacggtgaca
ggatcctccccggactcgctgagcctctcctggaccgtcccccagggccacttcgactcc
ttcaccgtccagtacaagggcagggacgggccccaggtggtgcgtgtcgggggcgaggag
agtgaggtgaccgctgggggcctggagcccgggcgcaagtacaagatgcacctgtacggc
ctgcacggggggcggcgtgtgggccccgtgtccaccgtgggcgtgaccgccctccaggag
gttgtggaggagacccccagccccacagaacccagcacagaggccccggagccccccaaa
gagcctctcttgggggaactgacggtaacaggatcctccccagactcgctgagcctctcc
tggaccgtcccccagggccacttcgactccttcaccgtccagtacaagggcagggacggg
ccccaggtggtgcgtgtcgggggcgaggagagtgaggtgaccgctgggggcctggagccc
gggcgcaagtacaagatgcacctgtacggcctgcacggggggcggcgtgtgggccccgtg
tccaccgtgggcgtgaccgatccagactacaaagccatgaccacccaagccccgtccact
gcggcccccgagcctcccgtcaagccgcagctgggggagctggcagtgtcagacgccacc
cccgaatccctgagcctctcctggaccatccccgagggccagtttgaccacttcctgatc
cagtacaagaacggggacgggcagcccaaggcagtgagggtgccgggatacgaggacggg
gtcaccatctcgggcctggagccagaccacaagtacaagatgaacctgtacggcttccat
gacagccagcgcgtgggccccgtctctgtcatcggggtgacaactgcagaggaagacacc
cccagccccacagagatggaggagacccccagccccacagaacccagcacagaggcctcg
gaggcccccgaggagcccctcctgggggagctgacggtgacaggatcctccccggactcg
ctgagcctctcctggaccgtcccccagggccacttcgactccttcaccgtccagtacaag
ggcagggacgggccccaggtggtgcgtgtcgggggcgaggagagtgaggtgaccgctggg
ggcctggagcccgggcgcaagtacaagatgcacctgtacggcctgcacggggggcggcgt
gtgggccccgtgtccaccgtgggcgtgaccgcctccctgaatacagagccccccgtggca
gcccgcctgggggagctggctgtggcagccgtgacctcagacacagtgcacctctcctgg
acagtggagcagggccccttcgactccttcctggtccaatacaaggatgtgcaggagcag
ccccaggcagtgcccgtgggtggacacctccgagaggtctcggtttcgggtctggccccg
ggccacaagtacaagttcctactcttcggactccaggatgagaaacaacacggtccagtc
tctgcagacgcaaagaccctcccagacactaaaccttctctccgcctgggggagctgaca
gtgacggacgtgaccccagactccgtgggcctctcctggacggtccccgagggcgaattt
gactccttcgtggttcagtacagggacagggacgggcagccccacgtggtgcctgtggcc
tcagaccagcgcgagatcaccatccccggcctggagcccagtagcaaatacaagtttctg
ctctatgggctggcaggcaggaaacgactgggccccatctctgctgaaggcagcacggct
cccctgaagacggagcggcagcgcccaccccgcctgggggagctgacagtgacagacgag
acctcggattctctgcgcctcttgtggacagtggcccagggcccctttgactcctttgtg
gtccagtacagggacacagatgggcagccctcggcggtgcccgtggccgcagaccagagg
gaggtcaccatagagggcctggagcctggcaggaagtacaaatttctgctctatgggctc
ctcagaggacagcgcctgggccccgtctctgtcctgggaatgacagccccagaagaggcc
atgccagcaccctggcacctggccacggaggcccccgagccccctgaagggccccgcctg
ggggcactggcagtgaaggacgtgtccccggacgccctacgcctctcatggagcgtggcc
cagggcccctttgactccttcgtggtccagtatcagagcacaggtggggagccccaggcc
ttgctcgtgggtggcgaccagaacaaggtcctcgtgtcaggcctggagcccagcacctcc
tacaagttcttcctctacggcttgcatgaagggaagcgcctggggcccgtctcagccgag
ggcaccacagggccagctcctgctggccagacccccggagagccagggccgcgcctatcc
cacctgtcggtgactgatgtgaccaccaattcgctgcggctcgactgggaggccccaccc
gaggcctttgactccttcctgctccgctttggggtcccgtcaccgagcactctggagctg
catccacgttccccgctgcagcgggagctgacggtgccagggacacggcgctcggctgtg
ctccgggacctgagtccagggaccctgtacacccttacgttgtatgggctgcgcgggcct
cacaaggccgacagcgtccagggcacagcccggaccctcagcccagctctggagcgcccc
cgtgacctccagttcagcgagatcggggagacgtcagccaaggtcagctggatgcccccc
ccgtccagagtggacagcttcaaagtttcctaccagctggcagatggaggggagccgcag
agtgttcaggtggatggccgagcccggacccagaaactcgaggggctcgtcccaggcgct
cactacgacgtcaccgtggtctccgtccgtggctttgaggagagtgagcctctcacaggc
ttccttaccacggttcctgacggccccacccagctgcgtgcactgaacctgacggaagga
tccgccctgctgctttggcagcccccccaggccccggtggatacctatgacatcaaggtc
acagccccaggagcgccctcgctacagggctcggcccctggcagcgccgtggactacccc
ctgcaaggccttgagctccacaccaactacacggcgaccctgcatggccttcggggcccc
agcctcacctccccagccagcatcaccttcaccacagggttggaggccccccaggacttg
gaggccaaggaggtgaccccccgcacagccctgctcacttggatcgcgcccccagtcccc
ccaactggctacctgctcagcttcaacacccctggtggacagacccaggagatcctgctc
ccaggaggggtcacctctcaccagcttcacggcctctttccctccaccccctatagcgtg
tggctccgggccatgtgggaccacagcttcactccgcccgtgtccacttccttcaccact
ggtggacttcggatccccttcccccgggactgtggggaggagatgcagaacgggcccagc
acctcaaggaccaccaccatcttcctcaacggcaaccgcgagcggcccctgaacgtcttt
tgtgacatggagaccgatgggggcggctggctggtgttccagcgccgcatggatggaaaa
actgacttctggagggactgggaggcctatgcccacggttttgggaacatctccgcggag
ttctggctgggcaacgaggccctgcacagcctgaccaaagccggcgactactccttgcgt
gtggacctgcgggatggggacgaggccgtgtttgcccagtacgactccttccaagtagac
tcggccgatgagtactaccgcctccacctggaaggctaccgcggcactgcaggggactcc
atgagctaccacagcggcagcgtcttctctgcccgtgaccgagaccccaacaacttgctc
atctcctgcgccgtctcttaccgaggggcctggtggtataggaactgccactatgccaac
ctcaacgggctctacgggagcacagtggaccaccagggagtgagctggtactactggaag
ggctttgacttctccgtgcccttcacggaaatgaagctgagaccaagaagctaccggccc
ccgggcaggggaggctga

KEGG   Physeter catodon (sperm whale): 102985243
Entry
102985243         CDS       T06011                                 

Gene name
RELN
Definition
(RefSeq) reelin isoform X1
  KO
K06249  reelin [EC:3.4.21.-]
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05017  Spinocerebellar ataxia
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102985243 (RELN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102985243 (RELN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102985243 (RELN)
 09160 Human Diseases
  09164 Neurodegenerative disease
   05017 Spinocerebellar ataxia
    102985243 (RELN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102985243 (RELN)
Enzymes [BR:pcad01000]
 3. Hydrolases
  3.4  Acting on peptide bonds (peptidases)
   3.4.21  Serine endopeptidases
    3.4.21.-  
     102985243 (RELN)
SSDB
Motif
Pfam: BNR EGF_2 EGF_Tenascin Reeler EB hEGF
Other DBs
NCBI-GeneID: 102985243
NCBI-ProteinID: XP_023971724
UniProt: A0A2Y9S785
LinkDB
Position
5
AA seq 3460 aa
MERSGWAPRTFLLALVLGATLRVRAAVGYYPRFSPFFFLCTHHGELEGDGEQGEVLISLH
IAGHPTYYVPGQEYHVTISTSTFFDGLLVTGLYTSTSVQASQSIRGSSAFGFGIMSDHQF
GNQFMCSVVASHVSHLPTTNLSFVWIAPPAGTGCVNFMATATHKGQIIFKDALAQQLCEQ
GAPTEATMHPHLAEIHSDSIILRDDFDSYHQKELNPNIWVECNNCETGEQCGAIMHGNAV
TFCEPYGPRELITTGLNTTTASVLQFSIGSGSCRFSYSDPCIIVSYAKNNTVDWIQLEKI
RAPSNVSTIIHILYLPEDAKGENVQFQWKQENLRVGEVYEACWALDNVLVINSAHRQVVL
EDNLDPIDTGNWLFFPGATVKHSCQSDGNSIYFHGNEGSEFNFATTRDVDLSTEDIQEQW
SEEFESQPTGWDILGAVIGTECGTIESGLSMVFLKDGERKICTPYLDTTGYGNLRFYFVM
GGICDPGESHENDIILYAKTEGRKEHITLDTLSYSSYKVPSLVSVVINPELQTPATRFCL
RQKNHQGHNRNVWAIDFFHVLPVLPSTMSHMIQFSINLGCGTQQPGNSVSLEFSTNHGRS
WSLLHTECLPEICAGPHLPHSTIYSSENYSGWNRITIPLPNAALTRDTRICWRQMGPILG
NMWAIDNVYVGPSCLKFCSGRGQCTRHGCKCDPGFSGPACEMASQTFPMFISESFGSSRL
SSYHNFYSIRGAEVSFGCGVLASGKALVFNKDGRRQLITSFLDSSQSRFLQFTLRLGSKS
VLSTCKAPDQPGEGVLLHYSYDNGITWKLLEHYSYLNYHEPRIISVELPDDARQFGIQFR
WWQPYHSSQGEDVWAIDEIIMTSVLFNSISLDFTNLVEVTQSLGFYLGNVQPYCGHDWTL
CFTGDSKLASSMRYVETQSMQIGASYMIQFSLVMGCGQKYTPHMDNQVKLEYSTNHGLTW
HLVQEECLPSMPSCQEFTSASIYHASEFTQWRRVIVLLPQKTWSSATRFRWSQSYYTAQD
EWALDSIYIGQQCPNMCSGHGSCDHGVCRCDQGYQGTECHPEAALPSTIMSDFENPNAWE
SDWQEVIGGEIVKPEEGCGVISSGSSLYFSKAGKRRLVSWDLDTSWVDFVQFYMQIGGES
AACNKPDSREEGILLQYSNNGGIQWHLLAEMYFSDFSKPRFVYLELPAAAKTPCTSFRWW
QPVFSGEDYDQWAVDDIIILSEKQKQIIPVVNPTLPQNFYEKPAFDYPMNQMSVWLMLAN
EGMVKNETFCSATPSAMVFGKSDGDRFAVTRDLTLKPGYVLQFKLNIGCANQFSSAAPVL
LQYSHDAGMSWFLVKEGCYPASAGKGCEGNSRELSEPTLYHTGDFEEWTRITIVIPRSLA
SSKTRFRWIQESSSQKNVPPFGLDGVYISEPCPSYCSGHGDCVSGVCFCDLGYTATQGTC
VSIVPNHSEMFDRFEGKLSPLWYKITGGQIGTGCGTLNDGKSLYFSGPGKREARTIPLDT
RNIRLVQFYIQIGSKTSGITCIKPRARNEGLVVQYSNDNGILWHLLRELDFMSFLEPQII
SIDLPREAKTPATAFRWWQPQHGKHSAQWALDDVLIGMNDSSQTGFQDKFDGSLDLQANW
YRIQGGQVNTDCLSMDTALIFTENIGKPRYAETWDFHVSASTFLQFEMSMGCSKPFSDSH
SVQLQYSLNNGRDWHLVTEECVPPTIGCLHYTESSIYTSERFQNWKRITVYLPLSTISPR
TRFRWIQANYTMGADSWAIDNVVLASGCPWMCSGRGICDAGRCVCDRGFGGTYCVPVVPL
PSILKDDFNGNLHPDLWPEVYGAERGNLNGETIKSGTSLIFKGEGLRMLISRDLDCTNTM
YVQFSLRFIAKGTPERSHSILLQFSANGGITWHLMDEFYFPQMTNILFINVPLPYTAQTN
ATRFRLWQPYNNGKKEEIWIVDDFIIDGNNLNNPVMLLDTFDFGPREDNWFFYPGGNIGL
YCPYSSKGAPEEDSAMVFVSNEVGEHSITTRDLNVNENTIIQFEINVGCSTDSSSADPVR
LEFSRDFGATWHLLLPLCYHSSSHVSSLCSTEHHPSSTYYAGTTQGWRREVVHFGKLHLC
GSVRFRWYQGFYSAGSQPVTWAIDNVYIGPQCEEMCNGHGSCINGTKCICDPGYSGPTCK
ISTKNPDFLKDDFEGQLESDRFLLMSGGKPSRKCGILSSGNNLFFNEDGLRMLMTRDLDL
SHARFVQFFMRLGCGKGVPDPRSQPVLLQYSLNGGLSWSLLQEFLFSNSSNVGRYIALEI
PLKARSASTRLRWWQPSENGHFYSPWVIDQILIGGNISGNTVLEDDFTTLESRKWLLHPG
GTKMPVCGSTGDALVFIEKASTRYVVTTDIAVNEDSFLQIDFAASCSVTDSCYAIELEYS
VDLGLSWHPLVRDCLPTNVECNRYHLQRILVSDTFNKWTRITLPLPPYTRSQATRFRWHQ
PAPFDKQQTWAIDNVYIGDGCPDMCSGHGRCIQGNCVCDEQWSGLYCDEPQTSLPTQLKD
NFNRAPSNQNWLTVNGGKLSTVCGAVASGMALHFSGGCSRLLVTVDLNLTNSEFIQFYFM
YGCLITPNNRNQGVLLEYSLNGGITWNLLMEIFYDQYSKPGFVNILLPSDAKEIATRFRW
WQPRHDGLDQNDWAIDNVLISGSADQRTVMLDTFSSAPVPQHERSPSDAGPVGRIAFDMF
MEDKTAVNEHWLFHDDCTVERFCDSPDGVMICGSHDGREVYAVTHDLTPTEGWIMQFKIS
VGCKVSEKVTQNQIHVQYSTDFGVSWNYLVPQCLPADPKCSGSVSQPSVFFPTKGWKRIT
YPLPDSLVGNPVRFRFYQKHSDMQWAIDNFYLGPECLDSCRGHGDCLKEQCICDPGYSGP
NCYLTHTLKTFLKERFDSEEIKPDLWMSLEGGSTCTECGILAEDTALYFGGSTVRQAITQ
DLDLRGAKFLQYWGRIGSENNMTSCHRPICRKEGVLLDYSTDGGITWTLLHEMDYQKYIS
VRHDYILLPEDALTNTTRLRWWQPFVISNGLVVSGVERAQWALDNILIGGAEINPSQLVD
TFDDEGTSHEENWSFYPNAVRTAGFCGNPSFHLYWPNKKKDKTHNALSSRELIIQPGYMM
QFKIVVGCEATSCGDLHSVMLEYTKDARSDSWQLVQTQCLPSSSNSIGCSPFQFHEATIY
NAVNSSSWKRITIQLPDHVSSSATQFRWIQKGEETEKQSWAIDHVYIGEACPKLCSGHGY
CTTGAICICDESFQGDDCSVFSHDLPSYIKDNFESARVTEANWETIQGGVIGSGCGQLAP
YAHGDSLYFNGCQIRQAATKPLDLTRASKIMFVLQIGSTSQTDSCNSDLSGPHAVDKAVL
LQYSVNNGITWHVIAQHQPKDFTQAQRVSYNVPLEARMKGVLLRWWQPRHNGTGHDQWAL
DHVEVVLVSTRKQNYMMNFSRQHGLRHFYNRRRRSLRRYP
NT seq 10383 nt   +upstreamnt  +downstreamnt
atggagcgcagtggctgggcgccgcggactttcctcctggcgctggtgctgggggcgacg
ctgagggtgcgcgccgcggtgggctattacccccgcttttctcccttctttttcctgtgc
acccaccacggggagctggaaggggatggggagcagggcgaggtgctcatttccctgcac
attgcgggccaccccacctactacgtaccgggacaagaataccatgtgacaatttcaaca
agcaccttctttgatggcttactggtgacaggactgtacacatctacaagtgttcaggct
tcacagagcatcagagggtccagtgcttttggatttgggatcatgtctgaccaccagttt
ggtaaccagtttatgtgcagtgtggtggcctctcatgtgagtcatctgcccacaaccaac
ctcagcttcgtctggattgctccacctgctggcacaggctgtgtgaatttcatggctaca
gcaacacacaaaggccagattatcttcaaagatgctttagcccagcaattgtgtgaacaa
ggagctccaacagaggccactatgcacccgcatctagctgaaattcatagcgacagtatt
atcctacgagatgactttgactcctaccaccaaaaggaattgaatccaaacatatgggtt
gaatgtaacaactgtgagactggagaacagtgtggtgcaattatgcatggcaatgctgtc
accttctgtgagccatatggtccaagagaattgattaccacaggccttaatacaacaaca
gcttctgtcctccagttttccattggttcaggttcatgtcgctttagttattcagaccct
tgcatcattgtgtcatatgccaagaataatactgtggactggattcagcttgagaaaatt
agagccccttccaatgtcagcacaatcatccatatcctctaccttcctgaagacgccaaa
ggggagaacgtccagtttcagtggaagcaggaaaatctccgagtaggtgaagtatatgaa
gcctgctgggctttagataacgtcctggtcatcaactcagctcacagacaagttgtttta
gaagataatctggatccaatagacacgggcaactggcttttcttcccaggagctacagtg
aagcatagctgtcagtcagatgggaactccatttatttccatggaaatgaaggcagtgag
ttcaattttgccaccacccgggatgtagatctttctacagaggatattcaagagcagtgg
tcagaagaatttgagagccagcctacagggtgggacatcttgggagctgtcattggtaca
gaatgtggaacaatagaatcaggtttatcaatggtcttcctcaaagatggagagaggaaa
atatgcactccttacctggacacaactggttacgggaacctgaggttctactttgttatg
ggagggatttgtgaccctggagagtctcatgaaaatgatatcatcctgtatgcaaagacc
gaaggaagaaaagagcacattacactggataccctttcctattcctcatataaggttcca
tctttggtttctgtagtcatcaatcctgaacttcagactcctgctaccagattttgtctc
agacaaaagaaccatcaaggacataataggaatgtctgggccatagattttttccacgtc
ttacctgttctcccttctacaatgtctcacatgatacagttttccattaatctgggatgt
ggaacacagcaacctgggaacagtgtcagcttagagttttctaccaaccatgggcgttcc
tggtccctgctccacactgaatgtttgcctgagatctgtgctggaccgcaccttccccac
agcacaatctactcctcggaaaactacagtgggtggaaccgaataacgattcccctgcct
aacgcagcactaacccgggataccagaatttgctggagacaaatgggaccaatccttgga
aatatgtgggcaattgataatgtttatgttggtccatcgtgtctcaaattctgttctggc
agaggacagtgcactcgacatggttgcaaatgtgaccctgggttttctggcccagcttgt
gagatggcatcccagacattcccaatgtttatttctgaaagctttggcagttccaggctc
tcctcttaccataacttttactctatccgtggtgctgaagtcagctttggctgtggtgtc
ttggccagtggtaaggccctggttttcaacaaagatgggaggcgtcagctaattacatct
ttccttgacagctcacaatccaggtttctccagttcacattgaggctggggagcaagtct
gttctgagcacatgcaaagctcctgaccagcctggtgaaggtgttttattgcattattct
tatgataatgggataacttggaaactcttggagcattattcatatctcaactatcatgag
cccagaataatctctgtagaattaccagatgatgccagacagtttggaattcagttcaga
tggtggcaaccatatcattcttcccagggagaggatgtatgggctattgatgagattatc
atgacatctgtacttttcaacagcattagtcttgactttaccaatcttgtggaggtcact
caatctctgggattctacctgggaaatgttcagccatactgtggccacgactggaccctt
tgttttacaggagattctaaacttgcctcaagtatgcgctatgtggaaacacaatcaatg
cagataggagcatcctatatgattcagttcagcttggtgatgggctgtggccagaaatac
actccacacatggacaaccaggtgaaactagagtattcaaccaaccatggcctcacctgg
cacctcgtacaagaggaatgccttccaagtatgccgagttgtcaggaatttacatcagca
agtatttaccatgccagtgagttcacacagtggagaagagtcatagtgcttcttccgcag
aaaacttggtccagcgccacccgcttccgctggagtcagagctattacacagcccaagat
gagtgggcgttagacagcatttacattgggcagcagtgcccaaacatgtgcagtgggcac
ggctcctgcgaccatggcgtgtgcaggtgtgaccagggataccaaggcactgagtgccac
ccagaagctgcccttccttccacgatcatgtcggactttgagaacccaaatgcttgggag
tctgactggcaagaagttattgggggagaaattgtaaaaccagaagaagggtgtggagtc
atctcttctggatcatctctgtatttcagcaaggctggaaaaagacggttggtgagctgg
gatctggatacttcctgggtggactttgtccagttctacatgcagatcggtggagagagt
gctgcgtgcaacaagcccgacagcagagaggagggcatcctcctccagtacagcaacaat
gggggcatccagtggcacctgctggcagaaatgtacttctcagacttcagcaaacccaga
tttgtctatctggagctgccagctgctgccaagaccccttgcaccagtttccgctggtgg
cagcctgtgttctcaggggaagactatgaccagtgggctgtcgatgacatcatcattctg
tccgagaagcaaaaacaaatcatcccagttgtcaacccaactttaccccagaacttttat
gagaagccggcttttgattaccctatgaatcaaatgagtgtgtggttgatgttggctaac
gaaggaatggttaaaaatgaaactttctgctctgccacgccatcagccatggtgtttgga
aaatcagatggggatcgatttgcagtaactcgagatttgacgctgaagcctggatatgtg
ctgcagtttaagctaaacattgggtgtgccaatcaattcagcagcgctgccccagttctt
cttcagtactctcacgatgctgggatgtcctggtttctggtgaaagaaggctgttaccca
gcttctgcaggcaaaggatgtgaaggaaactccagagaactgagtgagcccaccctgtat
cacacaggggactttgaagaatggacaagaatcaccattgttattccaaggtctcttgca
tccagtaagaccagattccgatggatccaagagagcagctcacagaagaatgtacctccc
tttggtttggatggcgtgtacatatcagagccttgtcccagttactgcagtggccatggg
gactgcgtttcaggggtgtgtttctgtgacctggggtacactgccacacaaggaacctgt
gtgtctattgtccctaaccacagtgagatgttcgataggtttgaggggaagctcagcccg
ttgtggtacaagataaccggaggccagattggaacgggctgtggaacacttaacgatggc
aagtctctgtacttcagtggccccggaaaaagggaagcaaggaccatccctctggacacc
aggaatatcagactcgttcagttttatatacaaattggaagcaaaacttcagggattacc
tgcatcaaaccaagagctagaaatgaagggcttgttgttcagtattcaaatgacaatggg
atactctggcatttgcttcgagagttggacttcatgtcatttctggaaccacagatcatt
tccatcgaccttccacgagaggcaaagacacctgcaacagcttttcgatggtggcaaccc
caacatgggaagcattcagcccagtgggccttggatgatgtccttataggaatgaatgac
agctctcaaactgggtttcaagacaaatttgatggctctttagatttgcaagccaactgg
tatcgaatccaaggaggtcaagtcaatactgactgtctctctatggatactgctctgata
ttcactgaaaacatagggaaacctcgttatgctgagacctgggactttcatgtgtcagca
tcgactttcttgcagtttgaaatgagcatgggctgtagcaagccctttagcgactcccac
agtgtacagctccagtattctctgaacaatggcagggactggcatcttgtcaccgaagag
tgtgttcctccaaccattggctgtctgcactacacagaaagttccatttacacctccgag
agattccagaattggaagcggatcactgtctaccttccactctccaccatctctcccagg
acccggttcagatggatccaagccaactacaccatgggggctgactcctgggctattgat
aacgttgtgctggcctcggggtgcccttggatgtgctcaggacgagggatttgtgatgct
ggacgctgtgtgtgtgaccggggctttggtggaacctactgtgttcctgtcgttcctctg
ccctcgattcttaaagatgatttcaacgggaacttacatcctgacctttggcctgaagta
tatggtgcagagagggggaatctgaatggtgaaaccatcaaatctggaacatcgctcatt
tttaaaggggaagggctaagaatgctcatttcaagagatctagactgtaccaatacaatg
tatgtccagttttcacttcgatttatagcaaaaggtaccccagagaggtcccactctatt
ctattacaattctctgccaatgggggaatcacctggcacctgatggatgaattttacttt
cctcaaatgaccaacatacttttcattaatgttcccttgccatacactgcccaaaccaac
gctacgagattcagactctggcaaccttataacaatggtaagaaagaagaaatctggatt
gttgatgacttcatcatcgatggaaataatttaaacaaccctgtgatgcttctagatact
tttgactttgggcccagagaggacaattggtttttctaccctggtggtaacattggtctt
tattgcccatattcttccaaaggagcacccgaggaagattcagctatggtgtttgtttca
aatgaagttggtgagcattccattactactcgtgacctgaatgtgaatgagaacaccatc
atacaatttgagatcaatgtcggatgctcaactgatagctcctctgctgatccggtcaga
ctggaattttccagggactttggggcgacctggcacctgctgctgcccctctgctaccac
agcagcagccatgtcagctccttatgctccacggagcaccacccaagcagcacctactac
gcggggaccacccagggctggaggagggaggttgtgcactttgggaagctgcacctttgc
ggatctgtgcgtttcagatggtaccaaggattttattctgctggctctcagccagtgaca
tgggccattgataatgtttatatcggtccccaatgtgaagagatgtgtaacggacacggg
agttgtatcaatggaaccaagtgtatatgtgatcctggctactcaggtccaacctgtaaa
ataagcaccaaaaatcctgattttctcaaggatgattttgaaggtcagctagaatctgat
agattcttattaatgagcggtggaaagccatctcggaagtgtggaatcctttccagtgga
aacaacctctttttcaatgaagatggcttacgcatgttgatgacacgagatctggattta
tcacatgctagatttgtgcagttcttcatgagactgggatgtggtaaaggtgttccagac
cccaggagccaacctgtgcttctacaatattctctcaatggtggcctctcctggagtctc
cttcaagagttccttttcagcaactccagcaacgttggcaggtacatagccctggagata
cctttgaaagcccgttctgcttctacacgtctccgctggtggcaaccatctgaaaatggg
cacttctacagtccctgggtaatcgaccagattcttattggaggaaatatttctggtaat
acagtcttggaggatgatttcacaactctggaaagtaggaaatggctgcttcaccccgga
ggcaccaagatgcctgtgtgtggctctactggtgatgccctagtcttcatcgaaaaggcc
agcacccgttatgtggtcaccacagacattgctgtgaatgaggattcattcctacagata
gattttgccgcctcctgctcggtcacagactcttgttatgctattgagttggaatactca
gtagaccttggattgtcatggcacccattggtaagggactgtctgcctaccaatgttgaa
tgcaatcgctatcacctacagcggatcctggtgtcagatactttcaacaaatggaccaga
attactctgcctctcccaccttacaccaggtcccaagccactcgtttccgttggcatcaa
ccagctccttttgacaagcagcagacgtgggcaatagataatgtctatattggggatggc
tgcccagatatgtgcagtggccatggaaggtgtatccaaggaaactgtgtctgtgacgag
cagtggagtggcctgtactgtgatgagccccagacctcccttccaacccaactcaaagac
aacttcaatcgggccccatccaaccagaactggctgactgtgaatggagggaaattaagt
actgtgtgtggagctgtggcttcgggaatggctctccatttcagtgggggttgcagtcgg
ctgttggtcacagtggatctaaacctcactaattctgagtttatccaattttacttcatg
tatggatgcctgatcacaccaaacaaccgtaaccaaggtgttctcttggaatattctctc
aatggaggcattacctggaacctgctaatggaaatcttctatgaccagtacagtaaacct
ggattcgtgaatatccttctcccctctgatgctaaagagattgctactcgtttccgctgg
tggcagccaagacacgacggcctggaccaaaacgactgggccattgacaacgtcctcatc
tcagggtctgccgaccagaggaccgtcatgctggataccttcagcagcgccccagtgccc
cagcacgagcgttcaccctcggacgctggccctgtgggaagaattgctttcgacatgttt
atggaggacaaaaccgcagtaaatgagcactggctattccatgatgattgtacagtagaa
agattctgtgactcccctgatggtgtcatgatttgtggtagtcacgatggaagagaagtg
tatgcagtgacccatgacctgacccccactgaaggctggatcatgcaattcaagatctct
gttggatgtaaagtatctgaaaaagtcacccagaatcaaattcacgtgcagtactctact
gactttggtgtgagctggaattatctggtccctcagtgcctacctgcagacccaaaatgc
tctggaagtgtttctcaaccatctgtattctttccaactaaaggatggaaaaggatcacc
tacccacttcctgacagtttagtcggaaatccagtaaggtttaggttctatcagaagcac
tcagatatgcagtgggcaattgacaatttctacctgggccctgaatgcttggacagctgc
agaggccatggggactgcttaaaggaacagtgcatctgtgatccgggatactctgggcca
aactgctacttgactcacactctgaagactttcctgaaggaacgctttgacagtgaagaa
atcaagcctgatttatggatgtccttggaaggtggaagtacttgtactgagtgtgggatt
cttgcagaggacactgcactctattttgggggatccacagtgagacaagctattacccaa
gatttagatctcagaggggcaaaattcctgcaatactgggggcgcatcggtagtgagaat
aacatgacctcctgccatcggcccatctgccggaaggaaggcgtgctgttggactactct
actgatggaggaattacttggactttgctccatgagatggattatcagaaatacatttct
gtcagacacgactacatccttcttcccgaggatgccctcaccaacacaactcgacttcgc
tggtggcagccttttgtgatcagcaatggacttgtggtctccggggtagagcgtgctcag
tgggcactagacaacattttgattggcggagcagaaatcaatcccagtcaactggtcgac
acttttgatgatgaaggcacttcccatgaagaaaactggagtttttaccctaatgcagtg
aggacagcaggattctgtggcaatccatccttccacctctactggccaaataaaaagaag
gacaagactcacaatgcgctgtcctccagagaactcattatacagccaggatacatgatg
cagtttaaaattgtggtgggttgtgaagctacttcttgtggtgaccttcattccgtaatg
ttggagtacactaaggatgcaaggtccgattcctggcagctcgttcagacccagtgcctt
ccttcctcttcaaacagcattggctgctcccccttccagttccatgaagccaccatctac
aacgctgtcaacagctccagctggaagagaatcaccatccagctgcctgaccacgtctcc
tccagtgcaacacagtttcgctggatccagaagggtgaagaaacggagaagcaaagctgg
gcaattgaccacgtgtacatcggagaggcttgccccaagctctgcagtgggcatggatac
tgtaccaccggcgccatctgcatctgcgatgaaagtttccaaggagatgattgctctgtt
ttcagtcatgatcttcccagttatattaaagataattttgagtccgcaagagtcactgag
gcaaactgggagactattcaaggtggagtgataggaagtggctgtgggcagctggcaccc
tatgcccatggagactcactctattttaacggctgtcaaataaggcaagctgccaccaaa
cctctggatctcactcgagcaagcaaaatcatgtttgttttgcaaattgggagtacatcg
cagacagacagctgcaacagcgacctgagtggcccccacgccgtggacaaggccgtgctg
ctgcagtacagtgtcaacaacgggatcacctggcacgtcatcgcgcagcaccagccgaag
gacttcacacaggctcagagggtgtcctacaacgtccccctggaggcacggatgaaagga
gttttattgcgctggtggcagccacgccacaatggaacaggtcatgatcaatgggctttg
gaccatgtggaggtcgtcctagtaagcactcgcaaacaaaattacatgatgaatttttca
cgacaacatgggctcaggcacttctacaacagaagacgaaggtcacttagacgataccca
tga

KEGG   Physeter catodon (sperm whale): 102986411
Entry
102986411         CDS       T06011                                 

Gene name
VTN
Definition
(RefSeq) vitronectin
  KO
K06251  vitronectin
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04610  Complement and coagulation cascades
pcad05165  Human papillomavirus infection
pcad05205  Proteoglycans in cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102986411 (VTN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102986411 (VTN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102986411 (VTN)
 09150 Organismal Systems
  09151 Immune system
   04610 Complement and coagulation cascades
    102986411 (VTN)
 09160 Human Diseases
  09161 Cancer: overview
   05205 Proteoglycans in cancer
    102986411 (VTN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102986411 (VTN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102986411 (VTN)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102986411 (VTN)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102986411 (VTN)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102986411 (VTN)
SSDB
Motif
Pfam: Hemopexin Somatomedin_B Ribosomal_S13_N CXCR4_N
Other DBs
NCBI-GeneID: 102986411
NCBI-ProteinID: XP_007121821
UniProt: A0A2Y9FHN0
LinkDB
Position
Unknown
AA seq 479 aa
MASPRPLLMLALLAWVVLADQESCKGRCTEGFNADRKCQCDELCSYYQSCCSDYVVECKP
QVTRGDVFTLPEDEYSFHDYREETRSNTSVQAQPESPTQPFVVQAQPEATLEQAPVLNPE
EEAPGPERGDSEPEVGPLRPETGNLGVSESPAEEEQCSGKPFDAFTDLKNGSLFAFRGLY
CYELDEKAVRPGYPKLIRDVWGIEGPLDAAFTRINCQGKTYLFKGSQYWRFEDGVLDPDF
PRNISEGFKGIPDNVDAAFALPAHSYFGRERVYFFKGKQYWEYEFQQQPSQEECEGSSQS
AVFQHFALMQRDSWENIFRLLFWGRSSGGAGKPRFISQDWLGLPGQVDAAMAGHIYISGS
APRSSRTKMTKSARRNRKRYRSRRGRGRGRGRSQNPYRQSRSTWLSWFSSEEIGLGDYDN
YKMDWLVPATCEPTQSVYFFSGDKYYRANLRTRRVDAVFPPYPRSIAKYWLGCSVPAHE
NT seq 1440 nt   +upstreamnt  +downstreamnt
atggcatccccaaggccccttctgatgctggccctgctggcgtgggttgttctggctgac
caagagtcgtgcaagggccgctgcactgagggcttcaacgccgacaggaagtgtcagtgt
gacgagctctgctcttactaccagagctgctgctccgactacgtggtcgagtgcaagccc
caagtgactcgtggggacgtattcactctgccagaagacgagtacagcttccatgactac
cgcgaggagaccagatccaataccagcgtccaggcacagccagagagccccacccagcct
tttgtcgtgcaggcccagcctgaagcgactcttgagcaggcacctgttctgaaccctgag
gaagaagccccaggacctgagcggggggactcagagcctgaggtggggcccctcaggccc
gagactggtaatctaggggtctccgagtccccagcagaggaagagcagtgcagcgggaag
ccctttgatgccttcaccgacctcaagaatggttccctctttgccttccgagggctctac
tgctatgagctggatgaaaaggccgtgaggcctgggtaccccaagctcatccgcgatgtc
tggggcatcgagggtccccttgatgctgccttcacccgcatcaactgtcaggggaagacc
tacctcttcaagggtagtcagtactggcgctttgaggatggtgtcctggaccccgatttc
ccccgcaacatctctgaaggcttcaagggcattccggacaacgtggacgcagccttcgcc
ctccccgctcatagctactttggccgggagcgggtctacttcttcaagggcaaacagtac
tgggagtacgagttccagcagcagcccagtcaagaggagtgcgaaggcagctcccagtcg
gccgtgtttcaacactttgccctgatgcagcgggacagctgggagaacatcttcagactt
cttttctggggccgttcctctggtggtgctggaaagcccaggttcatcagccaggactgg
ctcggtttgcctggacaagtggatgcggccatggctggccacatctacatctcaggctca
gctccccgctcctctcggaccaagatgaccaagtctgcgcgtcgtaatcgtaaacgctac
cgctcacgccgtggccgtggtcgtggccgtggccgaagccagaacccctaccggcaatct
cgttccacctggctgtcctggttctccagtgaggagataggcctgggagactatgataac
tacaagatggactggcttgtgcctgccacctgtgagcccacccagagcgtctacttcttc
tcaggagacaagtactaccgagcgaaccttcgcacgcggcgagtggatgctgtgttccct
ccctacccacgctccatcgccaagtactggctaggctgctcagtccctgcccacgagtag

KEGG   Physeter catodon (sperm whale): 102986665
Entry
102986665         CDS       T06011                                 

Gene name
COL6A3
Definition
(RefSeq) collagen alpha-3(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102986665 (COL6A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102986665 (COL6A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102986665 (COL6A3)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102986665 (COL6A3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102986665 (COL6A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102986665 (COL6A3)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102986665 (COL6A3)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102986665 (COL6A3)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102986665 (COL6A3)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   102986665 (COL6A3)
SSDB
Motif
Pfam: VWA VWA_2 Collagen VWA_3 Kunitz_BPTI fn3 HC2
Other DBs
NCBI-GeneID: 102986665
NCBI-ProteinID: XP_007114152
UniProt: A0A2Y9F4S9
LinkDB
Position
2
AA seq 3214 aa
MRKHRHLPLVAIFCLFFSGFSFTHAQQQADVKNGAAADIMFLVDSSWSIGKEHFQLVQEF
LYDVIKSLAVGENDFRFALVWFNGSPDTEFLLNTYRSKQEVLSHVSNMSYTGGSNQTGKG
LEYVMRNHLTEVAGSRASDGVPQVIVVLTDGHSKDALALPLAELKSADVNVFAIGVEDAD
EGALKETASEPLNVHMFNLENFTSLHDIVGNLVACVHSSMTPERAGGTETLKDITAQDSA
DIIFLIDGSNNTGSVNFAVIRDFLVNLLERLSVGTQQIQVGVVQYSDEPRTLFSLNSYST
KAQVLDAVKALGFIGGELANVGLALDFVLENHFTRAGGSRVEEGVPQVLVLISAGPSSDE
IRDAVIALKQASIFSFGLGAQAAPKAELQHIATNDNLVFTVPEFRSFGDVQEQLLPYIVG
VAQRHIVLQPPTIVTQVIEINKRDIVFLVDGSSALGLVSFNAIRDFVAKVIQRLEIGQDY
IQVAVAQYADTVRPEFYFNTYPTKREVVTAVRKMKPMDGSALYTGSALDFVRNNLFTGAA
GYRAAEGVPKLLVLITGGKSLDEVSQPAQELKRSSIMAFAIGNKVANRAELEEIAFDSSL
VFIPAEFRAAPLQGVLPGLLAPLRTLSGTTEVHVNKRDIIFLLDGSFNVGKTNFPYVRDF
VMNVVNSLDVGSDHIRVGLVQFSDTPVTEFSLNTYQTKADLLAHLRQLQLKGGLGLNTGA
ALSYVHDEHFTEAGGSRIRDRVPQLLLLFAAGQSEDSYLQAANALARAGILTFCVGTSQA
NKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILFL
FDGSANLVGQFPAVRDFLYKVIDELDVKPDRTRVAVAQYSDDVRVESRFDEHQSKPEILN
LVKRMKIKTGKALNLGYALDYAQRYIFVKSTGSRVEDGVLQFLVLLVAGRSSDRVDRPAL
NLKQSGVVPFILQAKNADPAELEQIVPSPAFILAAESLPKIGDLQAQIVNLLKSVQNGAP
TPVSGEKDVVFLIDGSEGVRNGFPLLKEFVQRVVESLDVGADRVRVAVVQYSDRTRPEFY
LNSYMDQQSIVGAIRGLTLLGGPTPNTGAALDFVLRNILIGSAGSRIAEGVPQLLIVLTA
DRSGDDVRGPSVVLKRGGAVPIGVGIGNADITEMQTISLIPDFAVVIPTFRQLGTVQQVI
SERVTQLSREELSRLRASVVPPTTPGVGGKRDVVFLIDGSQSASPEFQYIRTLIERLVDY
LDVGFDMTRVAVIQFSEDPRVEFLLNAHSSKDEVQNAVRRLRPKGGRQINIGGALEYVSR
NIFKRPLGSRIEEGVPQFLVLISSGKSDDEVDEPAVELKQFGVAPLTIARNADQEQLVKI
SLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQVQQLLASTRYPPPAVESDAADIVFLI
DSSDSVRPDGIAHIRDFVSRIVRRLNIGPNKVRIGVVQFSNDVFPEFYLKTYKSQANVLD
AIRRLRFKGGSPLNTGKALEFVARNFFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRYSQ
VISSSGIVSLGIGDRNIDRMELQTITNDSRLVFTVREFRELPNIEERVMNSFGPSKVTPA
PPGVDTPSPSRPETKKADIVFLLDGSINFRRDSFQEVLHFVSEIVDTVYEGGDSIQVGLV
QYNSDPTDEFFLKDFSTKQQIIDAINKVVYKGGRHANTKVGLEHLRLNHFVPEAGSRLDQ
RVPQIAFVITGGRSVEDAQEASLALTQRGVKVFAVGVKNIDSEEVGKIASNSATAFRVGN
VQELSELSEQVLETLHDAMHETLCPGVPDVSKVCNLDVILGFDGSRDQNIFVAQRSFESK
MDTILNRISQMQRISCSGSQLPMVRVSVVANTPSGPVEAFDFAEYQPELFEKFQNMRTQH
PYVLTADTLKLYQNKFQQASPDSVKVVIHFTDGVDGDLADLQRASEQLRQEGVRALIFVG
LERVSNLEQLMQLEFGRGFMYSRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQRG
DRGPIGSIGPKGIPGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGIKGSRGFPG
EKGELGEIGLDGLDGEDGDKGLPGISGEKGNPGRRGDKGPKGDKGERGDVGIRGDPGNSG
QDSQQRGPKGETGDIGPMGLPGTDGVSGGPGEPGKSGGVGRRGPSGAKGNKGGPGQPGSV
GEQGTRGAQGPPGPTGPPGLIGEQGILGPRGSGGTAGAPGERGRTGPLGRKGEPGDPGPK
GSVGNRGPRGETGDDGRDGVGSEGRRGKKGERGFPGYPGPKGYPGEPGTDGALGPKGIRG
RRGNSGPPGVAGQKGDPGYPGPSGYKGSRGDSMDQCALVQSIKDKCPCCYGPLECPVFPT
ELAFALDTSEGVTQDTFSRMRDVVLKIVDDLTIAESNCPRGARVAVVTYNNEVTTEIRFA
DSKKKSVLQDKIKNLQVALTSKQQSLETAMSFVARNTFKRVRNGFLMRKVSVFFSNKPTT
ESPQLREAVLKLSDAGITPLFLTSQEDRQLVNALQINNTAVGHALVLPASGDLTDFLKKV
LTCHVCLDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDIAFILDSSESTTLFQFNEMRKY
IEYLVQQLDMSPDPKASQHFARVAVVQHAPYESMGNASVPPVKVEFSLTDYGSKEKLLAF
LGSRMTQLQGTRALGSAIDYTIENIFESAPNPRDLKLVVLMLTGEVKNQQLEEAQRAILQ
AKCKGYFFVILGIGRKVNVKELYSFASEPNDIFFKLMDKSTELNEEPLMRFGRLLPSFIS
SKNAFYLSPDIRKQCDWFQGDQPAKNLVQFGYKQINVPNNVTSSPTSKPVTTAKPVTTTT
KPVTVVNLPTSKPASMRPVAERPVAGRPMATKPEAVKSTATKPEAVKSTATKPEAVKSTA
TKPEAAKPLASPVATKPEATKPEVSKTATVRSAVATRPAAAKPAPARPPAAAKPMAAKPE
APRPQAAKLAATRPATAKPMALYLAVKAPREVHASEITENSAKLHWERPEPPSPYLYNLT
ITSAHDQSPVLKQNLTVTDRVIGGLLPGQTYHVTVICYLRSQVRAIYQGSFSTKKIQPPP
LQTARSASSSTINLVVSAERLAGSKTDICKLPKEEGTCRDFILKWYYDSVTESCARFWYG
GCGGNENRFDSQDECEKVCPPVLIKPGVIAAIGT
NT seq 9645 nt   +upstreamnt  +downstreamnt
atgaggaagcatcggcatttgcctttagtggccatcttttgcctttttttctcaggcttt
tcctttactcatgcacagcaacaagcagatgtcaaaaatggcgctgctgccgatataatg
tttctagtggattcctcttggagcattggaaaggagcatttccaacttgttcaagagttt
ctatatgatgttataaaatctttagctgtgggagaaaatgatttccgttttgctctggtc
tggttcaacggcagcccagataccgagttcctgttaaatacgtatcgttctaaacaagaa
gtcctctcccatgtttccaacatgtcttatactgggggaagcaatcagactggaaaagga
ttagaatacgtaatgcgaaatcacctcactgaggttgccggaagccgggccagtgatgga
gtgcctcaggttatcgtagtgttaaccgatggacactcgaaggatgcccttgctctgcct
ttagcggaacttaaatctgctgatgttaacgtgtttgcaattggagttgaggatgcagat
gaaggagcgttaaaagaaacagcaagtgaaccgctcaatgtgcatatgttcaacctagag
aattttacctcacttcatgacatagtaggaaacttagtggcctgtgtgcattcatccatg
actccagaaagggctggaggcacagagacccttaaagacatcacagcacaagactctgct
gacattattttcctaattgatggatcaaacaacaccggaagtgtcaatttcgcagtcatt
cgcgacttccttgtaaatctccttgagagactctcagttggaactcagcagatccaagtg
ggggtggtccagtatagcgatgagcccagaaccctgttctccttaaacagttattccacc
aaggctcaggttctggacgcagtgaaagccctggggttcatcggtggggagttggccaat
gtcggccttgcccttgattttgtgctggagaatcacttcacccgtgcagggggcagccga
gtggaggaaggggttccccaggtgctggtcctcataagtgctgggccttctagtgatgaa
atccgagacgcggtcatagcactgaagcaggctagcattttctcgtttggcctcggagcc
caggccgcccccaaggcagagcttcagcacatagctaccaatgacaacttggtgtttact
gtcccggaattccgtagctttggggacgtccaggagcaattactgccgtacattgttggc
gtggcccaaaggcacattgtcttgcaaccgccaaccattgtcacccaagtcattgaaatc
aacaagagggacatagttttcctggtggatggctcttctgcactgggactggtcagcttc
aatgcaatccgcgacttcgttgccaaggtcatccagaggctggagatcggacaggattat
atccaagtggcagtggctcagtatgcagacacagtgaggccggagttttatttcaatacc
taccccaccaagagggaggtcgtgaccgccgtgaggaagatgaagcccatggatggctcg
gccttgtacacaggctctgctctggactttgttcggaacaacctgttcacgggggcggcc
ggctatcgggccgccgagggggtccctaagcttttggtgctgattacaggtggtaagtcc
ttagatgaagtcagccagcctgcccaggagctgaagagaagcagcatcatggcctttgcc
atcgggaacaaggtggccaaccgggctgagctggaagagatcgccttcgattcctccctg
gtgttcattcctgccgagttccgagccgcccctttgcagggtgtgctgcctggcttgctg
gcgcctctcaggaccctctctggaaccactgaagttcacgtaaacaaaagggatatcatc
tttcttttggatggatcgttcaacgttggaaagaccaatttcccttatgtgcgggacttt
gtcatgaacgtagttaacagccttgatgttggaagcgaccatattcgtgtcggtttagtg
caatttagtgacactccggtgacggagttctctctaaacacctaccagaccaaggcagat
ttgctagctcacctgaggcagctgcagctcaaggggggtttgggcctgaacacgggcgca
gccctaagctatgtccatgacgagcactttaccgaagctggtggcagcaggatccgggac
cgtgtgccgcaactcctgctcctgttcgcagccgggcagtctgaggactcctatttgcaa
gctgccaacgccctagcacgcgcaggcatcctgacattttgtgtgggcactagccaggcc
aacaaggcagagcttgagcagattgcttttaacccgagcctggtgtatctcatggatgat
ttcagctccctgcctgctctgcctcagcagctgattcagcccctaaccacttatgttagt
ggaggtgtggaggaagtgccactcgcccagccagagagcaagcgagacatcctgttcctc
tttgatggctcagccaatctcgtgggccagtttcctgccgtccgcgacttcctctacaag
gtcatcgatgagctggatgtgaaacctgacaggacccgggttgcggtggctcagtacagc
gacgatgtcagggtggagtcccgtttcgatgagcaccagagcaagcccgagatcctgaat
ctcgtgaagagaatgaagatcaagacgggcaaagccctcaacctgggctacgccctggac
tacgcgcagaggtacatctttgtgaagtccactggaagccgggtcgaggatggcgtgctt
cagttcctggtgctgctggtggcgggaaggtcatccgatcgtgtggacaggccagctctc
aacctgaaacagagtggggtggtgcctttcatcctccaggccaagaacgcagaccctgcc
gagttggaacagatagtaccgtcccccgcctttatcctggctgcggagtcacttcccaag
attggagaccttcaagcacagatcgtaaacctcttaaaatcggtgcaaaatggggcgcca
acaccagtttcaggtgaaaaggatgtggtgtttctgattgacggctccgagggtgtcagg
aatggcttcccgctgttgaaagagtttgtccagagagtggtggagagcctggacgtgggt
gcggaccgggtgcgcgtagctgtggtgcagtacagcgaccggaccaggcccgagttctac
ctgaattcctacatggaccagcagagcatcgtcggtgccatccgcgggctgaccctgctg
ggcgggccaacccccaacaccggggccgccttggactttgtcctgaggaacatcctgatt
ggttccgccggcagccgcatagcagaaggggtcccccagctcctgatcgtcctcacggcc
gacaggtctggggatgatgtgagaggcccctcggtggtcctgaagaggggaggggccgtg
cccatcggcgtcggcatcgggaacgctgacatcaccgagatgcagaccatctccctcatc
ccggacttcgcagtggtcattcccaccttccggcagctggggaccgtccaacaggtcatc
tctgagagagtgacccagctgagccgcgaggagctgagcaggttgcgggcatctgtggtc
cctccaacgaccccgggtgttggcggcaagagggatgtggtctttctcatcgatgggtcc
caaagcgccagccccgagtttcagtacatccgtaccctcatcgagaggctggttgactac
ttggatgtgggcttcgacatgacccgggtggcggtcatccagttcagcgaggaccccagg
gtggagttcctgctgaacgcccactccagcaaggacgaggtgcagaacgcggtgaggcgg
ctgcggcccaaaggcgggcggcagatcaacattgggggcgccctggagtatgtgtcaagg
aatatcttcaagaggcctctggggagccggattgaagagggtgtccctcagttcctggtc
ctcatttcttccgggaagtctgacgatgaggtagatgaaccggcagtagagctcaagcaa
tttggcgtggcgccacttacaatcgcaaggaacgcagatcaggagcagttggtgaagatc
tccctgagccctgaatacgtgttctcagtgagcaccttccgggagctgcccagcctggag
cagaaactgctgacacccatcacaaccctaacctccgagcaggttcagcagctcctggcc
agcacgcgctaccctcctccagccgtggagagtgatgctgcagacatcgtcttcctaatc
gacagctctgacagtgtgaggccggacggcattgcacacatccgggactttgttagcagg
atcgtgaggagacttaacattggccctaataaagtgagaatcggggtcgtgcagttcagc
aatgatgtcttcccagagttttacctgaagacctataaatcccaggccaatgtgctggat
gccatacgtcgcctgaggttcaaaggggggtctccgctgaacactggcaaagctctggaa
ttcgtggcaaggaatttctttgtcaagtctgccgggagccggatagaagatggggtgccc
caacacctggttctgtttctgggtggcaaatcccaggacgatgtttccaggtattcccag
gtgataagctcctcggggattgtgagtttagggataggagaccgaaacattgacagaatg
gagctgcagaccatcaccaacgactccagactagtcttcacggttcgggagttcagagag
ctccccaacatagaagaaagagtcatgaactcctttggaccctccaaggtcacccctgca
cctccgggagtggacacgccctctccctcacggccagagacaaagaaagcagacatcgtg
ttcctgttggatggttccatcaatttccggagggacagtttccaggaagtgctccatttc
gtgtctgaaattgtggacacggtttatgaagggggagactccatccaagtggggctggtc
cagtacaactctgaccccactgacgaattcttcctgaaggacttttccaccaagcagcag
attattgatgccatcaacaaggtggtctacaaaggggggaggcacgcaaacaccaaggtg
ggcctggagcacctgcggctgaatcacttcgtgccagaggcaggcagtcgcctggatcag
agagtcccgcagattgcctttgtgatcaccggaggaaggtcggtggaggatgcccaggag
gcgagcctggcgctcacccagagaggcgtcaaagtgttcgcggtgggcgtgaagaacatc
gactcagaggaggttgggaagattgcgtccaacagtgccacggcgttccgagtggggaac
gtccaggagctgtcggaattgagcgagcaggttttggaaaccctgcatgacgcgatgcat
gaaaccttatgtcctggtgtgcctgatgtttccaaagtctgtaacctggatgtgattctg
ggctttgatggttcaagagatcagaacatatttgtggctcagaggagctttgagtccaag
atggacaccatcctgaatagaatcagccagatgcaaagaatcagctgcagcggcagccag
ctgcccatggtgcgggtgtccgtggtggccaacacgccctcaggccccgtggaggccttt
gactttgctgagtaccagccagagctgtttgagaagttccagaacatgcgcacccagcac
ccctacgtcctcactgcggacacgctgaagctctaccagaacaagttccagcaggcctcg
ccagacagtgtgaaggtggtcattcattttactgatggggtggatggagatctggctgat
ttacaaagagcatcggagcaactccgacaagaaggcgtccgagctctgatcttcgtgggc
cttgaacgtgtctccaacttggagcagctgatgcagctggagtttgggcgaggattcatg
tacagccggcccctgagactgaacttgctggacctagactatgaactagcagagcaactt
gacaacattgccgagaaagcctgctgtggggttccctgcaagtgctctggacaaagagga
gaccgagggcccattggcagcattgggccaaagggtattcctggagaagatggctatcga
ggttatcctggtgacgagggtggacctggtgagcgcggcccgcctggtgtcaacggcact
caaggtttccagggctgccccgggcagagaggcataaagggctctcgtggattcccagga
gagaagggtgaattaggagaaattggactggatggtcttgacggtgaagacggagacaaa
ggattgcctggtatttctggagagaaagggaatccaggaagaaggggtgacaaaggaccc
aaaggagacaaaggagaaagaggggatgttgggatcagaggtgacccgggtaactcagga
caggacagccagcagagaggacccaaaggagaaaccggagacatcggccccatgggtctc
cccgggacagatggggtgtccggcgggcccggagaacccgggaaaagcggcggcgttggc
cgaaggggaccgtcaggagctaagggcaacaagggcggtcccggccagccgggctctgtg
ggagagcaggggaccagaggtgcacagggtcctcctggccccaccggtcctccaggcctg
atcggtgaacaaggcattcttggacctcggggaagcggggggaccgcaggtgctcctgga
gaacgtggcagaaccgggcccctgggaagaaagggcgagcccggagatccaggacccaag
ggaagcgtcgggaaccggggccctcgtggggagacgggagatgacggacgagatggagtt
ggcagtgaaggacgcagaggcaaaaaaggagagagaggattccccggatacccaggacca
aagggctatcctggtgagccggggacagacggagcactgggacccaaaggcatcagaggc
cgaaggggaaattcaggacctccaggggtagctggacagaagggagaccctggttaccca
ggaccatctggttacaaaggcagcagaggtgactcgatggaccaatgtgccctggtccag
agcatcaaagataaatgtccttgctgctacgggcccctggaatgccctgtcttcccgacg
gagctggccttcgccttggacacctccgagggggtcacccaggacactttcagccggatg
agggatgtggtcctgaagatcgtggacgacctgaccatcgccgagagcaactgcccgcgg
ggggcccgcgtggccgtggtcacctacaacaatgaggtcaccacggagatccgctttgct
gactccaagaagaagtccgttctccaggacaagatcaagaaccttcaggtggctctgacg
tccaagcaacagagtctggaaactgccatgtccttcgtggccagaaacacgtttaagcgg
gtgcggaatggattcctgatgaggaaggtgtctgttttcttcagcaataaacccacgacg
gagtctccgcagctcagggaggccgtgctcaagctctcagatgctgggatcacaccactg
ttcctcacgagccaggaggaccggcagctggtcaacgccctgcagatcaataacacagcg
gtgggtcatgcccttgtcctaccagccagcggagacctcacggacttcttgaagaaagtc
ctgacttgtcacgtttgcttggacatctgcaacatcgacccatcctgtggatttggcagc
tggaggccttccttcagggaccggagagcagcgggcagcgacgtggatatcgacatcgct
ttcatcttagatagctccgagtccaccactctgttccagttcaacgagatgaggaagtac
atagagtacctggtccaacagctggacatgagcccggaccccaaggcatcccagcacttc
gccagagtagcggtcgtgcagcatgcaccctatgagtccatgggcaacgccagcgtgccg
cccgtgaaggtggaattctccctgactgactacggctccaaggagaagctcctggccttc
ctcggcagcaggatgacacagctccaggggaccagggctttgggcagtgccatcgactat
accattgagaacatctttgaaagtgcacccaacccacgggacctgaagctcgtggtcctg
atgctgacgggcgaggtgaagaaccagcagctggaggaggcccagagagccatcctgcag
gccaaatgcaagggttacttcttcgtgatcctgggcattggtaggaaggtgaacgtcaag
gagctgtacagcttcgccagtgaaccgaacgacatcttcttcaaactcatggacaagtct
actgagctcaacgaagagcctctgatgcgctttgggaggctgttgccatctttcatcagc
agtaagaatgctttttacctgtccccagatatcaggaaacagtgtgattggttccaaggg
gaccaaccggcaaagaatcttgtgcagtttggctacaaacaaataaatgttccaaataat
gttacttcaagtcctacatctaaaccagtgaccacagcaaagccagtgaccaccaccacc
aaaccagtaacagtggtaaacctgccgacctcaaagccagcttccatgagacctgtggcc
gagagacccgtggctgggagacccatggccaccaagccggaggctgtcaagtccacggcc
accaagcctgaggctgtcaagtccacggccaccaagccggaggctgtcaagtccacggcc
accaagcccgaggccgccaagcccttggccagccctgtggccaccaagcccgaggccacc
aagcccgaggtttcgaagacggccacagttagatccgcagtggccacgaggcccgcagca
gcaaagccggcacctgccagaccccctgctgctgccaaacccatggccgcaaagcccgag
gcccccaggccccaggcagccaaactggccgccaccaggccagccactgctaagcccatg
gcactctatcttgcagttaaggcaccccgagaggtccacgcgtctgaaatcacagagaac
agcgccaaactccactgggagaggcccgagccccccagcccttatttatacaacctcacc
atcacctcggcccacgatcagtccccggttctgaagcagaacctcacggtcacggaccgc
gtcatcgggggcctgctccctgggcagacgtaccatgtcacggtgatctgttacctaagg
tctcaggtcagagccatctaccaaggcagtttcagtacaaagaaaattcagcctccacct
ctgcaaacagcgaggtcagcttctagttcaaccatcaatctagtggtgagcgcagaacgg
ctggctgggagtaaaacagatatttgtaagttgccaaaagaagaaggaacttgcagggac
tttatattaaaatggtactatgattccgttaccgaaagctgcgccagattctggtatgga
ggttgtggtggaaatgaaaacagatttgattcgcaggatgaatgtgaaaaggtttgccct
cctgttctcatcaagcccggagtcattgctgcgattgggacctaa

KEGG   Physeter catodon (sperm whale): 102986674
Entry
102986674         CDS       T06011                                 

Gene name
TNN
Definition
(RefSeq) tenascin-N isoform X1
  KO
K06252  tenascin
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05165  Human papillomavirus infection
pcad05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102986674 (TNN)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102986674 (TNN)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102986674 (TNN)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    102986674 (TNN)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102986674 (TNN)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102986674 (TNN)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102986674 (TNN)
SSDB
Motif
Pfam: fn3 Fibrinogen_C DUF4998 Pur_ac_phosph_N EGF_2 Interfer-bind EGF_Tenascin Ax_dynein_light Packaging_FI
Other DBs
NCBI-GeneID: 102986674
NCBI-ProteinID: XP_023989474
UniProt: A0A2Y9TKE1
LinkDB
Position
4
AA seq 1296 aa
MGLQGMFCFPLGLLFGSVLLVASAPATLESSGCSEKEPQVTVSHTYKIDVPKSALVQVEA
DPQPLRDAGASLLAPGEDEEQSIIFRHNIRLQTPQKDCELAGSIQDLLARVKKLEEEMAE
VKERCSAQRCCPGAAGLSHHCSGHGTFSLDTCSCHCEQGWEGAGCERPSCPGACSGHGRC
VDGRCLCEQPYVGADCAYPACAENCSGHGVCVRGVCQCHEDFMSEDCSERRCPGDCSGHG
FCDTGECYCEEGFTGLDCTQVVAPQDLQLLKSTEDSLLVSWEPSSQVDHYLLSYYPLGKE
LSIKQIQVPKEQHSYEILGLLPGTKYIVTLRNVKKEISSSPQHLLATTDLAVLGTAWVTD
ETENSLDVEWENPPTKVDYYKLQYGPLTGQEVAEVTVPKSSDPKSRYDITGLQPGTEYKI
TVVPMRGDLEGKPILLNGRTEIDSPANVVTDRVTEDTAMVSWNPVRAAIDKYVVRYISTD
GETKDTAVHREQSSTVLMGLKPGEVYKVYVWAERGNQESKKADTKALTDIDSPQNLVTDR
VTENTATISWDPVQAVIDRYLVCYTSADGDSGEVPVGKEQSSTVLTGLRPGMEYTVHVWA
QKGDRESRKADTTAPTEIDSPQNLVTDQVTENTATVSWDPVQAVIDRYLVCYTSADGDSG
EVPVGKEQSSTVLTGLRPGVEYTVHVWAQKGVRESRKADTTALTDIDSPQNLVTDQVTEN
TATISWNPVQAVIDRYLVRYTSADGDSGEVPVGKEQSGTVLTGLRPGVEYTVHVWAQKGD
RESRKADTMAPTDIDSPRNLVTDQVTENTATVSWDPVQAVIDRYLVRYTSADGDSGEVPV
GKEQSGTVLTGLRPGVEYTVHVWAQKGDRESRKADTTAPTDIDSPRNLVTDQVTENTATV
SWDPVQAVIDRYLVRYTSADGDSGEVPVGKEQSGTVLTGLRPGVEYTVHVWAQKGDRESR
KADTTAPTDIDPPQNLHPSAVTQSGGVLTWTPPSAQIDGYILTYQFPDGTVKEVQLGRGD
QRFELQGLEQGVTLPVSLVAFKGDRRSRSVSTTLSTVGARFPHPSDCSQVQQNSNVASGL
YTVYLHGDASRPLQVYCDMDTDGGGWIVFQRRNTGQLDFFKRWRTYVEGFGDPTREFWLG
LDKLHNLTTGTPTRYEVRVDLQTANESAYAIYDSFRVASSKERYRLTVGKYRGTAGDALT
YHNGWKFTTFDRDNDIALSNCALTHHGGWWYKNCHLANPNGRYGETKHSEGVNWEPWKGH
EFSIPFVELKIRPHGYSGEHVLGRKQRTLGENSRTF
NT seq 3891 nt   +upstreamnt  +downstreamnt
atgggtctccaggggatgttctgcttccccctggggctcctgtttggctctgtgctcttg
gtggcctcagccccggccactctcgagtcttctggctgcagcgagaaggagccacaggtc
actgtcagccatacctacaagattgatgtgcccaagtctgccctggtccaggtggaggct
gaccctcagcccctccgtgatgctggagcctcactcctggccccgggggaggatgaggaa
cagagcatcatcttcaggcacaacatccgcctgcagacaccccagaaggactgcgagtta
gcaggcagcatccaggacctcctggctcgggtgaagaagttggaggaagagatggcggag
gtgaaggaacggtgtagtgcccagcgctgctgcccgggagctgctggtctgagccaccac
tgcagcggccacgggaccttctccctggacacctgcagctgccactgcgagcagggctgg
gagggcgccggctgcgagcggccctcctgccccggcgcgtgcagcggccacgggcgctgc
gtggacggccgctgcctgtgcgagcagccctacgtgggggccgactgcgcctaccccgcc
tgcgcggagaactgcagcgggcacggcgtgtgcgtgcgcggcgtctgccagtgccacgag
gacttcatgtctgaggactgcagcgagcggcgctgccccggcgactgcagcggccatggc
ttctgtgacacgggcgagtgctactgcgaggagggcttcaccggcctcgactgcacccag
gtggtcgccccccaggatctgcagctgctcaaaagcacagaggattccctgctggtgagc
tgggagccctccagccaggtggaccactacctcctcagctactaccccctggggaaggag
ctctctatcaagcagatccaagtgcccaaggagcagcacagctatgagatccttggcttg
ctgcctggaaccaagtacatagtcaccctgcgcaacgtgaagaaagagatttccagcagc
ccacagcatctacttgccaccacagatcttgctgtgcttggcactgcctgggtgacggat
gagactgagaactcccttgatgtggagtgggagaaccccccgaccaaggtggactactac
aagctgcagtatggccccctgacagggcaagaggtggccgaggtcaccgtgcccaagagc
agtgaccccaaaagccgatatgacatcacgggtctgcagccggggacagaatataagatc
acagttgtccccatgaggggagacctcgagggcaagccgattctcctgaatggcaggaca
gaaattgatagtccagccaatgtggtcactgatcgagtgacagaagacacagcaatggtc
tcctggaacccagtccgggctgccattgataagtatgtggtgcgctacatctccactgac
ggggagacgaaggacacagctgtacacagggagcagagcagcaccgtcctgatgggcctg
aagccgggagaggtgtacaaagtctacgtgtgggctgagaggggcaaccaggagagcaag
aaggctgacaccaaggccctcacagacattgacagcccacaaaacctggtgaccgaccgg
gtgacagagaacacggccacaatctcctgggaccccgtgcaggccgtcatcgacaggtac
ctggtgtgctacacctctgctgacggagactccggggaggttccggtggggaaggagcag
agcagcaccgttctgacgggcctgaggccgggcatggagtacacggtccacgtgtgggcc
cagaagggggaccgggagagcaggaaggccgacaccacagccccgacagaaatcgacagc
ccccaaaacctggtgactgaccaggtgacagagaacacggccactgtctcctgggacccg
gtgcaggctgtgattgacaggtacctggtgtgctacacctctgctgacggagactccggg
gaggttccggtggggaaggagcagagcagcaccgtcctgacgggcctgaggccgggcgtg
gagtacacggtgcacgtgtgggcccagaagggagtccgggagagcaggaaggccgacacc
acggccctgacagacattgacagcccccaaaacctggtgaccgaccaggtgacagagaac
acagccacgatctcctggaacccggtgcaggccgtgattgacaggtacctggtgcgctac
acctctgctgacggagactccggggaggttccggtggggaaggagcagagcggcaccgtc
ctgacgggcctgaggccgggcgtggagtacacggtgcacgtgtgggcccagaagggggac
cgggagagcaggaaggccgacaccatggccccgacagacattgacagcccccgaaacctg
gtgaccgaccaggtgacagagaacacggccactgtctcctgggacccggttcaggctgtg
attgacaggtacctggtgcgctacacctctgctgacggagactccggggaggttccggtg
gggaaggagcagagcggcaccgtcctgacgggcctgaggccgggcgtggagtacacggtg
cacgtgtgggcccagaagggggaccgggagagcaggaaggccgacaccacagccccgaca
gacattgacagcccccgaaacctggtgaccgaccaggtgacagagaacacggccactgtc
tcctgggacccggtgcaggccgtgattgacaggtacctggtgcgctacacctctgctgac
ggagactccggggaggttccggtggggaaggagcagagcggcaccgtcctgacgggcctg
aggccgggcgtggagtacacggtgcacgtgtgggcccagaagggggaccgggagagcagg
aaggccgacaccacggccccgacagacattgaccctccccaaaaccttcatccgtctgct
gtaacacagtctggtggggtattgacctggacacccccttctgctcagatcgatggctat
attctgacctaccagttcccagatggcactgtcaaggaggtgcagcttggaagaggggac
cagaggtttgagttgcaaggccttgagcagggtgtcaccctccctgtctccttggttgcc
tttaagggtgatcgccggagcaggagtgtatccaccaccctttccacagttggtgcccgt
tttccacacccttcggattgcagtcaagttcagcagaatagcaacgtcgccagtggtctc
tataccgtctacctgcacggggatgccagtcggcccctgcaggtgtactgtgacatggac
acggatggaggcggctggatcgtcttccagaggcggaacactgggcagctagatttcttc
aagcgctggcggacctacgtggaaggcttcggggaccccacgagggagttctggcttgga
cttgacaagctacacaacctcaccacaggcacgcccacccgctatgaggtgagagtggac
ttacagactgccaacgaatctgcctatgccatatatgactccttccgagtagcctccagc
aaggagcggtacaggctgacggttgggaaatacagaggcacagcaggggacgctcttact
taccacaatggatggaagtttacaacctttgacagagacaatgatatcgccctcagcaac
tgtgccctgacgcatcatggcggctggtggtacaagaactgccacctggccaaccctaat
ggcagatacggggagacaaagcatagtgagggggtgaactgggagccatggaaaggacat
gaattctccattccttttgtggagttgaagatccgccctcatggctacagcggggagcat
gtgctgggcagaaagcagcgaacactgggagaaaattcaagaacattctga

KEGG   Physeter catodon (sperm whale): 102986739
Entry
102986739         CDS       T06011                                 

Gene name
THBS3
Definition
(RefSeq) thrombospondin-3 isoform X1
  KO
K04659  thrombospondin 2/3/4/5
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04145  Phagosome
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05144  Malaria
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102986739 (THBS3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102986739 (THBS3)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    102986739 (THBS3)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102986739 (THBS3)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102986739 (THBS3)
  09174 Infectious disease: parasitic
   05144 Malaria
    102986739 (THBS3)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pcad04131]
    102986739 (THBS3)
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102986739 (THBS3)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102986739 (THBS3)
   04990 Domain-containing proteins not elsewhere classified [BR:pcad04990]
    102986739 (THBS3)
Membrane trafficking [BR:pcad04131]
 Endocytosis
  Phagocytosis
   Opsonins
    102986739 (THBS3)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102986739 (THBS3)
  Exosomal proteins of colorectal cancer cells
   102986739 (THBS3)
  Exosomal proteins of bladder cancer cells
   102986739 (THBS3)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102986739 (THBS3)
Domain-containing proteins not elsewhere classified [BR:pcad04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   102986739 (THBS3)
SSDB
Motif
Pfam: TSP_C TSP_3 COMP EGF_CA cEGF EGF_3 EGF_MSP1_1
Other DBs
NCBI-GeneID: 102986739
NCBI-ProteinID: XP_007108943
UniProt: A0A2Y9EVT5
LinkDB
Position
Unknown
AA seq 956 aa
METQELRGALALLLLCTFTSASQDLQVIDLLTVGESRQMVAVVEKIRTALLTAGDIYLLS
TFRLPPKQGGVLFGLYSRQDNTRWLEASVVGKINKVLVRYQREDGRVHAVNLQQAGLADG
RTHTALLRLRGPSRPSPALQLYVDCKLGDQHAGLPALAPIPPAEVSGLEIRTGQKAYLRM
QGFVESMKMILGGSMARVGALSECPFQGDESIHSAVTNALHSILGEQTKALVTQLTLFNQ
ILVELRDDIRDQVKEMSLIRNTIMECQVCGFHEQRSHCSPNPCFRGVDCMEVYEYPGYRC
GPCPPGLQGNGTHCTDINECAHADPCFPGSSCVNTMPGFHCEACPRGYKGTRVSGVGIDY
ARASKQVCNDIDECNDGNNGGCDPNSICTNTVGSFKCGPCRLGFLGNQSQGCFPARTCHS
PTHSPCHVHAHCLFERNGAVSCSCNVGWAGNGNVCGPDTDIDGYPDQALPCMDNNKHCKQ
DNCLLTPNSGQEDADNDGVGDQCDDDADGDGIKNVEDNCRLFPNKDQQNSDTDSFGDACD
NCPNVPNNDQKDTDGNGEGDACDNDVDGDGIPNGLDNCPKVPNPLQTDRDEDGVGDACDS
CPEMSNPTQTDADSDLVGDVCDTNEDSDGDGHQDTKDNCPQLPNSSQLDSDNDGLGDECD
GDDDNDGVPDYVPPGPDNCRLIPNPNQKDSDGNGVGDVCEDDFDNDAVVDPLDVCPESAE
VTLTDFRAYQTVVLDPEGDAQIDPNWVVLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGT
FHVNTVTDDDYAGFLFSYQDSGRFYVVMWKQTEQTYWQATPFRAVAQPGLQLKAVTSVSG
PGEHLRNALWHTGHTPDQVRLLWTDPRNVGWRDKTSYRWQLLHRPQVGYIRVKLYEGSQL
VADSGVIIDTSMRGGRLGVFCFSQENIIWSNLQYRCNDTVPEDFEPFRRQLLQERV
NT seq 2871 nt   +upstreamnt  +downstreamnt
atggagacgcaggaacttcgaggggccctggctcttctcctcctttgcactttcacatct
gccagtcaggacctgcaggtgattgacctgctgactgtgggcgagtcccggcagatggta
gctgtggtggagaagatacggacagccctgctcactgctggggacatctatctcttgtcc
accttccgcctgccccccaagcagggtggtgtcctcttcggcctctactctcgccaagac
aacacacgatggctggaggcctctgttgtgggcaagatcaacaaagtgctggtgcggtac
cagcgggaggatggcagagtccatgcagtgaacctacagcaagcgggcctggctgacggg
cgcacacacacggctctcctgcgactccgaggtccgtccagacccagccctgccctacag
ctctatgtggactgcaaactgggcgaccagcatgctggccttccggccttggcccccatt
cctccagcggaagtcagtgggctggagattaggacgggacagaaggcttatttgaggatg
cagggcttcgtggaatctatgaaaatgattctgggcgggtccatggcccgggtcggagcc
ctgagtgagtgtccgttccagggggatgagtctatccacagtgcagtgaccaacgcactc
cactccatcctaggggagcagaccaaggcgctggtcacccaactcaccctcttcaaccag
atcctggtagagctgcgggatgacatccgagaccaggtgaaagaaatgtccctgatccga
aacaccatcatggagtgtcaggtgtgcggcttccacgaacagcgttcccactgcagcccc
aacccctgcttccgaggtgtggactgcatggaagtgtacgagtaccccggctaccgctgt
gggccctgcccccctggcctacagggcaacggcacccactgcacagacatcaatgagtgt
gctcacgcggacccttgtttcccgggctccagctgcgtcaacaccatgcctggcttccac
tgtgaggcctgtccccgaggctacaaaggcacacgggtgtctggtgtgggcatcgactat
gcccgggctagcaaacaggtctgcaacgatattgatgaatgcaatgatggtaacaatggt
ggctgcgacccaaactccatctgcaccaatactgtgggttctttcaagtgtggtccctgt
cgcttgggcttcttgggcaaccagagccagggctgcttcccagcccggacctgccacagc
ccgacccacagcccctgccatgtccacgcacactgtctctttgaacgcaatggtgcagtg
tcctgctcgtgtaacgtgggctgggccgggaacgggaatgtgtgtgggcccgacacggac
atcgatggctacccggaccaggcgctgccctgcatggacaacaacaaacactgcaagcag
gacaactgccttctgacacccaactctgggcaggaagatgctgataacgacggtgtgggg
gaccagtgtgatgatgatgctgacggggacgggatcaagaatgttgaggacaactgccgg
ctgtttcccaacaaggaccagcaaaactcagatacagattcatttggtgatgcctgtgac
aactgccccaacgttcccaacaatgaccagaaagacacagatggcaacggggaaggggat
gcctgtgacaacgacgtggatggggatggcatccctaatggattggacaattgccctaaa
gttcccaaccccctgcaaacagacagggatgaggacggggtgggagatgcttgtgacagc
tgccctgaaatgagcaatcccacccagacagatgcagacagcgacctggtgggggatgtc
tgtgacaccaatgaagacagtgatggcgatggacatcaggacactaaggacaattgccca
cagctgcccaacagctcccagctggactcagacaatgacgggcttggagatgagtgtgac
ggggacgatgacaatgacggtgtcccagattacgtgcctcctggtcctgacaactgtcgc
ctgatacctaatcccaatcagaaggactcagatggcaatggcgttggtgatgtgtgtgag
gatgattttgacaatgatgcagtggttgaccccctggatgtgtgccctgaaagtgcagag
gtaaccctcacggactttcgggcctatcagactgtcgtcctggatcctgagggtgatgct
cagattgacccaaactgggtcgtgctcaaccagggcatggaaatcgttcagaccatgaac
agtgaccccggcctggcagttggatatacagccttcaatggtgtggactttgaaggcacc
ttccatgtgaacacagtgactgatgatgactacgcaggctttctcttcagctatcaggac
agcggccgcttctacgtggtcatgtggaagcaaacggaacagacctactggcaggccaca
cctttccgggctgttgcccagcccgggctacagctcaaggcagtgacatcagtgtctggc
ccaggcgagcacctcaggaatgccctgtggcatacaggtcacacccctgatcaggtacgg
ctgctgtggactgacccacgaaatgtgggctggcgtgacaagacttcctaccgctggcag
ctgctgcaccggcctcaagttggctacattcgggtgaagctttatgagggttcccagcta
gtggccgattctggggtgatcattgacacatccatgcgaggggggcgtcttggtgtattc
tgcttctcccaagaaaacatcatttggtccaatctccagtatcgatgcaatgacacagtg
cccgaggactttgagccattccggaggcagctgctccaggaaagagtgtga

KEGG   Physeter catodon (sperm whale): 102987169
Entry
102987169         CDS       T06011                                 

Gene name
COL6A2
Definition
(RefSeq) collagen alpha-2(VI) chain isoform X1
  KO
K06238  collagen type VI alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102987169 (COL6A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102987169 (COL6A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102987169 (COL6A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102987169 (COL6A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102987169 (COL6A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102987169 (COL6A2)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102987169 (COL6A2)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   102987169 (COL6A2)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102987169 (COL6A2)
 Hyaluronan
  Extracellular matrix or blood plasma proteins
   102987169 (COL6A2)
SSDB
Motif
Pfam: VWA Collagen VWA_2 VWA_3
Other DBs
NCBI-GeneID: 102987169
NCBI-ProteinID: XP_007121465
UniProt: A0A2Y9FH40
LinkDB
Position
8
AA seq 1027 aa
MSGRTGAAKMLPGPCSALLLWALLGAVHAQQQEIISPGTSERNSCPEKADCPVHVYFVLD
TSESITMQSPTDSLLYHMQQFVLQFISQLQDELYLDQVALSWRYGGLHFSDLVEVFSPPG
SDRASFTKSLQGISSFRRGTFTDCMLANMTQEVRRHVGKGVVNFAVIITDGHVTGSPCGG
IKLQAERAREEGIRLFAVPPNLKLNEQGLRDIANTPHELYRNNYATMRPDSTEIDQDTIN
RIIKVMKHEAYGECYKVSCLEIPGPPGPKGYRGQKGAKGNMGEPGEPGQKGRQGDPGIEG
PIGFPGPKGVPGFKGEKGEFGAEGRKGAPGLAGKNGTDGQKGKLGRIGPPGCKGDSGGRG
PDGYVGEAGSPGEPGDRGSKGDPGRPGRRGPPGENGAKGSKGYQGNTGAPGSPGLKGAKG
GPGPRGPKGEPGRRGDPGSKGSPGSDGPKGEKGDPGPEGPRGLAGEVGNKGAKGDRGLPG
PRGPQGTVGEPGKQGSRGDPGDAGPRGDSGQPGPKGDPGRPGFSYPGPRGAPGEKGEPGP
RGPEGSRGDFGSKGEPGRKGQKGESADPGPPGEPGPRGQRGAPGPEGEPGPPGDPGLTEC
DVMTYVRETCGCCDCEKRCGALDVVFVIDSSESIGYTNFTLEKNFVINVVNRLGAIAKDP
KSETGTRVGVVQYSHDGTFEAIQLDDERIDSLASFKEAVKSLEWIAGGTWTPSALKFAYN
QLIKESRRQKTRVFAVVITDGRHDPRDDDLNLRALCNHDVTVTAIGIGDMFHEKHESENL
YSIACDKPHQVRNMTLFSDLVAEKFIDDMEDVLCPDPQIVCPDLPCQTELYVAQCTQRPV
DIVFLLDGSERLGEQNFDKARRFVEEVSRRLTLARRDDDPLNARVALLQFGGPLEQQVAF
PLTSNLTVIQEALASARYLNSFSHVGAGIVHAINQVVRGARPGARRHAELDFVFLTDGVT
GNDSLEEAVHSMRKQNVVPTVVAVGGDVDADVLSKISLGDPAAVFREKDYDSLAQPGFFD
RFIRWIC
NT seq 3084 nt   +upstreamnt  +downstreamnt
atgtcggggcgcacgggcgctgccaagatgctcccaggcccctgctccgccctcctgctc
tgggccctcctgggggccgtccacgctcagcagcaggagatcatcagccccggcacctcc
gagaggaacagctgcccagagaaggccgactgcccggtccacgtgtacttcgtgctggac
acgtcggagagcatcaccatgcagtcccccaccgacagcctgctctaccacatgcagcag
ttcgtgctgcagttcatcagccagctgcaggacgagctctacctggaccaggtggccctg
agctggcgctacggcggcctgcacttctccgacctggtggaggtgttcagcccgcccggc
agcgaccgcgcctccttcaccaagagcctgcagggcatcagctccttccgccgcggcacc
ttcactgactgcatgctggccaacatgacccaggaggtccggcggcacgtgggcaagggg
gtggtcaacttcgcggtgatcatcaccgatggccacgtcacgggcagcccgtgcgggggc
atcaagctgcaggcggagcgggcccgcgaggagggcatccggctcttcgcggtgccgccc
aacctgaagctgaacgagcagggcctgcgggacatcgccaacacgccgcacgagctctac
cggaacaactacgccaccatgcggcccgactccaccgagatcgaccaggacaccatcaac
cgcatcatcaaggtcatgaaacatgaagcctacggagagtgctacaaggtgagctgtctg
gagatccccgggccccccggccccaagggctaccgcggacagaagggcgccaagggcaac
atgggtgagcccggagagcctggacagaaggggcgacagggagatccaggcatcgaaggc
cccattggattcccaggacccaagggtgttcctggtttcaaaggagagaagggcgagttt
ggagccgaagggcggaagggcgcccccggcctggcgggcaagaacgggacggatggacag
aagggcaagttggggcgcatcggacctcctggctgcaagggagactccgggggtcggggc
ccggatggttacgtgggggaagccggcagccctggggagccaggagatcgaggctccaag
ggggaccctggccgccccggacgcagagggcccccgggggaaaacggggcaaaaggaagc
aagggctaccaaggcaacaccggagccccaggaagtcccggcctgaaaggagccaagggt
gggcctgggccccgaggccccaaaggcgagcccgggcgcaggggagaccctggaagcaag
ggcagcccaggcagcgatggccccaagggcgagaagggggaccctggccctgaggggccc
cggggcctggctggagaggttggcaacaaaggagccaagggagaccgaggtttacctgga
cccagaggtccccaggggactgttggagagcccgggaagcagggatctcggggagaccct
ggggacgctggcccccgcggagactcgggacagcctggccccaagggagaccccggcagg
cctggcttcagctacccgggaccccgaggagcgcctggagagaaaggcgagcctggcccc
cgcggccccgaggggagcagaggagactttgggtccaaaggagagcccgggaggaaaggc
cagaagggcgagtctgcagatcctggtccccctggcgagccaggcccgcggggccagaga
ggagcgccaggacccgagggagagcctggccccccaggagaccccggcctcacggagtgc
gacgtcatgacctacgtgagggagacgtgcgggtgctgtgactgtgagaagcgctgtggc
gctctggacgtggtgttcgtcatcgacagctccgagagcatcggctacaccaacttcacc
ctggagaagaacttcgtcatcaacgtggtcaacaggctgggggccatcgccaaggacccc
aagtcggagacggggacccgcgtgggcgtcgtgcagtacagccacgacggcaccttcgag
gccatccagctggacgacgagcgcatcgactcactggccagcttcaaggaggccgtcaag
agccttgagtggatcgcgggcggcacctggacgccctcggccctcaagttcgcctacaac
cagctcatcaaggagagccggcgccagaagacccgcgtgttcgcggtggtcatcacggac
ggccgccacgacccccgcgacgacgacctcaatctgcgggcgctgtgcaaccacgacgtg
acggtgacggccatcggcatcggggacatgttccacgagaagcacgagagcgagaacctg
tactccatcgcctgcgacaagccgcaccaggtgcgcaacatgacgctcttctccgacctg
gtggccgagaagttcatcgacgacatggaggacgtgctgtgcccggacccccagatcgtg
tgcccggaccttccctgccaaacagagctgtacgtggcccagtgcacgcagcggcccgtg
gacatcgtcttcctcctggacggctccgagcggctgggcgagcagaacttcgacaaggcg
cggcgcttcgtggaggaggtgtcccggcggctgaccctggcgcgcagggacgacgacccg
ctcaacgcgcgcgtggcgctgctgcagttcggaggcccgctcgagcagcaggtggccttc
ccgctgacctccaacctgaccgtcatccaggaggcgctggctagcgcgcgctacctcaac
tccttctcgcacgtgggcgcgggcatcgtgcacgccatcaaccaggtggtgcgcggcgcg
cggcccggggcgcgccgccacgccgagctggacttcgtgttcctcacggacggcgtcacg
ggcaacgacagcctggaggaggcggtgcactccatgcgcaagcagaacgtggtgcccacc
gtggtggccgtgggcggcgacgtggacgcggacgtgctctccaagatcagcctgggcgac
ccggccgccgtcttccgcgagaaggactacgacagcctggcccagcctggcttcttcgac
aggttcatccgctggatctgctag

KEGG   Physeter catodon (sperm whale): 102987616
Entry
102987616         CDS       T06011                                 

Gene name
TNR
Definition
(RefSeq) tenascin-R
  KO
K06252  tenascin
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05165  Human papillomavirus infection
pcad05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102987616 (TNR)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102987616 (TNR)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102987616 (TNR)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    102987616 (TNR)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102987616 (TNR)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102987616 (TNR)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102987616 (TNR)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_Tenascin EGF_2 Pur_ac_phosph_N DUF4959 Lzipper-MIP1
Other DBs
NCBI-GeneID: 102987616
NCBI-ProteinID: XP_007117870
UniProt: A0A2Y9FBD7
LinkDB
Position
4
AA seq 1358 aa
MGADGETVVLKNMLIGVNLILLGSMLKPSECQLEVTTERVQRQAVEEEGGADSYNTSGKE
QPVVFNHVYNINVPLDSLCSSGLEASAEQEVSAEDEVLTEYTGQTSDHESQVTFTHKINL
PKKACPCASSAQVLQELLSRIEMLEREVSVLRDQCTSNCCQESAATGQLDYIPHCSGHGN
FSLESCGCICDQGWFGKNCSEPYCPLGCSSRGVCVDGQCICDNEYSGDDCSELRCPTDCS
SRGLCVDGECVCEEAYTGEGCSELRCPGDCSGKGRCTNGTCLCQEGYAGEDCGQQRCLNA
CSGRGHCQEGLCFCEEGYQGPDCSAVAPPEDLRVAGISDRSIELEWDGPMAVTEYVISYQ
PTALEGLQLQQRVPGDWSGVTISELEPGLTYNISVYAVISNILSLPITAKVATHLSTPQG
LRFKTITETTVEVQWEPFSFSFDGWEISFIPKNNEGGVIAQLPSDVTSFNQTGLKPGEEY
TVNVVALKEQARSPPTSASVSTVIDGPTHILVRDVSDTVAFVEWTPPRAKVDFILLKYGL
VGGEGGKTTFRLQPPLSQYSVQALRPGSQYEVWVSAVRGTNESESTTTQFTTEIDAPKNL
RLGSRTATSLDLEWDNSEAEVQEYKIVYSTLAGERYHELLVPKGVGPTTRATLTDLVPGT
EYGIGISAIMNSKQSVPATMNARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITF
TPSSGIASEVTVPKDRTSYTLTDLEPGAEYIISITAERGRQQSLESTVDAFTGFRPISHL
HFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVSLDATKRHAVLMGLQPATEY
IVNLVAVHGTVTSEPIMGSITTGIDPPKDITISNVTKDSVMVSWSPPVASFDYYRVSYRP
TQVGRLDSSVVPNTVTEFTITKLYPATEYEISLNSVRGREESERICTLVHTAMDNPVDLT
ATNITPTGALLQWKAPVSEVENYVIVLTHFAVAGETILVDGDSEEFQLVDLLPRTHYTVT
MYAASGPLTSGTISTNFSTLLDAPANLTASEVTRQSALISWQPPRAETENYVLTYKSTDG
SRKELIVDAEDTWIRLEGLSESTDYTVLLQAAQDTERSSITSTAFTTGGRVFPHPQDCAQ
HLMNGDTLSGVYTIFLNGELSRKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWAEYRVG
FGNLEDEFWLGLDNIHEITSQGRYELRVDMRDGQDAAFAYYDKFSVEGSRSLYKLRIGGY
NGTAGDSLSYHQGRPFSTEDRDNDIAVTNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQG
INWYHWKGHEFSIPFVEMKMRPYSHRLMAGRKRRSLQF
NT seq 4077 nt   +upstreamnt  +downstreamnt
atgggggcagacggggaaacagtggtcttgaagaacatgctcattggcgtcaacctgatt
cttctgggctccatgctcaagccctcagagtgtcagctggaggtcaccacggaaagggtc
cagagacaggcggtggaggaggaaggaggcgctgacagctacaatacatccggcaaagag
cagccggtggtcttcaaccacgtgtacaacattaacgtgcccctggacagcctctgttcc
tcggggctggaggcctcggccgagcaggaggtgagtgccgaagacgaggtgctgacagag
tacacgggccagacctcggaccacgagagccaggtcaccttcacccacaagatcaacctc
cccaaaaaggcctgcccatgcgccagctccgctcaggtgctgcaggagctgctgagccga
atcgagatgctggagagggaggtgtcagtgcttcgggaccagtgcaccagcaactgctgc
caggaaagtgcggccacaggacaactggattatatccctcactgcagtggccatggcaac
tttagcctcgagtcctgtggctgcatctgcgatcaaggctggtttggcaagaactgctcg
gagccctactgccccctaggctgctccagtaggggtgtgtgtgtggatggccagtgcatc
tgtgacaacgagtacagcggcgacgactgctcggagctccggtgcccgacagactgcagc
tcccgggggctgtgcgtggacggggagtgtgtctgtgaagaggcctacactggcgagggc
tgcagcgagctgaggtgccctggggactgttcggggaaggggagatgtaccaacggtacc
tgcctgtgccaggagggctacgctggtgaggactgcggccagcagcggtgtctgaacgcc
tgcagtgggcgaggacactgccaggaggggctctgcttctgtgaagagggctaccagggc
cctgactgctcagcagttgcccctccagaggacttgcgagtggctggtatcagcgacagg
tccattgagctggaatgggacgggccgatggcagtgacggaatatgtgatctcttaccag
ccaacggccctggagggcctccagctccagcagcgggtgcctggagattggagtggtgtc
accatctcggagctggagccaggtctcacctacaacatcagcgtctacgctgtcattagc
aacatcctcagccttcccatcacggccaaggtggccacccatctctccactcctcaaggg
ctacgattcaagacgatcacagagaccaccgtggaggtgcagtgggaacccttctcattc
tccttcgatgggtgggagatcagcttcattccaaagaacaatgaaggaggggtgattgcc
cagctccccagtgatgttacatccttcaaccagacaggattaaagcctggggaggaatat
actgtcaatgtggtggctctgaaagagcaagcccggagcccccctacctcggccagcgtc
tccactgtcattgatgggcccacgcacatcctggttcgagatgtctccgacactgtagcc
ttcgtggagtggaccccacctcgagccaaagtcgatttcattctcttgaagtacggcttg
gtgggcggggaaggagggaagaccaccttccgactgcagcctcccctgagccagtactcg
gtacaggccctgcggcccggctcccaatacgaggtgtgggtcagcgcagtccgtggaacc
aacgagagtgagtccaccaccacccagttcacaacagagattgacgcccccaagaacctg
cggcttggttcccgcacagcaaccagccttgaccttgagtgggacaacagtgaagcagag
gttcaggagtacaagatcgtgtacagcaccttggcgggcgagcggtaccatgagctgctg
gtccccaagggcgtcggtccaaccaccagagccacgctcacagatctagtgcctggcact
gagtatggaattggaatatctgccatcatgaactcaaagcaaagcgtaccagccaccatg
aatgccaggactgaacttgacagtcctcgagacctcatggtgacagcctcctcagaaacc
tccatctccctcatctggaccaaggccagtggccccattgaccactaccgaattacattt
actccatcctctgggatcgcctccgaagtcaccgtgcccaaggacaggacctcgtacaca
ctgacagatctagagcccggggcagagtacatcatttcaatcacagctgagaggggtcgg
cagcagagcttggagtccacggtggatgccttcacaggcttccgccccatctcccatttg
cacttttctcacgtgacctcctccagtgtgaacatcacctggagtgacccatcccctcca
gcagacagactcattctgaactacagcccccgggatgaagaggaagagatgatggaggtc
tctctggatgccaccaagaggcatgctgtcctgatgggcctgcagccagccaccgagtac
attgtgaacttggtggcagtacatggcacagtgacctctgagcccatcatgggctccatc
accacaggaattgatccccccaaagacatcacaatcagcaatgtgaccaaggactcagtt
atggtctcctggagccctcctgttgcgtcttttgattactaccgagtatcatatcggcca
acacaagtgggacgactggacagctcagtggtgcccaacacagtgacagaattcaccatc
accaagctgtacccagctaccgaatatgaaatcagcctcaacagcgtgcggggcagggag
gagagcgagcggatctgcactcttgtgcacacagccatggacaatcctgtggatctgact
gctaccaacatcactccaacaggagctctgctgcagtggaaggcaccagtgagtgaagtg
gagaactacgtcattgttctcacacactttgcagttgctggagagaccatcctggttgat
ggagacagtgaggaattccagcttgttgacctgcttcctaggacccactataccgtcact
atgtatgccgccagtgggcctctcaccagtggcacgatcagcaccaacttctctaccctc
ctggacgctcctgcaaatctgacagcaagtgaagtcaccagacaaagtgccctgatctcc
tggcagcctcccagagcagagactgagaattacgtcttgacctacaaatccactgatgga
agccgcaaggagctgattgtggatgcggaggacacgtggatccgactggagggcctgtct
gagagcacagactacacggtgctcctgcaggcggcccaggacacggagcggagcagcatc
acctccaccgccttcaccacagggggccgggtcttccctcatcctcaagactgtgcccag
catttgatgaacggagacactctgagcggggtttacaccatcttcctcaacggagagcta
agccggaagttacaggtgtactgcgatatgacaacggacgggggtggctggattgtattc
cagaggcggcagaatggccaaactgactttttccggaaatgggctgagtaccgcgttggc
tttgggaacctggaggatgagttttggctggggctggacaacatacacgagatcacgtcc
cagggccgctacgagctgcgcgtggacatgcgtgacggccaggacgccgccttcgcctac
tacgacaagttctctgtcgagggcagcagaagcctgtacaaactccgcataggaggctac
aacggcaccgcaggggactccctcagctatcatcagggccgccctttctccacagaggac
agagacaacgacattgcagttaccaactgtgccatgtcatacaagggggcttggtggtat
aagaactgccaccggaccaacctcaatgggaaatacggggagtccaggcacagccagggg
atcaactggtaccattggaaaggccacgagttctccatcccctttgtggaaatgaagatg
cgcccctacagccaccgtctcatggcggggaggaaacggcggtccttgcagttctaa

KEGG   Physeter catodon (sperm whale): 102987884
Entry
102987884         CDS       T06011                                 

Gene name
LAMB2
Definition
(RefSeq) laminin subunit beta-2 isoform X1
  KO
K06243  laminin, beta 2
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102987884 (LAMB2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102987884 (LAMB2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102987884 (LAMB2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102987884 (LAMB2)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102987884 (LAMB2)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102987884 (LAMB2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102987884 (LAMB2)
   05145 Toxoplasmosis
    102987884 (LAMB2)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N F5_F8_type_C
Other DBs
NCBI-GeneID: 102987884
NCBI-ProteinID: XP_007129943
UniProt: A0A2Y9FSU9
LinkDB
Position
Unknown
AA seq 1801 aa
MEWAAGEQGRDLRGQPGPWQLLLGLLLSELATALAQALAPDVPGCSRGSCYPATGDLLVG
RADRLTASSTCGLHGPQPYCIVSHLQDEKKCFLCDSRRPFSARDNPNSHRIQNVVTSFAP
QRRAAWWQSENGVPVVTIQLDLEAEFHFTHLIMTFKTFRPAAMLVERSADFGRTWHVYRY
FSYDCGADFPGVPLAPPRHWDDVVCESRYSEIEPSTEGEVIYRVLDPAIPIPDPYSPWIQ
NLLKITNLRVNLTRLHTLGDNLLDPRREIREKYYYALYELVVRGNCFCYGHASQCAPAPG
APAHAEGMVHGACICKHNTRGFNCEQCQDFYHDLPWHPAEDGHSHACRKCECHGHTYSCH
FDMAIYLASGNVSGGVCDGCQHNTAGRQCELCRPFFYRDPTKDLRDPAVCRSCDCDPMGS
QDGGRCDPHDDPALGLVSGQCRCKEHVVGSRCQQCRDGFFGLSASDPLGCQQCQCDTRGT
VPGGTPCDRNSGACFCKRLVTGRGCNRCLPGHWGLSHDLLGCRPCDCDVGGALDPQCNEA
TGQCRCRQNMVGRRCEQVQPGYFRPFLDHLIWEAEEARGQVPDVVERPVTPGGIPSWTGP
GFVRLREGQALEFLVASVPRAMDYDLLLRLEPQVPEQWAEMELTVQRPGPVSAHSPCGHV
LPKDDHIPGTLRPGTRYMVFPRPVCLEPGISYKLHLKLVRTGGGAQPEAPYSGPSLLIDS
LVLLPRVLVLEMFSGGDAASLERRATFEHYRCHEEGLVPSKTLPSEACAPLLISLSTLVY
NGALPCQCDPQGSLSSECNPHGGQCLCKPAVVGRRCDLCAPGYYGFGPTGCQACQCSPEG
ALSGLCEATNGQCPCRTGAFGLRCDRCQRGQWGFPNCRPCVCNGHADECDPHTGACLGCR
DHTGGEHCERCIAGFHGDPRLPYGGHCRPCPCPEGPGSRRHFATSCHRDGYSQQIVCHCR
AGYTGLRCEACAPGHFGDPSRPGGQCQPCECSGNIDPTDPDACDPHSGQCLRCLHHTEGP
HCAHCKPGFHGQAARQSCHRCTCNLLGTDPQQCPSTDRCNCDPSSGQCPCLPNVQGPSCD
RCAPNFWNLTSGHGCQPCACHPSRARGPTCNEFTGQCHCRAGFGGRTCSECQELHWGDPG
LQCRACDCDPRGIDTPQCHRSTGHCSCRLGMSGVRCDQCARGFSGVFPACHPCHACFGDW
DRVVQDLAARTRRLEQWAQELQQTGVLGAFESSFWHMQEKLGTVQGIVGARNASAASTAQ
LVEATEELRREIGEATEHLTQLEAELTDVQDENFNANHALSSLERDGLALNLTLRQLDQH
LDLLKHSNFLGAYDSIRHAHSLSAEAERRANMSALTVPSPVSNSAGTRHRTEVLMAARRK
DFNRKHMANQQALGELSAHTHALSLTGINELVCGSPGDAPCATSPCGGAGCRDEDGQPRC
GGLSCSGAAAMADLALGRARHTQAELQRALAEGGGILSQVAETRRQAGEAQQRAQAALDK
ANASRGQVEQANQELRELIQSVKDFLSQEGADPDSIEMVATRVLELSIPASPEQIRHLAG
EIAERVRSLADVDTILARTVGDVHRAEQLLKDAQRARSRAEGEKQKAETVQAALEEAQRA
QGAAQSAIQGAVLDTQDTEQTLHQVQERMAGTEKALSSAGERAQQLDGLLEALKLKRAGN
SLAASSAEETAGSAQGRAREAEQLLKGPLGDQYQTVRALAERKAQGVLAAQTRAEQLRDE
ARGLLQAAQDKLQRLQELEGTYEENERALEGKAAQLDGLEARMRSVLQAINLQVQIYNTC
Q
NT seq 5406 nt   +upstreamnt  +downstreamnt
atggagtgggccgcaggagaacaagggagggacctgcggggacagcctgggccctggcag
cttctattgggcctgctgctgagtgagctggccaccgccctggcccaggccctggccccg
gatgtgccaggctgttcgcggggaagctgctaccccgccacaggggacctgctggtgggc
cgtgctgacagactgaccgcctcatccacctgtggcttgcacgggccccagccctactgc
atcgtcagtcacctgcaggacgagaagaaatgcttcctgtgtgactcccggcgccccttc
tctgctagagacaacccaaacagccatcgcatccagaatgtagttaccagcttcgcacca
caacgccgggcagcctggtggcagtcagagaatggtgtccccgtggtcaccatccagctg
gacttggaggctgagttccatttcacgcacctcattatgaccttcaagacgtttcgccct
gctgccatgctggtggagcgctcagcagactttggacgcacctggcacgtgtacagatat
ttctcctatgactgtggggctgacttcccaggagtcccactggcccccccacggcactgg
gatgacgtagtctgtgagtcacgctattcagagattgagccatctactgaaggcgaggtc
atctatcgtgtgctggaccctgcaatccctattccagacccctatagcccatggatccag
aacctgctaaagatcaccaacctacgggtgaacctgacacggctacacacgctcggagac
aacctgcttgacccacggcgggagatccgcgagaagtactattatgccctgtacgagctg
gttgtgcgtggcaactgcttctgctatggacatgcctcacagtgtgcacccgccccaggg
gcaccagcccatgctgagggcatggttcatggggcctgcatctgcaaacacaacactcgt
ggcttcaactgtgagcagtgtcaggatttctatcatgatctgccctggcatccggctgag
gacggccacagtcacgcctgcaggaagtgtgagtgccatgggcatacctacagctgccac
ttcgacatggccatatacctggcatctggcaacgtgagtggaggtgtgtgtgatgggtgt
cagcacaacacagctgggcgccagtgtgagctctgccgacccttcttctaccgtgaccca
accaaggacctgcgagacccagccgtgtgccgctcctgtgactgtgaccccatgggttcc
caagacggtggtcgctgtgatccccatgatgatcctgcactggggctggtctcaggccag
tgtcgctgcaaagaacatgtggtgggctctcgctgccagcagtgccgtgatggcttcttt
gggctcagtgccagtgaccctctaggctgccagcaatgtcagtgtgatacacggggcaca
gtgccggggggcaccccttgtgaccgcaacagtggagcctgtttctgcaagcgtctagtg
actggacgtggctgcaaccgctgtctgcctggccactggggcctgagccacgacttactt
ggctgccgtccatgtgactgcgatgtcggtggtgccctggatccccagtgcaacgaggcc
acaggtcagtgccgctgccgccagaacatggtggggcgacgctgtgagcaggtgcagcct
ggttacttccgacccttccttgaccacctaatttgggaggctgaggaagcccgggggcag
gtacctgatgtggtggagcgcccggtgacccctggggggattccatcttggactggccca
ggctttgttaggctgcgggaaggccaggcactggagttcctggtggcctctgtaccaaga
gccatggactatgacctgctactgcgcttggagccccaggtacctgagcaatgggcagag
atggaactgactgtgcagcgcccagggcctgtatctgcccacagcccatgtgggcacgtg
ctgcccaaggatgaccacatcccagggactctgcgaccaggcaccaggtacatggtgttt
cccagacctgtctgccttgagcctggcatctcctacaagctgcatctcaagctggtgcga
acaggaggaggtgcccagcctgaggccccctattctggacccagcctactcattgactcg
ctggtgctgctgccccgtgtcctggtgctggagatgtttagtgggggtgatgcggcttct
ctggagcgccgtgccacctttgaacactaccgctgccatgaggagggtctggtgcccagc
aagacccttccctctgaggcctgcgcccccctcctcatcagcctgtccacactggtctac
aacggagccctgccctgtcagtgtgacccccagggctcactgagctctgagtgcaacccc
catggcggtcagtgcctgtgtaaacctgcagtggttgggcgccgctgtgacctctgtgcc
cctggctactatggctttggccccacaggctgtcaagcctgccagtgcagccctgagggg
gcactcagtggcctgtgtgaagcaaccaatgggcaatgcccctgccgaactggtgcattt
gggcttcgctgcgaccgctgccagcgtggccagtggggattccctaactgccggccgtgt
gtctgcaacgggcatgcagacgaatgtgacccccacacaggcgcttgcctgggctgccgt
gaccacacagggggtgagcactgtgaaaggtgcattgctggcttccacggggacccacgg
ctgccatatgggggccattgccggccgtgtccctgccctgaaggccctgggagccggcgg
cactttgctacttcttgccatcgggatgggtactcccagcagatagtgtgccactgcagg
gcaggctacacagggctgcggtgcgaagcttgtgcccctgggcactttggggacccatca
aggccaggcggccagtgccaaccatgtgagtgcagtggtaacattgaccccacggaccct
gatgcctgtgacccccactcggggcaatgcctgcgctgcttacaccacacagaggggccg
cactgtgcccactgtaagcctggcttccatgggcaggctgcccgacagagctgtcaccgc
tgcacctgcaacctgctgggcacagatccccagcagtgcccatccactgaccggtgcaac
tgtgacccaagcagtgggcagtgcccatgcctgcccaatgtccagggccctagctgtgac
cgctgtgcccccaacttctggaaccttaccagtggccatggctgccagccctgtgcctgc
cacccaagccgagccagaggccccacctgcaatgagttcacaggacagtgccactgccgt
gctggcttcggtgggcgaacctgttctgagtgccaggagctccactggggagaccctggg
ttgcagtgccgcgcctgtgattgtgaccctcgtgggatagacacacctcagtgtcaccgt
tccacaggccactgcagctgccgcttgggcatgtctggcgtgcgctgtgaccagtgtgcc
cgtggcttctcgggcgtctttcctgcctgccacccctgccacgcgtgcttcggggactgg
gaccgtgtggtacaggacctggctgctcgtacacggcgcctggagcagtgggcgcaggag
ctgcagcagacgggtgtgctgggtgcctttgagagcagcttctggcacatgcaggagaag
ctgggcactgtgcaggggattgtgggtgcccgtaatgcctcagctgcctccactgcacag
ctcgtggaggccacagaggagttgcggcgtgaaattggggaggccactgagcacctgacc
cagctggaagcagagctcacagatgtgcaggacgagaatttcaatgccaaccatgcacta
agcagtctagagcgagacgggcttgcacttaatctcacactgcggcagcttgaccagcat
ctggacctgctcaagcattcaaatttcctgggtgcctatgacagcatccgccatgcccac
agcctgtctgcagaagcagaacgtcgtgccaatatgtcagccctgacagtgcccagccct
gtgagcaactcagcaggcacccggcaccggacagaggtgctgatggctgcccgaaggaag
gacttcaaccgcaagcacatggccaaccagcaggcactgggcgagctctctgcccatacc
catgccctgagcctgacaggcataaatgaactggtgtgtgggtccccgggagatgcaccc
tgtgctacgagcccttgcgggggtgccggctgtcgggacgaggatgggcagccccgttgc
gggggcctcagctgcagtggggcagcagccatggcggatctggcgctgggtcgggcccgc
cacacacaggcagaactgcagcgggcactggcagaaggtggtggcatcctcagccaggta
gctgagacccgtcggcaggcaggcgaggcacagcagcgggcccaggcagccctggacaag
gctaatgcttccaggggacaggtggaacaggccaaccaggaactgcgggaacttatccag
agtgtgaaggacttcctcagccaggagggggctgatcctgatagcattgagatggtggcc
acacgggtgctagagctctccatcccagcgtcacctgagcagatccggcacttggcgggc
gagattgcagagcgggtccggagcctggcagatgtggacacgatcctggcgcgtactgtg
ggagacgtgcatcgggcagagcagctactgaaggacgcacagcgggcacggagccgggct
gagggtgagaaacagaaggcagagacagtacaggcagcgctggaggaggcccagcgggca
cagggtgctgctcagagtgccatccagggggcagtgcttgacacacaggacacagagcag
accctgcaccaggtgcaggagaggatggcaggtacagagaaggcattgagctctgcaggt
gagcgggctcagcaattggatggtctcctggaggctctgaaattgaagcgagcagggaat
agcctggcagcctctagcgctgaagaaacagctggcagtgcccagggtcgtgcccgggaa
gctgaacagctgctgaagggcccactaggtgaccagtaccagacagtgagggccctggct
gagcgcaaggcccagggtgtgctggctgcgcagacacgggcagaacaactgcgggatgag
gctcgcggcttgttgcaggctgctcaagacaagctgcagcggttgcaagagctggaaggc
acttatgaggagaatgagcgggcgctggagggcaaagcggctcagctggacgggctggag
gccaggatgcgcagtgtgcttcaagccatcaacttgcaggtccagatctacaacacctgc
cagtga

KEGG   Physeter catodon (sperm whale): 102988692
Entry
102988692         CDS       T06011                                 

Gene name
COL4A4
Definition
(RefSeq) collagen alpha-4(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102988692 (COL4A4)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102988692 (COL4A4)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102988692 (COL4A4)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102988692 (COL4A4)
  09154 Digestive system
   04974 Protein digestion and absorption
    102988692 (COL4A4)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102988692 (COL4A4)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102988692 (COL4A4)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102988692 (COL4A4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102988692 (COL4A4)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102988692 (COL4A4)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102988692 (COL4A4)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102988692 (COL4A4)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102988692 (COL4A4)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102988692 (COL4A4)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102988692
NCBI-ProteinID: XP_028338109
UniProt: A0A455APU9
LinkDB
Position
2
AA seq 1686 aa
MALVRRSFRWIKPLATDSWSLILILFSVQHAYGGGKKFIGPCGGRDCSLCQCFPEKGSRG
QPGPPGPQGPIGPLGPPGPIGIPGEKGMRGDSGLPGAAGEKGDKGPTGVPGFPGLDGIPG
HPGPPGSRGKSGIDGYNGSRGAPGLPGERGALGPQGSPGLLGKKGEKGNSVFILGVIKGI
QGDRGDPGPPGLPGPRGARGPAGPVGYPGESGLAGPPGPPGSPGLKGNPGVGVKGQMGDP
GEVGQQGSPGPTLLVQPPDSGLYKGEKGIKGTPGITGPPGPPGPKGEPGIGTKGEKGIPG
FPGPRGDPGSYGSPGFLGLKGEPGVFGDPGPFGFLGPKGDPGDRGHPGPPGVLATPALPL
KGPPGDPGHPGRYGETGAVGPPGPPGPPGRPGEVCAGMMGPPGPRGFSGHPGFPGAAGIP
ARADSAPGKPGNQGPPGLPGEPGLPGPPGSDVVYCSAGPPGPRGIKGKVGPPGRRGSKGE
KGSTGLCACQPGPSGPPGAPGLPGRQGSKGDLGLPGRLGEKGYAGLPGARGPPGPPGKPG
TSGPPGSKGEKGDLVVSRVKGHKGERGPDGPPGFPGQQGQHGQDGYAGEKGDPGPLGDHE
DAAPGDKGPPGPPGPPGRAGPRGPPGLGFPGPPGERGQPGAPGCPGKRGPDGWKGQKGDT
IPCNVTYPGRPGPPGFDGPPGPKGFPGRRGAPGLRCLDGEKGQRGKPGISEIPGPPGFHG
DMGDPGFGGEKGSSPVGPPGSPGLPGMNGQKGPPGDSAIGYPGPPGKRGLLGVPGSKGLR
GDPGRPGAAGPAGMPGFPGLKGPKGREGSAGFPGIPGPPGHSCERGAPGRPGPPGLPGAP
GSPGAPGWKGQQGGMGPPGPAGMKGLPGVPGRPGADGPPGPPGVPGPTGDDALPGLPGPK
GPQGLPGFLGFPGERGKPGPDGHPGRKGEHGEKGWPGSLGDGGVKGDKGARGPPGYEGEM
DIISIKGETGEPGPPGDGGFPGEEGDKGHPGIPGKKGEPGSCGPPGFHRGEPGRNGQPGL
PGPPGPPGSPGLRGLIGFPGFPGDQGEPGSPGPPGLSGIDGTRGPKGYKGEPASQFGLPG
PKGEPGSPGYPGHLGAPGEQGLPGVQGPGGPPGRPGPPGSSGPAGCPGNPGLPGLQGHPG
EMGDPGPRGLVGDPGTPGLPGIKGPSGSPGLNGLHGLKGQKGSKGTSGSHKTGPPGPVGM
PGPKGETGDPGSPGISPPGLFGDKGPPGPPGRPGPPGPAGAPGRALKGDIPDPGLPGDQG
PPGPDGPRGIPGPPGPPGIVDLLKGEPGDCGLPGPPGPPGPPGPPGRKGFPGCDGKDGQK
GPMGFPGLQGPPGTPGPPGEKGLPGPPGRQGPSGPPGSRGEPGPPADLDACPRIPGLPGI
PGPRGPEGAMGLPGTRGPPGPGCKGEPGLDGRRGEDGLPGSPGPPGHRGDAGEAGCSGVP
GPPGPVGDPGPKGFGPGYLSGFLLVLHSQTDGEPTCPTGMPRLWTGYSLLYLEGQEKAHS
QDLGLAGSCLPMFSTLPFAHCNIHQVCQYARRNDRSYWLASTAPLPVTPLSEEEIRPYIS
RCAVCEAPAQAVAVHSQDQSIPPCPRAWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLE
DFRAAPFLECQGRQGTCHFFANEYSFWLTTVRPDLQFSSAPSPATLKESQAQRQKISRCQ
VCVKHS
NT seq 5061 nt   +upstreamnt  +downstreamnt
atggcattggtaaggcgctctttcaggtggatcaagcccttggccacagattcctggtcg
ctcatacttattctcttttctgtacaacatgcgtacgggggtggaaagaagtttattggt
ccctgcgggggaagagattgctctctttgccagtgctttcctgaaaaggggtctcggggt
caaccaggaccaccagggccacagggtcctattggacccctggggccaccaggacccatt
ggaattccaggagagaaagggatgagaggtgacagtggtcttcctggggcagcaggtgaa
aagggtgataagggtccaaccggtgttcctggattcccaggtttggatgggatacctggg
cacccagggcctcctggatccagaggtaaatctggcatagatggctacaatggctcaaga
ggtgctccagggcttccaggagaaagaggagctcttggcccacaaggctccccaggcctt
cttgggaaaaaaggagaaaaaggaaattcagtgttcattttaggtgtcattaaaggtatt
cagggtgacagaggggacccaggacctcccggcttaccaggcccaaggggcgcgagagga
ccagcaggcccagtgggatatccaggagagtcagggttagcgggacctccaggccctcct
gggagtccgggtttgaagggtaatcctggcgtgggagtaaaggggcaaatgggagacccg
ggtgaggttggccaacagggttctccaggacccacccttctggtgcagccacctgattct
ggtctctataaaggagaaaagggtattaaaggaacgcctggaatcactggacctcccgga
ccaccaggacccaagggggaacccggtattgggacaaaaggagagaaaggtattcctgga
tttccaggacctcggggggaccctggctcctatggatctccaggttttctaggattaaag
ggggaaccaggtgtgtttggagatcctgggccatttggatttcttggtccaaagggagat
cctggagaccgtgggcacccaggaccaccaggggttttggcaactccagctcttccactc
aaaggccctccaggggatccagggcaccctggccgctatggagaaacgggggctgttgga
ccacctggtccccctggtccccctggtagaccaggggaagtctgtgcaggcatgatggga
ccccctgggccacgagggttttctggtcatccaggatttccaggggcagctggtattcct
gcgagagctgattctgctccaggaaaaccaggcaaccagggaccacctgggttgcctgga
gagccagggttgccgggacctccaggatcagatgttgtgtattgtagtgctgggcctcct
ggaccacgaggaataaaaggcaaagtgggtcctccaggaagaagaggctcaaaaggagaa
aaaggaagcacagggctctgtgcctgccagcctggtccctcaggcccacccggcgctcca
ggacttcctgggaggcaagggagtaaaggagacttgggactccctgggcggcttggagaa
aaaggttacgcaggccttcctggtgccagaggacctccagggccaccaggaaaacctggt
acctcaggaccacctggcagcaaaggagaaaaaggtgacctggttgtatcaagagtgaaa
gggcacaaaggagaaagaggtcctgatgggcccccaggatttccagggcagcagggacaa
cacggtcaagatggatatgctggagaaaaaggggacccaggaccgctgggagatcatgaa
gatgcagccccaggtgataaagggcctcctggaccaccaggccccccgggcagagcagga
cccagggggcctccaggactgggatttcctggtccaccaggagagagagggcaaccagga
gctccaggctgcccaggcaagcggggccctgatggctggaagggtcagaaaggtgataca
attccttgtaatgtaacctaccctgggaggccaggcccaccaggttttgatggacctcca
ggtccaaagggatttccaggtcgtcggggagctcctgggttgaggtgtttggatggggaa
aagggtcaacgtggcaaaccaggaatctcagaaatacctggtccacctggttttcatggt
gatatgggagatccaggttttggaggtgaaaaggggtcctcccctgtcggacccccaggc
tctccaggtttacctggaatgaatggtcagaaaggacctccaggagactctgctattggc
tacccaggacccccaggaaagaggggtcttttaggagtaccagggtcaaaaggactcaga
ggtgatcctggacgaccgggggctgcagggccagctggcatgcccggattcccgggtctc
aaaggtcccaaagggagagagggaagcgctgggtttccagggatcccaggtccacccggc
cattcctgtgaaagaggcgctccagggagaccagggccaccagggctccctggagctcca
ggaagtccaggtgccccaggttggaaaggacaacaagggggcatggggcctcctggaccc
gctggaatgaagggcctccccggagtcccaggacggccaggggcagatggacccccaggg
cccccgggagtcccaggccccactggggatgatgcgctacctggtcttccaggcccaaag
ggaccccaggggctgcccggcttcctgggttttccgggagagagaggaaaacctgggcca
gacggacaccctggcagaaagggagaacacggagaaaagggttggcccggctccctggga
gacggaggagtgaaaggtgacaaaggagcaagaggacccccaggatatgaaggagaaatg
gatattatttctataaagggggaaaccggggaacctggacctcctggagatggtggattc
ccaggagaagaaggtgataaaggccatcctgggataccagggaagaaaggagagccggga
agctgtggaccacctggatttcacagaggggagcccgggaggaatgggcagccaggcctt
cctggacccccaggccctccaggctcacctgggctgagagggctcattggctttccagga
tttccaggtgaccagggtgagccaggttctccagggccccctggactttcaggaattgat
ggtacaagaggacctaaaggatacaaaggggaacctgcaagtcagtttggcctacctggt
ccaaagggtgagcccggtagccctggatatccaggacatttgggagcacctggagagcag
ggcttgcccggtgttcaaggacctggaggaccacccggaaggccaggaccacctggctcc
tccggaccagcggggtgtccaggtaatccagggctgcctgggctgcagggacatccagga
gaaatgggggatcctgggccaagaggcctcgtgggggatccagggacaccaggtcttccg
ggaataaaaggtccctccgggtcgcccggcctgaacggcttgcacggtttaaagggtcag
aaaggcagcaaaggcacttcaggttcccataaaacgggcccacctggtccagtgggaatg
cccgggccaaaaggcgagacaggagaccccgggagcccaggaatttctcctcccggactt
ttcggagacaaaggtcccccaggtcccccagggagacctggaccgcctggtcctgcaggt
gccccaggaagagctcttaagggtgacattccggacccaggtctgcctggagatcaggga
cctcctggccccgatggtccaagaggaatacccgggcctccagggccccctgggattgtt
gaccttctgaaaggtgaaccaggtgactgtggtctgccagggcctccaggtcccccaggc
ccgccaggccctccaggacgcaaaggcttcccaggatgtgatggaaaagatggccagaaa
ggaccaatgggattcccggggctgcaggggccacctggaactcctgggccacctggagag
aagggtttacctggacctccaggcagacaggggccctcgggtcccccaggttccagaggt
gaacctgggccccctgcagacttggatgcctgcccccgaatcccggggcttcctgggata
ccaggcccaagaggcccggaaggagccatggggctccctggaacgagaggccccccagga
ccagggtgcaaaggagagcccgggctggatggcaggagaggcgaggatggccttccaggg
tcccctgggcctccgggacacagaggtgacgcgggagaagcgggctgctcaggagtacca
ggccctcctgggcccgtgggggatcctgggcccaaagggtttggccctggatacctcagt
ggcttcctcctggttctccacagtcagacagatggagaacccacctgccccacgggcatg
cccaggctctggaccgggtatagtctgttatacctggaaggacaggagaaagctcacagt
caagaccttggtctggcgggctcctgccttcccatgttcagcacgctgcccttcgcccac
tgcaacatccaccaagtgtgccagtacgcccggaggaacgacaggtcctattggctggcg
agcaccgcgccgttgcccgtgacgccgctctcggaggaggagatccgcccgtacatcagc
cgctgcgccgtgtgcgaggcccccgcgcaagccgtggccgtgcacagtcaggaccagtcc
atcccgccgtgtccgcgggcctggaggagcctctggattgggtactctttcctgatgcac
acaggcgctggggaccaaggaggcggccaggccctcatgtcacccggcagctgcctggag
gatttcagagcggcaccgttcctggaatgccaaggacggcagggaacttgccactttttt
gcgaatgagtatagcttctggctgacaactgtgagacctgacttgcagttctcctccgca
ccctcaccggccaccttgaaagaaagccaagcccagcgccagaaaatcagcaggtgccag
gtctgtgtgaagcatagctag

KEGG   Physeter catodon (sperm whale): 102989664
Entry
102989664         CDS       T06011                                 

Definition
(RefSeq) laminin subunit gamma-3
  KO
K06247  laminin, gamma 3
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102989664
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102989664
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102989664
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102989664
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102989664
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102989664
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102989664
   05145 Toxoplasmosis
    102989664
SSDB
Motif
Pfam: Laminin_EGF Laminin_B XhlA
Other DBs
NCBI-GeneID: 102989664
NCBI-ProteinID: XP_028342764
UniProt: A0A455B2L4
LinkDB
Position
Unknown
AA seq 895 aa
QLTGLQILDQARPELHPCPSSSCLLTHHKEADWPILPSYRVPRPLSHTLPRLLPTACNCS
GHSEECMFDRELFRSSGHGGRCLRCRDHTAGPHCERCRENFYRWRPRTPCQPCDCHPAGS
LRLQCNDSGTCICKPTVTGWKCDRCLPGFHSLSEGGCRPCTCNPAGSLGTCDPRSGRCPC
KENVEGNLCDRCRPGTFSLQSHSPAGCSSCFCYGHSRACAAAARFREHHIVSDFRQGAEG
WRARSTGGPEHPAQWSPRGLLLDPEDKEDLTAPEKFLGDQRFSYGQPLTLTFRVPPRGSP
VPARLRLEGAGLTSTLRRSSSPSPLDARQPGEVQLKFLLQETSEDVDLPLPPFHFQRLLT
NLTALHIQASGHSARLSGPVFLTEVRLTSARRGLSPPASWVETCSCPTGYTGQFCESCAL
GYKRETPLGGPYASCVPCTCNQHGTCDPNTGICLCGHHTEGPSCEHCLPGFYGNPFTGQA
DDCQPCPCPGQSACVTIPESREVVCTHCPPGQRGWRCEICDDGSFGDPLGLSGAPQPCQL
CQCSGNVDPNAVGNCDPLSGHCLRCLHNTTGVHCESCQEGFYGSALSPRPADRCVPCSCD
QAGSVSEQRPCDPVTGQCSCLPHVTGRDCGHCSPGFYDLQPGRGCRSCKCHPLGSQGDQC
HPKTGQCPCRPGVEGQACDRCQLGFFGFSIKGCRACRCSPLGAASAQCHENSTCVCRPGF
VGYKCDLCQDNFFLTAGGTHCQECPSCYALVKKEAAKLKARLTLMEGWLQGSNCGRTWGP
LDILQGEAPRGNVYQGHPLLQGAQEDFLEQVTGLEGAVKAAREQLQALSRNAHCAQARAE
KTCVQLADLEAVLESSEEEILHAATILASLAIPQEGLGQPTNWSHLAIEARALAR
NT seq 2689 nt   +upstreamnt  +downstreamnt
atcaattgactggtctccagatccttgatcaagccaggcctgaactccatccttgccctt
cctccagctgcctgctcactcaccacaaggaagctgactggcccatactcccctcctacc
gtgtgccacggcctctctcacacaccctcccccgcctcctacccacagcttgcaactgca
gcggccactctgaggagtgcatgtttgaccgggagctcttccgcagctcgggccatggtg
ggcgttgcctccgctgccgtgaccacacagccgggccacactgtgagcgctgccgggaga
acttctatcgctggaggccacggacgccatgccagccctgcgactgccacccagcaggct
ccctgcgcctccagtgcaatgactccggcacctgcatctgcaagcccacggtgacgggct
ggaagtgtgaccgctgcctgcccggcttccactcgctcagtgagggcggctgcagaccct
gtacctgcaatcccgctggcagcctgggcacctgtgacccccgcagtggccgctgtccct
gcaaagagaatgtggaaggcaacctgtgtgacagatgtcgcccgggcacatttagcctgc
agtcccacagcccagccggctgcagcagctgcttctgctacggccactccagggcgtgtg
cggccgctgcccgcttccgggagcaccacatcgtctccgacttccgccagggagccgagg
gctggcgggccagaagcacggggggcccagagcatcctgcacaatggagcccgagggggc
tcctcctggaccccgaggacaaggaggacctcacagcaccagagaagttcctgggagacc
agcggttcagctatgggcagcccctcacactgacattccgggtcccccccaggggctccc
ccgtccctgcgaggctgaggctggaaggggcgggtctgacctcgactctgcggcgttcca
gctcgcccagccccctggacgccaggcagccgggggaggtacagctcaagttcctgttgc
aagagacctccgaggacgtggaccttccgctgccccccttccacttccagcggctgctca
ccaatctgactgctctgcacatccaggccagtggccacagtgccaggctttccggcccag
tgttcctgactgaggtccggctcacgtcggcccggcgggggctctcccccccagcctcct
gggtggagacttgctcatgtcccacgggctacacgggccagttctgtgagtcctgtgctc
tgggatacaagagggagacaccactggggggtccctatgccagctgtgttccctgcacct
gtaaccagcatggcacctgtgaccccaacacagggatctgcctatgcggccaccacaccg
agggcccgtcctgcgagcactgcttgccaggtttctacggcaaccccttcacaggccaag
ccgatgactgccagccctgtccgtgccctggacagtcggcctgcgtgaccatcccagaga
gcagagaggtggtgtgtacccactgccccccgggccagagagggtggcgctgtgagatct
gtgatgatggctcttttggggacccgctggggctctctggggccccccagccctgccagc
tgtgtcagtgcagtgggaacgtggaccccaacgcagtgggcaactgtgaccccctgtctg
gccactgcctgcggtgcctacacaacacgacgggtgtccactgtgagagctgccaggaag
gcttctacgggagcgccctgtcccctcggcccgcagacagatgcgtgccctgcagctgcg
accaggcgggctcagtcagtgagcagagaccctgtgacccagtgaccggccagtgttcct
gcctgcctcatgtgaccggacgagactgtggccactgcagccctggcttctacgacctcc
agcctggcaggggctgccggagctgcaagtgccacccgctgggctcccagggggaccagt
gccaccccaagaccgggcagtgcccctgccgtccaggcgtcgagggccaggcctgcgaca
gatgccagctgggtttcttcggcttctccatcaagggctgccgggcctgcaggtgctccc
cgctgggcgccgcctcggcccagtgtcatgagaacagcacgtgtgtgtgcaggcccggct
ttgtgggctacaagtgtgacctctgccaggacaacttcttcctcacggccggcggcacac
actgccaggagtgcccgtcctgctacgccctggtgaagaaggaggctgccaagctgaagg
ccagactgaccctgatggaagggtggctgcaggggtcaaactgtggcaggacctggggac
cactggacatcctacagggagaggccccacgggggaacgtctaccagggccaccccctgc
tgcaaggggcccaggaagacttcctggagcaggtgacaggccttgagggtgctgtgaagg
ctgcccgggagcagctgcaggcactgagcagaaatgcccactgtgcccaggccagagctg
agaagacctgcgtccagctggcagacctagaggcggtgctggagtcctcagaagaggaga
ttctgcatgcagccaccatccttgcatctctggcgattcctcaggaagggctcggccagc
ccaccaactggagccacctggccatagaggcccgtgccctcgccaggag

KEGG   Physeter catodon (sperm whale): 102989754
Entry
102989754         CDS       T06011                                 

Gene name
COL9A2
Definition
(RefSeq) LOW QUALITY PROTEIN: collagen alpha-2(IX) chain
  KO
K08131  collagen type IX alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04974  Protein digestion and absorption
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102989754 (COL9A2)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102989754 (COL9A2)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102989754 (COL9A2)
 09150 Organismal Systems
  09154 Digestive system
   04974 Protein digestion and absorption
    102989754 (COL9A2)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102989754 (COL9A2)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00535 Proteoglycans [BR:pcad00535]
    102989754 (COL9A2)
Proteoglycans [BR:pcad00535]
 Extracellular matrix (ECM) proteoglycans
  Collagen family
   102989754 (COL9A2)
SSDB
Motif
Pfam: Collagen
Other DBs
NCBI-GeneID: 102989754
NCBI-ProteinID: XP_007114315
UniProt: A0A2Y9F377
LinkDB
Position
3
AA seq 829 aa
MAQGMGRGLWVGGRGCVLEIRLDRDLGRTSASSERNGGVIPVPHPGGSSDNPADAPGKPE
ARMEGGRWRLDGAGRSWAEGAGLPPRVLSGDRPPRAPAPPRRIFRRPDPPRPRAASAAAP
EPQRSVRAWSRHPSSSSPPPAMAPAAARRSLLVLLQVLGLALAQIRGPPGERGPPGPPGP
PGVPGSDGIDGDKGPPGKAGPLGPKGEPGKAGPDGPDGKPGIDGLTGAKGEPGPMGIPGV
KGQPGLPGPPGLPGPGFAGPPGPAGPVGLPGEMGLTGPKGDPGPEGPAGPPGPPGKPGRP
GTIQGLEGSADFLCPTNCPAGVKGPPGLQGVKGHPGKRGVLGDSGRQGKPGPKGDVGASG
EQGIPGPPGPQGVRGYPGVAGPKGETGPHGYKGMVGSIGAAGSPGEEGPQGPPGRAGEKG
DVGSQGIRGPQGITGPKGATGPPGTDGKDGTPGTPGMKGSAGQVGRPGNPGHQGLVGVPG
HPGTKGGPGDKGEPGRQGFPGVSGSPGKEGEPGPRGEIGPQGIMGQKGDHGERGPVGQPG
PQGRQGPKGEQGPPGIPGPQGLPGIKGDKGSPGKTGPRGGVGDPGVAGLPGEKGEKGESG
EPGPKGQQGVRGEAGYPGPSGDAGAPGVQGYPGPPGPRGLAGNRGLPGQPGRQGVAGRDA
SDQHIEDVVLKMLQEQLAEMAVSAKREALGATGMMGLPGPPGPPGYPGKQGPIGHPGPRG
IPGIVGAVGQIGNTGPKGKRGEKGDRGEVGRGHPGMPGPPGIPGLPGRPGQAINGKDGDR
GIPGAPGEAGRPGLPGPMGLPGFCEPAACLGASAYTSARLTEPGSIKGP
NT seq 2490 nt   +upstreamnt  +downstreamnt
atggctcaggggatgggccgcggactgtgggtcgggggaaggggctgcgtcctggagatc
aggctggacagggacctgggaaggacctctgcctcctcagagagaaacggaggcgtgatc
cccgtcccgcaccctggcggctcttcggataacccggccgatgccccggggaagcccgag
gcgaggatggagggtgggcgctggcggctggacggagcggggcggagctgggccgagggc
gccggcctccctccccgggtcctctcaggtgacagacccccgcgggcccccgccccgccc
cgccgcatattcaggaggccggacccgccccgtccgagagcagctagcgcggccgcgccg
gagcctcagcggtcggtgcgtgcgtggagccggcacccatcctcctcgagtccgccgccc
gccatggcccctgcggccgcccgccgcagcctcctggttctcctgcaggtgctcgggctc
gccctggcgcagatcagaggtcctccaggagagcgggggcccccgggccccccggggcca
ccgggagtgcctggatccgacggaatcgacggtgacaaggggccccctgggaaagctggc
cctctgggacctaagggagagcctggcaaagcagggccagatgggccagacggcaagccc
gggattgacggtctaactggagccaagggggagcctggtcccatggggatccctggagtc
aagggccagcctgggctcccaggtccccccggcctgccgggccctggcttcgccggacct
cctggaccagctggacctgttggcctccctggtgagatgggactcacgggccccaagggg
gatcctggaccagagggaccagcagggcccccggggccccctgggaaaccgggccgccca
ggaaccattcagggtctggaaggcagcgcggacttcctgtgtccaaccaactgtccagcg
ggcgtgaaagggcccccggggctgcagggagtgaaggggcatcctggcaagcgcggggtt
ctgggagattccggccgccaggggaagccgggtcccaagggagatgtgggtgcctctgga
gagcaaggcatccctggaccgccgggtccccagggcgtcaggggctacccgggcgtggcg
ggacccaaaggagagacgggtcctcacgggtacaaaggcatggtgggctccatcggcgcc
gccgggtcaccgggtgaggaaggtcctcaggggccaccaggccgagctggggagaagggt
gacgtgggcagccaaggtatccgaggaccccagggaataacaggcccaaagggagcaacc
ggcccaccaggcactgacggcaaggacgggaccccaggcacacctggcatgaagggcagc
gcaggacaggtggggcggccaggaaacccgggccaccagggcctagtgggtgtgcccggc
caccctgggacaaaaggaggccctggagacaagggtgagccaggccggcaaggcttccca
ggagtctctggttcccccgggaaggagggagagccagggcctcgaggagaaatcggtccc
cagggcatcatggggcagaagggtgaccacggtgagagggggccagtggggcagccaggc
cctcaaggccgacagggccccaagggggaacagggtccccccggaattccagggccccaa
ggcttgccaggcatcaagggagacaagggctccccagggaagaccgggcctcgcggcgga
gtgggcgacccgggggtggctggcctcccaggagagaaaggcgagaagggcgagtctggc
gagccggggcccaagggacagcaaggagtccgcggagaagccggctatccgggccccagc
ggggatgcaggcgccccgggggtgcagggctaccccgggcccccgggtcctcgaggactg
gctggaaaccgaggcttgcccggacagcccgggaggcagggcgtagcgggccgagatgcc
agtgaccagcacatcgaggatgtcgtgctgaagatgctgcaagagcaactggcagagatg
gctgtgagtgccaagcgggaggctctgggtgcgaccgggatgatgggtctcccaggaccc
cctgggcctcctgggtacccaggcaaacagggacccattgggcaccctggccctcggggc
attcctggcatcgtgggagccgtgggtcagattggcaacaccgggcccaagggaaaacgt
ggagagaagggtgataggggagaagttggacgagggcatcccgggatgcctgggcctcca
gggatcccaggactgcctggccggcctggccaggcaatcaacggcaaagatggagatcga
gggatcccaggggccccaggagaggcaggccgacctggcctgccaggccccatggggctg
ccaggcttctgtgagcctgcggcctgccttggagcctcagcctacacgtctgcacgcctc
acagagcctggatccatcaaggggccatga

KEGG   Physeter catodon (sperm whale): 102992563
Entry
102992563         CDS       T06011                                 

Gene name
COL4A3
Definition
(RefSeq) collagen alpha-3(IV) chain
  KO
K06237  collagen type IV alpha
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad04926  Relaxin signaling pathway
pcad04933  AGE-RAGE signaling pathway in diabetic complications
pcad04974  Protein digestion and absorption
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102992563 (COL4A3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102992563 (COL4A3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102992563 (COL4A3)
 09150 Organismal Systems
  09152 Endocrine system
   04926 Relaxin signaling pathway
    102992563 (COL4A3)
  09154 Digestive system
   04974 Protein digestion and absorption
    102992563 (COL4A3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102992563 (COL4A3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102992563 (COL4A3)
  09167 Endocrine and metabolic disease
   04933 AGE-RAGE signaling pathway in diabetic complications
    102992563 (COL4A3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102992563 (COL4A3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102992563 (COL4A3)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04147 Exosome [BR:pcad04147]
    102992563 (COL4A3)
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102992563 (COL4A3)
Exosome [BR:pcad04147]
 Exosomal proteins
  Exosomal proteins of other cancer cells
   102992563 (COL4A3)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102992563 (COL4A3)
SSDB
Motif
Pfam: Collagen C4
Other DBs
NCBI-GeneID: 102992563
NCBI-ProteinID: XP_023980394
UniProt: A0A2Y9SXA0
LinkDB
Position
2
AA seq 1670 aa
MRPGTAPRPPALLFLLQLALLVAAPTAGKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGPP
GQKGYQGPEGLPGPQGPKGSPGLPGLTGPKGVRGTTGLPGFSGPPGLPGIPGHPGPYGPV
GLPGCNGSKGEQGFPGLRGTPGYPGLPGAIGLKGEKGAPAEGEGVALDGKGDPGLPGAPG
FQGLPGLPGFPGPIGPPGPPGFFGFPGTMGPPGPKGHMGDNVIGQKGERGMKGLTGPPGP
PGTVIVTLTGLVNRTDLKGEKGDQGAVGQSGRPGPSGPPGESYGSEKGAPGEPGPQGKPG
KDGAPGFPGTEGAKGSKGFPGLRGEDGIKGWKGDMGPPGFPGPTEYYDAYQEKGDEGIPG
PPGPKGAPGPQGPSGPPGAPGSPGSSRPGLQGAPGVPGVKGSKGEQGPPGKNAAGLPGSP
GWPGSPGPTGLPGPPGPPGGIVFRRGPPGDSGLPGHTGSPGIPGVDGPKGEPGLLCTQCP
WVPGPPGPPGLPGLDGVKGFPGGQGAAGIKGSPGSPGNAGLPGFPGFPGAQGHPGLKGEK
GEASQPEGQVGAPGDPGLRGHPGRKGLDGIPGTPGVKGLPGPKGEPALSGEKGDQGLPGD
PGIPGSPGPAGPAGPSDYGPWGEPGPKGTQGLPGAPGPPGEAGPKGEIGVLTPVPGPPGP
PGPTGRAGPQGPPGIPGSTGKCDLGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSPG
CPGEMGKPGLPGKPGLPGVKGEPGLAMPGEPGAPGLPGERGNSGENGEIGLPGLPGLPGI
PGTGGLDGPRGDPGQPGPPGERGPPGRCKESLSGAQGLPGLNGLKGQQGRRGETGPKGDP
GIPGLDRSGFPGEPGPPGMPGHQGEMGPPGTKGYPGNPGFLGPPGEKGMIGMMGVPGNVG
PPGPPGKPGTLGQRGSFGIPGVKGERGSPGAKGEQGDKGTPGPPGIFRLVGDKGEPGLKG
LAGKPGEKGNRGIPGLPGFKGLQGPPGPPGSPGPRGDPGSIGNPGEPGPRGVPGSMGNMG
VPGTKGMKGTMGFPGLAGRPGLPGTHGLQGDKGEPGYSEGTRPGPPGPKGDPGLPGDMGK
KGDRGPPGTPGRSGPAAPEGVPGNPGSPGHPGKPGPDGDLGSKGIKGFPGSPGIKGPPGP
PGFPGNPGPAGERGNQGRDGIPGPAGEKGETGLLGALPGPRGKPGPQGAKGDRGSPGLPG
LPGRKGPVGDAGPRGPIGMTGPQGPPGFPGSIIPGQKGDRGPPGRRGNPGEPGPPGPQGS
PVEGIKGDKGFMGQPGPRGPPGTVGDMGPPGHPGAPGAPGLPGVRGDPGFYGFPGMKGEK
GNSGFLGPSGPPGRIGPKGPPGARGDPGTVKIISLPGSPGPPGPAGEPGVQGEPGPPGLP
GNSGPCGPRGKPGKDGLPGTPGPIGEKGNKGCKGEQGQPGSDGLPGLKGRPGDTGPPARG
TAVRGFVFTRHSQTTAIPSCPEGTEPLYRGFSLLFVQGNEHSHGQDLGTLGSCLQRFTTM
PFLFCNINDVCNFASRNDYSYWLSTPALMPVDMAPITGRALEPYISRCTVCEGPAIAIAV
HSQTTDIPSCPPGWISLWKGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFIECHGRG
TCNYYSNSYSFWLASLDPKRMFRKPIPSTVKAGELEKIISRCQVCMKRRH
NT seq 5013 nt   +upstreamnt  +downstreamnt
atgcgccccgggacggcgccgaggccgccggcgttgctgttcctgctccagctggcgctg
ctggtggccgcgcccactgccggcaagggctgtgtctgtaaagacaaaggccagtgcttc
tgtgatggggccaaaggggagaagggggagaaaggcttcccaggaccccctgggcctccc
ggccagaaaggataccaaggtccggaaggcctgcctggaccacaggggcccaagggctct
ccaggacttccaggactcactggtcccaaaggcgtgaggggaacaactggattaccagga
ttttcaggtcctcctggacttccaggcatcccaggccatcctggaccgtatggacctgtc
ggtttgccaggatgcaatggttctaagggtgaacaagggtttccaggacttcgggggaca
ccaggctacccaggcctcccaggtgccattggtttgaaaggagagaaaggggctcctgct
gaaggagaaggtgtagcccttgatggaaaaggtgaccctgggttgccaggagctccagga
ttccagggtttgccgggccttccaggttttcccggacctattggtccacctggacctcca
ggattctttggcttcccaggaaccatgggaccgccaggacctaagggtcacatgggcgat
aacgtgataggacaaaaaggagagaggggtatgaaaggattaacaggaccgcccggacca
ccaggaacagttattgtgacactaaccggcctggttaacagaacggatctcaagggggag
aagggagaccagggagccgtgggccagtctggacgccctggaccctcagggccacctgga
gaatcttacggatcagaaaaaggtgctcctggagaaccaggcccacagggaaagcctgga
aaagatggtgcccccggtttccctggcactgagggagccaaaggcagcaagggtttccct
gggttaaggggcgaagatggcattaaggggtggaaaggagacatgggccctccaggattt
cctggtccaacagaatattatgatgcataccaggaaaagggagatgaaggaattcccggc
ccgccagggcccaagggagctcctggcccacagggtcccagtggtccccctggagcccct
ggaagtcctgggtcatcaagacctggcctccaaggagcccctggagtaccaggagtaaaa
ggaagtaaaggggagcaaggacccccaggaaagaacgcagcggggcttcccgggtcccct
ggctggcctggttcaccaggccctacagggttgccaggacctccaggaccaccaggtggt
attgtttttcgcagaggtccacctggagacagtggacttccaggccatacagggtctcca
ggaatcccaggagtcgatgggcccaaaggggagccaggccttttgtgcacacagtgtcct
tgggtcccagggcctccagggcccccaggattgccggggttagatggcgtaaaaggattc
ccaggaggacaaggggcagctggcattaaaggaagcccagggtccccaggaaacgctggt
cttccaggatttccgggattcccaggtgctcagggtcacccaggacttaaaggagaaaaa
ggtgaagcatctcagccagagggacaagtgggtgccccaggggatccagggctccgagga
catcctggaagaaagggcttggatggaattcctggaactccgggagtaaaaggattacca
ggacctaagggagaaccggccctgagtggggagaagggggaccagggtcttccaggggat
cctggcatccctgggtccccaggacctgcaggaccagctggaccatcagactatggacct
tggggagagcctggtccaaagggcacccaaggacttcccggagcccctggaccacctgga
gaagccggtcctaagggagaaatcggtgttttgacaccagttccagggcccccaggacct
cctgggcccacaggccgtgctggcccccaaggtccacctggtatccctggatccacagga
aaatgtgatctgggtcttcctgggcctgatggtgaaccaggaattccaggaatcggattc
cctgggcctcctggacctaagggagaccaaggttttccaggaacaaaaggatcaccaggc
tgtcctggagaaatgggaaagccagggttacctggaaagccaggcctcccaggagtcaag
ggagaaccaggactagccatgcctggagaaccaggagcaccaggtttgccaggagaaaga
ggcaactctggggaaaatggagaaattggactccctggacttccaggtcttcctggaatt
ccaggaactggagggcttgatggaccacgaggggatccagggcagcctggaccacctgga
gaaagaggccccccaggaaggtgcaaagaaagtctcagtggagctcaaggacttccaggc
ttaaatggattgaaagggcaacaaggcagaagaggcgaaacggggccaaagggagaccca
ggtattccaggcttggataggtcaggatttcccggagaacctggaccaccaggaatgccg
ggtcatcaaggggagatgggaccacctggtacaaaaggatatccaggaaatccaggattt
ttagggccaccaggtgaaaaaggaatgattgggatgatgggggttcccggaaacgttggc
cctccggggcctcctgggaagccaggcaccctgggacagagggggagctttggaattcca
ggagtaaaaggtgagagaggatccccaggtgccaaaggggaacaaggagataaaggaact
ccagggcctcctggaatattccgcttagtcggggacaaaggggaaccaggactcaaagga
cttgcagggaagcctggtgagaaaggaaacagaggcattccagggttaccaggtttcaaa
ggactccaagggccacctggaccaccaggatcaccaggccccagaggagatccaggcagc
attgggaatcctggtgagccaggaccacgtggtgtgccaggaagcatgggaaacatgggg
gtgccaggtactaaaggaatgaagggaactatgggattcccaggtctagctggaagacca
ggcctcccaggtactcatggtcttcaaggagataagggagagccaggttattcagaaggt
acaaggccaggaccaccgggaccaaagggagatccaggactgccaggtgacatgggaaag
aaaggagatagagggccacctggcacacctggacgttcgggccctgctgcaccagaggga
gttcctgggaatcccggaagccctggccacccaggaaagcccggtcctgatggtgatttg
gggtctaaaggaatcaaaggtttccctggctctccaggaataaaaggccctccaggccct
ccaggattcccaggaaatcctggaccagcgggggagagaggtaatcaaggacgtgatgga
attcctggtccagctggagaaaaaggagaaacaggcttgctgggagcacttccaggccca
agagggaaacctggtcctcaaggagccaaaggagacaggggatccccaggcttgccgggt
ctccctggcaggaaagggccagtgggagatgctgggcctcgagggcccattggcatgaca
ggacctcaagggccaccaggctttcctggttcaatcatccctggccaaaaaggagatcga
ggtccacctggcagaagaggaaacccaggtgagcctggtccccctggacctcaagggagt
cctgtagaaggcataaaaggagacaagggatttatgggccagcctggcccaagaggtccg
cctggaactgtaggagacatggggccaccaggtcatccgggagcaccaggtgccccaggt
ctgccaggggtcagaggtgatcctggattctatggatttccaggcatgaaaggggagaag
ggtaattcaggatttctgggaccaagtggacctccagggcgaattgggccaaaaggacca
cctggtgcacgtggagaccctggcacagtgaagatcatctcccttccaggaagcccaggc
ccgcctggcccagctggagaaccaggggtgcaaggagaacctgggcccccggggctaccg
ggaaactcaggaccctgtgggccaagaggtaaaccaggcaaggatggactaccaggaacc
cctggaccaattggagaaaaaggcaacaaaggttgtaaaggagagcaaggacagcctggg
tccgatgggctgcctggcttgaaggggagacctggcgacactgggccacctgcgcggggg
acagcggtgaggggctttgtcttcacccggcacagccagaccacggcgattccctcctgt
ccagaagggacggaaccactctatagggggttttctcttctctttgtacaaggaaatgaa
cactcccatggacaggacctgggaactcttggcagctgcctgcagagatttactacaatg
ccattcttattctgcaatatcaatgatgtatgtaatttcgcatctcgaaacgattattca
tactggctgtcaacaccagctctgatgccagtggacatggctccaattactggccgggcc
ctggagccttatattagcagatgcactgtctgtgaaggtcctgcaattgccatagctgtt
cacagccagactactgatatcccctcatgtcctcctggctggatttctctctggaaagga
ttttctttcatcatgttcacaagtgcaggttcagagggcgctgggcaggcactggcatcc
cccggttcctgcttagaagaattccgagccagtccatttatagaatgtcatggaagagga
acatgcaactactattcaaattcctacagtttctggttggcttcattagaccccaaaaga
atgttcagaaaacctattccatcaactgtgaaagctggggagttagaaaagataataagt
cgctgtcaggtgtgcatgaagaggagacattga

KEGG   Physeter catodon (sperm whale): 102992685
Entry
102992685         CDS       T06011                                 

Gene name
LAMA1
Definition
(RefSeq) laminin subunit alpha-1
  KO
K05637  laminin, alpha 1/2
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
pcad05410  Hypertrophic cardiomyopathy
pcad05412  Arrhythmogenic right ventricular cardiomyopathy
pcad05414  Dilated cardiomyopathy
pcad05416  Viral myocarditis
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102992685 (LAMA1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102992685 (LAMA1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102992685 (LAMA1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102992685 (LAMA1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102992685 (LAMA1)
  09166 Cardiovascular disease
   05410 Hypertrophic cardiomyopathy
    102992685 (LAMA1)
   05412 Arrhythmogenic right ventricular cardiomyopathy
    102992685 (LAMA1)
   05414 Dilated cardiomyopathy
    102992685 (LAMA1)
   05416 Viral myocarditis
    102992685 (LAMA1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102992685 (LAMA1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102992685 (LAMA1)
   05145 Toxoplasmosis
    102992685 (LAMA1)
SSDB
Motif
Pfam: Laminin_G_1 Laminin_EGF Laminin_G_2 Laminin_B Laminin_I Laminin_N Laminin_II Laminin_G_3 Hepar_II_III DUF948
Other DBs
NCBI-GeneID: 102992685
NCBI-ProteinID: XP_023976369
UniProt: A0A2Y9SKV5
LinkDB
Position
19
AA seq 3374 aa
MALVAARDKNLGSSSQETLSWECRVACVGPGAEQGDAQEGGSGGEVRHSETGSECGFLLL
SGYAAPSSGSPHKLPVVRNRPGPLGCGLEERPARQRRRRGVPASGSAAGPRWGGKGQQGK
YFLGSIRPGQLRQISPPSPSIPLSPPAPQLRARGCSGSTPTPPPSWGMALAVPADRARGP
RGGGLGGRESLWGGVSGEGDESSRGGGEVSGGRSLHGEGSRSGREVSLGRGLRAGGGGLQ
GEGSLAKAGGAGPGTAEVGGGVSEGPAGSADSRSAESQRAVDAFYRSHPSGRDEMRGGGA
GLALLASLLWVAAQGQQRGLFPAILNLASNAYISTNATCGEKGPETYCKLVEHVPRRAMR
NAQCRLCDGSSANPKERHPISNAIDGTNNWWQSPSIQNGREYHWVTVTLDLRQVFQVAYV
IIKAANAPRPGNWILEHSLDGMEFSPWQYYAVSDTECLTHYNITPRRGPPTYRADDEVIC
TSYYSRLVPLEHGEIHTSLINGRPSSDDLSPKLLEFTSARYIRLRLQRIRTLNADLMTLS
HRDPKDLDPIVTRRYYYSIKDISVGGMCICYGHASSCPWDETIKKFQCQCEHNTCGDSCN
RCCPGYHQQPWRPGTVSSGNTCEECNCHNKAKDCYYDENVANQKRSLNTAGQFRGGGVCI
DCLQKTMGINCETCVDGYYRPHRVSPYEDNPCHPCDCDPLGSLSSVCIKNDLHSDLHRGK
WPGQCPCKEGYAGEKCDRCQFGYKGYPACVPCDCNPAGSVNEEPCSEACLCKENVEGKHC
DRCKPGFYNLEERNPQGCSECFCFGVSDVCDSLSWPISQVKDMSGWLVTDLVSSNQLRSQ
QDALGGRQQISINSSAVVQRLTSQYYWSAPEAYLGNKLTAFGGFLKYAVSYEVPVETADG
DPMSHADVIIKGNGLTLSTRAEGLLLQPYEEHLNAVRLVPEHFRDFHSRREVDRDQLMTV
LANVTHLLIRANYNSAKMALYRLDSVSLDTASPNVIDLSLATAVEHCECPQGYVGISCES
CLPGYYRVDGILFGGICQPCECHGHAAECDVHGVCFACQHNTTGNHCERCLPGFYGLPSR
GTPGDCQPCACPLSAASNNFSPTCHLNDGDEVFCDQCAPGYAGDWCERCADGYFGNPTVP
GDSCVPCNCSGNVDPLEAGSCDSVTGECLRCVGNTDGPHCERCAYGFYGDAVTAKNCRAC
ECHGQGSFSAACHPETGLCDCKPHVTGQQCNQCLHGYYGLDTGLGCLPCNCSASGSLSDD
CTEEGQCRCVPGVAGERCDRCARGFYAYQDGGCTPCDCAHTQNTCDPESGECVCPPHTRG
AACEECADGHWGHDPELGCQACNCSSAGSASHQCDVLTGHCQCKPAFGGQTCHQCSLGYR
DFPDCVACDCDLRGTLADTCGREQGLCSCAPETGTCSCKENVIGPQCSQCRAGTFALDEA
NPRGCTPCFCFGLSQLCSEAEGYVRMPVTLGSDQPLLRVVSQSNLKGTTEGVYYQAPDVL
LDAMTVRRHVHTEPFYWRLPRQFQGDQLMAYGSKLRYSVAFYSSSGFGTSNLEPQVLIKG
GRTRKQVIYVDAPAPKNGLRQEQEVGMKENFWKYFNSVSEEPVTRSDFMSVLSNIEYILI
KASYGQGLQQSRISNISMEVGRKTEGSHPKREVASLLEKCLCPPGTAGFSCQDCAPGYHR
GKLPEGDGRRPRPLLAPCVPCNCSKHSDTCDPETGKCLNCSHNTAGDHCHVCAPGYYGKV
TGSASDCSLCTCPHGLPASFSPTCVLEGDHDFRCDACFLGYEGQYCERCSLGYHGDPQMP
GGTCQRCDCSPHGSVHSDCDRGSGQCVCRPGAAGLRCEGCEPRHLLVESDCVSCDDECVG
VLLNDMDHVGDAILSVNLSGIIPVPYGILSNLENTTKYLRDSLLKENIQKELAKIQLEGV
SEQTEDLQRKLDRVLTRSQHVTRATERILDKSRDLLTFTEKLQADVQEIIKRAATLNQTL
DEDVQLPSSTLQNMQKNITSLLEIIQKRNFLQLHQNATLELKAAEDLLSQIQKNYQKSQE
ELEVLKEAASSLLSRHNSKVQAAEELLREAERKTQESGRLLPVVRANLREFNDKKLRVQE
EQNLTSVLIAEGRGLLDAATAPANASGKALAHLEHHRDELLLWTAKIRHHVDDLVMQMSQ
RRALDLVYRAEDHATELQRLAGALDSGLGNVSHVSLNATSATHVHSNIRSLIEESEKLVK
DALRTGSKASTASESLVSNGKVALQRSAEFLKEGNSLSRKHGGIALELSELKNTAERFQE
NADKITKQTNESLLILRAIPEGVRDKGAKIKELATSANRSSMSTLKNIVGLGQKLLNTST
DLSRVNATLRETDKLLRDSSVTALLAGRKVKDVETQANLLFDRLKPLKMLEENLSRNLSE
IKLLISQARKQAASIKVAVSADRDCIRAYQPQISSTNYNSLTLNVKTSEPDNLLFYLGSS
SADFLAVEMRRGKVAFLWDLGSGSTRLEFPDFPIDDSKWHSIYVTRFGNIGSLSVKEMSA
SQKPPPKTSKSPGTANVLDVNNSTLMFVGGLGGQIKKSPAVKVTHFKGCMGEAFLNGQSI
GLWNYIEREGKCHGCFGSPQNEDASFHFDGSGYSVVEKMLRATVTQIIMLFSTFSPNGLL
LYLASNGTKDFLSIDLVDGRVRVTVDLGSGPLALITDRRYNNGTWYKIAFQRNRKQGLLA
VIDAYNTSYKETKQGETPGASSDLNRLDKDPIYVGGLPRSRIVRKGVTSKSYVGCIKNLE
ISRSTFDLLRNSYGVRKGCILEPVRSVSFLRGGYVELPPKSLSPDSELLATFATRNSSGI
ILAALGQHGEKQGHRQAHGPFFSIMLIEGHIEVHVNPGDGTSLRRALLHAPTGTYGDGQE
HSISLIRSGRIITVQLDETAPVEMKLGPSAESRTISVSNLYIGGVPEGEGTPMLKMRSSF
HGCIQNLIFNMELLDLTSATGYEHVDLDSCLLSERPKLALHGEDGELPPEPQALPSPELC
AVDRAPEYVPNARQFGLAEGSHLVLPFNQLAVRKRLLVQLSIRTFASSGLVYYMAHQNQV
DYATLQLHGGRLHFMFDLGKGRTKVSHPALLSDGQWHTVKTEYFKRKGFMTVDGQESPMV
TTGGDATTLDVEGKLYLGGLPSEYRARNIGNITHSIPACIGEVTVNSKQLDKDSPASAFA
VNRCYAAAQEGTFFEGSGYAALVKEGYRVRSDVNITLEFRTSSENGVLLGISSAKVDAIG
LEIVNGKLLFHVNNGAGRITATYEPKAPNTLCDGKWHTLQANKSKHRVVLIVDGNAARAE
SLHTQSTSADTNNPIYVGGYPADVKQNCLSSQTSFRGCLRKLTLIKGPQVQSYDFSTAFD
LQGVFPHSCPGTES
NT seq 10125 nt   +upstreamnt  +downstreamnt
atggccttggtggcagctcgagacaagaatttggggagctcttcccaggaaaccctgtcg
tgggagtgccgagtggcctgtgtgggcccaggggcagagcaaggagacgcccaggaggga
gggtctggaggagaagttagacattcggaaactggctcagagtgcgggtttctcctgttg
agtggatacgcagcgccatctagtggctccccgcacaaactgcccgtggtccggaacagg
ccggggcccctcgggtgcgggctggaggagcggccggcccggcagcggcggcggcgcggg
gtcccggcctcgggctcggctgcagggccgcgctggggcgggaagggtcaacagggcaaa
tactttctcggatccatccgccccggccaacttcggcagatctccccccccagcccctcc
atcccactttctccgcccgccccccaactccgagcccgcggctgcagcgggtcgaccccc
acgccacccccgagttgggggatggctctggcggtccccgcggaccgagcgcgggggcct
cgtgggggaggccttggggggagggagtctctgtggggaggggtctccggagagggggac
gaaagttctaggggcgggggggaggtctccggggggaggagcctccatggggagggctct
cggagtgggcgggaggtctccttggggaggggtctccgggcagggggaggaggtctccag
ggggaggggtctctggcgaaggccggcggggcggggccgggcacggcggaggtcggcggg
ggtgtgtccgaggggccggcgggcagcgcagactcgcgctccgccgagtcgcagcgcgcg
gtcgacgctttctaccggagccacccgagcggcagagacgagatgcgtggcggcggcgcg
gggctggcgctcctggcctcgctgctctgggtcgccgcgcagggccagcagagagggctg
ttccctgccattctcaaccttgccagcaatgcctacatcagcaccaacgcgacctgtggg
gagaaggggccggagacctactgcaagctggtggagcacgtgcccaggagggccatgcgc
aacgcccagtgccggctctgcgacggcagcagtgccaaccccaaagaacgccatcccata
tcaaatgccatcgatggcacgaataactggtggcagagtcccagcatccaaaacgggaga
gagtaccactgggtcacagtcactctggacttgagacaggtctttcaagtcgcttatgtc
atcatcaaagctgctaatgcccctcggcctggaaactggattttggagcattctctggat
ggcatggagttcagcccttggcagtactatgcagttagtgacacggaatgtttgactcat
tacaatatcactccaagacgggggccgcccacttacagggcagatgatgaagtgatctgc
acctcctattactccaggctggtgccgcttgagcacggagagattcatacatcgctaatc
aatggcagaccaagctctgatgatctttcacccaagttgttggaatttacttctgcacga
tacatccgcctgcgcttacagcgcattcgaaccctcaatgccgatcttatgaccctcagc
caccgggaccctaaagacctcgaccctattgtcacaagacgatattactattcaataaaa
gacatttctgttggaggaatgtgtatttgttatggccatgctagtagctgcccatgggat
gaaactataaagaaattccagtgtcagtgtgaacataacacgtgtggtgacagctgcaat
cggtgttgtcctgggtaccatcagcagccctggaggcctggcactgtttcttctggtaac
acgtgtgaagaatgcaattgccacaataaagccaaagactgttactatgatgaaaatgtg
gcgaatcagaagagaagtctgaatactgccggacagttcagaggaggaggggtttgcatc
gattgcctgcagaaaaccatgggcatcaactgtgaaacctgcgtggatggatattacaga
ccacacagggtgtcaccttatgaagacaacccttgtcatccctgtgactgtgaccctctg
gggtccctcagttctgtctgtattaagaatgaccttcattctgacttacacagagggaag
tggccgggtcagtgtccatgcaaggaaggttacgcaggagaaaaatgtgatcgctgccag
tttggttataagggttaccccgcgtgtgtgccctgtgactgtaacccagccggcagcgtg
aatgaggaaccctgctccgaggcttgtctctgtaaggagaatgttgaggggaaacattgt
gatcgatgcaaaccgggattctataacttggaggaaaggaacccccagggctgctcggaa
tgcttctgctttggtgtctctgacgtttgcgacagcctctcctggcccatcagtcaggtg
aaagatatgtctggctggctggtcaccgacttggtcagttccaaccagctccggtcccag
caggacgcgctgggcgggcgtcagcagatcagcatcaacagctcggcggtggtgcagagg
ctgacctcccagtattactggtcggccccagaggcctacctgggaaacaagctgacggcg
tttggtggattcctgaagtacgcggtgtcttacgaggtcccagtggagacggcggacggt
gaccccatgtctcacgccgacgtcatcattaaggggaatgggctcactttaagcacgcgg
gcagagggcctgttattgcagccctatgaagagcacttgaacgcggttagactcgtgcct
gagcacttccgagattttcatagcaggcgggaggtagatcgcgaccagctgatgaccgtg
cttgccaacgtgacgcatctcttgatcagagccaactacaattctgcaaagatggctctt
tacaggttggattctgtctctctggacacagccagccctaacgttatagacctgtccctg
gccacggcggtggagcactgcgagtgtccccaaggctatgtgggcatctcttgtgagtcc
tgtctccctggctattaccgcgtggatggaatactctttggaggaatctgtcagccctgt
gaatgccacggccatgcagccgagtgtgacgttcatggcgtttgctttgcatgccagcac
aacaccaccgggaatcactgcgagcgctgcctgcccggcttctacgggctgccttcccga
gggacccctggggactgccagccgtgtgcctgccctctctccgcggcctccaacaatttc
agccccacctgccacctcaacgacggagatgaagtgttctgtgaccagtgcgccccggga
tacgcgggggattggtgcgagagatgtgcagacggttactttggaaacccaacagtgccc
ggagattcttgtgtcccgtgtaactgcagtggcaatgtggaccccttggaggccgggagc
tgtgattccgtcaccggagaatgcctgagatgcgttgggaacaccgatggcccccactgt
gagaggtgtgcctacgggttctatggggacgccgtgactgcgaaaaactgccgcgcctgt
gaatgccatggccaaggctccttttctgctgcctgccatcctgagactggactctgtgac
tgcaaaccgcacgtgaccggacagcagtgtaaccagtgcctgcacggctactacgggttg
gacacggggctcgggtgcctgccctgtaactgcagcgcgtcgggctccctgtcggacgac
tgcacggaggaaggccagtgtcgctgtgtcccaggtgtagctggggagaggtgtgacagg
tgtgcccgcggcttctatgcctaccaggacggtggctgtacaccctgtgactgcgctcac
actcagaacacctgtgacccggagtcgggggagtgtgtctgcccgcctcatactcggggt
gcggcgtgcgaggagtgtgcagacggacactggggccacgacccggagctcgggtgccag
gcctgcaattgcagtagcgccgggtcagccagtcatcagtgcgacgtgctcaccggccat
tgccaatgtaagcctgcgtttggtggacagacctgccaccagtgctcactggggtacaga
gactttccggactgtgtggcctgtgactgcgacctgagagggacactggccgacacctgt
ggccgggagcagggtctctgcagctgtgccccagagaccggaacctgctcttgcaaggaa
aatgtcattggccctcaatgcagtcagtgtcgagctggcacctttgctctggacgaggcg
aaccctcgggggtgcaccccgtgcttctgctttgggctgtcacagctctgctcagaggcc
gagggctacgtgaggatgccggtaacgctgggctcagatcagcctcttctgcgtgtggtt
tctcagagtaacctcaagggtacgacggagggggtgtattaccaggcccctgatgtcctc
ctggatgccatgaccgtcaggcggcatgtccacacagagccgttttactggcggctgccc
aggcaatttcagggagaccagctcatggcctacggcagcaaactacggtacagcgtggcc
ttctattcttccagtgggttcggcacctccaacctggagcctcaggtgctcatcaaagga
ggccggaccagaaagcaagtcatctatgtggacgctcctgcccccaagaacggactgaga
caagagcaggaagtggggatgaaagagaatttttggaaatactttaactctgtttctgaa
gaacctgtcacacgctcagattttatgtctgttcttagcaacattgagtacatcctcatc
aaagcatcttatggccaaggattacagcaaagcagaatctcaaatatttctatggaggtt
ggcagaaagactgaagggtcgcaccccaagagggaggtggcatcccttttagagaagtgt
ctctgtcctcctggcacagcgggattctcatgtcaggactgcgcacctggttaccacaga
gggaagcttccagaaggtgatggcaggagaccgcgccctctgctggctccctgtgtacct
tgcaattgcagcaaacacagtgacacctgtgaccctgaaacggggaagtgtctgaactgc
agccacaacaccgccggagaccactgccacgtgtgtgcccccgggtactacgggaaggtg
accggctcggccagcgactgctctctctgcacctgtcctcacggcctccctgccagtttt
agtcccacttgtgtcttggaaggtgatcacgatttccgctgtgatgcctgctttctgggc
tacgaaggacaatattgtgaaaggtgctccttgggctaccatggggaccctcagatgccg
ggcggcacctgccagaggtgtgactgcagcccgcacggctctgtccacagtgactgtgac
cgcgggtccgggcagtgcgtctgcaggccgggggccgcggggctccgctgtgagggctgt
gaaccaaggcaccttctggtggagagcgactgtgtgtcctgtgacgatgaatgtgtaggc
gtgctgctgaatgacatggaccatgtgggtgatgccatcctctctgtgaacctctccggc
atcatccctgtcccgtatgggattttgtcaaacctggaaaatacaacgaaatatctccgg
gactctttattaaaagaaaatatacagaaagaactggcaaagattcagcttgaaggtgtc
tcagaacaaacagaggacctgcaaaggaagctggacagggtgttaacacggagccagcac
gtgaccagggccaccgaaagaatcctcgacaagagtcgcgacctcctgacgtttactgag
aagctgcaggcagacgtccaagaaatcatcaaaagggcagcaactctaaatcagaccttg
gatgaagatgtccagttacccagttctactcttcagaatatgcaaaagaacattacctca
ttgctggaaatcattcagaaaaggaatttcctgcagttgcaccaaaatgccacccttgaa
ctcaaggctgctgaagatttattgtcacaaattcagaaaaattaccagaaatcacaggaa
gagttggaggtattaaaagaagcagcaagcagcctcctttcaagacacaacagtaaagtg
caggccgcagaagagctcctgagggaagcagagaggaagacccaggaaagcggtcgcctg
ctacccgttgtcagggccaacctgcgagagttcaacgataaaaagctgcgtgttcaagaa
gaacaaaacttgacgtcagtgctaatcgccgaaggaagaggactgcttgatgctgccact
gcacctgcaaatgcttcaggaaaggctctagcacatttagagcatcaccgggatgagctg
cttctatggactgccaaaatcaggcaccacgtagatgacctggtcatgcagatgtcccag
aggagagcattggaccttgtctacagagcggaggatcatgccactgagctccagagactg
gcaggtgctctggacagtggccttggaaatgttagtcatgtgtccctaaatgcgaccagt
gcaacccatgtccattccaacatccggagcctgattgaagaatcagagaagttggtgaaa
gatgctctcaggactgggagcaaggcgagcacggcctcagaatctcttgtttctaacggg
aaagtggctctccagcgcagtgctgaatttctaaaagaaggcaatagcctgagcagaaag
catggaggtatcgcattggaactgagtgaattgaaaaatactgcagagagatttcaagag
aatgctgataaaattactaagcagaccaatgaatcactcttgatacttagagcaattcct
gaaggggtcagggacaaaggcgccaaaatcaaagagctggccacatctgcaaatcggagc
tctatgagcactctaaagaacatcgtgggattgggccagaagctgttgaatacgtccact
gacctgtccagggttaacgccacgttacgagaaacagacaaactcctccgtgactcctca
gtgaccgctttgttagctggaagaaaagtcaaagatgtggaaacacaagcgaacctttta
tttgatcggctgaagcctttgaagatgttagaagagaacctgagcagaaacctatcagaa
atcaaactgctgatcagccaggcccggaagcaagcagcgtctatcaaagttgctgtgtct
gcagaccgagactgtatccgggcctaccagcctcagatttcttctaccaactataacagc
ttaacactaaatgtgaagacaagcgaacctgacaaccttctcttctacctggggagtagc
agtgctgatttcctggcagtggagatgcggcgagggaaagtggccttcctctgggatttg
ggctccgggtccacacgtttggaatttccagacttcccgattgatgacagcaaatggcac
agtatctatgtaaccaggtttggaaacattggttcgttgagtgtaaaggaaatgagcgca
tctcaaaagccaccaccaaaaacaagtaaatcccctggaacagctaatgttctggatgta
aacaattcaacactaatgtttgttggagggcttggaggacagatcaagaagtctcctgct
gtgaaggttactcattttaaaggctgcatgggagaggccttcttgaatggacagtcgatt
ggcctatggaactatattgaaagggaggggaagtgccatggctgctttggaagcccccaa
aatgaagatgcttccttccattttgatgggagtgggtattctgtcgtggagaagatgctc
cgggctactgtgacgcagataattatgctttttagtaccttttcacctaatgggctgctt
ctctacctggcttcaaatggcaccaaagactttttatccatcgacctggtcgacggcaga
gttagagttacagttgacctgggttctgggcctcttgctcttattaccgacagacgctat
aataacggaacctggtacaaaattgccttccagcgaaacagaaagcaaggactcctagca
gttattgatgcatataacaccagctacaaagaaaccaagcaaggggaaactccaggagca
tcttctgacctcaatcgtctagataaggatccaatttatgtgggtggattacctaggtca
agaattgtaaggaaaggtgtcaccagcaaaagctacgtgggctgtatcaaaaacctggaa
atatctagatcgacctttgatttactcagaaactcctacggagtgagaaaaggctgtata
ctggagcccgtccgaagcgttagcttcttgagaggcggctacgttgaattgccacccaag
tccttgtcaccagattcggaattgttggcaacatttgccaccaggaacagcagcggcatc
atcctggctgccctgggccagcatggggagaagcaaggtcaccggcaggcccacgggccc
ttcttttccatcatgctaattgaaggccacattgaagtgcatgttaatcctggggatgga
accagcctgagaagagctctcctgcatgctcccacgggcacatatggcgacggtcaagag
cattccatctccttgataaggagcgggagaattatcactgtccaactggatgagacagct
cctgtggaaatgaagctgggcccatcagcagaaagcaggacaataagtgtgtccaacctg
tacataggtggtgttccagagggcgaggggacacccatgctcaagatgagaagctcattc
catggctgtatccaaaacctgatctttaacatggaacttttggatctcactagtgccact
ggctatgaacacgtggacttggactcctgcttgctctcagaacggccaaagctggctctc
cacggagaggacggcgaactcccgccagagccccaggctttaccgagtccggaattgtgt
gccgtggacagagccccggagtacgtccccaacgcccgccagttcggcctcgcagaaggc
agccacttggtgctgccctttaatcagctggctgtccgaaagaggctcttggtccagcta
agtatccgaacattcgcctccagtggcctggtttactacatggctcaccagaaccaggtt
gactacgccacgctccagctgcacgggggccgcctccacttcatgttcgatcttgggaag
ggcaggacaaaagtctcgcaccctgcactgctcagcgacggccagtggcacacggtcaag
acagagtactttaaaaggaagggcttcatgacggttgacggccaggaatcgcccatggtg
accacggggggagatgctactacgttggatgtggaagggaagttgtacctaggaggcctt
ccctccgagtacagggccaggaacattggaaatatcacccacagcatccccgcttgcatt
ggggaggtgacagtgaacagcaaacagctggacaaggacagcccagcgtctgcatttgca
gtaaacaggtgctatgcagcggcccaggaaggaactttctttgaaggaagtggatatgca
gcccttgtcaaagaaggctacagagtacgatcagatgtgaacattacgctggagtttcgt
acctcctcagagaatggtgtcctcctgggcatcagcagtgccaaagtggatgccatcgga
ttagagattgtaaatggcaagctcttgtttcatgttaacaatggtgctggtaggataaca
gccacatacgaacccaaagctcccaatactctctgtgatggaaaatggcacacacttcag
gcaaacaaaagcaaacatcgcgtggttctgattgtcgatgggaatgcagctcgagctgaa
agtctacacacccagtctacctctgctgacaccaacaatcccatttatgttggtggctat
cctgctgacgtaaagcaaaactgcctgagcagccagacctccttccggggctgtttgaga
aaactcactctaattaagggcccacaggtgcaatcctatgacttcagcacagcttttgac
ctacaaggagttttccctcattcctgtcctgggactgagtcctga

KEGG   Physeter catodon (sperm whale): 102993220
Entry
102993220         CDS       T06011                                 

Gene name
LAMB3
Definition
(RefSeq) laminin subunit beta-3
  KO
K06244  laminin, beta 3
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102993220 (LAMB3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102993220 (LAMB3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102993220 (LAMB3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102993220 (LAMB3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102993220 (LAMB3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102993220 (LAMB3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102993220 (LAMB3)
   05145 Toxoplasmosis
    102993220 (LAMB3)
SSDB
Motif
Pfam: Laminin_N Laminin_EGF Laminin_II BNR_6
Other DBs
NCBI-GeneID: 102993220
NCBI-ProteinID: XP_028341954
UniProt: A0A455B0D8
LinkDB
Position
Unknown
AA seq 1172 aa
MRPLLLLYFVLPSVLCAQQACSRGACYPPVGDLLIGRTRFLRASSTCGLAKPETYCTQYG
EWQMKCCKCDSRLPHNYNGHRVENVVSSSGPMRWWQSQNDVSPVSLQLDLDRKFQLQDLM
MDFKGPTPAGMLIERSSDFGKTWQVYQYLAADCTSAFPRVRQGQPQSWQDARCQPLPQRP
NGRLDGAKVQLNVMDLASGIPATQSQKIQELGKITNLRINFTRLAPVPQRGYHPPSAYYA
VSQLRLQGSCFCHGHADRCAPNSRAPAGPSTTVQVHEVCVCQHNTAGPNCEHCAPFYNNR
PWKPADDQDPHECQRCDCNGHSETCHFDPAVFAASQGAHGGVCDNCQHHTEGKNCERCQL
HYFRNRRPGAPVQETCIPCECDPDGAVPGAPCDPVTGQCVCKEQVQGERCDLCKPGFTGL
TYANPQGCHRCDCSVLGARRDMPCDEESGRCLCLPHVVGPKCDQCARYHWKLASGRGCEP
CACDPHNSLSPQCNEFTGQCPCREGFGGLTCSTAAVRQCPDRTYGDAAMGCRACDCDFRG
TEGPGCDKSSGRCLCRPGSTGPRCDRCQRGYCDRYPVCVACHPCFQTYDAGLRGRALRLS
SLRNATASLWPGPGLEDRGLASRMLDTKNKLEQIQAILGGASVTEQEAAQVANAIVSIRQ
TLQGLQVDLPLEEETLSLPGDLENLDRSFNRLLVTYQSKREQFEKISSADPSGAFRMLTA
AHQRSSQAAQQVADSSRLLPQLRDSRREVERLEQQLGGAGGAGGPQLAALRLQMASLPDL
TPTINKLCGGPREMACTPGACPGELCPRDNGTACGSHCRGALPRAGGAFRTAGQVAEQLR
GFNAQLQQTRQMIRAAEEATLQVQSDAQRLEIQVSTSRSQMEEDVTRTRLLIQQVRDFLS
DPDTDIATIQEVSEAVLALWLPTDSATVLRKMNEIQDIAARLPSVDLVLSQTKQDIARAR
RLQAEAEQARSQAHTVEGQVEDVLGNLRQGTLALQEARDTMQGTRRSLQLIQDRVAEVQQ
VLGPAERLVTGLMEQLGGFRARMEELGRRARQQRVQAAQAQQLAEEASKRALSAQEGFER
VKQKYAELKDRLGRSPMLGEQGSRILSIKMEAEELFGETMEMMDRMKDVESELLRGSQAI
MLRSADLTGLEKHVEQIRNHISGRALYYATCK
NT seq 3519 nt   +upstreamnt  +downstreamnt
atgaggccgctccttctcctgtattttgtcttgcccagtgtcctgtgtgcccagcaagcc
tgctcccgtggggcctgctatccacccgttggggacctgctcattgggaggacccggttt
ctccgagcttcatctacctgtggcctggccaagcctgagacctactgcacccagtatggt
gagtggcagatgaaatgctgcaagtgtgactccaggttgcctcacaattacaacggtcac
cgagtggagaatgtggtctcatcctcgggccccatgcgctggtggcagtcacagaacgat
gtgagccctgtctctctgcagctggacctagacaggaaattccaacttcaagacctcatg
atggattttaaggggcccacgcctgctggcatgctgatcgagcgctcctcggacttcggc
aagacctggcaggtgtaccaatacctagctgccgactgcacgtctgccttcccccgggtc
cgccagggccagcctcagagctggcaggacgctcggtgccagcctctgccccagaggcct
aacgggcgcctggatggggccaaggtccaacttaacgttatggatttagcctctgggatc
ccagcaactcaaagtcaaaaaattcaagagctggggaagatcacaaacttgagaatcaac
ttcaccaggctggcccctgtgccccagaggggctatcaccctcccagcgcctactatgct
gtttcccagctgcgtctgcaggggagctgtttctgtcacggccatgctgaccgctgtgcc
cccaattccagagcccctgccggcccctccaccactgtgcaggtccatgaggtctgcgtc
tgccagcacaacaccgccggccccaactgtgaacactgtgcacccttctacaacaacagg
ccctggaaacccgcagatgaccaggacccccatgaatgccaaaggtgtgactgcaacgga
cactcagagacgtgtcacttcgacccagccgtgtttgccgccagccagggggcacatgga
ggtgtgtgtgacaactgccagcaccacactgagggcaagaactgtgagcggtgtcagttg
cactatttccggaaccggcgtcccggtgctcccgttcaggagacgtgcatcccctgcgag
tgtgatccagacggggcagtgccaggagctccctgtgacccagtgaccgggcagtgtgtg
tgcaaggagcaagtgcagggggaacgctgtgacctgtgcaagccgggatttacaggactc
acctatgccaacccgcagggctgccaccgctgtgactgcagcgtcctgggtgcccggcgg
gacatgccgtgtgatgaggagagtggacgctgcctgtgtctgccccacgtggtgggcccc
aaatgtgaccagtgcgctcgctaccactggaagctggccagcggccggggctgcgagcca
tgtgcctgcgacccgcacaactccctcagcccccagtgcaacgagttcacagggcagtgc
ccctgtcgggaagggtttggtggcctgacctgcagcactgcagccgtccgccagtgtccc
gacaggacctacggagatgcagctatgggatgccgagcctgtgactgtgacttccgggga
acggagggaccaggctgcgacaagtcctcgggccgctgtctctgccgccccggctcgacc
gggccacgctgcgaccggtgccagcgaggctactgtgaccgctacccggtgtgtgtggcc
tgccacccctgcttccagacttacgacgccggcctccgggggcgggccctccgcctcagc
agcctccgcaatgccaccgccagcctgtggcccgggccgggcctggaggaccgcggcctg
gcatctcggatgctggacacaaagaacaagctggagcagatccaagccatcctcggcggt
gcctcggtcacggagcaggaggcggcccaggtggccaatgccatcgtctccatcagacag
actctccagggcctgcaggtggatctgcccctagaggaggagaccttgtccctcccggga
gacctggagaatctggacagaagcttcaatcgcctccttgttacgtatcagagcaagagg
gagcagtttgaaaaaataagcagtgccgatccttcaggagccttccggatgctgaccgcg
gcccaccagcggtcatcccaggctgctcagcaggtcgcggacagctcccgcctgctgccc
cagctcagggacagccggagagaggtggagaggctggagcagcagctgggaggggcagga
ggcgccggtggcccccagctcgcggccctgaggctgcagatggcttccttgcctgatctg
acacccaccatcaacaagctctgtgggggccccagggagatggcttgcaccccgggagca
tgccctggggagctgtgtccccgagacaacggcacggcctgtggctcccactgcaggggt
gctctccccagggcaggtggggccttccggacggcagggcaggtggccgagcagctgcgg
ggcttcaacgcccagctccagcagaccaggcagatgatcagggcagccgaggaagccacc
ttgcaggtgcagtcagatgctcagcgtctggagatacaggtgagcaccagccgctcccag
atggaggaagacgtcacacgcacgcgactcctcatccagcaggtccgggacttcctctcg
gaccctgacactgacatagccaccatccaggaggtcagtgaggccgtgctggccctgtgg
ctgcccacagattctgccacagtcctgcggaagatgaatgagatccaggacatcgcagcc
aggctccccagcgtggacctggtgctttctcagaccaagcaggacatcgctcgggctcgc
aggctccaggctgaggctgagcaggccaggagtcaagcccacacagtggagggccaggtg
gaggatgtgctggggaacctgcgacagggcacgctggcactgcaggaggcccgggacacc
atgcaaggcacccgccgctcccttcagcttatccaagacagggttgctgaggttcagcag
gtgctggggccagcggaaagactagtgaccggcctgatggagcagctgggtggcttccga
gcacggatggaggagctcggccgccgggccaggcagcagcgggtgcaggcagcccaggcc
cagcagctggcagaggaggccagcaagcgggcactgagtgcccaggagggatttgagaga
gtaaagcaaaaatatgctgagttgaaggaccggttgggtcggagccccatgctgggagag
cagggcagccggatcctgagcatcaagatggaggcggaggaactgtttggggaaaccatg
gagatgatggacaggatgaaagacgtggagtcggagctgcttcgggggagccaggccatc
atgcttcgctcggcggacctgacggggctggagaagcatgtggagcagatccggaaccac
atcagtgggcgcgcgctctactacgccacctgcaagtga

KEGG   Physeter catodon (sperm whale): 102994673
Entry
102994673         CDS       T06011                                 

Gene name
IBSP
Definition
(RefSeq) bone sialoprotein 2
  KO
K06253  integrin binding sialoprotein
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05165  Human papillomavirus infection
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102994673 (IBSP)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102994673 (IBSP)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102994673 (IBSP)
 09160 Human Diseases
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102994673 (IBSP)
SSDB
Motif
Pfam: BSP_II Autophagy_act_C SAPS Presenilin Mpp10 Nop25
Other DBs
NCBI-GeneID: 102994673
NCBI-ProteinID: XP_007122818
UniProt: A0A2Y9FH62
LinkDB
Position
7
AA seq 317 aa
MKTAFILLSILGMACALSMKNLHRRAKLEDSEENGVFKYRPRYYLYKHAYFYPPLKRFPV
QSSSDLSEENGNGDSSEEEEEEETSNEEQNNEENEDSDGNEDEESEAENTTLSITTLGYG
EETTPGTGNIGLAAIQLPTKAGDIGKKATKEEESDEEEEEENENEAEMDDNEQGINGTST
NSTEIDIGHGSSGGDNGEEEGEEGVTEANAEGTTVAGGQDNGGSKAAASPNGGYEPTTPP
QEIYGTTPPPLGETTTPGYEGEYEQTGTSEYDNGYEIYENENGEPRGDNYRAYEDEYSYY
KGRSYDGYDGQDYYYHQ
NT seq 954 nt   +upstreamnt  +downstreamnt
atgaagacggcttttattttgctcagcattttgggaatggcctgtgctctctcaatgaaa
aatttgcatcgaagagccaaattagaggattctgaagaaaatggggtctttaagtacagg
ccccgatattatctttacaagcatgcctacttttatcctcctctaaaacgatttccggtt
cagagcagtagtgacttgtctgaagaaaatggaaatggtgatagctcagaagaggaggaa
gaagaggagacttcaaatgaagaacaaaacaatgaagagaatgaagattctgatggaaat
gaagacgaagagtcggaggctgagaacaccaccctttccattaccacacttggttatgga
gaggagaccacacctggaacagggaatataggtctagctgccatccagcttcccacgaag
gctggggatataggaaagaaggctacaaaagaggaggaaagtgatgaagaagaagaggaa
gaaaatgaaaatgaagcagaaatggatgataatgagcaaggcataaatggcactagcacc
aacagcacagagatagacattggccatggcagcagtggtggggacaatggagaggaggaa
ggcgaagaaggtgtcactgaagccaatgcagaaggaaccacagtggctggagggcaggac
aatggtggctctaaggcagcggcctctccgaatggtggatatgaacctacaactccaccc
caggagatctacgggactaccccgccaccattgggggaaactaccacccctggatatgag
ggggagtatgaacaaacaggcaccagtgagtacgacaatggatatgaaatctacgaaaat
gaaaatggagaacctcgtggggacaattaccgagcctatgaggatgagtacagctactat
aaagggcggagctacgacggctatgacggtcaagattactactaccaccagtga

KEGG   Physeter catodon (sperm whale): 102994805
Entry
102994805         CDS       T06011                                 

Gene name
TNC
Definition
(RefSeq) tenascin isoform X1
  KO
K06252  tenascin
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05165  Human papillomavirus infection
pcad05206  MicroRNAs in cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102994805 (TNC)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102994805 (TNC)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102994805 (TNC)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    102994805 (TNC)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102994805 (TNC)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102994805 (TNC)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102994805 (TNC)
SSDB
Motif
Pfam: fn3 Fibrinogen_C EGF_2 EGF_Tenascin Pur_ac_phosph_N fn3_2 DUF4998 EGF
Other DBs
NCBI-GeneID: 102994805
NCBI-ProteinID: XP_023981944
UniProt: A0A2Y9T1D7
LinkDB
Position
9
AA seq 2201 aa
MGVATRLLAGIFLSLLALPAEGGVLKKVIRHKRQSGVNVTLPEENQPVVFNHVYNIKLPV
GSQCSVDLESASREKDLVPLSEPRESFQEHTVDGENQIVFTHRINIPRRACGCVAAPDVK
ELLSRLEELENLVSSLREQCTLGAGCCFQPAEGRLDTKPFCSGRGNFSTEGCGCVCEPGW
KGPNCSEPECPGSCHLRGQCLDGQCMCDKGFTGEDCGQLACPSDCNDQGKCMNGTCVCFE
GYSGVDCSQEPCPVPCSEEHGRCVDGRCVCRDGFAGEDCNEPLCLHNCRGRGRCVENECV
CDEGFTGDDCGELICPNDCFDRGRCVNGTCYCDEGFTGEDCGQLACPHACHGRGRCDEGQ
CVCDEGFAGADCSEKRCPSDCHDRGRCLDGHCECDDGFTGADCGELRCPRDCSGHGRCVN
GQCVCDEGHTAEDCGQLRCPHDCHGRGRCVQGRCECELGFQGYDCGEMSCPHDCHQHGRC
VNGMCVCDDAYMGEDCRELRCPRDCSQRGRCVDGRCVCEDGFAGPDCADLSCPGDCYGRG
RCVDGQCVCHEGFTGKDCGQRRCPDDCHGQGRCVDGRCVCHEGFTGLDCGQRSCPNDCSN
WGQCVSGRCICNEGYTGEDCSQVSPPKDLIVTEVTEETINLAWENEMRVTEYLIVYTPTH
EDGLEMQFRVPGNQTSTTIQELEPGVEYLIRVFAILENKKSIPVSARVATYLPTPEGLKF
KSIRETSVEVEWDPVDIAFETWEVIFRNMNKEDEEEITKSLRRPETTYRQTGLAPGQEYE
ISLHIVKNNTRGPGLKRVTTTRLDAPSQIEVKDVTDTTALITWSKPLAEIDSIELTYGIK
DVPGDRTSIDLTHEENQYSIGNLKPDTEYEVSLISHRADMSSNPARETFTTGLDAPRNLR
RISQTDNSITLEWRNVKAAADSYRIKYAPISGGDHAEVEVPRRQQTTTRTTLTGLRPGTE
YGVGVSAVKGDKESDPATINAATDLDAPKDLRVSELTETSVTLIWQTPLAKYDRYRLNYS
LPSGQPVEVQLPRGTTSYVLKGLEPGQAYTILLTAEKGRHKSKPARVQASTDHAPEMENL
TVTKSGWDGLKLNWTTADQAYEHFVIQVQEANRVEAAQNLTVPGSLRAVDIQGLKAATPY
RVTIYGVIRGYRTPVLSAEASTGEAPYLGEVTVSEVGWDALKLHWTAPEGAYEQFLIQVQ
ETDTEEAAQNLTVPGGLRSVDLPGLKAATHYSVTIRGVTRDFSTAPLSVEVLTEEVPDLG
NLTVTEVSWDALRLDWTSPDGIYEQFVIEIREADQAQEVHRLTVPGSLCSAEIPGLRAGT
PYTVTLRGEVRGRSTQPLAVEVLTAELPQLGHLAVTEAGWDGLKLNWTTDDQAYEHFVIQ
VQEANRVEAAQNLTVPGSLRAVDIQGLKAATPYRVTIYGVIRGYRTPVLSAEASTAGEPE
IGNLNVSDVTPESFNLSWTATDGAFKTFTIEIIDSNRFLETMEYNVSGAERTAHISGLRP
NNDFIIYLSGLGPSIRIKTISATATTEALPLLENLTISDINPYGFTVSWMASENAFDSFL
VTVVDSGKLLDPQEFTLSGTQRKLELRGLITGIGYEVTVSGFTQGHQTKPLRAEIVTEAE
PEVDNLLVSDATPDGFRLSWTADEGVFDSFVLKIRDTKKQSEPLEITLLDPERTRDITGL
REATEYEIELYGISSGRRSQPVSAIATTAMGSPKEIVFSDITENSATVSWKAPTTQVESF
RVTSVPTAGGTPSVVTVDGAETQTRLLRLLPGVEYLVSVIALKGFEESKPVSGMLTTALD
GPSGLVTANITDSEALAMWQPAIAPVDNYVISYTGGREPEITRTVSGNTVEYALTNLEPA
TEYTLRISAEKGPQKSSTITTKFTTDLDSPRDFTATEVQSETAVLTWRPPRASVTGYLLV
YESVDGTTKEVVVDPDTTSYSLAGLSPSTHYMAKIQALNGPLRSKIIRTTFTTTGVLYPF
PRDCSQAMLNGDTTSGVYPIYLNNDKTQKQEVFCDMTSDGGGWVVFLRRKNGREDFYRNW
KAYAAGFGDLKEEFWLGLDTLNKITAQGQYELRVDLRDHGQSAYAVYDKFSVGDARTRYR
LKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAITNCALSYKGAFWYKNCHRVNLMGRYGD
NNHSQGVNWFHWKGHEYSIQFAEMKLRPSSFRNLEGRRKRA
NT seq 6606 nt   +upstreamnt  +downstreamnt
atgggggtcgcgactcggctgttggcaggcatcttcctatctctgcttgcccttccggcc
gaaggtggcgtcctcaagaaagtcatccggcacaagcgacagagtggggtgaatgtcacc
ctgccagaggagaaccagccggtggtgtttaatcatgtctacaacatcaagctgcctgtg
gggtcccagtgctcggtggatctggaatcagccagcagggagaaagacttggtcccactg
tcagagcccagggaaagcttccaggagcacacggtagacggggaaaaccagatcgtcttc
acacaccgcatcaacatcccccgccgggcctgtggttgtgtcgccgctcctgatgtcaag
gagcttctgagcagactggaggagctagagaacctggtgtcttccctgcgggagcagtgc
accttgggagcaggctgctgcttccagcctgccgaaggccgcctggacaccaagcccttc
tgcagcggccggggcaacttcagcacggaaggctgtggctgtgtgtgtgaaccgggctgg
aaaggccccaactgctctgagcctgaatgtccaggcagctgtcacctgcgaggccagtgc
ctggacgggcagtgcatgtgtgacaagggcttcacaggggaggactgtggccagctggcc
tgtcccagcgactgcaatgaccagggcaagtgcatgaacgggacctgcgtctgcttcgaa
ggctactcgggggtggactgcagccaggagccctgcccggtgccctgcagcgaggagcac
ggccgctgcgtggacggccgctgtgtgtgccgggacggcttcgccggcgaggactgcaac
gagcccctgtgtctgcacaactgccgcggccgcgggcgctgcgtggagaacgagtgcgtg
tgcgatgagggcttcacgggcgacgactgcggcgagctcatctgccccaacgactgcttc
gaccgcggccgctgcgtcaacggcacctgctactgcgacgagggcttcacgggtgaggac
tgcggccagctcgcctgcccccacgcctgccacggccgcggccgctgcgacgagggccag
tgcgtgtgcgacgagggcttcgcgggtgcggactgcagcgagaagcgctgtccctctgac
tgccacgaccgcggccgctgcctggacgggcactgcgagtgcgatgacggcttcacgggc
gcggactgcggcgagctccggtgcccccgcgactgcagcggccacggccgctgcgtcaac
gggcagtgcgtgtgcgacgagggtcacaccgcggaggactgcgggcagctgcggtgcccc
catgactgtcacggccgcgggcgctgcgtgcaggggcggtgcgagtgcgaactcggcttc
cagggctatgactgtggcgagatgagctgcccccatgactgtcaccagcacggccgctgc
gtgaacggcatgtgtgtctgtgacgacgcctacatgggcgaggactgccgggagctgcgc
tgcccccgggactgcagccagcggggccgctgcgtggacgggcggtgcgtgtgcgaggac
ggcttcgccggccccgactgcgcagacctctcctgtcccggcgactgctacggccggggc
cgctgcgtggacggccagtgcgtgtgccacgagggcttcacgggcaaggactgcgggcag
cgcaggtgtcccgacgactgtcacggccagggccgctgcgtggacggccggtgcgtgtgc
cacgagggcttcactgggctggactgcgggcagcgctcctgccccaacgactgcagcaac
tgggggcagtgcgtctccggccgctgcatctgcaatgagggctacaccggagaagactgc
tcccaagtgtcccctcccaaagacctcatcgtgacagaagtgacagaagagaccataaac
ctggcctgggaaaatgagatgcgggtcacagagtacctcatcgtgtacacgcccactcac
gaagacggcctagaaatgcagttccgcgtccctggaaaccagacgtccaccaccatccag
gagttggagcctggcgtggagtacctcatccgcgtgttcgccatcctggagaacaagaag
agcattcctgtcagcgccagggtggccacttacctgcctacacctgaaggtctaaaattc
aagtccatcagggagacatctgtggaagtggagtgggatcctgtggacatcgcttttgaa
acgtgggaggtcatcttccggaatatgaataaagaagatgaggaagagatcaccaaaagc
ctgcggaggccagagaccacgtaccggcaaactggcctagctccggggcaagagtatgag
atatctctgcacatcgtgaaaaacaatacccggggcccaggcctgaagagggtgacaacg
acccgcttggacgcccccagccagatcgaagtgaaagacgttacggacaccaccgctctg
atcacctggtccaagcccctggccgagatcgacagcattgagctcacatacgggatcaaa
gatgtgccaggcgaccgaaccagcatcgacctcacacacgaggagaaccagtactccatc
gggaacctgaagccagacaccgagtacgaggtttccctcatctcccacagggccgacatg
tctagcaaccccgccagagagaccttcacaacaggcctggatgcgcccagaaatctccgc
cgcatctcccagacagacaacagcatcaccctggagtggaggaacgtcaaggcggccgcc
gacagttacagaattaagtacgcacccatctctggaggtgaccatgctgaggtagaagtc
cccaggaggcaacaaaccacgaccagaactacactcacaggtctgaggccaggaactgaa
tatggcgttggagtgtctgccgtgaagggggacaaggagagcgacccagccaccatcaat
gcggccacagacctggacgcgcccaaggacctacgggtttctgaacttacggagaccagc
gtgaccctgatctggcagacgccgctggccaagtatgaccgttaccgcttaaactacagt
cttccctcgggccagccggtggaggtgcagctccccagaggcaccacctcctacgtcctg
aaaggcctggaacctgggcaggcatacaccatcctcctcacggcagagaagggcaggcac
aagagcaagccggcacgggtacaggcatccacagaccacgcccctgaaatggaaaacctc
accgtgaccaaatctggctgggatggcctcaaactcaactggaccacagctgaccaggcc
tatgagcactttgtcattcaggtgcaggaggccaacagggtggaggcagctcagaacctc
acggtgcccggcagcctgcgggctgtggacatccagggcctcaaggccgccaccccttat
agagtcaccatctacggggtgatccggggctatagaacaccagtgctctctgctgaggcc
tccacaggagaagctccctatttgggagaggtcacggtgtccgaggtgggctgggatgcc
ctcaagctccactggacggctccggaaggagcctacgagcagtttctcattcaggtgcag
gaaactgacacagaggaggcagcccaaaacctcactgtcccgggaggactgaggtccgtg
gacctgcctgggctcaaggcagccactcactatagcgtcaccatccgcggggtgactcgg
gacttcagcacagcccctctctctgttgaagtcttgacagaggaggttccagatctgggg
aacctcacggtgactgaggttagctgggatgccctcagactggactggaccagcccagat
gggatctatgagcagtttgtcattgagatccgggaggctgaccaggcccaagaagttcac
cgtctcacggttcctggcagcctgtgctccgcggaaatcccaggcctcagggctggtacc
ccttatacagtcaccctgcgcggcgaggtcaggggccgcagcactcagccccttgctgtg
gaggtcctcacagcggagctcccccagctgggacacttagccgtgaccgaggccggctgg
gatggcctcaaactcaactggaccacagatgaccaggcctatgagcactttgtcattcag
gtgcaggaggccaacagggtggaggcagctcagaacctcacggtgcccggcagcctgcgg
gctgtggacatccagggcctcaaggccgccaccccttatagagtcaccatctacggggtg
atccggggctatagaacaccagtgctgtctgctgaggcctccacagccggagaacctgaa
atcggaaacttaaatgtttccgacgtcactcctgagagcttcaatctctcctggacagct
accgatggggccttcaagacctttaccattgaaattattgattccaataggttcctggaa
acgatggaatataacgtctctggtgctgaacgaactgctcacatctcagggctccgccct
aataatgattttattatctacctctctggactcggtcccagcatccggatcaaaaccatc
agtgccacggccaccacagaggccctgccccttctggaaaacctaaccatttccgacatt
aatccctacgggttcacagtttcctggatggcatcggagaatgcctttgacagcttccta
gtaacggtggtggattctgggaagctgctggacccccaggaattcacactttcaggaacc
cagaggaagctggagcttagaggcctcataactggcattggctatgaggtcacggtctct
ggcttcacccaagggcaccaaaccaagcccttgagggctgagattgttacagaagccgag
ccagaagttgacaaccttctggtttcagatgccaccccagacggtttccgtctgtcctgg
acagctgatgagggggtcttcgacagctttgttctcaaaatcagagataccaaaaagcag
tctgaaccactggaaataaccctgcttgaccccgaacgtaccagggacataacaggtctc
agagaggccactgaatacgaaattgaactctatggaataagcagtggaaggcgatcccag
ccagtcagtgctatagccacaacagccatgggctccccgaaggaaatcgttttctcagac
atcactgaaaactcagccactgtcagctggaaggcacccactacccaggtggagagcttc
cgggttacctctgtacccactgcaggaggtacaccctcagtggtaactgtggatggggcc
gagactcagaccaggctgctgagactcttacctggagtggagtacctcgtcagcgtcatc
gccctgaagggctttgaagaaagcaagcctgtctcggggatgctcaccacagctctggat
ggcccatctggcctggtgacagccaacatcaccgactcagaagccttggccatgtggcag
ccggccatcgcccccgtggacaattatgtcatctcctacacaggggggagagagccagaa
attacccgcacggtgtccgggaacacggtggagtatgcgctgaccaacctcgagcctgcc
acggaatacacgctgaggatctcggcggagaaagggccccagaagagctcaactatcacc
accaagttcacgacagacctcgattctccaagagacttcactgctactgaggttcagtcg
gaaactgccgttctcacgtggagacctccccgggcatctgtcactggttacctattggtg
tatgaatccgtggacggtacaaccaaggaggttgttgtggatccagacaccacctcttat
agcctggcagggctgagcccatccacccactacatggccaagatccaggcactgaatggg
cccctgagaagcaagatcatcaggaccaccttcaccacaactggagtcctgtacccattc
cccagggactgctcccaagccatgctgaatggagacacgacctctggcgtctaccccatt
tatctgaacaacgacaagacccagaagcaggaagtcttctgtgacatgacctctgacggg
ggtggatgggtcgtgttcctgagacgcaaaaatggacgtgaggatttctatcgaaactgg
aaggcctacgctgctggatttggggacctaaaagaagaattctggcttgggctggacacc
ctgaacaaaatcacagcccaaggtcagtacgagctccgggtggacctgcgggaccacggg
cagtcagcctacgccgtctacgacaagttcagcgtgggagacgcccggactcgctaccgg
ctgaaggtggaggggtacagcgggacagcaggtgactccatggcctaccacaacggcaga
tccttctccacctttgacaaggacacagactcggccatcaccaactgtgccctgtcctac
aagggggctttctggtacaagaactgccaccgcgtcaacctgatggggagatacggggac
aataaccacagtcagggtgttaactggttccactggaagggccacgagtattcaatccag
tttgctgagatgaagctgagacccagcagcttccgaaacctcgaaggcaggcgcaagcgg
gcataa

KEGG   Physeter catodon (sperm whale): 102995420
Entry
102995420         CDS       T06011                                 

Gene name
THBS1
Definition
(RefSeq) thrombospondin-1
  KO
K16857  thrombospondin 1
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04015  Rap1 signaling pathway
pcad04115  p53 signaling pathway
pcad04145  Phagosome
pcad04151  PI3K-Akt signaling pathway
pcad04350  TGF-beta signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05144  Malaria
pcad05165  Human papillomavirus infection
pcad05205  Proteoglycans in cancer
pcad05206  MicroRNAs in cancer
pcad05219  Bladder cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04015 Rap1 signaling pathway
    102995420 (THBS1)
   04350 TGF-beta signaling pathway
    102995420 (THBS1)
   04151 PI3K-Akt signaling pathway
    102995420 (THBS1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102995420 (THBS1)
 09140 Cellular Processes
  09141 Transport and catabolism
   04145 Phagosome
    102995420 (THBS1)
  09143 Cell growth and death
   04115 p53 signaling pathway
    102995420 (THBS1)
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102995420 (THBS1)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    102995420 (THBS1)
   05205 Proteoglycans in cancer
    102995420 (THBS1)
  09162 Cancer: specific types
   05219 Bladder cancer
    102995420 (THBS1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102995420 (THBS1)
  09174 Infectious disease: parasitic
   05144 Malaria
    102995420 (THBS1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:pcad04131]
    102995420 (THBS1)
  09183 Protein families: signaling and cellular processes
   00536 Glycosaminoglycan binding proteins [BR:pcad00536]
    102995420 (THBS1)
   04990 Domain-containing proteins not elsewhere classified [BR:pcad04990]
    102995420 (THBS1)
Membrane trafficking [BR:pcad04131]
 Endocytosis
  Phagocytosis
   Opsonins
    102995420 (THBS1)
Glycosaminoglycan binding proteins [BR:pcad00536]
 Heparan sulfate / Heparin
  Extracellular matrix molecules
   102995420 (THBS1)
Domain-containing proteins not elsewhere classified [BR:pcad04990]
 Thrombospondin domain-containing proteins
  Thrombospondins
   102995420 (THBS1)
SSDB
Motif
Pfam: TSP_C TSP_3 TSP_1 TSP1_spondin VWC EGF_3 cEGF TSP1_ADAMTS EGF_CA Laminin_G_3 Laminin_G_2 EGF TSP1_CCN Laminin_G_1
Other DBs
NCBI-GeneID: 102995420
NCBI-ProteinID: XP_007103269
UniProt: A0A2Y9EJH2
LinkDB
Position
11
AA seq 1170 aa
MGLAWGLSVLLLWHVCGSNRIPESGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFR
IEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLAVERKDHSGQVFSVV
SNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELD
VPIQSIFTRDLASAARLRIAKGGVNDNFQGVLQNVRFVFGTTPEDILRNKGCSSSTSVLL
TLDNNVVNGSSPAIRTNYIGHKTKDLQAMCGISCDELSNMVLELRGLRTIVTTLQDSIRK
VTEENKELVNELRRPPLCYHNGVQYRNSEEWTVDSCTECRCQNSVTICKKVSCPIMPCSN
ATVPDGECCPRCWPSDSADDGWSPWSEWTSCSVTCGNGIQQRGRSCDSLNNRCEGSSVQT
RTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARE
TKACQKDACPINGGWGPWSPWDICSITCGGGVQKRSRLCNNPTPQFGGKDCIGDVTENQI
CNKQDCPIDGCLSNPCFADVKCTSYPDGSWKCGACPPGYSGNGILCKDVDECKEVPDACF
NHNGEHRCENTDPGYNCLPCPPRFTGSQPFGRGVEHATTNKQVCKPRNPCTDGTHDCNKN
AKCNYLSHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNEDLVCVANATYHCKKDNCPN
LPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYN
HNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEH
NPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDN
DGIPDDRDNCRLVPNPDQKDSDGDGRGDACKDDFDQDNVPDIDDICPENVDISETDFRRF
QMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDD
DYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNAL
WHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYD
KTYAGGRLGLFVFSQEMVFFSDLKYECRDS
NT seq 3513 nt   +upstreamnt  +downstreamnt
atggggctggcctggggactcagtgtcctgctcctgtggcatgtgtgcggctccaaccgc
attccagagtctgggggagacaacagcgtgtttgacatctttgaactcacgggggctgcc
cgcaagggttctgggcgccgactggtgaagggtcctgacccttccagcccagctttccgc
atcgaggatgccaacctgatcccccctgtgcctgacgacaagttccaagacctcgtggat
gctgtgcgggcggagaaaggtttcctcctcctggcctccctgaggcaaatgaagaagacc
cggggcacgctgctggccgtggagcggaaggaccactccggccaggtcttcagcgtggtc
tccaatggcaaggcgggcaccctggacctgagcctgaccgtgcaggggaagcagcatgtg
gtgtcggtggaagaagcactcctggcgaccggccagtggaagagcatcaccctgtttgtg
caggaggacagggcccagctgtacatcgactgtgaaaagatggagaatgctgagctggac
gttcccatccaaagcatcttcaccagggacttggccagcgctgccaggctccgcatcgcc
aaaggaggtgtcaatgacaacttccagggcgtgctgcagaatgtgagatttgtctttgga
accacaccagaagacatcctcaggaacaaaggctgctccagctctaccagtgtccttctc
acccttgacaacaatgtggtgaacggttccagccctgccatccgcaccaactacattggc
cacaagacaaaggatctgcaagccatgtgcggcatctcctgtgacgagctatccaacatg
gtcctggagctcaggggcctgcgcaccatcgtgaccacgctgcaggacagcatccgcaaa
gtgactgaagagaacaaagagctggtcaatgagctgaggaggcccccgctctgctaccac
aacggggtccagtacaggaatagcgaggagtggacggtggacagctgcactgagtgtcgc
tgccagaactcagttaccatctgcaaaaaagtgtcctgcccaatcatgccctgctccaat
gccacagttcctgatggagaatgctgcccgcggtgttggcccagcgattctgcagacgat
ggctggtccccatggtctgagtggacctcttgctctgtgacgtgtggcaacggaatccag
cagcgcggccgctcctgcgacagcctcaacaacagatgcgagggctcctccgtgcagacc
cggacctgccacattcaggagtgtgacaagagatttaaacaggatggcggctggagccac
tggtccccgtggtcgtcttgttctgtaacgtgtggagacggtgtgatcacaaggatccgg
ctgtgcaactctcccagcccccagatgaatgggaagccatgtgagggcgaagctcgggag
accaaagcctgccagaaagatgcctgccccatcaatggaggctggggaccctggtcgcca
tgggacatctgttccatcacctgtggaggaggggtgcagaaacgtagtcggctctgcaac
aaccccacaccccagtttggaggcaaggactgcattggtgatgtgacagaaaaccagatc
tgcaacaagcaggactgtcccatcgatggatgcctgtccaatccctgctttgctgacgtc
aagtgtaccagctatcctgatggcagctggaagtgtggtgcttgtcccccgggctacagc
ggaaacggcatcctgtgcaaagatgttgacgagtgcaaagaagtccctgatgcctgcttc
aatcacaacggagagcacaggtgtgagaacacagaccccggctacaactgcctgccctgc
ccgccacgcttcactggctcgcagcccttcggccggggcgtggaacatgccaccaccaac
aaacaggtgtgcaagccccgcaacccctgcacagacggcacacacgactgcaacaagaat
gccaagtgcaactacctgagccactacagtgaccccatgtaccgctgcgagtgcaagccc
ggctacgccggcaacggcatcatctgtggggaggacacagacctggacggctggcccaac
gaggacctggtgtgcgtggccaatgcaacttaccactgcaaaaaggataattgccccaac
cttcccaactcagggcaggaagactatgacaaggacggaatcggcgatgcctgtgacgat
gatgatgacaatgataaaattccagatgacagggacaactgtccgttccattacaaccca
gcccagtacgactatgacagagacgacgtgggagaccgctgtgacaactgcccctacaac
cacaacccagaccaggctgacacggacaacaatggggaaggagacgcctgcgcggcagac
attgatggggacggaatcctcaatgaacgggacaactgccaatatgtctacaacgtggac
cagagagacactgacatggatggggttggagatcagtgtgacaactgccccctggaacac
aatccagatcagctcgactctgattcggaccgcattggagatacctgtgacaacaatcag
gacattgatgaagacggccaccagaacaacctggacaactgtccctacgtgcccaacgcc
aaccaggccgaccacgacaaggacggcaagggcgatgcctgcgaccatgacgatgacaat
gacggcattcctgatgacagggacaactgcaggcttgtgcccaatcctgaccagaaggat
tctgatggtgatggtcgaggtgatgcttgcaaagatgattttgaccaggacaacgtgcca
gacatcgatgacatctgtcccgaaaacgttgatatcagtgagaccgatttccgccgattc
cagatgattcctctagatcccaaagggacatcccaaaatgaccctaactgggttgtacgc
catcagggtaaagaactcgtccagactgtcaactgtgatcctggacttgctgtaggttat
gatgagtttaacgctgtggacttcagcggcaccttcttcatcaacaccgaaagggatgat
gactatgccggatttgtctttggctaccagtccagcagccgcttctacgttgtgatgtgg
aagcaagtcactcagtcctactgggacaccaaccccacaagggctcagggatactccggc
ctttctgtgaaggtcgtgaactccaccacggggcctggcgagcacctgcggaatgcgctg
tggcacacaggaaacacccctggccaggtacgcacgctgtggcatgaccctcgtcacata
ggctggaaagatttcaccgcctacagatggcgtctgagccacaggccaaagacgggtttc
attagagtggtgatgtatgaagggaagaaaatcatggccgactcaggacccatctatgac
aaaacctatgctggcggtaggctagggttgttcgtcttctctcaagaaatggtgttcttc
tctgacctgaaatatgaatgcagagattcctaa

KEGG   Physeter catodon (sperm whale): 102996119
Entry
102996119         CDS       T06011                                 

Gene name
LAMB1
Definition
(RefSeq) laminin subunit beta-1
  KO
K05636  laminin, beta 1
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    102996119 (LAMB1)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    102996119 (LAMB1)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    102996119 (LAMB1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    102996119 (LAMB1)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    102996119 (LAMB1)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    102996119 (LAMB1)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    102996119 (LAMB1)
   05145 Toxoplasmosis
    102996119 (LAMB1)
SSDB
Motif
Pfam: Laminin_EGF Laminin_N CorA F5_F8_type_C
Other DBs
NCBI-GeneID: 102996119
NCBI-ProteinID: XP_007117747
UniProt: A0A2Y9FB62
LinkDB
Position
5
AA seq 1786 aa
MGRLQMFAFGFLALCGAQVSAQEPDFSYGCAEGSCYPATGDLLIGRAQKLSVTSTCGLYK
PEPYCIVSHLQEDKKCFICNSQDPYHETLNPDSHLIENVVTTFAPNRLKIWWQSENGVEN
VTIQLDLEAEFHFTHLIMTFKTFRPAAMLIERSSDFGKTWGVYRYFAYDCESSFPGISTG
PMKKVDDIICDSRYSDIEPSTEGEVIFRALDPAFRIEDPYSPRIQNLLKITNLRIKFVKL
HTLGDNLLDSRMEIREKYYYAVYDMVIRGNCFCYGHASECAPVDGFHTEGEGMVHGHCMC
RHNTKGLNCEQCMDFYHDLPWRPAEGRNSNACKKCNCNEHSSSCHFDMAVYLATGNVSGG
VCDDCQHNTMGRSCEQCKPFYYQHPERDVRDPNLCEQCTCDPAGSQNKGICDGYTDFSAG
LIAGQCRCKLHVEGEHCDICKEGFYGLSTEDPFGCKACTCSPLGTIPGRSPCDSETGYCY
CKRLVTGQHCDQCLPEHWGLSNDLDGCRPCDCDLGGALNNSCSAESGQCACRPHMIGRQC
NGVESGYYFPTLDHYIYEAEEANLGPGVSIVERQYIQDRIPSWTGAGFVRVPEGAYLEFF
IDNIPYSMEYDILIRYEPQLPDHWEKAVITVQRPGKIPPSSRCGNTVPDDDNQVVSLSPG
SRYVVLPRPVCLEKGVNYTVRLELPQYTSSDSRVDSPYTLIDSLVLMPFCKSLDIFTVGG
SGDGVVTNSAWETFQRYRCLENSRSVVKTPMTDVCRNIIFSISALLHQTGLACECDPQGS
LSSVCDPNGGQCQCRPNVVGRRCDRCAPGTFGFGPSGCKPCECHLQGSVDAFCDPITGQC
HCFQGVYTRQCDRCLPGYWGFPSCQPCQCNGHANDCDSVTGECLSCQDYTTGHNCERCLA
GYYGDPIIGSGDHCRPCPCPDGPDSGRQFASSCYQDPVTLQLACVCNPGYIGSRCDDCAS
GFFGNPSDVGGTCQPCRCHRNIDATDPEACDKETGRCLKCLYHTEGEHCQLCRLGYYGDA
LQQDCRKCVCNYLGTVQEPCDGSDCQCDRITGQCLCLPNVIGQNCDRCAPNTWQLASGTG
CDPCNCHAAHSFGPSCNEFTGQCQCMPGFGGRTCSECQELFWGDPSVECRACDCDPRGIE
TPQCDQSTGQCVCVEGVEGPRCDKCTRGYSGVFPDCMPCHQCFAVWDVIIAELTNRTRKF
LEKAKALKISGVIGPYRETVDSVEKKVNEIRDVLAQSPAAEPLKNVGNLFEEAEKLTKDV
TEKMAQVEVNLSDTATQNNSTARELDALQTEAESLDNTVKELAEQLEFIKNSDIRGALDS
ITKYFQMSLEAEERVNASTTDSSGTVEQSARTRDRVEDLMLERESQFREKQEEQARLLDE
LAGKLQSLDLSAAAEMTCGTPPGASCSETECGGPNCRTDEGQKQCGGPGCGGLVAVAHGA
WQKAMDFDRDVLSALAEVERLSKMVSEAKLRADEAKQNAQDVLLKTNATKEKVDKSNEDL
RNLIKQIRNFLTQDSADLDSIEAVANEVLKMEMPSTPQQLQNLTEDIRERVESLSQVEVI
LQQSAADIARAEMLLEEAQRASKSATDVKVTADMVKEALEEAEKAQIAAEKAIKQADEDI
QGTQNLLTSIESETAASEETLFNASQRLSELERNVEELKRKAAQNSGEAEYIEKVVYTVK
QSAEDVKKTLDGELDEKYKKVENIIAQKTEESADARRKAELLQNEAKTLLAQANSKLQLL
KDLERKYEDNQKYLEDKAQELVRLEGEVRSLLKDISQKVAVYSTCL
NT seq 5361 nt   +upstreamnt  +downstreamnt
atggggcggctccagatgttcgcgttcggtttcttagccctgtgcggagcccaagtgagt
gctcaagaacccgatttcagctacggctgcgcggagggcagctgctacccggccaccggc
gaccttctcatcggccgagcccagaagctctcggtgacctcgacctgcgggctgtacaaa
cccgagccttactgtatcgtcagccatctccaggaggacaaaaaatgcttcatatgcaat
tcacaagatccttatcatgaaaccctgaaccctgacagtcatctcattgaaaatgtggtc
acgacatttgctccaaaccgtcttaagatctggtggcaatccgaaaatggcgtggaaaat
gtaactatccaactggatttggaagcagaattccatttcactcatctcatcatgaccttc
aagacattccgtccagccgctatgctgatagaacgatcgtccgactttggaaaaacctgg
ggtgtgtacagatactttgcctatgattgtgagagctcgtttcccggcatttccaccggc
cccatgaaaaaagtcgatgatataatttgtgattcccgatattctgacattgaaccctca
acggaaggagaggtgatttttcgtgctttagatcctgctttcagaatagaagacccttac
agtccaaggatacagaatctcttaaaaatcaccaacttgagaatcaagtttgtgaaacta
catactttgggagataaccttttggattccagaatggaaatcagagaaaagtactattac
gcggtttacgatatggtgattcgaggaaactgcttctgctatggccatgccagcgagtgt
gccccggtggatggattccacacagaaggagaaggaatggttcacggacactgcatgtgc
aggcataacaccaagggcttaaactgtgaacagtgcatggacttctaccatgatttacct
tggagacccgccgaaggtcgaaacagcaatgcctgtaaaaaatgtaactgcaacgaacat
tccagctcttgccacttcgacatggccgtctacttggccactggcaacgtcagcggaggg
gtgtgtgatgattgtcagcacaacacgatggggcgtagctgcgagcagtgcaaaccgttt
tactaccagcacccggagagggacgtccgggatcctaatctctgtgaacaatgtacctgt
gacccagctggttctcagaacaaggggatctgtgacggttacacggatttttctgccggt
ctcattgctggtcagtgtcggtgtaaattacacgtggaaggagaacattgtgatatctgc
aaagaaggcttctatggtttaagcactgaagatccatttggttgtaaagcttgtacttgc
agtcctctgggaacaattcctggcaggagtccttgtgattctgagactggttactgctac
tgtaagcgtctggtgacaggacagcactgtgaccagtgtctgccagagcactggggctta
agcaacgacttggatggatgtagaccttgtgactgtgaccttgggggagccttaaacaat
agttgctctgcggagtcaggccagtgcgcctgccggccccacatgattggacgccagtgc
aacggagtggaatctggttactacttccccaccttggatcactacatctacgaagcagaa
gaagccaacttggggcctggggtcagcatagtggagcggcagtatatccaggaccgtatt
ccttcctggactggagcaggcttcgtccgagtgcctgaaggggcttatttggagttcttc
attgacaacataccatattctatggagtatgacatcctaattcgctatgagccacagctc
cccgaccactgggagaaagctgtcatcactgtgcagcgacctgggaagattccacccagc
agccgatgtggaaacactgttcctgatgatgacaaccaggtggtgtccctatcgcctggc
tccaggtatgtggtcctcccccgccccgtgtgcttggagaagggggtgaactacacggtg
cggttagagctgccccagtacacctcctcagacagccgtgtggacagcccctacacgctg
atcgattctcttgttctcatgccattctgtaagtcgctggacatcttcaccgtgggaggc
tcaggagatggggtcgtcaccaacagtgcctgggaaacctttcagagataccgatgcctg
gagaacagcagaagcgtcgtgaagactccgatgaccgatgtttgcagaaacatcatcttt
agcatctccgccctgttacaccagacaggcctggcttgtgagtgtgaccctcagggctcc
ttaagttccgtctgcgaccccaacggaggccagtgtcagtgccggccaaacgtggttgga
agaaggtgtgacagatgtgcgcccggcacctttggctttggccccagcgggtgcaaacct
tgtgagtgccatctgcaaggatccgtcgatgctttctgtgaccccatcactggccagtgc
cactgtttccagggggtgtacacgcggcagtgtgaccggtgcttacccgggtactggggc
tttccaagctgccagccctgccagtgcaacggccatgccaatgactgcgactcagtgacg
ggggaatgcctgagctgccaggactacaccacggggcataactgtgaaaggtgcttggct
ggttactatggagatcccatcattggatcaggagatcactgccgtccttgtccttgtcca
gatggtcccgacagtggacgccagtttgccagtagctgttatcaagaccctgttacttta
cagcttgcctgtgtctgtaatcctggatacatcggctccagatgtgacgactgcgcctcg
ggcttctttggcaatccctcggacgtggggggcacgtgtcagccttgccggtgtcaccgc
aacatcgacgcgacggacccagaagcctgtgacaaggagaccgggaggtgcctcaagtgc
ctctaccacacggagggggagcactgccagctctgccgcctcggttactatggcgacgcc
ctccagcaggattgtcgcaagtgtgtgtgcaattacctgggcaccgtgcaggagccctgt
gacggctctgactgccagtgcgacagaatcaccgggcagtgcttgtgtctccctaacgtg
atcggccagaactgtgaccgctgtgcacccaacacctggcagctggccagcgggacgggg
tgtgacccgtgcaactgccatgctgctcattcctttgggccgtcttgtaacgagttcaca
gggcagtgccagtgcatgcctggcttcggaggccgcacgtgcagcgagtgccaggagctt
ttctggggggaccccagcgtggagtgccgagcctgtgactgcgacccccggggcatcgag
acaccgcagtgtgaccagtccaccggccagtgtgtctgcgtcgagggtgtggagggtcca
cgctgtgacaagtgcactcgggggtactcgggcgtcttccccgactgcatgccctgccac
cagtgctttgctgtttgggacgtgatcatcgccgagctgaccaacaggacccggaagttc
ctggagaaggccaaggccttgaagatcagtggcgtgattgggccttaccgggagactgtg
gactcggtggagaagaaagtcaatgagatcagagacgtcctggcccagagcccagccgcc
gagccactgaagaacgtcggaaacctctttgaggaagcagagaaactaaccaaagatgtt
acagaaaagatggctcaagtagaagtaaatttatctgacacagctacgcaaaacaacagc
acggccagagaactagatgctctacagacagaagcagaaagcctggataacacagtgaag
gagcttgctgaacaattggaatttatcaaaaattcagatattcggggcgccttggacagc
atcaccaagtatttccagatgtctctcgaggcagaggagcgcgtgaacgcttccaccaca
gattccagtggcactgtggagcagtcggcccgcactcgggacagagtggaagacttgatg
ttggagcgagagtcccagttcagggaaaaacaggaggagcaggcccgccttctggatgaa
ctggcaggcaaactacaaagtcttgacctttcagctgctgccgaaatgacgtgtggaaca
cctccaggagcctcctgttccgagaccgaatgtggcggcccaaactgcagaactgacgaa
ggacagaagcaatgtggggggcccggctgtggcgggctggtcgctgtcgcacatggcgcc
tggcagaaagccatggactttgaccgagatgtcctgagtgccctggctgaggtggaacgg
ctctccaagatggtctctgaagcaaaactgagggcagatgaggcaaaacagaatgctcaa
gatgttctgttgaaaaccaacgctaccaaagaaaaagtggacaagagcaatgaggatctg
agaaatctgatcaagcaaatcaggaactttttgacccaggatagtgctgacctagacagc
attgaagcagttgctaacgaagtactgaaaatggaaatgcctagcaccccacagcagtta
cagaacttgacagaagatattcgtgaacgagttgaaagcctttctcaagtagaggttatt
ctacagcagagtgctgctgacattgccagagctgagatgttgttagaagaagctcaaaga
gccagcaaaagtgcaacagatgttaaagtcactgcagacatggtaaaggaagctctggaa
gaagcagaaaaggcccagattgcagcagagaaggcaattaaacaagccgatgaagacatc
caaggaacccagaacctgctaacttcaatcgagtctgaaacagcagcttccgaggaaacc
ttgttcaatgcctcccagcgcctcagtgagctggagagaaatgtggaagaacttaagcgg
aaagctgcccagaactctggggaggcagaatatattgaaaaagtagtatatactgtgaaa
caaagtgcagaagatgttaagaagactctggatggtgaacttgatgaaaagtacaagaaa
gtagaaaatataattgcccaaaaaactgaagagtcagctgatgccagaaggaaagctgaa
ctgctacaaaacgaagcaaaaacacttttggctcaagcaaacagcaagctgcaactcctg
aaagatttagaaagaaaatatgaggacaatcaaaaatatctagaagataaagctcaagaa
ttagtaagactggaaggagaagtccgttcactcctaaaggatataagccagaaagtcgct
gtttatagcacctgcttgtaa

KEGG   Physeter catodon (sperm whale): 114487451
Entry
114487451         CDS       T06011                                 

Gene name
LAMC3
Definition
(RefSeq) laminin subunit gamma-3
  KO
K06247  laminin, gamma 3
Organism
pcad  Physeter catodon (sperm whale)
Pathway
pcad04151  PI3K-Akt signaling pathway
pcad04510  Focal adhesion
pcad04512  ECM-receptor interaction
pcad05145  Toxoplasmosis
pcad05146  Amoebiasis
pcad05165  Human papillomavirus infection
pcad05200  Pathways in cancer
pcad05222  Small cell lung cancer
Brite
KEGG Orthology (KO) [BR:pcad00001]
 09130 Environmental Information Processing
  09132 Signal transduction
   04151 PI3K-Akt signaling pathway
    114487451 (LAMC3)
  09133 Signaling molecules and interaction
   04512 ECM-receptor interaction
    114487451 (LAMC3)
 09140 Cellular Processes
  09144 Cellular community - eukaryotes
   04510 Focal adhesion
    114487451 (LAMC3)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    114487451 (LAMC3)
  09162 Cancer: specific types
   05222 Small cell lung cancer
    114487451 (LAMC3)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    114487451 (LAMC3)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    114487451 (LAMC3)
   05145 Toxoplasmosis
    114487451 (LAMC3)
SSDB
Motif
Pfam: Laminin_N Laminin_EGF F5_F8_type_C KORA
Other DBs
NCBI-GeneID: 114487451
NCBI-ProteinID: XP_028353183
UniProt: A0A455BZH7
LinkDB
Position
13
AA seq 788 aa
MAAVAALLLGLALLAPRAAGAGMGACYDGAGRPQRCLPVFENAAFGRLAEASHTCGRPPE
DFCPHVGAQGAGAQCQRCDAADPERHHNASYLTDFHSQDDSTWWQSPSVAFGVQYPTSVN
ITLNLGKAYEITYVRLKFHTSRPESFAIYKRSRAGGPWEPYQYYSASCQTTYGKPEGQYL
RPGEDERVALCTSEFSDISPLSGGNVAFSTLEGRPSAYNFEESPVLQDWVTSTELLISLD
RLNTFGDDIFKDPKVLQSYYYAVSDFSVGGRCKCNGHASECGPNEEGRLVCRCRHNTAGT
DCERCLPFFRDRPWARATAEAASECLRPQAALRCPGPHATWLALPNLTDTLEGMLPSYFF
QERGPRLRVTVGRVLSYSHRDTTTRIEATARRALLTSNSSYVLLWSLVEGRTALEAQQEL
EERYQEVYAAQRALGTAVAEVLPEAERVLAAVQQVGADAAQRLASPAAPAAPPQKSQART
LGLKVQALEKTVTSRERVVTEAAQALQATAQAVLHKTEPLTQLHREARAALTWASSSVQA
AMVTVTGARTLLADLEGMKPRFPRPKDQAALRRKAAIVQDRLLADSKKKTKQVERMLGNA
ASVSSSAKKKGREAELLAKDGAQLAEALLREGKQEHRRAGRLSSQTRSMLRQASQQVLSS
EARRQQPEAADRVGAGLSEMQRQIRESRTSLEKDTEALSELLARLGSLDTLGAPARALNE
TQRALERLRLQLGPPGALQGKLRLLEQESAQQQLQIQSFESSLAEIRADKQNLEAILHSL
PESCASWQ
NT seq 2367 nt   +upstreamnt  +downstreamnt
atggcggcggtggccgcgctcctgctgggtctggcgctcctggcgccgcgggcggccggc
gcgggcatgggcgcgtgttacgacggcgcggggcgcccgcagcgctgcctgcccgtgttc
gagaacgcggcgttcggcaggctagccgaggcctcccacacgtgcggccgcccgcccgag
gacttctgtccgcacgtgggcgctcagggagcgggggcgcagtgccagcgctgcgacgcc
gccgaccccgagcgccaccacaacgcctcctacctcaccgacttccacagccaggacgac
agcacctggtggcagagcccgtccgtggccttcggggtgcagtaccccacctcggtcaat
atcaccctcaacctggggaaagcttacgagatcacctacgtacggctgaagttccacacc
agccgccctgagagcttcgccatctacaagcgcagccgggccggtggtccctgggagccc
taccagtactacagcgcctcctgccagacgacctacggcaagcccgagggccagtacctg
cgccccggcgaggacgagcgcgtggccttatgcacctccgagttcagcgacatctccccg
ctgagcgggggcaacgtggccttctccaccctggagggccggcccagtgcctacaacttt
gaggagagccccgtgttgcaggactgggtcaccagcactgagctcctcatctccctagac
cggctcaacacgtttggggatgacatcttcaaggaccccaaggtgctgcaatcctactac
tacgccgtgtcggacttctctgtgggtggcaggtgcaagtgcaacgggcacgccagcgag
tgtggccccaacgaggagggccggctggtctgccgatgccggcacaacaccgcgggcaca
gactgcgagcgctgcctgcccttcttccgggaccgcccgtgggcccgggccacagccgag
gcggccagcgagtgtctgcgcccccaggcggcccttcgctgccccggcccccatgcgacg
tggctggcactgcctaacctcacagacacgctggaggggatgttgccatcctactttttc
caggaacgggggccgcgcctccgggtgacggtcgggcgggtattgtcatacagtcacagg
gacaccaccaccaggattgaggctactgctcggagggccctgctcacctccaacagcagt
tacgtgcttctctggagtctggtggaggggagaacagccctggaggcccagcaggagcta
gaggaaaggtaccaggaggtatacgcggcccagagggcgctgggcacggccgtggcagag
gtgctgcctgaagctgagagggtgctggccgccgtgcagcaagtcggcgcagatgcagcc
cagcgcctggcctcaccggctgcccctgcggcaccgcctcagaagtcccaggccaggacc
cttggcctgaaggtgcaggccctggagaagacggtcacatcgagagagcgcgtggtcacc
gaggctgcccaggccctccaggctaccgcccaggctgtgctgcacaagacagagcctctt
actcagctgcaccgggaagccagagctgccctgacctgggcttcctcatctgtccaggct
gccatggtgactgtcacaggagccaggactctgctggctgacctggaaggaatgaagcca
aggtttcctcggcccaaggaccaggccgcactgaggaggaaggcagccatcgtccaggac
aggctccttgcagactccaaaaagaagaccaagcaggtggagaggatgctgggaaatgcg
gcatctgtctcctccagtgccaagaagaaaggcagggaagccgagctgttggccaaggac
ggtgcccagctggccgaggccttgctgagggaggggaagcaagagcaccgccgggccggc
cggctctccagccagacgcggtcgatgctccgacaggcctcccagcaggtgctctcctca
gaagcacgcagacagcagccagaagcagctgatcgggtgggtgccgggctgagcgagatg
cagcggcagatccgagaatcacgcacctctctggagaaggacaccgaagccttgtcggag
ctgcttgccaggctggggtcactggacaccctcggagcccccgcccgggccctgaatgag
acccagcgggcactggagcgcctgaggctgcaactgggcccaccgggggcgctgcaaggg
aaactgaggctcttggagcaggagtcggcacagcagcagctgcagatccagagcttcgag
agcagccttgctgagatccgtgctgacaagcagaatctggaggccattttgcatagcctg
cccgagagctgtgccagctggcagtga

DBGET integrated database retrieval system