KEGG   Homo sapiens (human): 4583
Entry
4583              CDS       T01001                                 
Symbol
MUC2, MLP, MUC-2, SMUC
Name
(RefSeq) mucin-2 precursor
  KO
K10955  mucin-2
Organism
hsa  Homo sapiens (human)
Pathway
hsa05146  Amoebiasis
hsa05226  Gastric cancer
Network
nt06240  Transcription (cancer)
nt06261  Gastric cancer
  Element
N00250  CDX2-overexpression to transcriptional activation
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09162 Cancer: specific types
   05226 Gastric cancer
    4583 (MUC2)
  09174 Infectious disease: parasitic
   05146 Amoebiasis
    4583 (MUC2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   04131 Membrane trafficking [BR:hsa04131]
    4583 (MUC2)
Membrane trafficking [BR:hsa04131]
 Others
  Mucins
   Secretory mucins
    4583 (MUC2)
SSDB
Motif
Pfam: VWD C8 Mucin2_WxxW VWF TIL_OTOGL_Mucin TIL VWC Pacifastin_I
Other DBs
NCBI-GeneID: 4583
NCBI-ProteinID: NP_002448
OMIM: 158370
HGNC: 7512
UniProt: Q02817 A0A3S8TMF2
Structure
LinkDB
Position
11:1074874..1110508
AA seq 5130 aa
MGLPLARLAAVCLALSLAGGSELQTEGRTRNHGHNVCSTWGNFHYKTFDGDVFRFPGLCD
YNFASDCRGSYKEFAVHLKRGPGQAEAPAGVESILLTIKDDTIYLTRHLAVLNGAVVSTP
HYSPGLLIEKSDAYTKVYSRAGLTLMWNREDALMLELDTKFRNHTCGLCGDYNGLQSYSE
FLSDGVLFSPLEFGNMQKINQPDVVCEDPEEEVAPASCSEHRAECERLLTAEAFADCQDL
VPLEPYLRACQQDRCRCPGGDTCVCSTVAEFSRQCSHAGGRPGNWRTATLCPKTCPGNLV
YLESGSPCMDTCSHLEVSSLCEEHRMDGCFCPEGTVYDDIGDSGCVPVSQCHCRLHGHLY
TPGQEITNDCEQCVCNAGRWVCKDLPCPGTCALEGGSHITTFDGKTYTFHGDCYYVLAKG
DHNDSYALLGELAPCGSTDKQTCLKTVVLLADKKKNVVVFKSDGSVLLNELQVNLPHVTA
SFSVFRPSSYHIMVSMAIGVRLQVQLAPVMQLFVTLDQASQGQVQGLCGNFNGLEGDDFK
TASGLVEATGAGFANTWKAQSSCHDKLDWLDDPCSLNIESANYAEHWCSLLKKTETPFGR
CHSAVDPAEYYKRCKYDTCNCQNNEDCLCAALSSYARACTAKGVMLWGWREHVCNKDVGS
CPNSQVFLYNLTTCQQTCRSLSEADSHCLEGFAPVDGCGCPDHTFLDEKGRCVPLAKCSC
YHRGLYLEAGDVVVRQEERCVCRDGRLHCRQIRLIGQSCTAPKIHMDCSNLTALATSKPR
ALSCQTLAAGYYHTECVSGCVCPDGLMDDGRGGCVVEKECPCVHNNDLYSSGAKIKVDCN
TCTCKRGRWVCTQAVCHGTCSIYGSGHYITFDGKYYDFDGHCSYVAVQDYCGQNSSLGSF
SIITENVPCGTTGVTCSKAIKIFMGRTELKLEDKHRVVIQRDEGHHVAYTTREVGQYLVV
ESSTGIIVIWDKRTTVFIKLAPSYKGTVCGLCGNFDHRSNNDFTTRDHMVVSSELDFGNS
WKEAPTCPDVSTNPEPCSLNPHRRSWAEKQCSILKSSVFSICHSKVDPKPFYEACVHDSC
SCDTGGDCECFCSAVASYAQECTKEGACVFWRTPDLCPIFCDYYNPPHECEWHYEPCGNR
SFETCRTINGIHSNISVSYLEGCYPRCPKDRPIYEEDLKKCVTADKCGCYVEDTHYPPGA
SVPTEETCKSCVCTNSSQVVCRPEEGKILNQTQDGAFCYWEICGPNGTVEKHFNICSITT
RPSTLTTFTTITLPTTPTTFTTTTTTTTPTSSTVLSTTPKLCCLWSDWINEDHPSSGSDD
GDRETFDGVCGAPEDIECRSVKDPHLSLEQLGQKVQCDVSVGFICKNEDQFGNGPFGLCY
DYKIRVNCCWPMDKCITTPSPPTTTPSPPPTSTTTLPPTTTPSPPTTTTTTPPPTTTPSP
PITTTTTPPPTTTPSPPISTTTTPPPTTTPSPPTTTPSPPTTTPSPPTTTTTTPPPTTTP
SPPTTTPITPPASTTTLPPTTTPSPPTTTTTTPPPTTTPSPPTTTPITPPTSTTTLPPTT
TPSPPPTTTTTPPPTTTPSPPTTTTPSPPTITTTTPPPTTTPSPPTTTTTTPPPTTTPSP
PTTTPITPPTSTTTLPPTTTPSPPPTTTTTPPPTTTPSPPTTTTPSPPITTTTTPPPTTT
PSSPITTTPSPPTTTMTTPSPTTTPSSPITTTTTPSSTTTPSPPPTTMTTPSPTTTPSPP
TTTTTTLPPTTTSSPLTTTPLPPSITPPTFSPFSTTTPTTPCVPLCNWTGWLDSGKPNFH
KPGGDTELIGDVCGPGWAANISCRATMYPDVPIGQLGQTVVCDVSVGLICKNEDQKPGGV
IPMAFCLNYEINVQCCECVTQPTTMTTTTTENPTPTPITTTTTVTPTPTPTSTQSTTPTP
ITTTNTVTPTPTPTGTQTPTPTPITTTTTMVTPTPTITSTQTPTPTPITTTTVTPTPTPT
STQRTTPTSITTTTTVTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGTQTPTTTPISTTT
TVTPTPTPTGTQTLTPTPITTTTTVTPTPTPTGTQTPTSTPITTTTTVTPTPTPTGTQTP
TLTPITTTTTVTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGTKSTTPTSITTTTMVTPT
PPPTGTQTPTTTPITTTTTVTPTPTPTGTQTPTPTPITTTTTVTPTPTPTGTQTPTSTPI
TTNTTVTPTPTPTGTPSTTLTPITTTTTVTPTPTPTGTQTPTSTPISTTTMVTPTPTPTG
TQTPTPTPISTTTTVTPTPTPTGTQTPTPTPITTTTTVTPTPTPTGTQTPTSTPITTTTT
VTPTPTPTGTQTPTTTPITTNTTVTPTPTPTGTQTPTTVLITTTTTMTPTPTPTSTKSTT
VTPITTTTTVTPTPTPTGTQSTTLTPITTTTTVTPTPTPTGIQTPTTTPISTTTTVTPTP
TPTGTQTPTSTPITTTTTVTPTPTPTGTQTPTSTPISTTTTVTPTATPTGTQTPTLTPIT
TTTTVTPTPTPTGTKSTTPTSITTTTTVTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGT
QTPTPTPITTTTTVTPTPTPTSTQTPTSTPITTTTTVTPTPTPTGTQTPTTTPITTTTTV
TPTPTPTGTQAPTPTAITTTTTGTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGTQSPTP
TAITTTTTVTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGTQSTTLTPITTTTTVTPTPT
PTGTQTPTSTPITTTITVTPTPTPTGTQTPTPTPISTTTTVTPTPTPTGTQTPTSTPITT
TTTVTPTPTPTGTQTPTTTPISTTTTVTPTPTPTGTQTPTSTPITTTTTVTPTPTPTGTQ
TPTTTPISTTTTVTPTPTPTGTQTPTSTPITTTTTVTPTPTPTGTQTPTPTPITTTTTVT
PTPTPTGTQTPTSTPITTTTTVTPTPTPTGTQTPTPTPITTTTTVTPTPTPTGTQTPTPT
PITTTTTVTPTPTPTGTQTPTSTPITTTTTVTPTPTPTGTQTPTTTPITTTTTVTPTPTP
TGTQSTTLTPITTTTTVTPTPTPTGTQTPTSTPITTITTVTPTPTPTGTQTPTPTPISTT
TTVTPTPTPTGTQTPTMTPITTTTTVTPTPTPTGTQTPTTTPISTTTTVTPTPTPTGTQT
PTSTPITTTTTVTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGTQSTTLTPITTTTTVTP
TPTPTGTQTPTPTPISTTTTVTPTPTPTGTQTPTMTPITTTTTVTPTPTPTGTQTPTTTP
ISTTTTVTPTPTPTGTQTPRSTPITTTTKVTPTPTPTGTQTPTPTPITTTTTVTPTPTPT
GTQAPTPAAITTTSTVTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGTQSTTLTPITTTT
TVTPTPTPTGTQTPTSTPITTTTTVTPTPTPTGTQTPTPTPISTTSTVTPTPTPTGTQTP
TMTPITTTTTVTPTPTPTGTQTPTTTPISTTTTVTPTPTPTGTQNPTSTPITTTTTVTPT
PTPTGTQTPTMTPITTTTTVTPTPTPTGTQAPTPTAITTTTTVTPTPTPTGTQTPTTTPI
TTTTTVTPTPIPTGTQSTTLTPITTTTTVTPTPTPTGTQTPTPIPISTTTTVTPTPTPTG
TQTPTMTPITTTTTVTPTPTPTGTQTPTTTPISTTTTVTPTPTPTGTQTPTSTPITTTTT
VTPTPIPTGTQTPTTTPITTTTTVTPTPTPTGTQAPTPTAITTTTTVTPTPTPTGTQTPT
TTPITTTTTVTPTPIPTGTQSTTLTPITTTTTVTPTPTPTSTQTPTPTPISTTTTVTPTP
TPTGTQTPTMTPITTTTTVTPTPTPTGTQTPTTTPISTTTTVTPTPTPTGTQTPTSTPIT
TTTTVTPTPTSTGTQTPTTTPITTTTTVTPTPTPTGTQAPTPTAITTTSTVTPTPTPTGT
QTPTTTPITTTTTVTPTPTPTGTQSPTPTAITTTTTVTPTPTPTGTQTPTSTPITTTTTV
TPTPTPTGTQTPTPTPISTTTTVTPTPTPTGTQTPTTTPITTTTTVTPTPTPTGTQTPTT
VLITTTTTMTPTPTPTSTKSTTVTPITTTTTVTATPTPTGTQTPTMIPISTTTTVTPTPT
PTTGSTGPPTHTSTAPIAELTTSNPPPESSTPQTSRSTSSPLTESTTLLSTLPPAIEMTS
TAPPSTPTAPTTTSGGHTLSPPPSTTTSPPGTPTRGTTTGSSSAPTPSTVQTTTTSAWTP
TPTPLSTPSIIRTTGLRPYPSSVLICCVLNDTYYAPGEEVYNGTYGDTCYFVNCSLSCTL
EFYNWSCPSTPSPTPTPSKSTPTPSKPSSTPSKPTPGTKPPECPDFDPPRQENETWWLCD
CFMATCKYNNTVEIVKVECEPPPMPTCSNGLQPVRVEDPDGCCWHWECDCYCTGWGDPHY
VTFDGLYYSYQGNCTYVLVEEISPSVDNFGVYIDNYHCDPNDKVSCPRTLIVRHETQEVL
IKTVHMMPMQVQVQVNRQAVALPYKKYGLEVYQSGINYVVDIPELGVLVSYNGLSFSVRL
PYHRFGNNTKGQCGTCTNTTSDDCILPSGEIVSNCEAAADQWLVNDPSKPHCPHSSSTTK
RPAVTVPGGGKTTPHKDCTPSPLCQLIKDSLFAQCHALVPPQHYYDACVFDSCFMPGSSL
ECASLQAYAALCAQQNICLDWRNHTHGACLVECPSHREYQACGPAEEPTCKSSSSQQNNT
VLVEGCFCPEGTMNYAPGFDVCVKTCGCVGPDNVPREFGEHFEFDCKNCVCLEGGSGIIC
QPKRCSQKPVTHCVEDGTYLATEVNPADTCCNITVCKCNTSLCKEKPSVCPLGFEVKSKM
VPGRCCPFYWCESKGVCVHGNAEYQPGSPVYSSKCQDCVCTDKVDNNTLLNVIACTHVPC
NTSCSPGFELMEAPGECCKKCEQTHCIIKRPDNQHVILKPGDFKSDPKNNCTFFSCVKIH
NQLISSVSNITCPNFDASICIPGSITFMPNGCCKTCTPRNETRVPCSTVPVTTEVSYAGC
TKTVLMNHCSGSCGTFVMYSAKAQALDHSCSCCKEEKTSQREVVLSCPNGGSLTHTYTHI
ESCQCQDTVCGLPTGTSRRARRSPRHLGSG
NT seq 12474 nt   +upstreamnt  +downstreamnt
atggggctgccactagcccgcctggcggctgtgtgcctggccctgtctttggcagggggc
tcggagctccagacagagggcagaacccgaaaccacggccacaacgtctgcagcacctgg
ggcaacttccactacaagaccttcgacggggacgtcttccgcttccccggcccctgcgac
tacaacttcgcctccgactgccgaggctcctacaaggaatttgctgtgcacctgaagcgg
ggtccgggccaggctgaggcccccgccggggtggagtccatcctgctgaccatcaaggat
gacaccatctacctcacccgccacctggctgtgcttaacggggccgtggtcagcaccccg
cactacagccccgggctgctcattgagaagagcgatgcctacaccaaagtctactcccgc
gccggcctcaccctcatgtggaaccgggaggatgcactcatgctggagctggacactaag
ttccggaaccacacctgtggcctctgcggggactacaacggcctgcagagctattcagaa
ttcctctctgacggcgtgctcttcagtcccctggagtttgggaacatgcagaagatcaac
cagcccgatgtggtgtgtgaggatcccgaggaggaggtggcccccgcatcctgctccgag
caccgcgccgagtgtgagaggctgctgaccgccgaggccttcgcggactgtcaggacctg
gtgccgctggagccgtatctgcgcgcctgccagcaggaccgctgccggtgcccgggcggt
gacacctgcgtctgcagcaccgtggccgagttctcccgccagtgctcccacgccggcggc
cggcccgggaactggaggaccgccacgctctgccccaagacctgccccgggaacctggtg
tacctggagagcggctcgccctgcatggacacctgctcacacctggaggtgagcagcctg
tgcgaggagcaccgcatggacggctgtttctgcccagaaggcaccgtatatgacgacatc
ggggacagtggctgcgttcctgtgagccagtgccactgcaggctgcacggacacctgtac
acaccgggccaggagatcaccaatgactgcgagcagtgtgtctgtaacgctggccgctgg
gtgtgcaaagacctgccctgccccggcacctgtgccctggaaggcggctcccacatcacc
accttcgatgggaagacgtacaccttccacggggactgctactatgtcctggccaagggt
gaccacaacgattcctacgctctcctgggcgagctggccccctgtggctccacagacaag
cagacctgcctgaagacggtggtgctgctggctgacaagaagaagaatgtggtggtcttc
aagtccgatggcagtgtactgctcaacgagctgcaggtgaacctgccccacgtgaccgcg
agcttctctgtcttccgcccgtcttcctaccacatcatggtgagcatggccattggcgtc
cggctgcaggtgcagctggccccagtcatgcaactctttgtgacactggaccaggcctcc
caggggcaggtgcagggcctctgcgggaacttcaacggcctggaaggtgacgacttcaag
acggccagcgggctggtggaggccacgggggccggctttgccaacacctggaaggcacag
tcaacctgccatgacaagctggactggttggacgatccctgctccctgaacatcgagagc
gccaactacgccgagcactggtgctccctcctgaagaagacagagaccccctttggcagg
tgccactcggctgtggaccctgctgagtattacaagaggtgcaaatatgacacgtgtaac
tgtcagaacaatgaggactgcctgtgcgccgccctgtcctcctacgcgcgcgcctgcacc
gccaagggcgtcatgctgtggggctggcgggagcatgtctgcaacaaggatgtgggctcc
tgccccaactcgcaggtcttcctgtacaacctgaccacctgccagcagacctgccgctcc
ctctccgaggccgacagccactgtctcgagggctttgcgcctgtggacggctgcggctgc
cctgaccacaccttcctggacgagaagggccgctgcgtacccctggccaagtgctcctgt
taccaccgcggtctctacctggaggcgggggacgtggtcgtcaggcaggaagaacgatgt
gtgtgccgggatgggcggctgcactgtaggcagatccggctgatcggccagagctgcacg
gccccaaagatccacatggactgcagcaacctgactgcactggccacctcgaagccccga
gccctcagctgccagacgctggccgccggctattaccacacagagtgtgtcagtggctgt
gtgtgccccgacgggctgatggatgacggccggggtggctgcgtggtggagaaggaatgc
ccttgcgtccataacaacgacctgtattcttccggcgccaagatcaaggtggactgcaat
acctgcacctgcaagagaggacgctgggtgtgcacccaggctgtgtgccatggcacctgc
tccatttacgggagtggccactacatcacctttgacgggaagtactacgactttgacgga
cactgctcctacgtggctgttcaggactactgcggccagaactcctcactgggctcattc
agcatcatcaccgagaacgtcccctgtggcactacgggcgtcacctgctccaaggccatc
aagatcttcatggggaggacggagctgaagttggaagacaagcaccgtgtggtgatccag
cgtgatgagggtcaccacgtggcctacaccacgcgggaggtgggccagtacctggtggtg
gagtccagcacgggcatcatcgtcatctgggacaagaggaccaccgtgttcatcaagctg
gctccctcctacaagggcaccgtgtgtggcctgtgtgggaactttgaccaccgctccaac
aacgacttcaccacgcgggaccacatggtggtgagcagcgagctggacttcgggaacagc
tggaaggaggcccccacctgcccagatgtgagcaccaaccccgagccctgcagcctgaac
ccgcaccgccgctcctgggccgagaagcagtgcagcatcctcaaaagcagcgtgttcagc
atctgccacagcaaggtggaccccaagcccttctacgaggcctgtgtgcacgactcgtgc
tcctgtgacacgggtggggactgtgagtgcttctgctctgccgtggcctcctacgcccag
gagtgtaccaaagagggggcctgcgtgttctggaggacgccggacctgtgccccatattc
tgcgactactacaaccctccgcatgagtgtgagtggcactatgagccatgtgggaaccgg
agcttcgagacctgcaggaccatcaatggcatccactccaacatctccgtgtcctacctg
gagggctgctacccccggtgccccaaggacaggcccatctatgaggaggatctgaagaag
tgtgtcactgcagacaagtgtggctgctatgtcgaggacacccactacccacctggagca
tcggttcccaccgaggagacctgcaagtcctgcgtgtgtaccaactcctcccaagtcgtc
tgcaggccggaggaaggaaagattcttaaccagacccaggatggcgccttctgctactgg
gagatctgtggccccaacgggacggtggagaagcacttcaacatctgttccattacgaca
cgcccgtccaccctgaccaccttcaccaccatcaccctccccaccacccccaccaccttc
accactaccaccaccaccaccaccccgacctccagcacagttttatcaacaactccgaag
ctgtgctgcctctggtctgactggatcaatgaggaccaccccagcagtggcagcgacgac
ggtgaccgagaaacatttgatggggtctgcggggcccctgaggacatcgagtgcaggtcg
gtcaaggatccccacctcagcttggagcagctaggccagaaggtgcagtgtgatgtctct
gttgggttcatttgcaagaatgaagaccagtttggaaatggaccatttggactgtgttac
gactacaagatacgtgtcaattgttgctggcccatggataagtgtatcaccactcccagc
cctccaactaccactcccagccctccaccaaccagcacgaccacccttccaccaaccacc
acccccagccctccaaccaccaccacaaccacccctccaccaaccaccacccccagccct
ccaataaccaccacgaccacccctccaccaaccaccactcccagccctccaataagcacc
acaaccacccctccaccaaccaccactcccagccctccaaccaccactcccagccctcca
accaccactcccagccctccaacaaccaccacaaccacccctccaccaaccaccactccc
agccctccaacgactacgcccatcactccaccagccagcactaccacccttccaccaacc
accactcccagccctccaacaaccaccacaaccacccctccaccaaccaccactcccagt
cctccaacgactacgcccatcactccaccaaccagcactactacccttccaccaaccacc
actcccagccctccaccaaccaccacaaccacccctccaccaaccaccactcccagccct
ccaacaaccaccactcccagtcctccaacaatcaccacaaccacccctccaccaaccacc
actcccagccctccaacaaccaccacgaccacccttccaccaaccaccacttccagccct
ctaacaactactcctctacctccatcaataactcctcctacattttcaccattctcaacg
acaacccctactaccccatgcgtgcctctctgcaattggactggctggctggattctgga
aaacccaactttcacaaaccaggtggagacacagaattgattggagacgtctgtggacca
ggctgggcagctaacatctcttgcagagccaccatgtatcctgatgttcccattggacag
cttggacaaacagtggtgtgtgatgtctctgtggggctgatatgcaaaaatgaagaccaa
aagccaggtggggtcatccctatggccttctgcctcaactacgagatcaacgttcagtgc
tgtgagtgtgtcacccaacccaccaccatgacaaccaccaccacagagaacccaactccg
acaccaatcaccaccaccactacggtgaccccaaccccaacacccaccagcacacagagt
acaacaccaacacccatcaccaccaccaatacggtaaccccaaccccaacccccactggc
acacagaccccaaccccgacacccatcaccaccaccaccactatggtgaccccaacacca
acaatcaccagcacacagaccccaaccccgacacccatcaccaccactacggtgacccca
accccaacacccaccagcacacagagaacaacaccgacatccatcaccaccaccaccacg
gtgaccccaaccccaacacccaccggcacacagaccccaaccacgacacccatcaccacc
accaccacggtgaccccaaccccaacacccaccggcacacagaccccaacaacgacaccc
atcaccaccaccaccatggtgaccccaaccccaacacccactggaacacagacccaaacc
ccaacacccatcaccaccaccactacggtgaccccaacccctacacccaccggcacacag
accccaacatcgacacccatcagcaccaccactacggtgaccccaacaccaacacccacc
ggcacacagaccccaaccctgacacccatcaccaccaccactacggtgaccccaacccca
acacccaccggcacacagaccccaaccacgacacccatcaccaccaccactacggtgacc
ccaaccccaacacccaccggcacaaagagtacaaccccgacatccatcaccaccaccact
atggtgaccccaaccccaccacccactggcacacagaccccaaccacgacacccatcacc
accaccactacggtgaccccaaccccaacacccaccggcacacagaccccaaccccgaca
cccatcaccaccaccaccacggtgaccccaaccccaacacccaccggcacacagacccca
acatcgacacccatcaccaccaacactacggtgaccccaaccccaacaccaaccggcaca
ccgagtacaaccctgacacccatcaccaccaccactatggtgaccccaaccccaacaccc
accggcacacagaccccaacatcgacacccatcagcaccaccactacggtgaccccaacc
tcaacacccaccggcacacagaccccaaccccgacacccatctccaccaccactacggtg
accccaaccccgacacccatctccaccaccactacagtgaccccaaccccaacacccacc
ggcacacagaccccaaccatgacacccatcaccaccaccaccacggtgaccccaacccca
acacccaccggcacacagaccccaacaacgacacccatcagcaccaccaccacagtgacc
ccaaccccaacacccaccggcacacagaccccaacatcgacacccatcaccaccaccact
acggtgaccccaaccccaacacccaccggcacacagaccccaaccacgacacccatcacc
accaccaccacggtgaccccaaccccaacacccaccggcacacagagtacaaccctgaca
cccatcaccaccaccaccacggtgacaccaaccccaacacccaccggcacacagacccca
accccgacacccatctccaccaccactacggtgaccccaaccccaacacccaccggcaca
cagaccccaaccacgacacccatcaccaccaccaccacggtgaccccaaccccaacaccc
accggcacacagaccccaacaacgacacccatcagcaccaccaccacggtgaccccaacc
ccaacacccaccggcacacagaccccaacatcgacacccatcaccaccaccactacggtg
accccaaccccaacacccaccggcacacagaccccaaccacgacacccatcaccaccacc
accacggtgaccccaaccccaacacccactggcacacaggccccaaccccaacagccatc
accaccaccactacggtgaccccaaccccaacacccaccggcacacagaccccaacaacg
acacccatcaccaccaccaccatggtgaccccaaccccaacacccaccggcacacagacc
ccaacatcgacacccatcaccaccaccactacggtgaccccaaccccaacacccaccggc
acacagaccccaaccccgacacccatctccaccaccactacggtgaccccaaccccaaca
cccaccggcacacagaccccaaccatgacacccatcaccaccaccaccacggtgacccca
accccaacacccaccggcacacagaccccaacaacgacacccatcagcaccaccaccacg
gtgaccccaaccccaacacccaccggcacacagaccccaacatcgacacccatcaccacc
accactacggtgaccccaaccccaacacccaccggcacacagaccccaaccccgacaccc
atcaccaccaccaccacggtgaccccaaccccaacacccaccggcacacagaccccaaca
tcgacacccatcaccaccaccactacggtgaccccaaccccaacacccaccggcacacag
accccaaccacgacacccatcaccaccaccaccacggtgaccccaaccccaacacccacc
ggcacacagagtacaaccctgacacccatcaccaccaccaccacggtgacaccaacccca
acacccaccggcacacaaaacccaacatcaacacccatcaccaccaccactacggtgacc
ccaaccccaaaacccaccggcacacagaccccaaccccaacacccatctccaccaccaat
aaggtgaccccaaccccaacacccaccggcacacagaccccaaccatgacacccatcacc
accaccaccacggtgaccccaaccccaacacccaccggcacacagaccccaacatcgaca
cccatcaccaccaccactacggtgaccccaaccccaacacccaccggcacacagacccca
accatgacacccatcaccaccaccaccacggtgaccccaaccccaacacccactggcaca
caggccccaaccccaacagccatcaccaccaccactacggtgaccccaaccccaacaccc
accggcacacagaccccaaccacgacacccatcaccaccaccaccacggtgaccccaacc
ccaacacccaccggcacacagagtacaaccctgacacccatcaccaccaccaccacggtg
acaccaaccccaacacccaccggcacacagaccccaaccccgacacccatctccaccacc
actacggtgaccccaaccccaacacccaccggcacacagaccccaaccatgacacccatc
accaccaccaccacggtgaccccaaccccaacacccaccggcacacagaccccaacaacg
acacccatcagcaccaccaccacggtgaccccaaccccaacacccaccggcacacagacc
ccaacatcgacacccatcaccaccaccactacggtgaccccaaccccaacacccaccggc
acacagaccccaaccacgacacccatcaccaccaccaccacggtgaccccaaccccaaca
cccactggcacacaggccccaaccccaacagccatcaccaccaccagtacggtgacccca
accccaacacccaccggcacacagaccccaaccacgacacccatcaccaccaccactacg
gtgacaccaaccccaacacccaccggcacacagtccccaaccccaacagccatcaccacc
accactacggtgaccccaaccccaacacccaccggcacacagaccccaacattgacgccc
atcaccaccaccactacggtgaccccaaccccaacacccaccggcacacagaccccaacc
ccgacacccatctccaccaccactacggtgaccccaaccccaacacccaccggcacacag
accccaaccacgacacccatcaccaccaccaccacggtgaccccaaccccgacacccacc
ggcacacagaccccaaccacggtactcatcaccaccaccactacgatgaccccaacccca
acacccaccagcacaaagagtacaaccgtgacacccatcaccaccacaactacggtgacc
gcaaccccaacacccaccggcacacagaccccaaccatgatacccatcagcaccaccact
acggtgaccccaaccccaacacccaccactggaagcacggggccccccacccacacaagc
acagcaccgattgctgagttgaccacatccaatcctccgcctgagtcctcaacccctcag
acctctcggtccacctcttcccctctcacggagtcaaccacccttctgagtaccctacca
cctgccattgagatgaccagcacggccccaccctccacacccacggcacccacgaccacg
agcggaggccacacactgtctccaccgcccagcaccaccacgtcccctccaggcaccccc
actcgcggtaccacgactgggtcatcttcagcccccacccccagcactgtgcagacgacc
accaccagtgcctggacccccacgccgaccccactctccacacccagcatcatcaggacc
acaggcctgaggccctacccttcctctgtgcttatctgctgtgtcctgaacgacacctac
tacgcaccaggtgaggaggtgtacaacggcacatacggagacacctgttatttcgtcaac
tgctcactgagctgtacgttggagttctataactggtcctgcccatccacgccctcccca
acacccacgccctccaagtcgacgcccacgccttccaagccatcgtccacgccctccaag
ccgacgcccggcaccaagccccccgagtgcccagactttgatcctcccagacaggagaac
gagacttggtggctgtgcgactgcttcatggccacgtgcaagtacaacaacacggtggag
atcgtgaaggtggagtgtgagccgccgcccatgcccacctgctccaacggcctccaaccc
gtgcgcgtcgaggaccccgacggctgctgctggcactgggagtgcgactgctactgcacg
ggctggggcgacccgcactatgtcaccttcgacggactctactacagctaccagggcaac
tgcacctacgtgctggtggaggagatcagcccctccgtggacaacttcggagtttacatc
gacaactaccactgcgatcccaacgacaaggtgtcctgcccccgcaccctcatcgtgcgc
cacgagacccaggaggtgctgatcaagaccgtgcatatgatgcccatgcaggtgcaggtg
caggtgaacaggcaggcggtggcactgccctacaagaagtacgggctggaggtgtaccag
tctggcatcaactacgtggtggacatccccgagctgggtgtcctcgtctcctacaatggc
ctgtccttctccgtcaggctgccctaccaccggtttggcaacaacaccaagggccagtgt
ggcacctgcaccaacaccacctccgacgactgcattctgcccagcggggagatcgtctcc
aactgtgaggctgcggctgaccagtggctggtgaacgacccctccaagccacactgcccc
cacagcagctccacgaccaagcgcccggccgtcactgtgcccgggggcggtaaaacgacc
ccacacaaggactgcaccccatctcccctctgccagctcatcaaggacagcctgtttgcc
cagtgccacgcactggtgcccccgcagcactactacgatgcctgcgtgttcgacagctgc
ttcatgccgggctcgagcctggagtgcgccagtctgcaggcctacgcagccctctgtgcc
cagcagaacatctgcctcgactggcggaaccacacgcatggggcctgcttggtggagtgc
ccatctcacagggagtaccaggcctgtggccctgcagaagagcccacgtgcaaatccagc
tcctcccagcagaacaacacagtcctggtggaaggctgcttctgtcctgagggcaccatg
aactacgctcctggctttgatgtctgcgtgaagacctgcggctgtgtgggacctgacaat
gtgcccagagagtttggggagcacttcgagttcgactgcaagaactgtgtctgcctggag
ggtggaagtggcatcatctgccaacccaagaggtgcagccagaagcccgttacccactgc
gtggaagacggcacctacctcgccacggaggtcaaccctgccgacacctgctgcaacatt
accgtctgcaagtgcaacaccagcctgtgcaaagagaagccctccgtgtgcccgctggga
ttcgaagtgaagagcaagatggtgcctggaaggtgctgtcctttctactggtgtgagtcc
aagggggtgtgtgttcacgggaatgctgagtaccagcccggttctccagtttattcctcc
aagtgccaggactgcgtgtgcacggacaaggtggacaacaacaccctgctcaacgtcatc
gcctgcacccacgtgccctgcaacacctcctgcagccctggcttcgaactcatggaggcc
cccggggagtgctgtaagaagtgtgaacagacgcactgtatcatcaaacggcccgacaac
cagcacgtcatcctgaagcccggggacttcaagagcgacccgaagaacaactgcacattc
ttcagctgcgtgaagatccacaaccagctcatctcgtccgtctccaacatcacctgcccc
aactttgatgccagcatttgcatcccgggctccatcacattcatgcccaatggatgctgc
aagacctgcacccctcgcaatgagaccagggtgccctgctccaccgtccccgtcaccacg
gaggtttcgtacgccggctgcaccaagaccgtcctcatgaatcattgctccgggtcctgc
gggacatttgtcatgtactcggccaaggcccaggccctggaccacagctgctcctgctgc
aaagaggagaaaaccagccagcgtgaggtggtcctgagctgccccaatggcggctcgctg
acacacacctacacccacatcgagagctgccagtgccaggacaccgtctgcgggctcccc
accggcacctcccgccgggcccggcgctcccctaggcatctggggagcgggtga

KEGG   Homo sapiens (human): 1015
Entry
1015              CDS       T01001                                 
Symbol
CDH17, CDH16, HPT-1, HPT1
Name
(RefSeq) cadherin-17 isoform 1 precursor
  KO
K06811  cadherin 17, LI cadherin
Organism
hsa  Homo sapiens (human)
Pathway
hsa04519  Cadherin signaling
hsa05226  Gastric cancer
Network
nt06240  Transcription (cancer)
nt06261  Gastric cancer
nt06549  Cadherin signaling
  Element
N00250  CDX2-overexpression to transcriptional activation
N01999  Clasical cadherin-catenin cell adhesion system
N02000  Clasical cadherin to Hippo signaling pathway
N02001  Clasical cadherin to Wnt signaling pathway
N02012  Clasical cadherin-p120-Rho signaling pathway
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09133 Signaling molecules and interaction
   04519 Cadherin signaling
    1015 (CDH17)
 09160 Human Diseases
  09162 Cancer: specific types
   05226 Gastric cancer
    1015 (CDH17)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   04515 Cell adhesion molecules [BR:hsa04515]
    1015 (CDH17)
Cell adhesion molecules [BR:hsa04515]
 Cadherins
  Major cadherins
   1015 (CDH17)
SSDB
Motif
Pfam: Cadherin Cadherin_FAT4_N RET_CLD1 Cadherin_CELSR2_9th SKG6
Other DBs
NCBI-GeneID: 1015
NCBI-ProteinID: NP_001138135
OMIM: 603017
HGNC: 1756
UniProt: Q12864
Structure
LinkDB
Position
8:complement(94127162..94217278)
AA seq 832 aa
MILQAHLHSLCLLMLYLATGYGQEGKFSGPLKPMTFSIYEGQEPSQIIFQFKANPPAVTF
ELTGETDNIFVIEREGLLYYNRALDRETRSTHNLQVAALDANGIIVEGPVPITIKVKDIN
DNRPTFLQSKYEGSVRQNSRPGKPFLYVNATDLDDPATPNGQLYYQIVIQLPMINNVMYF
QINNKTGAISLTREGSQELNPAKNPSYNLVISVKDMGGQSENSFSDTTSVDIIVTENIWK
APKPVEMVENSTDPHPIKITQVRWNDPGAQYSLVDKEKLPRFPFSIDQEGDIYVTQPLDR
EEKDAYVFYAVAKDEYGKPLSYPLEIHVKVKDINDNPPTCPSPVTVFEVQENERLGNSIG
TLTAHDRDEENTANSFLNYRIVEQTPKLPMDGLFLIQTYAGMLQLAKQSLKKQDTPQYNL
TIEVSDKDFKTLCFVQINVIDINDQIPIFEKSDYGNLTLAEDTNIGSTILTIQATDADEP
FTGSSKILYHIIKGDSEGRLGVDTDPHTNTGYVIIKKPLDFETAAVSNIVFKAENPEPLV
FGVKYNASSFAKFTLIVTDVNEAPQFSQHVFQAKVSEDVAIGTKVGNVTAKDPEGLDISY
SLRGDTRGWLKIDHVTGEIFSVAPLDREAGSPYRVQVVATEVGGSSLSSVSEFHLILMDV
NDNPPRLAKDYTGLFFCHPLSAPGSLIFEATDDDQHLFRGPHFTFSLGSGSLQNDWEVSK
INGTHARLSTRHTEFEEREYVVLIRINDGGRPPLEGIVSLPVTFCSCVEGSCFRPAGHQT
GIPTVGMAVGILLTTLLVIGIILAVVFIRIKKDKGKDNVESAQASEVKPLRS
NT seq 2499 nt   +upstreamnt  +downstreamnt
atgatacttcaggcccatcttcactccctgtgtcttcttatgctttatttggcaactgga
tatggccaagaggggaagtttagtggacccctgaaacccatgacattttctatttatgaa
ggccaagaaccgagtcaaattatattccagtttaaggccaatcctcctgctgtgactttt
gaactaactggggagacagacaacatatttgtgatagaacgggagggacttctgtattac
aacagagccttggacagggaaacaagatctactcacaatctccaggttgcagccctggac
gctaatggaattatagtggagggtccagtccctatcaccataaaagtgaaggacatcaac
gacaatcgacccacgtttctccagtcaaagtacgaaggctcagtaaggcagaactctcgc
ccaggaaagcccttcttgtatgtcaatgccacagacctggatgatccggccactcccaat
ggccagctttattaccagattgtcatccagcttcccatgatcaacaatgtcatgtacttt
cagatcaacaacaaaacgggagccatctctcttacccgagagggatctcaggaattgaat
cctgctaagaatccttcctataatctggtgatctcagtgaaggacatgggaggccagagt
gagaattccttcagtgataccacatctgtggatatcatagtgacagagaatatttggaaa
gcaccaaaacctgtggagatggtggaaaactcaactgatcctcaccccatcaaaatcact
caggtgcggtggaatgatcccggtgcacaatattccttagttgacaaagagaagctgcca
agattcccattttcaattgaccaggaaggagatatttacgtgactcagcccttggaccga
gaagaaaaggatgcatatgttttttatgcagttgcaaaggatgagtacggaaaaccactt
tcatatccgctggaaattcatgtaaaagttaaagatattaatgataatccacctacatgt
ccgtcaccagtaaccgtatttgaggtccaggagaatgaacgactgggtaacagtatcggg
acccttactgcacatgacagggatgaagaaaatactgccaacagttttctaaactacagg
attgtggagcaaactcccaaacttcccatggatggactcttcctaatccaaacctatgct
ggaatgttacagttagctaaacagtccttgaagaagcaagatactcctcagtacaactta
acgatagaggtgtctgacaaagatttcaagaccctttgttttgtgcaaatcaacgttatt
gatatcaatgatcagatccccatctttgaaaaatcagattatggaaacctgactcttgct
gaagacacaaacattgggtccaccatcttaaccatccaggccactgatgctgatgagcca
tttactgggagttctaaaattctgtatcatatcataaagggagacagtgagggacgcctg
ggggttgacacagatccccataccaacaccggatatgtcataattaaaaagcctcttgat
tttgaaacagcagctgtttccaacattgtgttcaaagcagaaaatcctgagcctctagtg
tttggtgtgaagtacaatgcaagttcttttgccaagttcacgcttattgtgacagatgtg
aatgaagcacctcaattttcccaacacgtattccaagcgaaagtcagtgaggatgtagct
ataggcactaaagtgggcaatgtgactgccaaggatccagaaggtctggacataagctat
tcactgaggggagacacaagaggttggcttaaaattgaccacgtgactggtgagatcttt
agtgtggctccattggacagagaagccggaagtccatatcgggtacaagtggtggccaca
gaagtaggggggtcttccttgagctctgtgtcagagttccacctgatccttatggatgtg
aatgacaaccctcccaggctagccaaggactacacgggcttgttcttctgccatcccctc
agtgcacctggaagtctcattttcgaggctactgatgatgatcagcacttatttcggggt
ccccattttacattttccctcggcagtggaagcttacaaaacgactgggaagtttccaaa
atcaatggtactcatgcccgactgtctaccaggcacacagagtttgaggagagggagtat
gtcgtcttgatccgcatcaatgatgggggtcggccacccttggaaggcattgtttcttta
ccagttacattctgcagttgtgtggaaggaagttgtttccggccagcaggtcaccagact
gggatacccactgtgggcatggcagttggtatactgctgaccacccttctggtgattggt
ataattttagcagttgtgtttatccgcataaagaaggataaaggcaaagataatgttgaa
agtgctcaagcatctgaagtcaaacctctgagaagctga

KEGG   Homo sapiens (human): 83998
Entry
83998             CDS       T01001                                 
Symbol
REG4, GISP, REG-IV, RELP
Name
(RefSeq) regenerating islet-derived protein 4 isoform 1 precursor
  KO
K22244  regenerating islet-derived protein 4
Organism
hsa  Homo sapiens (human)
Pathway
hsa05226  Gastric cancer
Network
nt06240  Transcription (cancer)
nt06261  Gastric cancer
  Element
N00250  CDX2-overexpression to transcriptional activation
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09162 Cancer: specific types
   05226 Gastric cancer
    83998 (REG4)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01002 Peptidases and inhibitors [BR:hsa01002]
    83998 (REG4)
  09183 Protein families: signaling and cellular processes
   04091 Lectins [BR:hsa04091]
    83998 (REG4)
Peptidases and inhibitors [BR:hsa01002]
 Peptidase inhibitors
  Family I63
   83998 (REG4)
Lectins [BR:hsa04091]
 C-Type lectins
  Others
   83998 (REG4)
SSDB
Motif
Pfam: Lectin_C
Other DBs
NCBI-GeneID: 83998
NCBI-ProteinID: NP_001152824
OMIM: 609846
HGNC: 22977
UniProt: Q9BYZ8
Structure
LinkDB
Position
1:complement(119794017..119811460)
AA seq 158 aa
MASRSMRLLLLLSCLAKTGVLGDIIMRPSCAPGWFYHKSNCYGYFRKLRNWSDAELECQS
YGNGAHLASILSLKEASTIAEYISGYQRSQPIWIGLHDPQKRQQWQWIDGAMYLYRSWSG
KSMGGNKHCAEMSSNNNFLTWSSNECNKRQHFLCKYRP
NT seq 477 nt   +upstreamnt  +downstreamnt
atggcttccagaagcatgcggctgctcctattgctgagctgcctggccaaaacaggagtc
ctgggtgatatcatcatgagacccagctgtgctcctggatggttttaccacaagtccaat
tgctatggttacttcaggaagctgaggaactggtctgatgccgagctcgagtgtcagtct
tacggaaacggagcccacctggcatctatcctgagtttaaaggaagccagcaccatagca
gagtacataagtggctatcagagaagccagccgatatggattggcctgcacgacccacag
aagaggcagcagtggcagtggattgatggggccatgtatctgtacagatcctggtctggc
aagtccatgggtgggaacaagcactgtgctgagatgagctccaataacaactttttaact
tggagcagcaacgaatgcaacaagcgccaacacttcctgtgcaagtaccgaccatag

KEGG   Homo sapiens (human): 5243
Entry
5243              CDS       T01001                                 
Symbol
ABCB1, ABC20, CD243, CLCS, ENPAT, GP170, MDR1, P-GP, PGY1, p-170
Name
(RefSeq) ATP-dependent translocase ABCB1 isoform 2
  KO
K05658  ATP-binding cassette, subfamily B (MDR/TAP), member 1 [EC:7.6.2.2]
Organism
hsa  Homo sapiens (human)
Pathway
hsa02010  ABC transporters
hsa04976  Bile secretion
hsa05206  MicroRNAs in cancer
hsa05226  Gastric cancer
Network
nt06240  Transcription (cancer)
nt06261  Gastric cancer
  Element
N00250  CDX2-overexpression to transcriptional activation
Disease
H01227  Inflammatory bowel disease (IBD)
H01529  Avascular necrosis of femoral head
H02831  Acute transient encephalopathy
Drug target
Biricodar dicitrate: D03128
Elacridar hydrochloride: D03968
Encequidar (DG03103): D11782 D11783
Tariquidar: D06008
Tesmilifene hydrochloride: D06084
Valspodar: D06277
Zosuquidar trihydrochloride: D06387
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09130 Environmental Information Processing
  09131 Membrane transport
   02010 ABC transporters
    5243 (ABCB1)
 09150 Organismal Systems
  09154 Digestive system
   04976 Bile secretion
    5243 (ABCB1)
 09160 Human Diseases
  09161 Cancer: overview
   05206 MicroRNAs in cancer
    5243 (ABCB1)
  09162 Cancer: specific types
   05226 Gastric cancer
    5243 (ABCB1)
 09180 Brite Hierarchies
  09183 Protein families: signaling and cellular processes
   02000 Transporters [BR:hsa02000]
    5243 (ABCB1)
   04147 Exosome [BR:hsa04147]
    5243 (ABCB1)
   04090 CD molecules [BR:hsa04090]
    5243 (ABCB1)
Enzymes [BR:hsa01000]
 7. Translocases
  7.6  Catalysing the translocation of other compounds
   7.6.2  Linked to the hydrolysis of a nucleoside triphosphate
    7.6.2.2  ABC-type xenobiotic transporter
     5243 (ABCB1)
Transporters [BR:hsa02000]
 ABC transporters, eukaryotic type
  ABCB (MDR/TAP) subfamily
   ABCB1, 4, 5 subgroups
    5243 (ABCB1)
Exosome [BR:hsa04147]
 Exosomal proteins
  Exosomal proteins of other body fluids (saliva and urine)
   5243 (ABCB1)
CD molecules [BR:hsa04090]
 Proteins
  5243 (ABCB1)
SSDB
Motif
Pfam: ABC_membrane ABC_tran SMC_N ABC_ATPase AAA_22 Rad17 RsgA_GTPase AAA_16 AAA_5 Zeta_toxin AAA_29 AAA_14 G-alpha AAA_7 AAA_15 DUF815 DUF2207_C NTPase_1
Other DBs
NCBI-GeneID: 5243
NCBI-ProteinID: NP_000918
OMIM: 171050
HGNC: 40
UniProt: P08183 A4D1D2
Structure
LinkDB
Position
7:complement(87503017..87713295)
AA seq 1280 aa
MDLEGDRNGGAKKKNFFKLNNKSEKDKKEKKPTVSVFSMFRYSNWLDKLYMVVGTLAAII
HGAGLPLMMLVFGEMTDIFANAGNLEDLMSNITNRSDINDTGFFMNLEEDMTRYAYYYSG
IGAGVLVAAYIQVSFWCLAAGRQIHKIRKQFFHAIMRQEIGWFDVHDVGELNTRLTDDVS
KINEGIGDKIGMFFQSMATFFTGFIVGFTRGWKLTLVILAISPVLGLSAAVWAKILSSFT
DKELLAYAKAGAVAEEVLAAIRTVIAFGGQKKELERYNKNLEEAKRIGIKKAITANISIG
AAFLLIYASYALAFWYGTTLVLSGEYSIGQVLTVFFSVLIGAFSVGQASPSIEAFANARG
AAYEIFKIIDNKPSIDSYSKSGHKPDNIKGNLEFRNVHFSYPSRKEVKILKGLNLKVQSG
QTVALVGNSGCGKSTTVQLMQRLYDPTEGMVSVDGQDIRTINVRFLREIIGVVSQEPVLF
ATTIAENIRYGRENVTMDEIEKAVKEANAYDFIMKLPHKFDTLVGERGAQLSGGQKQRIA
IARALVRNPKILLLDEATSALDTESEAVVQVALDKARKGRTTIVIAHRLSTVRNADVIAG
FDDGVIVEKGNHDELMKEKGIYFKLVTMQTAGNEVELENAADESKSEIDALEMSSNDSRS
SLIRKRSTRRSVRGSQAQDRKLSTKEALDESIPPVSFWRIMKLNLTEWPYFVVGVFCAII
NGGLQPAFAIIFSKIIGVFTRIDDPETKRQNSNLFSLLFLALGIISFITFFLQGFTFGKA
GEILTKRLRYMVFRSMLRQDVSWFDDPKNTTGALTTRLANDAAQVKGAIGSRLAVITQNI
ANLGTGIIISFIYGWQLTLLLLAIVPIIAIAGVVEMKMLSGQALKDKKELEGSGKIATEA
IENFRTVVSLTQEQKFEHMYAQSLQVPYRNSLRKAHIFGITFSFTQAMMYFSYAGCFRFG
AYLVAHKLMSFEDVLLVFSAVVFGAMAVGQVSSFAPDYAKAKISAAHIIMIIEKTPLIDS
YSTEGLMPNTLEGNVTFGEVVFNYPTRPDIPVLQGLSLEVKKGQTLALVGSSGCGKSTVV
QLLERFYDPLAGKVLLDGKEIKRLNVQWLRAHLGIVSQEPILFDCSIAENIAYGDNSRVV
SQEEIVRAAKEANIHAFIESLPNKYSTKVGDKGTQLSGGQKQRIAIARALVRQPHILLLD
EATSALDTESEKVVQEALDKAREGRTCIVIAHRLSTIQNADLIVVFQNGRVKEHGTHQQL
LAQKGIYFSMVSVQAGTKRQ
NT seq 3843 nt   +upstreamnt  +downstreamnt
atggatcttgaaggggaccgcaatggaggagcaaagaagaagaacttttttaaactgaac
aataaaagtgaaaaagataagaaggaaaagaaaccaactgtcagtgtattttcaatgttt
cgctattcaaattggcttgacaagttgtatatggtggtgggaactttggctgccatcatc
catggggctggacttcctctcatgatgctggtgtttggagaaatgacagatatctttgca
aatgcaggaaatttagaagatctgatgtcaaacatcactaatagaagtgatatcaatgat
acagggttcttcatgaatctggaggaagacatgaccaggtatgcctattattacagtgga
attggtgctggggtgctggttgctgcttacattcaggtttcattttggtgcctggcagct
ggaagacaaatacacaaaattagaaaacagttttttcatgctataatgcgacaggagata
ggctggtttgatgtgcacgatgttggggagcttaacacccgacttacagatgatgtctcc
aagattaatgaaggaattggtgacaaaattggaatgttctttcagtcaatggcaacattt
ttcactgggtttatagtaggatttacacgtggttggaagctaacccttgtgattttggcc
atcagtcctgttcttggactgtcagctgctgtctgggcaaagatactatcttcatttact
gataaagaactcttagcgtatgcaaaagctggagcagtagctgaagaggtcttggcagca
attagaactgtgattgcatttggaggacaaaagaaagaacttgaaaggtacaacaaaaat
ttagaagaagctaaaagaattgggataaagaaagctattacagccaatatttctataggt
gctgctttcctgctgatctatgcatcttatgctctggccttctggtatgggaccaccttg
gtcctctcaggggaatattctattggacaagtactcactgtattcttttctgtattaatt
ggggcttttagtgttggacaggcatctccaagcattgaagcatttgcaaatgcaagagga
gcagcttatgaaatcttcaagataattgataataagccaagtattgacagctattcgaag
agtgggcacaaaccagataatattaagggaaatttggaattcagaaatgttcacttcagt
tacccatctcgaaaagaagttaagatcttgaagggtctgaacctgaaggtgcagagtggg
cagacggtggccctggttggaaacagtggctgtgggaagagcacaacagtccagctgatg
cagaggctctatgaccccacagaggggatggtcagtgttgatggacaggatattaggacc
ataaatgtaaggtttctacgggaaatcattggtgtggtgagtcaggaacctgtattgttt
gccaccacgatagctgaaaacattcgctatggccgtgaaaatgtcaccatggatgagatt
gagaaagctgtcaaggaagccaatgcctatgactttatcatgaaactgcctcataaattt
gacaccctggttggagagagaggggcccagttgagtggtgggcagaagcagaggatcgcc
attgcacgtgccctggttcgcaaccccaagatcctcctgctggatgaggccacgtcagcc
ttggacacagaaagcgaagcagtggttcaggtggctctggataaggccagaaaaggtcgg
accaccattgtgatagctcatcgtttgtctacagttcgtaatgctgacgtcatcgctggt
ttcgatgatggagtcattgtggagaaaggaaatcatgatgaactcatgaaagagaaaggc
atttacttcaaacttgtcacaatgcagacagcaggaaatgaagttgaattagaaaatgca
gctgatgaatccaaaagtgaaattgatgccttggaaatgtcttcaaatgattcaagatcc
agtctaataagaaaaagatcaactcgtaggagtgtccgtggatcacaagcccaagacaga
aagcttagtaccaaagaggctctggatgaaagtatacctccagtttccttttggaggatt
atgaagctaaatttaactgaatggccttattttgttgttggtgtattttgtgccattata
aatggaggcctgcaaccagcatttgcaataatattttcaaagattataggggtttttaca
agaattgatgatcctgaaacaaaacgacagaatagtaacttgttttcactattgtttcta
gcccttggaattatttcttttattacatttttccttcagggtttcacatttggcaaagct
ggagagatcctcaccaagcggctccgatacatggttttccgatccatgctcagacaggat
gtgagttggtttgatgaccctaaaaacaccactggagcattgactaccaggctcgccaat
gatgctgctcaagttaaaggggctataggttccaggcttgctgtaattacccagaatata
gcaaatcttgggacaggaataattatatccttcatctatggttggcaactaacactgtta
ctcttagcaattgtacccatcattgcaatagcaggagttgttgaaatgaaaatgttgtct
ggacaagcactgaaagataagaaagaactagaaggttctgggaagatcgctactgaagca
atagaaaacttccgaaccgttgtttctttgactcaggagcagaagtttgaacatatgtat
gctcagagtttgcaggtaccatacagaaactctttgaggaaagcacacatctttggaatt
acattttccttcacccaggcaatgatgtatttttcctatgctggatgtttccggtttgga
gcctacttggtggcacataaactcatgagctttgaggatgttctgttagtattttcagct
gttgtctttggtgccatggccgtggggcaagtcagttcatttgctcctgactatgccaaa
gccaaaatatcagcagcccacatcatcatgatcattgaaaaaacccctttgattgacagc
tacagcacggaaggcctaatgccgaacacattggaaggaaatgtcacatttggtgaagtt
gtattcaactatcccacccgaccggacatcccagtgcttcagggactgagcctggaggtg
aagaagggccagacgctggctctggtgggcagcagtggctgtgggaagagcacagtggtc
cagctcctggagcggttctacgaccccttggcagggaaagtgctgcttgatggcaaagaa
ataaagcgactgaatgttcagtggctccgagcacacctgggcatcgtgtcccaggagccc
atcctgtttgactgcagcattgctgagaacattgcctatggagacaacagccgggtggtg
tcacaggaagagattgtgagggcagcaaaggaggccaacatacatgccttcatcgagtca
ctgcctaataaatatagcactaaagtaggagacaaaggaactcagctctctggtggccag
aaacaacgcattgccatagctcgtgcccttgttagacagcctcatattttgcttttggat
gaagccacgtcagctctggatacagaaagtgaaaaggttgtccaagaagccctggacaaa
gccagagaaggccgcacctgcattgtgattgctcaccgcctgtccaccatccagaatgca
gacttaatagtggtgtttcagaatggcagagtcaaggagcatggcacgcatcagcagctg
ctggcacagaaaggcatctatttttcaatggtcagtgtccaggctggaacaaagcgccag
tga

DBGET integrated database retrieval system