KEGG   Sarcophilus harrisii (Tasmanian devil): 100918059
Entry
100918059         CDS       T02286                                 
Symbol
NDST2
Name
(RefSeq) bifunctional heparan sulfate N-deacetylase/N-sulfotransferase 2 isoform X1
  KO
K02577  heparan sulfate N-deacetylase/N-sulfotransferase NDST2 [EC:3.5.1.- 2.8.2.8]
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr00534  Glycosaminoglycan biosynthesis - heparan sulfate / heparin
shr01100  Metabolic pathways
Module
shr_M00059  Glycosaminoglycan biosynthesis, heparan sulfate backbone
Brite
KEGG Orthology (KO) [BR:shr00001]
 09100 Metabolism
  09107 Glycan biosynthesis and metabolism
   00534 Glycosaminoglycan biosynthesis - heparan sulfate / heparin
    100918059 (NDST2)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01003 Glycosyltransferases [BR:shr01003]
    100918059 (NDST2)
Enzymes [BR:shr01000]
 2. Transferases
  2.8  Transferring sulfur-containing groups
   2.8.2  Sulfotransferases
    2.8.2.8  [heparan sulfate]-glucosamine N-sulfotransferase
     100918059 (NDST2)
Glycosyltransferases [BR:shr01003]
 Unclassified
  Sulfotransferases
   100918059 (NDST2)
SSDB
Motif
Pfam: HSNSD-CE HSNSD_N Sulfotransfer_1 Sulfotransfer_3
Other DBs
NCBI-GeneID: 100918059
NCBI-ProteinID: XP_012409090
Ensembl: ENSSHAG00000007678
UniProt: G3W0B3
LinkDB
Position
2:612663097..612667947
AA seq 883 aa
MLQLWRVVRPARQLELHRLVLLLIAFSLASMAILAYYVSTSPKAKEPLPLPPGDCGSSGD
AGPGPARPPAPPWPPRPSETARTEPVVLVFVESVYSQLGQEIVAILESSRFRYSTELAPG
RGDMPTLTERARGRYALVVYENLLKYVNLDAWSRELLDRYCVEYGVGIIGFFRAHEHSLL
SAQLKGFPLFLHSNLGLRDYQVNPAAPLLHLTRPSRLEPGPLPGEDWTVFQSNHSTYEPV
LLASARLAEAPALGPLPRGTRLPTVVQDLGLHDGIQRVLFGHSLAFWLHKLVFVDAVAYL
TGKRLCLALDRYILVDIDDIFVGKEGTRMKVADVEALLTTQSKLRALVPNFTFNLGFSGK
FYHTGTEEEDAGDDMLLRHRKEFWWFPHMWSHMQPHLFHNRSVLADQMRLNKQFALEHGI
PTDLGYAVAPHHSGVYPIHTQLYEAWKSVWGIQVTSTEEYPHLRPARYRRGFIHNGIMVL
PRQTCGLFTHTIFYNEYPGGSRELDRSIRGGELFLTVLLNPISIFMTHLSNYGNDRLGLY
TFESLVRFLQCWTRLRLQTLPPAPLAQKYFDLFPQERSPLWQNPCDDKRHKDIWSKEKTC
DRLPKFLIVGPQKTGTTAIHFFLSLHPAVTSSFPSASTFEEIQFFSGPNYYKGIDWYMDF
FPIPSNASTDFLFEKSATYFDSEVVPRRGAALLPRAKIITVLTNPADRAYSWYQHQRAHG
DPTALNHTFYQVISAPPQAPAALRALQNRCLVPGYYSTHLQRWLTYYPSGQVLIVDGQEL
RANPAASMENIQKFLGVTPVLNYTRTLRFDEGKGFWCQGLEGGKTRCLGKSKGRRYPDMD
PESRLLLVDFFRDHNLELSKLLSRLGQPLPSWLQEELQRSGLG
NT seq 2652 nt   +upstreamnt  +downstreamnt
atgctccagctgtggagagtggtccgcccggcccggcagctggagctgcatcgcctggtg
ctgctgctcatcgccttcagcctggcctccatggccattctggcctactacgtgtccacc
agccccaaggccaaggagcccctgcccctgcccccgggcgactgcggcagcagcggggac
gcggggcccggcccggcccgacccccggccccgccctggcccccccggccctcggagacg
gcccgcaccgagcccgtggtgctggtgttcgtggagagcgtctactcgcagctgggccag
gagatcgtggccatcttggaatccagccgcttccggtacagcacagagctggcccccggc
cgcggggacatgcccacgctgaccgagcgcgccaggggccggtacgccctcgtggtctat
gagaacctgctgaagtacgtcaacctggacgcctggagccgcgagctcctggaccgctac
tgcgtggaatacggggtgggcatcatcggcttcttccgcgcccacgagcacagcctgctc
agtgcccagctcaagggcttccccctcttcctgcattccaacctggggctgcgcgactac
caggtgaaccccgccgcccccctcctgcacctgacccggcccagccgcctggagcccggg
ccgctgcccggggaggactggacggtcttccagtccaaccacagcacctacgagccggtg
ctcctggccagcgcgcgcctggccgaggcccccgccctgggcccgctgccccgcgggacg
cgcctgcccaccgtggtgcaggacctggggctgcatgacggcatccagcgcgtgctcttc
ggccacagcctcgccttctggctccacaagctggtctttgtggacgccgtcgcctacctc
accgggaagcgcctctgcctggcgctggatcgctacatcctggtggacatcgatgacatc
ttcgtgggcaaggagggcacccgcatgaaggtggctgacgtcgaggccctgctgaccacg
cagagtaaactcagggccttagtgcccaacttcacgttcaatctgggcttctcaggaaag
ttctaccacacagggacggaggaggaggacgcgggggacgacatgctcctgaggcatcgc
aaggagttctggtggttcccccacatgtggagccacatgcagccgcacctgttccacaac
cgctctgtgctggccgaccagatgcggctcaacaagcagtttgctctggagcatgggatc
cccacggatctggggtatgcagtggctccccaccactcgggcgtgtaccccatccacaca
cagctctatgaggcctggaaatccgtgtgggggatccaggtgaccagcactgaagagtat
ccccacctccgccctgcacgctaccggcgaggctttattcacaacggcatcatggtcctg
ccccgacagacctgcggcctcttcactcacactatcttctacaacgagtaccccgggggc
tcccgagagctggaccgcagcatccgggggggcgagctcttcctcaccgttctgctcaac
ccgatcagcatcttcatgacccacctgtctaactacggcaatgaccggctgggcctgtac
acgttcgagagcctggtgcgcttcctgcagtgctggacccggctgcgcctgcagactctg
cccccagcccccctggcacaaaagtactttgacctcttccctcaggaacgcagcccgctc
tggcagaacccctgtgacgacaagagacacaaagacatctggtccaaagagaagacctgt
gatcggctccccaagttccttatcgtggggccccagaaaacggggaccacggccatccac
ttcttcctgagcctgcacccggccgtgaccagcagcttccccagcgccagcacctttgag
gagatccagttcttcagcggccccaactactacaagggcatcgactggtacatggatttt
ttcccgatcccctccaatgccagcactgacttcctttttgagaaaagcgccacctatttt
gactccgaggttgtaccacgtcggggggcggcccttttgccccgtgccaagatcatcaca
gtacttaccaaccctgctgaccgggcctactcctggtaccagcatcagcgagcacacggt
gaccccacagccctgaaccacaccttctaccaagtgatctcggcgcccccgcaggctccc
gcggcgctccgggccctgcagaaccgctgcctggtccctggctactactccactcacctg
cagcgctggctcacctactacccctctgggcaggtgctgatagtggatgggcaggagcta
cgggctaacccagcggcctccatggaaaacatccagaagttcctcggtgtcacccctgtt
ctcaactacacgagaaccctcaggtttgatgaggggaagggattctggtgccaggggcta
gagggtggcaagacccgatgcctgggcaagagcaaaggacggaggtacccggacatggac
cccgagtcccgtctcctcctggttgactttttccgggaccacaatctggaactgtccaaa
cttctgagccgcctcgggcagccgctgccctcttggcttcaggaagagctgcagcgctcg
ggcctggggtga

KEGG   Sarcophilus harrisii (Tasmanian devil): 100919030
Entry
100919030         CDS       T02286                                 
Symbol
NDST1
Name
(RefSeq) bifunctional heparan sulfate N-deacetylase/N-sulfotransferase 1 isoform X1
  KO
K02576  heparan sulfate N-deacetylase/N-sulfotransferase NDST1 [EC:3.5.1.- 2.8.2.8]
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr00534  Glycosaminoglycan biosynthesis - heparan sulfate / heparin
shr01100  Metabolic pathways
Module
shr_M00059  Glycosaminoglycan biosynthesis, heparan sulfate backbone
Brite
KEGG Orthology (KO) [BR:shr00001]
 09100 Metabolism
  09107 Glycan biosynthesis and metabolism
   00534 Glycosaminoglycan biosynthesis - heparan sulfate / heparin
    100919030 (NDST1)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01003 Glycosyltransferases [BR:shr01003]
    100919030 (NDST1)
Enzymes [BR:shr01000]
 2. Transferases
  2.8  Transferring sulfur-containing groups
   2.8.2  Sulfotransferases
    2.8.2.8  [heparan sulfate]-glucosamine N-sulfotransferase
     100919030 (NDST1)
Glycosyltransferases [BR:shr01003]
 Unclassified
  Sulfotransferases
   100919030 (NDST1)
SSDB
Motif
Pfam: HSNSD-CE HSNSD_N Sulfotransfer_1 Sulfotransfer_3
Other DBs
NCBI-GeneID: 100919030
NCBI-ProteinID: XP_031809387
Ensembl: ENSSHAG00000018096
UniProt: G3X0Z8
LinkDB
Position
2:382865330..382924149
AA seq 894 aa
MTIPLRRRRLCRQASPQAVLLLLFAFCVLSVFVSAYYLYGWKRNLEPSGEVTGPDCNEEP
QITPSRLLPLKTLKVSSSSRTDPLVLVFVESLYSQLGQEIVAILESSRFKYRTEIAPGKG
DMPTLTDKDRGRFALIIYENILKYVNLDAWNRELLDKYCVEYGVGIIGFFKANENSLLSA
QLKGFPLFLHSNLGLKDCSINPKSPLLYVTRPSEVEKGLLPGDDWTVFQSNHSTYEPVLL
AKTKSAESIPHLSVGAALHTTVVQDLGLHDGIQRVLFGNNLNFWLHKLVFVDAVAFLTGK
RLSLPLDRYILVDIDDIFVGKEGTRMKVEDVKALFDTQNKLRAHIPNFTFNLGYSGKFFH
TGTDAEDDGDDLLLSYVKEFWWFPHMWSHMQPHLFHNQSVLAEQMALNKKFAVEHGIPTD
MGYAVAPHHSGVYPVHMQLYEAWKQVWDIQVTSTEEYPHLKPARYRRGFIHNGIMVLPRQ
TCGLFTHTIFYNEYPGGSSELDKIINGGELFLTVLLNPISIFMTHLSNYGNDRLGLYTFK
HLVRFLHSWTNLKLQTLPPVQLAQRYFQIFSEEKDPLWQDPCEDKRHKDIWSKEKTCDRF
PKLLIIGPQKTGTTALYLFLGMHPDLSSNYPSPETFEEIQFFNGQNYHKGIDWYMEFFPI
PSNTTSDFYFEKSANYFDSEVAPQRAAALLPKAKVLTILIDPADRAYSWYQHQRAHDDPV
ALKYTFHEVITAGSNASPKLRALQNRCLVPGWYATHIEHWLSAYHANQGLDSWLKNVSYP
EQAKILVLDGKLLRTEPAKVMDTVQKFLGVTNIIDYHKTLVYDAKKGFWCQLLEGGKTKC
LGKSKGRKYPEMDLDSRAFLRDYYRDHNIELSKLLYKMGQTLPTWLREDLQNTR
NT seq 2685 nt   +upstreamnt  +downstreamnt
atgaccatccccctgcgccgaaggaggctttgcaggcaggcgtcccctcaggccgtgctg
ctcctgctgttcgccttctgtgtgctcagcgtttttgtctctgcctattacctgtatggc
tggaagaggaacctggagccctcaggtgaggtcactgggcctgactgtaatgaggagccc
cagatcactccatcgcgcctgctccccctcaagaccctgaaggtgtcctcctcatcgcga
actgatccccttgtgctggtgtttgtggagagcctgtactcccagctgggccaggagatc
gtggccatcctggagtccagccgattcaagtaccggacggagatcgcccccgggaagggg
gacatgcccacgctcacggacaaagacaggggccgctttgcactcatcatctatgagaac
atcctcaagtacgtcaacctggacgcctggaaccgggagctgctggataagtactgcgtg
gagtacggcgtcgggatcattggcttcttcaaggccaatgagaacagtctgctgagcgct
cagctcaaaggtttccccctcttcctccattcgaacctgggcctgaaggattgcagcatc
aaccccaagtccccactgctgtatgtgacccggcccagcgaggtggagaagggcctccta
cctggggacgactggaccgtcttccagtccaaccactccacctacgagccggtgctgctg
gccaagaccaagtcggccgagtccatcccccacctgagtgtgggggccgccttgcacacc
acggtggtgcaggacctggggctgcatgacggcatccagcgggtgctcttcggcaacaac
ctcaatttctggctgcacaagttggtctttgtggacgccgtcgctttcctcactggcaag
cgactttccttgcccctggaccgctacatcctggtggacatcgacgacatttttgtgggc
aaggagggcacccggatgaaggtggaggacgtgaaggccctttttgacactcagaataaa
ctgcgcgcgcacatcccaaacttcaccttcaacctggggtactcagggaaattcttccac
acaggtacagatgctgaggatgatggcgacgacctgctcttgtcctatgtgaaggagttc
tggtggttcccccacatgtggagtcacatgcagccccacctcttccacaaccagtctgtg
ctggctgagcagatggccctcaacaagaagttcgctgtcgagcatgggatccccaccgac
atgggctacgccgtggccccccatcactctggcgtgtacccagtccacatgcagctgtat
gaggcctggaagcaggtgtgggacatccaggtgaccagcaccgaggagtatccccacctg
aagcctgcccgttaccgccggggcttcatccacaacgggatcatggtcctcccccggcag
acctgtggcctcttcacccacaccatattctacaatgagtatcccggaggctccagcgaa
ctggacaaaatcatcaatgggggagagctcttcctcactgtgctgctcaatcccatcagc
atcttcatgacccacctgtccaactacggcaatgatcgcctgggactgtacaccttcaaa
cacttggtgcgcttccttcactcttggaccaacctgaagctgcaaacgctgccaccagtc
cagctggctcaacgatactttcagatcttttctgaggagaaggaccctctgtggcaggat
ccttgtgaagacaagcggcacaaagacatctggtcaaaggaaaaaacgtgtgaccggttc
cccaaactcttaatcattggaccccaaaaaacaggcactacagccctgtatctgttcctg
ggcatgcacccggacctgagcagtaactacccaagcccagagacctttgaagagattcag
tttttcaatggccaaaactaccacaaaggcatagactggtacatggagttcttccccatc
ccctccaacaccacctctgacttctactttgagaaaagcgccaactacttcgactccgag
gtggctccccagcgggccgcggccttgctgcccaaggccaaagtcctcaccattctcata
gaccctgcggaccgagcctactcttggtaccagcatcagcgagcccatgatgaccccgtt
gccctgaagtacaccttccatgaagtgatcacggccgggtcgaacgcctccccgaagctg
cgggcacttcagaaccgctgcctagttcctggctggtacgccacccacatcgagcactgg
cttagcgcttaccacgccaaccagggactggactcctggctgaaaaatgtttcttatcct
gaacaagcaaagatcctggtgttggatggcaaactgctccgaaccgaaccggccaaagtg
atggacactgtgcagaagttccttggggtgaccaacattattgattaccataaaactctg
gtgtatgatgcaaagaaaggattctggtgtcagctgcttgaaggagggaaaaccaagtgc
ttgggcaaaagcaaaggcaggaagtacccagaaatggatttggattcacgagccttctta
agagactattaccgggaccataacatcgaactttctaagctgctttataagatgggccag
actttgcccacttggctgcgggaggacctccagaacactaggtag

KEGG   Sarcophilus harrisii (Tasmanian devil): 100925822
Entry
100925822         CDS       T02286                                 
Symbol
GLCE
Name
(RefSeq) D-glucuronyl C5-epimerase isoform X1
  KO
K01793  heparosan-N-sulfate-glucuronate 5-epimerase [EC:5.1.3.17]
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr00534  Glycosaminoglycan biosynthesis - heparan sulfate / heparin
shr01100  Metabolic pathways
Module
shr_M00059  Glycosaminoglycan biosynthesis, heparan sulfate backbone
Brite
KEGG Orthology (KO) [BR:shr00001]
 09100 Metabolism
  09107 Glycan biosynthesis and metabolism
   00534 Glycosaminoglycan biosynthesis - heparan sulfate / heparin
    100925822 (GLCE)
Enzymes [BR:shr01000]
 5. Isomerases
  5.1  Racemases and epimerases
   5.1.3  Acting on carbohydrates and derivatives
    5.1.3.17  heparosan-N-sulfate-glucuronate 5-epimerase
     100925822 (GLCE)
SSDB
Motif
Pfam: C5-epim_C Glce_b_sandwich DUF4962 EspG
Other DBs
NCBI-GeneID: 100925822
NCBI-ProteinID: XP_003755734
Ensembl: ENSSHAG00000013926
UniProt: A0A7N4PQJ6
LinkDB
Position
2:complement(515753806..515890125)
AA seq 617 aa
MRCLAARVNYKTLIVICALFTLVTVLLWNKCSSDKAIQFPRNWNKGLRADGLEKRAAASE
SDDSVNHAARQQPEEASPQEQQKAPPVVGGFNSQGNRVLGLKYEEIDCLINDEHTIKGRR
EGNEIFLPFSWVEKYFEVYGKMVQYDGYDRFEFSHSYSKVYAQRAPYHPDGVFMSFEGYN
VEVRDRVKCISGVEGVPLSTQWGPQGYFYPIQIAQYGLSHYSKNLTEKPPHIEVYETAED
KDKHSRSNDWTVPKGCFMTNVADKSRFTNVKQFVAPETSEGVSLQLGNTKDFIISFDLKL
LTNGSVSVVLETTEKNQLFTVHYVSNTQLIAFKERDIYYGIGPRTTWSTVTRDLVTDLRK
GVGLSNTKAVKQTKIMPKKVVRLIAKGKGFLDNITISTTAHMAAFFAASHWLVKNQDEKG
GWPIMVTRKLGEGFRSLEPGWYSAMAQGQAISTLVRAYLLTKDHVFLDSALRATAPYKFL
SEQHGVKAIFMNKYDWYEEYPTTPSSFVLNGFMYSLIGLYDLKETAGEKLGKEARLLYER
GMESLKAMLPLYDTGSGTIYDLRHFMLGSAPNLARWDYHTTHINQLQLLSTIDEAPVFRD
FVKRWKSYLKGNRAKHN
NT seq 1854 nt   +upstreamnt  +downstreamnt
atgcgttgcttggcagctcgggtcaactacaagactttgattgtcatctgtgcgctcttc
acattggtcacagtcctgctgtggaataagtgttccagtgataaggctattcagtttcca
cgaaactggaataaaggtctgagagcagacggattagaaaaaagagcagcagcttctgag
agtgacgatagcgtgaaccatgcagccaggcagcagccggaggaagcatccccgcaggag
cagcagaaagctccccctgtagtcggaggcttcaacagccagggcaacagagttctggga
ctcaagtatgaagagattgactgtctgataaatgatgaacacacaattaaaggaaggcga
gagggaaatgagatcttcctccctttctcttgggtagagaagtactttgaagtttatggg
aagatggtccagtacgacggctatgatcggtttgagttctctcacagttattccaaagtc
tacgcacagagagcgccttatcaccctgatggagtgtttatgtcctttgaaggctacaat
gtggaagtccgagatagagtcaagtgcataagtggtgtggaaggtgtaccactatccact
caatggggacctcaaggctatttttatccaatccagattgcacagtatggtttaagtcat
tacagcaagaatctaactgagaagcccccacacattgaagtttatgaaacagcagaagat
aaggacaaacacagtcggtctaatgattggactgtgccaaaaggctgcttcatgactaat
gtggctgataagtctagattcaccaatgttaaacaatttgttgctccagaaaccagtgag
ggcgtctctctacagcttggaaatacaaaagacttcattatttcctttgacctcaagtta
ctgacaaatgggagcgtgtctgtggttttggagaccacggagaaaaaccagctcttcaca
gtgcattatgtttcaaatactcagctgattgcttttaaagaaagagacatttactatggc
attgggcccagaactacttggagcactgtcacgagagacttggtcactgacctgaggaaa
ggagtgggtctctccaacacaaaagctgtcaagcagaccaaaataatgcccaagaaagta
gtcaggttaattgcaaaaggaaaaggattcctcgacaacatcaccatctctaccacggct
cacatggctgctttctttgcagcaagtcactggctggtcaaaaaccaggatgaaaaaggt
ggctggccaatcatggtgacacgaaagttaggcgaagggttcaggtctttagagcccggc
tggtactcagccatggcccagggacaagcgatttccaccctagtgagggcctatctctta
accaaggaccacgtattcctcgactcagctttacgggcaacggccccttataagttcttg
tcagagcaacatggcgtgaaagccatcttcatgaataaatatgactggtatgaagaatat
ccaaccacacctagctcctttgtcttaaatggcttcatgtactctctaattggactctat
gacttaaaagaaactgccggggaaaagctagggaaagaagccaggctgctgtacgagcgt
ggcatggagtccctgaaggccatgctccccttgtacgacaccggctcgggcacaatctac
gacctgcggcacttcatgctgggcagcgcgcccaacctggcccgctgggactaccacacc
acccacatcaaccagctgcagctgctgagtaccatcgacgaggccccggtcttcagagac
ttcgtcaagcggtggaagagttacctcaaaggcaacagggcaaagcacaactaa

KEGG   Sarcophilus harrisii (Tasmanian devil): 105751013
Entry
105751013         CDS       T02286                                 
Symbol
NDST4
Name
(RefSeq) bifunctional heparan sulfate N-deacetylase/N-sulfotransferase 4 isoform X1
  KO
K02579  heparan sulfate N-deacetylase/N-sulfotransferase NDST4 [EC:3.5.1.- 2.8.2.8]
Organism
shr  Sarcophilus harrisii (Tasmanian devil)
Pathway
shr00534  Glycosaminoglycan biosynthesis - heparan sulfate / heparin
shr01100  Metabolic pathways
Module
shr_M00059  Glycosaminoglycan biosynthesis, heparan sulfate backbone
Brite
KEGG Orthology (KO) [BR:shr00001]
 09100 Metabolism
  09107 Glycan biosynthesis and metabolism
   00534 Glycosaminoglycan biosynthesis - heparan sulfate / heparin
    105751013 (NDST4)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01003 Glycosyltransferases [BR:shr01003]
    105751013 (NDST4)
Enzymes [BR:shr01000]
 2. Transferases
  2.8  Transferring sulfur-containing groups
   2.8.2  Sulfotransferases
    2.8.2.8  [heparan sulfate]-glucosamine N-sulfotransferase
     105751013 (NDST4)
Glycosyltransferases [BR:shr01003]
 Unclassified
  Sulfotransferases
   105751013 (NDST4)
SSDB
Motif
Pfam: HSNSD-CE HSNSD_N Sulfotransfer_1 Sulfotransfer_3
Other DBs
NCBI-GeneID: 105751013
NCBI-ProteinID: XP_031797675
UniProt: A0A7N4P0H9
LinkDB
Position
6:122633425..123047659
AA seq 873 aa
MNLIVKVRRSFRTLILLLATFCLVSIVISAYFLYTGYKQEITLIETTAGAECADFKLLPY
RSMELKTVKPIDTSKTDPTVLLFVESQYSQLGQDIIAILESSRFQYHMVIAPGKGDIPPL
TDNGKGKYTLVIYENVLKYVTMDSWNRELLEKYCVEYSVSIIGFHKANENSLPSTKLKGF
PLNLYNNIALKDCFINPQSPLLHITKAPKFEKGPLPGEDWTIFQFNHSTYQPVLLTELQT
SKLFEVPLSSTNGYATVIQDLGLHDGIQRVLFGNNLNFWLHKLIFIDAISFLSGKKFTLS
LDRYILVDIDDIFVGKEGTRMNVKDVKALLETQNLLRTQVANFTFNLGFSGKFYHTGTEE
EDEGDDLLLRSVDEFWWFPHMWSHMQPHLFHNESSLVEQMILNKEFALEHGIPINMGYAV
APHHSGVYPVHIQLYEAWKKVWGIQVTSTEEYPHLKPARYRQGFIHNGIMVLPRQTCGLF
THTIFYKEYPGGPQELDKSIQGGELFLTILLNPISIFMTHLSNYGNDRLGLYTFVNLANF
IQSWTNLRLQTLPPVQLAHKYFELFPEQKDPLWQNPCDDKRHRDIWSRDKTCDHLPKFLV
IGPQKTGTTALYLFLLMHPSIISNLPSPKTFEEVQFFNGNNYQKGIDWYMDFFPIPSNIT
NDFLFEKSANYFHSEEAPRRAASLVPKAKIITILIDPSDRAYSWYQHQRSHEDPAALRFN
FYEVVTTGHWAPSELKTLQKRCLTPGWYAVHIERWLTYFSTSQLLIIDGQQLRSDPAAVM
DEVQKFLGVTPHYNYSEALTFDPQKGFWCQLLEGGKTKCLGKSKGRKYPSMDLESRAFLS
SYYRDHNVELSKLLHRLGQPLPSWLRQELQKVR
NT seq 2622 nt   +upstreamnt  +downstreamnt
atgaatctcattgtgaaggttcgaagaagctttcgaacattgatccttctcttagccacc
ttctgcttagtaagcattgtcatttctgcttattttctctacacgggctacaaacaagag
attacacttattgaaaccactgcaggagcagaatgtgctgacttcaaacttctaccctac
cggtcaatggaattgaagactgtcaaacctattgatacatcaaagacagacccaacagtt
ctcttatttgtggaaagccaatactctcagcttggtcaagatatcatagccatcttggag
tctagtcgatttcagtatcatatggtcattgctccaggcaaaggagacatacctcctctt
actgacaatggcaaaggaaaatatactttggttatctatgaaaatgttttgaaatatgtt
accatggactcatggaatcgagagcttttggaaaaatattgtgtggagtacagtgttagc
ataattggttttcataaagctaatgagaatagcttaccaagtacaaagttaaaaggtttt
cccttgaacctttataacaacatagccctaaaagattgctttataaatcctcagtctcct
ttgttgcatattaccaaagcaccaaagtttgagaaaggcccgctgcctggtgaagactgg
actattttccaattcaatcattcaacctaccaacctgttctcttaacagagttacagact
tcaaaactctttgaagttcccttatctagcactaatggttatgctacagtgattcaggac
ttggggcttcatgatgggattcagcgtgtcctttttggaaataatttgaacttttggttg
cacaagctcatcttcatagacgccatctctttcctgtcagggaagaagttcacattgtca
ttggacagatatatccttgtggacattgatgacatctttgttggcaaggaaggaacaagg
atgaatgtcaaagatgtaaaggcattactagagactcaaaatttactgcgcactcaggtt
gcaaattttaccttcaaccttggattttcagggaagttttaccatacaggaacggaagag
gaagatgaaggggatgacctcttgctgcgatctgtggatgagttctggtggtttcctcac
atgtggagtcacatgcaaccccatctcttccacaatgagtcatctctggtggagcagatg
attctcaacaaggaatttgcactagagcatggaattcctatcaatatgggctatgcagtg
gccccacaccattcaggggtctacccagtacatatacaactttacgaagcttggaaaaaa
gtctggggtattcaggtcacaagtacagaagagtatccacatcttaaacctgcacggtac
agacaaggcttcatccacaatggcatcatggtgctccctcgacagacatgtggattattt
acccacacaattttctataaggaatatccaggaggacctcaagaattggacaaaagtatc
caaggaggggaactttttcttactatccttctaaatcctatcagcatcttcatgacccat
ttgtccaattatgggaatgaccgcttggggttatatacctttgtgaacttggccaacttt
attcaaagctggactaacttgagactgcagactctgcctccagtgcagcttgctcataag
tactttgagctttttccagagcagaaagaccctctctggcagaatccatgtgatgataaa
cgacacagggacatctggtccagagacaaaacttgtgatcatttaccaaaattccttgtg
ataggacctcagaaaacaggtacaactgcactttatctatttctacttatgcatccttct
atcatcagcaatctccccagtccaaaaacatttgaagaagttcagttctttaatggaaac
aactatcagaagggaattgactggtatatggattttttccccatcccttctaatattacc
aatgattttctgtttgagaagagtgccaactatttccactcagaagaagctcccaggaga
gcagcatctctggttcccaaggccaagatcatcaccattcttattgatccctcagaccga
gcatactcttggtaccagcaccagcgatctcatgaggatccagctgccttgagatttaat
ttctatgaagttgttacaacaggacattgggccccctctgaattaaaaacactgcagaag
agatgcttgacccctggatggtatgcggtccacatagagagatggctaacttatttctct
acttctcagttgctgattattgatggacaacagttgagatctgacccagctgctgtgatg
gatgaagtgcagaagtttctgggagttactcctcattacaattactcagaagctctaacg
tttgaccctcaaaagggcttttggtgtcagctactagaaggaggaaaaaccaaatgcctt
gggaaaagtaaagggcgaaaatatccatcaatggatctagagtctagagcattcctctcc
agttactaccgagatcataatgtggagctgtccaaacttctgcacagactggggcaacct
ctgccatcgtggctgagacaggaactgcagaaagtaagatag

DBGET integrated database retrieval system