KEGG   Branchiostoma floridae (Florida lancelet): 118429838
Entry
118429838         CDS       T01074                                 
Name
(RefSeq) arginine-glutamic acid dipeptide repeats protein-like isoform X1
  KO
K05628  arginine-glutamic acid dipeptide repeats protein
Organism
bfo  Branchiostoma floridae (Florida lancelet)
Brite
KEGG Orthology (KO) [BR:bfo00001]
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:bfo03000]
    118429838
Transcription factors [BR:bfo03000]
 Eukaryotic type
  Zinc finger
   Cys4 GATA-factors
    118429838
SSDB
Motif
Pfam: Atrophin-1 BAH ELM2 GATA Myb_DNA-binding Myb_DNA-bind_7
Other DBs
NCBI-GeneID: 118429838
NCBI-ProteinID: XP_035696332
UniProt: A0A9J7N7R1
LinkDB
Position
14:13237003..13332544
AA seq 1214 aa
MGEEGNKQDMEPRETTRRKRYSIEETREEKPKRRLRNQPEITSWTVGGVSYKVGDCVYID
SQRADNPYYICSIQEFRMTKKETVYLEVKWFYRTSEVPDSVYHLLVQDRNSENSSGEDSV
IKESLMKSRELFISDATDSYPVSALRGRCCVHHYPDIIAAKAFEPNPNTFFYILGYNPET
RRLNGTQGEIRVGPSHQAKLPEYNPNRPDDRELQEELVWTPKVNDCDLLMYLRAARSMAA
FAGMCDGGSPDDGCLAASRDETTINALDTLHQFDYNTGKALQALVKRPVPVSADRKWTDD
ETKRFVKGLRQFGKNFFRIRKELLPEKETSELVEFYYLWKKTPAANNCRPHRRHRRQSTL
RRVRGVGSRAGNGNRPPSSEFLDLSSASETELENVDSEDSDSRDLSAYACRHCFTTTSRD
WHHGGHDKVILCTDCRIFFKKYGELRPIETPREPPPFMFKPVKEDKDDDAMNSGKAGMRT
RRNRDGVQSRHKGKASSSSPSESSSPVNHLANARHLGQKGRQSPSTGSTTSNSSDKSVKK
RKLGKGNKDEEKGSKKRHRDKSLSEESEATNLEEGRHKKSKQTSSRSESPSEAATSDSGS
INEESCSGIQEYTNEDENSSPSSPENDNDSDYEPPASTAREDSQPPSPVPIKPIPQVPRL
PTPPPMQELPATQPPVPAAAPAGPPVQLPPHPLPPPPMQEPDVPPPVPRLLPPPPPLQPA
VPLIIPKQEPKSPSPPPREPTPPREPTPPASPGTPPSSPEVPRSPRSPSPPAVVVDRQDH
VSASARFIRHLSRRDNTCSRTDLTFVPLENTKLAKKRAEATKKAEQAKHEEERKKQEDER
ETREREREQEQDREREEKQRRPPTSPPPRCSTADVQITGPHVHHQGHHGQGFPSSLSFQQ
GPFIGPDTPALRTLSEYARPHAMASQDPNMPYHLPPGFFIPGAPEHEIRERELREREMRE
REIRERELRESGFKPGYELKAMEHEQIRMSQMEMHRLWQGHPMAPGHQPGTAPPFASPFG
PYAPPGSSVLDQRERMAHPMHHMAESPHVNPVERLNAERLQAERNMALAMDPIVRLQLPG
ITPHHHQHSHSHTHVHLHPQDPLYGVPPHMIGPHPWPAGLPPPPNAGTPFQPPSPFLRGP
TPFLTREHEIHRETVLSRQYEGLIPPYMSASQQLSAQAQAEQLRLIEQQRYMQQHHEDYL
RRLQGEGDKPPGPP
NT seq 3645 nt   +upstreamnt  +downstreamnt
atgggagaggaaggtaataaacaagacatggagccacgtgaaacaacaaggagaaaacgc
tactctatcgaggaaacgagagaggagaaacccaagagacggttgcgaaatcagccggaa
ataacgtcctggactgtaggaggtgtcagttataaagtgggagattgtgtgtacatcgat
agccagcgtgcagacaacccttactacatctgctctatccaggagtttcgcatgacaaag
aaagagacagtttatctggaagtgaagtggttttacagaacgtcagaggttcctgactca
gtctaccatctcttggtacaggataggaactcagagaacagttcaggtgaagattccgtg
attaaggaatcccttatgaagtccagggaactcttcatctccgatgccacagacagttat
cctgtatctgctctcaggggaaggtgttgtgtgcatcactatcctgacatcatcgctgcc
aaagcctttgaacccaaccctaacaccttcttttacatcctgggctacaacccagagacg
aggaggttaaacggcacacaaggggagatccgagtaggccccagtcaccaggccaagctg
ccagagtacaaccctaacaggccggatgaccgagaacttcaggaggagctggtctggacc
ccaaaggtcaacgactgtgacctccttatgtacctcagggctgcaaggagtatggcagcg
ttcgctgggatgtgtgatggtggttcccctgacgacggttgcctagcagcatccagggat
gagacaaccatcaatgcccttgatacactgcaccagtttgactacaacacaggaaaggcc
ctgcaggccctggtgaaacgtcccgtccccgtatcggcggacaggaagtggacagacgat
gaaacgaaacgcttcgtgaaggggctgcggcagtttggcaagaatttcttccgtattcga
aaggagctgcttccagagaaggaaacgagcgagcttgttgagttctactatctgtggaag
aaaactccagcagcaaacaactgccgccctcaccgcaggcaccgcagacagagcacactt
aggagggtccgtggagtggggtcgcgtgctggcaacgggaacaggcctccctcttctgaa
ttcttggacctgagctctgccagcgagacagaactggagaacgtggacagtgaggacagc
gactcacgcgatctcagcgcctacgcttgcagacactgtttcactacaacttctcgagat
tggcaccacggaggccacgataaggtcatcctttgcacagactgtcggatcttcttcaag
aagtatggggagcttcggccgatagagactccgcgggaacccccgccattcatgtttaaa
cctgtcaaagaggataaggacgatgatgcaatgaattctgggaaagctggtatgaggaca
cggcgcaatagggatggggtgcaatctagacacaaggggaaggccagctccagcagtcct
tcggagtcttcctctcctgtcaatcacttggccaacgcccgtcacctgggacagaagggg
cgacagtctcccagcacgggcagcaccaccagcaacagtagtgacaagtctgtgaagaag
agaaaacttggaaagggtaataaggatgaggagaagggcagtaagaaacgtcaccgagac
aagagtctgtcagaggagtctgaggcgaccaacttagaggaaggacgccataaaaagtcg
aagcagacttctagtcgatcggagagtccgtcagaagctgccaccagtgacagtgggagc
atcaacgaagaaagctgcagtgggatacaggagtataccaatgaggatgagaactcatcg
ccatcgagtccagaaaacgacaacgacagtgactacgagccgccagcctccaccgctcgc
gaggactcgcagccgcccagtcctgttcccatcaagcccatcccgcaggtgccacgccta
cccacgccccctcccatgcaggagttaccggccacgcagccacctgtaccagctgcggca
cctgcaggcccacctgttcagctgccgccacaccccctccctccacctcccatgcaggag
ccagacgtccccccaccagtgccaagacttctccccccacccccaccactccagcctgct
gtccccctcatcatccccaaacaggaacccaagtccccctctcctccccccagagaaccc
acaccccctcgggagcccacccccccagccagcccaggcacccccccttccagcccagaa
gtgcccagaagcccccggagcccgtcaccgcctgcagtggtggttgacagacaagatcac
gttagtgcctcagcaagatttatccgccaccttagtcgtcgtgacaacacgtgcagtcgg
acggaccttacgttcgttccactggagaacaccaagctcgccaagaagcgcgccgaggcc
accaagaaggcggagcaggccaaacacgaggaagagaggaagaaacaggaggacgagagg
gagacgagagaacgggaacgggaacaggaacaggaccgggagagggaggagaaacagcgg
aggcccccgacctccccacccccgcggtgttccacagccgatgtccagatcacaggccca
catgtccaccaccaagggcaccacggccagggattcccctcgtccctgtccttccaacaa
ggacccttcatcgggccggacacccctgccttacgcacgctcagcgagtacgcccgtcca
cacgccatggcttcccaggaccccaacatgccttaccacctcccgcccggcttcttcatt
cccggcgctcccgagcacgagatacgggagcgcgaactccgcgagagggagatgagagag
cgggagattcgggaacgcgagctgagggagtcgggattcaaacctggatacgaactcaaa
gcaatggagcacgagcagatccgtatgagccagatggagatgcacagactgtggcagggc
caccccatggctcccggccaccagccgggcaccgcccctccctttgccagcccgtttgga
ccttacgcccctcccggcagctcggtactagaccagagagagaggatggcccaccccatg
caccacatggccgagtctcctcacgtcaaccctgtggaaagactaaacgcagagagactg
caggcggagcgcaacatggcgctggcgatggaccccatcgtgagattacagctgcctgga
atcacgccgcaccatcaccaacattcccactcccacacacacgtacacctgcatcctcag
gacccgctgtacggagtaccgcctcacatgattggtccccacccctggccggcgggactg
cccccgccgcctaacgcgggaaccccattccagccgccgtcgcccttcctacgcggtccg
acgccgttcctgacgcgggagcatgagattcatcgagaaacggtgctgtctcgccagtat
gaagggctgatcccaccgtacatgagcgcctcccagcagctgtcagcacaagcgcaggcc
gagcagttgagactgatcgagcagcagcggtacatgcagcagcatcatgaggactatctc
aggagactgcaaggagagggagacaaacccccaggacccccctaa

DBGET integrated database retrieval system