GLEANS Gupta Lab Evolutionary
Analysis Software
McMaster Faculty of Health Sciences
  • Home
  • Phylogeny Apps
    • Seq_rename
  • Compatibility Apps
    • None
  • CSI Apps
    • Sig_create
    • Sig_style
    • Sig_style2
  • Unsorted Apps
    • Align_dualstate
    • Clique_treeviewer
    • Seq_compile
    • Seq_order
  • Links
    • Microbial Evolution Resource
  • Log In

Log In

Forgot Password?

Enter the email associated with your account to reset your password. A new password will be sent to you.

Log Out

Are you sure you want to log out?

Change Password

Change Email

Phylogeny Apps
Seq_rename

SEQ_RENAME

Filters out the metadata for the given sequences, leaving only the species name. The species name can be optionally trucated to a set length.

Enter inputs and click "Run Program" to get started.
Required
Browse…
Optional
Uploading Files:

Program completed successfully.

Download result

  • Help
  • Examples
  • Downloads
This program filters out the metadata for each of the sequences in the given file, leaving only a truncated version of the species name.

If the "Maximum species name length" parameter is omitted, the entire species name will be included. Otherwise it is truncated to the given integer value.

The program filters only lines in the file which match raw sequence metadata. All other lines (including the sequence data itself) are ignored.

View sample input/output files. All files should be in a plain-text format (.txt, .csv, .xml, etc.).

Download files related to this app.

Sample Input Sample Output

Program Results


      

Example Input File

>gi|16129828|ref|NP_416390.1| arginine tRNA synthetase [Escherichia coli K12]
MNIQALLSEKVRQAMIAAGAPADCEPQVRQSAKVQFGDYQANGMMAVAKKLGMAPRQLAEQVLTHLDLNG
IASKVEIAGPGFINIFLDPAFLAEHVQQALASDRLGVATPEKQTIVVDYSAPNVAKEMHVGHLRSTIIGD
AAVRTLEFLGHKVIRANHVGDWGTQFGMLIAWLEKQQQENAGEMELADLEGFYRDAKKHYDEDEEFAERA
RNYVVKLQSGDEYFREMWRKLVDITMTQNQITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVE
SEGATVVFLDEFKNKEGEPMGVIIQKKDGGYLYTTTDIACAKYRYETLHADRVLYYIDSRQHQHLMQAWA
IVRKAGYVPESVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALERARRLVAEKNPDMPADELEKL
ANAVGIGAVKYADLSKNRTTDYIFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKAEIDEEQLAAAPVIIRE
DREAQLAARLLQFEETLTVVAREGTPHVMCAYLYDLAGLFSGFYEHCPILSAENEEVRNSRLKLAQLTAK
TLKLGLDTLGIETVERM

>gi|16273477|ref|NP_439728.1| arginyl-tRNA synthetase [Haemophilus influenzae Rd KW20]
MNIQSILSDKIKQAMILAGADQSCDALIRQSGKPQFGDYQANGIMAAAKKLGLNPREFAQKVLDNLQLSD
IAEKLEIAGPGFINIFLNPTWLTTEISAALSHKNLGIQATNKQTVVIDYSSPNVAKEMHVGHLRSTIIGD
AVARTLEFLGHNVIRANHVGDWGTQFGMLIAYLEKMQNEHASEMELQDLEAFYREAKKHYDEDEVFAEKA
RNYVVKLQSGDEYCRTMWKRLVDITMQQNQHNYARLNVTLTEKDVMGESLYNPMLPSIVKDLKKQGLAVE
NDGALVVYLDEFKNKDGDPMGVIVQKKDGGFLYTTTDIAAAKYRYETLKANRALVFSDTRQSQHMQQAWL
ITRKAGYVPDSFSLEHKNFGMMLGKDGKPFKTRTGGTVKLADLLDEAIERATVLINEKNTNLSNDEKEAV
IEAVGIGAVKYADLSKNRTTDYVFDWDNMLSFEGNTAPYMQYAYTRIRSIFNKTDINSTALLAAPLTIKD
DKERTLAIKLLQFEEAVQTVGKEGTPHVLCAYLYELAGIFSSFYEHCPILNAEDESIKLSRLKLALLTEK
TLKQGLTLLGIKTVEKM

>gi|9656621|gb|AAF95220.1| arginyl-tRNA synthetase [Vibrio cholerae O1 biovar eltor str. N16961]
MICLTSKMMAFLPYFLELKGKRVNIQALINDRVSQAIEAAGAPAGTPALVRQSAKAQFGDYQANGIMGAA
KQLGTNPREFAQKVLDVLNLEGIASKTEIAGPGFINIFLSEEFLAAQAEAALADARLGVAQEAPKTIVAD
YSAPNVAKEMHVGHLRSTIIGDAVVRTLEFLGHKVIRANHIGDWGTQFGMLIANLERVQKASGEVSMELS
DLEAFYRESKKLYDEDEQFAETARNYVVKLQGGDPFCLEMWKKLVDVTMIQNQRNYDRLNVSLTRENVMG
ESMYNDMLPQIVSDLKAKGLAVEDDGAQVVFLEEFKNKDGEPMGVIIQKRDGGFLYTTTDIACAKYRYET
LGADRVLYFIDSRQHQHLMQAWTIVRKAGYIPENVSLEHHAFGMMLGKDGRPFKTRAGGTVRLADLLDEA
QERAKALIESKNPELSAEEKANIANTVAMAAVKYADLSKHRTTDYVFDWDNMLAFEGNTAPYMQYAYTRV
ASVFAKAGVDMNELTGHIQITEEKEKALIAKLLQFEEAVQSVAREGQPHIMCSYLFELAGIFSSFYEACP
ILVAEQESIKQSRLKLAALTAKTIKQGLALLGIDTLERM

>gi|15677359|ref|NP_274514.1| arginyl-tRNA synthetase [Neisseria meningitidis MC58]
MNLHQTVEHEAAAAFAAAGIADSPIVLQPTKNAEHGDFQINGVMGAAKKAKQNPRELAQKVAEALADNAV
IESAEVAGPGFINLRLRPEFLAQNIQTALNDARFGVAKTDKPQTVVIDYSSPNLAKEMHVGHLRSSIIGD
SISRVLAFMGNTVIRQNHVGDWGTQFGMLVAYLVEQQKDNAAFELADLEQFYRAAKVRFDEDPAFADTAR
EYVVKLQGGDETVLALWKQFVDISLSHAQAVYDTLGLKLRPEDVAGESKYNDDLQPVVDDLVQKGLAVED
DGAKVVFLDEFKNKEGEPAAFIVQKQGGGFLYASTDLACLRYRIGRLKADRLLYVVDHRQALHFEQLFTT
SRKAGYLPENVGAAFIGFGTMMGKDGKPFKTRSGDTVKLVDLLTEAVERATALVKEKNPELGADEAAKIG
KTVGIGAVKYADLSKNRTSDYVFDWDAMLSFEGNTAPYLQYAYTRVQSVFRKAGEWDANAPTVLTEPLEK
QLAAELLKFEDVLQSVADTAYPHYLAAYLYQIATLFSRFYEACPILKAEGASRNSRLQLAKLTGDTLKQG
LDLLGIDVLDVM

>gi|15600244|ref|NP_253738.1| arginyl-tRNA synthetase [Pseudomonas aeruginosa PAO1]
MKDTIRQLIQQALDQLTADGTLPAGLTPDIQVENTKDRSHGDFASNIAMMLAKPAGMKPRDLAARLVEAI
PAHEQLAKVEIAGPGFLNFFQDHVWLAASLDRALADERLGVRKAGPAQRVVIDLSSPNLAKEMHVGHLRS
TIIGDAVARVLEFLGDTVIRQNHVGDWGTQFGMLLAYLEEQPVDAEAELHDLEVFYRAAKKRFDESPEFA
DRARELVVKLQAGDPDCLRLWTRFNEISLSHCQKVYDRLGVKLSMADVMGESAYNDDLAQVVADLTAKGL
LTEDNGALCVFLEEFKNAEGNPLPVIVQKAGGGYLYATTDLAAMRYRHNVLHADRVLYFVDQRQALHFQQ
VFEVARRAGFVPAGMELEHMGFGTMNGADGRPFKTRDGGTVKLIDLLEEAESRAYALVKERNEQRAERGE
EPFDEVQLREIGRVVGIDSVKYADLSKHRTSDYSFNFELMLSFEGNTAPYLLYACTRVASVFRKLGQGRE
QLGGKIVLEQPQELALAAQLAQFGDLINNVALKGVPHLLCAYLYELAGLFSSFYEHCPILTAEDPAQKDS
RLRLAALTGRTLEQGLELLGLKTLERM

>gi|22298368|ref|NP_681615.1| arginyl-tRNA-synthetase [Thermosynechococcus elongatus BP-1]
MVAPIKILGDRLRRALQAALPLDTYPQPLLVPASQVKFGDYQSNVCLSLAKQLGKAPRELAQEVVPHLEV
EDLCQPVEIAGPGFLNFRLKPEFLAATLQAARGSDRLGIPPAREPRRVVVDFSSPNIAKEMHVGHLRSTI
IGDCIARILEFQGHTVLRLNHVGDWGTQFGMLIAYLDEVYPDALTTANALDLGDLVTFYKKAKQRFDSDP
EFQQKARAKVVALQQGEEQSRRAWQLLCEQSRREFQKIYDLLDIQLTERGESFYNPFLPAVIEDLAACGL
LVEDQGAKVVFLEGFTNKEGQPQPLIIQKSDGGYNYATTDLAALRYRIDKDQADWIIYVTDVGQSTHFAQ
VFQVAQRAGWVPPHVTLTHVPFGLVLGEDGKRLKTRSGETIRLIDLLTEAIARSRADLEQRLATEGRTES
PEFIDTVARAIGIGAVKYADLSQNRNSNYVFSYDKMLSLQGNTAPYLLYAYVRVQGLTRRGDIDWCTLSP
DSPLLLEDETEQHLAKHLVQLEETLDLVSTELLPNRLCQYLFELSQLFNQFYDRCPILSAPQPTKQSRLT
LAYLTAQTLKLGLSLLGIPVLDRI

>gi|17231209|ref|NP_487757.1| arginyl-tRNA-synthetase [Nostoc sp. PCC 7120]
MNATQEQLKIKLEQALVAAFGDEYAGVDPILVSASNPKFGDYQANVALSLSKKLGQQPRAIASAIVEKLD
VSEICEKPEIAGPGFINLKLKTAYLEAQLNTIQADTRLGVPTAKHPQREIVDFSSPNIAKEMHVGHLRST
IIGDSIARILEFRGHDVLRLNHVGDWGTQFGMLITYLREVSPEALTTANALDIGDLVSFYRQAKQRFDAD
EAFQETARQEVVRLQAGAADTLHAWKLLCEQSRQEFQVIYDLLDVKLTERGESFYNPLLPTVVENLEKSG
LLVENQGAKCVFLDGFTNREGEPLPLIVQKSDGGYNYATTDLAALRYRIQKDEAKRIIYITDAGQANHFA
QFFQVARKAGWIPDDVELVHVPFGLVLGEDGKKFKTRSGDTVRLRDLLDEAISRAHADVEVRLKAEEREE
TAEFIDKVAEVVGISAVKYADLSQNRTSNYIFSYDKMLDLKGNTAPYMLYAYARIQGISRKGEINFADLG
DNAKVILQHETEFALAKYLLQLGEVISTVEEDLSPNRLCEYLYELSKRFNAFYDRNQGVQVLSAEEPLRT
SRLVLCDLTARTLKLGLSLLGIQVLERM

>gi|33864335|ref|NP_895895.1| Arginyl-tRNA synthetase [Prochlorococcus marinus str. MIT 9313]
MQAHFASEFMLSLAHALESQLRAAIDRAFPEAAASARESGTGLDPQLAPASKPEFGDFQANAALPLAKPL
KQPPRQIAAAIVDQLMVDTAFTAICLTPEIAGPGFINLTVRPECLAAEVQARLADARLGVPLVEGDNDGQ
QPTPVVVDFSSPNIAKEMHVGHLRSTIIGDSLARVLEFRGHPVLRLNHVGDWGTQFGMLITHLKQVAPEA
LETADAVDLGDLVVFYRQAKQRFDDDEAFQTTSREEVVKLQGGDPISLKAWSLLCDQSRREFQKIYDRLD
VRLNERGESFYNAYLESVVEDLNVSGLLVSDDGAQCVFLEGVTGKDGKPLPVIVQKSDGGFNYATTDLAA
MRYRFAAPPQGDGARRVIYVTDAGQANHFAGVFQVAQRAGWIPDAGRLQHVPFGLVQGEDGKKLKTRAGD
TVRLRELLDEAVERAESDLRRRLQEEGRDEDESFIEQVATTVGLAAVKYADLSQNRITNYQFSFDRMLAL
QGNTAPYLLYAVVRIAGIARKGGDLDVTTAELQFSETQEWALVRELLKFDAVIAEVEEELLPNRLCTYLF
ELSQVFNRFYDQVPVLKAEQPSRSCRLALCRLTADTLKLGLSLLGIPTLERM

>gi|15605181|ref|NP_219967.1| Arginyl tRNA Transferase [Chlamydia trachomatis D/UW-3/CX]
MTTLLSFLTSLCSAAIHQAFPELEELTLDITPSTKEHFGHYQCNDAMKLARVLHKSPRAIAESIVAHIPP
TPFSSIEIAGAGFINFTFSKEFLASQLQTFSKELANGFRAASPQKVIIDFSSPNIAKDMHVGHLRSTIIG
DCLARCFSFVGHDVLRLNHIGDWGTAFGMLITYLQETSQEAIHQLEDLTALYKKAHARFAEDSEFKKRSQ
HNVVALQSGDAQALALWKQICSVSEKSFQTIYSILDVELHTRGESFYNPFLAEVVADLESKNLVTLSDGA
KCVFHEAFSIPLMIQKSDGGYNYATTDVAAMRYRIQQDQADRILIVTDSGQSLHFQLLEATCLAAGYLPS
KGIFSHVGFGLVLDTQGRKFKTRSGENIKLRELLDTAVEKAKESLKAHRPDISEEELAYQGPILGINAIK
YADLSSHRINDYVFSFEKMLRFEGNTAMSLLYAYVRIQGIKRRMGLESPPQEGPLAVHEPAEEALALTLL
RFPEILDLTLRELCPHFLTDYLYALTNKFNAFFRDCHIEGSDSQQERLYLCGLTERTLSTGMHLLGLKTL
NHL

>gi|15836101|ref|NP_300625.1| arginyl tRNA transferase [Chlamydophila pneumoniae J138]
MSTLLSILSVICSQAIAKAFPNLEDWAPEITPSTKEHFGHYQCNDAMKLARVLKKAPRAIAEAIVAELPQ
EPFSLIEIAGAGFINFTFSPVFLNQQLEHFKDALKLGFQVSQPKKIIIDFSSPNIAKDMHVGHLRSTIIG
DSLARIFSYVGHDVLRLNHIGDWGTAFGMLITYLQENPCDYSDLEDLTSLYKKAYVCFTNDEEFKKRSQQ
NVVALQAKDPQAIAIWEKICETSEKAFQKIYDILDIVVEKRGESFYNPFLPEIIEDLEKKGLLTVSNDAK
CVFHEAFSIPFMVQKSDGGYNYATTDLAAMRYRIEEDHADKIIIVTDLGQSLHFQLLEATAIAAGYLQPG
IFSHVGFGLVLDPQGKKLKTRSGENVKLRELLDTAIEKAEEALREHRPELTDEAIQERAPVIGINAIKYS
DLSSHRTSDYVFSFEKMLRFEGNTAMFLLYAYVRIQGIKRRLGISQLSLEGPPEIQEPAEELLALTLLRF
PEALESTIKELCPHFLTDYLYNLTHKFNGFFRDSHIQDSPYAKSRLFLCALAEQVLATGMHLLGLKTLER
L

>gi|21221735|ref|NP_627514.1| putative arginyl-tRNA synthetase [Streptomyces coelicolor A3(2)]
MASVTSLSDSVQQHLASALTATRPEAAGADPLLRRSDRADYQANGILALAKKTKANPRELAAEVVARITT
GDELIEDVEVSGPGFLNITVADRAITANLAARLADGERLGVPLKQDAGTTVVDYAQPNVAKEMHVGHLRS
AVIGDALRSMLDFTGEKTIGRHHIGDWGTQFGMLIQYLFEHPGELAPAGDIDGEQAMSNLNRVYKASRAV
FDTDEEFKERARRRVVALQSGDKETLDLWQQFVDESKVYFYSVFEKLDMEIRDEEIVGESAYNDGMPETA
RLLEEMGVAVRSEGALVVFFDEIRGKDDQPVPLIVQKADGGFGYAASDLTAIRNRVQDLHATTLLYVVDV
RQSLHFRMVFETARRAGWLGDEVTAHNMGYGTVLGADGKPFKTRAGETVRLEDLLDEAVQRAAEVVREKA
RDLTEDEIQERAAQVGIGAVKYADLSTSPNRDYKFDLDQMVSLNGDTSVYLQYAYARIQSILRKAGEVRP
AAHPELALHEAERALGLHLDAFGPTVFEAAAEYAPHKLAAYLYQLASLYTTFYDKCPVLKAETPEQVENR
LFLCDLTARTLHRGMALLGIRTPERL

>gi|32473878|ref|NP_866872.1| arginyl-tRNA synthetase [Rhodopirellula baltica SH 1]
MHLPNVLQARFVQALEPLTDSPSDYAGMIRPAADPKFGDYQSNAAMPLAKRVGKTSRDVAAELVQNLNVT
DLFEEPEVAGPGFINLRLKDSVLFDSIQQMLLDERVGVSKTTDPKKVIVDFSSPNVAKPMHVGHIRSTVI
GDCLARTLRFYGEDVVTDNHLGDWGTQFGIIIYGYRNFGDPAKVAANPVPELSALYRLTNQLIEYQKAKQ
SLATMADKLATAKSDAKTAKEVSDQSESDENLKPKDKKKLRKNAEAATRRVASIEADMKSLKAKIDAVDS
DTELSKLASEHSDVDVAVLRETAKLHEGDPENLALWKEFLPHCQDEINRIYDRLNVQFDHTLGESFYHDR
LAGVVDHLTTLGLTTKSDGAICVFLEGFDSPMIIQKRDGAFLYATTDLATLQYRRDEFQPDEILYVVDSR
QGEHFKKFFAMAEPLGMAEVQLVHVNFGTVLGPDGRPMKTRSGSLIGLESLLNDAVSRAKEVVCNPDRLA
TMDPPMGGEEQQQIAEIVGIGAIKYADLSHHRTSDYKFDVDKMVALEGNTATYVQYSYARTQSILRRASD
GEGLPAFEQAIEQAAATQPMTFTHPNERSLALMLMRFEEAIEQVRLNYAPNALCDYLFETAKTYSSFNES
CRVLGNDDPAVMQTRLALVVLTGRVLKKGLSLLGIDVAERM

>gi|34762844|ref|ZP_00143829.1| Arginyl-tRNA synthetase [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
MKITSKELTDIFQKHVESLFPNKELKPVEITVATNENFGDYQCNFAMINSKIIGDNPRKIAEEIKNNFSC
GDVVEKLEVAGPGFINIFLSDKYISNSIKKIGENYDFSFLNRKGKVIIDFSSPNIAKRMHIGHLRSTIIG
EAVCRIYKFLGYDVVADNHIGDWGTQFGKLIVGYRKWLNREAYEKNAIEELERVYVKFSDEAEKDPSLED
LARAELKKVQDGEEENTKLWKEFITESLKEYNKLYKRLDVHFDTYYGESFYNDMMGDVVKELVDKKIAVD
DDGAKVVFFDEKDNLFPCIVQKKDGAYLYSTSDIATVKFRKNTYDVNRMIYLTDARQQDHFKQFFKITDM
LGWNIEKYHIWFGIIRFADGILSTRKGNVIKLEELLDEAHSRAYDVVNEKNPNLSEEEKQNIAEVVGVSS
VKYADLSQNKQSDIIFEWDKMLSFEGNTAPYLLYTYARIQSILRKVTEQNIDLNKNIEIKTDNKFEKSLA
TYLLVFPISVLKAAETFKPNLIADYLYELSKKLNSFYNNCPILNQDIETLKSRALLIKKTGEVLKEGLGL
LGIPVLNKM

>gi|16127589|ref|NP_422153.1| arginyl-tRNA synthetase [Caulobacter crescentus CB15]
MNDLKRSLSEAAAAAFQAAGLPPEFGRVTASDRPDLADFQCNGALAAAKSAKRNPREIAVQVVDILKGDP
RLASVEIAGVGFINMRVSDEALSARAREIASDDRTGAQLLETPRRVLIDYAGPNVAKPMHVGHLRASIIG
ESVKRLYRFRGDDVVGDAHFGDWGFQMGLLISAIMDEDPFINALMEKLPEAPRGFSSADEAKVMAEFEKR
ITLADLDRIYPAASVRQKEDPAFKERARKATAELQNGRFGYRLLWRHFVNVSRVALEREFHALGVDFDLW
KGESDVNDLIEPMVLQLEAKGLLVQDQGARIVRVAREGDKRDVPPLLVVSSEGSAMYGTTDLATILDRRK
SFDPHLILYCVDQRQADHFETVFRAAYLAGYAEEGALEHIGFGTMNGADGKPFKTRAGGVLKLHDLIEMA
REKARERLREAGLGAELSEEQFEDTAHKVGVAALKFADLQNFRGTSYVFDLDRFTSFEGKTGPYLLYQSV
RIKSVLRRAAESGAVAGRVEIHEPAERDLAMLLDAFEGALQEAYDKKAPNFVAEHAYKLAQSFSKFYAAC
PIMSADTETLRASRLTLAETTLRQLELALDLLGIEAPERM

>gi|15903931|ref|NP_359481.1| Arginyl-tRNA synthetase(arginine--tRNA ligase) (ARGRS) [Streptococcus pneumoniae R6]
MNTKELIASELSSIIDSLDQEAILKLLETPKNSEMGDIAFPAFSLAKVERKAPQMIAAELAEKMNSQAFE
KVVATGPYVNFFLDKSAISAQVLQAVTTEKEHYADQNIGKQENVVIDMSSPNIAKPFSIGHLRSTVIGDS
LSHIFQKIGYQTVKVNHLGDWGKQFGMLIVAYKKWGDEEAVKAHPIDELLKLYVRINAEAENDPSLDEEA
REWFRKLENGDEEALALWQWFRDESLVEFNRLYNELKVEFDSYNGEAFYNDKMDAVVDILSEKGLLLESE
GAQVVNLEKYGIEHPALIKKSDGATLYITRDLAAALYRKNEYQFAKSIYVVGQEQSAHFKQLKAVLQEMG
YDWSDDITHVPFGLVTKEGKKLSTRKGNVILLEPTVAEAVSRAKVQIEAKNPELENKDQVAHAVGVGAIK
FYDLKTDRTNGYDFDLEAMVSFEGETGPYVQYAYARIQSILRKADFKPETAGNYSLNDTESWEIIKLIQD
FPRIINRAADNFEPSIIAKFAISLAQSFNKYYAHTRILDESPERDSRLALSYATAVVLKEALRLLGVEAP
EKM

>gi|18310643|ref|NP_562577.1| arginine-tRNA ligase [Clostridium perfringens str. 13]
MDYKKLVAERIKEHVDLELENIEKLIEIPPKPEMGDFAFPCFQLAKVMRKAPNMIAAELAEKINKEGFER
VECLGPYLNFFVDKVAFSKNIISKVLEEGDKYGSSKIGEGKNVVVEYSSPNIAKPFHVGHLFTTAIGHSL
YRMLNFEGYNPIRINHLGDWGTQFGKLISAYKRWGNEEALEEAPINELLRIYVKFHDEAENNPELEDEGR
MYFKKLEDGDQEAVALWERFKDLSLKEFNKIYDMLGVDFDSWAGESFYNDKMDKVVEELEKANILTESNG
AKVVMLDEYNMPPCIVVKSDGASIYATRDLAAASYRHKTYNFDKCIYVVGKDQILHFNQVFKTLELAGNE
WAKNCVHIPFGLVKFADRKLSTRKGNVVLLEDLLNEAIDKTRETIEEKNPQLENKEEVAKKIGIGAILFT
YLKNSRERDIVFDWKEMLSFDGETGPYVQYSYARAKSILRKAEEQKITAEPDFTKLTSKEEFELAKTLEG
LQKAVILGIDKLEPSVVTRYSIEVAKAFNKFYNNHTVLNVEDEGLKAARLELIKATAQVIKNALFLIGID
VVEKM

>gi|15926285|ref|NP_373818.1| arginyl-tRNA synthetase [Staphylococcus aureus subsp. aureus N315]
MNIIDQVKQTLVEEIAASINKAGLADEIPDIKIEVPKDTKNGDYATNIAMVLTKIAKRNPREIAQAIVDN
LDTEKAHVKQIDIAGPGFINFYLDNQYLTAIIPEAIEKGDQFGHVNESKGQNVLLEYVSANPTGDLHIGH
ARNAAVGDALANILTAAGYNVTREYYINDAGNQITNLARSIETRFFEALGDNSYSMPEDGYNGKDIIEIG
KDLAEKHPEIKDYSEEARLKEFRKLGVEYEMAKLKNDLAEFNTHFDNWFSETSLYEKGEILEVLAKMKEL
GYTYEADGATWLRTTDFKDDKDRVLIKNDGTYTYFLPDIAYHFDKVKRGNDILIDLFGADHHGYINRLKA
SLETFGVDSNRLEIQIMQMVRLMENGKEVKMSKRTGNAITLREIMDEVGVDAARYFLTMRSPDSHFDFDM
ELAKEQSQDNPVYYAQYAHARICSILKQAKEQGIEVTAANDFTTITNEKAIELLKKVADFEPTIESAAEH
RSAHRITNYIQDLAAHFHKFYNAEKVLTDDIEKTKAHVAMIEAVRITLKNALAMVGVSAPESM

>gi|39649773|emb|CAE28295.1| arginyl-tRNA synthetase [Rhodopseudomonas palustris CGA009]
MAELPMSTHLFARLLSRVHAVCAALIEEGALPAGIDLSRVVVEPPKDASHGDMATNAAMVLAKDAKAKPR
DLADKIADKLRAEELIDQVAIAGPGFINLTLKPAVWAEALRAVLDAGAGYGRSTVGGGEKVNVEYVSANP
TGPMHVGHCRGAVFGDALANLLDTAGYDVTREYYINDAGAQVDVLARSAFLRYREALGETIGEIPEGLYP
GDYLKPVGEALKAEHGAALKDMPEAQWLPTVRATAIAMMMEAIKGDLAALNITHEVFFSERSLIEGGRNR
VAETIEFLRAKGDVYQGRLPPPKGAPVEDYEDREQTLFRATAYGDDVDRPLLKSDGSYTYFASDIAYHKV
KFDAGFANMVDVWGADHGGYIKRMQAAIQAVTAGKGALDVKIVQLVRLLRNGEPVKMSKRAGDFVTLREV
VDEVGSDAVRFMMLFRKNDAVLDFDLAKVIEQSKDNPVFYVQYGHARGHSIFRNAREVVPDLPEDSKARA
AMLRQAPLERLNDPAELELLKRLALYPRIVEAAAQAHEPHRIAFYLNELASEFHALWTHGRDLPHLRFII
NNDAEITRARLAMVQGVVSVLASGLAILGVTAPDEMR

>gi|16801767|ref|NP_472035.1| arginyl tRNA synthetase [Listeria innocua Clip11262]
MNVMQENQIKLIEHIKQAVVQAVGLEETEVPEILLEVPKDKKHGDYSTNIAMQLARVAKKAPRQIAESIV
PELKKDTKLIKEVEIAGPGFINFYLDNAYLTDLVPVILTEDKKYGESDFGKGEKFQIEFVSANPTGDLHL
GHARGAAIGDSLANIMKMAGFDVSREYYINDAGNQINNLVLSAEARYFEALGLESEFPEDGYRGSDIIAL
GKDLAAKYGDKYVNASEEERRSVFRVDALAFETGKLRADLEEFRVSFDEWFSETSLYEENKVLPALERLR
ENGYIYEQDGATWLRTTDFEDDKDRVLIKSDGSYTYFLPDIAYHLNKLERGFDVLIDIWGADHHGYIPRM
RAAIEALGYSPNQLEVEIIQLVHLFEDGVQVKMSKRTGKSVTMRDLIEEVGLDATRYFFAMRSSDTHMNF
DMSLAKSTSNDNPVYYVQYAHARISSILRSGKEQGLEVSKDANMSLLETEAEYDLLKVLGEFADVVAEAA
VKRAPHRIVRYLNDLATAFHRFYNSNKVLDMDNLEVTKARLALIKTAQITLRNGLTLLGVSAPEKM

>gi|17545006|ref|NP_518408.1| PROBABLE ARGINYL-TRNA SYNTHETASE (ARGININE--TRNA LIGASE) PROTEIN [Ralstonia solanacearum GMI1000]
MLPSHKQTISQLLSDAVGTLLPEGTNRPEIVLERPKQAAHGDIACNVALQLAKPLGTNPRELANRIADGI
RADARGQRLVSAVEIAGPGFINLRLSPTARTDVLAAVFAEGDRYGAADLHDGAPVLVEFVSANPTGPLHV
GHGRQAALGDALAALLEWQGHKVHREFYYNDAGVQIHNLAVSVQARARGFKPGDTGWPEAAYNGDYIADI
AADYLAGKTVRASDGEPVTGARDVENIEAIRRFAVTYLRNEQDIDLQAFGVKFDHYYLESSLYADGKVQQ
TVDALIAAGKTYEQEGALWLRTTDDGDDKDRVMRKSDGSYTYFVPDVAYHTTKWGRGFTQVINVQGSDHH
GTIARVRAGLQGLDLGIPKGYPDYVLHKMVTVMKDGAEVKISKRAGSYVTVRDLIEWSNGDAESEAGVDT
IRACVESGAPNWPGRFTRGRDAVRFFLLSRKADTEFVFDVDLALKQSDENPVYYVQYAHARICSVFEQWH
AREGGDAASLAGADLAAVAGPEASPQAVALVQRIAAFPDMLADAARELAPHAVAFYLRDLAGDFHAFYNA
DRVLVDDDAVKRARLALLAATRQVLRNGLAVIGVSAPQKM

>gi|23336003|ref|ZP_00121233.1| COG0018: Arginyl-tRNA synthetase [Bifidobacterium longum DJO10A]
MSPEALSELISSIAHNLVAAGQAGALTDELIPPVDKLAVMRPKDRAHGDWASNIAMQLAKKAGMKPRDLA
EPFAAALAEADGIAKVEVAGPGFINITLDSASAAAVVDTVLAAGAMTDTDKHLNKVNEYGRNAHLGGQTL
NLEFVSANPTGPIHIGGTRWAAVGDAMARVLEANGAKVVREYYFNDHGEQINRFAKSLVAAWAEANNLGE
AGYQTETPCDGYKGAYINEIAARVQAEAESDGVDLTALAHQDQGLNDDGEPLGEADTEVREEFRKRAVPM
MFDEIQKSMKDFRVNFDVWFHENSLYADGKVDAAIEELKSRGDIFDKDGATWFESTKHGDDKDRVIIKSN
GEFAYFAADIAYYWDKRHRAENPADVAIYMLGADHHGYIGRMMAMCAAFGDEPGKNMQILIGQLVNVMKD
GKPVRMSKRAGNVVTIDDLVSVVGVDAARYSLARSDYNQNFDIDLALLASHTNDNPVYYVQYAHARSKNV
DRNAAVAGISYEGADLALLDTEADGEVLAALAQFPSVLATAADDRQPHKVARYLEELAATYHKWYNVERV
VPMALTDPETRGDDEARKALEIAKNPEPARAAARLKLNDAVQQVIANGLDLLGVTAPEKM

>gi|15889018|ref|NP_354699.1| AGR_C_3144p [Agrobacterium tumefaciens str. C58]
MNIFADFDTRIKNALETLDLVKENREKVDFSRITVESPRDLSHGDVATNAAMVLAKPLGTNPRALAELLV
PALQADGDVDGVNVAGPGFINLKVSVGYWQRLLADMIGQGVDFGRSTVGAGQKINVEYVSANPTGPMHVG
HCRGAVVGDTLANLLAFAGYGVTKEYYINDAGSQIDVLARSVFLRYREALGEDIGSIPSGLYPGDYLVPV
GQALADEYGIKLRAMPEEKWLPIVKDKAIDAMMVMIREDLALLNVRHDVFFSERTLHEGNGGPILSAIND
LTFKGHVYKGTLPPPKGELPDDWEDREQTLFRSTEVGDDMDRALMKSDGSYTYFAADVAYFKNKFDRGFS
EMIYVLGADHGGYVKRLEAVARAVSEGKSKLTVLLCQLVKLFRDGEPVKMSKRSGDFVTLRDVVDEVGRD
PVRFMMLYRKNSEPLDFDFAKVTEQSKDNPVFYVQYAHARCKSIFRQAQEAFPGLAPSAEDMAASVALIS
DINELQLVAKLAEYPRLIESAALSHEPHRLAFYLYDLAGSFHGHWNKGKDHQELRFINDKNRELSIARLG
LVNAVANVLKSGLTLLGADAPDEMR

>gi|17987372|ref|NP_540006.1| ARGINYL-TRNA SYNTHETASE [Brucella melitensis 16M]
MNIFADFDARIKKTLQDIDLKPKDGGELDLSRIGVEPPRDASHGDIATNAAMVLSKAVGQNPRELAARIA
EALKADEDVESVDVAGPGFINLRLKASYWQRELLVMLNEGTDFGRSRLGAGKKVNVEYVSANPTGPMHVG
HCRGAVVGDVLANLLKFAGYDVVKEYYINDAGAQIDVLARSVMLRYREALGESIGEIPAGLYPGDYLVRV
GQELAGEFGTKLLEMPEAEALAIVKDRTIDAMMAMIRADLDALNVHHDVFYSERKLHVDHARAIRNAIND
LTLKGHVYKGKLPPPKGQLPEDWEDREQTLFRSTEVGDDIDRPLMKSDGSFTYFAGDVAYFKDKYDHGFN
EMIYVLGADHGGYVKRLEAVARAVSDGKAKLTVLLCQLVKLFRNGEPVRMSKRAGEFITLRDVVDEVGRD
PVRFMMLYRKNDAPLDFDFAKVTEQSKDNPVFYVQYASARCHSVFRQAADQLGLVDLDRVAMGSHFEKLT
DESEIALVRKLAEYPRLIESAAIHQEPHRLAFYLYDLASSFHSQWNRGAENPDLRFIKVNDPDLSLARLG
LVQVVSDVLTSGLTIIGADAPTEMR

>gi|34540109|ref|NP_904588.1| arginyl-tRNA synthetase [Porphyromonas gingivalis W83]
MSILQKLENSAAAAVKALYGTDPMEGQIQLQKTKREFKGHLTLVVFPFVKMSRKSPEATATEIGEWLLAN
ESAVSAIEVVKGFLNLTIAPRVWLELLNEIRADINFGHKVATEDSPLVMVEYSSPNTNKPLHLGHVRNNL
LGYSLSEIMKANGYRVVKTNIVNDRGIHICKSMLAWQKWGDGVTPEKAGKKGDHLIGDFYVLFDKHYKAE
LNSLMAEGKSKEEAEAASTLMAEAREMLRLWEAGDEKVVDLWRTMNQWVYDGFDATYKMMGVDFDKIYYE
SETYLVGKEEVLRGLEEGLFVKHSDGSVWADLTKDGLDEKLLLRADGTSVYMTQDIGTAKMRFNDYPINR
MIYVVGNEQNYHFQVLSILLDRLGFEFGKGLVHFSYGMVELPEGKMKSREGTVVDADDLMDEMIRTAAEI
AAEAGKAAEMDEEESREVARIVGLGSLKYFILKVDPRKNMTFNPKESIDFNGNTGSFVQYTYARIRSLMR
RAEAAGYDIPSQLPTDLPLSEKEEALIQKVSEYAEVVSEAGHSYSPALIANYIYDLVKEYNQFYHDFSVL
KEEDERIRAFRLALSEVVALTMRKGFALLGIEMPERM

>gi|15603944|ref|NP_220459.1| ARGINYL-TRNA SYNTHETASE (argS) [Rickettsia prowazekii str. Madrid E]
MNIFNQLKQDIIAASQKLYNNKEIANTATIETPKDSFNGDLSSNIAMIIASKESIAPREVALKFKEVLVT
LPYIASIEIAGPGFINFTIKAESWQAAIKDILQHEEKFFEIDIDKNSNINIEYVSANPTGPMHIGHARGA
VYGDVLARILQKVGYSVTKEYYVNDAGSQINDLVSTVLLRYKEALGEPITIPVGLYPGEYLIPLGEILSK
EYGNKLLTMNDVERFKIIKSFAVEKMLDLNRKDLADLGIKHDVFFSEQSLYDKGEIEKTVKLLERMGLIY
EGTLPAPKGKVHEDWEYRVQKLFKSTNYGDSQDRPIEKADGSWSYFASDLAYAKDKIDRGANHLIYVLGA
DHSGYVKRIEAIVKALGQEKVKVDVKICQLVNFVENGVPIKMSKRLGSFASVQDVNKEVGKDIIRFMMLT
RQNDKPLDFDLVKVKEQSRENPIFYVQYAHVRTKSILSKARELMPEAYNSFKEGKYNLSLLSSEEEIEII
KLLAAWTKTLEASVKYFEPHRIAFYLINLASKFHSMWNFGKENSDYRFIIENNKELTLARLALASVIQKI
IASGLEVIGVEPMVTM

>gi|33591373|ref|NP_879017.1| arginyl-tRNA synthetase [Bordetella pertussis Tohama I]
MRGHLRQTPGRPPGGSARPAARQTCRRHLPFCRLPMLLEQQKQLISLIQAAVAQCLPEAQAQVQLERPKV
AAHGDIATNVAMQLAKPARRNPRELAQGIVDALMAQPQARELIQDAEIAGPGFINFRLTPAARQAVVQAV
ASQADAYGRAPRNGEKVLVEFVSANPTGPLHVGHARQAALGDAICRLYDASGWDVTREFYYNDAGNQIDN
LAISVQARGRGIAPDAPDYPADGYKGDYIVEIARDFAARKSVQASDGQPVTATGDLDSLDDIRAFAVAYL
RREQDLDLQAFGLAFDNYFLESSLYASGRVQETVDTLVAKGHTYEEGGALWLRTTELGTGDDKDRVMRKS
EGGYTYFVPDVAYHKVKWERGFHHAVNIQGSDHHGTVARVRAGLQGLAGIPKDFPAYVLHKMVKVMRGGE
EVKISKRAGSYVTMRDLIDWVGRDAVRYFLIQRRADTEFVFDIDLALSKSDENPVYYIQYAHARICTMIG
NSGASAAEIAQADTALLTAPSEYALLQRLAEFPQVVALAAQELAPHHVAFWLRDCASDFHAWYNAERVLV
DEPALKLARLRLAATTRQVLANGLALLGVSAPDRM

>gi|21672856|ref|NP_660921.1| arginyl-tRNA synthetase [Chlorobium tepidum TLS]
MRAFFLPFIQDALQKAGIETDKEIQIDKPNDKKFGDFSTNIAFLVAKEARKNPRELAGQLIGLLDFPEGT
VTKTEVAGPGFINFHLAPAFFMRSAQEVLAKGEGFGCNESGKGLKAIVEYVSANPTGPLTIGRGRGGVLG
DCIANLLETQGYEVTREYYFNDAGRQMQILAESVRYRYLEKCGQVIEFPETHYQGDYIGEIAETLFIEHG
DGLAATDELTIFKEAAEAVIFSSIRKTLERLLITHDSFFNEHTLYQSREGQPSANQRVIDALDAKGFIGN
YDGATWFMTTKLGQEKDKVLIKSSGDPSYRLPDIAYHVTKFERGFDLMVNVFGADHIDEYPDVLEALKIL
GYDTSKVKIAINQFVTTTVGGQTVKMSTRKGNADLLDDLIDDVGADATRLFFIMRGKDSHLNFDVELAKK
QSKDNPVFYLQYAHARICSLVRMAEKEVGFDEATAIGAGLPLLSSEPEIDLASALLDFPDIIQSSLRQLE
PQKMVEYLHTVAERYHKFYQECPILKADEHLRTARLELSLAVRQVLRNGFKILGISAPESM

>gi|20808887|ref|NP_624058.1| Arginyl-tRNA synthetase [Thermoanaerobacter tengcongensis]
MENIVQKAKEEIKDVVLKALNEAKKEGLLNFESIQDVEVEEPKEKQHGDLATNFAMVMAREAKMAPRKIA
EIIASKMNTSGTFIEKVEVAGPGFINFFLNQNFLIETLKLIHKRGKDYGRVNLGKGKKVQVEFVSANPTG
PMHMGNARGGAIGDVLASILDYAGYNVSREFYINDAGNQIEKFGYSLEARYLQLLGIDAEVPEGGYHGED
IIDRAKEFLEIHGDKYKDVPSEERRKALIEYGLKKNIEKMKEDLVLYGIEYDVWFSEQSLYDSGEVYKVI
EELTEKGYTYEKDGALWFKMTLFGAEKDDVLVRSNGVPTYLASDIAYHKNKFVTRGFDWVINVWGADHHG
HVAPMKGAMKALGIDPNRLDVVLMQLVKLIEGGQVVRMSKRTGKMITLRDLIEEVGKDAARFFFNMRSPD
SPIEFDLDLAKQQTNENPVFYVQYAHARICSIIRQLEEMGVKIENIEDVDLGLLKEEEEVDLIKKLAYFP
EEITIAAKTLAPHRITRYVIDVASLFHSFYNSHRVKGAEENLMKARFALILAVKTVLKNALDILKVTAPE
RM

>gi|15611371|ref|NP_223022.1| ARGINYL-TRNA SYNTHETASE [Helicobacter pylori J99]
MHTLIKGVLEEILEAEVIIEYPKDREHGHYATPIAFNLAKVFKKSPLAIAEELALKIGSHEKTQGFFDRV
VACKGYINFTLSLDFLERFTQKALELKEQFGSQVKSERSQKIFLEFVSANPTGPLHIGHARGAVFGDSLA
KIARFLGHEVLCEYYVNDMGSQIRLLGVSVWLAYKEHVLKESVTYPEVFYKGEYIIEIAKKAHNDLEPSL
FKENEETIIEVLSDYAKDLMLLEIKGNLDALDIHFDSYASEKEVFKHKDAVFDRLEKANALYEKDSKTWL
KSSLYQDESDRVLIKEDKSYTYLAGDIVYHDEKFQQNYTKYINIWGADHHGYIARVKASLEFLGYDSSKL
EVLLAQMVRLLKDNEPYKMSKRAGNFILIKDVIDDVGKDALRFIFLSKRLDTHLEFDVNTLKKQDSSNPI
YYIHYANSRIHTMLEKSPFSKEEILQTPLKNLNAEEKYLLFSALSLPKAVESSFEEYGLQKMCEYAKTLA
SEFHRFYNAGKILDTPKAKELLKICLMVSLSLTNAFKLLGIEIKTKISSKD

>gi|15836752|ref|NP_297440.1| arginyl-tRNA synthetase [Xylella fastidiosa 9a5c]
MLTRFSYKRSDKITLSIATHPHPHVKAPLRALICQGIEALRSNGTLPTNTLPPDFVVERPKTRKHGDFAT
NVAMLLSKATGSNPRLLAQTLVAALPTSADIARIEIAGPGFINFHLHPVAYQRETINVLKQDNDYGRNLS
GQSRTVGVEYVSANPTGPLHVGHGRAAAIGDCLARLLEANGWNVKREFYYNDAGVQIENLVRSVQARARG
LKPGDAFWPTDAYNGEYIADIAKAYLAGDSINMVDTIITSTKNVDDTAAIHHFAVNYLRNEQNHDLAAFN
VDFDIYFLESSLYKDGKVEETVQKLINSGHTYEEGGALWLKSTHFGDDKDRVMRKSDGSYTYFVPDIAYH
LSKWQRGYERAITELGADHHGSLARVHAGLQALEIGIPPGWPEYVLHQMVTVMRGGEEVKLSKRSGGYVT
LRDLIEETSTDATRWFLIARKPDSQLTFDIDLARQKSNDNPVFYVQYAYARVCSLMHQAHEKNLNYDQTS
GMASLDQLSDNTSLCLMIEISRYPEIVQIACELLEPHLIAQYLRELAHAFHTWYHNTPVLVENAVERNAK
LTLACATRQVLANGLNLLGVGTPEKM

>gi|53715701|ref|YP_101693.1| arginyl-tRNA synthetase [Bacteroides fragilis YCH46]
MKIEDKLVTSVISGLKALYGQDVPAAQVQLQKTKKEFEGHLTLVVFPFLKMSKKGPEQTAQEIGEYLKAN
EPAVAAFNVIKGFLNLTVASATWIELLNEIHADAQYGIVSADENAPLVMIEYSSPNTNKPLHLGHVRNNL
LGNALANIVMANGNKVVKTNIVNDRGIHICKSMLAWQKYGKGETPESSGKKGDHLVGDYYVAFDKHYKAE
VAELMEKGMSKEEAEAASPLMNEAREMLVKWEAGDPEVRALWQMMNNWVYTGFDETYRKMGVGFDKIYYE
SNTYLEGKEKVMEGLEKGFFFKKEDGSVWADLTAEGLDHKLLLRGDGTSVYMTQDIGTAKLRFADYPIDK
MIYVVGNEQNYHFQVLSILLDKLGFEWGKSLVHFSYGMVELPEGKMKSREGTVVDADDLMAEMIATAKET
SQELGKLDGLTQEEADDIARIVGLGALKYFILKVDARKNMTFNPKESIDFNGNTGPFIQYTYARIRSVLR
KAAEAGIVIPEVLPANIELSEKEEGLIQMVADFAAVVRQAGEDYSPSGIANYVYDLVKEYNQFYHDFSIL
REENEDVKLFRIALSANIAKVVRLGMGLLGIEVPDRM

>gi|15807552|ref|NP_296288.1| arginyl-tRNA synthetase [Deinococcus radiodurans R1]
MDLKAQLKAAVEQAAHQMGMPVDAAIQETPANKPGDYGTPAAFQMAKAAGGNPAQIAAQLAQTVVLPAGI
RRVEATGPFLNFFLDAGAFVRGVVERPFELPKREGKVVIEHTSVNPNKELHVGHLRNVVLGDSMARILRA
AGHTVEVQNYIDDTGRQAAESLFATQHYGRVWDGVQKYDQWLGEGYVQLNADPQKPELESGIMEIMHKLE
AGELRPLVEQTVKAQLQTCFRLGARYDLLNWESDVVGSGFLAQAMNILEGSRYTSRPTEGKYAGAFIMDV
SEFMPGLEEPNVVLVRSGGTAMYAAKDIGYQFWKFGLFEGMKFKPFMQDPEGNTIWTSAPDGQPDDERRF
GHAQEVINVIDSRQDHPQTVVRSALGVAGEQEKEERSIHLSYAFVTLEGQTISGRKGIAVSADDAMDEAQ
KRALSVLQGINPDLAAREDAAEIARRIGLGAIRFAMLKAEPTRKIDFRWEQALALNGDTAPYVQYAAVRA
ANILKKAEEAGYATDGTGADWDALPDIDLVLAKQIAKLPEVAAQAARIHSPHVVAQYALDLATSFNAWYN
AKTKQGKPATNVLQSEEGLREARLALIVRLRKAFEDTLDLIGIEIPAAM

>gi|16080786|ref|NP_391614.1| arginyl-tRNA synthetase [Bacillus subtilis subsp. subtilis str. 168]
MNIAEQMKDVLKEEIKAAVLKAGLAEESQIPNVVLETPKDKTHGDYSTNMAMQLARVAKKAPRQIAEEIV
AHFDKGKASIEKLDIAGPGFINFYMNNQYLTKLIPSVLEAGEAYGETNIGNGERVQVEFVSANPTGDLHL
GHARGAAVGDSLCNVLSKAGYDVSREYYINDAGNQINNLALSVEVRYFEALGLEKPMPEDGYRGEDIIAI
GKRLAEEYGDRFVNEEESERLAFFREYGLKYELEKLRKDLENFRVPFDVWYSETSLYQNGKIDKALEALR
EKGHVYEEDGATWFRSTTFGDDKDRVLIKKDGTYTYLLPDIAYHKDKLDRGFDKLINVWGADHHGYIPRM
KAAIEALGYEKGTLEVEIIQLVHLYKNGEKMKMSKRTGKAVTMRDLIEEVGLDAVRYFFAMRSADTHMDF
DLDLAVSTSNENPVYYAQYAHARICSMLRQGEEQGLKPAADLDFSHIQSEKEYDLLKTIGGFPEAVAEAA
EKRIPHRVTNYIYDLASALHSFYNAEKVIDPENEEKSRARLALMKATQITLNNALQLIGVSAPEKM

>gi|15594939|ref|NP_212728.1| arginyl-tRNA synthetase (argS) [Borrelia burgdorferi B31]
MLKRKKMNKSVKKKIKDEINVIVTNLALSNNIKLDNININIQKPPKSDLGDISILMFEIGKTLKLPIEII
SEEIIKNLKTKYEIKAVGPYLNIKISRKEYINNTIQMVNTQKDTYGTSKYLDNKKIILEFSSPNTNKPLH
VGHLRNDVIGESLSRILKAVGAKITKINLINDRGVHICKSMLAYKKFGNGITPEKAFKKGDHLIGDFYVK
YNKYSQENENAEKEIQDLLLLWEQKDVSTIELWKKLNKWAIEGIKETYEITNTSFDKIYLESEIFKIGKN
VVLEGLEKGFCYKREDGAICIDLPSDSDEKADTKVKQKVLIRSNGTSIYLTQDLGNIAVRTKEFNFEEMI
YVVGSEQIQHFKSLFFVAEKLGLSKNKKLIHLSHGMVNLVDGKMKSREGNVIDADNLISNLIELIIPEMT
QKIENKESAKKNALNIALGAIHYYLLKSAIHKDIVFNKKESLSFTGNSGPYIQYVGARINSILEKYKALS
IPVMEKIDFELLKHEKEWEIIKIISELEENIINAAKDLNPSILTSYSYSLAKHFSTYYQEVKVIDTNNIN
LTAARIEFLKAILQTIKNCMYLLNIPYMLKM

>gi|15643850|ref|NP_228899.1| arginyl-tRNA synthetase [Thermotoga maritima MSB8]
MLVNAIRQKVSEVISKAYGSEIEFEVEIPPRKEFGDLSTNVAMKLAKTLKKNPREIAQEIVKSLDEDPSF
DRIEIMGPGFINFFLSNELLRGVVKTVLEKKDEYGRENVGNGMKVQFEYGSANPTGPFTVGHGRQIIIGD
VLSEVYKELGYDVTREMYINDAGKQIRLLAQSLWARYNQLLGVEKEIPEGGYRGEYLVDIARDLVNEIGD
RYKDLWNEEVEEFFKQTALNRILSSMKDTLEKIGSSFDVYFSEKSLIEDGTVEEVLKLLKNKDVVYEKDG
AVWLKVSAFIDEEDKVLVRSDGTYTYFMTDIAYHYKKYKRGFRKVYDIWGSDHHGHIPRMKAAMKALDIP
DDFFNVILHQFVTLKRGGEIVRMSTRAGEFVTLDELLDEVGRDATRYFFAMVDPNTHMVFDIDLAKAKSM
DNPVYYVQYAHARIHNLFSNAEKKGVKFEEGKHLELLGNEEERVLMRNLGMFNTALKEVAQMFAPNRLTN
YLQSLAESFHAFYTKHVIVDPENPELSNARLNLALATGIVLRKGLKLIGVSAPERM

>gi|46200208|ref|YP_005875.1| Arginyl-tRNA synthetase [Thermus thermophilus HB27]
MLRRALEEAIAQALKEMGVPARLKVARAPKDKPGDYGVPLFALAKELRKPPQAIAQELKDRLPLPEFVEE
AIPVGGYLNFRLRTEALLREALRPKAPFPRRPGVVLVEHTSVNPNKELHVGHLRNIALGDAIARILAYAG
REVLVLNYIDDTGRQAAETLFALRHYGLTWDGKEKYDHFAGRAYVRLHQDPEYERLQPAIEEVLHALERG
ELREEVNRILLAQMATMHALNARYDLLVWESDIVRAGLLQKALALLEQSPHVFRPREGKYAGALVMDASP
VIPGLEDPFFVLLRSNGTATYYAKDIAFQFWKMGILEGLRFRPYENPYYPGLRTSAPEGEAYTPKAEETI
NVIDVRQSHPQALVRAALALAGYPALAEKAHHLAYETVLLEGRQMSGRKGLAVSVDEVLEEATRRARAIV
EEKNPDHPDKEEAARMVALGAIRFSMVKTEPKKQIDFRYQEALSFEGDTGPYVQYAHARAHSILRKAGEW
GAPDLSQATPYERALALDLLDFEEAVLEAAEEKTPHVLAQYLLDLAASWNAYYNARENGQPATPVLTAPE
GLRELRLSLVQSLQRTLATGLDLLGIPAPEVM

>gi|46323650|ref|ZP_00224013.1| COG0018: Arginyl-tRNA synthetase [Burkholderia cepacia R1808]
MLPAHKQTLEALLADSVAQVAHALKGADAEFVIPAITLERPKVAAHGDVACNVAMQLAKPLGTNPRQLAE
RIVAALVAQPAAQGLVDAAEIAGPGFINLRVSAAAKQAVIAAVFEQGRAFGTSQREKGKRVLVEFVSANP
TGPLHVGHGRQAALGDVLANVIASQGYAVHREFYYNDAGVQIANLAISTQARARGLKPGDAGWPEAAYNG
EYIADIARDYLNGATVAAKDGEPVTGARDIENLDAIRKFAVAYLRHEQDMDLQAFGVKFDQYYLESSLYS
EGRVEKTVDALVKAGMTYEQDGALWLRTTDEGDDKDRVMRKSDGTYTYFVPDVAYHVTKWERGFTKVINI
QGSDHHGTIARVRAGLQGLHIGIPKGYPDYVLHKMVTVMRDGQEVKLSKRAGSYVTVRDLIEWSGGAAPG
QEAAPDMIDEATITRGRDAVRFFLISRKADTEFVFDIDLALKQNDENPVYYVQYAHARICSVLNELKARY
NVDVAQLPGADLSQLTSPQAVSLMQKLAEYPDLLTHAANELAPHAVAFYLRDLAGEFHSFYNAERVLVDD
EAPRNARAALLAATRQVLENGLAMLGVSAPAKM

>gi|30248381|ref|NP_840451.1| Arginyl-tRNA synthetase [Nitrosomonas europaea ATCC 19718]
MVTTTLPDFKSHCIQLLDQAARQVLPDEVGVQIELLRPKLADHGDYSSNLAMKLARRLRRNPLELAKALI
GALPDSSCVEKADVAGGGFINFFLKKTAKQQFLHAVLQAGDSFGHSRLGAGKTIQIEFVSANPTGPLHVG
HGRGAAFGASLANIMTAAGYAVTREFYVNDAGRQMDILTLSTWLRYLDLCGLSFSFPANAYRGQYVADMA
SEIYQAQGDRYAHRSDATIRQLTEISTSTTIDSEDERLDRLITAAKSILDQDYADLHNFVLTEQLADCRN
DLMEFGVEFETWFSEQSLFDSGMVARAVQLLDDKKLLYRQDGALWFRSTDFGDEKDRVVQRENGLYTYFA
SDIAYHLSKYERGFDYLLNIWGADHHGYIPRVKGAIEALSLDPGRLEIALVQFAVLYRDGKKVSMSTRSG
EFVTLRQLRQEVGNDAARFFYVLRKSDQHLDFDLDLAKSQSNDNPVYYVQYAHARICSVLGQWGGAEDIL
ARAETELLTDPAELVLLQKMIDFTDTIEAAAKERAPHLIAFFLRELAGEFHSYYNSTRFLVEDESLKITR
LALISAVRQILSKGLTLLGVTAPREM

>gi|433534|emb|CAA79710.1| arginyl-tRNA synthetase [Corynebacterium glutamicum]
MTPADLATLIKETAVEVLTSRELDTSVLPEQVVVERPRNPEHGDYATNIALQVAKKVGQNPRDLATWLAE
ALAADDAIDSAEIAGPGFLNIRLAAAAQGEIVAKILAQGETFGNSDHLSHLDVNLEFVSANPTGPIHLGG
TRWAAVGDSLGRVLEASGAKVTREYYFNDHGRQIDRFALSLLAAAKGEPTPEDGYGGEYIKEIAEAIVEK
HPEALALEPAATQELFRAEGVEMMFEHIKSSLHEFGTDFDVYYHENSLFESGAVDKAVQVLKDNGNLYEN
EGAWWLRSTEFGDDKDRVVIKSDGDAAYIAGDIAYVADKFSRGHNLNIYMLGADHHGYIARLKAAAAALG
YKPEDVEVLIGQMVNLLRDGKAVRMSKRAGTVVTLDDLVEAIGIDAARYSLIRSSVDSSLDMDLGLWESQ
SSDNPVYYVQYGHARLCSIARKAETLGVTEEGADLSLLTHDREGDLIRTLGEFPAVVKAAADLREPHRIA
RYAEELAGTFHRFYDSCHILPKADEDTAPIHTARLALAAATRQTLANALRLVGVSAPEKM

>gi|15606252|ref|NP_213630.1| arginyl-tRNA synthetase [Aquifex aeolicus VF5]
MKELVKEKVLKALKELYNTQVENFKVEKPKEEAHGDLASNVAFLLARELKKPPVNIAQELADFLSKDETF
KSVEAVKGFINFRFSEDFLKEEFKKFLLSGEAYFKEDLGKGLKVQLEYVSANPTGPLHLGHGRGAVVGDT
LARLFKFFNYDVTREYYINDAGRQVYLLGISIYYRYLEKCPERDEETFKEIKEIFEKDGYRGEYVKEIAE
RLRKLVGESLCKPEEANLKEVREKILKEESIELYYTKKYEPKDVVDLLSNYGLDLMMKEIREDLSLMDIS
FDVWFSERSLYDSGEVERLINLLKEKGYVYEKDGALWLKTSLFGDDKDRVVKRSDGTYTYFASDIAYHYN
KFKRGFEKVINVWGADHHGYIPRVKAALKMLEIPEDWLEILLVQMVKLFREGKEVKMSKRAGTFVTLREL
LDEVGKDAVRFIFLTKRSDTPLDFDVEKAKEKSSENPVYYVQYAHARISGIFREFKERYKKDVSVEELIN
YVQHLEEEAEIKLIKKVLFFKDELVDITLKREPHLLTYYLIDLAGDFHHYYNHHRILGMEENVMFSRLAL
VKGIKEVVRLGLNLMGVSAPERM

>gi|15792499|ref|NP_282322.1| arginyl-tRNA synthetase [Campylobacter jejuni subsp. jejuni NCTC 11168]
MKSIIFNEIKKILECDFALENPKDKNLAHFATPLAFSLAKELKKSPMLIASDLASKFQNHDCFESVEAVN
GYLNFRISKTFLNELANQALTNPNDFTKGEKKQESFLLEYVSANPTGPLHIGHARGAVFGDTLTRLARHL
GYKFNTEYYVNDAGNQIYLLGLSILLSVKESILHENVEYPEQYYKGEYIVDLAKEAFEKFGKEFFSEENI
PSLADWAKDKMLVLIKQNLEQAKIKIDSYVSERSYYDALNATLESLKEHKGIYEQEGKIWLASSQKGDEK
DRVIIREDGRGTYLAADIVYHKDKMSRGYGKCINIWGADHHGYIPRMKAAMEFLGFDSNNLEIILAQMVS
LLKDGEPYKMSKRAGNFILMSDVVDEIGSDALRYIFLSKKCDTHLEFDISDLQKEDSSNPVYYINYAHAR
IHQVFAKAGKKIDDVMKADLQSLNQDGVNLLFEALNLKAVLNDAFEARALQKIPDYLKNLAANFHKFYNE
NKVVGSANENDLLKLFSLVALSIKTAFSLMGIEAKNKMEH

>gi|45657950|ref|YP_002036.1| arginyl-tRNA synthetase [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130]
MSRSLWIARHMKENETLKQIVLKTLEESVNSLISSFPEVEKEAFKIKIEYSRDEKFGDYSTSFALENSKL
LKRNPIQVSKELVEILQKRTDLFEKVDFTPPGFVNFRISTSFLLNYIETSVLSGNYFPKVDLPLKINLEF
VSANPTGPLNIVSARAAANGDTMASLLKAIGHNVDKEFYINDYGNQVFLLGVSTLVRIRELKGEEGTQQE
TTDDTPIEIILEKNILPAEGYRGEYIKDIASSLLKDPKKNVTIENLLKQKKYKELAELCAVWTIENNLIW
QRKDLDAFGVEFDCYFSERTLHEADKVLSVMKDLEKSGKIFQEDGKKVFRSTEYGDDKDRVVVRDDGRPT
YLLADIAYHKDKIERGYDKIYDIWGPDHHGYISRLSGAVQSLGYKKENFKVIISQQVNLLESGQKVKMSK
RAGSFQTMSDLIGFLGKHGKDVGRYFFVMRSLDAPLDFDLDLAKDESDKNPVFYLQYAHARICSIFKEVG
DQTSKEAAAILEMSEERKRLLFWIARFPEEIFDSANAMEPHRVTNYLQSFAKAFTSFYLAKDNRLKDASK
EVRLGLARICLAAKNVLAEGLKLIGVSAPERMEKEN

>gi|39996911|ref|NP_952862.1| arginyl-tRNA synthetase [Geobacter sulfurreducens PCA]
MSQIEGSMKDAVRDLVREALERSFADGTLASGHVPDIVVEKPALEEHGDFACTAAMLMAKAEKKAPRAIA
EIIITHLNDRESLVESVEIAGPGFINFRMRTSAWCRVLRRIEREGGDYGKSEAGAGKKVQVEFVSANPTG
PLHIGHGRGAAIGDTICRLLAAIGWDVTREFYYNDAGQQIANLALSVQARCLGVEPGGPLWPTDGYQGEY
IKDVARSYLNRETVDAGDQHVTAAGDPHDVEAIRRFAVAYLRREQDQDLRAFDVGFDVYFLESSLYAEGR
VDDVVQRIIAKGHAYEQDGALWLRTTEFGDDKDRVMRKSDGSYTYFVPDVAYHLNKWERGFIRVVNEQGA
DHHSTITRVRAGLQALDAGIPKGWPEYVLHQMVTVMRGGEEVKISKRAGSYVTLRDLVDEVGRDATRFFF
LMRKPDSQLVFDIDLAKQQTLENPVYYVQYAHARICSIFENAADKGVVPPTVDQASLESLGTPEELTLVK
LLSSFPEIVEGSALNFEPHRITYYLQELAGAFHSFYNKNRVITEDADLTGARLLLLHSTATVIRNGLGLL
GVSAPEKM

>gi|15840742|ref|NP_335779.1| arginyl-tRNA synthetase [Mycobacterium tuberculosis CDC1551]
MTPADLAELLKATAAAVLAERGLDASALPQMVTVERPRIPEHGDYASNLAMQLAKKVGTNPRELAGWLAE
ALTKVDGIASAEVAGPGFINMRLETAAQAKVVTSVIDAGHSYGHSLLLAGRKVNLEFVSANPTGPIHIGG
TRWAAVGDALGRLLTTQGADVVREYYFNDHGAQIDRFANSLIAAAKGEPTPQDGYAGSYITNIAEQVLQK
APDALSLPDAELRETFRAIGVDLMFDHIKQSLHEFGTDFDVYTHEDSMHTGGRVENAIARLRETGNIYEK
DGATWLRTSAFGDDKDRVVIKSDGKPAYIAGDLAYYLDKRQRGFDLCIYMLGADHHGYIARLKAAAAAFG
DDPATVEVLIGQMVNLVRDGQPVRMSKRAGTVLTLDDLVEAIGVDAARYSLIRSSVDTAIDIDLALWSSA
SNENPVYYVQYAHARLSALARNAAELALIPDTNHLELLNHDKEGTLLRTLGEFPRVLETAASLREPHRVC
RYLEDLAGDYHRFYDSCRVLPQGDEQPTDLHTARLALCQATRQVIANGLAIIGVTAPERM

>gi|34556632|ref|NP_906447.1| ARGINYL-TRNA SYNTHETASE [Wolinella succinogenes DSM 1740]
MHHTIKHLLETTLGFSVVLEKPKDKNHGHYATPAAFSLAKELKKNPALIAQELSKKLSEIEVFESVQSVG
GYINFRLKQGFLDAQASLALSQGREFGKGDKQGSILLEYVSANPTGPLHIGHARGAVLGDALSRIGRHLG
YALETEYYVNDAGNQIHLLGLSIYLAGRDSLLSLPVTYPEQYYRGEYIVDIAKEALKKWGEKAFADEAFI
PELSLFGKELMLEEIRSNLADTHIHFDHYVSEKSLYPRWEETYALLQSHQGCYEGGGKVWLRSSAHGDEK
DRVIVRESGEPTYLAGDIIYHADKFARPYDRYINIWGADHHGYIARVKAAIEFLGHDSSKLEVLLSQMVT
LLKGGQPYKMSKRAGNFILMRDVLEDIGADALRFIFLSKKPDTHLEFDVDDLNKEDSSNPIFYINYAHAR
IHTMLGKSSLDSQEIEAASLEGLEDSIFDLLFLSLQLPQVLEDSFENRAIQKVAEYLRALAGEFHKFYNE
HKILETPQEAALLKVCKVVALSLSQGLALLGITAKERM

>gi|16120221|ref|NP_395809.1| ArgS [Halobacterium sp. NRC-1]
MLYNLRQELLAGIRAATSDAGYDYEVDQSAIELEDITDEEKGEFSSPISFSIAAAAGAPPVDVAAAIADA
HRSNGLPAEVEAVTVEGGHINYHADTTDLADATLSTILRDGSEYGTRTDADPDTILADVSSPNIAKPLHV
GHLRNTILSDAVMNILEARGHDVTRDNHLGDWGVQFGNLMHEYTEFGDEATLEDDAIEHLLDLYQQFEQR
DSMLADLEDDETVTDQFADAVTEERDYHADSGKEWFTRLEQGDEDATALWERFRTVSIDRFKQTYDDLDV
AFDVWNGESFYAQEGWNDVIIEKAIENDVAMRGEGESVYIPVYPDDYENVGDPQAADVDASLDRARQMRE
ANDDLEDADFDPFYIVKSDGSTLYGTRDLATIEYRIEEYDADQSVYVVANEQNQYFQQLFVAARKMGYND
IKLKHIDYGLISLPEGSMSTRKGQIITAREVLDRAQDRAEEIIAEKGRIDDAEAQSVATKIALATIKYEM
VAAKRERDTTFDIDESVALEGDTGPYVQYAATRGYSILDGADAAPEIDDLDPSVFNDTDVELLFELARYP
LVLERCEERYDAAPLAHYLLQLAHVFNSFYHKNAVLDAENARTERLLLTKATTQIFDNGLGLLGIDVLEE
M

>gi|21227450|ref|NP_633372.1| Arginyl-tRNA synthetase [Methanosarcina mazei Go1]
MFLELKAQATSILKEAIRKAGFEVEDSELQFETSPHADLASRAAFRLAGIHRQNPKDLASRIVSAVEIPE
GSFIGKVSAAGPYINFFAGKHYLNGTVNAVLKEKEKFGCGAPKDRILLEHTSANPNGPLHVGHIRNSIIG
DTLARILRRAGYDVEVQYYVNDMGRQIAVVSWACERFELDLSRKSDSAIADVYIKANVELDKNPGYIKEI
DALMEKVEAGDVRTIEHFDKAVSLAVAGIKETLLRLNVAHDKFVSESTFLKSGAVHDIVERIKATGRTKT
DKGALVVDLSDYGFEKTLVIQRSNGTSLYTTRDLAYHEWKAGQADRIIDVFGADHKLISGQLRATLNAIG
VKEPEVVIFEFVSLPEGSMSTRRGQFISADDLFDRVTGAAFEQVETRRPETSYEFKKQVAEAVGLGAVRY
DIVRVSPEKSTVFNWKEALDFEKQGAPYIQYSHARACSILEKAKEEAAWNPDKEIDPSLLVEDSEIDLIK
KMAMFDSVIDLGARELKPHVLAIYARELADAFNQFYRFVPVIAAEDENVRAARLALVDCARVVLANSLDT
LGIIAPESM

>gi|16081423|ref|NP_393761.1| arginine--tRNA ligase related protein [Thermoplasma acidophilum DSM 1728]
MLLFQDLRKDIYEIVSKRFRISENDVYLDDTGHSDITIRVFRILKSPDGGENAVMEIVRSISEKDYVEKA
LSEGGYINVWIKRTYMLREVLESIEKSGTYPDVFQEAERVSVEHTSANPTGPLHIGRARNSIIGDSIYRI
LSRYGYRTVRQYFVNDSGKQMISLYTAYIKYGGPITIENLLENYQKIYREMEKDQSIEKEIEKNIERYEN
ADPEVFGTLRKIAGVMLDGIASTLKRIGIEFDEFDWESDLLLNGSVRKAIDMLETKEEDSARYIEISGKK
VFLTRKDGTTLYFARDIAYHLFKAENSEWIIDVLGEDHKDHAKSLNHVLKEMLKLENRVSFMYYSFITLE
TGKMSTRRGNIVTLQDLVDRTYDEALKIVNEKRPDLSEEERKKIAEVIASSAVRYSIIRVSAPKPITFRW
EEALNFESNSAPFIMYSHARAASILDKAPEPEQSYGMDMPKEEADLVKAMYVYPYYLKDAAQDLKPDLIA
AYLISLVQKFNDFYGACRVIGTDPLTYARRIRIVKAYKQILSDAGDLIGIKMLDQM

>gi|15639817|ref|NP_219267.1| arginyl-tRNA synthetase (argS) [Treponema pallidum subsp. pallidum str. Nichols]
MQDLCEMWRHAVARVLSQLQGPAVEPVEGAQLVMEEPPEPGMGDIAFPLFLFAKRVRRSPAQLAQQLCTL
LEEDTSMCAYGTPQARGPYLNVFLNKECVAAHTLDAIFAQGERYGHTQYLQGKRIMVEFSSPNTNKPLHV
GHLRNNAIGESLSRIIAFCGADVFKVNIINDRGVHICKSMCAYQKFAHGKTPAHTGIKSDRFVGDWYVQF
NRYAQQYPEEAEHDVRDLLQRWESADPHVRALWRTMNEWALRGIKQTYERTGISFDKLYFESETYTKGRE
EVRRGLACGVFYQMEDNSIWVDLSSLGLDKKALLRSDGTTMYITQDIGTAIFRAQDWPFDQLLYVVGNEQ
NYHFKVLFFVLRLLGYPWAQQLHHVSYGMVNLPHGRMKSREGTVVDADDILDRLHSAAEEEIAKKGRENA
LKHAQCIAENVAIAALHYFLLQVSPQKDMVFHPEESLSFNGNTGPYLQYMGARISSLLKKVQEDVEQKGP
REVRCDPALLTHEAEWELVKALARFPACVTRAAQGHDPSVITGYLYTLSKSFSRFYHDCPILCEARPDYA
CARLELVRAVRIVLRTAMRLVLIPFLEEM

>gi|39575708|emb|CAE79875.1| argS [Bdellovibrio bacteriovorus HD100]
MIKHDSIRLLATNLLKDAIGRAYPDFSASEDDIYKALVNPPKSDLGDLAFGCFILAKALKTAPPQVATAV
AAQMKGATAVAAGPYINIRFDEQTHGEQVLATILDGSYFKKPLMEKSPKTMIEYSQPNTHKELHVGHMRN
LCLGDAIVRMLRYSGREIVSSTFPGDMGTHVAKCLWYMKKHNQEPVPETEKGEWLGRMYSKANLLLEDQN
GTPQEDINRQELTAILHQLEGKTGPYYDLWLETREWSIELMKKVYAWADVTFDEWYFESEMDSPSAAWVK
QLYAEGKLEMSQGAIGKDLESEKLGFCMLLKSDGTGLYATKDLLLAKHKFEDVKIEKSVYVVDMRQALHF
KQVFRVLEILGFEQAKNCFHLQYNYVELPDGAMSSRKGNIVPLRELVHRMEDHVKTTYLSRYKGEWSEED
VEKIAGQVAKGAIFYGMLRMDTNKKIVFDMNEWLKLDGESGPFVQYSYARISSLGRKFPRTAGAKIDWSR
LNHASERQLMQSLGGFNTAVAAAAENFKPSAICTYLYDLAKSFNVFYHECPIGTEADVATREARLALSEA
VGLTLKNGLAVLGMPAPEKM

Example Output File

>EscherichiacoliK12
MNIQALLSEKVRQAMIAAGAPADCEPQVRQSAKVQFGDYQANGMMAVAKKLGMAPRQLAEQVLTHLDLNG
IASKVEIAGPGFINIFLDPAFLAEHVQQALASDRLGVATPEKQTIVVDYSAPNVAKEMHVGHLRSTIIGD
AAVRTLEFLGHKVIRANHVGDWGTQFGMLIAWLEKQQQENAGEMELADLEGFYRDAKKHYDEDEEFAERA
RNYVVKLQSGDEYFREMWRKLVDITMTQNQITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVE
SEGATVVFLDEFKNKEGEPMGVIIQKKDGGYLYTTTDIACAKYRYETLHADRVLYYIDSRQHQHLMQAWA
IVRKAGYVPESVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALERARRLVAEKNPDMPADELEKL
ANAVGIGAVKYADLSKNRTTDYIFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKAEIDEEQLAAAPVIIRE
DREAQLAARLLQFEETLTVVAREGTPHVMCAYLYDLAGLFSGFYEHCPILSAENEEVRNSRLKLAQLTAK
TLKLGLDTLGIETVERM

>HaemophilusinfluenzaeRdKW20
MNIQSILSDKIKQAMILAGADQSCDALIRQSGKPQFGDYQANGIMAAAKKLGLNPREFAQKVLDNLQLSD
IAEKLEIAGPGFINIFLNPTWLTTEISAALSHKNLGIQATNKQTVVIDYSSPNVAKEMHVGHLRSTIIGD
AVARTLEFLGHNVIRANHVGDWGTQFGMLIAYLEKMQNEHASEMELQDLEAFYREAKKHYDEDEVFAEKA
RNYVVKLQSGDEYCRTMWKRLVDITMQQNQHNYARLNVTLTEKDVMGESLYNPMLPSIVKDLKKQGLAVE
NDGALVVYLDEFKNKDGDPMGVIVQKKDGGFLYTTTDIAAAKYRYETLKANRALVFSDTRQSQHMQQAWL
ITRKAGYVPDSFSLEHKNFGMMLGKDGKPFKTRTGGTVKLADLLDEAIERATVLINEKNTNLSNDEKEAV
IEAVGIGAVKYADLSKNRTTDYVFDWDNMLSFEGNTAPYMQYAYTRIRSIFNKTDINSTALLAAPLTIKD
DKERTLAIKLLQFEEAVQTVGKEGTPHVLCAYLYELAGIFSSFYEHCPILNAEDESIKLSRLKLALLTEK
TLKQGLTLLGIKTVEKM

>VibriocholeraeO1biovareltorstr.N16961
MICLTSKMMAFLPYFLELKGKRVNIQALINDRVSQAIEAAGAPAGTPALVRQSAKAQFGDYQANGIMGAA
KQLGTNPREFAQKVLDVLNLEGIASKTEIAGPGFINIFLSEEFLAAQAEAALADARLGVAQEAPKTIVAD
YSAPNVAKEMHVGHLRSTIIGDAVVRTLEFLGHKVIRANHIGDWGTQFGMLIANLERVQKASGEVSMELS
DLEAFYRESKKLYDEDEQFAETARNYVVKLQGGDPFCLEMWKKLVDVTMIQNQRNYDRLNVSLTRENVMG
ESMYNDMLPQIVSDLKAKGLAVEDDGAQVVFLEEFKNKDGEPMGVIIQKRDGGFLYTTTDIACAKYRYET
LGADRVLYFIDSRQHQHLMQAWTIVRKAGYIPENVSLEHHAFGMMLGKDGRPFKTRAGGTVRLADLLDEA
QERAKALIESKNPELSAEEKANIANTVAMAAVKYADLSKHRTTDYVFDWDNMLAFEGNTAPYMQYAYTRV
ASVFAKAGVDMNELTGHIQITEEKEKALIAKLLQFEEAVQSVAREGQPHIMCSYLFELAGIFSSFYEACP
ILVAEQESIKQSRLKLAALTAKTIKQGLALLGIDTLERM

>NeisseriameningitidisMC58
MNLHQTVEHEAAAAFAAAGIADSPIVLQPTKNAEHGDFQINGVMGAAKKAKQNPRELAQKVAEALADNAV
IESAEVAGPGFINLRLRPEFLAQNIQTALNDARFGVAKTDKPQTVVIDYSSPNLAKEMHVGHLRSSIIGD
SISRVLAFMGNTVIRQNHVGDWGTQFGMLVAYLVEQQKDNAAFELADLEQFYRAAKVRFDEDPAFADTAR
EYVVKLQGGDETVLALWKQFVDISLSHAQAVYDTLGLKLRPEDVAGESKYNDDLQPVVDDLVQKGLAVED
DGAKVVFLDEFKNKEGEPAAFIVQKQGGGFLYASTDLACLRYRIGRLKADRLLYVVDHRQALHFEQLFTT
SRKAGYLPENVGAAFIGFGTMMGKDGKPFKTRSGDTVKLVDLLTEAVERATALVKEKNPELGADEAAKIG
KTVGIGAVKYADLSKNRTSDYVFDWDAMLSFEGNTAPYLQYAYTRVQSVFRKAGEWDANAPTVLTEPLEK
QLAAELLKFEDVLQSVADTAYPHYLAAYLYQIATLFSRFYEACPILKAEGASRNSRLQLAKLTGDTLKQG
LDLLGIDVLDVM

>PseudomonasaeruginosaPAO1
MKDTIRQLIQQALDQLTADGTLPAGLTPDIQVENTKDRSHGDFASNIAMMLAKPAGMKPRDLAARLVEAI
PAHEQLAKVEIAGPGFLNFFQDHVWLAASLDRALADERLGVRKAGPAQRVVIDLSSPNLAKEMHVGHLRS
TIIGDAVARVLEFLGDTVIRQNHVGDWGTQFGMLLAYLEEQPVDAEAELHDLEVFYRAAKKRFDESPEFA
DRARELVVKLQAGDPDCLRLWTRFNEISLSHCQKVYDRLGVKLSMADVMGESAYNDDLAQVVADLTAKGL
LTEDNGALCVFLEEFKNAEGNPLPVIVQKAGGGYLYATTDLAAMRYRHNVLHADRVLYFVDQRQALHFQQ
VFEVARRAGFVPAGMELEHMGFGTMNGADGRPFKTRDGGTVKLIDLLEEAESRAYALVKERNEQRAERGE
EPFDEVQLREIGRVVGIDSVKYADLSKHRTSDYSFNFELMLSFEGNTAPYLLYACTRVASVFRKLGQGRE
QLGGKIVLEQPQELALAAQLAQFGDLINNVALKGVPHLLCAYLYELAGLFSSFYEHCPILTAEDPAQKDS
RLRLAALTGRTLEQGLELLGLKTLERM

>ThermosynechococcuselongatusBP-1
MVAPIKILGDRLRRALQAALPLDTYPQPLLVPASQVKFGDYQSNVCLSLAKQLGKAPRELAQEVVPHLEV
EDLCQPVEIAGPGFLNFRLKPEFLAATLQAARGSDRLGIPPAREPRRVVVDFSSPNIAKEMHVGHLRSTI
IGDCIARILEFQGHTVLRLNHVGDWGTQFGMLIAYLDEVYPDALTTANALDLGDLVTFYKKAKQRFDSDP
EFQQKARAKVVALQQGEEQSRRAWQLLCEQSRREFQKIYDLLDIQLTERGESFYNPFLPAVIEDLAACGL
LVEDQGAKVVFLEGFTNKEGQPQPLIIQKSDGGYNYATTDLAALRYRIDKDQADWIIYVTDVGQSTHFAQ
VFQVAQRAGWVPPHVTLTHVPFGLVLGEDGKRLKTRSGETIRLIDLLTEAIARSRADLEQRLATEGRTES
PEFIDTVARAIGIGAVKYADLSQNRNSNYVFSYDKMLSLQGNTAPYLLYAYVRVQGLTRRGDIDWCTLSP
DSPLLLEDETEQHLAKHLVQLEETLDLVSTELLPNRLCQYLFELSQLFNQFYDRCPILSAPQPTKQSRLT
LAYLTAQTLKLGLSLLGIPVLDRI

>Nostocsp.PCC7120
MNATQEQLKIKLEQALVAAFGDEYAGVDPILVSASNPKFGDYQANVALSLSKKLGQQPRAIASAIVEKLD
VSEICEKPEIAGPGFINLKLKTAYLEAQLNTIQADTRLGVPTAKHPQREIVDFSSPNIAKEMHVGHLRST
IIGDSIARILEFRGHDVLRLNHVGDWGTQFGMLITYLREVSPEALTTANALDIGDLVSFYRQAKQRFDAD
EAFQETARQEVVRLQAGAADTLHAWKLLCEQSRQEFQVIYDLLDVKLTERGESFYNPLLPTVVENLEKSG
LLVENQGAKCVFLDGFTNREGEPLPLIVQKSDGGYNYATTDLAALRYRIQKDEAKRIIYITDAGQANHFA
QFFQVARKAGWIPDDVELVHVPFGLVLGEDGKKFKTRSGDTVRLRDLLDEAISRAHADVEVRLKAEEREE
TAEFIDKVAEVVGISAVKYADLSQNRTSNYIFSYDKMLDLKGNTAPYMLYAYARIQGISRKGEINFADLG
DNAKVILQHETEFALAKYLLQLGEVISTVEEDLSPNRLCEYLYELSKRFNAFYDRNQGVQVLSAEEPLRT
SRLVLCDLTARTLKLGLSLLGIQVLERM

>Prochlorococcusmarinusstr.MIT9313
MQAHFASEFMLSLAHALESQLRAAIDRAFPEAAASARESGTGLDPQLAPASKPEFGDFQANAALPLAKPL
KQPPRQIAAAIVDQLMVDTAFTAICLTPEIAGPGFINLTVRPECLAAEVQARLADARLGVPLVEGDNDGQ
QPTPVVVDFSSPNIAKEMHVGHLRSTIIGDSLARVLEFRGHPVLRLNHVGDWGTQFGMLITHLKQVAPEA
LETADAVDLGDLVVFYRQAKQRFDDDEAFQTTSREEVVKLQGGDPISLKAWSLLCDQSRREFQKIYDRLD
VRLNERGESFYNAYLESVVEDLNVSGLLVSDDGAQCVFLEGVTGKDGKPLPVIVQKSDGGFNYATTDLAA
MRYRFAAPPQGDGARRVIYVTDAGQANHFAGVFQVAQRAGWIPDAGRLQHVPFGLVQGEDGKKLKTRAGD
TVRLRELLDEAVERAESDLRRRLQEEGRDEDESFIEQVATTVGLAAVKYADLSQNRITNYQFSFDRMLAL
QGNTAPYLLYAVVRIAGIARKGGDLDVTTAELQFSETQEWALVRELLKFDAVIAEVEEELLPNRLCTYLF
ELSQVFNRFYDQVPVLKAEQPSRSCRLALCRLTADTLKLGLSLLGIPTLERM

>ChlamydiatrachomatisD/UW-3/CX
MTTLLSFLTSLCSAAIHQAFPELEELTLDITPSTKEHFGHYQCNDAMKLARVLHKSPRAIAESIVAHIPP
TPFSSIEIAGAGFINFTFSKEFLASQLQTFSKELANGFRAASPQKVIIDFSSPNIAKDMHVGHLRSTIIG
DCLARCFSFVGHDVLRLNHIGDWGTAFGMLITYLQETSQEAIHQLEDLTALYKKAHARFAEDSEFKKRSQ
HNVVALQSGDAQALALWKQICSVSEKSFQTIYSILDVELHTRGESFYNPFLAEVVADLESKNLVTLSDGA
KCVFHEAFSIPLMIQKSDGGYNYATTDVAAMRYRIQQDQADRILIVTDSGQSLHFQLLEATCLAAGYLPS
KGIFSHVGFGLVLDTQGRKFKTRSGENIKLRELLDTAVEKAKESLKAHRPDISEEELAYQGPILGINAIK
YADLSSHRINDYVFSFEKMLRFEGNTAMSLLYAYVRIQGIKRRMGLESPPQEGPLAVHEPAEEALALTLL
RFPEILDLTLRELCPHFLTDYLYALTNKFNAFFRDCHIEGSDSQQERLYLCGLTERTLSTGMHLLGLKTL
NHL

>ChlamydophilapneumoniaeJ138
MSTLLSILSVICSQAIAKAFPNLEDWAPEITPSTKEHFGHYQCNDAMKLARVLKKAPRAIAEAIVAELPQ
EPFSLIEIAGAGFINFTFSPVFLNQQLEHFKDALKLGFQVSQPKKIIIDFSSPNIAKDMHVGHLRSTIIG
DSLARIFSYVGHDVLRLNHIGDWGTAFGMLITYLQENPCDYSDLEDLTSLYKKAYVCFTNDEEFKKRSQQ
NVVALQAKDPQAIAIWEKICETSEKAFQKIYDILDIVVEKRGESFYNPFLPEIIEDLEKKGLLTVSNDAK
CVFHEAFSIPFMVQKSDGGYNYATTDLAAMRYRIEEDHADKIIIVTDLGQSLHFQLLEATAIAAGYLQPG
IFSHVGFGLVLDPQGKKLKTRSGENVKLRELLDTAIEKAEEALREHRPELTDEAIQERAPVIGINAIKYS
DLSSHRTSDYVFSFEKMLRFEGNTAMFLLYAYVRIQGIKRRLGISQLSLEGPPEIQEPAEELLALTLLRF
PEALESTIKELCPHFLTDYLYNLTHKFNGFFRDSHIQDSPYAKSRLFLCALAEQVLATGMHLLGLKTLER
L

>StreptomycescoelicolorA3(2)
MASVTSLSDSVQQHLASALTATRPEAAGADPLLRRSDRADYQANGILALAKKTKANPRELAAEVVARITT
GDELIEDVEVSGPGFLNITVADRAITANLAARLADGERLGVPLKQDAGTTVVDYAQPNVAKEMHVGHLRS
AVIGDALRSMLDFTGEKTIGRHHIGDWGTQFGMLIQYLFEHPGELAPAGDIDGEQAMSNLNRVYKASRAV
FDTDEEFKERARRRVVALQSGDKETLDLWQQFVDESKVYFYSVFEKLDMEIRDEEIVGESAYNDGMPETA
RLLEEMGVAVRSEGALVVFFDEIRGKDDQPVPLIVQKADGGFGYAASDLTAIRNRVQDLHATTLLYVVDV
RQSLHFRMVFETARRAGWLGDEVTAHNMGYGTVLGADGKPFKTRAGETVRLEDLLDEAVQRAAEVVREKA
RDLTEDEIQERAAQVGIGAVKYADLSTSPNRDYKFDLDQMVSLNGDTSVYLQYAYARIQSILRKAGEVRP
AAHPELALHEAERALGLHLDAFGPTVFEAAAEYAPHKLAAYLYQLASLYTTFYDKCPVLKAETPEQVENR
LFLCDLTARTLHRGMALLGIRTPERL

>RhodopirellulabalticaSH1
MHLPNVLQARFVQALEPLTDSPSDYAGMIRPAADPKFGDYQSNAAMPLAKRVGKTSRDVAAELVQNLNVT
DLFEEPEVAGPGFINLRLKDSVLFDSIQQMLLDERVGVSKTTDPKKVIVDFSSPNVAKPMHVGHIRSTVI
GDCLARTLRFYGEDVVTDNHLGDWGTQFGIIIYGYRNFGDPAKVAANPVPELSALYRLTNQLIEYQKAKQ
SLATMADKLATAKSDAKTAKEVSDQSESDENLKPKDKKKLRKNAEAATRRVASIEADMKSLKAKIDAVDS
DTELSKLASEHSDVDVAVLRETAKLHEGDPENLALWKEFLPHCQDEINRIYDRLNVQFDHTLGESFYHDR
LAGVVDHLTTLGLTTKSDGAICVFLEGFDSPMIIQKRDGAFLYATTDLATLQYRRDEFQPDEILYVVDSR
QGEHFKKFFAMAEPLGMAEVQLVHVNFGTVLGPDGRPMKTRSGSLIGLESLLNDAVSRAKEVVCNPDRLA
TMDPPMGGEEQQQIAEIVGIGAIKYADLSHHRTSDYKFDVDKMVALEGNTATYVQYSYARTQSILRRASD
GEGLPAFEQAIEQAAATQPMTFTHPNERSLALMLMRFEEAIEQVRLNYAPNALCDYLFETAKTYSSFNES
CRVLGNDDPAVMQTRLALVVLTGRVLKKGLSLLGIDVAERM

>Fusobacteriumnucleatumsubsp.vincentiiATCC49256
MKITSKELTDIFQKHVESLFPNKELKPVEITVATNENFGDYQCNFAMINSKIIGDNPRKIAEEIKNNFSC
GDVVEKLEVAGPGFINIFLSDKYISNSIKKIGENYDFSFLNRKGKVIIDFSSPNIAKRMHIGHLRSTIIG
EAVCRIYKFLGYDVVADNHIGDWGTQFGKLIVGYRKWLNREAYEKNAIEELERVYVKFSDEAEKDPSLED
LARAELKKVQDGEEENTKLWKEFITESLKEYNKLYKRLDVHFDTYYGESFYNDMMGDVVKELVDKKIAVD
DDGAKVVFFDEKDNLFPCIVQKKDGAYLYSTSDIATVKFRKNTYDVNRMIYLTDARQQDHFKQFFKITDM
LGWNIEKYHIWFGIIRFADGILSTRKGNVIKLEELLDEAHSRAYDVVNEKNPNLSEEEKQNIAEVVGVSS
VKYADLSQNKQSDIIFEWDKMLSFEGNTAPYLLYTYARIQSILRKVTEQNIDLNKNIEIKTDNKFEKSLA
TYLLVFPISVLKAAETFKPNLIADYLYELSKKLNSFYNNCPILNQDIETLKSRALLIKKTGEVLKEGLGL
LGIPVLNKM

>CaulobactercrescentusCB15
MNDLKRSLSEAAAAAFQAAGLPPEFGRVTASDRPDLADFQCNGALAAAKSAKRNPREIAVQVVDILKGDP
RLASVEIAGVGFINMRVSDEALSARAREIASDDRTGAQLLETPRRVLIDYAGPNVAKPMHVGHLRASIIG
ESVKRLYRFRGDDVVGDAHFGDWGFQMGLLISAIMDEDPFINALMEKLPEAPRGFSSADEAKVMAEFEKR
ITLADLDRIYPAASVRQKEDPAFKERARKATAELQNGRFGYRLLWRHFVNVSRVALEREFHALGVDFDLW
KGESDVNDLIEPMVLQLEAKGLLVQDQGARIVRVAREGDKRDVPPLLVVSSEGSAMYGTTDLATILDRRK
SFDPHLILYCVDQRQADHFETVFRAAYLAGYAEEGALEHIGFGTMNGADGKPFKTRAGGVLKLHDLIEMA
REKARERLREAGLGAELSEEQFEDTAHKVGVAALKFADLQNFRGTSYVFDLDRFTSFEGKTGPYLLYQSV
RIKSVLRRAAESGAVAGRVEIHEPAERDLAMLLDAFEGALQEAYDKKAPNFVAEHAYKLAQSFSKFYAAC
PIMSADTETLRASRLTLAETTLRQLELALDLLGIEAPERM

>StreptococcuspneumoniaeR6
MNTKELIASELSSIIDSLDQEAILKLLETPKNSEMGDIAFPAFSLAKVERKAPQMIAAELAEKMNSQAFE
KVVATGPYVNFFLDKSAISAQVLQAVTTEKEHYADQNIGKQENVVIDMSSPNIAKPFSIGHLRSTVIGDS
LSHIFQKIGYQTVKVNHLGDWGKQFGMLIVAYKKWGDEEAVKAHPIDELLKLYVRINAEAENDPSLDEEA
REWFRKLENGDEEALALWQWFRDESLVEFNRLYNELKVEFDSYNGEAFYNDKMDAVVDILSEKGLLLESE
GAQVVNLEKYGIEHPALIKKSDGATLYITRDLAAALYRKNEYQFAKSIYVVGQEQSAHFKQLKAVLQEMG
YDWSDDITHVPFGLVTKEGKKLSTRKGNVILLEPTVAEAVSRAKVQIEAKNPELENKDQVAHAVGVGAIK
FYDLKTDRTNGYDFDLEAMVSFEGETGPYVQYAYARIQSILRKADFKPETAGNYSLNDTESWEIIKLIQD
FPRIINRAADNFEPSIIAKFAISLAQSFNKYYAHTRILDESPERDSRLALSYATAVVLKEALRLLGVEAP
EKM

>Clostridiumperfringensstr.13
MDYKKLVAERIKEHVDLELENIEKLIEIPPKPEMGDFAFPCFQLAKVMRKAPNMIAAELAEKINKEGFER
VECLGPYLNFFVDKVAFSKNIISKVLEEGDKYGSSKIGEGKNVVVEYSSPNIAKPFHVGHLFTTAIGHSL
YRMLNFEGYNPIRINHLGDWGTQFGKLISAYKRWGNEEALEEAPINELLRIYVKFHDEAENNPELEDEGR
MYFKKLEDGDQEAVALWERFKDLSLKEFNKIYDMLGVDFDSWAGESFYNDKMDKVVEELEKANILTESNG
AKVVMLDEYNMPPCIVVKSDGASIYATRDLAAASYRHKTYNFDKCIYVVGKDQILHFNQVFKTLELAGNE
WAKNCVHIPFGLVKFADRKLSTRKGNVVLLEDLLNEAIDKTRETIEEKNPQLENKEEVAKKIGIGAILFT
YLKNSRERDIVFDWKEMLSFDGETGPYVQYSYARAKSILRKAEEQKITAEPDFTKLTSKEEFELAKTLEG
LQKAVILGIDKLEPSVVTRYSIEVAKAFNKFYNNHTVLNVEDEGLKAARLELIKATAQVIKNALFLIGID
VVEKM

>Staphylococcusaureussubsp.aureusN315
MNIIDQVKQTLVEEIAASINKAGLADEIPDIKIEVPKDTKNGDYATNIAMVLTKIAKRNPREIAQAIVDN
LDTEKAHVKQIDIAGPGFINFYLDNQYLTAIIPEAIEKGDQFGHVNESKGQNVLLEYVSANPTGDLHIGH
ARNAAVGDALANILTAAGYNVTREYYINDAGNQITNLARSIETRFFEALGDNSYSMPEDGYNGKDIIEIG
KDLAEKHPEIKDYSEEARLKEFRKLGVEYEMAKLKNDLAEFNTHFDNWFSETSLYEKGEILEVLAKMKEL
GYTYEADGATWLRTTDFKDDKDRVLIKNDGTYTYFLPDIAYHFDKVKRGNDILIDLFGADHHGYINRLKA
SLETFGVDSNRLEIQIMQMVRLMENGKEVKMSKRTGNAITLREIMDEVGVDAARYFLTMRSPDSHFDFDM
ELAKEQSQDNPVYYAQYAHARICSILKQAKEQGIEVTAANDFTTITNEKAIELLKKVADFEPTIESAAEH
RSAHRITNYIQDLAAHFHKFYNAEKVLTDDIEKTKAHVAMIEAVRITLKNALAMVGVSAPESM

>RhodopseudomonaspalustrisCGA009
MAELPMSTHLFARLLSRVHAVCAALIEEGALPAGIDLSRVVVEPPKDASHGDMATNAAMVLAKDAKAKPR
DLADKIADKLRAEELIDQVAIAGPGFINLTLKPAVWAEALRAVLDAGAGYGRSTVGGGEKVNVEYVSANP
TGPMHVGHCRGAVFGDALANLLDTAGYDVTREYYINDAGAQVDVLARSAFLRYREALGETIGEIPEGLYP
GDYLKPVGEALKAEHGAALKDMPEAQWLPTVRATAIAMMMEAIKGDLAALNITHEVFFSERSLIEGGRNR
VAETIEFLRAKGDVYQGRLPPPKGAPVEDYEDREQTLFRATAYGDDVDRPLLKSDGSYTYFASDIAYHKV
KFDAGFANMVDVWGADHGGYIKRMQAAIQAVTAGKGALDVKIVQLVRLLRNGEPVKMSKRAGDFVTLREV
VDEVGSDAVRFMMLFRKNDAVLDFDLAKVIEQSKDNPVFYVQYGHARGHSIFRNAREVVPDLPEDSKARA
AMLRQAPLERLNDPAELELLKRLALYPRIVEAAAQAHEPHRIAFYLNELASEFHALWTHGRDLPHLRFII
NNDAEITRARLAMVQGVVSVLASGLAILGVTAPDEMR

>ListeriainnocuaClip11262
MNVMQENQIKLIEHIKQAVVQAVGLEETEVPEILLEVPKDKKHGDYSTNIAMQLARVAKKAPRQIAESIV
PELKKDTKLIKEVEIAGPGFINFYLDNAYLTDLVPVILTEDKKYGESDFGKGEKFQIEFVSANPTGDLHL
GHARGAAIGDSLANIMKMAGFDVSREYYINDAGNQINNLVLSAEARYFEALGLESEFPEDGYRGSDIIAL
GKDLAAKYGDKYVNASEEERRSVFRVDALAFETGKLRADLEEFRVSFDEWFSETSLYEENKVLPALERLR
ENGYIYEQDGATWLRTTDFEDDKDRVLIKSDGSYTYFLPDIAYHLNKLERGFDVLIDIWGADHHGYIPRM
RAAIEALGYSPNQLEVEIIQLVHLFEDGVQVKMSKRTGKSVTMRDLIEEVGLDATRYFFAMRSSDTHMNF
DMSLAKSTSNDNPVYYVQYAHARISSILRSGKEQGLEVSKDANMSLLETEAEYDLLKVLGEFADVVAEAA
VKRAPHRIVRYLNDLATAFHRFYNSNKVLDMDNLEVTKARLALIKTAQITLRNGLTLLGVSAPEKM

>RalstoniasolanacearumGMI1000
MLPSHKQTISQLLSDAVGTLLPEGTNRPEIVLERPKQAAHGDIACNVALQLAKPLGTNPRELANRIADGI
RADARGQRLVSAVEIAGPGFINLRLSPTARTDVLAAVFAEGDRYGAADLHDGAPVLVEFVSANPTGPLHV
GHGRQAALGDALAALLEWQGHKVHREFYYNDAGVQIHNLAVSVQARARGFKPGDTGWPEAAYNGDYIADI
AADYLAGKTVRASDGEPVTGARDVENIEAIRRFAVTYLRNEQDIDLQAFGVKFDHYYLESSLYADGKVQQ
TVDALIAAGKTYEQEGALWLRTTDDGDDKDRVMRKSDGSYTYFVPDVAYHTTKWGRGFTQVINVQGSDHH
GTIARVRAGLQGLDLGIPKGYPDYVLHKMVTVMKDGAEVKISKRAGSYVTVRDLIEWSNGDAESEAGVDT
IRACVESGAPNWPGRFTRGRDAVRFFLLSRKADTEFVFDVDLALKQSDENPVYYVQYAHARICSVFEQWH
AREGGDAASLAGADLAAVAGPEASPQAVALVQRIAAFPDMLADAARELAPHAVAFYLRDLAGDFHAFYNA
DRVLVDDDAVKRARLALLAATRQVLRNGLAVIGVSAPQKM

>BifidobacteriumlongumDJO10A
MSPEALSELISSIAHNLVAAGQAGALTDELIPPVDKLAVMRPKDRAHGDWASNIAMQLAKKAGMKPRDLA
EPFAAALAEADGIAKVEVAGPGFINITLDSASAAAVVDTVLAAGAMTDTDKHLNKVNEYGRNAHLGGQTL
NLEFVSANPTGPIHIGGTRWAAVGDAMARVLEANGAKVVREYYFNDHGEQINRFAKSLVAAWAEANNLGE
AGYQTETPCDGYKGAYINEIAARVQAEAESDGVDLTALAHQDQGLNDDGEPLGEADTEVREEFRKRAVPM
MFDEIQKSMKDFRVNFDVWFHENSLYADGKVDAAIEELKSRGDIFDKDGATWFESTKHGDDKDRVIIKSN
GEFAYFAADIAYYWDKRHRAENPADVAIYMLGADHHGYIGRMMAMCAAFGDEPGKNMQILIGQLVNVMKD
GKPVRMSKRAGNVVTIDDLVSVVGVDAARYSLARSDYNQNFDIDLALLASHTNDNPVYYVQYAHARSKNV
DRNAAVAGISYEGADLALLDTEADGEVLAALAQFPSVLATAADDRQPHKVARYLEELAATYHKWYNVERV
VPMALTDPETRGDDEARKALEIAKNPEPARAAARLKLNDAVQQVIANGLDLLGVTAPEKM

>Agrobacteriumtumefaciensstr.C58
MNIFADFDTRIKNALETLDLVKENREKVDFSRITVESPRDLSHGDVATNAAMVLAKPLGTNPRALAELLV
PALQADGDVDGVNVAGPGFINLKVSVGYWQRLLADMIGQGVDFGRSTVGAGQKINVEYVSANPTGPMHVG
HCRGAVVGDTLANLLAFAGYGVTKEYYINDAGSQIDVLARSVFLRYREALGEDIGSIPSGLYPGDYLVPV
GQALADEYGIKLRAMPEEKWLPIVKDKAIDAMMVMIREDLALLNVRHDVFFSERTLHEGNGGPILSAIND
LTFKGHVYKGTLPPPKGELPDDWEDREQTLFRSTEVGDDMDRALMKSDGSYTYFAADVAYFKNKFDRGFS
EMIYVLGADHGGYVKRLEAVARAVSEGKSKLTVLLCQLVKLFRDGEPVKMSKRSGDFVTLRDVVDEVGRD
PVRFMMLYRKNSEPLDFDFAKVTEQSKDNPVFYVQYAHARCKSIFRQAQEAFPGLAPSAEDMAASVALIS
DINELQLVAKLAEYPRLIESAALSHEPHRLAFYLYDLAGSFHGHWNKGKDHQELRFINDKNRELSIARLG
LVNAVANVLKSGLTLLGADAPDEMR

>Brucellamelitensis16M
MNIFADFDARIKKTLQDIDLKPKDGGELDLSRIGVEPPRDASHGDIATNAAMVLSKAVGQNPRELAARIA
EALKADEDVESVDVAGPGFINLRLKASYWQRELLVMLNEGTDFGRSRLGAGKKVNVEYVSANPTGPMHVG
HCRGAVVGDVLANLLKFAGYDVVKEYYINDAGAQIDVLARSVMLRYREALGESIGEIPAGLYPGDYLVRV
GQELAGEFGTKLLEMPEAEALAIVKDRTIDAMMAMIRADLDALNVHHDVFYSERKLHVDHARAIRNAIND
LTLKGHVYKGKLPPPKGQLPEDWEDREQTLFRSTEVGDDIDRPLMKSDGSFTYFAGDVAYFKDKYDHGFN
EMIYVLGADHGGYVKRLEAVARAVSDGKAKLTVLLCQLVKLFRNGEPVRMSKRAGEFITLRDVVDEVGRD
PVRFMMLYRKNDAPLDFDFAKVTEQSKDNPVFYVQYASARCHSVFRQAADQLGLVDLDRVAMGSHFEKLT
DESEIALVRKLAEYPRLIESAAIHQEPHRLAFYLYDLASSFHSQWNRGAENPDLRFIKVNDPDLSLARLG
LVQVVSDVLTSGLTIIGADAPTEMR

>PorphyromonasgingivalisW83
MSILQKLENSAAAAVKALYGTDPMEGQIQLQKTKREFKGHLTLVVFPFVKMSRKSPEATATEIGEWLLAN
ESAVSAIEVVKGFLNLTIAPRVWLELLNEIRADINFGHKVATEDSPLVMVEYSSPNTNKPLHLGHVRNNL
LGYSLSEIMKANGYRVVKTNIVNDRGIHICKSMLAWQKWGDGVTPEKAGKKGDHLIGDFYVLFDKHYKAE
LNSLMAEGKSKEEAEAASTLMAEAREMLRLWEAGDEKVVDLWRTMNQWVYDGFDATYKMMGVDFDKIYYE
SETYLVGKEEVLRGLEEGLFVKHSDGSVWADLTKDGLDEKLLLRADGTSVYMTQDIGTAKMRFNDYPINR
MIYVVGNEQNYHFQVLSILLDRLGFEFGKGLVHFSYGMVELPEGKMKSREGTVVDADDLMDEMIRTAAEI
AAEAGKAAEMDEEESREVARIVGLGSLKYFILKVDPRKNMTFNPKESIDFNGNTGSFVQYTYARIRSLMR
RAEAAGYDIPSQLPTDLPLSEKEEALIQKVSEYAEVVSEAGHSYSPALIANYIYDLVKEYNQFYHDFSVL
KEEDERIRAFRLALSEVVALTMRKGFALLGIEMPERM

>Rickettsiaprowazekiistr.MadridE
MNIFNQLKQDIIAASQKLYNNKEIANTATIETPKDSFNGDLSSNIAMIIASKESIAPREVALKFKEVLVT
LPYIASIEIAGPGFINFTIKAESWQAAIKDILQHEEKFFEIDIDKNSNINIEYVSANPTGPMHIGHARGA
VYGDVLARILQKVGYSVTKEYYVNDAGSQINDLVSTVLLRYKEALGEPITIPVGLYPGEYLIPLGEILSK
EYGNKLLTMNDVERFKIIKSFAVEKMLDLNRKDLADLGIKHDVFFSEQSLYDKGEIEKTVKLLERMGLIY
EGTLPAPKGKVHEDWEYRVQKLFKSTNYGDSQDRPIEKADGSWSYFASDLAYAKDKIDRGANHLIYVLGA
DHSGYVKRIEAIVKALGQEKVKVDVKICQLVNFVENGVPIKMSKRLGSFASVQDVNKEVGKDIIRFMMLT
RQNDKPLDFDLVKVKEQSRENPIFYVQYAHVRTKSILSKARELMPEAYNSFKEGKYNLSLLSSEEEIEII
KLLAAWTKTLEASVKYFEPHRIAFYLINLASKFHSMWNFGKENSDYRFIIENNKELTLARLALASVIQKI
IASGLEVIGVEPMVTM

>BordetellapertussisTohamaI
MRGHLRQTPGRPPGGSARPAARQTCRRHLPFCRLPMLLEQQKQLISLIQAAVAQCLPEAQAQVQLERPKV
AAHGDIATNVAMQLAKPARRNPRELAQGIVDALMAQPQARELIQDAEIAGPGFINFRLTPAARQAVVQAV
ASQADAYGRAPRNGEKVLVEFVSANPTGPLHVGHARQAALGDAICRLYDASGWDVTREFYYNDAGNQIDN
LAISVQARGRGIAPDAPDYPADGYKGDYIVEIARDFAARKSVQASDGQPVTATGDLDSLDDIRAFAVAYL
RREQDLDLQAFGLAFDNYFLESSLYASGRVQETVDTLVAKGHTYEEGGALWLRTTELGTGDDKDRVMRKS
EGGYTYFVPDVAYHKVKWERGFHHAVNIQGSDHHGTVARVRAGLQGLAGIPKDFPAYVLHKMVKVMRGGE
EVKISKRAGSYVTMRDLIDWVGRDAVRYFLIQRRADTEFVFDIDLALSKSDENPVYYIQYAHARICTMIG
NSGASAAEIAQADTALLTAPSEYALLQRLAEFPQVVALAAQELAPHHVAFWLRDCASDFHAWYNAERVLV
DEPALKLARLRLAATTRQVLANGLALLGVSAPDRM

>ChlorobiumtepidumTLS
MRAFFLPFIQDALQKAGIETDKEIQIDKPNDKKFGDFSTNIAFLVAKEARKNPRELAGQLIGLLDFPEGT
VTKTEVAGPGFINFHLAPAFFMRSAQEVLAKGEGFGCNESGKGLKAIVEYVSANPTGPLTIGRGRGGVLG
DCIANLLETQGYEVTREYYFNDAGRQMQILAESVRYRYLEKCGQVIEFPETHYQGDYIGEIAETLFIEHG
DGLAATDELTIFKEAAEAVIFSSIRKTLERLLITHDSFFNEHTLYQSREGQPSANQRVIDALDAKGFIGN
YDGATWFMTTKLGQEKDKVLIKSSGDPSYRLPDIAYHVTKFERGFDLMVNVFGADHIDEYPDVLEALKIL
GYDTSKVKIAINQFVTTTVGGQTVKMSTRKGNADLLDDLIDDVGADATRLFFIMRGKDSHLNFDVELAKK
QSKDNPVFYLQYAHARICSLVRMAEKEVGFDEATAIGAGLPLLSSEPEIDLASALLDFPDIIQSSLRQLE
PQKMVEYLHTVAERYHKFYQECPILKADEHLRTARLELSLAVRQVLRNGFKILGISAPESM

>Thermoanaerobactertengcongensis
MENIVQKAKEEIKDVVLKALNEAKKEGLLNFESIQDVEVEEPKEKQHGDLATNFAMVMAREAKMAPRKIA
EIIASKMNTSGTFIEKVEVAGPGFINFFLNQNFLIETLKLIHKRGKDYGRVNLGKGKKVQVEFVSANPTG
PMHMGNARGGAIGDVLASILDYAGYNVSREFYINDAGNQIEKFGYSLEARYLQLLGIDAEVPEGGYHGED
IIDRAKEFLEIHGDKYKDVPSEERRKALIEYGLKKNIEKMKEDLVLYGIEYDVWFSEQSLYDSGEVYKVI
EELTEKGYTYEKDGALWFKMTLFGAEKDDVLVRSNGVPTYLASDIAYHKNKFVTRGFDWVINVWGADHHG
HVAPMKGAMKALGIDPNRLDVVLMQLVKLIEGGQVVRMSKRTGKMITLRDLIEEVGKDAARFFFNMRSPD
SPIEFDLDLAKQQTNENPVFYVQYAHARICSIIRQLEEMGVKIENIEDVDLGLLKEEEEVDLIKKLAYFP
EEITIAAKTLAPHRITRYVIDVASLFHSFYNSHRVKGAEENLMKARFALILAVKTVLKNALDILKVTAPE
RM

>HelicobacterpyloriJ99
MHTLIKGVLEEILEAEVIIEYPKDREHGHYATPIAFNLAKVFKKSPLAIAEELALKIGSHEKTQGFFDRV
VACKGYINFTLSLDFLERFTQKALELKEQFGSQVKSERSQKIFLEFVSANPTGPLHIGHARGAVFGDSLA
KIARFLGHEVLCEYYVNDMGSQIRLLGVSVWLAYKEHVLKESVTYPEVFYKGEYIIEIAKKAHNDLEPSL
FKENEETIIEVLSDYAKDLMLLEIKGNLDALDIHFDSYASEKEVFKHKDAVFDRLEKANALYEKDSKTWL
KSSLYQDESDRVLIKEDKSYTYLAGDIVYHDEKFQQNYTKYINIWGADHHGYIARVKASLEFLGYDSSKL
EVLLAQMVRLLKDNEPYKMSKRAGNFILIKDVIDDVGKDALRFIFLSKRLDTHLEFDVNTLKKQDSSNPI
YYIHYANSRIHTMLEKSPFSKEEILQTPLKNLNAEEKYLLFSALSLPKAVESSFEEYGLQKMCEYAKTLA
SEFHRFYNAGKILDTPKAKELLKICLMVSLSLTNAFKLLGIEIKTKISSKD

>Xylellafastidiosa9a5c
MLTRFSYKRSDKITLSIATHPHPHVKAPLRALICQGIEALRSNGTLPTNTLPPDFVVERPKTRKHGDFAT
NVAMLLSKATGSNPRLLAQTLVAALPTSADIARIEIAGPGFINFHLHPVAYQRETINVLKQDNDYGRNLS
GQSRTVGVEYVSANPTGPLHVGHGRAAAIGDCLARLLEANGWNVKREFYYNDAGVQIENLVRSVQARARG
LKPGDAFWPTDAYNGEYIADIAKAYLAGDSINMVDTIITSTKNVDDTAAIHHFAVNYLRNEQNHDLAAFN
VDFDIYFLESSLYKDGKVEETVQKLINSGHTYEEGGALWLKSTHFGDDKDRVMRKSDGSYTYFVPDIAYH
LSKWQRGYERAITELGADHHGSLARVHAGLQALEIGIPPGWPEYVLHQMVTVMRGGEEVKLSKRSGGYVT
LRDLIEETSTDATRWFLIARKPDSQLTFDIDLARQKSNDNPVFYVQYAYARVCSLMHQAHEKNLNYDQTS
GMASLDQLSDNTSLCLMIEISRYPEIVQIACELLEPHLIAQYLRELAHAFHTWYHNTPVLVENAVERNAK
LTLACATRQVLANGLNLLGVGTPEKM

>BacteroidesfragilisYCH46
MKIEDKLVTSVISGLKALYGQDVPAAQVQLQKTKKEFEGHLTLVVFPFLKMSKKGPEQTAQEIGEYLKAN
EPAVAAFNVIKGFLNLTVASATWIELLNEIHADAQYGIVSADENAPLVMIEYSSPNTNKPLHLGHVRNNL
LGNALANIVMANGNKVVKTNIVNDRGIHICKSMLAWQKYGKGETPESSGKKGDHLVGDYYVAFDKHYKAE
VAELMEKGMSKEEAEAASPLMNEAREMLVKWEAGDPEVRALWQMMNNWVYTGFDETYRKMGVGFDKIYYE
SNTYLEGKEKVMEGLEKGFFFKKEDGSVWADLTAEGLDHKLLLRGDGTSVYMTQDIGTAKLRFADYPIDK
MIYVVGNEQNYHFQVLSILLDKLGFEWGKSLVHFSYGMVELPEGKMKSREGTVVDADDLMAEMIATAKET
SQELGKLDGLTQEEADDIARIVGLGALKYFILKVDARKNMTFNPKESIDFNGNTGPFIQYTYARIRSVLR
KAAEAGIVIPEVLPANIELSEKEEGLIQMVADFAAVVRQAGEDYSPSGIANYVYDLVKEYNQFYHDFSIL
REENEDVKLFRIALSANIAKVVRLGMGLLGIEVPDRM

>DeinococcusradioduransR1
MDLKAQLKAAVEQAAHQMGMPVDAAIQETPANKPGDYGTPAAFQMAKAAGGNPAQIAAQLAQTVVLPAGI
RRVEATGPFLNFFLDAGAFVRGVVERPFELPKREGKVVIEHTSVNPNKELHVGHLRNVVLGDSMARILRA
AGHTVEVQNYIDDTGRQAAESLFATQHYGRVWDGVQKYDQWLGEGYVQLNADPQKPELESGIMEIMHKLE
AGELRPLVEQTVKAQLQTCFRLGARYDLLNWESDVVGSGFLAQAMNILEGSRYTSRPTEGKYAGAFIMDV
SEFMPGLEEPNVVLVRSGGTAMYAAKDIGYQFWKFGLFEGMKFKPFMQDPEGNTIWTSAPDGQPDDERRF
GHAQEVINVIDSRQDHPQTVVRSALGVAGEQEKEERSIHLSYAFVTLEGQTISGRKGIAVSADDAMDEAQ
KRALSVLQGINPDLAAREDAAEIARRIGLGAIRFAMLKAEPTRKIDFRWEQALALNGDTAPYVQYAAVRA
ANILKKAEEAGYATDGTGADWDALPDIDLVLAKQIAKLPEVAAQAARIHSPHVVAQYALDLATSFNAWYN
AKTKQGKPATNVLQSEEGLREARLALIVRLRKAFEDTLDLIGIEIPAAM

>Bacillussubtilissubsp.subtilisstr.168
MNIAEQMKDVLKEEIKAAVLKAGLAEESQIPNVVLETPKDKTHGDYSTNMAMQLARVAKKAPRQIAEEIV
AHFDKGKASIEKLDIAGPGFINFYMNNQYLTKLIPSVLEAGEAYGETNIGNGERVQVEFVSANPTGDLHL
GHARGAAVGDSLCNVLSKAGYDVSREYYINDAGNQINNLALSVEVRYFEALGLEKPMPEDGYRGEDIIAI
GKRLAEEYGDRFVNEEESERLAFFREYGLKYELEKLRKDLENFRVPFDVWYSETSLYQNGKIDKALEALR
EKGHVYEEDGATWFRSTTFGDDKDRVLIKKDGTYTYLLPDIAYHKDKLDRGFDKLINVWGADHHGYIPRM
KAAIEALGYEKGTLEVEIIQLVHLYKNGEKMKMSKRTGKAVTMRDLIEEVGLDAVRYFFAMRSADTHMDF
DLDLAVSTSNENPVYYAQYAHARICSMLRQGEEQGLKPAADLDFSHIQSEKEYDLLKTIGGFPEAVAEAA
EKRIPHRVTNYIYDLASALHSFYNAEKVIDPENEEKSRARLALMKATQITLNNALQLIGVSAPEKM

>BorreliaburgdorferiB31
MLKRKKMNKSVKKKIKDEINVIVTNLALSNNIKLDNININIQKPPKSDLGDISILMFEIGKTLKLPIEII
SEEIIKNLKTKYEIKAVGPYLNIKISRKEYINNTIQMVNTQKDTYGTSKYLDNKKIILEFSSPNTNKPLH
VGHLRNDVIGESLSRILKAVGAKITKINLINDRGVHICKSMLAYKKFGNGITPEKAFKKGDHLIGDFYVK
YNKYSQENENAEKEIQDLLLLWEQKDVSTIELWKKLNKWAIEGIKETYEITNTSFDKIYLESEIFKIGKN
VVLEGLEKGFCYKREDGAICIDLPSDSDEKADTKVKQKVLIRSNGTSIYLTQDLGNIAVRTKEFNFEEMI
YVVGSEQIQHFKSLFFVAEKLGLSKNKKLIHLSHGMVNLVDGKMKSREGNVIDADNLISNLIELIIPEMT
QKIENKESAKKNALNIALGAIHYYLLKSAIHKDIVFNKKESLSFTGNSGPYIQYVGARINSILEKYKALS
IPVMEKIDFELLKHEKEWEIIKIISELEENIINAAKDLNPSILTSYSYSLAKHFSTYYQEVKVIDTNNIN
LTAARIEFLKAILQTIKNCMYLLNIPYMLKM

>ThermotogamaritimaMSB8
MLVNAIRQKVSEVISKAYGSEIEFEVEIPPRKEFGDLSTNVAMKLAKTLKKNPREIAQEIVKSLDEDPSF
DRIEIMGPGFINFFLSNELLRGVVKTVLEKKDEYGRENVGNGMKVQFEYGSANPTGPFTVGHGRQIIIGD
VLSEVYKELGYDVTREMYINDAGKQIRLLAQSLWARYNQLLGVEKEIPEGGYRGEYLVDIARDLVNEIGD
RYKDLWNEEVEEFFKQTALNRILSSMKDTLEKIGSSFDVYFSEKSLIEDGTVEEVLKLLKNKDVVYEKDG
AVWLKVSAFIDEEDKVLVRSDGTYTYFMTDIAYHYKKYKRGFRKVYDIWGSDHHGHIPRMKAAMKALDIP
DDFFNVILHQFVTLKRGGEIVRMSTRAGEFVTLDELLDEVGRDATRYFFAMVDPNTHMVFDIDLAKAKSM
DNPVYYVQYAHARIHNLFSNAEKKGVKFEEGKHLELLGNEEERVLMRNLGMFNTALKEVAQMFAPNRLTN
YLQSLAESFHAFYTKHVIVDPENPELSNARLNLALATGIVLRKGLKLIGVSAPERM

>ThermusthermophilusHB27
MLRRALEEAIAQALKEMGVPARLKVARAPKDKPGDYGVPLFALAKELRKPPQAIAQELKDRLPLPEFVEE
AIPVGGYLNFRLRTEALLREALRPKAPFPRRPGVVLVEHTSVNPNKELHVGHLRNIALGDAIARILAYAG
REVLVLNYIDDTGRQAAETLFALRHYGLTWDGKEKYDHFAGRAYVRLHQDPEYERLQPAIEEVLHALERG
ELREEVNRILLAQMATMHALNARYDLLVWESDIVRAGLLQKALALLEQSPHVFRPREGKYAGALVMDASP
VIPGLEDPFFVLLRSNGTATYYAKDIAFQFWKMGILEGLRFRPYENPYYPGLRTSAPEGEAYTPKAEETI
NVIDVRQSHPQALVRAALALAGYPALAEKAHHLAYETVLLEGRQMSGRKGLAVSVDEVLEEATRRARAIV
EEKNPDHPDKEEAARMVALGAIRFSMVKTEPKKQIDFRYQEALSFEGDTGPYVQYAHARAHSILRKAGEW
GAPDLSQATPYERALALDLLDFEEAVLEAAEEKTPHVLAQYLLDLAASWNAYYNARENGQPATPVLTAPE
GLRELRLSLVQSLQRTLATGLDLLGIPAPEVM

>BurkholderiacepaciaR1808
MLPAHKQTLEALLADSVAQVAHALKGADAEFVIPAITLERPKVAAHGDVACNVAMQLAKPLGTNPRQLAE
RIVAALVAQPAAQGLVDAAEIAGPGFINLRVSAAAKQAVIAAVFEQGRAFGTSQREKGKRVLVEFVSANP
TGPLHVGHGRQAALGDVLANVIASQGYAVHREFYYNDAGVQIANLAISTQARARGLKPGDAGWPEAAYNG
EYIADIARDYLNGATVAAKDGEPVTGARDIENLDAIRKFAVAYLRHEQDMDLQAFGVKFDQYYLESSLYS
EGRVEKTVDALVKAGMTYEQDGALWLRTTDEGDDKDRVMRKSDGTYTYFVPDVAYHVTKWERGFTKVINI
QGSDHHGTIARVRAGLQGLHIGIPKGYPDYVLHKMVTVMRDGQEVKLSKRAGSYVTVRDLIEWSGGAAPG
QEAAPDMIDEATITRGRDAVRFFLISRKADTEFVFDIDLALKQNDENPVYYVQYAHARICSVLNELKARY
NVDVAQLPGADLSQLTSPQAVSLMQKLAEYPDLLTHAANELAPHAVAFYLRDLAGEFHSFYNAERVLVDD
EAPRNARAALLAATRQVLENGLAMLGVSAPAKM

>NitrosomonaseuropaeaATCC19718
MVTTTLPDFKSHCIQLLDQAARQVLPDEVGVQIELLRPKLADHGDYSSNLAMKLARRLRRNPLELAKALI
GALPDSSCVEKADVAGGGFINFFLKKTAKQQFLHAVLQAGDSFGHSRLGAGKTIQIEFVSANPTGPLHVG
HGRGAAFGASLANIMTAAGYAVTREFYVNDAGRQMDILTLSTWLRYLDLCGLSFSFPANAYRGQYVADMA
SEIYQAQGDRYAHRSDATIRQLTEISTSTTIDSEDERLDRLITAAKSILDQDYADLHNFVLTEQLADCRN
DLMEFGVEFETWFSEQSLFDSGMVARAVQLLDDKKLLYRQDGALWFRSTDFGDEKDRVVQRENGLYTYFA
SDIAYHLSKYERGFDYLLNIWGADHHGYIPRVKGAIEALSLDPGRLEIALVQFAVLYRDGKKVSMSTRSG
EFVTLRQLRQEVGNDAARFFYVLRKSDQHLDFDLDLAKSQSNDNPVYYVQYAHARICSVLGQWGGAEDIL
ARAETELLTDPAELVLLQKMIDFTDTIEAAAKERAPHLIAFFLRELAGEFHSYYNSTRFLVEDESLKITR
LALISAVRQILSKGLTLLGVTAPREM

>Corynebacteriumglutamicum
MTPADLATLIKETAVEVLTSRELDTSVLPEQVVVERPRNPEHGDYATNIALQVAKKVGQNPRDLATWLAE
ALAADDAIDSAEIAGPGFLNIRLAAAAQGEIVAKILAQGETFGNSDHLSHLDVNLEFVSANPTGPIHLGG
TRWAAVGDSLGRVLEASGAKVTREYYFNDHGRQIDRFALSLLAAAKGEPTPEDGYGGEYIKEIAEAIVEK
HPEALALEPAATQELFRAEGVEMMFEHIKSSLHEFGTDFDVYYHENSLFESGAVDKAVQVLKDNGNLYEN
EGAWWLRSTEFGDDKDRVVIKSDGDAAYIAGDIAYVADKFSRGHNLNIYMLGADHHGYIARLKAAAAALG
YKPEDVEVLIGQMVNLLRDGKAVRMSKRAGTVVTLDDLVEAIGIDAARYSLIRSSVDSSLDMDLGLWESQ
SSDNPVYYVQYGHARLCSIARKAETLGVTEEGADLSLLTHDREGDLIRTLGEFPAVVKAAADLREPHRIA
RYAEELAGTFHRFYDSCHILPKADEDTAPIHTARLALAAATRQTLANALRLVGVSAPEKM

>AquifexaeolicusVF5
MKELVKEKVLKALKELYNTQVENFKVEKPKEEAHGDLASNVAFLLARELKKPPVNIAQELADFLSKDETF
KSVEAVKGFINFRFSEDFLKEEFKKFLLSGEAYFKEDLGKGLKVQLEYVSANPTGPLHLGHGRGAVVGDT
LARLFKFFNYDVTREYYINDAGRQVYLLGISIYYRYLEKCPERDEETFKEIKEIFEKDGYRGEYVKEIAE
RLRKLVGESLCKPEEANLKEVREKILKEESIELYYTKKYEPKDVVDLLSNYGLDLMMKEIREDLSLMDIS
FDVWFSERSLYDSGEVERLINLLKEKGYVYEKDGALWLKTSLFGDDKDRVVKRSDGTYTYFASDIAYHYN
KFKRGFEKVINVWGADHHGYIPRVKAALKMLEIPEDWLEILLVQMVKLFREGKEVKMSKRAGTFVTLREL
LDEVGKDAVRFIFLTKRSDTPLDFDVEKAKEKSSENPVYYVQYAHARISGIFREFKERYKKDVSVEELIN
YVQHLEEEAEIKLIKKVLFFKDELVDITLKREPHLLTYYLIDLAGDFHHYYNHHRILGMEENVMFSRLAL
VKGIKEVVRLGLNLMGVSAPERM

>Campylobacterjejunisubsp.jejuniNCTC11168
MKSIIFNEIKKILECDFALENPKDKNLAHFATPLAFSLAKELKKSPMLIASDLASKFQNHDCFESVEAVN
GYLNFRISKTFLNELANQALTNPNDFTKGEKKQESFLLEYVSANPTGPLHIGHARGAVFGDTLTRLARHL
GYKFNTEYYVNDAGNQIYLLGLSILLSVKESILHENVEYPEQYYKGEYIVDLAKEAFEKFGKEFFSEENI
PSLADWAKDKMLVLIKQNLEQAKIKIDSYVSERSYYDALNATLESLKEHKGIYEQEGKIWLASSQKGDEK
DRVIIREDGRGTYLAADIVYHKDKMSRGYGKCINIWGADHHGYIPRMKAAMEFLGFDSNNLEIILAQMVS
LLKDGEPYKMSKRAGNFILMSDVVDEIGSDALRYIFLSKKCDTHLEFDISDLQKEDSSNPVYYINYAHAR
IHQVFAKAGKKIDDVMKADLQSLNQDGVNLLFEALNLKAVLNDAFEARALQKIPDYLKNLAANFHKFYNE
NKVVGSANENDLLKLFSLVALSIKTAFSLMGIEAKNKMEH

>LeptospirainterrogansserovarCopenhagenistr.FiocruzL1-130
MSRSLWIARHMKENETLKQIVLKTLEESVNSLISSFPEVEKEAFKIKIEYSRDEKFGDYSTSFALENSKL
LKRNPIQVSKELVEILQKRTDLFEKVDFTPPGFVNFRISTSFLLNYIETSVLSGNYFPKVDLPLKINLEF
VSANPTGPLNIVSARAAANGDTMASLLKAIGHNVDKEFYINDYGNQVFLLGVSTLVRIRELKGEEGTQQE
TTDDTPIEIILEKNILPAEGYRGEYIKDIASSLLKDPKKNVTIENLLKQKKYKELAELCAVWTIENNLIW
QRKDLDAFGVEFDCYFSERTLHEADKVLSVMKDLEKSGKIFQEDGKKVFRSTEYGDDKDRVVVRDDGRPT
YLLADIAYHKDKIERGYDKIYDIWGPDHHGYISRLSGAVQSLGYKKENFKVIISQQVNLLESGQKVKMSK
RAGSFQTMSDLIGFLGKHGKDVGRYFFVMRSLDAPLDFDLDLAKDESDKNPVFYLQYAHARICSIFKEVG
DQTSKEAAAILEMSEERKRLLFWIARFPEEIFDSANAMEPHRVTNYLQSFAKAFTSFYLAKDNRLKDASK
EVRLGLARICLAAKNVLAEGLKLIGVSAPERMEKEN

>GeobactersulfurreducensPCA
MSQIEGSMKDAVRDLVREALERSFADGTLASGHVPDIVVEKPALEEHGDFACTAAMLMAKAEKKAPRAIA
EIIITHLNDRESLVESVEIAGPGFINFRMRTSAWCRVLRRIEREGGDYGKSEAGAGKKVQVEFVSANPTG
PLHIGHGRGAAIGDTICRLLAAIGWDVTREFYYNDAGQQIANLALSVQARCLGVEPGGPLWPTDGYQGEY
IKDVARSYLNRETVDAGDQHVTAAGDPHDVEAIRRFAVAYLRREQDQDLRAFDVGFDVYFLESSLYAEGR
VDDVVQRIIAKGHAYEQDGALWLRTTEFGDDKDRVMRKSDGSYTYFVPDVAYHLNKWERGFIRVVNEQGA
DHHSTITRVRAGLQALDAGIPKGWPEYVLHQMVTVMRGGEEVKISKRAGSYVTLRDLVDEVGRDATRFFF
LMRKPDSQLVFDIDLAKQQTLENPVYYVQYAHARICSIFENAADKGVVPPTVDQASLESLGTPEELTLVK
LLSSFPEIVEGSALNFEPHRITYYLQELAGAFHSFYNKNRVITEDADLTGARLLLLHSTATVIRNGLGLL
GVSAPEKM

>MycobacteriumtuberculosisCDC1551
MTPADLAELLKATAAAVLAERGLDASALPQMVTVERPRIPEHGDYASNLAMQLAKKVGTNPRELAGWLAE
ALTKVDGIASAEVAGPGFINMRLETAAQAKVVTSVIDAGHSYGHSLLLAGRKVNLEFVSANPTGPIHIGG
TRWAAVGDALGRLLTTQGADVVREYYFNDHGAQIDRFANSLIAAAKGEPTPQDGYAGSYITNIAEQVLQK
APDALSLPDAELRETFRAIGVDLMFDHIKQSLHEFGTDFDVYTHEDSMHTGGRVENAIARLRETGNIYEK
DGATWLRTSAFGDDKDRVVIKSDGKPAYIAGDLAYYLDKRQRGFDLCIYMLGADHHGYIARLKAAAAAFG
DDPATVEVLIGQMVNLVRDGQPVRMSKRAGTVLTLDDLVEAIGVDAARYSLIRSSVDTAIDIDLALWSSA
SNENPVYYVQYAHARLSALARNAAELALIPDTNHLELLNHDKEGTLLRTLGEFPRVLETAASLREPHRVC
RYLEDLAGDYHRFYDSCRVLPQGDEQPTDLHTARLALCQATRQVIANGLAIIGVTAPERM

>WolinellasuccinogenesDSM1740
MHHTIKHLLETTLGFSVVLEKPKDKNHGHYATPAAFSLAKELKKNPALIAQELSKKLSEIEVFESVQSVG
GYINFRLKQGFLDAQASLALSQGREFGKGDKQGSILLEYVSANPTGPLHIGHARGAVLGDALSRIGRHLG
YALETEYYVNDAGNQIHLLGLSIYLAGRDSLLSLPVTYPEQYYRGEYIVDIAKEALKKWGEKAFADEAFI
PELSLFGKELMLEEIRSNLADTHIHFDHYVSEKSLYPRWEETYALLQSHQGCYEGGGKVWLRSSAHGDEK
DRVIVRESGEPTYLAGDIIYHADKFARPYDRYINIWGADHHGYIARVKAAIEFLGHDSSKLEVLLSQMVT
LLKGGQPYKMSKRAGNFILMRDVLEDIGADALRFIFLSKKPDTHLEFDVDDLNKEDSSNPIFYINYAHAR
IHTMLGKSSLDSQEIEAASLEGLEDSIFDLLFLSLQLPQVLEDSFENRAIQKVAEYLRALAGEFHKFYNE
HKILETPQEAALLKVCKVVALSLSQGLALLGITAKERM

>Halobacteriumsp.NRC-1
MLYNLRQELLAGIRAATSDAGYDYEVDQSAIELEDITDEEKGEFSSPISFSIAAAAGAPPVDVAAAIADA
HRSNGLPAEVEAVTVEGGHINYHADTTDLADATLSTILRDGSEYGTRTDADPDTILADVSSPNIAKPLHV
GHLRNTILSDAVMNILEARGHDVTRDNHLGDWGVQFGNLMHEYTEFGDEATLEDDAIEHLLDLYQQFEQR
DSMLADLEDDETVTDQFADAVTEERDYHADSGKEWFTRLEQGDEDATALWERFRTVSIDRFKQTYDDLDV
AFDVWNGESFYAQEGWNDVIIEKAIENDVAMRGEGESVYIPVYPDDYENVGDPQAADVDASLDRARQMRE
ANDDLEDADFDPFYIVKSDGSTLYGTRDLATIEYRIEEYDADQSVYVVANEQNQYFQQLFVAARKMGYND
IKLKHIDYGLISLPEGSMSTRKGQIITAREVLDRAQDRAEEIIAEKGRIDDAEAQSVATKIALATIKYEM
VAAKRERDTTFDIDESVALEGDTGPYVQYAATRGYSILDGADAAPEIDDLDPSVFNDTDVELLFELARYP
LVLERCEERYDAAPLAHYLLQLAHVFNSFYHKNAVLDAENARTERLLLTKATTQIFDNGLGLLGIDVLEE
M

>MethanosarcinamazeiGo1
MFLELKAQATSILKEAIRKAGFEVEDSELQFETSPHADLASRAAFRLAGIHRQNPKDLASRIVSAVEIPE
GSFIGKVSAAGPYINFFAGKHYLNGTVNAVLKEKEKFGCGAPKDRILLEHTSANPNGPLHVGHIRNSIIG
DTLARILRRAGYDVEVQYYVNDMGRQIAVVSWACERFELDLSRKSDSAIADVYIKANVELDKNPGYIKEI
DALMEKVEAGDVRTIEHFDKAVSLAVAGIKETLLRLNVAHDKFVSESTFLKSGAVHDIVERIKATGRTKT
DKGALVVDLSDYGFEKTLVIQRSNGTSLYTTRDLAYHEWKAGQADRIIDVFGADHKLISGQLRATLNAIG
VKEPEVVIFEFVSLPEGSMSTRRGQFISADDLFDRVTGAAFEQVETRRPETSYEFKKQVAEAVGLGAVRY
DIVRVSPEKSTVFNWKEALDFEKQGAPYIQYSHARACSILEKAKEEAAWNPDKEIDPSLLVEDSEIDLIK
KMAMFDSVIDLGARELKPHVLAIYARELADAFNQFYRFVPVIAAEDENVRAARLALVDCARVVLANSLDT
LGIIAPESM

>ThermoplasmaacidophilumDSM1728
MLLFQDLRKDIYEIVSKRFRISENDVYLDDTGHSDITIRVFRILKSPDGGENAVMEIVRSISEKDYVEKA
LSEGGYINVWIKRTYMLREVLESIEKSGTYPDVFQEAERVSVEHTSANPTGPLHIGRARNSIIGDSIYRI
LSRYGYRTVRQYFVNDSGKQMISLYTAYIKYGGPITIENLLENYQKIYREMEKDQSIEKEIEKNIERYEN
ADPEVFGTLRKIAGVMLDGIASTLKRIGIEFDEFDWESDLLLNGSVRKAIDMLETKEEDSARYIEISGKK
VFLTRKDGTTLYFARDIAYHLFKAENSEWIIDVLGEDHKDHAKSLNHVLKEMLKLENRVSFMYYSFITLE
TGKMSTRRGNIVTLQDLVDRTYDEALKIVNEKRPDLSEEERKKIAEVIASSAVRYSIIRVSAPKPITFRW
EEALNFESNSAPFIMYSHARAASILDKAPEPEQSYGMDMPKEEADLVKAMYVYPYYLKDAAQDLKPDLIA
AYLISLVQKFNDFYGACRVIGTDPLTYARRIRIVKAYKQILSDAGDLIGIKMLDQM

>Treponemapallidumsubsp.pallidumstr.Nichols
MQDLCEMWRHAVARVLSQLQGPAVEPVEGAQLVMEEPPEPGMGDIAFPLFLFAKRVRRSPAQLAQQLCTL
LEEDTSMCAYGTPQARGPYLNVFLNKECVAAHTLDAIFAQGERYGHTQYLQGKRIMVEFSSPNTNKPLHV
GHLRNNAIGESLSRIIAFCGADVFKVNIINDRGVHICKSMCAYQKFAHGKTPAHTGIKSDRFVGDWYVQF
NRYAQQYPEEAEHDVRDLLQRWESADPHVRALWRTMNEWALRGIKQTYERTGISFDKLYFESETYTKGRE
EVRRGLACGVFYQMEDNSIWVDLSSLGLDKKALLRSDGTTMYITQDIGTAIFRAQDWPFDQLLYVVGNEQ
NYHFKVLFFVLRLLGYPWAQQLHHVSYGMVNLPHGRMKSREGTVVDADDILDRLHSAAEEEIAKKGRENA
LKHAQCIAENVAIAALHYFLLQVSPQKDMVFHPEESLSFNGNTGPYLQYMGARISSLLKKVQEDVEQKGP
REVRCDPALLTHEAEWELVKALARFPACVTRAAQGHDPSVITGYLYTLSKSFSRFYHDCPILCEARPDYA
CARLELVRAVRIVLRTAMRLVLIPFLEEM

>BdellovibriobacteriovorusHD100
MIKHDSIRLLATNLLKDAIGRAYPDFSASEDDIYKALVNPPKSDLGDLAFGCFILAKALKTAPPQVATAV
AAQMKGATAVAAGPYINIRFDEQTHGEQVLATILDGSYFKKPLMEKSPKTMIEYSQPNTHKELHVGHMRN
LCLGDAIVRMLRYSGREIVSSTFPGDMGTHVAKCLWYMKKHNQEPVPETEKGEWLGRMYSKANLLLEDQN
GTPQEDINRQELTAILHQLEGKTGPYYDLWLETREWSIELMKKVYAWADVTFDEWYFESEMDSPSAAWVK
QLYAEGKLEMSQGAIGKDLESEKLGFCMLLKSDGTGLYATKDLLLAKHKFEDVKIEKSVYVVDMRQALHF
KQVFRVLEILGFEQAKNCFHLQYNYVELPDGAMSSRKGNIVPLRELVHRMEDHVKTTYLSRYKGEWSEED
VEKIAGQVAKGAIFYGMLRMDTNKKIVFDMNEWLKLDGESGPFVQYSYARISSLGRKFPRTAGAKIDWSR
LNHASERQLMQSLGGFNTAVAAAAENFKPSAICTYLYDLAKSFNVFYHECPIGTEADVATREARLALSEA
VGLTLKNGLAVLGMPAPEKM
Copyright Protected. Copyright / Disclaimer / Privacy / Accessibility / Contact McMaster University A McMaster University Website