SEQ_RENAME
Filters out the metadata for the given sequences, leaving only the species name. The species name can be optionally trucated to a set length.
Enter inputs and click "Run Program" to get started.
This program filters out the metadata for each of the sequences in the given file,
leaving only a truncated version of the species name.
If the "Maximum species name length" parameter is omitted, the entire species name will be included. Otherwise it is truncated to the given integer value.
The program filters only lines in the file which match raw sequence metadata. All other lines (including the sequence data itself) are ignored.
If the "Maximum species name length" parameter is omitted, the entire species name will be included. Otherwise it is truncated to the given integer value.
The program filters only lines in the file which match raw sequence metadata. All other lines (including the sequence data itself) are ignored.
View sample input/output files. All files should be in a plain-text format (.txt, .csv, .xml, etc.).
Download files related to this app.
Program Results
Example Input File
>gi|16129828|ref|NP_416390.1| arginine tRNA synthetase [Escherichia coli K12] MNIQALLSEKVRQAMIAAGAPADCEPQVRQSAKVQFGDYQANGMMAVAKKLGMAPRQLAEQVLTHLDLNG IASKVEIAGPGFINIFLDPAFLAEHVQQALASDRLGVATPEKQTIVVDYSAPNVAKEMHVGHLRSTIIGD AAVRTLEFLGHKVIRANHVGDWGTQFGMLIAWLEKQQQENAGEMELADLEGFYRDAKKHYDEDEEFAERA RNYVVKLQSGDEYFREMWRKLVDITMTQNQITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVE SEGATVVFLDEFKNKEGEPMGVIIQKKDGGYLYTTTDIACAKYRYETLHADRVLYYIDSRQHQHLMQAWA IVRKAGYVPESVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALERARRLVAEKNPDMPADELEKL ANAVGIGAVKYADLSKNRTTDYIFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKAEIDEEQLAAAPVIIRE DREAQLAARLLQFEETLTVVAREGTPHVMCAYLYDLAGLFSGFYEHCPILSAENEEVRNSRLKLAQLTAK TLKLGLDTLGIETVERM >gi|16273477|ref|NP_439728.1| arginyl-tRNA synthetase [Haemophilus influenzae Rd KW20] MNIQSILSDKIKQAMILAGADQSCDALIRQSGKPQFGDYQANGIMAAAKKLGLNPREFAQKVLDNLQLSD IAEKLEIAGPGFINIFLNPTWLTTEISAALSHKNLGIQATNKQTVVIDYSSPNVAKEMHVGHLRSTIIGD AVARTLEFLGHNVIRANHVGDWGTQFGMLIAYLEKMQNEHASEMELQDLEAFYREAKKHYDEDEVFAEKA RNYVVKLQSGDEYCRTMWKRLVDITMQQNQHNYARLNVTLTEKDVMGESLYNPMLPSIVKDLKKQGLAVE NDGALVVYLDEFKNKDGDPMGVIVQKKDGGFLYTTTDIAAAKYRYETLKANRALVFSDTRQSQHMQQAWL ITRKAGYVPDSFSLEHKNFGMMLGKDGKPFKTRTGGTVKLADLLDEAIERATVLINEKNTNLSNDEKEAV IEAVGIGAVKYADLSKNRTTDYVFDWDNMLSFEGNTAPYMQYAYTRIRSIFNKTDINSTALLAAPLTIKD DKERTLAIKLLQFEEAVQTVGKEGTPHVLCAYLYELAGIFSSFYEHCPILNAEDESIKLSRLKLALLTEK TLKQGLTLLGIKTVEKM >gi|9656621|gb|AAF95220.1| arginyl-tRNA synthetase [Vibrio cholerae O1 biovar eltor str. N16961] MICLTSKMMAFLPYFLELKGKRVNIQALINDRVSQAIEAAGAPAGTPALVRQSAKAQFGDYQANGIMGAA KQLGTNPREFAQKVLDVLNLEGIASKTEIAGPGFINIFLSEEFLAAQAEAALADARLGVAQEAPKTIVAD YSAPNVAKEMHVGHLRSTIIGDAVVRTLEFLGHKVIRANHIGDWGTQFGMLIANLERVQKASGEVSMELS DLEAFYRESKKLYDEDEQFAETARNYVVKLQGGDPFCLEMWKKLVDVTMIQNQRNYDRLNVSLTRENVMG ESMYNDMLPQIVSDLKAKGLAVEDDGAQVVFLEEFKNKDGEPMGVIIQKRDGGFLYTTTDIACAKYRYET LGADRVLYFIDSRQHQHLMQAWTIVRKAGYIPENVSLEHHAFGMMLGKDGRPFKTRAGGTVRLADLLDEA QERAKALIESKNPELSAEEKANIANTVAMAAVKYADLSKHRTTDYVFDWDNMLAFEGNTAPYMQYAYTRV ASVFAKAGVDMNELTGHIQITEEKEKALIAKLLQFEEAVQSVAREGQPHIMCSYLFELAGIFSSFYEACP ILVAEQESIKQSRLKLAALTAKTIKQGLALLGIDTLERM >gi|15677359|ref|NP_274514.1| arginyl-tRNA synthetase [Neisseria meningitidis MC58] MNLHQTVEHEAAAAFAAAGIADSPIVLQPTKNAEHGDFQINGVMGAAKKAKQNPRELAQKVAEALADNAV IESAEVAGPGFINLRLRPEFLAQNIQTALNDARFGVAKTDKPQTVVIDYSSPNLAKEMHVGHLRSSIIGD SISRVLAFMGNTVIRQNHVGDWGTQFGMLVAYLVEQQKDNAAFELADLEQFYRAAKVRFDEDPAFADTAR EYVVKLQGGDETVLALWKQFVDISLSHAQAVYDTLGLKLRPEDVAGESKYNDDLQPVVDDLVQKGLAVED DGAKVVFLDEFKNKEGEPAAFIVQKQGGGFLYASTDLACLRYRIGRLKADRLLYVVDHRQALHFEQLFTT SRKAGYLPENVGAAFIGFGTMMGKDGKPFKTRSGDTVKLVDLLTEAVERATALVKEKNPELGADEAAKIG KTVGIGAVKYADLSKNRTSDYVFDWDAMLSFEGNTAPYLQYAYTRVQSVFRKAGEWDANAPTVLTEPLEK QLAAELLKFEDVLQSVADTAYPHYLAAYLYQIATLFSRFYEACPILKAEGASRNSRLQLAKLTGDTLKQG LDLLGIDVLDVM >gi|15600244|ref|NP_253738.1| arginyl-tRNA synthetase [Pseudomonas aeruginosa PAO1] MKDTIRQLIQQALDQLTADGTLPAGLTPDIQVENTKDRSHGDFASNIAMMLAKPAGMKPRDLAARLVEAI PAHEQLAKVEIAGPGFLNFFQDHVWLAASLDRALADERLGVRKAGPAQRVVIDLSSPNLAKEMHVGHLRS TIIGDAVARVLEFLGDTVIRQNHVGDWGTQFGMLLAYLEEQPVDAEAELHDLEVFYRAAKKRFDESPEFA DRARELVVKLQAGDPDCLRLWTRFNEISLSHCQKVYDRLGVKLSMADVMGESAYNDDLAQVVADLTAKGL LTEDNGALCVFLEEFKNAEGNPLPVIVQKAGGGYLYATTDLAAMRYRHNVLHADRVLYFVDQRQALHFQQ VFEVARRAGFVPAGMELEHMGFGTMNGADGRPFKTRDGGTVKLIDLLEEAESRAYALVKERNEQRAERGE EPFDEVQLREIGRVVGIDSVKYADLSKHRTSDYSFNFELMLSFEGNTAPYLLYACTRVASVFRKLGQGRE QLGGKIVLEQPQELALAAQLAQFGDLINNVALKGVPHLLCAYLYELAGLFSSFYEHCPILTAEDPAQKDS RLRLAALTGRTLEQGLELLGLKTLERM >gi|22298368|ref|NP_681615.1| arginyl-tRNA-synthetase [Thermosynechococcus elongatus BP-1] MVAPIKILGDRLRRALQAALPLDTYPQPLLVPASQVKFGDYQSNVCLSLAKQLGKAPRELAQEVVPHLEV EDLCQPVEIAGPGFLNFRLKPEFLAATLQAARGSDRLGIPPAREPRRVVVDFSSPNIAKEMHVGHLRSTI IGDCIARILEFQGHTVLRLNHVGDWGTQFGMLIAYLDEVYPDALTTANALDLGDLVTFYKKAKQRFDSDP EFQQKARAKVVALQQGEEQSRRAWQLLCEQSRREFQKIYDLLDIQLTERGESFYNPFLPAVIEDLAACGL LVEDQGAKVVFLEGFTNKEGQPQPLIIQKSDGGYNYATTDLAALRYRIDKDQADWIIYVTDVGQSTHFAQ VFQVAQRAGWVPPHVTLTHVPFGLVLGEDGKRLKTRSGETIRLIDLLTEAIARSRADLEQRLATEGRTES PEFIDTVARAIGIGAVKYADLSQNRNSNYVFSYDKMLSLQGNTAPYLLYAYVRVQGLTRRGDIDWCTLSP DSPLLLEDETEQHLAKHLVQLEETLDLVSTELLPNRLCQYLFELSQLFNQFYDRCPILSAPQPTKQSRLT LAYLTAQTLKLGLSLLGIPVLDRI >gi|17231209|ref|NP_487757.1| arginyl-tRNA-synthetase [Nostoc sp. PCC 7120] MNATQEQLKIKLEQALVAAFGDEYAGVDPILVSASNPKFGDYQANVALSLSKKLGQQPRAIASAIVEKLD VSEICEKPEIAGPGFINLKLKTAYLEAQLNTIQADTRLGVPTAKHPQREIVDFSSPNIAKEMHVGHLRST IIGDSIARILEFRGHDVLRLNHVGDWGTQFGMLITYLREVSPEALTTANALDIGDLVSFYRQAKQRFDAD EAFQETARQEVVRLQAGAADTLHAWKLLCEQSRQEFQVIYDLLDVKLTERGESFYNPLLPTVVENLEKSG LLVENQGAKCVFLDGFTNREGEPLPLIVQKSDGGYNYATTDLAALRYRIQKDEAKRIIYITDAGQANHFA QFFQVARKAGWIPDDVELVHVPFGLVLGEDGKKFKTRSGDTVRLRDLLDEAISRAHADVEVRLKAEEREE TAEFIDKVAEVVGISAVKYADLSQNRTSNYIFSYDKMLDLKGNTAPYMLYAYARIQGISRKGEINFADLG DNAKVILQHETEFALAKYLLQLGEVISTVEEDLSPNRLCEYLYELSKRFNAFYDRNQGVQVLSAEEPLRT SRLVLCDLTARTLKLGLSLLGIQVLERM >gi|33864335|ref|NP_895895.1| Arginyl-tRNA synthetase [Prochlorococcus marinus str. MIT 9313] MQAHFASEFMLSLAHALESQLRAAIDRAFPEAAASARESGTGLDPQLAPASKPEFGDFQANAALPLAKPL KQPPRQIAAAIVDQLMVDTAFTAICLTPEIAGPGFINLTVRPECLAAEVQARLADARLGVPLVEGDNDGQ QPTPVVVDFSSPNIAKEMHVGHLRSTIIGDSLARVLEFRGHPVLRLNHVGDWGTQFGMLITHLKQVAPEA LETADAVDLGDLVVFYRQAKQRFDDDEAFQTTSREEVVKLQGGDPISLKAWSLLCDQSRREFQKIYDRLD VRLNERGESFYNAYLESVVEDLNVSGLLVSDDGAQCVFLEGVTGKDGKPLPVIVQKSDGGFNYATTDLAA MRYRFAAPPQGDGARRVIYVTDAGQANHFAGVFQVAQRAGWIPDAGRLQHVPFGLVQGEDGKKLKTRAGD TVRLRELLDEAVERAESDLRRRLQEEGRDEDESFIEQVATTVGLAAVKYADLSQNRITNYQFSFDRMLAL QGNTAPYLLYAVVRIAGIARKGGDLDVTTAELQFSETQEWALVRELLKFDAVIAEVEEELLPNRLCTYLF ELSQVFNRFYDQVPVLKAEQPSRSCRLALCRLTADTLKLGLSLLGIPTLERM >gi|15605181|ref|NP_219967.1| Arginyl tRNA Transferase [Chlamydia trachomatis D/UW-3/CX] MTTLLSFLTSLCSAAIHQAFPELEELTLDITPSTKEHFGHYQCNDAMKLARVLHKSPRAIAESIVAHIPP TPFSSIEIAGAGFINFTFSKEFLASQLQTFSKELANGFRAASPQKVIIDFSSPNIAKDMHVGHLRSTIIG DCLARCFSFVGHDVLRLNHIGDWGTAFGMLITYLQETSQEAIHQLEDLTALYKKAHARFAEDSEFKKRSQ HNVVALQSGDAQALALWKQICSVSEKSFQTIYSILDVELHTRGESFYNPFLAEVVADLESKNLVTLSDGA KCVFHEAFSIPLMIQKSDGGYNYATTDVAAMRYRIQQDQADRILIVTDSGQSLHFQLLEATCLAAGYLPS KGIFSHVGFGLVLDTQGRKFKTRSGENIKLRELLDTAVEKAKESLKAHRPDISEEELAYQGPILGINAIK YADLSSHRINDYVFSFEKMLRFEGNTAMSLLYAYVRIQGIKRRMGLESPPQEGPLAVHEPAEEALALTLL RFPEILDLTLRELCPHFLTDYLYALTNKFNAFFRDCHIEGSDSQQERLYLCGLTERTLSTGMHLLGLKTL NHL >gi|15836101|ref|NP_300625.1| arginyl tRNA transferase [Chlamydophila pneumoniae J138] MSTLLSILSVICSQAIAKAFPNLEDWAPEITPSTKEHFGHYQCNDAMKLARVLKKAPRAIAEAIVAELPQ EPFSLIEIAGAGFINFTFSPVFLNQQLEHFKDALKLGFQVSQPKKIIIDFSSPNIAKDMHVGHLRSTIIG DSLARIFSYVGHDVLRLNHIGDWGTAFGMLITYLQENPCDYSDLEDLTSLYKKAYVCFTNDEEFKKRSQQ NVVALQAKDPQAIAIWEKICETSEKAFQKIYDILDIVVEKRGESFYNPFLPEIIEDLEKKGLLTVSNDAK CVFHEAFSIPFMVQKSDGGYNYATTDLAAMRYRIEEDHADKIIIVTDLGQSLHFQLLEATAIAAGYLQPG IFSHVGFGLVLDPQGKKLKTRSGENVKLRELLDTAIEKAEEALREHRPELTDEAIQERAPVIGINAIKYS DLSSHRTSDYVFSFEKMLRFEGNTAMFLLYAYVRIQGIKRRLGISQLSLEGPPEIQEPAEELLALTLLRF PEALESTIKELCPHFLTDYLYNLTHKFNGFFRDSHIQDSPYAKSRLFLCALAEQVLATGMHLLGLKTLER L >gi|21221735|ref|NP_627514.1| putative arginyl-tRNA synthetase [Streptomyces coelicolor A3(2)] MASVTSLSDSVQQHLASALTATRPEAAGADPLLRRSDRADYQANGILALAKKTKANPRELAAEVVARITT GDELIEDVEVSGPGFLNITVADRAITANLAARLADGERLGVPLKQDAGTTVVDYAQPNVAKEMHVGHLRS AVIGDALRSMLDFTGEKTIGRHHIGDWGTQFGMLIQYLFEHPGELAPAGDIDGEQAMSNLNRVYKASRAV FDTDEEFKERARRRVVALQSGDKETLDLWQQFVDESKVYFYSVFEKLDMEIRDEEIVGESAYNDGMPETA RLLEEMGVAVRSEGALVVFFDEIRGKDDQPVPLIVQKADGGFGYAASDLTAIRNRVQDLHATTLLYVVDV RQSLHFRMVFETARRAGWLGDEVTAHNMGYGTVLGADGKPFKTRAGETVRLEDLLDEAVQRAAEVVREKA RDLTEDEIQERAAQVGIGAVKYADLSTSPNRDYKFDLDQMVSLNGDTSVYLQYAYARIQSILRKAGEVRP AAHPELALHEAERALGLHLDAFGPTVFEAAAEYAPHKLAAYLYQLASLYTTFYDKCPVLKAETPEQVENR LFLCDLTARTLHRGMALLGIRTPERL >gi|32473878|ref|NP_866872.1| arginyl-tRNA synthetase [Rhodopirellula baltica SH 1] MHLPNVLQARFVQALEPLTDSPSDYAGMIRPAADPKFGDYQSNAAMPLAKRVGKTSRDVAAELVQNLNVT DLFEEPEVAGPGFINLRLKDSVLFDSIQQMLLDERVGVSKTTDPKKVIVDFSSPNVAKPMHVGHIRSTVI GDCLARTLRFYGEDVVTDNHLGDWGTQFGIIIYGYRNFGDPAKVAANPVPELSALYRLTNQLIEYQKAKQ SLATMADKLATAKSDAKTAKEVSDQSESDENLKPKDKKKLRKNAEAATRRVASIEADMKSLKAKIDAVDS DTELSKLASEHSDVDVAVLRETAKLHEGDPENLALWKEFLPHCQDEINRIYDRLNVQFDHTLGESFYHDR LAGVVDHLTTLGLTTKSDGAICVFLEGFDSPMIIQKRDGAFLYATTDLATLQYRRDEFQPDEILYVVDSR QGEHFKKFFAMAEPLGMAEVQLVHVNFGTVLGPDGRPMKTRSGSLIGLESLLNDAVSRAKEVVCNPDRLA TMDPPMGGEEQQQIAEIVGIGAIKYADLSHHRTSDYKFDVDKMVALEGNTATYVQYSYARTQSILRRASD GEGLPAFEQAIEQAAATQPMTFTHPNERSLALMLMRFEEAIEQVRLNYAPNALCDYLFETAKTYSSFNES CRVLGNDDPAVMQTRLALVVLTGRVLKKGLSLLGIDVAERM >gi|34762844|ref|ZP_00143829.1| Arginyl-tRNA synthetase [Fusobacterium nucleatum subsp. vincentii ATCC 49256] MKITSKELTDIFQKHVESLFPNKELKPVEITVATNENFGDYQCNFAMINSKIIGDNPRKIAEEIKNNFSC GDVVEKLEVAGPGFINIFLSDKYISNSIKKIGENYDFSFLNRKGKVIIDFSSPNIAKRMHIGHLRSTIIG EAVCRIYKFLGYDVVADNHIGDWGTQFGKLIVGYRKWLNREAYEKNAIEELERVYVKFSDEAEKDPSLED LARAELKKVQDGEEENTKLWKEFITESLKEYNKLYKRLDVHFDTYYGESFYNDMMGDVVKELVDKKIAVD DDGAKVVFFDEKDNLFPCIVQKKDGAYLYSTSDIATVKFRKNTYDVNRMIYLTDARQQDHFKQFFKITDM LGWNIEKYHIWFGIIRFADGILSTRKGNVIKLEELLDEAHSRAYDVVNEKNPNLSEEEKQNIAEVVGVSS VKYADLSQNKQSDIIFEWDKMLSFEGNTAPYLLYTYARIQSILRKVTEQNIDLNKNIEIKTDNKFEKSLA TYLLVFPISVLKAAETFKPNLIADYLYELSKKLNSFYNNCPILNQDIETLKSRALLIKKTGEVLKEGLGL LGIPVLNKM >gi|16127589|ref|NP_422153.1| arginyl-tRNA synthetase [Caulobacter crescentus CB15] MNDLKRSLSEAAAAAFQAAGLPPEFGRVTASDRPDLADFQCNGALAAAKSAKRNPREIAVQVVDILKGDP RLASVEIAGVGFINMRVSDEALSARAREIASDDRTGAQLLETPRRVLIDYAGPNVAKPMHVGHLRASIIG ESVKRLYRFRGDDVVGDAHFGDWGFQMGLLISAIMDEDPFINALMEKLPEAPRGFSSADEAKVMAEFEKR ITLADLDRIYPAASVRQKEDPAFKERARKATAELQNGRFGYRLLWRHFVNVSRVALEREFHALGVDFDLW KGESDVNDLIEPMVLQLEAKGLLVQDQGARIVRVAREGDKRDVPPLLVVSSEGSAMYGTTDLATILDRRK SFDPHLILYCVDQRQADHFETVFRAAYLAGYAEEGALEHIGFGTMNGADGKPFKTRAGGVLKLHDLIEMA REKARERLREAGLGAELSEEQFEDTAHKVGVAALKFADLQNFRGTSYVFDLDRFTSFEGKTGPYLLYQSV RIKSVLRRAAESGAVAGRVEIHEPAERDLAMLLDAFEGALQEAYDKKAPNFVAEHAYKLAQSFSKFYAAC PIMSADTETLRASRLTLAETTLRQLELALDLLGIEAPERM >gi|15903931|ref|NP_359481.1| Arginyl-tRNA synthetase(arginine--tRNA ligase) (ARGRS) [Streptococcus pneumoniae R6] MNTKELIASELSSIIDSLDQEAILKLLETPKNSEMGDIAFPAFSLAKVERKAPQMIAAELAEKMNSQAFE KVVATGPYVNFFLDKSAISAQVLQAVTTEKEHYADQNIGKQENVVIDMSSPNIAKPFSIGHLRSTVIGDS LSHIFQKIGYQTVKVNHLGDWGKQFGMLIVAYKKWGDEEAVKAHPIDELLKLYVRINAEAENDPSLDEEA REWFRKLENGDEEALALWQWFRDESLVEFNRLYNELKVEFDSYNGEAFYNDKMDAVVDILSEKGLLLESE GAQVVNLEKYGIEHPALIKKSDGATLYITRDLAAALYRKNEYQFAKSIYVVGQEQSAHFKQLKAVLQEMG YDWSDDITHVPFGLVTKEGKKLSTRKGNVILLEPTVAEAVSRAKVQIEAKNPELENKDQVAHAVGVGAIK FYDLKTDRTNGYDFDLEAMVSFEGETGPYVQYAYARIQSILRKADFKPETAGNYSLNDTESWEIIKLIQD FPRIINRAADNFEPSIIAKFAISLAQSFNKYYAHTRILDESPERDSRLALSYATAVVLKEALRLLGVEAP EKM >gi|18310643|ref|NP_562577.1| arginine-tRNA ligase [Clostridium perfringens str. 13] MDYKKLVAERIKEHVDLELENIEKLIEIPPKPEMGDFAFPCFQLAKVMRKAPNMIAAELAEKINKEGFER VECLGPYLNFFVDKVAFSKNIISKVLEEGDKYGSSKIGEGKNVVVEYSSPNIAKPFHVGHLFTTAIGHSL YRMLNFEGYNPIRINHLGDWGTQFGKLISAYKRWGNEEALEEAPINELLRIYVKFHDEAENNPELEDEGR MYFKKLEDGDQEAVALWERFKDLSLKEFNKIYDMLGVDFDSWAGESFYNDKMDKVVEELEKANILTESNG AKVVMLDEYNMPPCIVVKSDGASIYATRDLAAASYRHKTYNFDKCIYVVGKDQILHFNQVFKTLELAGNE WAKNCVHIPFGLVKFADRKLSTRKGNVVLLEDLLNEAIDKTRETIEEKNPQLENKEEVAKKIGIGAILFT YLKNSRERDIVFDWKEMLSFDGETGPYVQYSYARAKSILRKAEEQKITAEPDFTKLTSKEEFELAKTLEG LQKAVILGIDKLEPSVVTRYSIEVAKAFNKFYNNHTVLNVEDEGLKAARLELIKATAQVIKNALFLIGID VVEKM >gi|15926285|ref|NP_373818.1| arginyl-tRNA synthetase [Staphylococcus aureus subsp. aureus N315] MNIIDQVKQTLVEEIAASINKAGLADEIPDIKIEVPKDTKNGDYATNIAMVLTKIAKRNPREIAQAIVDN LDTEKAHVKQIDIAGPGFINFYLDNQYLTAIIPEAIEKGDQFGHVNESKGQNVLLEYVSANPTGDLHIGH ARNAAVGDALANILTAAGYNVTREYYINDAGNQITNLARSIETRFFEALGDNSYSMPEDGYNGKDIIEIG KDLAEKHPEIKDYSEEARLKEFRKLGVEYEMAKLKNDLAEFNTHFDNWFSETSLYEKGEILEVLAKMKEL GYTYEADGATWLRTTDFKDDKDRVLIKNDGTYTYFLPDIAYHFDKVKRGNDILIDLFGADHHGYINRLKA SLETFGVDSNRLEIQIMQMVRLMENGKEVKMSKRTGNAITLREIMDEVGVDAARYFLTMRSPDSHFDFDM ELAKEQSQDNPVYYAQYAHARICSILKQAKEQGIEVTAANDFTTITNEKAIELLKKVADFEPTIESAAEH RSAHRITNYIQDLAAHFHKFYNAEKVLTDDIEKTKAHVAMIEAVRITLKNALAMVGVSAPESM >gi|39649773|emb|CAE28295.1| arginyl-tRNA synthetase [Rhodopseudomonas palustris CGA009] MAELPMSTHLFARLLSRVHAVCAALIEEGALPAGIDLSRVVVEPPKDASHGDMATNAAMVLAKDAKAKPR DLADKIADKLRAEELIDQVAIAGPGFINLTLKPAVWAEALRAVLDAGAGYGRSTVGGGEKVNVEYVSANP TGPMHVGHCRGAVFGDALANLLDTAGYDVTREYYINDAGAQVDVLARSAFLRYREALGETIGEIPEGLYP GDYLKPVGEALKAEHGAALKDMPEAQWLPTVRATAIAMMMEAIKGDLAALNITHEVFFSERSLIEGGRNR VAETIEFLRAKGDVYQGRLPPPKGAPVEDYEDREQTLFRATAYGDDVDRPLLKSDGSYTYFASDIAYHKV KFDAGFANMVDVWGADHGGYIKRMQAAIQAVTAGKGALDVKIVQLVRLLRNGEPVKMSKRAGDFVTLREV VDEVGSDAVRFMMLFRKNDAVLDFDLAKVIEQSKDNPVFYVQYGHARGHSIFRNAREVVPDLPEDSKARA AMLRQAPLERLNDPAELELLKRLALYPRIVEAAAQAHEPHRIAFYLNELASEFHALWTHGRDLPHLRFII NNDAEITRARLAMVQGVVSVLASGLAILGVTAPDEMR >gi|16801767|ref|NP_472035.1| arginyl tRNA synthetase [Listeria innocua Clip11262] MNVMQENQIKLIEHIKQAVVQAVGLEETEVPEILLEVPKDKKHGDYSTNIAMQLARVAKKAPRQIAESIV PELKKDTKLIKEVEIAGPGFINFYLDNAYLTDLVPVILTEDKKYGESDFGKGEKFQIEFVSANPTGDLHL GHARGAAIGDSLANIMKMAGFDVSREYYINDAGNQINNLVLSAEARYFEALGLESEFPEDGYRGSDIIAL GKDLAAKYGDKYVNASEEERRSVFRVDALAFETGKLRADLEEFRVSFDEWFSETSLYEENKVLPALERLR ENGYIYEQDGATWLRTTDFEDDKDRVLIKSDGSYTYFLPDIAYHLNKLERGFDVLIDIWGADHHGYIPRM RAAIEALGYSPNQLEVEIIQLVHLFEDGVQVKMSKRTGKSVTMRDLIEEVGLDATRYFFAMRSSDTHMNF DMSLAKSTSNDNPVYYVQYAHARISSILRSGKEQGLEVSKDANMSLLETEAEYDLLKVLGEFADVVAEAA VKRAPHRIVRYLNDLATAFHRFYNSNKVLDMDNLEVTKARLALIKTAQITLRNGLTLLGVSAPEKM >gi|17545006|ref|NP_518408.1| PROBABLE ARGINYL-TRNA SYNTHETASE (ARGININE--TRNA LIGASE) PROTEIN [Ralstonia solanacearum GMI1000] MLPSHKQTISQLLSDAVGTLLPEGTNRPEIVLERPKQAAHGDIACNVALQLAKPLGTNPRELANRIADGI RADARGQRLVSAVEIAGPGFINLRLSPTARTDVLAAVFAEGDRYGAADLHDGAPVLVEFVSANPTGPLHV GHGRQAALGDALAALLEWQGHKVHREFYYNDAGVQIHNLAVSVQARARGFKPGDTGWPEAAYNGDYIADI AADYLAGKTVRASDGEPVTGARDVENIEAIRRFAVTYLRNEQDIDLQAFGVKFDHYYLESSLYADGKVQQ TVDALIAAGKTYEQEGALWLRTTDDGDDKDRVMRKSDGSYTYFVPDVAYHTTKWGRGFTQVINVQGSDHH GTIARVRAGLQGLDLGIPKGYPDYVLHKMVTVMKDGAEVKISKRAGSYVTVRDLIEWSNGDAESEAGVDT IRACVESGAPNWPGRFTRGRDAVRFFLLSRKADTEFVFDVDLALKQSDENPVYYVQYAHARICSVFEQWH AREGGDAASLAGADLAAVAGPEASPQAVALVQRIAAFPDMLADAARELAPHAVAFYLRDLAGDFHAFYNA DRVLVDDDAVKRARLALLAATRQVLRNGLAVIGVSAPQKM >gi|23336003|ref|ZP_00121233.1| COG0018: Arginyl-tRNA synthetase [Bifidobacterium longum DJO10A] MSPEALSELISSIAHNLVAAGQAGALTDELIPPVDKLAVMRPKDRAHGDWASNIAMQLAKKAGMKPRDLA EPFAAALAEADGIAKVEVAGPGFINITLDSASAAAVVDTVLAAGAMTDTDKHLNKVNEYGRNAHLGGQTL NLEFVSANPTGPIHIGGTRWAAVGDAMARVLEANGAKVVREYYFNDHGEQINRFAKSLVAAWAEANNLGE AGYQTETPCDGYKGAYINEIAARVQAEAESDGVDLTALAHQDQGLNDDGEPLGEADTEVREEFRKRAVPM MFDEIQKSMKDFRVNFDVWFHENSLYADGKVDAAIEELKSRGDIFDKDGATWFESTKHGDDKDRVIIKSN GEFAYFAADIAYYWDKRHRAENPADVAIYMLGADHHGYIGRMMAMCAAFGDEPGKNMQILIGQLVNVMKD GKPVRMSKRAGNVVTIDDLVSVVGVDAARYSLARSDYNQNFDIDLALLASHTNDNPVYYVQYAHARSKNV DRNAAVAGISYEGADLALLDTEADGEVLAALAQFPSVLATAADDRQPHKVARYLEELAATYHKWYNVERV VPMALTDPETRGDDEARKALEIAKNPEPARAAARLKLNDAVQQVIANGLDLLGVTAPEKM >gi|15889018|ref|NP_354699.1| AGR_C_3144p [Agrobacterium tumefaciens str. C58] MNIFADFDTRIKNALETLDLVKENREKVDFSRITVESPRDLSHGDVATNAAMVLAKPLGTNPRALAELLV PALQADGDVDGVNVAGPGFINLKVSVGYWQRLLADMIGQGVDFGRSTVGAGQKINVEYVSANPTGPMHVG HCRGAVVGDTLANLLAFAGYGVTKEYYINDAGSQIDVLARSVFLRYREALGEDIGSIPSGLYPGDYLVPV GQALADEYGIKLRAMPEEKWLPIVKDKAIDAMMVMIREDLALLNVRHDVFFSERTLHEGNGGPILSAIND LTFKGHVYKGTLPPPKGELPDDWEDREQTLFRSTEVGDDMDRALMKSDGSYTYFAADVAYFKNKFDRGFS EMIYVLGADHGGYVKRLEAVARAVSEGKSKLTVLLCQLVKLFRDGEPVKMSKRSGDFVTLRDVVDEVGRD PVRFMMLYRKNSEPLDFDFAKVTEQSKDNPVFYVQYAHARCKSIFRQAQEAFPGLAPSAEDMAASVALIS DINELQLVAKLAEYPRLIESAALSHEPHRLAFYLYDLAGSFHGHWNKGKDHQELRFINDKNRELSIARLG LVNAVANVLKSGLTLLGADAPDEMR >gi|17987372|ref|NP_540006.1| ARGINYL-TRNA SYNTHETASE [Brucella melitensis 16M] MNIFADFDARIKKTLQDIDLKPKDGGELDLSRIGVEPPRDASHGDIATNAAMVLSKAVGQNPRELAARIA EALKADEDVESVDVAGPGFINLRLKASYWQRELLVMLNEGTDFGRSRLGAGKKVNVEYVSANPTGPMHVG HCRGAVVGDVLANLLKFAGYDVVKEYYINDAGAQIDVLARSVMLRYREALGESIGEIPAGLYPGDYLVRV GQELAGEFGTKLLEMPEAEALAIVKDRTIDAMMAMIRADLDALNVHHDVFYSERKLHVDHARAIRNAIND LTLKGHVYKGKLPPPKGQLPEDWEDREQTLFRSTEVGDDIDRPLMKSDGSFTYFAGDVAYFKDKYDHGFN EMIYVLGADHGGYVKRLEAVARAVSDGKAKLTVLLCQLVKLFRNGEPVRMSKRAGEFITLRDVVDEVGRD PVRFMMLYRKNDAPLDFDFAKVTEQSKDNPVFYVQYASARCHSVFRQAADQLGLVDLDRVAMGSHFEKLT DESEIALVRKLAEYPRLIESAAIHQEPHRLAFYLYDLASSFHSQWNRGAENPDLRFIKVNDPDLSLARLG LVQVVSDVLTSGLTIIGADAPTEMR >gi|34540109|ref|NP_904588.1| arginyl-tRNA synthetase [Porphyromonas gingivalis W83] MSILQKLENSAAAAVKALYGTDPMEGQIQLQKTKREFKGHLTLVVFPFVKMSRKSPEATATEIGEWLLAN ESAVSAIEVVKGFLNLTIAPRVWLELLNEIRADINFGHKVATEDSPLVMVEYSSPNTNKPLHLGHVRNNL LGYSLSEIMKANGYRVVKTNIVNDRGIHICKSMLAWQKWGDGVTPEKAGKKGDHLIGDFYVLFDKHYKAE LNSLMAEGKSKEEAEAASTLMAEAREMLRLWEAGDEKVVDLWRTMNQWVYDGFDATYKMMGVDFDKIYYE SETYLVGKEEVLRGLEEGLFVKHSDGSVWADLTKDGLDEKLLLRADGTSVYMTQDIGTAKMRFNDYPINR MIYVVGNEQNYHFQVLSILLDRLGFEFGKGLVHFSYGMVELPEGKMKSREGTVVDADDLMDEMIRTAAEI AAEAGKAAEMDEEESREVARIVGLGSLKYFILKVDPRKNMTFNPKESIDFNGNTGSFVQYTYARIRSLMR RAEAAGYDIPSQLPTDLPLSEKEEALIQKVSEYAEVVSEAGHSYSPALIANYIYDLVKEYNQFYHDFSVL KEEDERIRAFRLALSEVVALTMRKGFALLGIEMPERM >gi|15603944|ref|NP_220459.1| ARGINYL-TRNA SYNTHETASE (argS) [Rickettsia prowazekii str. Madrid E] MNIFNQLKQDIIAASQKLYNNKEIANTATIETPKDSFNGDLSSNIAMIIASKESIAPREVALKFKEVLVT LPYIASIEIAGPGFINFTIKAESWQAAIKDILQHEEKFFEIDIDKNSNINIEYVSANPTGPMHIGHARGA VYGDVLARILQKVGYSVTKEYYVNDAGSQINDLVSTVLLRYKEALGEPITIPVGLYPGEYLIPLGEILSK EYGNKLLTMNDVERFKIIKSFAVEKMLDLNRKDLADLGIKHDVFFSEQSLYDKGEIEKTVKLLERMGLIY EGTLPAPKGKVHEDWEYRVQKLFKSTNYGDSQDRPIEKADGSWSYFASDLAYAKDKIDRGANHLIYVLGA DHSGYVKRIEAIVKALGQEKVKVDVKICQLVNFVENGVPIKMSKRLGSFASVQDVNKEVGKDIIRFMMLT RQNDKPLDFDLVKVKEQSRENPIFYVQYAHVRTKSILSKARELMPEAYNSFKEGKYNLSLLSSEEEIEII KLLAAWTKTLEASVKYFEPHRIAFYLINLASKFHSMWNFGKENSDYRFIIENNKELTLARLALASVIQKI IASGLEVIGVEPMVTM >gi|33591373|ref|NP_879017.1| arginyl-tRNA synthetase [Bordetella pertussis Tohama I] MRGHLRQTPGRPPGGSARPAARQTCRRHLPFCRLPMLLEQQKQLISLIQAAVAQCLPEAQAQVQLERPKV AAHGDIATNVAMQLAKPARRNPRELAQGIVDALMAQPQARELIQDAEIAGPGFINFRLTPAARQAVVQAV ASQADAYGRAPRNGEKVLVEFVSANPTGPLHVGHARQAALGDAICRLYDASGWDVTREFYYNDAGNQIDN LAISVQARGRGIAPDAPDYPADGYKGDYIVEIARDFAARKSVQASDGQPVTATGDLDSLDDIRAFAVAYL RREQDLDLQAFGLAFDNYFLESSLYASGRVQETVDTLVAKGHTYEEGGALWLRTTELGTGDDKDRVMRKS EGGYTYFVPDVAYHKVKWERGFHHAVNIQGSDHHGTVARVRAGLQGLAGIPKDFPAYVLHKMVKVMRGGE EVKISKRAGSYVTMRDLIDWVGRDAVRYFLIQRRADTEFVFDIDLALSKSDENPVYYIQYAHARICTMIG NSGASAAEIAQADTALLTAPSEYALLQRLAEFPQVVALAAQELAPHHVAFWLRDCASDFHAWYNAERVLV DEPALKLARLRLAATTRQVLANGLALLGVSAPDRM >gi|21672856|ref|NP_660921.1| arginyl-tRNA synthetase [Chlorobium tepidum TLS] MRAFFLPFIQDALQKAGIETDKEIQIDKPNDKKFGDFSTNIAFLVAKEARKNPRELAGQLIGLLDFPEGT VTKTEVAGPGFINFHLAPAFFMRSAQEVLAKGEGFGCNESGKGLKAIVEYVSANPTGPLTIGRGRGGVLG DCIANLLETQGYEVTREYYFNDAGRQMQILAESVRYRYLEKCGQVIEFPETHYQGDYIGEIAETLFIEHG DGLAATDELTIFKEAAEAVIFSSIRKTLERLLITHDSFFNEHTLYQSREGQPSANQRVIDALDAKGFIGN YDGATWFMTTKLGQEKDKVLIKSSGDPSYRLPDIAYHVTKFERGFDLMVNVFGADHIDEYPDVLEALKIL GYDTSKVKIAINQFVTTTVGGQTVKMSTRKGNADLLDDLIDDVGADATRLFFIMRGKDSHLNFDVELAKK QSKDNPVFYLQYAHARICSLVRMAEKEVGFDEATAIGAGLPLLSSEPEIDLASALLDFPDIIQSSLRQLE PQKMVEYLHTVAERYHKFYQECPILKADEHLRTARLELSLAVRQVLRNGFKILGISAPESM >gi|20808887|ref|NP_624058.1| Arginyl-tRNA synthetase [Thermoanaerobacter tengcongensis] MENIVQKAKEEIKDVVLKALNEAKKEGLLNFESIQDVEVEEPKEKQHGDLATNFAMVMAREAKMAPRKIA EIIASKMNTSGTFIEKVEVAGPGFINFFLNQNFLIETLKLIHKRGKDYGRVNLGKGKKVQVEFVSANPTG PMHMGNARGGAIGDVLASILDYAGYNVSREFYINDAGNQIEKFGYSLEARYLQLLGIDAEVPEGGYHGED IIDRAKEFLEIHGDKYKDVPSEERRKALIEYGLKKNIEKMKEDLVLYGIEYDVWFSEQSLYDSGEVYKVI EELTEKGYTYEKDGALWFKMTLFGAEKDDVLVRSNGVPTYLASDIAYHKNKFVTRGFDWVINVWGADHHG HVAPMKGAMKALGIDPNRLDVVLMQLVKLIEGGQVVRMSKRTGKMITLRDLIEEVGKDAARFFFNMRSPD SPIEFDLDLAKQQTNENPVFYVQYAHARICSIIRQLEEMGVKIENIEDVDLGLLKEEEEVDLIKKLAYFP EEITIAAKTLAPHRITRYVIDVASLFHSFYNSHRVKGAEENLMKARFALILAVKTVLKNALDILKVTAPE RM >gi|15611371|ref|NP_223022.1| ARGINYL-TRNA SYNTHETASE [Helicobacter pylori J99] MHTLIKGVLEEILEAEVIIEYPKDREHGHYATPIAFNLAKVFKKSPLAIAEELALKIGSHEKTQGFFDRV VACKGYINFTLSLDFLERFTQKALELKEQFGSQVKSERSQKIFLEFVSANPTGPLHIGHARGAVFGDSLA KIARFLGHEVLCEYYVNDMGSQIRLLGVSVWLAYKEHVLKESVTYPEVFYKGEYIIEIAKKAHNDLEPSL FKENEETIIEVLSDYAKDLMLLEIKGNLDALDIHFDSYASEKEVFKHKDAVFDRLEKANALYEKDSKTWL KSSLYQDESDRVLIKEDKSYTYLAGDIVYHDEKFQQNYTKYINIWGADHHGYIARVKASLEFLGYDSSKL EVLLAQMVRLLKDNEPYKMSKRAGNFILIKDVIDDVGKDALRFIFLSKRLDTHLEFDVNTLKKQDSSNPI YYIHYANSRIHTMLEKSPFSKEEILQTPLKNLNAEEKYLLFSALSLPKAVESSFEEYGLQKMCEYAKTLA SEFHRFYNAGKILDTPKAKELLKICLMVSLSLTNAFKLLGIEIKTKISSKD >gi|15836752|ref|NP_297440.1| arginyl-tRNA synthetase [Xylella fastidiosa 9a5c] MLTRFSYKRSDKITLSIATHPHPHVKAPLRALICQGIEALRSNGTLPTNTLPPDFVVERPKTRKHGDFAT NVAMLLSKATGSNPRLLAQTLVAALPTSADIARIEIAGPGFINFHLHPVAYQRETINVLKQDNDYGRNLS GQSRTVGVEYVSANPTGPLHVGHGRAAAIGDCLARLLEANGWNVKREFYYNDAGVQIENLVRSVQARARG LKPGDAFWPTDAYNGEYIADIAKAYLAGDSINMVDTIITSTKNVDDTAAIHHFAVNYLRNEQNHDLAAFN VDFDIYFLESSLYKDGKVEETVQKLINSGHTYEEGGALWLKSTHFGDDKDRVMRKSDGSYTYFVPDIAYH LSKWQRGYERAITELGADHHGSLARVHAGLQALEIGIPPGWPEYVLHQMVTVMRGGEEVKLSKRSGGYVT LRDLIEETSTDATRWFLIARKPDSQLTFDIDLARQKSNDNPVFYVQYAYARVCSLMHQAHEKNLNYDQTS GMASLDQLSDNTSLCLMIEISRYPEIVQIACELLEPHLIAQYLRELAHAFHTWYHNTPVLVENAVERNAK LTLACATRQVLANGLNLLGVGTPEKM >gi|53715701|ref|YP_101693.1| arginyl-tRNA synthetase [Bacteroides fragilis YCH46] MKIEDKLVTSVISGLKALYGQDVPAAQVQLQKTKKEFEGHLTLVVFPFLKMSKKGPEQTAQEIGEYLKAN EPAVAAFNVIKGFLNLTVASATWIELLNEIHADAQYGIVSADENAPLVMIEYSSPNTNKPLHLGHVRNNL LGNALANIVMANGNKVVKTNIVNDRGIHICKSMLAWQKYGKGETPESSGKKGDHLVGDYYVAFDKHYKAE VAELMEKGMSKEEAEAASPLMNEAREMLVKWEAGDPEVRALWQMMNNWVYTGFDETYRKMGVGFDKIYYE SNTYLEGKEKVMEGLEKGFFFKKEDGSVWADLTAEGLDHKLLLRGDGTSVYMTQDIGTAKLRFADYPIDK MIYVVGNEQNYHFQVLSILLDKLGFEWGKSLVHFSYGMVELPEGKMKSREGTVVDADDLMAEMIATAKET SQELGKLDGLTQEEADDIARIVGLGALKYFILKVDARKNMTFNPKESIDFNGNTGPFIQYTYARIRSVLR KAAEAGIVIPEVLPANIELSEKEEGLIQMVADFAAVVRQAGEDYSPSGIANYVYDLVKEYNQFYHDFSIL REENEDVKLFRIALSANIAKVVRLGMGLLGIEVPDRM >gi|15807552|ref|NP_296288.1| arginyl-tRNA synthetase [Deinococcus radiodurans R1] MDLKAQLKAAVEQAAHQMGMPVDAAIQETPANKPGDYGTPAAFQMAKAAGGNPAQIAAQLAQTVVLPAGI RRVEATGPFLNFFLDAGAFVRGVVERPFELPKREGKVVIEHTSVNPNKELHVGHLRNVVLGDSMARILRA AGHTVEVQNYIDDTGRQAAESLFATQHYGRVWDGVQKYDQWLGEGYVQLNADPQKPELESGIMEIMHKLE AGELRPLVEQTVKAQLQTCFRLGARYDLLNWESDVVGSGFLAQAMNILEGSRYTSRPTEGKYAGAFIMDV SEFMPGLEEPNVVLVRSGGTAMYAAKDIGYQFWKFGLFEGMKFKPFMQDPEGNTIWTSAPDGQPDDERRF GHAQEVINVIDSRQDHPQTVVRSALGVAGEQEKEERSIHLSYAFVTLEGQTISGRKGIAVSADDAMDEAQ KRALSVLQGINPDLAAREDAAEIARRIGLGAIRFAMLKAEPTRKIDFRWEQALALNGDTAPYVQYAAVRA ANILKKAEEAGYATDGTGADWDALPDIDLVLAKQIAKLPEVAAQAARIHSPHVVAQYALDLATSFNAWYN AKTKQGKPATNVLQSEEGLREARLALIVRLRKAFEDTLDLIGIEIPAAM >gi|16080786|ref|NP_391614.1| arginyl-tRNA synthetase [Bacillus subtilis subsp. subtilis str. 168] MNIAEQMKDVLKEEIKAAVLKAGLAEESQIPNVVLETPKDKTHGDYSTNMAMQLARVAKKAPRQIAEEIV AHFDKGKASIEKLDIAGPGFINFYMNNQYLTKLIPSVLEAGEAYGETNIGNGERVQVEFVSANPTGDLHL GHARGAAVGDSLCNVLSKAGYDVSREYYINDAGNQINNLALSVEVRYFEALGLEKPMPEDGYRGEDIIAI GKRLAEEYGDRFVNEEESERLAFFREYGLKYELEKLRKDLENFRVPFDVWYSETSLYQNGKIDKALEALR EKGHVYEEDGATWFRSTTFGDDKDRVLIKKDGTYTYLLPDIAYHKDKLDRGFDKLINVWGADHHGYIPRM KAAIEALGYEKGTLEVEIIQLVHLYKNGEKMKMSKRTGKAVTMRDLIEEVGLDAVRYFFAMRSADTHMDF DLDLAVSTSNENPVYYAQYAHARICSMLRQGEEQGLKPAADLDFSHIQSEKEYDLLKTIGGFPEAVAEAA EKRIPHRVTNYIYDLASALHSFYNAEKVIDPENEEKSRARLALMKATQITLNNALQLIGVSAPEKM >gi|15594939|ref|NP_212728.1| arginyl-tRNA synthetase (argS) [Borrelia burgdorferi B31] MLKRKKMNKSVKKKIKDEINVIVTNLALSNNIKLDNININIQKPPKSDLGDISILMFEIGKTLKLPIEII SEEIIKNLKTKYEIKAVGPYLNIKISRKEYINNTIQMVNTQKDTYGTSKYLDNKKIILEFSSPNTNKPLH VGHLRNDVIGESLSRILKAVGAKITKINLINDRGVHICKSMLAYKKFGNGITPEKAFKKGDHLIGDFYVK YNKYSQENENAEKEIQDLLLLWEQKDVSTIELWKKLNKWAIEGIKETYEITNTSFDKIYLESEIFKIGKN VVLEGLEKGFCYKREDGAICIDLPSDSDEKADTKVKQKVLIRSNGTSIYLTQDLGNIAVRTKEFNFEEMI YVVGSEQIQHFKSLFFVAEKLGLSKNKKLIHLSHGMVNLVDGKMKSREGNVIDADNLISNLIELIIPEMT QKIENKESAKKNALNIALGAIHYYLLKSAIHKDIVFNKKESLSFTGNSGPYIQYVGARINSILEKYKALS IPVMEKIDFELLKHEKEWEIIKIISELEENIINAAKDLNPSILTSYSYSLAKHFSTYYQEVKVIDTNNIN LTAARIEFLKAILQTIKNCMYLLNIPYMLKM >gi|15643850|ref|NP_228899.1| arginyl-tRNA synthetase [Thermotoga maritima MSB8] MLVNAIRQKVSEVISKAYGSEIEFEVEIPPRKEFGDLSTNVAMKLAKTLKKNPREIAQEIVKSLDEDPSF DRIEIMGPGFINFFLSNELLRGVVKTVLEKKDEYGRENVGNGMKVQFEYGSANPTGPFTVGHGRQIIIGD VLSEVYKELGYDVTREMYINDAGKQIRLLAQSLWARYNQLLGVEKEIPEGGYRGEYLVDIARDLVNEIGD RYKDLWNEEVEEFFKQTALNRILSSMKDTLEKIGSSFDVYFSEKSLIEDGTVEEVLKLLKNKDVVYEKDG AVWLKVSAFIDEEDKVLVRSDGTYTYFMTDIAYHYKKYKRGFRKVYDIWGSDHHGHIPRMKAAMKALDIP DDFFNVILHQFVTLKRGGEIVRMSTRAGEFVTLDELLDEVGRDATRYFFAMVDPNTHMVFDIDLAKAKSM DNPVYYVQYAHARIHNLFSNAEKKGVKFEEGKHLELLGNEEERVLMRNLGMFNTALKEVAQMFAPNRLTN YLQSLAESFHAFYTKHVIVDPENPELSNARLNLALATGIVLRKGLKLIGVSAPERM >gi|46200208|ref|YP_005875.1| Arginyl-tRNA synthetase [Thermus thermophilus HB27] MLRRALEEAIAQALKEMGVPARLKVARAPKDKPGDYGVPLFALAKELRKPPQAIAQELKDRLPLPEFVEE AIPVGGYLNFRLRTEALLREALRPKAPFPRRPGVVLVEHTSVNPNKELHVGHLRNIALGDAIARILAYAG REVLVLNYIDDTGRQAAETLFALRHYGLTWDGKEKYDHFAGRAYVRLHQDPEYERLQPAIEEVLHALERG ELREEVNRILLAQMATMHALNARYDLLVWESDIVRAGLLQKALALLEQSPHVFRPREGKYAGALVMDASP VIPGLEDPFFVLLRSNGTATYYAKDIAFQFWKMGILEGLRFRPYENPYYPGLRTSAPEGEAYTPKAEETI NVIDVRQSHPQALVRAALALAGYPALAEKAHHLAYETVLLEGRQMSGRKGLAVSVDEVLEEATRRARAIV EEKNPDHPDKEEAARMVALGAIRFSMVKTEPKKQIDFRYQEALSFEGDTGPYVQYAHARAHSILRKAGEW GAPDLSQATPYERALALDLLDFEEAVLEAAEEKTPHVLAQYLLDLAASWNAYYNARENGQPATPVLTAPE GLRELRLSLVQSLQRTLATGLDLLGIPAPEVM >gi|46323650|ref|ZP_00224013.1| COG0018: Arginyl-tRNA synthetase [Burkholderia cepacia R1808] MLPAHKQTLEALLADSVAQVAHALKGADAEFVIPAITLERPKVAAHGDVACNVAMQLAKPLGTNPRQLAE RIVAALVAQPAAQGLVDAAEIAGPGFINLRVSAAAKQAVIAAVFEQGRAFGTSQREKGKRVLVEFVSANP TGPLHVGHGRQAALGDVLANVIASQGYAVHREFYYNDAGVQIANLAISTQARARGLKPGDAGWPEAAYNG EYIADIARDYLNGATVAAKDGEPVTGARDIENLDAIRKFAVAYLRHEQDMDLQAFGVKFDQYYLESSLYS EGRVEKTVDALVKAGMTYEQDGALWLRTTDEGDDKDRVMRKSDGTYTYFVPDVAYHVTKWERGFTKVINI QGSDHHGTIARVRAGLQGLHIGIPKGYPDYVLHKMVTVMRDGQEVKLSKRAGSYVTVRDLIEWSGGAAPG QEAAPDMIDEATITRGRDAVRFFLISRKADTEFVFDIDLALKQNDENPVYYVQYAHARICSVLNELKARY NVDVAQLPGADLSQLTSPQAVSLMQKLAEYPDLLTHAANELAPHAVAFYLRDLAGEFHSFYNAERVLVDD EAPRNARAALLAATRQVLENGLAMLGVSAPAKM >gi|30248381|ref|NP_840451.1| Arginyl-tRNA synthetase [Nitrosomonas europaea ATCC 19718] MVTTTLPDFKSHCIQLLDQAARQVLPDEVGVQIELLRPKLADHGDYSSNLAMKLARRLRRNPLELAKALI GALPDSSCVEKADVAGGGFINFFLKKTAKQQFLHAVLQAGDSFGHSRLGAGKTIQIEFVSANPTGPLHVG HGRGAAFGASLANIMTAAGYAVTREFYVNDAGRQMDILTLSTWLRYLDLCGLSFSFPANAYRGQYVADMA SEIYQAQGDRYAHRSDATIRQLTEISTSTTIDSEDERLDRLITAAKSILDQDYADLHNFVLTEQLADCRN DLMEFGVEFETWFSEQSLFDSGMVARAVQLLDDKKLLYRQDGALWFRSTDFGDEKDRVVQRENGLYTYFA SDIAYHLSKYERGFDYLLNIWGADHHGYIPRVKGAIEALSLDPGRLEIALVQFAVLYRDGKKVSMSTRSG EFVTLRQLRQEVGNDAARFFYVLRKSDQHLDFDLDLAKSQSNDNPVYYVQYAHARICSVLGQWGGAEDIL ARAETELLTDPAELVLLQKMIDFTDTIEAAAKERAPHLIAFFLRELAGEFHSYYNSTRFLVEDESLKITR LALISAVRQILSKGLTLLGVTAPREM >gi|433534|emb|CAA79710.1| arginyl-tRNA synthetase [Corynebacterium glutamicum] MTPADLATLIKETAVEVLTSRELDTSVLPEQVVVERPRNPEHGDYATNIALQVAKKVGQNPRDLATWLAE ALAADDAIDSAEIAGPGFLNIRLAAAAQGEIVAKILAQGETFGNSDHLSHLDVNLEFVSANPTGPIHLGG TRWAAVGDSLGRVLEASGAKVTREYYFNDHGRQIDRFALSLLAAAKGEPTPEDGYGGEYIKEIAEAIVEK HPEALALEPAATQELFRAEGVEMMFEHIKSSLHEFGTDFDVYYHENSLFESGAVDKAVQVLKDNGNLYEN EGAWWLRSTEFGDDKDRVVIKSDGDAAYIAGDIAYVADKFSRGHNLNIYMLGADHHGYIARLKAAAAALG YKPEDVEVLIGQMVNLLRDGKAVRMSKRAGTVVTLDDLVEAIGIDAARYSLIRSSVDSSLDMDLGLWESQ SSDNPVYYVQYGHARLCSIARKAETLGVTEEGADLSLLTHDREGDLIRTLGEFPAVVKAAADLREPHRIA RYAEELAGTFHRFYDSCHILPKADEDTAPIHTARLALAAATRQTLANALRLVGVSAPEKM >gi|15606252|ref|NP_213630.1| arginyl-tRNA synthetase [Aquifex aeolicus VF5] MKELVKEKVLKALKELYNTQVENFKVEKPKEEAHGDLASNVAFLLARELKKPPVNIAQELADFLSKDETF KSVEAVKGFINFRFSEDFLKEEFKKFLLSGEAYFKEDLGKGLKVQLEYVSANPTGPLHLGHGRGAVVGDT LARLFKFFNYDVTREYYINDAGRQVYLLGISIYYRYLEKCPERDEETFKEIKEIFEKDGYRGEYVKEIAE RLRKLVGESLCKPEEANLKEVREKILKEESIELYYTKKYEPKDVVDLLSNYGLDLMMKEIREDLSLMDIS FDVWFSERSLYDSGEVERLINLLKEKGYVYEKDGALWLKTSLFGDDKDRVVKRSDGTYTYFASDIAYHYN KFKRGFEKVINVWGADHHGYIPRVKAALKMLEIPEDWLEILLVQMVKLFREGKEVKMSKRAGTFVTLREL LDEVGKDAVRFIFLTKRSDTPLDFDVEKAKEKSSENPVYYVQYAHARISGIFREFKERYKKDVSVEELIN YVQHLEEEAEIKLIKKVLFFKDELVDITLKREPHLLTYYLIDLAGDFHHYYNHHRILGMEENVMFSRLAL VKGIKEVVRLGLNLMGVSAPERM >gi|15792499|ref|NP_282322.1| arginyl-tRNA synthetase [Campylobacter jejuni subsp. jejuni NCTC 11168] MKSIIFNEIKKILECDFALENPKDKNLAHFATPLAFSLAKELKKSPMLIASDLASKFQNHDCFESVEAVN GYLNFRISKTFLNELANQALTNPNDFTKGEKKQESFLLEYVSANPTGPLHIGHARGAVFGDTLTRLARHL GYKFNTEYYVNDAGNQIYLLGLSILLSVKESILHENVEYPEQYYKGEYIVDLAKEAFEKFGKEFFSEENI PSLADWAKDKMLVLIKQNLEQAKIKIDSYVSERSYYDALNATLESLKEHKGIYEQEGKIWLASSQKGDEK DRVIIREDGRGTYLAADIVYHKDKMSRGYGKCINIWGADHHGYIPRMKAAMEFLGFDSNNLEIILAQMVS LLKDGEPYKMSKRAGNFILMSDVVDEIGSDALRYIFLSKKCDTHLEFDISDLQKEDSSNPVYYINYAHAR IHQVFAKAGKKIDDVMKADLQSLNQDGVNLLFEALNLKAVLNDAFEARALQKIPDYLKNLAANFHKFYNE NKVVGSANENDLLKLFSLVALSIKTAFSLMGIEAKNKMEH >gi|45657950|ref|YP_002036.1| arginyl-tRNA synthetase [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] MSRSLWIARHMKENETLKQIVLKTLEESVNSLISSFPEVEKEAFKIKIEYSRDEKFGDYSTSFALENSKL LKRNPIQVSKELVEILQKRTDLFEKVDFTPPGFVNFRISTSFLLNYIETSVLSGNYFPKVDLPLKINLEF VSANPTGPLNIVSARAAANGDTMASLLKAIGHNVDKEFYINDYGNQVFLLGVSTLVRIRELKGEEGTQQE TTDDTPIEIILEKNILPAEGYRGEYIKDIASSLLKDPKKNVTIENLLKQKKYKELAELCAVWTIENNLIW QRKDLDAFGVEFDCYFSERTLHEADKVLSVMKDLEKSGKIFQEDGKKVFRSTEYGDDKDRVVVRDDGRPT YLLADIAYHKDKIERGYDKIYDIWGPDHHGYISRLSGAVQSLGYKKENFKVIISQQVNLLESGQKVKMSK RAGSFQTMSDLIGFLGKHGKDVGRYFFVMRSLDAPLDFDLDLAKDESDKNPVFYLQYAHARICSIFKEVG DQTSKEAAAILEMSEERKRLLFWIARFPEEIFDSANAMEPHRVTNYLQSFAKAFTSFYLAKDNRLKDASK EVRLGLARICLAAKNVLAEGLKLIGVSAPERMEKEN >gi|39996911|ref|NP_952862.1| arginyl-tRNA synthetase [Geobacter sulfurreducens PCA] MSQIEGSMKDAVRDLVREALERSFADGTLASGHVPDIVVEKPALEEHGDFACTAAMLMAKAEKKAPRAIA EIIITHLNDRESLVESVEIAGPGFINFRMRTSAWCRVLRRIEREGGDYGKSEAGAGKKVQVEFVSANPTG PLHIGHGRGAAIGDTICRLLAAIGWDVTREFYYNDAGQQIANLALSVQARCLGVEPGGPLWPTDGYQGEY IKDVARSYLNRETVDAGDQHVTAAGDPHDVEAIRRFAVAYLRREQDQDLRAFDVGFDVYFLESSLYAEGR VDDVVQRIIAKGHAYEQDGALWLRTTEFGDDKDRVMRKSDGSYTYFVPDVAYHLNKWERGFIRVVNEQGA DHHSTITRVRAGLQALDAGIPKGWPEYVLHQMVTVMRGGEEVKISKRAGSYVTLRDLVDEVGRDATRFFF LMRKPDSQLVFDIDLAKQQTLENPVYYVQYAHARICSIFENAADKGVVPPTVDQASLESLGTPEELTLVK LLSSFPEIVEGSALNFEPHRITYYLQELAGAFHSFYNKNRVITEDADLTGARLLLLHSTATVIRNGLGLL GVSAPEKM >gi|15840742|ref|NP_335779.1| arginyl-tRNA synthetase [Mycobacterium tuberculosis CDC1551] MTPADLAELLKATAAAVLAERGLDASALPQMVTVERPRIPEHGDYASNLAMQLAKKVGTNPRELAGWLAE ALTKVDGIASAEVAGPGFINMRLETAAQAKVVTSVIDAGHSYGHSLLLAGRKVNLEFVSANPTGPIHIGG TRWAAVGDALGRLLTTQGADVVREYYFNDHGAQIDRFANSLIAAAKGEPTPQDGYAGSYITNIAEQVLQK APDALSLPDAELRETFRAIGVDLMFDHIKQSLHEFGTDFDVYTHEDSMHTGGRVENAIARLRETGNIYEK DGATWLRTSAFGDDKDRVVIKSDGKPAYIAGDLAYYLDKRQRGFDLCIYMLGADHHGYIARLKAAAAAFG DDPATVEVLIGQMVNLVRDGQPVRMSKRAGTVLTLDDLVEAIGVDAARYSLIRSSVDTAIDIDLALWSSA SNENPVYYVQYAHARLSALARNAAELALIPDTNHLELLNHDKEGTLLRTLGEFPRVLETAASLREPHRVC RYLEDLAGDYHRFYDSCRVLPQGDEQPTDLHTARLALCQATRQVIANGLAIIGVTAPERM >gi|34556632|ref|NP_906447.1| ARGINYL-TRNA SYNTHETASE [Wolinella succinogenes DSM 1740] MHHTIKHLLETTLGFSVVLEKPKDKNHGHYATPAAFSLAKELKKNPALIAQELSKKLSEIEVFESVQSVG GYINFRLKQGFLDAQASLALSQGREFGKGDKQGSILLEYVSANPTGPLHIGHARGAVLGDALSRIGRHLG YALETEYYVNDAGNQIHLLGLSIYLAGRDSLLSLPVTYPEQYYRGEYIVDIAKEALKKWGEKAFADEAFI PELSLFGKELMLEEIRSNLADTHIHFDHYVSEKSLYPRWEETYALLQSHQGCYEGGGKVWLRSSAHGDEK DRVIVRESGEPTYLAGDIIYHADKFARPYDRYINIWGADHHGYIARVKAAIEFLGHDSSKLEVLLSQMVT LLKGGQPYKMSKRAGNFILMRDVLEDIGADALRFIFLSKKPDTHLEFDVDDLNKEDSSNPIFYINYAHAR IHTMLGKSSLDSQEIEAASLEGLEDSIFDLLFLSLQLPQVLEDSFENRAIQKVAEYLRALAGEFHKFYNE HKILETPQEAALLKVCKVVALSLSQGLALLGITAKERM >gi|16120221|ref|NP_395809.1| ArgS [Halobacterium sp. NRC-1] MLYNLRQELLAGIRAATSDAGYDYEVDQSAIELEDITDEEKGEFSSPISFSIAAAAGAPPVDVAAAIADA HRSNGLPAEVEAVTVEGGHINYHADTTDLADATLSTILRDGSEYGTRTDADPDTILADVSSPNIAKPLHV GHLRNTILSDAVMNILEARGHDVTRDNHLGDWGVQFGNLMHEYTEFGDEATLEDDAIEHLLDLYQQFEQR DSMLADLEDDETVTDQFADAVTEERDYHADSGKEWFTRLEQGDEDATALWERFRTVSIDRFKQTYDDLDV AFDVWNGESFYAQEGWNDVIIEKAIENDVAMRGEGESVYIPVYPDDYENVGDPQAADVDASLDRARQMRE ANDDLEDADFDPFYIVKSDGSTLYGTRDLATIEYRIEEYDADQSVYVVANEQNQYFQQLFVAARKMGYND IKLKHIDYGLISLPEGSMSTRKGQIITAREVLDRAQDRAEEIIAEKGRIDDAEAQSVATKIALATIKYEM VAAKRERDTTFDIDESVALEGDTGPYVQYAATRGYSILDGADAAPEIDDLDPSVFNDTDVELLFELARYP LVLERCEERYDAAPLAHYLLQLAHVFNSFYHKNAVLDAENARTERLLLTKATTQIFDNGLGLLGIDVLEE M >gi|21227450|ref|NP_633372.1| Arginyl-tRNA synthetase [Methanosarcina mazei Go1] MFLELKAQATSILKEAIRKAGFEVEDSELQFETSPHADLASRAAFRLAGIHRQNPKDLASRIVSAVEIPE GSFIGKVSAAGPYINFFAGKHYLNGTVNAVLKEKEKFGCGAPKDRILLEHTSANPNGPLHVGHIRNSIIG DTLARILRRAGYDVEVQYYVNDMGRQIAVVSWACERFELDLSRKSDSAIADVYIKANVELDKNPGYIKEI DALMEKVEAGDVRTIEHFDKAVSLAVAGIKETLLRLNVAHDKFVSESTFLKSGAVHDIVERIKATGRTKT DKGALVVDLSDYGFEKTLVIQRSNGTSLYTTRDLAYHEWKAGQADRIIDVFGADHKLISGQLRATLNAIG VKEPEVVIFEFVSLPEGSMSTRRGQFISADDLFDRVTGAAFEQVETRRPETSYEFKKQVAEAVGLGAVRY DIVRVSPEKSTVFNWKEALDFEKQGAPYIQYSHARACSILEKAKEEAAWNPDKEIDPSLLVEDSEIDLIK KMAMFDSVIDLGARELKPHVLAIYARELADAFNQFYRFVPVIAAEDENVRAARLALVDCARVVLANSLDT LGIIAPESM >gi|16081423|ref|NP_393761.1| arginine--tRNA ligase related protein [Thermoplasma acidophilum DSM 1728] MLLFQDLRKDIYEIVSKRFRISENDVYLDDTGHSDITIRVFRILKSPDGGENAVMEIVRSISEKDYVEKA LSEGGYINVWIKRTYMLREVLESIEKSGTYPDVFQEAERVSVEHTSANPTGPLHIGRARNSIIGDSIYRI LSRYGYRTVRQYFVNDSGKQMISLYTAYIKYGGPITIENLLENYQKIYREMEKDQSIEKEIEKNIERYEN ADPEVFGTLRKIAGVMLDGIASTLKRIGIEFDEFDWESDLLLNGSVRKAIDMLETKEEDSARYIEISGKK VFLTRKDGTTLYFARDIAYHLFKAENSEWIIDVLGEDHKDHAKSLNHVLKEMLKLENRVSFMYYSFITLE TGKMSTRRGNIVTLQDLVDRTYDEALKIVNEKRPDLSEEERKKIAEVIASSAVRYSIIRVSAPKPITFRW EEALNFESNSAPFIMYSHARAASILDKAPEPEQSYGMDMPKEEADLVKAMYVYPYYLKDAAQDLKPDLIA AYLISLVQKFNDFYGACRVIGTDPLTYARRIRIVKAYKQILSDAGDLIGIKMLDQM >gi|15639817|ref|NP_219267.1| arginyl-tRNA synthetase (argS) [Treponema pallidum subsp. pallidum str. Nichols] MQDLCEMWRHAVARVLSQLQGPAVEPVEGAQLVMEEPPEPGMGDIAFPLFLFAKRVRRSPAQLAQQLCTL LEEDTSMCAYGTPQARGPYLNVFLNKECVAAHTLDAIFAQGERYGHTQYLQGKRIMVEFSSPNTNKPLHV GHLRNNAIGESLSRIIAFCGADVFKVNIINDRGVHICKSMCAYQKFAHGKTPAHTGIKSDRFVGDWYVQF NRYAQQYPEEAEHDVRDLLQRWESADPHVRALWRTMNEWALRGIKQTYERTGISFDKLYFESETYTKGRE EVRRGLACGVFYQMEDNSIWVDLSSLGLDKKALLRSDGTTMYITQDIGTAIFRAQDWPFDQLLYVVGNEQ NYHFKVLFFVLRLLGYPWAQQLHHVSYGMVNLPHGRMKSREGTVVDADDILDRLHSAAEEEIAKKGRENA LKHAQCIAENVAIAALHYFLLQVSPQKDMVFHPEESLSFNGNTGPYLQYMGARISSLLKKVQEDVEQKGP REVRCDPALLTHEAEWELVKALARFPACVTRAAQGHDPSVITGYLYTLSKSFSRFYHDCPILCEARPDYA CARLELVRAVRIVLRTAMRLVLIPFLEEM >gi|39575708|emb|CAE79875.1| argS [Bdellovibrio bacteriovorus HD100] MIKHDSIRLLATNLLKDAIGRAYPDFSASEDDIYKALVNPPKSDLGDLAFGCFILAKALKTAPPQVATAV AAQMKGATAVAAGPYINIRFDEQTHGEQVLATILDGSYFKKPLMEKSPKTMIEYSQPNTHKELHVGHMRN LCLGDAIVRMLRYSGREIVSSTFPGDMGTHVAKCLWYMKKHNQEPVPETEKGEWLGRMYSKANLLLEDQN GTPQEDINRQELTAILHQLEGKTGPYYDLWLETREWSIELMKKVYAWADVTFDEWYFESEMDSPSAAWVK QLYAEGKLEMSQGAIGKDLESEKLGFCMLLKSDGTGLYATKDLLLAKHKFEDVKIEKSVYVVDMRQALHF KQVFRVLEILGFEQAKNCFHLQYNYVELPDGAMSSRKGNIVPLRELVHRMEDHVKTTYLSRYKGEWSEED VEKIAGQVAKGAIFYGMLRMDTNKKIVFDMNEWLKLDGESGPFVQYSYARISSLGRKFPRTAGAKIDWSR LNHASERQLMQSLGGFNTAVAAAAENFKPSAICTYLYDLAKSFNVFYHECPIGTEADVATREARLALSEA VGLTLKNGLAVLGMPAPEKM
Example Output File
>EscherichiacoliK12 MNIQALLSEKVRQAMIAAGAPADCEPQVRQSAKVQFGDYQANGMMAVAKKLGMAPRQLAEQVLTHLDLNG IASKVEIAGPGFINIFLDPAFLAEHVQQALASDRLGVATPEKQTIVVDYSAPNVAKEMHVGHLRSTIIGD AAVRTLEFLGHKVIRANHVGDWGTQFGMLIAWLEKQQQENAGEMELADLEGFYRDAKKHYDEDEEFAERA RNYVVKLQSGDEYFREMWRKLVDITMTQNQITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVE SEGATVVFLDEFKNKEGEPMGVIIQKKDGGYLYTTTDIACAKYRYETLHADRVLYYIDSRQHQHLMQAWA IVRKAGYVPESVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALERARRLVAEKNPDMPADELEKL ANAVGIGAVKYADLSKNRTTDYIFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKAEIDEEQLAAAPVIIRE DREAQLAARLLQFEETLTVVAREGTPHVMCAYLYDLAGLFSGFYEHCPILSAENEEVRNSRLKLAQLTAK TLKLGLDTLGIETVERM >HaemophilusinfluenzaeRdKW20 MNIQSILSDKIKQAMILAGADQSCDALIRQSGKPQFGDYQANGIMAAAKKLGLNPREFAQKVLDNLQLSD IAEKLEIAGPGFINIFLNPTWLTTEISAALSHKNLGIQATNKQTVVIDYSSPNVAKEMHVGHLRSTIIGD AVARTLEFLGHNVIRANHVGDWGTQFGMLIAYLEKMQNEHASEMELQDLEAFYREAKKHYDEDEVFAEKA RNYVVKLQSGDEYCRTMWKRLVDITMQQNQHNYARLNVTLTEKDVMGESLYNPMLPSIVKDLKKQGLAVE NDGALVVYLDEFKNKDGDPMGVIVQKKDGGFLYTTTDIAAAKYRYETLKANRALVFSDTRQSQHMQQAWL ITRKAGYVPDSFSLEHKNFGMMLGKDGKPFKTRTGGTVKLADLLDEAIERATVLINEKNTNLSNDEKEAV IEAVGIGAVKYADLSKNRTTDYVFDWDNMLSFEGNTAPYMQYAYTRIRSIFNKTDINSTALLAAPLTIKD DKERTLAIKLLQFEEAVQTVGKEGTPHVLCAYLYELAGIFSSFYEHCPILNAEDESIKLSRLKLALLTEK TLKQGLTLLGIKTVEKM >VibriocholeraeO1biovareltorstr.N16961 MICLTSKMMAFLPYFLELKGKRVNIQALINDRVSQAIEAAGAPAGTPALVRQSAKAQFGDYQANGIMGAA KQLGTNPREFAQKVLDVLNLEGIASKTEIAGPGFINIFLSEEFLAAQAEAALADARLGVAQEAPKTIVAD YSAPNVAKEMHVGHLRSTIIGDAVVRTLEFLGHKVIRANHIGDWGTQFGMLIANLERVQKASGEVSMELS DLEAFYRESKKLYDEDEQFAETARNYVVKLQGGDPFCLEMWKKLVDVTMIQNQRNYDRLNVSLTRENVMG ESMYNDMLPQIVSDLKAKGLAVEDDGAQVVFLEEFKNKDGEPMGVIIQKRDGGFLYTTTDIACAKYRYET LGADRVLYFIDSRQHQHLMQAWTIVRKAGYIPENVSLEHHAFGMMLGKDGRPFKTRAGGTVRLADLLDEA QERAKALIESKNPELSAEEKANIANTVAMAAVKYADLSKHRTTDYVFDWDNMLAFEGNTAPYMQYAYTRV ASVFAKAGVDMNELTGHIQITEEKEKALIAKLLQFEEAVQSVAREGQPHIMCSYLFELAGIFSSFYEACP ILVAEQESIKQSRLKLAALTAKTIKQGLALLGIDTLERM >NeisseriameningitidisMC58 MNLHQTVEHEAAAAFAAAGIADSPIVLQPTKNAEHGDFQINGVMGAAKKAKQNPRELAQKVAEALADNAV IESAEVAGPGFINLRLRPEFLAQNIQTALNDARFGVAKTDKPQTVVIDYSSPNLAKEMHVGHLRSSIIGD SISRVLAFMGNTVIRQNHVGDWGTQFGMLVAYLVEQQKDNAAFELADLEQFYRAAKVRFDEDPAFADTAR EYVVKLQGGDETVLALWKQFVDISLSHAQAVYDTLGLKLRPEDVAGESKYNDDLQPVVDDLVQKGLAVED DGAKVVFLDEFKNKEGEPAAFIVQKQGGGFLYASTDLACLRYRIGRLKADRLLYVVDHRQALHFEQLFTT SRKAGYLPENVGAAFIGFGTMMGKDGKPFKTRSGDTVKLVDLLTEAVERATALVKEKNPELGADEAAKIG KTVGIGAVKYADLSKNRTSDYVFDWDAMLSFEGNTAPYLQYAYTRVQSVFRKAGEWDANAPTVLTEPLEK QLAAELLKFEDVLQSVADTAYPHYLAAYLYQIATLFSRFYEACPILKAEGASRNSRLQLAKLTGDTLKQG LDLLGIDVLDVM >PseudomonasaeruginosaPAO1 MKDTIRQLIQQALDQLTADGTLPAGLTPDIQVENTKDRSHGDFASNIAMMLAKPAGMKPRDLAARLVEAI PAHEQLAKVEIAGPGFLNFFQDHVWLAASLDRALADERLGVRKAGPAQRVVIDLSSPNLAKEMHVGHLRS TIIGDAVARVLEFLGDTVIRQNHVGDWGTQFGMLLAYLEEQPVDAEAELHDLEVFYRAAKKRFDESPEFA DRARELVVKLQAGDPDCLRLWTRFNEISLSHCQKVYDRLGVKLSMADVMGESAYNDDLAQVVADLTAKGL LTEDNGALCVFLEEFKNAEGNPLPVIVQKAGGGYLYATTDLAAMRYRHNVLHADRVLYFVDQRQALHFQQ VFEVARRAGFVPAGMELEHMGFGTMNGADGRPFKTRDGGTVKLIDLLEEAESRAYALVKERNEQRAERGE EPFDEVQLREIGRVVGIDSVKYADLSKHRTSDYSFNFELMLSFEGNTAPYLLYACTRVASVFRKLGQGRE QLGGKIVLEQPQELALAAQLAQFGDLINNVALKGVPHLLCAYLYELAGLFSSFYEHCPILTAEDPAQKDS RLRLAALTGRTLEQGLELLGLKTLERM >ThermosynechococcuselongatusBP-1 MVAPIKILGDRLRRALQAALPLDTYPQPLLVPASQVKFGDYQSNVCLSLAKQLGKAPRELAQEVVPHLEV EDLCQPVEIAGPGFLNFRLKPEFLAATLQAARGSDRLGIPPAREPRRVVVDFSSPNIAKEMHVGHLRSTI IGDCIARILEFQGHTVLRLNHVGDWGTQFGMLIAYLDEVYPDALTTANALDLGDLVTFYKKAKQRFDSDP EFQQKARAKVVALQQGEEQSRRAWQLLCEQSRREFQKIYDLLDIQLTERGESFYNPFLPAVIEDLAACGL LVEDQGAKVVFLEGFTNKEGQPQPLIIQKSDGGYNYATTDLAALRYRIDKDQADWIIYVTDVGQSTHFAQ VFQVAQRAGWVPPHVTLTHVPFGLVLGEDGKRLKTRSGETIRLIDLLTEAIARSRADLEQRLATEGRTES PEFIDTVARAIGIGAVKYADLSQNRNSNYVFSYDKMLSLQGNTAPYLLYAYVRVQGLTRRGDIDWCTLSP DSPLLLEDETEQHLAKHLVQLEETLDLVSTELLPNRLCQYLFELSQLFNQFYDRCPILSAPQPTKQSRLT LAYLTAQTLKLGLSLLGIPVLDRI >Nostocsp.PCC7120 MNATQEQLKIKLEQALVAAFGDEYAGVDPILVSASNPKFGDYQANVALSLSKKLGQQPRAIASAIVEKLD VSEICEKPEIAGPGFINLKLKTAYLEAQLNTIQADTRLGVPTAKHPQREIVDFSSPNIAKEMHVGHLRST IIGDSIARILEFRGHDVLRLNHVGDWGTQFGMLITYLREVSPEALTTANALDIGDLVSFYRQAKQRFDAD EAFQETARQEVVRLQAGAADTLHAWKLLCEQSRQEFQVIYDLLDVKLTERGESFYNPLLPTVVENLEKSG LLVENQGAKCVFLDGFTNREGEPLPLIVQKSDGGYNYATTDLAALRYRIQKDEAKRIIYITDAGQANHFA QFFQVARKAGWIPDDVELVHVPFGLVLGEDGKKFKTRSGDTVRLRDLLDEAISRAHADVEVRLKAEEREE TAEFIDKVAEVVGISAVKYADLSQNRTSNYIFSYDKMLDLKGNTAPYMLYAYARIQGISRKGEINFADLG DNAKVILQHETEFALAKYLLQLGEVISTVEEDLSPNRLCEYLYELSKRFNAFYDRNQGVQVLSAEEPLRT SRLVLCDLTARTLKLGLSLLGIQVLERM >Prochlorococcusmarinusstr.MIT9313 MQAHFASEFMLSLAHALESQLRAAIDRAFPEAAASARESGTGLDPQLAPASKPEFGDFQANAALPLAKPL KQPPRQIAAAIVDQLMVDTAFTAICLTPEIAGPGFINLTVRPECLAAEVQARLADARLGVPLVEGDNDGQ QPTPVVVDFSSPNIAKEMHVGHLRSTIIGDSLARVLEFRGHPVLRLNHVGDWGTQFGMLITHLKQVAPEA LETADAVDLGDLVVFYRQAKQRFDDDEAFQTTSREEVVKLQGGDPISLKAWSLLCDQSRREFQKIYDRLD VRLNERGESFYNAYLESVVEDLNVSGLLVSDDGAQCVFLEGVTGKDGKPLPVIVQKSDGGFNYATTDLAA MRYRFAAPPQGDGARRVIYVTDAGQANHFAGVFQVAQRAGWIPDAGRLQHVPFGLVQGEDGKKLKTRAGD TVRLRELLDEAVERAESDLRRRLQEEGRDEDESFIEQVATTVGLAAVKYADLSQNRITNYQFSFDRMLAL QGNTAPYLLYAVVRIAGIARKGGDLDVTTAELQFSETQEWALVRELLKFDAVIAEVEEELLPNRLCTYLF ELSQVFNRFYDQVPVLKAEQPSRSCRLALCRLTADTLKLGLSLLGIPTLERM >ChlamydiatrachomatisD/UW-3/CX MTTLLSFLTSLCSAAIHQAFPELEELTLDITPSTKEHFGHYQCNDAMKLARVLHKSPRAIAESIVAHIPP TPFSSIEIAGAGFINFTFSKEFLASQLQTFSKELANGFRAASPQKVIIDFSSPNIAKDMHVGHLRSTIIG DCLARCFSFVGHDVLRLNHIGDWGTAFGMLITYLQETSQEAIHQLEDLTALYKKAHARFAEDSEFKKRSQ HNVVALQSGDAQALALWKQICSVSEKSFQTIYSILDVELHTRGESFYNPFLAEVVADLESKNLVTLSDGA KCVFHEAFSIPLMIQKSDGGYNYATTDVAAMRYRIQQDQADRILIVTDSGQSLHFQLLEATCLAAGYLPS KGIFSHVGFGLVLDTQGRKFKTRSGENIKLRELLDTAVEKAKESLKAHRPDISEEELAYQGPILGINAIK YADLSSHRINDYVFSFEKMLRFEGNTAMSLLYAYVRIQGIKRRMGLESPPQEGPLAVHEPAEEALALTLL RFPEILDLTLRELCPHFLTDYLYALTNKFNAFFRDCHIEGSDSQQERLYLCGLTERTLSTGMHLLGLKTL NHL >ChlamydophilapneumoniaeJ138 MSTLLSILSVICSQAIAKAFPNLEDWAPEITPSTKEHFGHYQCNDAMKLARVLKKAPRAIAEAIVAELPQ EPFSLIEIAGAGFINFTFSPVFLNQQLEHFKDALKLGFQVSQPKKIIIDFSSPNIAKDMHVGHLRSTIIG DSLARIFSYVGHDVLRLNHIGDWGTAFGMLITYLQENPCDYSDLEDLTSLYKKAYVCFTNDEEFKKRSQQ NVVALQAKDPQAIAIWEKICETSEKAFQKIYDILDIVVEKRGESFYNPFLPEIIEDLEKKGLLTVSNDAK CVFHEAFSIPFMVQKSDGGYNYATTDLAAMRYRIEEDHADKIIIVTDLGQSLHFQLLEATAIAAGYLQPG IFSHVGFGLVLDPQGKKLKTRSGENVKLRELLDTAIEKAEEALREHRPELTDEAIQERAPVIGINAIKYS DLSSHRTSDYVFSFEKMLRFEGNTAMFLLYAYVRIQGIKRRLGISQLSLEGPPEIQEPAEELLALTLLRF PEALESTIKELCPHFLTDYLYNLTHKFNGFFRDSHIQDSPYAKSRLFLCALAEQVLATGMHLLGLKTLER L >StreptomycescoelicolorA3(2) MASVTSLSDSVQQHLASALTATRPEAAGADPLLRRSDRADYQANGILALAKKTKANPRELAAEVVARITT GDELIEDVEVSGPGFLNITVADRAITANLAARLADGERLGVPLKQDAGTTVVDYAQPNVAKEMHVGHLRS AVIGDALRSMLDFTGEKTIGRHHIGDWGTQFGMLIQYLFEHPGELAPAGDIDGEQAMSNLNRVYKASRAV FDTDEEFKERARRRVVALQSGDKETLDLWQQFVDESKVYFYSVFEKLDMEIRDEEIVGESAYNDGMPETA RLLEEMGVAVRSEGALVVFFDEIRGKDDQPVPLIVQKADGGFGYAASDLTAIRNRVQDLHATTLLYVVDV RQSLHFRMVFETARRAGWLGDEVTAHNMGYGTVLGADGKPFKTRAGETVRLEDLLDEAVQRAAEVVREKA RDLTEDEIQERAAQVGIGAVKYADLSTSPNRDYKFDLDQMVSLNGDTSVYLQYAYARIQSILRKAGEVRP AAHPELALHEAERALGLHLDAFGPTVFEAAAEYAPHKLAAYLYQLASLYTTFYDKCPVLKAETPEQVENR LFLCDLTARTLHRGMALLGIRTPERL >RhodopirellulabalticaSH1 MHLPNVLQARFVQALEPLTDSPSDYAGMIRPAADPKFGDYQSNAAMPLAKRVGKTSRDVAAELVQNLNVT DLFEEPEVAGPGFINLRLKDSVLFDSIQQMLLDERVGVSKTTDPKKVIVDFSSPNVAKPMHVGHIRSTVI GDCLARTLRFYGEDVVTDNHLGDWGTQFGIIIYGYRNFGDPAKVAANPVPELSALYRLTNQLIEYQKAKQ SLATMADKLATAKSDAKTAKEVSDQSESDENLKPKDKKKLRKNAEAATRRVASIEADMKSLKAKIDAVDS DTELSKLASEHSDVDVAVLRETAKLHEGDPENLALWKEFLPHCQDEINRIYDRLNVQFDHTLGESFYHDR LAGVVDHLTTLGLTTKSDGAICVFLEGFDSPMIIQKRDGAFLYATTDLATLQYRRDEFQPDEILYVVDSR QGEHFKKFFAMAEPLGMAEVQLVHVNFGTVLGPDGRPMKTRSGSLIGLESLLNDAVSRAKEVVCNPDRLA TMDPPMGGEEQQQIAEIVGIGAIKYADLSHHRTSDYKFDVDKMVALEGNTATYVQYSYARTQSILRRASD GEGLPAFEQAIEQAAATQPMTFTHPNERSLALMLMRFEEAIEQVRLNYAPNALCDYLFETAKTYSSFNES CRVLGNDDPAVMQTRLALVVLTGRVLKKGLSLLGIDVAERM >Fusobacteriumnucleatumsubsp.vincentiiATCC49256 MKITSKELTDIFQKHVESLFPNKELKPVEITVATNENFGDYQCNFAMINSKIIGDNPRKIAEEIKNNFSC GDVVEKLEVAGPGFINIFLSDKYISNSIKKIGENYDFSFLNRKGKVIIDFSSPNIAKRMHIGHLRSTIIG EAVCRIYKFLGYDVVADNHIGDWGTQFGKLIVGYRKWLNREAYEKNAIEELERVYVKFSDEAEKDPSLED LARAELKKVQDGEEENTKLWKEFITESLKEYNKLYKRLDVHFDTYYGESFYNDMMGDVVKELVDKKIAVD DDGAKVVFFDEKDNLFPCIVQKKDGAYLYSTSDIATVKFRKNTYDVNRMIYLTDARQQDHFKQFFKITDM LGWNIEKYHIWFGIIRFADGILSTRKGNVIKLEELLDEAHSRAYDVVNEKNPNLSEEEKQNIAEVVGVSS VKYADLSQNKQSDIIFEWDKMLSFEGNTAPYLLYTYARIQSILRKVTEQNIDLNKNIEIKTDNKFEKSLA TYLLVFPISVLKAAETFKPNLIADYLYELSKKLNSFYNNCPILNQDIETLKSRALLIKKTGEVLKEGLGL LGIPVLNKM >CaulobactercrescentusCB15 MNDLKRSLSEAAAAAFQAAGLPPEFGRVTASDRPDLADFQCNGALAAAKSAKRNPREIAVQVVDILKGDP RLASVEIAGVGFINMRVSDEALSARAREIASDDRTGAQLLETPRRVLIDYAGPNVAKPMHVGHLRASIIG ESVKRLYRFRGDDVVGDAHFGDWGFQMGLLISAIMDEDPFINALMEKLPEAPRGFSSADEAKVMAEFEKR ITLADLDRIYPAASVRQKEDPAFKERARKATAELQNGRFGYRLLWRHFVNVSRVALEREFHALGVDFDLW KGESDVNDLIEPMVLQLEAKGLLVQDQGARIVRVAREGDKRDVPPLLVVSSEGSAMYGTTDLATILDRRK SFDPHLILYCVDQRQADHFETVFRAAYLAGYAEEGALEHIGFGTMNGADGKPFKTRAGGVLKLHDLIEMA REKARERLREAGLGAELSEEQFEDTAHKVGVAALKFADLQNFRGTSYVFDLDRFTSFEGKTGPYLLYQSV RIKSVLRRAAESGAVAGRVEIHEPAERDLAMLLDAFEGALQEAYDKKAPNFVAEHAYKLAQSFSKFYAAC PIMSADTETLRASRLTLAETTLRQLELALDLLGIEAPERM >StreptococcuspneumoniaeR6 MNTKELIASELSSIIDSLDQEAILKLLETPKNSEMGDIAFPAFSLAKVERKAPQMIAAELAEKMNSQAFE KVVATGPYVNFFLDKSAISAQVLQAVTTEKEHYADQNIGKQENVVIDMSSPNIAKPFSIGHLRSTVIGDS LSHIFQKIGYQTVKVNHLGDWGKQFGMLIVAYKKWGDEEAVKAHPIDELLKLYVRINAEAENDPSLDEEA REWFRKLENGDEEALALWQWFRDESLVEFNRLYNELKVEFDSYNGEAFYNDKMDAVVDILSEKGLLLESE GAQVVNLEKYGIEHPALIKKSDGATLYITRDLAAALYRKNEYQFAKSIYVVGQEQSAHFKQLKAVLQEMG YDWSDDITHVPFGLVTKEGKKLSTRKGNVILLEPTVAEAVSRAKVQIEAKNPELENKDQVAHAVGVGAIK FYDLKTDRTNGYDFDLEAMVSFEGETGPYVQYAYARIQSILRKADFKPETAGNYSLNDTESWEIIKLIQD FPRIINRAADNFEPSIIAKFAISLAQSFNKYYAHTRILDESPERDSRLALSYATAVVLKEALRLLGVEAP EKM >Clostridiumperfringensstr.13 MDYKKLVAERIKEHVDLELENIEKLIEIPPKPEMGDFAFPCFQLAKVMRKAPNMIAAELAEKINKEGFER VECLGPYLNFFVDKVAFSKNIISKVLEEGDKYGSSKIGEGKNVVVEYSSPNIAKPFHVGHLFTTAIGHSL YRMLNFEGYNPIRINHLGDWGTQFGKLISAYKRWGNEEALEEAPINELLRIYVKFHDEAENNPELEDEGR MYFKKLEDGDQEAVALWERFKDLSLKEFNKIYDMLGVDFDSWAGESFYNDKMDKVVEELEKANILTESNG AKVVMLDEYNMPPCIVVKSDGASIYATRDLAAASYRHKTYNFDKCIYVVGKDQILHFNQVFKTLELAGNE WAKNCVHIPFGLVKFADRKLSTRKGNVVLLEDLLNEAIDKTRETIEEKNPQLENKEEVAKKIGIGAILFT YLKNSRERDIVFDWKEMLSFDGETGPYVQYSYARAKSILRKAEEQKITAEPDFTKLTSKEEFELAKTLEG LQKAVILGIDKLEPSVVTRYSIEVAKAFNKFYNNHTVLNVEDEGLKAARLELIKATAQVIKNALFLIGID VVEKM >Staphylococcusaureussubsp.aureusN315 MNIIDQVKQTLVEEIAASINKAGLADEIPDIKIEVPKDTKNGDYATNIAMVLTKIAKRNPREIAQAIVDN LDTEKAHVKQIDIAGPGFINFYLDNQYLTAIIPEAIEKGDQFGHVNESKGQNVLLEYVSANPTGDLHIGH ARNAAVGDALANILTAAGYNVTREYYINDAGNQITNLARSIETRFFEALGDNSYSMPEDGYNGKDIIEIG KDLAEKHPEIKDYSEEARLKEFRKLGVEYEMAKLKNDLAEFNTHFDNWFSETSLYEKGEILEVLAKMKEL GYTYEADGATWLRTTDFKDDKDRVLIKNDGTYTYFLPDIAYHFDKVKRGNDILIDLFGADHHGYINRLKA SLETFGVDSNRLEIQIMQMVRLMENGKEVKMSKRTGNAITLREIMDEVGVDAARYFLTMRSPDSHFDFDM ELAKEQSQDNPVYYAQYAHARICSILKQAKEQGIEVTAANDFTTITNEKAIELLKKVADFEPTIESAAEH RSAHRITNYIQDLAAHFHKFYNAEKVLTDDIEKTKAHVAMIEAVRITLKNALAMVGVSAPESM >RhodopseudomonaspalustrisCGA009 MAELPMSTHLFARLLSRVHAVCAALIEEGALPAGIDLSRVVVEPPKDASHGDMATNAAMVLAKDAKAKPR DLADKIADKLRAEELIDQVAIAGPGFINLTLKPAVWAEALRAVLDAGAGYGRSTVGGGEKVNVEYVSANP TGPMHVGHCRGAVFGDALANLLDTAGYDVTREYYINDAGAQVDVLARSAFLRYREALGETIGEIPEGLYP GDYLKPVGEALKAEHGAALKDMPEAQWLPTVRATAIAMMMEAIKGDLAALNITHEVFFSERSLIEGGRNR VAETIEFLRAKGDVYQGRLPPPKGAPVEDYEDREQTLFRATAYGDDVDRPLLKSDGSYTYFASDIAYHKV KFDAGFANMVDVWGADHGGYIKRMQAAIQAVTAGKGALDVKIVQLVRLLRNGEPVKMSKRAGDFVTLREV VDEVGSDAVRFMMLFRKNDAVLDFDLAKVIEQSKDNPVFYVQYGHARGHSIFRNAREVVPDLPEDSKARA AMLRQAPLERLNDPAELELLKRLALYPRIVEAAAQAHEPHRIAFYLNELASEFHALWTHGRDLPHLRFII NNDAEITRARLAMVQGVVSVLASGLAILGVTAPDEMR >ListeriainnocuaClip11262 MNVMQENQIKLIEHIKQAVVQAVGLEETEVPEILLEVPKDKKHGDYSTNIAMQLARVAKKAPRQIAESIV PELKKDTKLIKEVEIAGPGFINFYLDNAYLTDLVPVILTEDKKYGESDFGKGEKFQIEFVSANPTGDLHL GHARGAAIGDSLANIMKMAGFDVSREYYINDAGNQINNLVLSAEARYFEALGLESEFPEDGYRGSDIIAL GKDLAAKYGDKYVNASEEERRSVFRVDALAFETGKLRADLEEFRVSFDEWFSETSLYEENKVLPALERLR ENGYIYEQDGATWLRTTDFEDDKDRVLIKSDGSYTYFLPDIAYHLNKLERGFDVLIDIWGADHHGYIPRM RAAIEALGYSPNQLEVEIIQLVHLFEDGVQVKMSKRTGKSVTMRDLIEEVGLDATRYFFAMRSSDTHMNF DMSLAKSTSNDNPVYYVQYAHARISSILRSGKEQGLEVSKDANMSLLETEAEYDLLKVLGEFADVVAEAA VKRAPHRIVRYLNDLATAFHRFYNSNKVLDMDNLEVTKARLALIKTAQITLRNGLTLLGVSAPEKM >RalstoniasolanacearumGMI1000 MLPSHKQTISQLLSDAVGTLLPEGTNRPEIVLERPKQAAHGDIACNVALQLAKPLGTNPRELANRIADGI RADARGQRLVSAVEIAGPGFINLRLSPTARTDVLAAVFAEGDRYGAADLHDGAPVLVEFVSANPTGPLHV GHGRQAALGDALAALLEWQGHKVHREFYYNDAGVQIHNLAVSVQARARGFKPGDTGWPEAAYNGDYIADI AADYLAGKTVRASDGEPVTGARDVENIEAIRRFAVTYLRNEQDIDLQAFGVKFDHYYLESSLYADGKVQQ TVDALIAAGKTYEQEGALWLRTTDDGDDKDRVMRKSDGSYTYFVPDVAYHTTKWGRGFTQVINVQGSDHH GTIARVRAGLQGLDLGIPKGYPDYVLHKMVTVMKDGAEVKISKRAGSYVTVRDLIEWSNGDAESEAGVDT IRACVESGAPNWPGRFTRGRDAVRFFLLSRKADTEFVFDVDLALKQSDENPVYYVQYAHARICSVFEQWH AREGGDAASLAGADLAAVAGPEASPQAVALVQRIAAFPDMLADAARELAPHAVAFYLRDLAGDFHAFYNA DRVLVDDDAVKRARLALLAATRQVLRNGLAVIGVSAPQKM >BifidobacteriumlongumDJO10A MSPEALSELISSIAHNLVAAGQAGALTDELIPPVDKLAVMRPKDRAHGDWASNIAMQLAKKAGMKPRDLA EPFAAALAEADGIAKVEVAGPGFINITLDSASAAAVVDTVLAAGAMTDTDKHLNKVNEYGRNAHLGGQTL NLEFVSANPTGPIHIGGTRWAAVGDAMARVLEANGAKVVREYYFNDHGEQINRFAKSLVAAWAEANNLGE AGYQTETPCDGYKGAYINEIAARVQAEAESDGVDLTALAHQDQGLNDDGEPLGEADTEVREEFRKRAVPM MFDEIQKSMKDFRVNFDVWFHENSLYADGKVDAAIEELKSRGDIFDKDGATWFESTKHGDDKDRVIIKSN GEFAYFAADIAYYWDKRHRAENPADVAIYMLGADHHGYIGRMMAMCAAFGDEPGKNMQILIGQLVNVMKD GKPVRMSKRAGNVVTIDDLVSVVGVDAARYSLARSDYNQNFDIDLALLASHTNDNPVYYVQYAHARSKNV DRNAAVAGISYEGADLALLDTEADGEVLAALAQFPSVLATAADDRQPHKVARYLEELAATYHKWYNVERV VPMALTDPETRGDDEARKALEIAKNPEPARAAARLKLNDAVQQVIANGLDLLGVTAPEKM >Agrobacteriumtumefaciensstr.C58 MNIFADFDTRIKNALETLDLVKENREKVDFSRITVESPRDLSHGDVATNAAMVLAKPLGTNPRALAELLV PALQADGDVDGVNVAGPGFINLKVSVGYWQRLLADMIGQGVDFGRSTVGAGQKINVEYVSANPTGPMHVG HCRGAVVGDTLANLLAFAGYGVTKEYYINDAGSQIDVLARSVFLRYREALGEDIGSIPSGLYPGDYLVPV GQALADEYGIKLRAMPEEKWLPIVKDKAIDAMMVMIREDLALLNVRHDVFFSERTLHEGNGGPILSAIND LTFKGHVYKGTLPPPKGELPDDWEDREQTLFRSTEVGDDMDRALMKSDGSYTYFAADVAYFKNKFDRGFS EMIYVLGADHGGYVKRLEAVARAVSEGKSKLTVLLCQLVKLFRDGEPVKMSKRSGDFVTLRDVVDEVGRD PVRFMMLYRKNSEPLDFDFAKVTEQSKDNPVFYVQYAHARCKSIFRQAQEAFPGLAPSAEDMAASVALIS DINELQLVAKLAEYPRLIESAALSHEPHRLAFYLYDLAGSFHGHWNKGKDHQELRFINDKNRELSIARLG LVNAVANVLKSGLTLLGADAPDEMR >Brucellamelitensis16M MNIFADFDARIKKTLQDIDLKPKDGGELDLSRIGVEPPRDASHGDIATNAAMVLSKAVGQNPRELAARIA EALKADEDVESVDVAGPGFINLRLKASYWQRELLVMLNEGTDFGRSRLGAGKKVNVEYVSANPTGPMHVG HCRGAVVGDVLANLLKFAGYDVVKEYYINDAGAQIDVLARSVMLRYREALGESIGEIPAGLYPGDYLVRV GQELAGEFGTKLLEMPEAEALAIVKDRTIDAMMAMIRADLDALNVHHDVFYSERKLHVDHARAIRNAIND LTLKGHVYKGKLPPPKGQLPEDWEDREQTLFRSTEVGDDIDRPLMKSDGSFTYFAGDVAYFKDKYDHGFN EMIYVLGADHGGYVKRLEAVARAVSDGKAKLTVLLCQLVKLFRNGEPVRMSKRAGEFITLRDVVDEVGRD PVRFMMLYRKNDAPLDFDFAKVTEQSKDNPVFYVQYASARCHSVFRQAADQLGLVDLDRVAMGSHFEKLT DESEIALVRKLAEYPRLIESAAIHQEPHRLAFYLYDLASSFHSQWNRGAENPDLRFIKVNDPDLSLARLG LVQVVSDVLTSGLTIIGADAPTEMR >PorphyromonasgingivalisW83 MSILQKLENSAAAAVKALYGTDPMEGQIQLQKTKREFKGHLTLVVFPFVKMSRKSPEATATEIGEWLLAN ESAVSAIEVVKGFLNLTIAPRVWLELLNEIRADINFGHKVATEDSPLVMVEYSSPNTNKPLHLGHVRNNL LGYSLSEIMKANGYRVVKTNIVNDRGIHICKSMLAWQKWGDGVTPEKAGKKGDHLIGDFYVLFDKHYKAE LNSLMAEGKSKEEAEAASTLMAEAREMLRLWEAGDEKVVDLWRTMNQWVYDGFDATYKMMGVDFDKIYYE SETYLVGKEEVLRGLEEGLFVKHSDGSVWADLTKDGLDEKLLLRADGTSVYMTQDIGTAKMRFNDYPINR MIYVVGNEQNYHFQVLSILLDRLGFEFGKGLVHFSYGMVELPEGKMKSREGTVVDADDLMDEMIRTAAEI AAEAGKAAEMDEEESREVARIVGLGSLKYFILKVDPRKNMTFNPKESIDFNGNTGSFVQYTYARIRSLMR RAEAAGYDIPSQLPTDLPLSEKEEALIQKVSEYAEVVSEAGHSYSPALIANYIYDLVKEYNQFYHDFSVL KEEDERIRAFRLALSEVVALTMRKGFALLGIEMPERM >Rickettsiaprowazekiistr.MadridE MNIFNQLKQDIIAASQKLYNNKEIANTATIETPKDSFNGDLSSNIAMIIASKESIAPREVALKFKEVLVT LPYIASIEIAGPGFINFTIKAESWQAAIKDILQHEEKFFEIDIDKNSNINIEYVSANPTGPMHIGHARGA VYGDVLARILQKVGYSVTKEYYVNDAGSQINDLVSTVLLRYKEALGEPITIPVGLYPGEYLIPLGEILSK EYGNKLLTMNDVERFKIIKSFAVEKMLDLNRKDLADLGIKHDVFFSEQSLYDKGEIEKTVKLLERMGLIY EGTLPAPKGKVHEDWEYRVQKLFKSTNYGDSQDRPIEKADGSWSYFASDLAYAKDKIDRGANHLIYVLGA DHSGYVKRIEAIVKALGQEKVKVDVKICQLVNFVENGVPIKMSKRLGSFASVQDVNKEVGKDIIRFMMLT RQNDKPLDFDLVKVKEQSRENPIFYVQYAHVRTKSILSKARELMPEAYNSFKEGKYNLSLLSSEEEIEII KLLAAWTKTLEASVKYFEPHRIAFYLINLASKFHSMWNFGKENSDYRFIIENNKELTLARLALASVIQKI IASGLEVIGVEPMVTM >BordetellapertussisTohamaI MRGHLRQTPGRPPGGSARPAARQTCRRHLPFCRLPMLLEQQKQLISLIQAAVAQCLPEAQAQVQLERPKV AAHGDIATNVAMQLAKPARRNPRELAQGIVDALMAQPQARELIQDAEIAGPGFINFRLTPAARQAVVQAV ASQADAYGRAPRNGEKVLVEFVSANPTGPLHVGHARQAALGDAICRLYDASGWDVTREFYYNDAGNQIDN LAISVQARGRGIAPDAPDYPADGYKGDYIVEIARDFAARKSVQASDGQPVTATGDLDSLDDIRAFAVAYL RREQDLDLQAFGLAFDNYFLESSLYASGRVQETVDTLVAKGHTYEEGGALWLRTTELGTGDDKDRVMRKS EGGYTYFVPDVAYHKVKWERGFHHAVNIQGSDHHGTVARVRAGLQGLAGIPKDFPAYVLHKMVKVMRGGE EVKISKRAGSYVTMRDLIDWVGRDAVRYFLIQRRADTEFVFDIDLALSKSDENPVYYIQYAHARICTMIG NSGASAAEIAQADTALLTAPSEYALLQRLAEFPQVVALAAQELAPHHVAFWLRDCASDFHAWYNAERVLV DEPALKLARLRLAATTRQVLANGLALLGVSAPDRM >ChlorobiumtepidumTLS MRAFFLPFIQDALQKAGIETDKEIQIDKPNDKKFGDFSTNIAFLVAKEARKNPRELAGQLIGLLDFPEGT VTKTEVAGPGFINFHLAPAFFMRSAQEVLAKGEGFGCNESGKGLKAIVEYVSANPTGPLTIGRGRGGVLG DCIANLLETQGYEVTREYYFNDAGRQMQILAESVRYRYLEKCGQVIEFPETHYQGDYIGEIAETLFIEHG DGLAATDELTIFKEAAEAVIFSSIRKTLERLLITHDSFFNEHTLYQSREGQPSANQRVIDALDAKGFIGN YDGATWFMTTKLGQEKDKVLIKSSGDPSYRLPDIAYHVTKFERGFDLMVNVFGADHIDEYPDVLEALKIL GYDTSKVKIAINQFVTTTVGGQTVKMSTRKGNADLLDDLIDDVGADATRLFFIMRGKDSHLNFDVELAKK QSKDNPVFYLQYAHARICSLVRMAEKEVGFDEATAIGAGLPLLSSEPEIDLASALLDFPDIIQSSLRQLE PQKMVEYLHTVAERYHKFYQECPILKADEHLRTARLELSLAVRQVLRNGFKILGISAPESM >Thermoanaerobactertengcongensis MENIVQKAKEEIKDVVLKALNEAKKEGLLNFESIQDVEVEEPKEKQHGDLATNFAMVMAREAKMAPRKIA EIIASKMNTSGTFIEKVEVAGPGFINFFLNQNFLIETLKLIHKRGKDYGRVNLGKGKKVQVEFVSANPTG PMHMGNARGGAIGDVLASILDYAGYNVSREFYINDAGNQIEKFGYSLEARYLQLLGIDAEVPEGGYHGED IIDRAKEFLEIHGDKYKDVPSEERRKALIEYGLKKNIEKMKEDLVLYGIEYDVWFSEQSLYDSGEVYKVI EELTEKGYTYEKDGALWFKMTLFGAEKDDVLVRSNGVPTYLASDIAYHKNKFVTRGFDWVINVWGADHHG HVAPMKGAMKALGIDPNRLDVVLMQLVKLIEGGQVVRMSKRTGKMITLRDLIEEVGKDAARFFFNMRSPD SPIEFDLDLAKQQTNENPVFYVQYAHARICSIIRQLEEMGVKIENIEDVDLGLLKEEEEVDLIKKLAYFP EEITIAAKTLAPHRITRYVIDVASLFHSFYNSHRVKGAEENLMKARFALILAVKTVLKNALDILKVTAPE RM >HelicobacterpyloriJ99 MHTLIKGVLEEILEAEVIIEYPKDREHGHYATPIAFNLAKVFKKSPLAIAEELALKIGSHEKTQGFFDRV VACKGYINFTLSLDFLERFTQKALELKEQFGSQVKSERSQKIFLEFVSANPTGPLHIGHARGAVFGDSLA KIARFLGHEVLCEYYVNDMGSQIRLLGVSVWLAYKEHVLKESVTYPEVFYKGEYIIEIAKKAHNDLEPSL FKENEETIIEVLSDYAKDLMLLEIKGNLDALDIHFDSYASEKEVFKHKDAVFDRLEKANALYEKDSKTWL KSSLYQDESDRVLIKEDKSYTYLAGDIVYHDEKFQQNYTKYINIWGADHHGYIARVKASLEFLGYDSSKL EVLLAQMVRLLKDNEPYKMSKRAGNFILIKDVIDDVGKDALRFIFLSKRLDTHLEFDVNTLKKQDSSNPI YYIHYANSRIHTMLEKSPFSKEEILQTPLKNLNAEEKYLLFSALSLPKAVESSFEEYGLQKMCEYAKTLA SEFHRFYNAGKILDTPKAKELLKICLMVSLSLTNAFKLLGIEIKTKISSKD >Xylellafastidiosa9a5c MLTRFSYKRSDKITLSIATHPHPHVKAPLRALICQGIEALRSNGTLPTNTLPPDFVVERPKTRKHGDFAT NVAMLLSKATGSNPRLLAQTLVAALPTSADIARIEIAGPGFINFHLHPVAYQRETINVLKQDNDYGRNLS GQSRTVGVEYVSANPTGPLHVGHGRAAAIGDCLARLLEANGWNVKREFYYNDAGVQIENLVRSVQARARG LKPGDAFWPTDAYNGEYIADIAKAYLAGDSINMVDTIITSTKNVDDTAAIHHFAVNYLRNEQNHDLAAFN VDFDIYFLESSLYKDGKVEETVQKLINSGHTYEEGGALWLKSTHFGDDKDRVMRKSDGSYTYFVPDIAYH LSKWQRGYERAITELGADHHGSLARVHAGLQALEIGIPPGWPEYVLHQMVTVMRGGEEVKLSKRSGGYVT LRDLIEETSTDATRWFLIARKPDSQLTFDIDLARQKSNDNPVFYVQYAYARVCSLMHQAHEKNLNYDQTS GMASLDQLSDNTSLCLMIEISRYPEIVQIACELLEPHLIAQYLRELAHAFHTWYHNTPVLVENAVERNAK LTLACATRQVLANGLNLLGVGTPEKM >BacteroidesfragilisYCH46 MKIEDKLVTSVISGLKALYGQDVPAAQVQLQKTKKEFEGHLTLVVFPFLKMSKKGPEQTAQEIGEYLKAN EPAVAAFNVIKGFLNLTVASATWIELLNEIHADAQYGIVSADENAPLVMIEYSSPNTNKPLHLGHVRNNL LGNALANIVMANGNKVVKTNIVNDRGIHICKSMLAWQKYGKGETPESSGKKGDHLVGDYYVAFDKHYKAE VAELMEKGMSKEEAEAASPLMNEAREMLVKWEAGDPEVRALWQMMNNWVYTGFDETYRKMGVGFDKIYYE SNTYLEGKEKVMEGLEKGFFFKKEDGSVWADLTAEGLDHKLLLRGDGTSVYMTQDIGTAKLRFADYPIDK MIYVVGNEQNYHFQVLSILLDKLGFEWGKSLVHFSYGMVELPEGKMKSREGTVVDADDLMAEMIATAKET SQELGKLDGLTQEEADDIARIVGLGALKYFILKVDARKNMTFNPKESIDFNGNTGPFIQYTYARIRSVLR KAAEAGIVIPEVLPANIELSEKEEGLIQMVADFAAVVRQAGEDYSPSGIANYVYDLVKEYNQFYHDFSIL REENEDVKLFRIALSANIAKVVRLGMGLLGIEVPDRM >DeinococcusradioduransR1 MDLKAQLKAAVEQAAHQMGMPVDAAIQETPANKPGDYGTPAAFQMAKAAGGNPAQIAAQLAQTVVLPAGI RRVEATGPFLNFFLDAGAFVRGVVERPFELPKREGKVVIEHTSVNPNKELHVGHLRNVVLGDSMARILRA AGHTVEVQNYIDDTGRQAAESLFATQHYGRVWDGVQKYDQWLGEGYVQLNADPQKPELESGIMEIMHKLE AGELRPLVEQTVKAQLQTCFRLGARYDLLNWESDVVGSGFLAQAMNILEGSRYTSRPTEGKYAGAFIMDV SEFMPGLEEPNVVLVRSGGTAMYAAKDIGYQFWKFGLFEGMKFKPFMQDPEGNTIWTSAPDGQPDDERRF GHAQEVINVIDSRQDHPQTVVRSALGVAGEQEKEERSIHLSYAFVTLEGQTISGRKGIAVSADDAMDEAQ KRALSVLQGINPDLAAREDAAEIARRIGLGAIRFAMLKAEPTRKIDFRWEQALALNGDTAPYVQYAAVRA ANILKKAEEAGYATDGTGADWDALPDIDLVLAKQIAKLPEVAAQAARIHSPHVVAQYALDLATSFNAWYN AKTKQGKPATNVLQSEEGLREARLALIVRLRKAFEDTLDLIGIEIPAAM >Bacillussubtilissubsp.subtilisstr.168 MNIAEQMKDVLKEEIKAAVLKAGLAEESQIPNVVLETPKDKTHGDYSTNMAMQLARVAKKAPRQIAEEIV AHFDKGKASIEKLDIAGPGFINFYMNNQYLTKLIPSVLEAGEAYGETNIGNGERVQVEFVSANPTGDLHL GHARGAAVGDSLCNVLSKAGYDVSREYYINDAGNQINNLALSVEVRYFEALGLEKPMPEDGYRGEDIIAI GKRLAEEYGDRFVNEEESERLAFFREYGLKYELEKLRKDLENFRVPFDVWYSETSLYQNGKIDKALEALR EKGHVYEEDGATWFRSTTFGDDKDRVLIKKDGTYTYLLPDIAYHKDKLDRGFDKLINVWGADHHGYIPRM KAAIEALGYEKGTLEVEIIQLVHLYKNGEKMKMSKRTGKAVTMRDLIEEVGLDAVRYFFAMRSADTHMDF DLDLAVSTSNENPVYYAQYAHARICSMLRQGEEQGLKPAADLDFSHIQSEKEYDLLKTIGGFPEAVAEAA EKRIPHRVTNYIYDLASALHSFYNAEKVIDPENEEKSRARLALMKATQITLNNALQLIGVSAPEKM >BorreliaburgdorferiB31 MLKRKKMNKSVKKKIKDEINVIVTNLALSNNIKLDNININIQKPPKSDLGDISILMFEIGKTLKLPIEII SEEIIKNLKTKYEIKAVGPYLNIKISRKEYINNTIQMVNTQKDTYGTSKYLDNKKIILEFSSPNTNKPLH VGHLRNDVIGESLSRILKAVGAKITKINLINDRGVHICKSMLAYKKFGNGITPEKAFKKGDHLIGDFYVK YNKYSQENENAEKEIQDLLLLWEQKDVSTIELWKKLNKWAIEGIKETYEITNTSFDKIYLESEIFKIGKN VVLEGLEKGFCYKREDGAICIDLPSDSDEKADTKVKQKVLIRSNGTSIYLTQDLGNIAVRTKEFNFEEMI YVVGSEQIQHFKSLFFVAEKLGLSKNKKLIHLSHGMVNLVDGKMKSREGNVIDADNLISNLIELIIPEMT QKIENKESAKKNALNIALGAIHYYLLKSAIHKDIVFNKKESLSFTGNSGPYIQYVGARINSILEKYKALS IPVMEKIDFELLKHEKEWEIIKIISELEENIINAAKDLNPSILTSYSYSLAKHFSTYYQEVKVIDTNNIN LTAARIEFLKAILQTIKNCMYLLNIPYMLKM >ThermotogamaritimaMSB8 MLVNAIRQKVSEVISKAYGSEIEFEVEIPPRKEFGDLSTNVAMKLAKTLKKNPREIAQEIVKSLDEDPSF DRIEIMGPGFINFFLSNELLRGVVKTVLEKKDEYGRENVGNGMKVQFEYGSANPTGPFTVGHGRQIIIGD VLSEVYKELGYDVTREMYINDAGKQIRLLAQSLWARYNQLLGVEKEIPEGGYRGEYLVDIARDLVNEIGD RYKDLWNEEVEEFFKQTALNRILSSMKDTLEKIGSSFDVYFSEKSLIEDGTVEEVLKLLKNKDVVYEKDG AVWLKVSAFIDEEDKVLVRSDGTYTYFMTDIAYHYKKYKRGFRKVYDIWGSDHHGHIPRMKAAMKALDIP DDFFNVILHQFVTLKRGGEIVRMSTRAGEFVTLDELLDEVGRDATRYFFAMVDPNTHMVFDIDLAKAKSM DNPVYYVQYAHARIHNLFSNAEKKGVKFEEGKHLELLGNEEERVLMRNLGMFNTALKEVAQMFAPNRLTN YLQSLAESFHAFYTKHVIVDPENPELSNARLNLALATGIVLRKGLKLIGVSAPERM >ThermusthermophilusHB27 MLRRALEEAIAQALKEMGVPARLKVARAPKDKPGDYGVPLFALAKELRKPPQAIAQELKDRLPLPEFVEE AIPVGGYLNFRLRTEALLREALRPKAPFPRRPGVVLVEHTSVNPNKELHVGHLRNIALGDAIARILAYAG REVLVLNYIDDTGRQAAETLFALRHYGLTWDGKEKYDHFAGRAYVRLHQDPEYERLQPAIEEVLHALERG ELREEVNRILLAQMATMHALNARYDLLVWESDIVRAGLLQKALALLEQSPHVFRPREGKYAGALVMDASP VIPGLEDPFFVLLRSNGTATYYAKDIAFQFWKMGILEGLRFRPYENPYYPGLRTSAPEGEAYTPKAEETI NVIDVRQSHPQALVRAALALAGYPALAEKAHHLAYETVLLEGRQMSGRKGLAVSVDEVLEEATRRARAIV EEKNPDHPDKEEAARMVALGAIRFSMVKTEPKKQIDFRYQEALSFEGDTGPYVQYAHARAHSILRKAGEW GAPDLSQATPYERALALDLLDFEEAVLEAAEEKTPHVLAQYLLDLAASWNAYYNARENGQPATPVLTAPE GLRELRLSLVQSLQRTLATGLDLLGIPAPEVM >BurkholderiacepaciaR1808 MLPAHKQTLEALLADSVAQVAHALKGADAEFVIPAITLERPKVAAHGDVACNVAMQLAKPLGTNPRQLAE RIVAALVAQPAAQGLVDAAEIAGPGFINLRVSAAAKQAVIAAVFEQGRAFGTSQREKGKRVLVEFVSANP TGPLHVGHGRQAALGDVLANVIASQGYAVHREFYYNDAGVQIANLAISTQARARGLKPGDAGWPEAAYNG EYIADIARDYLNGATVAAKDGEPVTGARDIENLDAIRKFAVAYLRHEQDMDLQAFGVKFDQYYLESSLYS EGRVEKTVDALVKAGMTYEQDGALWLRTTDEGDDKDRVMRKSDGTYTYFVPDVAYHVTKWERGFTKVINI QGSDHHGTIARVRAGLQGLHIGIPKGYPDYVLHKMVTVMRDGQEVKLSKRAGSYVTVRDLIEWSGGAAPG QEAAPDMIDEATITRGRDAVRFFLISRKADTEFVFDIDLALKQNDENPVYYVQYAHARICSVLNELKARY NVDVAQLPGADLSQLTSPQAVSLMQKLAEYPDLLTHAANELAPHAVAFYLRDLAGEFHSFYNAERVLVDD EAPRNARAALLAATRQVLENGLAMLGVSAPAKM >NitrosomonaseuropaeaATCC19718 MVTTTLPDFKSHCIQLLDQAARQVLPDEVGVQIELLRPKLADHGDYSSNLAMKLARRLRRNPLELAKALI GALPDSSCVEKADVAGGGFINFFLKKTAKQQFLHAVLQAGDSFGHSRLGAGKTIQIEFVSANPTGPLHVG HGRGAAFGASLANIMTAAGYAVTREFYVNDAGRQMDILTLSTWLRYLDLCGLSFSFPANAYRGQYVADMA SEIYQAQGDRYAHRSDATIRQLTEISTSTTIDSEDERLDRLITAAKSILDQDYADLHNFVLTEQLADCRN DLMEFGVEFETWFSEQSLFDSGMVARAVQLLDDKKLLYRQDGALWFRSTDFGDEKDRVVQRENGLYTYFA SDIAYHLSKYERGFDYLLNIWGADHHGYIPRVKGAIEALSLDPGRLEIALVQFAVLYRDGKKVSMSTRSG EFVTLRQLRQEVGNDAARFFYVLRKSDQHLDFDLDLAKSQSNDNPVYYVQYAHARICSVLGQWGGAEDIL ARAETELLTDPAELVLLQKMIDFTDTIEAAAKERAPHLIAFFLRELAGEFHSYYNSTRFLVEDESLKITR LALISAVRQILSKGLTLLGVTAPREM >Corynebacteriumglutamicum MTPADLATLIKETAVEVLTSRELDTSVLPEQVVVERPRNPEHGDYATNIALQVAKKVGQNPRDLATWLAE ALAADDAIDSAEIAGPGFLNIRLAAAAQGEIVAKILAQGETFGNSDHLSHLDVNLEFVSANPTGPIHLGG TRWAAVGDSLGRVLEASGAKVTREYYFNDHGRQIDRFALSLLAAAKGEPTPEDGYGGEYIKEIAEAIVEK HPEALALEPAATQELFRAEGVEMMFEHIKSSLHEFGTDFDVYYHENSLFESGAVDKAVQVLKDNGNLYEN EGAWWLRSTEFGDDKDRVVIKSDGDAAYIAGDIAYVADKFSRGHNLNIYMLGADHHGYIARLKAAAAALG YKPEDVEVLIGQMVNLLRDGKAVRMSKRAGTVVTLDDLVEAIGIDAARYSLIRSSVDSSLDMDLGLWESQ SSDNPVYYVQYGHARLCSIARKAETLGVTEEGADLSLLTHDREGDLIRTLGEFPAVVKAAADLREPHRIA RYAEELAGTFHRFYDSCHILPKADEDTAPIHTARLALAAATRQTLANALRLVGVSAPEKM >AquifexaeolicusVF5 MKELVKEKVLKALKELYNTQVENFKVEKPKEEAHGDLASNVAFLLARELKKPPVNIAQELADFLSKDETF KSVEAVKGFINFRFSEDFLKEEFKKFLLSGEAYFKEDLGKGLKVQLEYVSANPTGPLHLGHGRGAVVGDT LARLFKFFNYDVTREYYINDAGRQVYLLGISIYYRYLEKCPERDEETFKEIKEIFEKDGYRGEYVKEIAE RLRKLVGESLCKPEEANLKEVREKILKEESIELYYTKKYEPKDVVDLLSNYGLDLMMKEIREDLSLMDIS FDVWFSERSLYDSGEVERLINLLKEKGYVYEKDGALWLKTSLFGDDKDRVVKRSDGTYTYFASDIAYHYN KFKRGFEKVINVWGADHHGYIPRVKAALKMLEIPEDWLEILLVQMVKLFREGKEVKMSKRAGTFVTLREL LDEVGKDAVRFIFLTKRSDTPLDFDVEKAKEKSSENPVYYVQYAHARISGIFREFKERYKKDVSVEELIN YVQHLEEEAEIKLIKKVLFFKDELVDITLKREPHLLTYYLIDLAGDFHHYYNHHRILGMEENVMFSRLAL VKGIKEVVRLGLNLMGVSAPERM >Campylobacterjejunisubsp.jejuniNCTC11168 MKSIIFNEIKKILECDFALENPKDKNLAHFATPLAFSLAKELKKSPMLIASDLASKFQNHDCFESVEAVN GYLNFRISKTFLNELANQALTNPNDFTKGEKKQESFLLEYVSANPTGPLHIGHARGAVFGDTLTRLARHL GYKFNTEYYVNDAGNQIYLLGLSILLSVKESILHENVEYPEQYYKGEYIVDLAKEAFEKFGKEFFSEENI PSLADWAKDKMLVLIKQNLEQAKIKIDSYVSERSYYDALNATLESLKEHKGIYEQEGKIWLASSQKGDEK DRVIIREDGRGTYLAADIVYHKDKMSRGYGKCINIWGADHHGYIPRMKAAMEFLGFDSNNLEIILAQMVS LLKDGEPYKMSKRAGNFILMSDVVDEIGSDALRYIFLSKKCDTHLEFDISDLQKEDSSNPVYYINYAHAR IHQVFAKAGKKIDDVMKADLQSLNQDGVNLLFEALNLKAVLNDAFEARALQKIPDYLKNLAANFHKFYNE NKVVGSANENDLLKLFSLVALSIKTAFSLMGIEAKNKMEH >LeptospirainterrogansserovarCopenhagenistr.FiocruzL1-130 MSRSLWIARHMKENETLKQIVLKTLEESVNSLISSFPEVEKEAFKIKIEYSRDEKFGDYSTSFALENSKL LKRNPIQVSKELVEILQKRTDLFEKVDFTPPGFVNFRISTSFLLNYIETSVLSGNYFPKVDLPLKINLEF VSANPTGPLNIVSARAAANGDTMASLLKAIGHNVDKEFYINDYGNQVFLLGVSTLVRIRELKGEEGTQQE TTDDTPIEIILEKNILPAEGYRGEYIKDIASSLLKDPKKNVTIENLLKQKKYKELAELCAVWTIENNLIW QRKDLDAFGVEFDCYFSERTLHEADKVLSVMKDLEKSGKIFQEDGKKVFRSTEYGDDKDRVVVRDDGRPT YLLADIAYHKDKIERGYDKIYDIWGPDHHGYISRLSGAVQSLGYKKENFKVIISQQVNLLESGQKVKMSK RAGSFQTMSDLIGFLGKHGKDVGRYFFVMRSLDAPLDFDLDLAKDESDKNPVFYLQYAHARICSIFKEVG DQTSKEAAAILEMSEERKRLLFWIARFPEEIFDSANAMEPHRVTNYLQSFAKAFTSFYLAKDNRLKDASK EVRLGLARICLAAKNVLAEGLKLIGVSAPERMEKEN >GeobactersulfurreducensPCA MSQIEGSMKDAVRDLVREALERSFADGTLASGHVPDIVVEKPALEEHGDFACTAAMLMAKAEKKAPRAIA EIIITHLNDRESLVESVEIAGPGFINFRMRTSAWCRVLRRIEREGGDYGKSEAGAGKKVQVEFVSANPTG PLHIGHGRGAAIGDTICRLLAAIGWDVTREFYYNDAGQQIANLALSVQARCLGVEPGGPLWPTDGYQGEY IKDVARSYLNRETVDAGDQHVTAAGDPHDVEAIRRFAVAYLRREQDQDLRAFDVGFDVYFLESSLYAEGR VDDVVQRIIAKGHAYEQDGALWLRTTEFGDDKDRVMRKSDGSYTYFVPDVAYHLNKWERGFIRVVNEQGA DHHSTITRVRAGLQALDAGIPKGWPEYVLHQMVTVMRGGEEVKISKRAGSYVTLRDLVDEVGRDATRFFF LMRKPDSQLVFDIDLAKQQTLENPVYYVQYAHARICSIFENAADKGVVPPTVDQASLESLGTPEELTLVK LLSSFPEIVEGSALNFEPHRITYYLQELAGAFHSFYNKNRVITEDADLTGARLLLLHSTATVIRNGLGLL GVSAPEKM >MycobacteriumtuberculosisCDC1551 MTPADLAELLKATAAAVLAERGLDASALPQMVTVERPRIPEHGDYASNLAMQLAKKVGTNPRELAGWLAE ALTKVDGIASAEVAGPGFINMRLETAAQAKVVTSVIDAGHSYGHSLLLAGRKVNLEFVSANPTGPIHIGG TRWAAVGDALGRLLTTQGADVVREYYFNDHGAQIDRFANSLIAAAKGEPTPQDGYAGSYITNIAEQVLQK APDALSLPDAELRETFRAIGVDLMFDHIKQSLHEFGTDFDVYTHEDSMHTGGRVENAIARLRETGNIYEK DGATWLRTSAFGDDKDRVVIKSDGKPAYIAGDLAYYLDKRQRGFDLCIYMLGADHHGYIARLKAAAAAFG DDPATVEVLIGQMVNLVRDGQPVRMSKRAGTVLTLDDLVEAIGVDAARYSLIRSSVDTAIDIDLALWSSA SNENPVYYVQYAHARLSALARNAAELALIPDTNHLELLNHDKEGTLLRTLGEFPRVLETAASLREPHRVC RYLEDLAGDYHRFYDSCRVLPQGDEQPTDLHTARLALCQATRQVIANGLAIIGVTAPERM >WolinellasuccinogenesDSM1740 MHHTIKHLLETTLGFSVVLEKPKDKNHGHYATPAAFSLAKELKKNPALIAQELSKKLSEIEVFESVQSVG GYINFRLKQGFLDAQASLALSQGREFGKGDKQGSILLEYVSANPTGPLHIGHARGAVLGDALSRIGRHLG YALETEYYVNDAGNQIHLLGLSIYLAGRDSLLSLPVTYPEQYYRGEYIVDIAKEALKKWGEKAFADEAFI PELSLFGKELMLEEIRSNLADTHIHFDHYVSEKSLYPRWEETYALLQSHQGCYEGGGKVWLRSSAHGDEK DRVIVRESGEPTYLAGDIIYHADKFARPYDRYINIWGADHHGYIARVKAAIEFLGHDSSKLEVLLSQMVT LLKGGQPYKMSKRAGNFILMRDVLEDIGADALRFIFLSKKPDTHLEFDVDDLNKEDSSNPIFYINYAHAR IHTMLGKSSLDSQEIEAASLEGLEDSIFDLLFLSLQLPQVLEDSFENRAIQKVAEYLRALAGEFHKFYNE HKILETPQEAALLKVCKVVALSLSQGLALLGITAKERM >Halobacteriumsp.NRC-1 MLYNLRQELLAGIRAATSDAGYDYEVDQSAIELEDITDEEKGEFSSPISFSIAAAAGAPPVDVAAAIADA HRSNGLPAEVEAVTVEGGHINYHADTTDLADATLSTILRDGSEYGTRTDADPDTILADVSSPNIAKPLHV GHLRNTILSDAVMNILEARGHDVTRDNHLGDWGVQFGNLMHEYTEFGDEATLEDDAIEHLLDLYQQFEQR DSMLADLEDDETVTDQFADAVTEERDYHADSGKEWFTRLEQGDEDATALWERFRTVSIDRFKQTYDDLDV AFDVWNGESFYAQEGWNDVIIEKAIENDVAMRGEGESVYIPVYPDDYENVGDPQAADVDASLDRARQMRE ANDDLEDADFDPFYIVKSDGSTLYGTRDLATIEYRIEEYDADQSVYVVANEQNQYFQQLFVAARKMGYND IKLKHIDYGLISLPEGSMSTRKGQIITAREVLDRAQDRAEEIIAEKGRIDDAEAQSVATKIALATIKYEM VAAKRERDTTFDIDESVALEGDTGPYVQYAATRGYSILDGADAAPEIDDLDPSVFNDTDVELLFELARYP LVLERCEERYDAAPLAHYLLQLAHVFNSFYHKNAVLDAENARTERLLLTKATTQIFDNGLGLLGIDVLEE M >MethanosarcinamazeiGo1 MFLELKAQATSILKEAIRKAGFEVEDSELQFETSPHADLASRAAFRLAGIHRQNPKDLASRIVSAVEIPE GSFIGKVSAAGPYINFFAGKHYLNGTVNAVLKEKEKFGCGAPKDRILLEHTSANPNGPLHVGHIRNSIIG DTLARILRRAGYDVEVQYYVNDMGRQIAVVSWACERFELDLSRKSDSAIADVYIKANVELDKNPGYIKEI DALMEKVEAGDVRTIEHFDKAVSLAVAGIKETLLRLNVAHDKFVSESTFLKSGAVHDIVERIKATGRTKT DKGALVVDLSDYGFEKTLVIQRSNGTSLYTTRDLAYHEWKAGQADRIIDVFGADHKLISGQLRATLNAIG VKEPEVVIFEFVSLPEGSMSTRRGQFISADDLFDRVTGAAFEQVETRRPETSYEFKKQVAEAVGLGAVRY DIVRVSPEKSTVFNWKEALDFEKQGAPYIQYSHARACSILEKAKEEAAWNPDKEIDPSLLVEDSEIDLIK KMAMFDSVIDLGARELKPHVLAIYARELADAFNQFYRFVPVIAAEDENVRAARLALVDCARVVLANSLDT LGIIAPESM >ThermoplasmaacidophilumDSM1728 MLLFQDLRKDIYEIVSKRFRISENDVYLDDTGHSDITIRVFRILKSPDGGENAVMEIVRSISEKDYVEKA LSEGGYINVWIKRTYMLREVLESIEKSGTYPDVFQEAERVSVEHTSANPTGPLHIGRARNSIIGDSIYRI LSRYGYRTVRQYFVNDSGKQMISLYTAYIKYGGPITIENLLENYQKIYREMEKDQSIEKEIEKNIERYEN ADPEVFGTLRKIAGVMLDGIASTLKRIGIEFDEFDWESDLLLNGSVRKAIDMLETKEEDSARYIEISGKK VFLTRKDGTTLYFARDIAYHLFKAENSEWIIDVLGEDHKDHAKSLNHVLKEMLKLENRVSFMYYSFITLE TGKMSTRRGNIVTLQDLVDRTYDEALKIVNEKRPDLSEEERKKIAEVIASSAVRYSIIRVSAPKPITFRW EEALNFESNSAPFIMYSHARAASILDKAPEPEQSYGMDMPKEEADLVKAMYVYPYYLKDAAQDLKPDLIA AYLISLVQKFNDFYGACRVIGTDPLTYARRIRIVKAYKQILSDAGDLIGIKMLDQM >Treponemapallidumsubsp.pallidumstr.Nichols MQDLCEMWRHAVARVLSQLQGPAVEPVEGAQLVMEEPPEPGMGDIAFPLFLFAKRVRRSPAQLAQQLCTL LEEDTSMCAYGTPQARGPYLNVFLNKECVAAHTLDAIFAQGERYGHTQYLQGKRIMVEFSSPNTNKPLHV GHLRNNAIGESLSRIIAFCGADVFKVNIINDRGVHICKSMCAYQKFAHGKTPAHTGIKSDRFVGDWYVQF NRYAQQYPEEAEHDVRDLLQRWESADPHVRALWRTMNEWALRGIKQTYERTGISFDKLYFESETYTKGRE EVRRGLACGVFYQMEDNSIWVDLSSLGLDKKALLRSDGTTMYITQDIGTAIFRAQDWPFDQLLYVVGNEQ NYHFKVLFFVLRLLGYPWAQQLHHVSYGMVNLPHGRMKSREGTVVDADDILDRLHSAAEEEIAKKGRENA LKHAQCIAENVAIAALHYFLLQVSPQKDMVFHPEESLSFNGNTGPYLQYMGARISSLLKKVQEDVEQKGP REVRCDPALLTHEAEWELVKALARFPACVTRAAQGHDPSVITGYLYTLSKSFSRFYHDCPILCEARPDYA CARLELVRAVRIVLRTAMRLVLIPFLEEM >BdellovibriobacteriovorusHD100 MIKHDSIRLLATNLLKDAIGRAYPDFSASEDDIYKALVNPPKSDLGDLAFGCFILAKALKTAPPQVATAV AAQMKGATAVAAGPYINIRFDEQTHGEQVLATILDGSYFKKPLMEKSPKTMIEYSQPNTHKELHVGHMRN LCLGDAIVRMLRYSGREIVSSTFPGDMGTHVAKCLWYMKKHNQEPVPETEKGEWLGRMYSKANLLLEDQN GTPQEDINRQELTAILHQLEGKTGPYYDLWLETREWSIELMKKVYAWADVTFDEWYFESEMDSPSAAWVK QLYAEGKLEMSQGAIGKDLESEKLGFCMLLKSDGTGLYATKDLLLAKHKFEDVKIEKSVYVVDMRQALHF KQVFRVLEILGFEQAKNCFHLQYNYVELPDGAMSSRKGNIVPLRELVHRMEDHVKTTYLSRYKGEWSEED VEKIAGQVAKGAIFYGMLRMDTNKKIVFDMNEWLKLDGESGPFVQYSYARISSLGRKFPRTAGAKIDWSR LNHASERQLMQSLGGFNTAVAAAAENFKPSAICTYLYDLAKSFNVFYHECPIGTEADVATREARLALSEA VGLTLKNGLAVLGMPAPEKM