ALIGN_DUALSTATE
This program takes an aligned CLUSTAL sequence file, and finds columns which have exactly two different types of characters. It then computes various output files based on the results.
Enter inputs and click "Run Program" to get started.
The program first finds all columns with exactly two different characters present, and where each of those two characters appears at least twice. The result is saved as the first output file. At the top of the file is listed the total number of character columns.
The second output file converts the first output into a binary format. That is, the majority character is converted to a 0 and the minority character is converted to a 1. At the top of the file is listed the number of rows, followed by the number of character columns.
The third output file is slightly different. It lists, for each row, the number of columns in which that character is the only one of its kind. That is, it lists the number of characters which are unique in a particular column (occurs exactly once in a column).
The second output file converts the first output into a binary format. That is, the majority character is converted to a 0 and the minority character is converted to a 1. At the top of the file is listed the number of rows, followed by the number of character columns.
The third output file is slightly different. It lists, for each row, the number of columns in which that character is the only one of its kind. That is, it lists the number of characters which are unique in a particular column (occurs exactly once in a column).
View sample input/output files. All files should be in a plain-text format (.txt, .csv, .xml, etc.).
Download files related to this app.
Program Results
Example Input File
Concat First Half Nostoc712 PQHLSGNEIRTRFLDFYAQRGHQILASASLVP-EDPTVLLTIAGMLPFKPIFLGQRTPEF Anaba2941 PQHLSGNEIRTRFLDFYAQRGHQILASASLVP-EDPTVLLTIAGMLPFKPIFLGQRTPEF Nosto7310 PQYLSGNEIRNTFLNFFAQRSHQILPSASLVP-EDPTVLLTIAGMLPFKPIFLGQRTPEF SynPCC680 PPVLSGPEIRQQFLNFFADRQHQILPSASLVP-EDPTVLLTIAGMLPFKPIFLGQKSAEF Crocospha TPFLTGQEIRDRFLQFYSQYKHQILPSASLVP-EDPTVLLTIAGMLPFKPIFLGQKQSDF Trichodes AKKLTGNEIRKKFLSFYAQREHTILPSASLVP-EDPTVLLTIAGMLPFKPIFLGQQPRQY SynPCC794 APALSGDQIRETFLKFFEGKGHRRLPSASLIP-EDPTVLLTIAGMLPFKPIFLGQQVAEV SynPCC630 APALSGDQIRETFLKFFEGKGHRRLPSASLIP-EDPTVLLTIAGMLPFKPIFLGQQVAEV SynJA22-1 FPALSGAAIRQTFLDFYAQRGHQVLPSASLVP-EDPTVLLTIAGMLPFKPIFLGQRDPEY SynJA3-3A SPSFSGAAIRQAFLDFYAQRGHQVLPSASLVP-EDPTVLLTIAGMLPFKPIFLGHQDPQY ThermoBP- MTALTGDQIRQKFLDFYAAKGHTILPSASLIP-DDPTVLLTIAGMLPFKPIFLGQEAPKV Gloeobact PTVLNGNAIREKFLQFFESKQHRRLPSASLVP-DDPTVLLTIAGMLPFKPVFMGKRERPA SyneRS991 ARPRSGEQIREAFLAFYEQRGHQRIASASLVP-EDPTVLLTIAGMLPFKPVFLGQQQRPA SyneWH780 ARPRTGAEIREAFLSFFEQRGHRRMPSASLVP-EDPTVLLTIAGMLPFKPIFLGQQERPA SyneCC960 SAPRSGEEIREAFLNFYAERGHKRMASASLIP-DDPTVLLTIAGMLPFKPVFLGQQERPA SyneWH810 SAPRSGEDIREAFLNFYAERGHQRMASASLIP-EDPTVLLTIAGMLPFKPVFLGQQQRPA SyneCC990 SLPQTGAEIRSAFLRFYEERGHKVMASASLIP-EDPTVLLTIAGMLPFKPVFLGQQERPA ProMIT931 SRPRTGSEIRTAFLTFFAERAHQVIPSASLVP-EDPTVLLTIAGMLPFKPVFMGQAERPA SyneWH570 SRPHSGAEIREAFLAFYEARGHRRMASASLVP-DDPTVLLTIAGMLPFKPVFLGQAEPPS ProCMP137 MKSLTGEEIRAAFLNFYAERGHEIVPSASLVP-NDPTVLLTIAGMLPFKPVFLGHQDRPS ProMIT921 SHPLTGEEIRTAFLHFFAERGHQVLPSASLVP-DDPTVLLTIAGMLPFKPVFLGHEERPS ProCMP198 NTPITGDEIRKEFLNFYHEKLHKIIPSASLIP-DDPTVMLTIAGMLPFKPVFLGLKERPS ProMIT931 SRPRTGSEIRTAFLTFFAERAHQVIPSASLVP-EDPTVLLTIAGMLPFKPVFMGQAERPA PromNATL2 PPSLSGDEIRDAFINFFVQHNHKKLASSSLIP-DDPTVLLTIAGMLPFKPIFLGLKESST Staphyloc MKKLKASEIRQKYLDFFVEKGHMVEPSAPLVPIDDDTLLWINSGVATLKKYFDGRETPKK Bsubtilis MKHLTSAEVRQMFLDFFKEKGHAVEPSASLVPHEDPSLLWINSGVATLKKYFDGRVVPEN Nostoc712 KRATTSQKCIRTNDIENVGRTKRHQTFFEMLGNFSFGDYFKEQAIAWGWEIST--EVFGL Anaba2941 KRATTSQKCIRTNDIENVGRTKRHQTFFEMLGNFSFGDYFKEQAIAWGWEIST--EVFGL Nosto7310 KRATTSQKCIRTNDIENVGRTKRHHTFFEMLGNFSFGDYFKEQAIAWGWEIST--QVFGF SynPCC680 PRATTSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKSQAIAWAWELST--QVFKL Crocospha PRATTSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKEQAIKWAWELST--KVYQL Trichodes PRATTSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKPEAIALAWELST--KIFAL SynPCC794 PRATTSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKKEAIAFAWELVT--EVFQV SynPCC630 PRATTSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKKEAIAFAWELVT--EVFQV SynJA22-1 PRVTTAQKCVRTNDIENVGRTARHHTFFEMLGNFSFGDYFKKEAIAWAWELVT--EVFGL SynJA3-3A PRVTTAQKCVRTNDIENVGRTARHHTFFEMLGNFSFGDYFKKEAITWAWELVT--EVFGL ThermoBP- PRATTAQKCLRTNDIENVGRTARHHTFFEMLGNFSFGDYFKGEAIAWAWELMT--TVYGL Gloeobact PRVTTSQKCVRTNDIENVGRTARHHTFFEMLGNFSFGDYFKREAIGWAWELVT--GVFEL SyneRS991 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKEQAIQWAWELST--EVFGL SyneWH780 PCATTSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKQQAIEWAWELST--EVFGL SyneCC960 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKQRAIEWAWELST--GVYGI SyneWH810 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKQQAIEWAWELST--QVYGI SyneCC990 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKQQAIEWAWQLST--EVYGI ProMIT931 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKQQAIEWAWELTT--EVFGL SyneWH570 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKTEAMTWAWELAT--KVFGL ProCMP137 ARVTSCQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKKEAIQWAWELSI--NVFGL ProMIT921 SRVVTSQKCIRTNDIENVGRTARHQTYFEMLGNFSFGDYFKKEAIQWAWELST--KTFGL ProCMP198 KRATSSQKCIRTNDIENVGVTARHHTFFEMLGNFSFGDYFKKEAIEWAWELVT--DIYGL ProMIT931 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKQQAIEWAWELTT--EVFGL PromNATL2 PRATSSQKCIRTNDIENVGRTARHHTFFEMLGNFSFGDYFKKEAIQWAWELST--EVFRL Staphyloc PRIVNSQKAIRTNDIENVGFTARHHTFFEMLGNFSIGDYFKQEAIEFAWEFLTSDKWMGM Bsubtilis PRIVNAQKAIRTNDIENVGKTARHHTFFEMLGNFSIGDYFKEEAITWAWEFLTSDKWIGF Nostoc712 PKERLVVSVFEEDDEAYAIWRDQIGVTEARIKRMGADDNFWVSGPTGPCGPCSEIYYDFH Anaba2941 PKERLVVSVFEEDDEAYVIWRDQIGVTEARIKRMGADDNFWVSGPTGPCGPCSEIYYDFH Nosto7310 SPQNLVVSVFEDDDEAFAIWRDQIGVPVARIKRLGEDDNFWVSGPTGPCGPCSEIYYDFH SynPCC680 PAERLVVSVFEEDDEAFAIWRDEIGIPAHRIQRMGADDNFWVSGPTGPCGPCSEIYYDFH Crocospha PPENIVVSVFENDDEAFKMWEEIIGVPPQRIKRMGEKDNFWKAGPTGPCGPCSELYYDFH Trichodes PPERLVVSVFREDDEAFAIWRDQIGIPAHRIQRMDEADNFWASGPTGPCGPCSEIYYDFH SynPCC794 PAERLAVSVFEEDDEAFAIWRDQIGVPEARIQRLGAKDNFWASGPTGPCGPCSEIYYDFH SynPCC630 PAERLAVSVFEEDDEAFAIWRDQIGVPEARIQRLGAKDNFWASGPTGPCGPCSEIYYDFH SynJA22-1 PPERLVVSVFREDEEAFALWRDEIGIPAHRIQRMGEADNFWAAGPTGPCGPCSEIYYDFK SynJA3-3A PPERLVVSVFREDEEAFALWRDAIGIPPHRIRRMGEEDNFWAAGPTGPCGPCSEIYYDFK ThermoBP- PPERLLVSVFENDDEAYDIWHRQVGLPKERIQRMGEESNFWTAGPTGPCGPCSEIYYDFY Gloeobact PPERLWVSVYEEDEEALVLWQEVAGLPPVRVRKMGADSNFWESGPTGPCGPCSEIYYDFH SyneRS991 SPKNLVVSVFREDDEGEAIWRDRVGVNPKRIIRMDEADNFWASGPTGPCGPCSEIYYDFK SyneWH780 DPKNLVVSVFREDDEAEQIWRDVVGVNPKRIIRMDEEDNFWASGPTGPCGPCSEIYYDFK SyneCC960 DPKNLVVSVFREDDEAELIWRDVVGVNPKRIIRMDEADNFWASGPTGPCGPCSEIYYDFK SyneWH810 DPRNLVVSVFREDDEAEQIWRDVVGVNTKRIIRMDEADNFWASGPTGPCGPCSEIYYDFK SyneCC990 DPKHLVVSVFREDDEAEQIWRDVVGVNPKRIIRMDEADNFWASGPTGPCGPCSEIYYDFK ProMIT931 DPKNLVVSVFREDDEAETIWRDVVGVNPKRIMRMDEADNFWASGPTGPCGPCSEIYYDFK SyneWH570 DPKHLVVSVFRDDDEAAVIWRDTVGVDPRRIVRLGEADNFWVSGPTGPCGPCSEIYYDFK ProCMP137 NPSHIVISIFREDDEAEQIWRDVIGVNPRRIIRMDEKDNFWSSGPTGPCGPCSELYYDFH ProMIT921 DPKYLVVSIFREDDDAYEIWRNIIGVNPDRIIRMDEADNFWSSGPTGPCGPCSELYYDFN ProCMP198 SAENIIVSVFHEDDDSVKIWKEDIGIHPKRIIKLGEKDNFWSSGKTGPCGPCSELYFDFK ProMIT931 DPKNLVVSVFREDDEAETIWRDVVGVNPKRIMRMDEADNFWASGPTGPCGPCSEIYYDFK PromNATL2 NPQNIVISVFKEDLEAEQIWKEVVGVDANRIIRMGAADNFWSSGATGPCGPCSELYFDFK Staphyloc EPDKLYVTIHPEDMEAYNIWHKDIGLEESRIIRIEG--NFWDIG-EGPSGPNTEIFYDRG Bsubtilis DKELLSVTVHPEDEEAYEFWAKKIGIPEERIIRLEG--NFWDIG-EGPSGPNTEIFYDRG Nostoc712 PERGDDNIDLE-----DDTRFIEFYNLVFMQYNRDASGNLTPLQNKNIDTGMGLERITQI Anaba2941 PERGDENIDLE-----DDTRFIEFYNLVFMQYNRDASGNLTPLQNKNIDTGMGLERITQI Nosto7310 PERGDENIDLE-----DDSRFIEFYNLVFMQYNRDVLGNLTPLQNKNIDTGMGLERMAQI SynPCC680 PELGDEKLDLE-----DDSRFIEFYNLVFMQYNRDNAGNLTPLEKKNIDTGMGLERMAQI Crocospha PELGDDDIDLE-----DDSRFIEFYNLVFMEYNRDADGKLTPLQNKNIDTGMGLERMAQI Trichodes PELGDDHIDLE-----DDTRFIEFYNLVFMQYNRDANGNLTPLENRNIDTGMGLERMAQI SynPCC794 PELGNDGLDLE-----DDSRFIEVYNLVFMQYNRDAAGNLTALEKQNIDTGMGLERMAQV SynPCC630 PELGNDGLDLE-----DDSRFIEVYNLVFMQYNRDAAGNLTALEKQNIDTGMGLERMAQV SynJA22-1 PELGDERIDLE-----DDSRFLEIYNLVFMELNRDSEGHLMPLAKQNIDTGLGLERLAQV SynJA3-3A PELGDEQIDLG-----DDSRFLEIYNLVFMELNRDSEGRLTPLARQNIDTGLGLERLAQV ThermoBP- PEKGLANVDLD-----DDGRFIELYNLVFMELNQDDQGHRTPLKAKNIDTGMGLERMAQV Gloeobact PERGEGDIDLD-----DDSRFLEIYNLVFMQLNRDSEGTFSELPRKNIDTGLGLERMAQV SyneRS991 PELGDDGIDLE-----DDSRFIEFYNLVFMQYNRDAEGTLTPLENRNIDTGMGLERMAQI SyneWH780 PELGDEGIDLE-----DDDRFIEFYNLVFMQSNRDAQGVLTPLANRNIDTGMGLERMAQI SyneCC960 PELGDEDIDLE-----DDDRFIEFYNLVFMQYNRDAEGTLTPLANRNIDTGMGLERMAQI SyneWH810 PELGDDGIDLE-----DDDRFIEFYNLVFMQYNRDAEGNLTPLANRNIDTGMGLERMAQI SyneCC990 PELGDEGIDLE-----DDDRFIEFYNLVFMQYNRDAEGSLTPLANRNIDTGLGLERMAQI ProMIT931 PDLGNDDIDLE-----DDGRFVEFYNLVFMQYNRDGEGNLTPLANRNIDTGMGLERMAQI SyneWH570 PELGVEHLDLE-----DDSRFIEFYNLVFMQFNRDAEGNLTPLENRNIDTGLGLERMAQI ProCMP137 PELGEEEIDLE-----DDTRFIEFYNLVFMQYNRNSSGTLTSLANCNIDTGMGLERMAQI ProMIT921 PELGNHCIDLE-----DDTRFIEFYNLVFMEFNRDSTGRLSSLSNCNIDTGMGLERMAQI ProCMP198 PEKGVQNIDLE-----DGDRFIEFYNLVFMQYNRDPDGQLTDLKYKNIDTGMGLERMAQI ProMIT931 PDLGNDDIDLE-----DDGRFVEFYNLVFMQYNRDGEGNLTPLANRNIDTGMGLERMAQI PromNATL2 PELGSDEIDLE-----DDSRFIEFYNLVFMQYNRDLKGNLEPLANCHIDTGMGLERMAQI Staphyloc EAYGQDDPAEEMYPGGENERYLEVWNLVFSEFNHNKDHSYTPLPNKNIDTGMGLERMASV Bsubtilis EAYGNDPEDPELYPGGENDRYLEVWNLVFSEFNHNPDGTYTPLPKKNIDTGMGLERMVSV Nostoc712 LQRVPNNYETDLIFPIIETAAKIAGIDYHSSDESTKVSLKVIGDHVRSVVHMIADEIRAS Anaba2941 LQQVPNNYETDLIFPIIETAAKIAGIDYHSSDETTKVSLKVIGDHVRSVVHMIADEIRAS Nosto7310 LQKVPNNYETDLIFPIIQTAAQIAGIDYHSSDEKTKVSLKVIGDHVRSVVHMIADEIRAS SynPCC680 LQKVPNNYETDLIFPIIQTAANIAGIDYAQANEKTKVSLKVIGDHVRSVVHMIADGISAS Crocospha LQKVPNNYETDLIFPIIETAANIANINYQKATEKTKVSLKVIGDHVRSVVHMISDNITAS Trichodes LQKVPNNYETDLIFPIIKTASDIADIDYRKSDDKTKVSLKVIGDHVRAVANLIADGVTAS SynPCC794 LQGVPNNYETDLIFPIIQAVAAIAQRDYASESESVKVSLKVIGDHLRAVTHLIADGVTAS SynPCC630 LQGVPNNYETDLIFPIIQAVAAIAQRDYASESESVKVSLKVIGDHLRAVTHLIADGVTAS SynJA22-1 LQGVPNNYETDLIFPIIQTAAGIAGLNYSKANPEQQISLKVIGDHARAVMHLIADGVLPS SynJA3-3A LQGVPNNYETDLIFPIVQKAAEIARVDYFQASPEQKVSLKVIGDHARAVMHLIADGVIPS ThermoBP- LQGVPNNYETDLIFPIIEAAAQRAGIQYQKANASTQTSLKVIGDHTRAVVHLIADGVTAS Gloeobact LQGVPNNYETDLVFPIIRKAEVLSGRDYFKADDQQKTSFKIIGDHTRAVVHLVADGVLPG SyneRS991 LQAVPNNYETDLIYPLIETAAQLAGVAYPALDGKGKTSLKVIGDHSRAITQLIGDGVTAS SyneWH780 LQKVPNNYETDLIFPLIQAAADRAGVDYYQLDEKGKTSLKVIGDHSRAVTQLISDGVTAS SyneCC960 LQKVPNNYETDLIFPLIQAAADLAGVDYHQLNDKGKTSLKVIGDHSRAVTQLICDGVTAS SyneWH810 LQKVPNNYETDLIFPLIQAAADLACVDYAQLDDKGKTSLKVIGDHSRAVTQLICDGVTAS SyneCC990 LQKVPNNYETDLIFPLIQAAADCAGVDYHQLDDAGKTSLKVIGDHSRAVTQLICDGVTAS ProMIT931 LQGVPNNYETDIIYPLIETAAGLAGLDYQKLDEKGKTSFKVIGDHCRAITHLICDGVTAS SyneWH570 LQGAANNYETDLIYPLIETAATLAGVNYRQLDARGQTSLKVIGDHSRAITQLIADGVTAS ProCMP137 LQGVANNYETDLIYPLLEKIASLIDVQYEDLNATIKASFKVIGDHTRACVHLIGDGVSAS ProMIT921 LQKVPNNYETDLIYPLLEKVAYLVGVDYQKTDKKTRISYKVIGDHIRACVQLISDGVSAS ProCMP198 LQKKKNNYETDLIFPIIQKASEISKIDYYSSGERTKISLKIIGDHIRAVIHLISDGVIAS ProMIT931 LQGVPNNYETDIIYPLIETAAGLAGLDYQKLDEKGKTSFKVIGDHCRAITHLICDGVTAS PromNATL2 LQKKSNNYETDLIFPLINAAALLAQIKYETTNKKNKTSLKIIGDHCRAVTHLICDGVSAS Staphyloc SQNVRTNYETDLFMPIMNEIEKVSGKQYLVNNEQ-DVAFKVIADHIRTIAFAISDGALPA Bsubtilis IQNVPTNFDTDLFVPIIKATESISGETYGKDNVK-DTAFKVIADHIRTVAFAVSDGALPS Nostoc712 NVGRGYVLRRLIRRVVRHGRLIGISGEFINQVAETAIALSESAYPNVRQRETVIKAELER Anaba2941 NVGRGYVLRRLIRRVVRHGRLIGISGEFINQVAERAIALSESAYPNVRKRETVIKAELER Nosto7310 NVGRGYVLRRLIRRVVRHGRLIGISGEFTTQVAESAIALSESAYPNVRQREAAIKAELQR SynPCC680 NLGRGYVLRRLIRRVVRHGRLLGINGEFTTKVAATAVQLAQPVYPNVLERQSLIEQELQR Crocospha NTDRGYVLRRLIRRVVRHGRLIGIEGKFINEVAETAIQLSQKAYPQVRERDSFIKAELER Trichodes NVGRGYILRRLIRRVVRHGRLIGISGEFTSKVAETAITLAEDIYPNLRERETVIKAELQR SynPCC794 NLGRGYVLRRLIRRVVRHGRLIGIDRPFTAEIAETAIALMAAQYPNLREREAAIKAELTR SynPCC630 NLGRGYVLRRLIRRVVRHGRLIGIDRPFTAEIAETAIALMAAQYPNLREREAAIKAELTR SynJA22-1 NVDRGYVLRRLIRRMVRHGRLLGIGEPFTIPVIETAIQLAEAAYPQVRERETLIKTELQR SynJA3-3A NVDRGYVLRRLIRRMVRHGRLLGIGEPFTLPVVETAIQLAEAAYPEVREREAVIKAELQR ThermoBP- NVGRGYVLRRLIRRIVRHSRLLGINGLVTPDLAQVAIDLAANVYPNVRERQAVILSELQR Gloeobact NLGRGYILRRLIRRMVRHGRMLGIAGSFLPQMATVAVELMGGAYPNLLESRELILTRLAT SyneRS991 NLGRGYILRRLLRRVVRHGRLLGIDKPFLTAMGEASIALMQSAYPQLLERREVILAELQR SyneWH780 NLGRGYILRRLLRRVVRHGRLLGIDKPFLQAMGEASIALMQSAHPQLIERREVILAELQR SyneCC960 NLGRGYILRRLLRRVVRHGRLLGIDKPFLVTMGEAAIALLKGAHPSVIERQEVILAELQR SyneWH810 NLGRGYILRRLLRRVVRHGRLLGIDKPFLVTMGEAAIDLLKGAHPGVIERQEVILAELQR SyneCC990 NLGRGYILRRLLRRVVRHGRLLGINKPFLVMMGEASIALLKDAHPSVLERQEVILAELQR ProMIT931 NLGRGYIMRRLLRRVVRHGRLVGIEKPFLQAMGEAAIALMVEAYPQLEERRKLILAELNR SyneWH570 NLGRGYILRRLLRRVVRHGRLLGITTPFLTAMGEAAIALMVDAYPQLVERRDAIMAELAR ProCMP137 NLGRGYVLRRLLRRVVRHGRLIGITKPFLVQIAEVAIELMQSAYPQLLERRQLIFKELQR ProMIT921 NLGRGYILRRLLRRIVRHGRLLGIPKPFLIDLGEVAISLMKSTYPQLLERRDVILKELQR ProCMP198 NLGRGYILRRLIRRMVRHGRLLGLKNEFLSKLASVGIKLMQENYPDLKNNCDHILSEIKI ProMIT931 NLGRGYIMRRLLRRVVRHGRLVGIEKPFLQAMGEAAIALMVEAYPQLEERRKLILAELNR PromNATL2 NLGRGYILRRLIRRMIRHGRLVGIIQPFLPQLAEIAIELMKNAYPQLLEKKKIILNELKI Staphyloc NEGRGYVLRRLLRRAVRFSQTLGINEPFMYKLVDIVADIMEPYYPNVKEKADFIKRVIKS Bsubtilis NEGRGYVLRRLLRRAVRYAKTINIHRPFMFDLVPVVAEIMADFYPEVKEKADFIAKVIKT Nostoc712 EEANFLKTLDRGEKLLAEIIAEVKTVISGKDAFTLYDTHGFPLELTQEIAEENNLTVDVE Anaba2941 EEANFLKTLDRGEKLLEEIIQQVQTIISGESAFTLYDTYGFPLELTQEIAEENNLTVDED Nosto7310 EESNFLRTLDRGEKLLEEIIQEVSTQISGESAFTLYDTYGFPLELTQEVAEENNLTVDAE SynPCC680 EEAAFLKTLERGEKLLADLMADGVTEIAGADAFTLYDTFGFPLELTQEIAEEQGITVDVE Crocospha EEATFLKTLERGEKLLAEIISKTQGQISGVDAFTLYDTFGFPLELTQEIAEESNLTVDIE Trichodes EESRFLETLERGEKLLAEIMAKPTQHISGEDAFKLYDTYGFPLELTQEIAEENGLTVDVS SynPCC794 EEQRFLETLERGEKLLAELLAAATDQIRGEDAFVLYDTYGFPLELTQEIAEEKGLTVDLA SynPCC630 EEQRFLETLERGEKLLAELLAAATDQIRGEDAFVLYDTYGFPLELTQEIAEEKGLTVDLA SynJA22-1 EEEQFLKTLERGERLLLDLFTAVPKQISGADAFKLFDTYGFPLELTQEIAQEHGFSVDLE SynJA3-3A EEEQFLKTLERGERLLFDLFANVPKQISGADAFKLFDTYGFPLELTQEIAQERGFSVDVQ ThermoBP- EEEQFLKTLDRGEKLLAEMLSPLQPQLAGRDAFVLFDTYGFPLELTQEIAAEQGIGVDVA Gloeobact EESQFLQTLETGERLLDEILGRSEKRISGVDAFMLYDTYGFPLELTMEVAAERGLEVDVA SyneRS991 EEARFLETLERGEKLLADVLAAKPKQISGAQAFELYDTYGFPLELTEEIAEEHGLTVDLD SyneWH780 EEARFLETLERGEKLLADVLAAKPKQISGEQAFELYDTYGFPLELTQEIAEEHGLAVDLA SyneCC960 EEARFLETLERGEKLLAEVLASKPTQISGAQAFELYDTYGFPLELTQEIAEEQGLAVDLD SyneWH810 EEARFLETLERGEKLLAEVLASRPTQISGAQAFELYDTYGFPLELTQEIAEEQRITVDLD SyneCC990 EESRFLETLERGEKLLSEVLDSKPKQISGAQAFELYDTYGFPLELTQEIAEEQGLEVDLA ProMIT931 EEARFLETLERGEKVLADVLVANPQMISGGQAFELYDTYGFPLELTQEIAEEHGLTVDLQ SyneWH570 EEARFLETLERGEKLLAEVLSAGPSQISGEQAFELYDTYGFPLELTEEIAEEHGLAVDLA ProCMP137 EETRFLETLEKGEKLLAELLSKAPSVITGEAAFELYDTYGFPVELTEEIAEENDLRVDMK ProMIT921 EELRFLETLERGERLLADLLSNNPKEISGEQAFELYDTYGFPLELTQEIAEENSIEVDLK ProCMP198 EEIRFRETLERGEKLLDELISSGQKMITGFKAFELYDTYGFPLELTEEIAQENNIGVDVK ProMIT931 EEARFLETLERGEKVLADVLVANPQMISGGQAFELYDTYGFPLELTQEIAEEHGLTVDLQ PromNATL2 EESRFLETLERGEKLLAEITSHECDLISGAQAFELYDTYGFPLELTEEIANEKGISVDIN Staphyloc EEERFHETLEDGLAILNELIKKATNEINGKDAFKLYDTYGFPIELTEEIAVQAGLKVDMT Bsubtilis EEERFHETLNEGLAILSEMIKKESSVISGADVFKLYDTYGFPVELTEEYAEDENMTVDHE Nostoc712 GFQKQMEIQQQGGRGAHETIDLTVQGSLDKLAEHIHATEFIGYSQATATAKVEVLLVDGV Anaba2941 DFNVEMQKQVERAKAAHETIDLTVQGSLDKLAEHIHATEFIGYSQATATAKVEVLLVDGV Nosto7310 GFEAQMEIQKGRGRDAHETIDLTVQGSLDKLAEHIQVTEFLGYTQSAATAIIEAILLEGV SynPCC680 GFEKAMQEQQERSKAAHETIDLTVQESLDKLANHIHPTEFLGYTDLQSSAIVKAVLVGGE Crocospha GFEVEMEQQQERSKAAHETIDLTVQGSLDKLAEHIHPTEFLGYTDLQSTAKVEAVLIEGK Trichodes MFDQEMKLAQIRSQSAHETIDLTAQ---DGVKLDIDKTQFLGYTDLSSPAQVMALVGDGE SynPCC794 GFEAAMAAQRQRSQAAHETIDLTVQGSLDRLAEQIHPTEFVGYGDAVATAKVTALLREGQ SynPCC630 GFEAAMAAQRQRSQAAHETIDLTVQGSLDRLAEQIHPSEFVGYGDAVATAKVTALLREGQ SynJA22-1 GFEQEMEKQRQRARAAHQTIDLTAQGSLDELADFLSQTEFLGYSQSSARGVVEALLVEGK SynJA3-3A GFEQEMEKQRQRARAAHQTLDVTAQGSLDELAEFLIETEFLGYSQSSARGVVEALLVEGK ThermoBP- EFEACMAEQRQRSQAAHETIDVTVQEGIDSLGDQLHPTQFRGYEELSLTTTVTAILVAGH Gloeobact AYEEAMEKQRERARRAAKTVDLTQGASLADLER----TDFLGYRNVASKAMVKLVLFEGE SyneRS991 GFEAAMEAQRQRAKAAAVSIDLTLQEAIDQVAADLEATAFRGYELLEQSSCVVALVVNGE SyneWH780 GFETAMEQQRQRAKAAAVSIDLTLQDAIDQVAAGLQDTEFRGYEQLEQSSSIQALVVNGE SyneCC960 GFEAAMEEQRQRAKAAAVSIDLTLQDAIDQVVADQAATCFEGYEALDHASCVQALVVNGG SyneWH810 GFEVAMEEQRQRAKAAAVSIDLTLQDAIEQVAGDQPATAFKGYDALDHPSTVQALVVNGA SyneCC990 GFEEAMEQQRQRAKAAAVSIDLTLQDAIDQVVATQKATAFQGYQRLEQPSCVQALVANGE ProMIT931 GFEQAMDQQRQRAKAAAVSIDLTLQGAIEQMAAELEATRFQGYEVLEQPCCVLALVVNGE SyneWH570 GFEAAMEAQRQRAKAASVRLDLTLQGAIEAMAEQLPATDFRGYEALEHPSQVMALVVNGE ProCMP137 GFKKAMDEQRRRAKSAAMTIDLTLQDTIDKVVSEVGETNFLGYQQLEQFSQVQAIVVNGV ProMIT921 AFEKAMQRQRIRAKAASTTIDLTLQDTLDKAVRELKPTNFKGYECLTETSTVQAIFINGE ProCMP198 GFDKEMSAQKERAKAASQIIDLTLEGSLEREIDLFDKTLFNGYDSLDSDAEIKGIFLEST ProMIT931 GFEQAMDQQRQRAKAAAVSIDLTLQGAIEQMAAELEATRFQGYEVLEQPCCVLALVVNGE PromNATL2 GFENEMAKQRKRAKEASVSIDLTEEGSIEREISLFDDTRFEGYEKLETTSTVIGIFKNNE Staphyloc TFESEMQQQRDRARQARQNS-QSMQVQSEVLKNITSASTFVGYDTATAQTTLTHLIYNGE Bsubtilis GFEEEMNQQRERARNARQDV-GSMQVQGGALRDVTVESTFVGYSQTKADANIIVLLQDGQ Nostoc712 VQEEAEAGTEVQIVLDETPFYAESGGQIGDRGYISGD--------GIVVQVEDVKKE-SD Anaba2941 VQEEAEAGTEVQIVLDKTPFYAESGGQIGDRGYISGD--------GIVVQVEDVKKE-SD Nosto7310 SQEEAEAGTQVQIVLDKTPFYAESGGQIGDRGYISGD--------GIVVRVEDVKKE-SD SynPCC680 LVDQAVAGQTVQIVLDQTPFYGESGGQIGDKGFLNGD--------NLLIRIEDVKRE-SG Crocospha SVETATAGTEIQIVLDKTPFYGESGGQIGDRGFLTGE--------DLVIRVDDVQKE-SG Trichodes QLETVQAGAQVQIVLDKTPFYAESGGQIADRGYLSGD--------SLVVRIEDVQKQ-NN SynPCC794 SVEAAEAGDRVQIVLDHTPFYAESGGQVGDRGVLTGE--------SLIVRIEDVQKE-SG SynPCC630 SVEAAEAGDRVQIVLDHTPFYAESGGQVGDRGVLTGE--------SLIVRIEDVQKE-SG SynJA22-1 SVPQVEAGQAVQVVLDRTPFYAEAGGQIGDRGYLSGD--------GLLVRIEDVQKQ-GD SynJA3-3A SVPQVEAGQSVQVVLDRTPFYAESGGQIGDRGYLAGD--------GVLVRVEDVQKR-GD ThermoBP- PATTATAGTEVQVILEATPFYAESGGQIGDRGYLASS--------DALVHIHDVQKQ-KE Gloeobact PIERASAGASVQLVLDQTPFYPEGGGQIGDRGYLSGP--------DLLVRIEDVQKR-DT SyneRS991 PAQRAVAGDSVQVVLDTTPFYGEGGGQVGDRGVLSGDGP---DGHGLIVAIEAVSRN-RS SyneWH780 PAQSASAGDAVQVVLDVTPFYGEGGGQIGDCGTLVADGQ---AGDGLIVMVESVSRN-RS SyneCC960 PSTTAKAGDAVQVVLDTTPFYGEGGGQVGDRGVLA--------GSDLIVRIESVSRS-RD SyneWH810 PASTASAGDVVQVVLDSTPFYGEGGGQVGDRGSLS--------GVDVIVAIDSVSRS-RD SyneCC990 PATSASAGDHVQIVLESTPFYGEGGGQVGDRGVLS--------GADVIVAIESVSRS-RD ProMIT931 SAERASAGDSVQIVLDTTPFYGESGGQVGDHGVLSGEGS---GGNGVIVTVDDVSRH-RN SyneWH570 PAQGARAGDAVQIVLDSTPFYGESGGQIGDRGVLAGGAPGAGAEAGVIVRIEAVSHQ-RK ProCMP137 SSQECNVGDKIDIVLNMTPFYGEGGGQIGDRGIISSAS---SDDSECLIEIDSVRRV-KG ProMIT921 LAQEAQENDVIQVALDITPFYGESGGQIGDTGILFKES---VTN--CLIEIDSVIRN-ND ProCMP198 LVKQASEGQKVLIVLDQTSFYGESGGQVGDIGTILS--------NDLEVVVDNVIRK-KN ProMIT931 SAERASAGDSVQIVLDTTPFYGESGGQVGDHGVLSGEGS---GGNGVIVTVDDVSRH-RN PromNATL2 SVKQAVQGDLVKIIVNRTPFYAESGGQIGDKGLITS--------QDLEVSVENVRKK-KN Staphyloc EVSQVEAGETVYFMLTETPFYAVSGGQVADTGIVYND--------NFEIAVSEVTKAPNG Bsubtilis LIEEAHEGESVQIILDETPFYAESGGQIGDKGYLRSE--------QAVVRIKDVQKAPNG Nostoc712 FFVHFGRIERGTLRVGDNVTAQIDRAGRRRAQANHTATHLLQAALKTIVDGGISQAGSLV Anaba2941 FFVHFGRIERGTLRVGDSVTAQIDRAGRRRAQANHTATHLLQAALKTIVDGGISQAGSLV Nosto7310 FFVHFGRIERGTLRVGDRVTAQIDPACRRRAQANHTATHLLQAALKKIVDDGISQAGSLV SynPCC680 IFIHFGRVERGTVQIGTTITATIDRACRRRAQANHTATHLLQSALKRVVDEGISQAGSLV Crocospha IFVHFCRVERGQVSVNEHLKATIDRSCRRRVQANHTATHLFTAALKKVVDDSISQAGSLG Trichodes IFVHFGRIERGRLQLGMTVNAQIDGTCRRRAQANHTATHLLQAALRSLVDSSISQAGSLV SynPCC794 FFVHYGQIERGLLQVGDSVTAQIDRACRRRAQANHTATHLLQAALKLIVDEGISQAGSLV SynPCC630 FFVHYGQIERGLLQVGDSVTAQIDRACRRRAQANHTATHLLQAALKLIVDEGISQAGSLV SynJA22-1 LFVHFGRVERGILRVGDPVQAQIDLACRRRAQAHHTATHLLQAALKRIVDPSIGQAGSLV SynJA3-3A LFVHFGWVERGILRVGDPVQAQIDLACRRRAQAHHTATHLLQAALKKVVDPSISQAGSLV ThermoBP- LFVHYGKVERGSLKVGDRVSAQIDLSCRRRVQAHHTATHLLQAALKKLIDENISQAGSLV Gloeobact VILHTGRIERGTVATGDSVQAQIDLADRRRSQAHHTATHLLQAALKKVLGESVQQQGSLV SyneRS991 VFVHSGRIERGVLSLGDVVHGQVDRACRRRAQANHTATHLLQAALKQVVDPGIGQAGSLV SyneWH780 VFVHSGRVERGALSVGDVVHGRVDRACRRRAQANHTATHLLQAALKQVVDPGIGQAGSLV SyneCC960 VFVHAGRVERGELALGDTVKAQVDRACRRRAQANHTATHLLQAALKQVVDPGIGQAGSLV SyneWH810 VFVHSGRMERGHLAVGDTVNAQVDRSCRRRAQANHTATHLLQAALKQVVDPGIGQAGSLV SyneCC990 VFVHAGRIERGQLTVGDAITAQVDRACRRRAQANHTATHLLQAALKQVVDPGIGQAGSLV ProMIT931 VFVHFGRIERGTLALGDLVNAQVDRACRRRAQANHTATHLLQAALKQVVDSGIGQAGSLV SyneWH570 LIVHHGRIERGELALGDTVTALVDRACRRRVQAHHTATHLLQAALKQLVDPSIAQAGSLV ProCMP137 AFVHSGLVKNGVLTLGDNVQCTVDSFSRRCAQANHTATHLLQAALKKAVDSDITQAGSLV ProMIT921 VFVHRGIVKNGRLQVGDIIQSQVHHINRRRAQINHTATHLLQASLKEIVGSEISQAGSLV ProCMP198 VFLHYGIVKKGILSLGQKVKTKVNDLARAKAAANHTATHLLQSALKVVVNESVGQKGSLV ProMIT931 VFVHFGRIERGTLALGDLVNAQVDRACRRRAQANHTATHLLQAALKQVVDSGIGQAGSLV PromNATL2 IFIHSGIVNTGVLEINSSVQMNVTPSFRQRTTSNHTATHLLQSALKLSIDSSVSQRGSLV Staphyloc QNLHKGVVQFGQVNVGATVSAEVNQNDRRDIQKNHSATHLLHAALKSVLGDHVNQAGSLV Bsubtilis QHVHEGVVESGTVQKGLHVTAEVEDHMRSGVIKNHTATHLLHQALKDVLGTHVNQAGSLV Nostoc712 SFDRLRFDFNSPRGLTVEEIQQVEEQINTWIAEAHSAKIELLPLAEAKARGAVAMFGEKY Anaba2941 SFDRLRFDFNSPRGLTAEEIQQVEEQINTWIAEAHSAKIEVLPLAEAKARGAVAMFGEKY Nosto7310 SFDRLRFDFNCPRALTAEEVQQVEEQVNSWIAEAHAAKVEVLPLAEAKAKGAVAMFGEKY SynPCC680 DFNRLRFDFNSPRAVTMEELQQIEDLINQWIAEAHQTEVAVMPIADAKAKGAIAMFGEKY Crocospha DFDRLKFDL--------------------------------------------------- Trichodes SFDRLRFDFNCPRGLKPEEVEQVEAQVNSWIAEAHSATVAEMPLEVAKAKGAVAMFGEKY SynPCC794 AFDRLRFDFNCPRAVTPEELRQIEDQINQWIAEAHGTVVEVMPIATAKAKGAVAMFGEKY SynPCC630 AFDRLRFDFNCPRAVTPEELRQIEDQINQWIAEAHGTVVEVMPIATAKAKGAVAMFGEKY SynJA22-1 AFDRLRFDFTLSRPVTPEELEQIENLVNTWIAEAHAAQVAIMPLAEAKARGAVAMFGEKY SynJA3-3A AFDRLRFDFTLSRPLTPEELQQVEDLVNTWIAEAHPAQVSIMPLAEAKARGAIAMFGEKY ThermoBP- AFDRLRFDFNCPRPLTREELQQIEDQINAWISESHTTHTYIMALSEAKAKGAIAMFGEKY Gloeobact AFDRLRFDFSWPKALSQAEIQQVEDLVNTWIAEAHTLSQEVMPIDQAKARGALAFFGEKY SyneRS991 DFDRLRFDFHCPRAVTAAELEQIEALINGWISEAHSLEVQEMAIETAKAAGAVAMFGEKY SyneWH780 SFDRLRFDFHCPRAVTTEELGRIESLINGWIADAHALEVQEMAIEQAKAAGAVAMFGEKY SyneCC960 DFDRLRFDFHCPTAVTPDQLQQVETLINGWINEAHALQVQEMAIDQAKAAGAVAMFGEKY SyneWH810 DFDRLRFDFHCPTAVTAEQLAQIETLINGWIAEAHCLEVQEMAIDQAKAAGAVAMFGEKY SyneCC990 DFDRLRFDFHAPQAVTTEQLGQIETLINGWIAEAHGLLVEEMAIDQAKAAGAVAMFGEKY ProMIT931 DFDRLRFDFHCARAVTAKELEQIEALINGWIMESHDLIVEEMSIQEAKAAGAVAMFGEKY SyneWH570 DFERLRFDFHCPHAISPEDLERIEERINGWIADAHALEVREMELERARSAGAVAMFGEKY ProCMP137 DFDRLRFDFHFVRPVSGAELEHIEKLINGWISEAHSLVISEMSINEAKRVGAIAMFGEKY ProMIT921 SFERLRFDFHCSSPVSSEELEKVEKKINLWISESHSLVVKEMKIDDAKQAGAVAMFGEKY ProCMP198 AFNKLRFDFNSSKPITKDQIFKVETLVNSWILENHSLNIKNMAKSEALERGAVAMFGEKY ProMIT931 DFDRLRFDFHCARAVTAKELEQIEALINGWIMESHDLIVEEMSIQEAKAAGAVAMFGEKY PromNATL2 SNHRLRFDFNAPKPLTIKELEDMEARINQWINEDHLIQIKTMPIKEAMAAGALAMFGEKY Staphyloc EADRLRFDFSHFGPMTNDEIDQVERLVNEEIWKGIDVNIQEMDIASAKEMGAMALFGEKY Bsubtilis TENRLRFDFSHFGQVTKEELEQIERIVNEKIWASIPVSIDLKPIAEAKEMGAMALFGEKY Nostoc712 GDEVRVIDFPGVSMELCGGTHVSNTAEIGVFKIISEAGVASGVRRIEAVSGLAVLDYLNV Anaba2941 GDEVRVIDFPGVSMELCGGTHVSNTAEIGVFKIISEAGVASGVRRIEAVSGLAVLDYLNV Nosto7310 GDEVRVIDFPNVSMELCGGTHVSNTAEIGVFKIISEAGVASGVRRIEAVSGPAILDYLNL SynPCC680 GAEVRVIDVPGVSLELCGGTHVANTAEIGLFKIVAETGIAAGVRRIEAVAGPSVLDYLNV Crocospha -------------IAFCSN--------------------IRGVTRIWGFN---------- Trichodes ADVVRVVDYPGVSMELCGGTHVNNTAEIGVFKIISEAGISSGVRRIEAVAGLAVLDYLKV SynPCC794 GAEVRVIDVPGVSMELCGGTHVANTAEIGLFKIISEAGVASGVRRIEAVAGPAVLEYLNV SynPCC630 GAEVRVIDVPGVSMELCGGTHVANTAEIGLFKIISEAGVASGVRRIEAVAGPAVLEYLNV SynJA22-1 GAEVRVIDFPGVSMELCGGTHVNNTAEIGLFKIIAETGVAAGIRRIEAVAGPAVLEYLNE SynJA3-3A GAEVRVVDFPGVSMELCGGTHVSNTAEIGLFKIISESGVAAGIRRIEAVAGPAVLEYLNE ThermoBP- GEQVRVLDIPGVSMELCGGTHVHNTAEIGLFKIISESGVAAGIRRIEAIAGAAVRDYLQQ Gloeobact GSEVRVIDVPGVSIELCGGTHVRNTAEIGLFKMVSETGIASGVRRIEAVAGPAVLEYLRL SyneRS991 ADVVRVVDVPGVSMELCGGTHVANTAEIGLFKIVSESGVAAGIRRIEAVAGPAVLAYLNE SyneWH780 ADVVRVVDVPGVSMELCGGTHVRNTAEIGLFKIVSESGVAAGIRRIEAVAGAAVLPYLNE SyneCC960 ADVVRVVDVPGVSMELCGGTHVTNTAEIGLFKIVAESGVAAGIRRIEAVAGPAVLAYLNE SyneWH810 ADVVRVVDVPGVSMELCGGTHVANTAEIGLFKIVAESGVAAGIRRIEAVAGPAVLAYLNE SyneCC990 ADVVRVVDVPGVSMELCGGTHVANTAEIGLFKIVGESGVAAGIRRIEAVAGASVLGYLNE ProMIT931 ADVVRVVDVPGVSMELCGGTHVANTAEIGLFKIVAESSVAAGIRRIEAVAGPAVLAYLNE SyneWH570 ADIVRVVDVPGVSMELCGGTHVANTAEIGLFRIVSESGVAAGIRRIEAVAGPAVLDYLKE ProCMP137 GEVVRVVDVPGVSKELCGGTHVTNTAEIGLFKIVSETGIAAGIRRIEAIAGQGVLDYLND ProMIT921 GTLVRVVDVPGVSMELCGGTHVANTADIGAFKIVGESGIAAGIRRIEAVAGPGVFDYFNA ProCMP198 DDEVRVVDVPSVSMELCGGTHVKTTSELGCFKIISEEGISAGVRRIEALSGQSAFEYFSD ProMIT931 ADVVRVVDVPGVSMELCGGTHVANTAEIGLFKIVAESSVAAGIRRIEAVAGPAVLAYLNE PromNATL2 GDVVRVVDVPGISMELCGGTHVTRTSQLGTFKIINETGIASGIRRIEAIAGPSVLDYFNE Staphyloc GDVVRVVNMAPFSIELCGGIHVRNTSEIGLFKIVSESGTGAGVRRIEALTGKAAFLYLED Bsubtilis GDIVRVVQVGDYSLELCGGCHVRNTAEIGLFKIVSESGIGAGTRRIEAVTGQGAYVEMNS Nostoc712 RDKVVKDLSDRFKVK-PEELPERITTLQNELRTTEKQLETLKGQLAIAKSDSLLQTADTC Anaba2941 RDKVVKDLSDRFKVK-PEELPERITTLQNELRTTEKQLETLKGQLAIAKSDSLLQTADTL Nosto7310 RDKVVKDLSDRFKVK-PEELPDRITSLQSELRNSQKELETLKVQLAIAKSDSLLQTAETV SynPCC680 REAVVKELGDRLKAK-PEEIPDRVHQLQQELKASQKQLEALKQELALQKSEQLLTQAQTV Crocospha ------------------------------------------------------------ Trichodes RDAVVKELSDRFKAK-PEELSERVSNLQQELKDSQKQLEALKGELAVAKSDQLLGNAETV SynPCC794 RDAVVRDLSDRFKAK-PEELSDRVTALQEELKANQKQLTALKAELAIAKSDALVSQAIPV SynPCC630 RDAVVRDLSDRFKAK-PEELSDRVTALQEELKANQKQLTALKAELAIAKSDALVSQAIPV SynJA22-1 RDRVVRELSAQFKAK-PQELPERVAALQAELKAAQKELEEVRSQLALLQAEGLLAQAVAV SynJA3-3A RDSVVRELSAQFKAK-PQEIPERVAALQAELKAAQRALEEARSQLALLQAERLLPQAVAV ThermoBP- RDSIVRELCDRFKAK-PEEILDRISQLQADLKAQQKALEHLKAELALAKTQALLEQAKPV Gloeobact RDTVTRQLADEFKVQ-VEQIPQRVAALTSDLKQAQKQIDSLKAALATARAEALLGQAETT SyneRS991 RDAVVKQLGERFKAQ-PAEIVDRVVQLADELKASQKALVAAREELALAKSAALAGQAVAV SyneWH780 RDAVVKQLGERFKAQ-PGEIIERVSALQDELKATGKALAAAQAELAVAKSAALAAQAVAV SyneCC960 RDVVVKQLGDRFKAQ-PAEIVDRVAALQEELKATGKALAAAQAELAVAKAGALAAKAEAV SyneWH810 RDAVVKQLGDRFKAQ-PAEIVDRVTALQEELKATGKALAAAQAELAVAKAGALAAKAEAV SyneCC990 RELVVKQLGDRFKAQ-PGEIVERVVALQDELKSTGKALIAAQAELAVAKSAALATKAVAI ProMIT931 RDEVVKKLLERLKVQ-PSEIVERVTSLQEELKSSQKALTAARAELAVAKSAALATQAVAV SyneWH570 RDTVVRALGDRFKVQ-PGEILERVSGLQDELKAASRALAAARSELALARASALATGALEV ProCMP137 RDGVVKILSERFKAQ-SNEIVDRVIALQDEVKSLTKLLVKAQDEVAFTKALSLKNKVVSL ProMIT921 RDSVVRILSERFKVQ-SNEIVDRVIALQDEVKLLGKSLIKAQEEIAFAKTSALVSKATAI ProCMP198 KNSLVSQLCDLLKAN-PNQLLDRVNSLQSELINKNKEIQKMKDEIAYFKYSSLSSSANKV ProMIT931 RDEVVKKLLERLKVQ-PSEIVERVTSLQEELKSSQKALTAARAELAVAKSAALATQAVAV PromNATL2 RDLVVKELSKSFKVQ-SYEIVERVSSLQLELKDKTKELIKVKNELALAKALGLATYAKSV Staphyloc IQEKFNTMKSQMKVKSDDQVVEKLTQLQDEEKALLKQLEQRDKEITSLKMGNIEDQVEEI Bsubtilis QISVLKQTADELKTN-IKEVPKRVAALQAELKDAQRENESLLAKLGNVEAGAILSKVKEV Nostoc712 GDYKIIVAQLEGVDPESLKSAAERLLQKIGNG-AVVLG---SVPEADKVSLVAAFSPEVN Anaba2941 GDHKIIVAQLEGVDPESLKSAAERLLQKIGNG-AVVLG---SVPEADKVSLVAAFSPEVN Nosto7310 GDHKIIVAQLENVDPESLKTAAERLLQKIGNG-AVVLG---SVPEADKVSIVAAFSPEVN SynPCC680 GEFKILVADLGTVDGESLKTAAERLQQKLGES-AVVLA---SIPEEGKVSLVAAFSPQLV Crocospha ------------------------------------------------------------ Trichodes GEFQILVAEMPGVDAEALKTAAERLQQKLDES-AVVLG---SAAEG-KVSLVAAFSKSVN SynPCC794 GDAQVLVETLTGVDAAALQTAAERLQQKLGDAGAVVLG---SSPEEGKVTLVAAFGPAII SynPCC630 GDAQVLVETLTGVDAAALQTAAERLQQKLGDAGAVVLG---SSPEEGKVTLVAAFGPAII SynJA22-1 ADLKVLVAELGSTTPEALKTAAEHLLHKLGEG-AVVLG---SVPEAGKVSLVAAFSPAVQ SynJA3-3A GNLQILAAELGSTPPEALKTAAEHLLHKLGEG-AVVLG---SVPEAGKVSLVAAFSPAVQ ThermoBP- GNSHVLIASLAGVDPQGLKTAAEWLLNKLGSG-AVVLA---TQPAADKVNLLVAASQDVV Gloeobact GGFRVLVADLGDTEPEALKSAAEHLLAKLGEGGAVVLG---SAPAADKVSLVAAFGKGAV SyneRS991 GAHQLLVARLDGVEGGGLQSAAQGLVDQLGDGAAVVLGGLPDPGDLGKVILVAAFGKAVI SyneWH780 GNFQLLVERLDGVDGSGLQGAAQSLVDQLGDGGAVVIGGLPDPADQGKVILVAAFGKDVI SyneCC960 GDFQLLVERLDGVDGAGLQGAAQSLADQLGDGAAVVIGGLPDPGDMGKVILVAAFGQQVI SyneWH810 GEFQLLVERLDGVEGAGLQGAAQSLADQLGDGAAVVIGGLPDPGDLGKVILVAAFGKQVI SyneCC990 GSFQLLVERLDGVDGTGLQGAAQSLAAQLGDGAAVVLGGLPDPSDQGKVILVAAFGKDVI ProMIT931 GEYQLLVARLDGVEGAGLQNAAQGLLDQLGDGAAVVLGGLPDPSDEGKVILVAAFGKQLI SyneWH570 GAFRVLVARLDGVEGAALQTAAQQLQEGLGSAAAVVLGGLPAPEEPAKLVLVAAFGAEVI ProCMP137 TNSQYLIERLDGVTGDAIQSVVKTLVDELGDNAAVVLAGMPDLNDQKKVILVAAFGSEII ProMIT921 KSSHYIIHRLDGVPSEALQSAAKVLVDQLGDCSAVLLAGTPTQSDPNKVILVAAFGAKTV ProCMP198 GLFSLIISQLDGLDGNSLQSAALDLTSKLGDKSVVILGGIPDKENR-KLLFVVSFGEDLV ProMIT931 GEYQLLVARLDGVEGAGLQNAAQGLLDQLGDGAAVVLGGLPDPSDEGKVILVAAFGKQLI PromNATL2 GKSKLLIRRLDGVDGSGLQSAASSLIDHLGKYSAVIFGGIPNQEIDNKLVFVAAFSPDLV Staphyloc NGYKVLVTEVDVPNAKAIRSTMDDFKSKLQDTIIILA-----SNVDDKVSMVATVP-KSL Bsubtilis DGVNVLAAKVNAKDMNHLRTMVDELKAKLGSAVIVLG-----AVQNDKVNISAGVT-KDL Nostoc712 K-KGLQAGKFIGAIAKICGGGGGGRPNLAQAGGRDASKLPAALEQAQSELKSALMARINP Anaba2941 K-KGLQAGKFIGAIAKICGGGGGGRPNLAQAGGRDASKLPTALEQAQSELKSALMARTNP Nosto7310 K-KGLQAGKFVGAIAKICGGGGGGRPNLAQAGGRDASKLPDALGQAESDLKSALMARTIP SynPCC680 KTKQLKAGQFIGAIAKICGGGGGGRPNLAQAGGRDASKLPEALATAKQTLLAELMARTVP Crocospha ------------------------------------------------------MARNIP Trichodes G-KGLQAGKFIGGIAKICGGGGGGRPNLAQAGGRDPSKLKEALESAKEQLVDGLMVRTMP SynPCC794 A-KGLKAGQFIGGIAKICGGGGGGRPNLAQAGGRDASKLPEAIAAALDQLKTAIMARSVP SynPCC630 A-KGLKAGQFIGGIAKICGGGGGGRPNLAQAGGRDASKLPEAIAAALDQLKTAIMARSVP SynJA22-1 Q-LGLKAGSFIGEIAKLTGGGGGGRPNLAQAGGKQPEKLAEALQVARERLQAELMARTVP SynJA3-3A K-LGLKAGSFIGEIAKLTGGGGGGRPNLAQAGGKQPEKLAQALQVAQERLQAELMARTVP ThermoBP- Q-RGVHAGQLVAALAQVCGGRGGGRPNFAQAGGSQPAKLAEALELAHSRLKEILMARTTP Gloeobact A-KGLNAGKFVGEVAKITGGGGGGRPNLAQAGGKQPEKLKNALSEASSKLSGALMARNIP SyneRS991 S-RGQQAGKFIGGIAKVCGGGGGGRPNLAQAGGRDGAALDRALAAAREELSAALMARAFP SyneWH780 A-AKLQAGKFIGGIAKLCGGGGGGRPNLAQAGGRDGASLDAALTAARAELESVFMARAFP SyneCC960 A-AKLQAGKFIGGIAKQCGGGGGGRPNLAQAGGRDGAALPGALDAAQAELTSAFMARDFP SyneWH810 A-AKLQAGKFIGGIAKQCGGGGGGRPNLAQAGGRDGAALPGALAAARSELAAALMARDFP SyneCC990 A-AKQQAGKFIGTIAKLCGGGGGGRPNLAQAGGRDGAALAGALETARMELTAALMARAFP ProMIT931 A-QGQQAGKFIGSIAKRCGGGGGGRPNLAQAGGRDGAALDGALEAAKVDLQQALMARDFP SyneWH570 A-AGPKAGSFIAVVAKRCGGGGGGRPQLAQAGGRDAASLDPALEQARADLIASLMARAYP ProCMP137 A-QGLHAGQFLGPIAEICGGGGGGRPNFAQAGGRDPTKLDDALDLAKERIIQSLMARAFP ProMIT921 A-HGLHAGKFLGPIAKMCGGGGGGRPNFAQAGGRDAKPLDKALDLAREQLMGALMARPFP ProCMP198 K-RGMHAGKLINDISRICSGGGGGKPNFAQAGAKDIDKLNDALEYARKDLRTKLMARDFP ProMIT931 A-QGQQAGKFIGSIAKRCGGGGGGRPNLAQAGGRDGAALDGALEAAKVDLQQALMARDFP PromNATL2 S-DGLHAGKFISGVAKMCGGGGGGRPNLAQAGGSQPQSLDLALEKANENLTQQLMARAFP Staphyloc TNN-VKAGDLIKQMAPIVGGKGGGRPDMAQGGGTQPENISKSLSFIKDYIKNL-MAREFS Bsubtilis IEKGLHAGKLVKQAAEVCGGGGGGRPDMAQAGGKQPEKLEEALASVEDWVKSVLMAREFS Nostoc712 LEKVRNIGIAAHIDAGKTTTTERILFYSGIIHKIGEVHEGTAVTDWMDQERERGITITAA Anaba2941 LEKVRNIGIAAHIDAGKTTTTERILFYSGIIHKIGEVHEGTAVTDWMDQERERGITITAA Nosto7310 LEKVRNIGIAAHIDAGKTTTTERILFYSGIIHKIGEVHEGTAVTDWMEQERERGITITAA SynPCC680 LERIRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHEGTAVTDWMAQERERGITITAA Crocospha LERVRNIGIAAHIDAGKTTTTERILFYTGIAYKLGEVHEGTATMDWMAQEQERGITITAA Trichodes IERVRNIGIAAHIDAGKTTTTERILFYSGIVHKMGEVHYGTAVTDWMAQERERGITITAA SynPCC794 LEKVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGNAVTDWMEQERERGITITAA SynPCC630 LEKVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGNAVTDWMEQERERGITITAA SynJA22-1 LERVRNIGIAAHIDAGKTTTTERILFYSGLVHKIGEVHDGTAVTDWMAQERERGITITAA SynJA3-3A LERVRNIGIAAHIDAGKTTTTERILFYSGLVHKLGEVHEGTTVTDWMAQERERGITITAA ThermoBP- LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHEGTTVTDWMEQERERGITITAA Gloeobact LERVRNIGIAAHIDAGKTTTTERILFYSGVIHKIGEVHEGNTVTDWMAQERERGITITAA SyneRS991 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA SyneWH780 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA SyneCC960 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA SyneWH810 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA SyneCC990 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA ProMIT931 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA SyneWH570 LDRVRNIGIAAHIDAGKTTTTERILFYSGVVHKMGEVHDGAAVTDWMEQERERGITITAA ProCMP137 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA ProMIT921 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA ProCMP198 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA ProMIT931 LERVRNIGIAAHIDAGKTTTTERILFYSGVVHKIGEVHDGAAVTDWMAQERERGITITAA PromNATL2 LERVRNIGIAAHIDAGKTTCTERILFYSGVVHKMGEVHDGAAVTDWMAQERERGITITAA Staphyloc LEKTRNIGIMAHIDAGKTTTTERILYYTGRIHKIGETHEGASQMDWMEQEQDRGITITSA Bsubtilis LEKTRNIGIMAHIDAGKTTTTERILFYTGRIHKIGETHEGASQMDWMEQEQERGITITSA Nostoc712 AISTSWK---------------DYQINIIDTPGHVDFTIEVERSMRVLDGVIAVFCSVGG Anaba2941 AISTSWK---------------DYQINIIDTPGHVDFTIEVERSMRVLDGVIAVFCSVGG Nosto7310 AISTSWK---------------DHQINIIDTPGHVDFTIEVERSMRVLDGVIAVFCSVGG SynPCC680 AISTDWL---------------GHHINIIDTPGHVDFTIEVERSMRVLDGVIAVFCSVGG Crocospha AISTNWL---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCSVGG Trichodes AISTKWL---------------DHQINIIDTPGHVDFTIEVERSMRVLDGIIAVFCSVGG SynPCC794 AISTSWK---------------DYRVNIIDTPGHVDFTIEVERSMRVLDGVVAVFCSVGG SynPCC630 AISTSWK---------------DYRVNIIDTPGHVDFTIEVERSMRVLDGVVAVFCSVGG SynJA22-1 AITTRWTKRDPANPSQPLSGAPEYTINIIDTPGHVDFTIEVERSMRVLDGVIAVFDSVGG SynJA3-3A AITTRWTKRDPKNPSQPLAGAPEYTINIIDTPGHVDFTIEVERSMRVLDGVIAVFDSVGG ThermoBP- AISTSWR---------------DHQINIIDTPGHVDFTIEVERSMRVLDGVIAVFCSVGG Gloeobact AITTAWTRRDPENPTQPLPGALEHKINIIDTPGHVDFTIEVERSMRVLDGVITVLCSVGG SyneRS991 AISTSWK---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG SyneWH780 AISTSWN---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG SyneCC960 AISTSWQ---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG SyneWH810 AISTSWQ---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG SyneCC990 AISTSWK---------------DHRVNIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG ProMIT931 AISTSWQ---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG SyneWH570 AISTSWK---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVVAVFCAVGG ProCMP137 AISTSWQ---------------EHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG ProMIT921 AISTSWQ---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG ProCMP198 AISTSWQ---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG ProMIT931 AISTSWQ---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG PromNATL2 AISTTWD---------------DHRINIIDTPGHVDFTIEVERSMRVLDGVIAVFCAVGG Staphyloc ATTAAWE---------------GHRVNIIDTPGHVDFTVEVERSLRVLDGAVTVLDAQSG Bsubtilis ATTAQWK---------------GYRVNIIDTPGHVDFTVEVERSLRVLDGAVAVLDAQSG Nostoc712 VQPQSETVWRQADRYKVPRIAFINKMDRTGANFYRVHEQMRDRLRANAIAIQLPIGSEND Anaba2941 VQPQSETVWRQADRYKVPRIAFINKMDRTGANFYRVHEQMRDRLRANAIAIQLPIGSEND Nosto7310 VQPQSETVWRQAERYKVPRIAFINKMDRTGANFYKVHEQIRDRLRANAIAIQLPIGSEND SynPCC680 VQPQSETVWRQAERYQVPRIAFVNKMDRTGANFFRVCQQIGDRLRANAVPVQIPIGSEAE Crocospha VQPQSETVWRQANRYHVPRIAFVNKMDRTGANFFKVYQQISDRLKANAVPIQIPIGTESE Trichodes VQSQSETVWRQADRYQVPRMAFINKMDRTGANFFKVYEQIRDRLRANAVPIQIPIGSENE SynPCC794 VQPQSETVWRQADRYSVPRIVFVNKMDRTGADFFKVYGQIRDRVRANAVPIQIPIGAESD SynPCC630 VQPQSETVWRQADRYSVPRIVFVNKMDRTGADFFKVYGQIRDRVRANAVPIQIPIGAESD SynJA22-1 VQPQSETVWRQANRYNVPRIAFVNKMDRMGANFLKVYNQIRERLKANAVPIQLPIGAEDG SynJA3-3A VQPQSETVWRQANRYNVPRIAFVNKMDRMGANFLKVYNQIRERLKANAVPIQLPIGAEDE ThermoBP- VQPQSETVWRQADRYSVPRIVFVNKMDRTGANFYKVHDQIRDRLRANAVPIQLPIGAEDQ Gloeobact VQPQTETVWRQANRYNVPRFIFVNKMDRTGANFYKVYSQVRDRLRANAVPIQLPIGAEDT SyneRS991 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVHGQIKDRLKANAVPIQLPIGAEGD SyneWH780 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVHAQIKDRLKANAAPIQLPIGAEGD SyneCC960 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVHGQIKDRLKANAVPIQLPIGAEGE SyneWH810 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVHGQIKDRLKANAVPIQLPIGAEGD SyneCC990 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVHGQIQDRLKANAVPIQLPIGAEGE ProMIT931 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVNNQIKDRLKANALPIQLPIGAEGD SyneWH570 VQPQSETVWRQADRYNVPRIVFVNKMDRTGANFLKVYDQIKDRLKANAVPLQLPIGAEGE ProCMP137 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVYDQIKDRLKANAAPIQLPIGAEGD ProMIT921 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVYGQIKDRLKANAAPIQLPIGAEGD ProCMP198 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVNQQIKDRLKANAFPIQLPIGAEGD ProMIT931 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVNNQIKDRLKANALPIQLPIGAEGD PromNATL2 VQPQSETVWRQADRYSVPRMVFVNKMDRTGADFLKVHGQIKDRLKANAVPIQLPIGAEND Staphyloc VEPQTETVWRQATTYGVPRIVFVNKMDKLGANFEYSVSTLHDRLQANAAPIQLPIGAEDE Bsubtilis VEPQTETVWRQATTYGVPRIVFVNKMDKIGADFLYSVGTLRDRLQANAHAIQLPIGAEDN Nostoc712 FKGIVDLVRKRAYMYNNDQGTDIEETDIPADLQDQVEEYYTKLVEAVAETDDDLMSKYFD Anaba2941 FKGIVDLVRKRAYIYNNDQGTDIQETDIPADLQNQVEEYYTKLVEAVAETDDALMTKYFD Nosto7310 FQGIVDLVRQRAYIYANDQGTDIQETDIPEELQAQVDEFRTKLIEAAAETDDALMAKYFE SynPCC680 FEGIVDLVRMKAYLYKNDLGTDIQEVPIPDSVKDKTEEYRLRLVESVAEADDALMEKYLE Crocospha FRGIVDLVRMRAKIYQDDLGQNIEDTEIPAEYLEQAQEYRAKLVEAVAEIDETLLEKYME Trichodes FTGIVDLVAMKALIYNDDQGTDIQETEIPADVEKLAQEYRLKLVESVAETDDALTEKYLE SynPCC794 FQGIVDLVEMKAHIYTNDLGTDILVTDIPAELQETAAEWRSKMVEAVAETDEALLDKYFE SynPCC630 FQGIVDLVEMKAHIYTNDLGTDILVTDIPAELQETAAEWRSKMVEAVAETDEALLDKYFE SynJA22-1 FCGIVDLVRMQARIYMDEIGKDIRPAPIPEEMKDLVAEYRAKLVEAVAETDEALMEKYFA SynJA3-3A FRGIVDLVRLQANIYMDEIGKDIRPAPIPEEMKDLVAEYRAKLVEAVAETDEALMEKYFA ThermoBP- FKGIVDLVRMRAKIYKDDLGKEIEDTEIPAEMTELAQEYRTKLIEAVAETDDALMEKYFE Gloeobact LSGIVDLVGMKAYVYGNDIGTDIRVEEIPADMEELVQEYRAKLIEAVSETDDVLLEKYFG SyneRS991 LSGIIDLVANKAYIYKDDLGKDIEITDVPADMADEVAEWRNTLMEAVAETDEALIEKFLE SyneWH780 LSGIIDLVENKAHIYKDDLGQNIEITDVPDDMKDQVAEWRNYLMEAVAETDEALIEKFLE SyneCC960 LSGIIDLVANKAYIYKNDLGTDIEEADVPADMADEVAEWRNTLMETVAETDEALIEKFLE SyneWH810 LSGIIDLVGNKAYIYKNDLGTDIEEAEIPAEMADEAAEWRATLMETIAETDEALIEKFLE SyneCC990 LSGIIDLVENKAHIYKDDLGQDIEITDVPEAMKDQVEEWRAFLMEKVAETDEALIEKFLD ProMIT931 LTGIIDLVANKAYLYKNDLGTDIQEAPIPSEMDDEAAEWRYKLMESVAENDEELIETFLE SyneWH570 LKGIIDLVREKAILYTNDLGTDILEGEIPENMKEEAAEWRGKLMESVAETDEELLEAYLE ProCMP137 LSGIIDLVSNKAHIYKNDLGTDIEETEIPSDMAEKAAEWRSKLMETVAETDEELIESFLE ProMIT921 LSGIIDLVANKAYIYKNDLGTDIEESDIPADMASEAAEWRAKLMETVAETDEELIEQFLE ProCMP198 LSGIIDLVSNKAYLYKNDLGTDIEEAPIPDEMKDEALEWRSKLMESVAENDEELIEIFLD ProMIT931 LTGIIDLVANKAYLYKNDLGTDIQEAPIPSEMDDEAAEWRYKLMESVAENDEELIETFLE PromNATL2 LKGIIDLVENKAYIYKDDLGKDIEQTEVPSDMVDLVSDWRSKLMESIAETEEELLEAFLE Staphyloc FEAIIDLVEMKCFKYTNDLGTEIEEIEIPEDHLDRAEEARASLIEAVAETSDELMEKYLG Bsubtilis FEGIIDLVENVAYFYEDDLGTRSDAKEIPEEYKEQAEELRNSLIEAVCELDEELMDKYLE Nostoc712 GEPLTEEEIRSALRKGTIAG---TIVPVLCGSAFKNKGVQLMLDAVVDYLPAPTEVPPIQ Anaba2941 GEALTEEEIRSALRQGTIAG---TIVPVLCGSAFKNKGVQLMLDAVVDYLPAPTEVPPIQ Nosto7310 GEELTEQEIRTALRKGTIAG---TIVPVLCGSAFKNKGVQLMLDAVVDYLPAPSEVPPIQ SynPCC680 GEELTADELVAGLRRGTIAG---TMVPVLCGSAFKNKGVQLLLDAVVDYLPSPLEVPAIE Crocospha GEEITETEIKQGLRKGTLDK---TIIPMLCGSAFKNKGVQLLLDAVVDYLPSPLDVPPIT Trichodes GEELTAEEIRKALRLATISG---TVVPILCGSAFKNKGIQLLLNAVVDYLPAPQEVPAIQ SynPCC794 DGDLSIEDIKAGLRKGVLIQGNDRLVPMLCGSAFKNKGVQLLLDAVVELLPSPQDIPPIQ SynPCC630 DGDLSIEDIKAGLRKGVLIQGNDRLVPMLCGSAFKNKGVQLLLDAVVELLPSPQDIPPIQ SynJA22-1 EEDLSEADLMAGLRKGTISG---QIVPMLCGSAFKNKGVQMLLDAVVDYLPSPVDIPAIK SynJA3-3A EEDLSEADLMAGLRKGTISG---QIVPMLCGSAFKNKGVQMLLDAVVDYLPSPIDIPAIK ThermoBP- GEELTEEEIRAALRKGTIAG---TIVPMLCGSAFKNKGVQLLLDAVVDYLPAPIDIPAIK Gloeobact GEELTEAEIKAALRKGTVAN---TIVPMLCGSAFKNKGVQQMLDAVLDYLPSPLDIPPIK SyneRS991 TGELSDEELKSGIRIGVLKH---GLVPMLCGSAFKNKGVQLVLDAVVDYLPAPVDVPPIQ SyneWH780 TGELSVEELKAGIRKGVLKH---GLVPVLCGSAFKNKGVQLVLDAVVDYLPAPIDVPPIQ SyneCC960 SGELSVDDLKKGIREGVLKH---GLVPMLCGSAFKNKGVQLVLDAVIDYLPAPVDVPPIQ SyneWH810 TGELSTEELKKGIREGVLKH---GLVPMLCGSAFKNKGVQLVLDAVIDYLPAPVDVPPIQ SyneCC990 TGELSNDELKQGIRTGVVKH---GLVPVLCGSAFKNKGVQLVLDAVVDYLPAPIDVPPIQ ProMIT931 TGELSEEQLKKGIREGVLKH---GLVPVLCGSAFKNKGVQLVLDAVVDYLPAPVDVKPIQ SyneWH570 NGELTQEQLIKGIRTGVVKH---GLVPMLCGSAFKNKGVQLVLDAVVDYLPAPIDVPPIT ProCMP137 NGELTIDQLKKGIREGVLKH---GLVPMLCGSAFKNKGVQLVLDAVIDYLPAPIDVPPIQ ProMIT921 NGELTEQQLKKGIREGVLKH---GLVPLLCGSAFKNKGVQLVLDAVVDYLPAPVDVPPIQ ProCMP198 KGELTEDQLKKGIREGVLKH---GLVPVLCGSAFKNKGVQLVLDAVVDYLPAPIDVKPIQ ProMIT931 TGELSEEQLKKGIREGVLKH---GLVPVLCGSAFKNKGVQLVLDAVVDYLPAPVDVKPIQ PromNATL2 NGELTIEQLKSGIREGVLKH---GVVPMLCGSAFKNKGVQLLLDAVVNYLPAPVDVPPIQ Staphyloc DEEISVSELKEAIRQATTNV---EFYPVLCGTAFKNKGVQLMLDAVIDYLPSPLDVKPII Bsubtilis GEELTIDELKAGIRKGTLNV---EFYPVLVGSAFKNKGVQLVLDAVLDYLPAPTDVAAIK Nostoc712 GTLP--NGDAIERRADDNEPLAALAFKIMADPYG-RLTFVRVYSGVLKKGSYVLNATKNK Anaba2941 GTLA--NGDTVERRADDNEPLAALAFKIMADPYG-RLTFVRVYSGVLKKGSYVLNATKNK Nosto7310 GLLP--NGDTIERRADDNEPLAALAFKIMADPYG-RLTFVRVYSGVLKKGSYVLNASKNK SynPCC680 GHLP--DGEVATRPAEDKAPLSALAFKVMADPFG-RLTFVRVYSGVLEKGSYVLNSTKEK Crocospha GLLK--DETEDSRKADDNEPFSALAFKIASDPYG-RLTFMRVYSGVLEKGNYVYNATKDQ Trichodes GTLP--NGELDVRPADDEAPLASLAFKIMSDPYG-RLTFLRVYSGVLAKGSYILNSTKDK SynPCC794 GTLP--DGEVALRPSSDEAPFSALAFKIMADPYG-RLTFVRVYSGILQKGSYVYNATKGK SynPCC630 GTLP--DGEVALRPSSDEAPFSALAFKIMADPYG-RLTFVRVYSGILQKGSYVYNATKGK SynJA22-1 GVLP--DGSEVSRRASDDEPFSALAFKLMSDKYG-DLTFIRVYSGVLTKGTYVLNSTKNK SynJA3-3A GVLP--DGSEVSRKASDDEPFSALAFKLMSDKYG-DLTFIRVYSGVLTKGTYVLNSTKNK ThermoBP- GRLP--DGTEVERAADDDQPLAALAFKIMSDPYG-RLTFVRVYSGVLKKGSYVLNATKGK Gloeobact GLLP--NGTEVERSADDSQPLSALAFKIMADPYG-RLTFVRVYSGILQKGSYALNASKDK SyneRS991 GVLP--DGKEAVRPSDDKAPFSALAFKVMADPYG-KLTFVRMYSGVLEKGSYVLNSTKGE SyneWH780 GVLP--NGEEAVRPSDDKAPFSALAFKVMADPYG-KLTFVRMYSGVLQKGSYVMNSTKDS SyneCC960 GVLP--DGSEAVRPSDDSAPFSALAFKVMADPYG-KLTFVRMYSGILEKGSYVLNSTKGE SyneWH810 GVLP--DGKEAVRPSDDKAPFSALAFKVMADPYG-KLTFVRMYSGILEKGSYVLNSTKGE SyneCC990 GILP--DGTEAVRPSDDKAPFSALAFKVMADPYG-KLTFVRMYSGVLEKGSYVMNSTKGI ProMIT931 GVLP--SGKEDVRPSDDNAPFSALAFKVMSDPYG-KLTFVRMYSGVLSKGSYVMNSTKDA SyneWH570 GLLP--DGTESNRPCDDSAPFSALAFKVMADPYG-KLTFVRMYSGVLQKGSYVLNSTKDK ProCMP137 GVLP--SGKDDVRPSEDNAPFSALAFKVMADPYG-KLTFVRMYSGVLEKGSYVLNSTKDA ProMIT921 GVLP--NGEEAVRPSDDSEPFSALAFKVMADPYG-KLTFVRMYSGVLEKGSYVTNSTKDI ProCMP198 GVLP--NGKEDVRPSDDNAPFSALAFKVMSDPYG-KLTFVRMYSGVLSKGSYVMNSTKDA ProMIT931 GVLP--SGKEDVRPSDDNAPFSALAFKVMSDPYG-KLTFVRMYSGVLSKGSYVMNSTKDA PromNATL2 GLLP--NGKEAVRPSDDGAPFSALAFKVMADPYG-KLTFVRMYSGVLEKGSYVLNSTKDA Staphyloc GHRASNPEEEVIAKADDSAEFAALAFKVMTDPYVGKLTFFRVYSGTMTSGSYVKNSTKGK Bsubtilis GTRP-DTNEEIERHSSDEEPFSALAFKVMTDPYVGKLTFFRVYSGTLDSGSYVKNSTKGK Nostoc712 KERISRLVLMKADDRQDVEELRAGDLGAALGLKDTLTGDTITDEGAPVILESLFIPEPVI Anaba2941 KERISRLVLMKADDRQDVEELRAGDLGAALGLKDTLTGDTITDEGSPVILESLFIPEPVI Nosto7310 KERISRLVLMKADDRQDVDELRAGDLGAALGLKDTLTGDTLCDDGSPVILESLFIPEPVI SynPCC680 KERISRLIILKADDRIEVDQLNAGDLGAVLGLKDTLTGDTLCDDQEPIILESLFVPQPVI Crocospha KERISRLIVLKSNDRIEVDELRAGDLGAAIGLRNTITGDTLCDDKHPILLESLYIPEPVI Trichodes KERISRLIVLKADDRIEVDELRAGDLGAVVGLKDTLTGDTICDKDNPIILESLYVPEPVI SynPCC794 KERVSRLIILKADDRIEVDELRAGDLGAVLGLKDTFTGDTLCDDQNPIILESLFIPEPVI SynPCC630 KERVSRLIILKADDRIEVDELRAGDLGAVLGLKDTFTGDTLCDDQNPIILESLFIPEPVI SynJA22-1 KERISRLVVLKADERLDVDELRAGDLGAVVGLKDTTTGDTLCDENAPVILESLFIPEPVI SynJA3-3A KERISRLVVLKADERLDVDELRAGDLGAVLGLKDTTTGDTLCDENAPVILESLYIPEPVI ThermoBP- KERISRLIVLKADERIEVDELRAGDLGAALGLKETFTGDTLCDESSPVILESLYIPEPVI Gloeobact KERISRLIVLKADDRIEVDELRAGDLGAVVGLKDTFTGDTLCTEDSPVILESLFIPEPVI SyneRS991 KERISRLVVLKADDREEVDELRAGDLGAVLGLKATTTGDTLCAADDPIVLETLFVPEPVI SyneWH780 KERISRLVVLKADDREEVDELRAGDLGAVLGLKATTTGDTLCSAEDPIVLETLFVPEPVI SyneCC960 KERISRLVVLKADDREEVDALRAGDLGAVLGLKNTTTGDTLCTQDDPIVLETLFIPEPVI SyneWH810 KERISRLVVLKADDREEVDALRAGDLGAVLGLKNTTTGDTLCTQDDPIVLETLFIPEPVI SyneCC990 KERISRLVVLKADDREEVDQLQAGDLGAVLGLKNTTTGDTLCSADEPIVLETLFVPEPVI ProMIT931 KERISRLVILKADEREEVDELRAGDLGAVLGLKNTTTGDTLCNTEDPIVLETLFIPEPVI SyneWH570 KERISRLILLKADDREEVDELRAGDLGAVLGLKDTTTGDTLCVESDPIILESLFIPEPVI ProCMP137 KERISRLVVLKADDREEVDQLRAGDLGAVLGLKNTTTGDTLCSTDDPIVLETLFVPEPVI ProMIT921 KERISRLVVLKADDREEVDQLRAGDLGAVLGLKNTTTGDTLCTTDEPIVLETLFIPEPVI ProCMP198 KERISRLVILKADEREEVDELRAGDLGAVLGLKNTTTGDTLCNTDDPIVLETLFIPEPVI ProMIT931 KERISRLVILKADEREEVDELRAGDLGAVLGLKNTTTGDTLCNTEDPIVLETLFIPEPVI PromNATL2 KERISRLIILKADDREEVDELRAGDLGAVLGLKNTTTGDTLCASEEAIVLETLYIPEPVI Staphyloc RERVGRLLQMHANSRQEIDTVYSGDIAAAVGLKDTGTGDTLCGEKNDIILESMEFPEPVI Bsubtilis RERVGRILQMHANSREEISTVYAGDIAAAVGLKDTTTGDTLCDEKDLVILESMEFPEPVI Nostoc712 SVAVEPKTKNDMDKLSKALQSLSEEDPTFRVNVDPETNQTVIAGMGELHLEILVDRMLRE Anaba2941 SVAVEPKTKNDMDKLSKALQSLSEEDPTFRVNVDPETNQTVIAGMGELHLEILVDRMLRE Nosto7310 SVAVEPKTKNDMDKLSKALQSLSEEDPTFRVRVDPETNQTVIAGMGELHLEILVDRMLRE SynPCC680 SVAVEPKTKQDMDKLSKALQSLSEEDPTFRVSVDPETNQTVIAGMGELHLEILVDRMLRE Crocospha SVAVEPKTKQDMEKLSKALQALSDEDPTFKVSIDPETNQTVIAGMGELHLEILVDRMLRE Trichodes SVAVEPKTKQDIDKLSQALQALSDEDPTFRVSVDPETNQTVIAGMGELHLEILVDRMLRE SynPCC794 SVAVEPKTKNDMEKLSKALQALSEEDPTFRVSVDSETNQTVIAGMGELHLEILVDRMLRE SynPCC630 SVAVEPKTKNDMEKLSKALQALSEEDPTFRVSVDSETNQTVIAGMGELHLEILVDRMLRE SynJA22-1 SVAVEPKTKADIDKLSKALQALAKEDPTFRVSVDPETNQTIISGMGELHLEILVDRMLRE SynJA3-3A SVAVEPKTKADIDKLSKALQALAKEDPTFRVSVDPETNQTIISGMGELHLEILVDRMLRE ThermoBP- SVAVEPKTKQDMEKLSKALQALSEEDPTFRVSVDPETNQTVIAGMGELHLEILVDRMQRE Gloeobact SVAIEPKTKADLDKLSKALQSLSEEDPTFRVHVDQETNQTIIAGMGELHLEILVDRMLRE SyneRS991 SVAVEPKTKGDMEKLSKALVALAEEDPTFRVNTDAETGQTVIAGMGELHLEILVDRMLRE SyneWH780 SVAVEPKTKGDMEKLSKALVSLAEEDPTFRVNTDQETGQTVIAGMGELHLEILVDRMLRE SyneCC960 SVAVEPKTKGDMEKLSKALVALAEEDPTFRVNTDSETGQTVIAGMGELHLEILVDRMLRE SyneWH810 SVAVEPKTKGDMEKLSKALVSLAEEDPTFRVNTDSETGQTVIAGMGELHLEILVDRMLRE SyneCC990 SVAVEPKTKGDMEKLSKALVSLAEEDPTFRVRTDQETGQTVIAGMGELHLEILVDRMMRE ProMIT931 SVAVEPKTKGDMEKLSKALTALSEEDPTFRVSTDPETNQTVIAGMGELHLEILVDRMLRE SyneWH570 SVAVEPKTKGDMEKLSKALQSLSEEDPTFRVSTDPETSQTVIAGMGELHLEILVDRMLRE ProCMP137 SVAVEPKTKGDMEKLSKALVSLAEEDPTFRVSTDQETNQTVIAGMGELHLEILVDRMLRE ProMIT921 SVAVEPKTKGDMEKLSKALVSLAEEDPTFRVSTDQETNQTVIAGMGELHLEILVDRMLRE ProCMP198 SVAVEPKTKGDMEKLSKALQALSEEDPTFRVSTDQETNQTVIAGMGELHLEILVDRMLRE ProMIT931 SVAVEPKTKGDMEKLSKALTALSEEDPTFRVSTDPETNQTVIAGMGELHLEILVDRMLRE PromNATL2 SVAVEPKTKSDMEKLGKALTSLSEEDPTFRVSTDQETNQTVIAGMGELHLEILVDRMLRE Staphyloc HLSVEPKSKADQDKMTQALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKE Bsubtilis DVAIEPKSKADQDKMGIALAKLAEEDPTFRTQTNPETGQTIISGMGELHLDIIVDRMKRE Nostoc712 FKVEANVGAPQVAYRETIRKPVTNVEGKFIRQSGGKGQYGHVVINLEPGEPGTGFEFVSK Anaba2941 FKVEANVGAPQVAYRETIRKSVTNVEGKFIRQSGGKGQYGHVVINLEPGEPGTGFEFVSK Nosto7310 FKVEANVGAPQVAYRETIRKAVNKVEGKFIRQSGGKGQYGHVVINLEPGEPGTGFEFVSK SynPCC680 FKVEANVGAPQVAYRETIRKAVQ-AEGKFIRQSGGKGQYGHVVIEVEPTEPGTGFEFVSK Crocospha YKVQASVGKPQVAYRETIRKPSE-AEGKYIRQSGGKGQYGHVVVELEPXEAGSGFEFVSK Trichodes YKVKANVGKPQVAYRETIRQQIQ-AEGKFIRQSGGKGQYGHVVIELEPGDPGSGFEFVSK SynPCC794 YKVEANIGAPQVAYRETVRKAVK-AEGKFVRQSGGKGQYGHVVIELEPAEPGTGFEFVSK SynPCC630 YKVEANIGAPQVAYRETVRKAVK-AEGKFVRQSGGKGQYGHVVIELEPAEPGTGFEFVSK SynJA22-1 FNVEANVGNPQVAYRETIRKPVSRVEGKFVRQSGGRGQYGHVVIDLEPAEPGTGFEFVSK SynJA3-3A FNVEANVGNPQVAYRETIRKPVSRVEGKFIRQTGGRGQYGHVVIDLEPAEPGTGFEFVSK ThermoBP- YKVEANIGQPQVAYRETIRKPVR-AEGKFIRQSGGKGQYGHVVIEVEPAEPGTGFEFVSK Gloeobact FKVEANVGAPQVAYRETIRKAVNNVEGLYKRQTGGKGQYGHVVINLEPGEPGTGFEFVSK SyneRS991 FKVEANIGAPQVSYRETIRASAR-GEGKFSRQTGGKGQYGHVVIEMEPGEPESGFEFVNK SyneWH780 FKVEANIGAPQVSYRETIRASSR-GEGKFSRQTGGKGQYGHVVIEMEPGEPESGFEFVNK SyneCC960 FKVEANIGAPQVSYRETIRGSAG-GEGKFSRQTGGKGQYGHVVIEMEPGEPGSGFEFVNK SyneWH810 FKVEANIGAPQVSYRETIRGSAG-GEGKFSRQTGGKGQYGHVVIEMEPGEPGSGFEFVNK SyneCC990 FKVEANIGAPQVSYRETIRGSSK-GEGKFSRQTGGKGQYGHVVIEMEPGEPESGFVFVNK ProMIT931 FKVEANIGAPQVSYRETIRSSSK-GEGKYARQTGGKGQYGHVIIEMEPAEVGKGFEFVNK SyneWH570 FKVEANIGAPQVSYRETIRARAK-GEGKFARQTGGKGQYGHVVIEMEPGEPGSGFEFVNK ProCMP137 FKVEANIGAPQVSYRETIRSSSK-GEGKYARQTGGKGQYGHVVIEMEPGEPGSGFEFINK ProMIT921 FKVEANIGAPQVSYRETIRSSSK-GEGKFARQTGGKGQYGHVVIEMEPGEPGTGFEFVNK ProCMP198 FKVEANIGAPQVSYRETIRSSSK-GEGKYARQTGGKGQYGHVVIEMEPAEVGKGFEFVNK ProMIT931 FKVEANIGAPQVSYRETIRSSSK-GEGKYARQTGGKGQYGHVIIEMEPAEVGKGFEFVNK PromNATL2 FKVEANIGAPQVSYRETIRASSS-GEGKFARQTGGKGQYGHVVIEVEPGEPGTGFEFVNK Staphyloc FNVECNVGAPMVSYRETFKSSAQ-VQGKFSRQSGGRGQYGDVHIEFTPNETGAGFEFENA Bsubtilis FKVEANVGAPQVAYRETFRTGAK-VEGKFVRQSGGRGQFGHVWIEFEPNEEGAGFEFENA Nostoc712 IVGGVVPKEYIGPAEQGMKESCESGILAGYPLIDVKATLVHGSYHDVDSSEMAFKIAGSM Anaba2941 IVGGVVPKEYIGPAEQGMKESCESGILAGYPLIDVKATLVHGSYHDVDSSEMAFKIAGSM Nosto7310 IAGGTVPKEYVGPAEQGMKESCESGVLAGYPLIDVKATLIDGSYHDVDSSEMAFKIAGSM SynPCC680 IVGGVIPKEYIAPSEQGMKEACASGVLAGYPVIDLKATLVDGSFHDVDSSEMAFKIAGSM Crocospha IVGGVIPKEL-------------------------------------------------- Trichodes IVGGTVPKEFISPAEQGMKEACEAGVLAGYPLIDVKATLVDGSYHDVDSSEMAFKIAGSM SynPCC794 IVGGTVPKEYVGPAEQGMKETCESGVLAGYPLIDIKATLVDGSYHDVDSSEMAFKIAGSM SynPCC630 IVGGTVPKEYVGPAEQGMKETCESGVLAGYPLIDIKATLVDGSYHDVDSSEMAFKIAGSM SynJA22-1 IVGGVVPKEYIGPAEQGIREACESGVLAGYPLIDIRATLVDGSYHEVDSSEMAFKIAGSM SynJA3-3A IVGGVIPKEYIPPAEQGIREACESGVLAGYPLIDIRVTLVDGSYHEVDSSEMAFKIAGSM ThermoBP- IVGGVVPKEYIPPAEQGMKEACESGILAGYPVIDLKVTLVDGSYHEVDSSEMAFKIAGSI Gloeobact IVGGVVPKEYIGPAEQGMKERCESGVIAGYPLIDVKVTMVDGSYHDVDSSEMAFKIAGSL SyneRS991 IVGGVVPKEFIKPSEMGMKETCESGVIAGFPMIDVRVTMVDGSYHDVDSSEMAFKIAGSM SyneWH780 IVGGVVPKEYIKPSEMGMKETCESGVIAGYPLIDVKVTMVDGSYHDVDSSEMAFKIAGSM SyneCC960 IVGGVVPKEYIKPAEQGMKETCESGVIAGYPLIDVKCTLVHGSYHDVDSSEMAFKIAGSM SyneWH810 IVGGIVPKEYIKPAEQGMRETCESGVIAGYPLIDVRCTLVHGSYHDVDSSEMAFKIAGSM SyneCC990 IVGGIVPKEFIKPSEQGMKETCESGVIAGFPLIDVKVSMVDGSYHDVDSSEMAFKIAGSM ProMIT931 IVGGAVPKEYIGPASNGMKETCESGVLAGYPLIDVKVTLVDGSFHDVDSSEMAFKIAGSM SyneWH570 IVGGIVPKEYIGPAENGMKETCQSGVIAGFPMIDIKVTMVDGSYHDVDSSEMAFKIAGSM ProCMP137 IVGGVVPKEYIGPASNGMKETCESGVLAGYPLIDVKVTMVDGSFHDVDSSEMAFKIAGSM ProMIT921 IVGGVVPKEYIGPASNGMKETCESGVLAGYPLIDVKVTMVDGSFHDVDSSEMAFKIAGSM ProCMP198 IVGGTVPKEYIGPASNGMKETCESGVLAGYPLIDVKVTLVDGSFHDVDSSEMAFKIAGSM ProMIT931 IVGGAVPKEYIGPASNGMKETCESGVLAGYPLIDVKVTLVDGSFHDVDSSEMAFKIAGSM PromNATL2 IVGGSVPKEYIKPAESGMRETCESGVIAGYPLIDVKVTLVDGSYHDVDSSEMAFKIAGSM Staphyloc IVGGVVPREYIPSVEAGLKDAMENGVLAGYPLIDVKAKLYDGSYHDVDSSEMAFKIAASL Bsubtilis IVGGVVPREYIPAVQAGLEDALENGVLAGFPLIDIKAKLFDGSYHDVDSNEMAFKVAASM Nostoc712 ALKEAVLKASPVLLEPMMKVEVEVPEDYIGNVIGDLISRRGQIESQSTEQGLAKVASKVP Anaba2941 ALKEAVLKASPVLLEPMMKVEVEVPEDYIGNVIGDLISRRGQIESQSTEQGLAKVASKVP Nosto7310 AMKEAVLKASPVILEPMMKVEVEVPEDYMGNVIGDLNTRRGQIESQSTEKGLAKVTSKVP SynPCC680 AIREAVGQADPVLLEPVMKVEIEVPDDFMGNVIGDLNARRGHIEGQETEQGIAKVAASVP Crocospha ------------------------------------------------------------ Trichodes AIKEGVIKASPVLLEPMMKVEVEVPEDFIGNIIGDLNSRRGQIEGQGLETGMAKVMAKVP SynPCC794 AIKEAVRKADPVLLEPVMKVEVEVPEDFLGSVMGNLISRRGQIEGQATTNGTATVSAKVP SynPCC630 AIKEAVRKADPVLLEPVMKVEVEVPEDFLGSVMGNLISRRGQIEGQATTNGTATVSAKVP SynJA22-1 ALKEAARRASPTLLEPMMKVEVEVPEAFVGDVIGDINARRGQMEGMNTEGGITKVNAKVP SynJA3-3A ALKEAARRANPVLLEPMMKVEVEVPEAFVGDVIGDINARRGQMEGMSTEGGISKVNAKVP ThermoBP- AIKEAVMKANPVLLEPMMKVEVEVPEEFLGTVMGDLIARRGQIEGQTVENGIAKVTAKVP Gloeobact ALREAAQKAQPVLLEPMMKVEVEVSGDFLGDVMGDLNARRGQIESMDNEGGVSKVTSRVP SyneRS991 AFKDAVKKCNPVLLEPMMKVEVEIPEDFLGSIIGDLSSRRGQVEGQAIDDGTSKVSAKVP SyneWH780 AFKDAVKKCNPVLLEPMMKVEVEVPEDFLGSIIGDLSSRRGQVEGQAIDDGTSKVSAKVP SyneCC960 AFKDGVKKCNPVLLEPMMKVEVEAPEDFLGSIIGDLSSRRGQVEGQSVEDGTSKISAKVP SyneWH810 AFKDGVKKCNPVLLEPMMKVEVEVPEDFLGSIIGDLSSRRGQVEGQGVEDGTSKISAKVP SyneCC990 AFKDAVRKCNPVLLEPMMKVEVEVPEDFLGSVIGDLSSRRGQVEGQAIDDGTSKVSAKVP ProMIT931 AFKDGVKKCNPVLLEPMMKVEVESPDDFLGSVIGDLSSRRGQVEGQSVDDGLSKVQAKVP SyneWH570 AFKDGVKKCNPVLLEPMMKVEVEIPEDFLGSVIGDLSSRRGQVEGQSIDNGQSKVQSKVP ProCMP137 AFKDGVKKCNPVLLEPMMKVEVEVPEDFLGSIIGDLSSRRGQVEGQSIDDGISKVQSKVP ProMIT921 AFKDGVKKCNPVLLEPMMKVEVETPEDFLGSIIGDLSSRRGQVEGQSIDDGQSKVQAKVP ProCMP198 AFKDGVKKCNPVLLEPMMKVEVESPDDFLGSVIGDLSSRRGQVEGQSVDDGLSKVQAKVP ProMIT931 AFKDGVKKCNPVLLEPMMKVEVESPDDFLGSVIGDLSSRRGQVEGQSVDDGLSKVQAKVP PromNATL2 AFKDGIKKCNPVLLEPMMKVEVEVPEDFLGSIIGDLSSRRGQVEGQSIEDGQSKVQSKVP Staphyloc ALKEAAKKCDPVILEPMMKVTIEMPEEYMGDIMGDVTSRRGRVDGMEPRGNAQVVNAYVP Bsubtilis ALKNAVSKCNPVLLEPIMKVEVVIPEEYMGDIMGDITSRRGRVEGMEARGNAQVVRAMVP Nostoc712 LATMFGYATDIRSKTQGRGIFTMEFSHYEEVPRSVAETIIAKSKGNA---MTTSQERIIP Anaba2941 LATMFGYATDIRSKTQGRGIFTMEFSHYEEVPRSVAETIIAKSKGNA---MTTSQERIIP Nosto7310 LASMFGYATDIRSKTQGRGTFTMEFSHYEEVPRSVAETIIAKSKGNA---MTTSQERIIP SynPCC680 LAEMFGYATDIRSKTQGRGIFSMEFSHYAEVPRNVAEAIVAKSRGYA---MTDSPDRLIA Crocospha --------------------------------------------------MTIPQDRIIP Trichodes LAEMFGYATDMRSKTQGRGVFSMEFSNYEEVPHNVAETIISKSRGYV---MTTSESRIVP SynPCC794 LAEMFGYATDLRSMTQGRGIFTMEFSQYEEVPRNVAETIIAKNKGNA----MSAQERIIQ SynPCC630 LAEMFGYATDLRSMTQGRGIFTMEFSQYEEVPRNVAETIIAKNKGNA----MSAQERIIQ SynJA22-1 LAEMFGYATDIRSKTQGRGTFTMEFSHYEEVPRSIAEAIIAKNKGNE---DNESTARIIS SynJA3-3A LAEMFGYATDIRSKTQGRGIFTMEFSHYEEVPRSIAEAIIAKSKGSGVTN---MTARIIS ThermoBP- LERMFGYATDIRSNTQGRGIFSMEFSHYEEVPRNVAEAIIAKNKGNA---FAADSSRIIP Gloeobact LAEMFGYATDIRSKTQGRGTFSMEFSHYEEVPRNVAETIIAKNKGNA-------MTTIIP SyneRS991 LAEMFGYATELRSMTQGRGIFSMEFSHYEDVPRNVAEAIISKNQGNS---PGESDDRIIQ SyneWH780 LAEMFGYATELRSMTQGRGIFSMEFSHYEDVPRNVAEAIISKNQGNS---PGDSDDRIIQ SyneCC960 LAEMFGYATELRSMTQGRGIFSMEFDNYAEVPRNVAEAIISKNQGNS---PGDSDDRIIQ SyneWH810 LAEMFGYATELRSMTQGRGIFSMEFDNYAEVPRNVAEAIISKNQGN----PGDSDDRIIQ SyneCC990 LAEMFGYATELRSMTQGRGIFSMEFSHYEDVPRNVAEAIISKNQGNS---PGDSDERIIQ ProMIT931 LAEMFGYATQLRSMTQGRGIFSMEFANYEEVPRNVAEAIITKNQGNS---PGESDDRIIQ SyneWH570 LAEMFGYATQLRSMTQGRGIFSMEFSHYEEVPRNVAEAIIAKNQGNS----GESDGRIIQ ProCMP137 LAEMFGYATQLRSMTQGRGIFSMEFSKYEEVPRNVAEAIISKNQGNS---PDELEDRIIQ ProMIT921 LAEMFGYATQLRSMTQGRGIFSMEFSNYEEVPRNVAEAIISKNQGNS---PGEFEDRIIQ ProCMP198 LAEMFGYATQLRSMTQGRGIFSMEFANYEEVPRNVAEAIISKNQGNS-----MTKEKFTS ProMIT931 LAEMFGYATQLRSMTQGRGIFSMEFANYEEVPRNVAEAIITKNQGNS---PGESDDRIIQ PromNATL2 LAEMFGYATQLRSMTQGRGIFSMEFSTYEEVPRNVAEAIISKNQGNS---PGESDDRIIQ Staphyloc LSEMFGYATSLRSNTQGRGTYTMYFDHYAEVPKSIAEDIIKKNKGE----AELPQSRINE Bsubtilis LAEMFGYATALRSNTQGRGTFTMHMDHYGEVPKSVAEEIIKKNKGE----SEQNTPQVRE Nostoc712 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMHELGLTHDRPFKKCARVV Anaba2941 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMHELGLTHDRPFKKCARVV Nosto7310 IDLRTEMSQSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMHELGLLHDRPFKKCARVV SynPCC680 TDLRNEMSQSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTPDRPFRKCARVV Crocospha TDLSNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTPERPFRKCARVV Trichodes TDLRNEMSQSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMHELGLTPDRPFRKCARVV SynPCC794 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTPDRPFRKCARVV SynPCC630 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTPDRPFRKCARVV SynJA22-1 TDLQREMSQSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMHELGLTADRPFRKCARVV SynJA3-3A TDLQREMAQSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMHELGLTADRPFRKCARVV ThermoBP- TELREEISRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPFRKCARVV Gloeobact TNLRNEMQRSYLEYAMSVIVGRALPDARDGLKPVHRRILFAMHELGLGPDRPYRKCARVV SyneRS991 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV SyneWH780 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV SyneCC960 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV SyneWH810 ADLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV SyneCC990 TDLRNEMSRSYMEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV ProMIT931 TDLRIEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV SyneWH570 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV ProCMP137 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV ProMIT921 TDLRNEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV ProCMP198 ISLQEEMQRSYLEYAMSVIIGRALPDARDGLKPVQRRILFAMHELGLTPDRPFRKCARVV ProMIT931 TDLRIEMSRSYLEYAMSVIVGRALPDARDGLKPVHRRILYAMYELGLTSDRPYRKCARVV PromNATL2 TDLRNEMSRSYLEYAMSVIVGRALPDSRDGLKPVHRRILYAMYELGLTSDRPYRKCARVV Staphyloc RNITSEMRESFLDYAMSVIVARALPDVRDGLKPVHRRILYGLNEQGMTPDKSYKKSARIV Bsubtilis INISQEMRTSFLDYAMSVIVSRALPDVRDGLKPVHRRILYAMNDLGMTSDKPYKKSARIV Nostoc712 GEVLGKYHPHGDTAVYDALVRMAQDFSMRSPLVNGHGNFGSVDNDPPAAMRYTECRLQAL Anaba2941 GEVLGKYHPHGDTAVYDALVRMAQDFSMRSPLVNGHGNFGSVDNDPPAAMRYTECRLQAL Nosto7310 GEVLGKYHPHGDTAVYDALVRMAQDFSMRSPLVNGHGNFGSVDNDPPAAMRYTECRLQAL SynPCC680 GEVLGKYHPHGDTAVYDALVRMAQDFSMREPLIDGHGNFGSVDNDPPAAMRYTESRLRPL Crocospha GEVLGKYHPHGDTAVYDALVRMAQDFSMRSPLIEGHGNFGSVDNDPPAAMRYTECRLQAL Trichodes GEVLGKYHPHGDTAVYDALVRMAQDFSMRSPLIQGHGNFGSVDNDPPAAMRYTECRLQVL SynPCC794 GEVLGKYHPHGDTAVYDALVRMAQDFSMRSPLIDGHGNFGSIDNDPPAAMRYTESRLKPL SynPCC630 GEVLGKYHPHGDTAVYDALVRMAQDFSMRSPLIDGHGNFGSIDNDPPAAMRYTESRLKPL SynJA22-1 GDVIGKYHPHGDQAVYDALVRMAQDFSMRERLVDGHGNFGSVDNDPPAAMRYTECRLTAF SynJA3-3A GDVIGKYHPHGDQAVYEALVRMAQDFSMRERLVDGHGNFGSIDNDPPAAMRYTECRLTAF ThermoBP- GEVLGKYHPHGDSAVYDALVRMAQDFSMRHPLIEGHGNFGSIDNDPPAAMRYTECRLQAL Gloeobact GDVLGKYHPHGDSAVYDALVRLAQDFSTRYLLIDGHGNFGSVDNDPPAAMRYTECRLTPL SyneRS991 GEVLGKYHPHGDTAVYDALVRMAQSFSMSMPLIDGHGNFGSVDNDPPAAMRYTESRLQAL SyneWH780 GEVLGKYHPHGDTAVYDALVRMAQDFSMSMPLIDGHGNFGSVDNDPPAAMRYTESRLKAL SyneCC960 GEVLGKYHPHGDTAVYDALVRMAQDFSMSMPLIDGHGNFGSVDNDPPAAMRYTESRLRAL SyneWH810 GEVLGKYHPHGDTAVYDALVRMAQDFSMSMPLIDGHGNFGSVDNDPPAAMRYTESRLQAL SyneCC990 GEVLGKYHPHGDTAVYDALVRMAQDFSMSMPLIDGHGNFGSVDNDPPAAMRYTESRLRAL ProMIT931 GEVLGKYHPHGDTAVYDALVRMAQDFSMQMPLIDGHGNFGSVDNDPPAAMRYTESRLQSL SyneWH570 GEVLGKYHPHGDTAVYDALVRMAQDFSMRMPLIDGHGNFGSVDNDPPAAMRYTESRLQAL ProCMP137 GEVLGKYHPHGDTAVYDALVRMAQDFSMQMPLIDGHGNFGSIDNDPPAAMRYTESRLRSL ProMIT921 GEVLGKYHPHGDTAVYDALVRMAQDFSMRMPLVDGHGNFGSIDNDPPAAMRYTESRLQSL ProCMP198 GDVLGKYHPHGDQAVYEALVRLVQDFSTKYPTLDGHGNFGSVDNDPPAAMRYTETRLAPI ProMIT931 GEVLGKYHPHGDTAVYDALVRMAQDFSMQMPLIDGHGNFGSVDNDPPAAMRYTESRLQSL PromNATL2 GEVLGKFHPHGDTAVYDALVRMAQDFSMQMPLIDGHGNFGSVDNDPPAAMRYTESRLQSL Staphyloc GDVMGKY
Example Output File
The total number of character(s) is: 1214 Nostoc712 YAVVLTIAMLPFPTCKQFGLVSVERTCCSIYYFPDFYMQDMQINLISVGIAGVLILYVLE Anaba2941 YAVVLTIAMLPFPTCKQFGLVSVERTCCSIYYFPDFYMQDMQINLISVGIAGVLILYVLE Nosto7310 FPVVLTIAMLPFPTCKHFGLVSVERTCCSIYYFPDFYMQDMQINLISVGIAGVLILYVLE SynPCC680 FPVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINLISVGIAGVLILYVLE Crocospha YPVVLTIAMLPFPTCAHFAIVSVERTCCSLYYFPDFYMEDMQINLISVGIADVLILYVLE Trichodes YPVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINLISVGIAGILILYLLE SynPCC794 FPIVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQVNLISVGIAGVLILYLLE SynPCC630 FPIVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQVNLISVGIAGVLILYLLE SynJA22-1 YPVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMEDLQVNLISVGIPDVLILYVLE SynJA3-3A YPVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMEDLQVNLISVGIPDVLILYVLE ThermoBP- YPIVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMEDMQVNLISVGIAGVLILYVLE Gloeobact FPVVLTIAMLPFPTCAHFALVSVEKTCCSIYYFPDFYMQDLQVNLISIGVPGILILYLLE SyneRS991 YAVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINLLSVGIAGILLLYLLE SyneWH780 FPVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINLLSVGIAGILLLHLLE SyneCC960 YAIVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINLLSVGIAGILLLHVLE SyneWH810 YAIVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINLLSVGIAGILLLHVLE SyneCC990 YAIVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDLQINLLSVGIAGILLLHVLE ProMIT931 FPVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINILSVGIAGIMLLYLLE SyneWH570 YAVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDLQINLLSVGIAGILLLYLLE ProCMP137 YPVVLTIAMLPFPTCAHFAIISIERTCCSLYYFPDFYMQNMQINLLSVGIAGVLLLYLLE ProMIT921 FPVVLTIAMLPFPVCAQFALVSIDRTCCSLYYFPDFYMEDMQINLLSVGIAGILLLYLLE ProCMP198 YPIVLTIAMLPFPTCAHFAIVSVDKTCCSLYFFPDFYMQDMQINLISIGIAGILILYLIE ProMIT931 FPVVLTIAMLPFPTCAHFALVSVERTCCSIYYFPDFYMQDMQINILSVGIAGIMLLYLLE PromNATL2 FAIVLTIAMLPFPTCAHFAIISVERTCCSLYFFPDFYMQDMQINLLSIGIAGILILYLLE Staphyloc FPVLWINSVATLKVAAHIALVTIERESNTIFYREEYWSENMSVTLIAVAIPGVLLIYVIL Bsubtilis FPVLWINSVATLKVAAHIALVTVERESNTIFYREEYWSENMSVTLIAVAVPGVLLIYVIL Nostoc712 EYTTVIGVVLGINIPKASVTRAISTYIIMIAFCSVGQSRIRNVQDLALFVNDIYLDYELT Anaba2941 EYTTVIGVVLGINIPKASVTRAISTYIIMIAFCSVGQSRIRNVQDLALFVNDIYLDYELT Nosto7310 EYTTVIGVVLGINIPKASVTRAISTHIIMIAFCSVGQSRIRNVQDLALFVNDIRLDYELT SynPCC680 DYTTVIGIIVGINIPRASVTRAISTHIIMIAFCSVGQSRVRNVQDLPIFVNDIRLDYELT Crocospha EYTTIIGIVVNINIPRATVMQAISTHIIMIAFCSVGQSRVRNVQDLPIFVDDIRLEYEIT Trichodes EYTTVIAVILGINIPRASVTRAISTHIIMIAFCSVGQSRIRNVQDLPIFVDDIRLDYELT SynPCC794 EYTTVVGVILGINIPKASVTRAISTYVIMVAFCSVGQSRVRDVQDVPIFVNDIRMEYGLS SynPCC630 EYTSVVGVILGINIPKASVTRAISTYVIMVAFCSVGQSRVRDVQDVPIFVNDIRMEYGLS SynJA22-1 DFTTVIGVILGIHIPRASVTRAITTYIIMIAFDSVGQSRVRNVQELPLFVDEIRLEYELS SynJA3-3A DFTTVIGVVLGIHIPRASVTRAITTYIIMIAFDSVGQSRVRNVQELPLFVDEIRLEYELS ThermoBP- EFTTVIGVILGIHIPRASVTRAISTHIIMIAFCSVGQSRVRNVQDLPLFVDDIRLDYELT Gloeobact EYTTVIGVIVGIHVPRASVTRAITTHIIMITLCSVGQTRVRNVQDLPLLVNDIRLDYELT SyneRS991 DYTTVVGVILGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLIDDVRLEFGLS SyneWH780 DYTTVIGVVLGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLIDDVRLEFGLS SyneCC960 EYTTVVGVILGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLINDVRLEFGLS SyneWH810 EYTTVVGVILGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLINDIRLEFGLS SyneCC990 EYTTVVGVILGVNIPRASVTRAISTHVIMIAFCAVGQSRVRDVQDLPLLIDDVRLEFGLS ProMIT931 DYTTVVGVVLGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLINDIRLEFGLS SyneWH570 EYTTVIGVILGVHIPRASVTRAISTHIIMVAFCAVGQSRVRNVQDLPLLINDIRLEYGLT ProCMP137 EYTTIIGIILGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLINDIRLEFGLT ProMIT921 DYTTIIGIILGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLINDIRLEFGLT ProCMP198 EYTTVVGVVLGVNVPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLINDIRLEFGLT ProMIT931 DYTTVVGVVLGVNIPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLINDIRLEFGLS PromNATL2 EYTTVIGVVLNVNVPRASVTRAISTHIIMIAFCAVGQSRVRDVQDLPLLIDDVRLEFGLT Staphyloc EYSSVVAIVVGVNVSKMTTMQSTTAHVVLVTLDAQSETTVKNSTDLPLFINDIRLDYEIS Bsubtilis EYSSVIGVIVGVNVSKMTTMQSTTAYVVLVALDAQSETTVKDSTDLALFIDDIRLEYELT Nostoc712 IALGTYAEVPLLAPGVKATKISMKDDVLLGAITVSLVTDLVEFKVAFSKGSKVKLYEYLR Anaba2941 IALGTYAEVPLLAPGVKATKISMKDDVLLGAITVSLVTDLVEFKVAFSKGSKVKLYEYLR Nosto7310 IALGTYAEVPLLAPGVKASKISMKDDVLLGALCVSLVTDLVEFKVAFSKGSKVKLYEYLR SynPCC680 LGLGTYSEVALLSPGVKSTKISLKDEVLLGVLCISLVTDLVEFKVAFSKGSKIKLYEYLR Crocospha IGLGTYSDVPLFSPGVKATKISLKNEVLLGALCISLVTELVEYKVAYSKGSKIKLYEYLR Trichodes IALATYAEVALLAPGVKSTKISLKDEVLLGVICISLVTDLVEYKVAFSKGSKVKLYEYLR SynPCC794 IGLGVLSDIPLFSPGVKATKVSLKDEVLLGVLCISLVTELVEYKIAFSKGSKVKLYEYLR SynPCC630 IGLGVLSDIPLFSPGVKATKVSLKDEVLLGVLCISLVTELVEYKIAFSKGSKVKLYEYLR SynJA22-1 LGLGTYSDIALFSKGVKSTKISLKDDVLLGVLCVSLVTDLIEFNVAFSRGSKVKLYEYLR SynJA3-3A LGLGTYSDIALFSKGVKSTKISLKDDVLLGVLCVSLVTDLIEFNVAFTRGSKIKLYEYLR ThermoBP- IALGTYADIALLAPGVKATKISLKDEVLLGALCVSLVTELVEYKIAFSKGSKVKLYEYLR Gloeobact IALGTYSDIPLLSPGVKASKISLKDEVLLGVLCVSLITDLIEFKVAYTKGSKVKLYEFLR SyneRS991 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISFTKENKVKLYEYLR SyneWH780 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISFTKENKVKLYEYLR SyneCC960 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISFTKGNKVKLYEYLR SyneWH810 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISFTKGNKVKLYEYLR SyneCC990 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISFTKENKVKLYEYLR ProMIT931 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISYTKGNKVKLYEYLR SyneWH570 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCISLVTELVEFKISFTKGNKVKLYEYLR ProCMP137 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISYTKGNKVKLYEYLR ProMIT921 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISFTKGNKVKLYEYLR ProCMP198 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISYTKGNKVKLYEFLR ProMIT931 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISYTKGNKVKLYEYLR PromNATL2 LGIGVYADVPLFSPGMKSTKISLKDEVLLGVLCITLVTELVEFKISFTKGNKVKLYEYLR Staphyloc LAIATYSDVPRFAPVVSSTRVGMHNEIVIAALCISMVSDMIDFNVSFSRGNAVRIFDYMK Bsubtilis LGIGTYADVARFSPVVSSTRVGMHNEIVIAALCVSMISDMIDFKVAFSRGNAVRIFDYMK Nostoc712 FKCVEDLMNPLVFFQLSSSNLIVGGAARERAIIKLNRIEIIKRAVYPNGLVNGLFFRIRL Anaba2941 FKCVEDLMNPLVFFQLSSSNLIVGGAARERAIIKLNRIEIIKRAVYPNGLVNGLFFRIRL Nosto7310 FKCVEDLMNPLVFFQLASSNLIVGGAARERAIIKMNRLEIIKRAVYPNGLVNSLFFRIRL SynPCC680 FRCVEDLMNPLVFFQLASSNLILGGAGRERAIVRLNKIDIIKRAVYPNGLVNGIFFRIRL Crocospha FRCVEDLMNPLVFFQLASSNLILGGGGRERAIIRLNRIDIIKRAVYPNGLVNGIFFRIRL Trichodes FRCVEDLMNPLVFFQLSSSNLILGGAGREKAIVKLNKIEIIKRAVYPNGLVNSLFFRIRL SynPCC794 FRCVEDLMNPLVFFQLASSNPILGGGGKDRAIIRMNRLEIIKRAVYPNSLVDGLFFRIRL SynPCC630 FRCVEDLMNPLVFFQLASSNPILGGGGKDRAIIRMNRLEIIKRAVYPNSLVDGLFFRIRL SynJA22-1 FRCVDDLMNPLVFYQLASSNLVLAGAGRERAIIKLNKLEIIKRAVYPNGILNGLFFRIRL SynJA3-3A FRCVDELMNPLVFYQLASSNLVLAGGGRERAIIKLNKLEIIKRAVYPNGILNGLFFRIRL ThermoBP- FRCVEDLMNPLVFFQLAASNLVLGGGGRERAIIKLNKIELIKRAVYPNGIVNGLFFRIRL Gloeobact YRCVDDLLNPLVFFQLSSAQLVLGGGGRERAVIKLNRIDIIKRAVFPNGLVNGLFFRIRI SyneRS991 YRCVEDLMNPLVFFQLASANLILGGGGRERAIIRMNKLEIVRRAVFPNSLVNGLFFRIRI SyneWH780 YRCVEDLMNPLVYFQLSSANLILGGGGRERAIIRMNKLEIVRRAVYPNSLVNGLFFRIRI SyneCC960 YRCVEDLMNPLVFFQLASANLILGGGGRERAIIRLNKLEIVRRAVFPNSLVNGLFFRIRI SyneWH810 YRCVEDLMNPLVFFQLASANLILGGGGRERAIIRLNKLEIVRRAVFANSLVNGLFFRIRI SyneCC990 YRCVEDLMNPLVYFQLASANLILGGGGRERAIIRMNKLEIVRRAVFPNSLVNGLFFRIRI ProMIT931 YRCVEDLMNPLVYFQLASSNLILGGGGRERAIIRMNKLEIVRRAVFPNSLVNGLFFRVRI SyneWH570 YRCVEDLMNPLVYFQMASSNLILGGGGRERAIIRMNKLDIIRRAVFPNSLVNSLFFRIRI ProCMP137 YRCVEDLMNPLVFFQLASSNLILGGGGRERAIIRMNKLEIVRRSVFPNSLVDGLFFRIKL ProMIT921 YRCVEDLMNPLVFFQLASSNLILGGGGRERAIIRMNKLEIVRRSVFPNSLVDGLFFRIRL ProCMP198 FRCVDELLNPLVYFQLASSNLVLGGGAREKAIIKLNKIDIIKKSVFANGLVDGLFFRIKI ProMIT931 YRCVEDLMNPLVYFQLASSNLILGGGGRERAIIRMNKLEIVRRAVFPNSLVNGLFFRVRI PromNATL2 YRCVEDLMNPLVFFQLASSNLILGGGGRERAIIRMNKLEIVRRSVFPNSLVNGLFFRIKI Staphyloc YKSIDEMMGAMIFYRLAASNLILGAGGRERRVVKLRKIDLIRKAIYPSGLVNGLYHQVRI Bsubtilis YKSIEEMMGAMIYYRMSAAQLIVGAGGREKRIVKLRKIELIRRAIYASGLVDGLYHQIRI Nostoc712 LIASAATIIIFIIFKLASVGTQTSRVSIYVIGVDTSETVMVVVYFRYYKEIEYRISWDLR Anaba2941 LIASAATIIIFIIFKLASVGTQTSRVSIYVIGVDTSETVMVVVYFRYYKEIEYRISWDLR Nosto7310 LIASAATILIFIIFKLASVGTQTTRVSIHVIGVDTSETVMVVVYFRYYKEIEYRVSWDLR SynPCC680 LVASAATILVFIIFKLASISTQTSKVAIYIIGVNTSETVLVVVFFKYYREVEYKVAWDLR Crocospha LVASAATIIVFIIFKLATISTQSTRVSIYIIGVNTSETVMVVVYFKYYKEVDYKVAWDLR Trichodes LIASSATILVFIVFKLASISTQTTRVSIYVLGVDTSETVMVVVYFRYYKEVEFKVAWDLR SynPCC794 LISSSATLLVFIIFKLASISTQSSKVAVYVLGVNTSETVLIVVFFKYFREVEFKVAWDLR SynPCC630 LISSSATLLVFIIFKLASISTQSSKVAVYVLGVNTSETVLIVVFFKYFREVEFKVAWDLR SynJA22-1 LVASAGSLLIFIVFKLATVSTQTSKVSVYILGVNTSVTVLVVVFFRYYKEVDYRVAWELL SynJA3-3A LVASAGSLLIFIVFKLATVSTQTSKVSVYVLGVNTSVTVMVLAFFRYFKEVDYRVAWELL ThermoBP- LVASAASIIVFIIFKLASVGTQTSKVSVYVIGVDTSETVMVLVYFRYYREVEYKVAWDLR Gloeobact LVASAATIIVFIVFKLASVSTQTSRISVHILGIDTSETVLVLAFFRFYREVDFRVAWDLR SyneRS991 LIASAATILVFVIFRLASVSSQSTRVSVHVLGVNTSETVLVVVFFRYYKEVEYKVAWDLR SyneWH780 LIASAATILVFVIFRLASVSSQSTRVSVHVLGINTSETVLVVVFFRYYKEVEYKVAWDLR SyneCC960 LIASAATILVFVIFRLASVSSQSSRVSVHVLGVNTSETVLVVVFFRYYKEVDYKVAWDLR SyneWH810 LIASAATILVFVIFRLASVSSQSTRVSVHVLGINTSETVLVVVFFRYYKEVDYKVAWDLR SyneCC990 LIASAATILVFVIFRLASVSSQSTRVAVHVLGVNTSETVLVVVFFRYYKEVDYKVAWDLR ProMIT931 LIATAATILVFVIFRLASVSSQSSRVSVHVLGINTSETVLVVVFFRYYKEVEYKVAWDLR SyneWH570 LIASSATILVFVIFRLASVSSQSTRVSVHVLGINTSETVLVVVFFRYYREVDYKVAWDLR ProCMP137 LIATAATILVFVIFRVSSVSSQSSRVAVHVLGINTSETVLVVVFFRYYKEVEYKVAWDLR ProMIT921 LIATAATIIVFVIFRVASVGSQTSRVSVHVLGINTSETVLVVVFFRYYKEVEYKVAWDLR ProCMP198 IISSAATIIVFIILKLSTVSSSTSRVSVHILGINTSETVLVVVFFRYYKEVEYKVAWDLR ProMIT931 LIATAATILVFVIFRLASVSSQSSRVSVHVLGINTSETVLVVVFFRYYKEVEYKVAWDLR PromNATL2 LIATSATILVFVIFRLASVSSQSSKVAVHILGVNTSETVLVVVFFRYYKEVEYKVAWDLR Staphyloc IISSAATILVLVILRLATVGTSTSRISIYVINVNMPEVILVLVYYRFYKSVEYKVAYEFR Bsubtilis LVSTAATIIVLIIFRLATVGTQTSKISIYIINVNMPEVIMVLVYYRFYKSVEYKVAYEFR Nostoc712 TAEVVVEDGYLFHIDAQAREVVRYITILITDKYIQLALELRIIIAIFICRSEFNIFPNLV Anaba2941 TAEVVVEDGYLFHIDAQAREVVRYITILITDKYIQLALELRIIIAIFICRSEFNIFPNLV Nosto7310 TAEVVVEDGYLFHIDAQAREVVRFITILITDKYVQLALELRIVIALFICRSDFNIFPNMV SynPCC680 TADVVVEDGFLFNIEAQAREVVKFIFILITDKYIQLALDLRVVIALYICKSDFNIFPNMV Crocospha TAEVVVEDGYLFNIEAQAREVVRFLFIIITDKYIQLALELRIVIALYVCRNELNIFPNMV Trichodes TADVVVEDGYLFNVDAQARDVVKYLFIIITDKYIQMALELKVIISLYICRSDFNIFPNLV SynPCC794 TADIVVEDGYLFHIEAQAREVVRFIFILITDKYIQLALELRIVIALYICRNEFNIFPDMV SynPCC630 TADIVVEDGYLFHIEAQAREVVRFIFILITDKYIQLALELRIVIALYICRNEFNIFPDMV SynJA22-1 TADIVVEQSYLFNIDAQSREVARYLFVLIADKYIQMALELRIILSLYVCRSDLKIFPNLI SynJA3-3A TADIVVEQSYLFNIEAQSREVARYLFVLIADKYIQMALELRIILSLYVCRNELKIFPNLI ThermoBP- TGEIVVEDGYLFHIEAQAREVVRFIFILITDKYVQLALELKIILSLFVCRSDFNIFPNLV Gloeobact TADVVVEDSYLFHIEAQSRDVARFLFILIADKYIQMGLELRIIIALYIQRSELKLFANLV SyneRS991 TAEVVVEDGYLFNIEAQAREVVRYIFILITDRYIQLALDLRVVIALFICKNEFNIFPDMV SyneWH780 TAEVVVEDGYLFNIEAQAREVVRYIFILITDKYIQLALDLRVVIALYICKNEFNIFPDMV SyneCC960 TAEVVVEDGFLFNIEAQAREVVRYIFILITDKYIQLALDLRVVIALYICKNEFNIFPDMV SyneWH810 TAEVVVEDGFLFNIEAQAREVVRYIFILITDKYIQLALDLRVVIALYICKNEFNIFPDMV SyneCC990 TAEVVVEDGFLFNIEAQAREVVRYIFILITDKYIQLALELRVVIAIYICKNEFKIFPDMI ProMIT931 TAEVVVEDGYLFNIEAQAREVVRYIFILITDKYIQLALELRVVIAIYICKNEFNIFPDMV SyneWH570 TADVVVEDGFLFHIEAQAREVVRYIFILITDKYIQLGLELRIVIALYICKNEFNIFPDMV ProCMP137 TADVVVEDGYLFNIEAQAREVVRYIFILITDKYIQLGLDLRVVIALYVCKNEFNIFPDMV ProMIT921 TADVVVEDGYLFNIEAQAREVVRYIFILITDKYIQLALELRVVIALYICKNEFNIFPDMV ProCMP198 TAEVVVEDGYMFNIEAQSKDVVRYIFILITDKYIQLALELRVVIALFICKNEFNIFPNMV ProMIT931 TAEVVVEDGYLFNIEAQAREVVRYIFILITDKYIQLALELRVVIAIYICKNEFNIFPDMV PromNATL2 TADVVVEDGYLFNIEAQAREVVRYIFILITDKYIQLALDLRVVIALYICKNEFNIFPDMV Staphyloc VGEIIHQDSFLENVEGMAKETAKFLTILVARRLIRMATDAKIVIALYVQKNDLKIYANLV Bsubtilis VAEIIHQDSFMENIDGMAKETAKYIFILVARKLVRMATDAKVVIALYVQKNDLKLYADLV Nostoc712 EAILHANRLVYCVLAFVWGTPARTLAINLVYISEYGFVLGVIYYKVVNVDWRGTWDHQTE Anaba2941 EAILHANRLVYCVLAFVWGTPARTLAINLVYISEYGFVLGVIYYKVVNVDWRGTWDHQTE Nosto7310 EAILHANRLVYCVLAFIWGTPARTLAINLVYISEYGFVLGVIYYKVVNVDWRGTWDHQTE SynPCC680 EAILHANKLVYCVLAFIWGTPARTLVLNLVYVSEYGFVLAIIYYKVVTVDWKGTWDHQTE Crocospha EALLHANKMVYCVLAFVWGTPARTLALNLLYVSEYGFVLAIIYYKVVTIDWKGTWDHQTE Trichodes EALLHANRLVYCVLAFVWGTPARTLAINLVYVSEYGFVLGVIYYKVVTVDWKGTWDHQTE SynPCC794 EAILHANKLVYCVLAYVWATPARTLVINLVYISDFGLVLAVIYYKVVNVDWKGTWDHQTE SynPCC630 EAILHANKLVYCVLAYVWATPARTLVINLVYISDFGLVLAVIYYKVVNVDWKGTWDHQTE SynJA22-1 EAIITTNKLVYCVLAYVWGTPARTLVINLVYVSDFGFVLGVIYYEVVTIDWKGTWDHQTE SynJA3-3A EAIITTNKLVYCVLAYVWGTPARTLVINLVYVSEFGFVLGVIYYEVVTIDWKGTWDHQTE ThermoBP- EALLHANKLVYCVLAFVWGTPARTLAINLLYVSEYGFVLGVIYYKVVTVDWKGTWDHQTE Gloeobact EAILHANRLVYCVLAFVWGTPARTLVLNLVHVSEFAFVLGVIYYKVTTVDWKEIWDHQTE SyneRS991 EALLHANKMVFCVLAYIWATKARTLVLNLLYISDFGFVLAIIYYKVVNIDWKGTWDHQTE SyneWH780 EALLHANKLVYCVLAYIWATKARTLVLNLVYISDFGFVLAIIYYKVVNIDWKGTWDHQTE SyneCC960 EALLHANKLVYCVLAYIWATKSRTLALNLVYISDFGFVLAIIYYKVVNIDWKGTWDHQTE SyneWH810 EALLHANKLVYCVLAYIWATKARTLVLNLVYISDFGFVLAIIYYKVVNIDWKGTWDHQTE SyneCC990 EALLHANKLVFCVLAYIWASKARTLALNLVYISDFGFVLAIIYYKIVNIDWKGTWDHQTE ProMIT931 QALLHANKLVFCVLAFIWATKARTLVLNLVYISDFGFVLAIIYYKVVNVDWKGTWEHQTE SyneWH570 EALLHANKLAFCVLAYIWATKARTLVLNLVYIADFGFVLAIIYYKVVNVDWKGTWDHQTE ProCMP137 EALLHANKLVFCVLAYIWATKARTLVLNIVYISDFGFVLAIIYYKVVNIDWKGTWDHQTE ProMIT921 EALLHANKMVFCVLAYIWGTKSRTLVLNLVYISDFGFVLAIIYYKVVNIDWKGTWDHQTE ProCMP198 EAILHANKLVFCVLAYIWGTKSRTLVINIVYISDFGFVLAIIYYKVVNIDWKGTWDHQTE ProMIT931 QALLHANKLVFCVLAFIWATKARTLVLNLVYISDFGFVLAIIYYKVVNVDWKGTWEHQTE PromNATL2 EALLHANKMVYCVLAYVWGSKARTLVINLVYISDFAFVLAIIYYKVVNIDWKGTWDHQTE Staphyloc EGILHAVRMAYTAMCFVRGTPAKSIVIHLVHVADYGFYDAVTIHKITTVNFKEIHDYNSG Bsubtilis EGIIHAVRMAYTAMCYVRGTPAKSIVIHLVHVADFGFYDAITIHKITTVNFKEIHDYNSG Nostoc712 VDASGDVIFGSLMLYFFFFNDRISCAVDQWVILLIVLESQGTVAVLEVVVFARMKGRLVI Anaba2941 VDASGDVIFGSLMLYFFFFNDRISCAVDQWVILLIVLESQGTVAVLEVVVFARMKGRLVI Nosto7310 VDVSGDVIFGSLLLYFFFFNDRISCAVDQWVILLIVIESQGTVAVLEVVVFARMKGRLVI SynPCC680 VDVAGDIIFGNLLLYYFFFNDRISCAVDQWIILLIVIESQGTVAVLPVIVFARLKGRLAV Crocospha IDVSGDIIFGNLLLYFYFFNDRISCAVDQWVILLVVIESQGTVSVLEVIVFARLKGRLAV Trichodes VDVSGDIIFGNIMLYFFYFNDRISCAVDQWVILIIVIEGQGTVAVLEIVVFARLKGRLVV SynPCC794 VDVSGDVIFGNLLLYFFFFNDRISCAVDQWVILLIVLEGQGTVAVLEVVVFARLRGRLVV SynPCC630 VDVSGDVIFGNLLLYFFFFNDRISCAVDQWVILLIVLEGQGTVAVLEVVVFARLRGRLVV SynJA22-1 VDVSGDVIFSNLLLYFYFFNDRISCAVESWVITIVVLEGQGIVAVLEVVVFSRLKGRLAV SynJA3-3A VDVSGDVIFSNLLLYFYFFNDRISCAVESWVITIVVLEGQGIVAVLEVVVFSRLKGRLAV ThermoBP- VDVSGDVIFGNLLLYFFFFNDRISCAVDQWVILLIVLEGQGIVAVLPVVVFSRLKGRLAV Gloeobact VDVAGEVIYGSLLVYFYFFNDRISCAVDQWVILIVVLEGQGTLAVLEVVVFARLKGRLVV SyneRS991 VDVSGDVVYGNLLLWFYFFNDRVSCAVDQWVLLLIVLEGQATLAVMEVVVYARLKGRLVV SyneWH780 VDVSGDVVYGNLLLWFYFFNDRVSCAVDQWVLLLIVLEGQATLSVMEVVVYARLKGRLVV SyneCC960 VDVSGDVVYGNLLLWFFFFNDRVSCAVDQWVLLLIVIEGQATLAVMEVIVYARLKGRLVV SyneWH810 VDVSGDVVYGNLLLWFYFFNDRVSCAVDQWVLLLIVIEGQATLAVMEVIVYARLKGRLVV SyneCC990 VDVSGDVVYGNLLLWFFFFNDRVSCAVDQWVLLLIVLEGQATLAVMEVIVYARLKGRLVV ProMIT931 VDVSGDVVYGNLLLWFYFFSDRVSCSVDQWVLLLIVLDGQATLSVMEVVIYARLKGRLAV SyneWH570 VDVSGDVVYGNLLLWFYFFNDRVSCAVDQWVLLIVILEGQAILAVMEIVVYARLKGRLVV ProCMP137 IDVSGDVVYGNLLLWFYFFSDRVSCSVDQWVLLIIILEGQATLSVMEVVVYARLKGRLVV ProMIT921 IDVSGDVVYGNLLLWFYFFSDRVSCSVDQWVLLLIILDGQATLSVMEVVVYARLKGRLVV ProCMP198 VDVSGDVVYGNILLWYYFFSDRVSCSVDQWIILIIILDGQATLSVMEVVIYARLKGRLVV ProMIT931 VDVSGDVVYGNLLLWFYFFSDRVSCSVDQWVLLLIVLDGQATLSVMEVVIYARLKGRLAV PromNATL2 IDVSGDVVYGNLLLWYFFFNDRVSCAVDQWILLLVIIDGQATLSVMEVVVYARLKGRLVV Staphyloc VVVSSDVIFGNILVYYFYYNEIIEMAIESMVILIIILEGYGTLSILPVIVFSVMKLEIVV Bsubtilis VVVSSEVIFGNLLVYYYFYNEIIEMAIESMVILIIVLEGYGTLSILPVIVFSVLKLEIVV Nostoc712 TEVVTEELDLSFVKVEDEEAPDIEIFLELYKEHLKYEATYQYIEQIDLDRNIGYESKFNL Anaba2941 TEVVTEELDLSFVKVEDEEAPDIEIFLELYKEHLKYEATYQYIEQIDLDRNIGYESKFNL Nosto7310 TEVVTEELDLSFVKVEDEEAPDIEIFLELYKEHLKYEATYQYIEQIDLDRNIGYESKFNL SynPCC680 TEVITEELDLSFVKVEDDESPDIEIFLELYKEHLKYDSSYQYIEQIDLERNIGYESKFNL Crocospha TEVITEELDLAFVKVEDEEAPDIEIFLELYKEHLKYDASYQYIEQIDLDRNIGYESKFNL Trichodes TEVITEDLDLAFVKIEDDDAPDIEIFLELYKEHLKYESSYQYIEQIDLDRNIGYESKFNL SynPCC794 TEVISEDLDLAFVRVEDEEAPDVAIFLELYKEHLKYDATYQYIEQIDLDRNIGYESKFNL SynPCC630 TEVISEDLDLAFVRVEDEEAPDVAIFLELYKEHLKYDATYQYIEQIDLDRNIGYESKFNL SynJA22-1 TEVITEDIDLSFVKVEDEEAPDVEIFFELYKEHIRYEATYQRIDMIELDRNIGYESKFNL SynJA3-3A TEVITEDIDLSFVKVEDEEAPDVEIFFELYKEHIRYEATYQRIDMIELDRNIGYESKFNL ThermoBP- TEVITEELDLAFVRVDDEDAPDVEIFLELYKEHLKYDATYQYIDQIDLDRNIGYESKFNL Gloeobact TQVITEEIDLAFVKVEDEEAPDVEIFLELYKEHLKYEATYQYIEQIELDRNIGYESKFNL SyneRS991 KQVITEDIDLAFVRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL SyneWH780 KQVITEDIDLAFVRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL SyneCC960 KQVITEDIDLAFVRIDEEEAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL SyneWH810 TEVITEDIDLAFVRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL SyneCC990 KQVITEDIDLAFVRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL ProMIT931 KEIITEDIDLAFVRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL SyneWH570 TQVITEDIDLAFVRVDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL ProCMP137 KQIITEDIDLAFVKIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL ProMIT921 KQVITEDIDLAFIKIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL ProCMP198 KEIITEDIDLAFVRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL ProMIT931 KEIITEDIDLAFVRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL PromNATL2 KEVITEDIDLAFIRIDEEDAPDVEVFLELYKEHLRHEATFQYIEQIELERNIGFESKFNL Staphyloc TEIITREIGISYVRVEDDEAGNIEIYLMFFNSELKYESTYPRVEQMDFDTSLSFKTEYKV Bsubtilis TEIITRDIGISYVRVEDDESGNIEIYLMFFNSELKYESTYPRVDQMDFDTSLSFKTEYKV Nostoc712 VIKLLKLEIPYTEELKTGGQSGRLILALIEDVVNLIIRTVDLTSVPKLVAILTFAIPYIA Anaba2941 VIKLLKLEIPYTEELKTGGQSGRLILALIEDVVNLIIRTVDLTSVPKLVAILTFAIPYIA Nosto7310 VIKLLKLEIPYTEELKTGGQSGRLILALIEDVVNLIIRTVDLTSVPKLVAILTFAIPYIA SynPCC680 VIKLLKLEIPFTEELKTGGQSGRLILALIEDVVNLIIRTVELTSVPKLVAIITFAIPHVA Crocospha VIKLLKLEIPFTEELKTGGQSGRLILSLIEDVVNLIIRTVELTSVPKLVAIITFAIPHVA Trichodes VIKLLKLEIPYTEELKTGGQSGRLILALIEDVVNLIIRTVELTSVPKLVAIITFAIPYIA SynPCC794 VIKLLKLEIPYTEDLKTGGQSGRLILALIEDVVNLIIRTVDLSSVPKLVAILTFAIPYIA SynPCC630 VIKLLKLEIPYTEDLKTGGQSGRLILALIEDVVNLIIRTVDLSSVPKLVAILTFAIPYIA SynJA22-1 VIKLLKLEIPFTEELKTGGQSGRLILALIEDVVNLIIRTVELTSVPKLVAILSFAIPYIA SynJA3-3A VIKLLKLEIPFTEELKTGGQSGRLILALIEDVVNLIIRTVELTSVPKLVAILSFAIPYIA ThermoBP- LIKLLKLEIPYTEELKTGGQSGRLILALIEDVVNLIIRTVDLTSVPKLVAILTFAIPYIA Gloeobact VIKLLKLEIPYTEELKTGGQTGRLILALIEDVVNLIIRTVELTSVPKLVAILTFAIPYIA SyneRS991 LVKIMRLDVPYSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA SyneWH780 LVKIMRLDVPYSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA SyneCC960 LVKIMRLDVPFSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA SyneWH810 LVKIMRLDVPFSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA SyneCC990 LVKIMRLDVPFSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA ProMIT931 LVKIMRLDVPYSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAVITFAIPYIA SyneWH570 LVKIMRLDVPYSEDLKSGGQSGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA ProCMP137 LVKIMRLDVPYSEDLKSGGQSGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA ProMIT921 LVKIMRLDVPYSEDLKSGGQSGRLVLSLIEDVVNLIIKTVELTQVPKLVAVITFAIPYIA ProCMP198 LVKIMRLDVPFSEDLKSGGQSGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA ProMIT931 LVKIMRLDVPYSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAVITFAIPYIA PromNATL2 LVKIMRLDVPFSEDLKSGGQTGRLVLSLIEDVVNLIIKTVELTQVPKLVAIITFAIPYIA Staphyloc VIRLLRFEINYTDEIRTNASSAKIIIAFFLGLISMVVRSIDITQIIRVISILTMEVYYII Bsubtilis VIRLLRFEINYTDDIRTNASSAKIIISFFLGLISMVVRSIDITQIIRVISILTMEVYYII Nostoc712 GARYLTFMTRGDIVYQFDVVVVTSHASLRPLQMVVYYDLQKLEQVSSQIVAYPELDYIIK Anaba2941 GARYLTFMTRGDIVYQFDVVVVTSHASLRPLQMVVYYDLQKLEQVSSQIVAYPELDYIIK Nosto7310 GARYLTFMTRGDIVYQFDVVVVTSHASLRPLQMVVYYDLQKLEQVSSQIVAYPELDYIIK SynPCC680 GARYITYLTRGDIVYQFDVVIVTSHASLRPLQMVVYYDLQRLEQVSAQIVAYPELDYIIK Crocospha GARYITYLTRGDIVYQFDVVIVTSHASLRPLQMVVYYDLQRLEQVSAQIVAYPELDYIVK Trichodes GAKYITYMTRGDIVYQFDVVVVTSHASLRPLQMVVYYDLQRLDQVSSQVVAYPELDYIIK SynPCC794 GARYITFMTRGDIIYQFDVLIVTSHASLRPLQMVIYYDLQRIDQVSAQIVAYPELDYIIK SynPCC630 GARYITFMTRGDIIYQFDVLIVTSHASLRPLQMVIYYDLQRIDQVSAQIVAYPELDYIIK SynJA22-1 GARYISYLTRGDIVYQFHVVIVTSHAALRPLQMVVYYDLQRLDQVPAQVVAYPELDFVIK SynJA3-3A GARYISYLTRGDIVYQFHVVIVTSHAALRPLQMVVYYDLQRLDQVPAQVVAYPELDFVIK ThermoBP- GARYITFMTRGDIVYQFDVVIVTSHASLRPLQMVVYYDLQRIDQVSAQIVAYPELEYIIK Gloeobact GARYITYLTRGDIVYQFDVVVVTSHAALRPLQMVIYYDLQRIDQISAQIVAFPELDFIVK SyneRS991 NARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVIAYPELEYVIK SyneWH780 NARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVIAYPELDYVIK SyneCC960 NARYITFLSRGDIIYQFDVLIVTSHASLRALQMVVHYDLQRIDQVSAQVIAYPELDYVIK SyneWH810 NARYITFLSRGDIIYQFDVLIVASHASLRALQMVVHYDLHRIDQVSAQVIAYPELDYVIK SyneCC990 NARYITFLSRGDIIYQFDVLIVTSHASLRALQMVVHYDLQRIDQVSAQVIAYPELDYVIK ProMIT931 NARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVVAYPELDYVIK SyneWH570 GARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVIAYPELDYVIK ProCMP137 NARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVIAYPELDYVIK ProMIT921 NARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVIAYPELDYVIK ProCMP198 NARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVIAYPELDYVIK ProMIT931 NARYITFLSRGDIIYQFDVLIVTSHASLRPLQMVVHYDLQRIDQVSAQVVAYPELDYVIK PromNATL2 NARYITFLSRGDIVYQFDVLIVASHASLRPLQMVVHYDLHRIDQVSAQVIAYPELDYVIK Staphyloc NSRFITYLTVANFVFGNDMVVATANSAMAPMVAAVYFGYQRIDEIPSRVVGFTDIDYIIE Bsubtilis NSKFITYLTVANFIFGNDMVVATANSAMAPMVAAVYFGYQRIDEIPSRVVGFTDIDYIIE Nostoc712 IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRYPIVLWGRPEMEVTVIVLIQFFTLNEALNR Anaba2941 IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRYPIVLWGRPEMEVTVIVLIQFFTLNEALNR Nosto7310 IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRYPIVLWGRPEMEVTIIVLIQFFTLNEALNR SynPCC680 IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRYPIVLWGRPEMEVTIVVLIQFYILNEALNR Crocospha IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRYPIVLWGRPEMEVTVVVLIQFYTLNEALNR Trichodes IQEVEIEDQPPKRDNGRIRFTAMVVAQKIRYPVVLWGRPEMEVTVVVLIQFFTLNEALNR SynPCC794 IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRYPIVLWGRPEMEVTVVVLIQFYILNEALNR SynPCC630 IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRYPIVLWGRPEMEVTVVVLIQFYILNEALNR SynJA22-1 IQEVEVEDQPPRRDNGRVRFTAMVVALKIRYAVVLWARPEMEVTVVVLIQFFILNEALNR SynJA3-3A IQEVEVEDQPPRRDNGRVRFTAMVVALKIRYAVVLWARPEMEVTVVVLIQFFILNEALNR ThermoBP- IQEVEIEDQPPKRDNGRVRFTAMVVAQKIRFPIVLWGRPEMEVTVVVLIQFYILNEALNR Gloeobact IQEVEVEDQPPKRDNGRVRFTAMVVAQKIKYPIVLWGRPEMEVTVVVLIQFFTLNEALNR SyneRS991 IQEVEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR SyneWH780 IQEVEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELSVVVLIQLYTLNEALNR SyneCC960 IQEVEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR SyneWH810 IQEVEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR SyneCC990 IQEVEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR ProMIT931 IQEVEIEDQPPKRDNTRVRYTAMVAAQRIRFPIVMWARPEMELSVVVLIQLYTLNEALNR SyneWH570 IQEVEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR ProCMP137 IQEIEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR ProMIT921 IQEIEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR ProCMP198 IQEIEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR ProMIT931 IQEVEIEDQPPKRDNTRVRYTAMVAAQRIRFPIVMWARPEMELSVVVLIQLYTLNEALNR PromNATL2 IQEVEIEDQPPKRDNTRVRYTAMVVAQRIRYPIVMWARPEMELTVVVLIQLYTLNEALNR Staphyloc SDDVDVVELTARHETGIVKFNVQLVVQKVKYPIILMAHVGADLTIVAMLFFYTIVKTYEK Bsubtilis SDDVDIVELTARHETGIIKFNVQLVVQKVKYPIIMMAHVGADLTVVAMLFFYTIVKTYEK Nostoc712 NVKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI Anaba2941 NVKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI Nosto7310 NVKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SynPCC680 NIKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI Crocospha NVKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI Trichodes NIKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SynPCC794 NIKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SynPCC630 NIKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SynJA22-1 NIKLLFGAAKGFATVSISDLPKAEIAGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SynJA3-3A NIKLLFGAAKGFATVSISDLPKAEIAGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI ThermoBP- NIKLLFGAAKGFATVSISDLPKAEIAGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI Gloeobact NIKLLFGAAKGFATVSISDLPKAEIAGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SyneRS991 NVKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SyneWH780 NVKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SyneCC960 NIKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SyneWH810 NVKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SyneCC990 NIKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI ProMIT931 NIKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI SyneWH570 NIKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI ProCMP137 NVKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI ProMIT921 NVKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI ProCMP198 NVKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI ProMIT931 NIKLLYGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI PromNATL2 NVKLLFGAAKGFATVSISDLPKAEITGITEVRKIDTFLNSVMMASGRGSVRQLGMRGMQI Staphyloc YILPIFKPKDFCIFKDWECGKRDRVTEMGHILPSHIMEEVIFASVVDPTLEKKLLSEEYK Bsubtilis YILPIFKPKDFCIFKDWECGKRDRVTEMGHILPSHIMEEVIFASVVDPTLEKKLLSEEYK Nostoc712 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA Anaba2941 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA Nosto7310 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA SynPCC680 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA Crocospha IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLATDGEAGIAA Trichodes IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA SynPCC794 IITFREGEYSSYRVASGYRLVDVQVHEDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA SynPCC630 IITFREGEYSSYRVASGYRLVDVQVHEDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA SynJA22-1 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA SynJA3-3A IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA ThermoBP- IITFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA Gloeobact IIAFREGEYSSYRVASGYRLVDVQVREDCTLGRVVVRSPLTCSVCCYGWLAHDGEAGIAA SyneRS991 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA SyneWH780 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA SyneCC960 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA SyneWH810 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA SyneCC990 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVIRSPLTCSVCCYGWLAHDGEAGIAA ProMIT931 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA SyneWH570 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA ProCMP137 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVIRSPLTCSVCCYGWLAHDGEAGIAA ProMIT921 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLIVRSPLTCSVCCYGWLAHDGEAGIAA ProCMP198 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVIRSPLTCSVCCYGWLAHDGEAGIAA ProMIT931 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA PromNATL2 IITFREGEYSSYRVASGYRLVDVQVREDCTLGRLVVRSPLTCSVCCYGWLAHDGEAGIAA Staphyloc YFAMGAELLIDLELEGQRAIKRLVERNGNPPVQLIVQNEMLQALIGRRGPVTRLKSSHLK Bsubtilis YFAMGAELLIDLELEGQRAIKRLVERNGNPPVQLIVQNEMLQALIGRRGPVTRLKSSHLK Nostoc712 QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLPPANAERNPVGKE Anaba2941 QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLPPANAERNPVGKE Nosto7310 QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLPPANAERNPVGKE SynPCC680 QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLPPAGAEREPVGRE Crocospha QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLPPAGAEREPVGRE Trichodes QSIEPGTQTMRTFHTGVFTARRTHGGSEEVKVLADQGTGLWLGVYNLPPANAEREPVGKE SynPCC794 QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLLPAGAEREPVGKE SynPCC630 QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLLPAGAEREPVGKE SynJA22-1 QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWMGVYNLPPAGAEREPVGKE SynJA3-3A QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWMGVYNLPPAGAEREPVGKE ThermoBP- QSIEPGTQTMRTFHTGVFTARRTHGGSEEVKVLADQGTGLWLGVYNLPPAGAEREPVGKE Gloeobact QSIEPGTQTMRTFHTGVFTARRTHGGSEEAKVLADQGTGLWLGVYNLPPAGAEREPVGKE SyneRS991 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGRE SyneWH780 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGRE SyneCC960 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE SyneWH810 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE SyneCC990 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE ProMIT931 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGRE SyneWH570 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE ProCMP137 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE ProMIT921 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE ProCMP198 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE ProMIT931 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGRE PromNATL2 QSIEPGTQTMRTFHTGVSTGRRTHGGSQEAKVLADQGTGRWLGVYNLPPAGAEREPIGKE Staphyloc GKQRFRQNLGKRVDYSRSVGKVMELHPEPVTYFDAHVSARLLANILNPKGGLTENVINKK Bsubtilis GKQRFRQNLGKRVDYSRSVGKVMELHPEPVTYFDAHVSARLLANILNPKGGLTENVVNKK Nostoc712 LNDSLVEGVKDIQGQNDILEVVGGGLRPPQRVKLLTDEDRLIEDDIRRDRGYSALLRDLL Anaba2941 LNDSLVEGVKDIQGQNDILEVVGGGLRPPQRVKLLTDEDRLIEDDIRRDRGYSALLRDLL Nosto7310 LNDSLVEGVKDIQGQNDILEVVGGGLRPPQRVKLLTDEDRLIEDDIRRDRGYSALLRDLL SynPCC680 LNDSLVEGVKDIQGQNDILEIIGGGLRPPQRVKLLTDEDRLIEDDIRRDRAYSALLRDLL Crocospha LNDSLVEGVKDIQGQNDILEIIGGGLRPPQRVKLLTDEDRLIEDDIRRDRAYSALLRDLL Trichodes LNDSLVEGVKDIQGQNDILEVIGGGLRPPQRVKLLTDEDRLIEDDIRRDRGYSALLRDLL SynPCC794 LNDSLVEGVKDIQGQNDILEIIGGGLRPPQRIKLLTDEDRLIEDDIRRDRAYSALLRDLL SynPCC630 LNDSLVEGVKDIQGQNDILEIIGGGLRPPQRIKLLTDEDRLIEDDIRRDRAYSALLRDLL SynJA22-1 LNDSLVEGVKDIQGQNDILEIIGSGLRPPQRVKLLTDEDRLIEDDIQRDRAYSALLRDLL SynJA3-3A LNDSLVEGVKDIQGQNDILEIIGSGLRPPQRVKLLTDEDRLIEDDIQRDRAYSALLRDLL ThermoBP- LNDSLVEGVKDIQGQNDILEIIGGGLRPPQRVKLLTDEDRLIEDDIQRDRAYSALLRDLL Gloeobact LNDSLVEGVKDILGQNDILEVIGGGLRPPQRVKLLTDEDRLIEDDIRRDRGYSALLRDLL SyneRS991 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRVKLLTDEDRLIEDDIQRDRGYSSLLRDLL SyneWH780 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRVKLLTDEDRLIEDDIQRDRGYSSLLRDLL SyneCC960 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL SyneWH810 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL SyneCC990 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL ProMIT931 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL SyneWH570 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL ProCMP137 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL ProMIT921 INDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL ProCMP198 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL ProMIT931 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL PromNATL2 LNDSLIEGVKDIQGQNDILEIVGGGLRAPQRIKLLTDEDRLIEDDIQRDRGYSSLLRDLL Staphyloc LGIAEITTLDRMLKGTVGVDIVDGAIGPMRGIISFELDKTRDAEGERTEAGVGAGIPTMR Bsubtilis IGIAEITTLDRMLKGTVGVDIVDGAIGPMRGIISFELDKTRDAEGERTEAGVGAGIPTMR Nostoc712 VERKTGELKLEIVSHVESIIVTILRAANIEESRRAQRFLIDLDRGGVLFNPLELQEKIWT Anaba2941 VERKTGELKLEIVSHVESIIVTILRAANIEESRRAQRFLIDLDRGGVLFNPLELQEKIWT Nosto7310 VERKTGELKLEVVTHVESIIVTILRAANIEESRRAQRFLIDLDRGGVLFNPLELQEKIWT SynPCC680 VERKTGELKLEVITHVESVIVTILRAANIEESRRAQRFIIDFNGGGILYNPLELQEKIWT Crocospha VERKTGELKLEIVTHVQSIIVTVLRAANIEESRRAQRFIIDFDGGGILFNPLELQEKIWT Trichodes VERKTGELKLDVVTHVQSIIVTILRAANIEESRRAQRYLINFDGGGILYNPLELQEKIWT SynPCC794 VERKTGELKLDIVTHVESIIVTILRAANIEESRRAQRFIIDFDGGGVLYNPLELQEKIWT SynPCC630 VERKTGELKLDIVTHVESIIVTILRAANIEESRRAQRFIIDFDGGGVLYNPLELQEKIWT SynJA22-1 VERKTGELKLEIVTHVESVIITVLRAANIEESRRAQRYIIDLDGGGILFNPLELQEQIWT SynJA3-3A VERKTGELKLEIVTHVESVIITVLRAANIEESRRAQRYIIDFDGGGILFNPLELQEQIWT ThermoBP- VERKTGELKLDIVTHVESIIVTILRAANIEESRRAQRYLIDFDGGGILYNPLELQEKIWI Gloeobact VERKTGELKLEVVTHVESVIITVLRAANIEESRRAQRYIINFNGGGVLYNPLELQEQIWT SyneRS991 VERKTGELRLEIVTHVESVIVTVLRAANIEESRRAQRYIIDFDGGGVLFNPLELQEKIWT SyneWH780 VERKTGELRLEIVTHVESVIVTVLRAANIEESRRAQRYIIDFDGGGILFNPLELQEKIWT SyneCC960 VERKTGELRLEIVTHVESVIVTVLRAANIEESRRAQRYIIDFDGGGILFNPLELQEKIWT SyneWH810 VERKTGELRLEIVTHVESVIVTVLRAANIEESRRAQRYIIDFDGGGVLFNPLELQEKIWT SyneCC990 VERKTGELRLDIVTHVESVIVTVLRASNIEESRRAQRYIIDFDGGGILFNPLELQEKIWT ProMIT931 VERKTGELRLEIVTHVESVIVTILRASNIEESRRAQRYIIDFDGGGILFNPLELQEKIWT SyneWH570 VERKTGELRLDIVTHVESVIVTILRAANIEESRRAQRYIIDFDGGGVLFNPLELQEKIWT ProCMP137 VERKTGELRLDIVTHVESVIVTIIRAANIEESRRAQRYIIDFDGGGILFNPLELQEKIWT ProMIT921 VERKTGELRLEIVTHVESVIVTIIRAANIEESRRAQRYIIDFDGGGILFNPLELQEKIWT ProCMP198 VERKTGELRLEIVTHVESVIVTILRAANIEESRRAQRYIIDFDGGGILFNPLELQEKIWT ProMIT931 VERKTGELRLEIVTHVESVIVTILRASNIEESRRAQRYIIDFDGGGILFNPLELQEKIWT PromNATL2 VERKTGELRLDIVTHVESVIVTILRASNIEESRRAQRYIIDFDGGGVLFNPLELQEKIWT Staphyloc THTGVAQFKIDVVTKLEMVVVLIIHRAELDKRLKMNKYIVNLNGASIMYTVKFEWQKLLI Bsubtilis THTGVAQFKIDVITKLEMVVVLVIHRAELDKRLKMNKFIVNLNGASIMYTVKFEWQKLLI Nostoc712 ILVILGVGRSRGMIAYSLAKVNLVAITFGALEKQMLKRQGVGLRERVSVAQVLGTEDYLT Anaba2941 ILVILGVGRSRGMIAYSLAKVNLVAITFGALEKQMLKRQGVGLRERVSVAQVLGTEDYLT Nosto7310 ILVILGVGRSRGMIAYSLAKVNLVAITFGALDRQMLKRQGVGLRERVSVAHVLATEDYLT SynPCC680 LIVILGVGRSRGMIAYALSKVNLVATTFGALDKQMLKKQGIGLRERVSIAHVLGTEDYLT Crocospha LLVILGVGRSKGMIAYALSKVNLVATTFGALDKQMLKRQGIGLKERISIAQILGTEDYLT Trichodes LIVILGVGRSRGMIAYALSKVNLVAITFGALDKQMLRRQGVGLKERISVAHVLATEDYLT SynPCC794 LLVILGVGRSRGMIAYSLAKVNLVAITFGALDKQVLRRQGVGLRERISVAQILASEDYLT SynPCC630 LLVILGVGRSRGMIAYSLAKVNLVAITFGALDKQVLRRQGVGLRERISVAQILASEDYLT SynJA22-1 LLVVLDVGKYRGMISYALAKINLVAITFGALEKQMLKRQGVGLRERVSVAHVLGTDDYIT SynJA3-3A LLVVLDVGKYRGMISYALAKINLVAITFGALEKQMLKRQGVGLRERVSVAHVLGTEDYIT ThermoBP- LLVVLGVGRSRGMIAYSLSKVNLVAITFGALDKQMIKRQGVGLKERVSVAHVLGTEDYLT Gloeobact LLVILDVGRSRGMISYALSRVNLVAITFGALDRQMLKRQGVGLKERVAVAHVMGTDDYLT SyneRS991 LLVILGIGRSKGMIAFALAKVNLVAITFGALDKQMLRRQGVGLRERISVAHVLGSEDHIS SyneWH780 LLVILGIGRSKGMIAFALARVNLVAITFGALDKQMLRRQGVGLRERISVAHVLGSEDHIS SyneCC960 LLVILGIGRSKGMIAFSLARVNLVAITFGALDKQMLRRQGVGLRERISVAHVLGSEDHIS SyneWH810 LLVILGIGRSKGMIAFALAKVNLVAITFGALDKQMLRRQGVGLRERISVAHVLGSDDHIS SyneCC990 LLVILGIGRSKGMIAFALARVNLVAITFGALDKQMIRRQGVGLRERISVAHVLGSEDHIS ProMIT931 ILVILGIGRSKGMIAFALAKINLVAITFGALDKQMLRRQGVGLRERISVAHVLGSEDHIS SyneWH570 LLVILGIGRSKGMIAFALARVNLVAITFGALDKQMLRRQGVGLRERISVAHVLASEDHLT ProCMP137 ILVILGIGRSKGMIAFALARVNLVAITFGALDKQMLRKQGVGLRQKISVSHVLGSEDHIS ProMIT921 ILVILGIGRSKGMIAFALARVNLVAITFGALDKQVLRKQGVGLRERISVAHVLASEDHIT ProCMP198 ILVILGIGRSKGMIAFALAKINLVAITFGALDKQMLKKQSIGLKQKISIAHVLGSEDHIS ProMIT931 ILVILGIGRSKGMIAFALAKINLVAITFGALDKQMLRRQGVGLRERISVAHVLGSEDHIS PromNATL2 LLVILGIGRSKGMIAFALAKVNLVAITFGALDKQMIRKQGVGLRERISVSHVLGSEDHIS Staphyloc LLLVIGIAKYKAIVAYAVAKVYVISIQSQGVEKEMLRRKSIAMRERVAVAHVMGTDEYIS Bsubtilis LLLVIGIAKYKAIVAYAVAKIYVISIQSQGVEKEMLKRKSVAMRERVSVAHVMGTDEHLS Nostoc712 PRYRMRLELIGLAYLPYAIGVNIWQYAYIMTSVLEQSRLSWLSAYRLVCSVELWEVMTAN Anaba2941 PRYRMRLELIGLAYLPYAIGVNIWQYAYIMTSVLEQSRLSWLSAYRLVCSVELWEVMTAN Nosto7310 PRYRMRMELIGLSYLPYAIGVNVWQYAYIMTSVLEQSRLSWLSAYRLVCSVELWEVMTAN SynPCC680 PRYRTKLEHIGLAYLPYAIGINVWQFAYIMTSVLEQSRLSWMSAYRLVCSVELWVVMTGN Crocospha PRYKTRLEHIGLAYLPYAIGINVWQFAYIMTSVLDQSRISWLSAYRLVCSVELWEVMTGN Trichodes PRYKTRMEHIGLAYLPYAIGVNVWQFAYIMTSVLEKSRLSWLSAYRLVCSVELWEVMTGN SynPCC794 PRYRTRLAHIGLAYIPYAIGVNVWQFAYIMTSVLDQSRLSWMSAYRLVCSVELWEVMTAN SynPCC630 PRYRTRLAHIGLAYIPYAIGVNVWQFAYIMTSVLDQSRLSWMSAYRLVCSVELWEVMTAN SynJA22-1 PRYRTKLEHIGLAFLPYALGVNIWQYAFLMTSVLEQSRLSWLSAYRLVCSVELWVVMTAN SynJA3-3A PRYRTKLEHIGLAFLPYALGVNIWQYAFLMTSVLEQSRLSWLSAYRLVCSVELWVVMTAH ThermoBP- PRYKTRLEHIGLAYLPYAIGVNVWQFAYIMTSVLDQSRLSWMSAYRLVCSVELWEVMTAH Gloeobact PRYRTRLEHVGLAFLPYALGVNVWQFPFLMTSVLEQSRLSWLNVHRLVCTVELWVVMTGH SyneRS991 PRYRTRLAHVGLAYLAYSIGINVWQFPFILTSVLEQSRLSWLSAHRLVCSVELWEVMVAN SyneWH780 PRYRTRLAHVGLAYLAYSIGINVWQFPFILTSVLEQSRLSWLSAHRLVCSVELWEVMTAN SyneCC960 PRYRTRLAHVALAYLPYSIGVNVWQFPFILTSVLEQSRLSWLSAHRLVCSVELWEVMTAN SyneWH810 PRYRTRLAHVAIAYLPYSIGVTVWQFPFILTSVLEQSRLSWLSAHRLVCSVELWEVMTAN SyneCC990 PRYRTRLAHVGLAYLPFSIGVTIWQFPFILTSVLEQSRLSWLSAHRLVCSVELWEVMTAN ProMIT931 PRYRTRLAHVGLSYLPFSIGVTVWQFPFILTSVLEQSRISWLNAYSLVCSVELWEVLTAN SyneWH570 PRYRTRLAHIGLAFLPYAIGVTVWQFPFILTSVLEQSRLSWLNAHRLVCSVELWVVMTGH ProCMP137 PRYRTRLAHIGLAYLPYSIAINVWQFPFILTSVLEQSRLSWMSAHRLVCSVELWEVMVAN ProMIT921 PRYRTRLAHIGLAYLPYSIAINVWQFPFILTSVLEQSRLSWMSAHRLVCSVELWEVMVAN ProCMP198 PRYRTKLAHIGLSFLPYSIGINVWQFPFILTSVLEQSRISWLNAHRLVCSVELWEVLTAN ProMIT931 PRYRTRLAHVGLSYLPFSIGVTVWQFPFILTSVLEQSRISWLNAYSLVCSVELWEVLTAN PromNATL2 PRYRTRLALVGLAYLPFSIGINIWQFPFILTSVLEQSRLSWLSAHRLVCSVELWEVMVAN Staphyloc ASARTRMALIGIAYIPYAIGVNVRYFPYIMMGTVEKYKLGYLSVYRIIDTLDFYEIMTAN Bsubtilis ASARTRLELVGLAYIPYAIGVNVRYFAYIMMGTVEKYKLGYLSVYRIIDTLDFYEIMTAH Nostoc712 LDTCVHLGFFVYVPDGKVPVLWVLLDQAYAQLTLFMATHRNSNIAYARRRMILEAKIFFL Anaba2941 LDTCVHLGFFVYVPDGKVPVLWVLLDQAYAQLTLFMATHRNSNIAYARRRMILEAKIFFL Nosto7310 LDTCVHMAFFVYVPDGKVPILWVLLDQAYAQLTLFMATHLGTGIAYARRRMILEAKLFFL SynPCC680 LDTCVHLGFFVYVPDGKVPVLWVLLDQAYAQMTLFMADHRGSNIAYASRRMLLEAKLFFL Crocospha LDTCVHMAFFIYVPDGKVPVLWVLLDQAYAQLTLFMADHRGTNIAYASRRMLLEAKLFFL Trichodes LDTCVHMAFFVYVPDGKVPVLWVLIDQAYAQLTLFMGTHLGSGIAYARRRMLLEAKLFFL SynPCC794 LETCVHMAFFVYIPDGKVPVLWVLLDQAYAQLTLFMADHRNSGIAYARRRMLLAAKLFFL SynPCC630 LETCVHMAFFVYIPDGKVPVLWVLLDQAYAQLTLFMADHRNSGIAYARRRMLLAAKLFFL SynJA22-1 LDSCIHLGFFVYVPDGGVPILWVLLPQAYAQLNMFMGDNRGTGIAYARRRLLLEARLLFL SynJA3-3A LDSCVHLGFFVYVPDGGVPILWVLLPQAYARLTMFMGDNRGSGIAYARRRLLLEARLLFL ThermoBP- LDTCVHMAFFVYVPDGKVPILWVLLDQAYAQLNLFMGDHRNSNIAYARRRLLLEAKLFFL Gloeobact LDTAVHLAYFVHVPEGGVPIYWVLIPRAFARLNMFMATNRGTGIAYARRRLLLEARLFFL SyneRS991 LETCVHMAFFVHVPDGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFVL SyneWH780 LETCVHMAFFVYVPDGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFVL SyneCC960 LETCVHMAFFVYVPDGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFVL SyneWH810 LETCVHMAFFVYVPDGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFVL SyneCC990 LETCVNMAFFVYVPEGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFVL ProMIT931 VDTCVHMAFYIYVPDGKVPILWVLIDQVYARLNLFMGDHRNSGIAFARRKLLLEAKLFFL SyneWH570 LDTCVHMAFFVHVPDGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFVL ProCMP137 LDTCVHMAFFVYVPDGKVPILWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFFM ProMIT921 LDTCVHMAFFVYVPDGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFFM ProCMP198 VDTCVNMAFFIYVPDGKVPILWVLIDQAYARLNLFMGDHRNSGIAFARRKLLLEAKLFFL ProMIT931 VDTCVHMAFYIYVPDGKVPILWVLIDQVYARLNLFMGDHRNSGIAFARRKLLLEAKLFFL PromNATL2 LDTCVHMAFFIYVPDGKVPVLWVLIDRAFAQLTLFMADHRNSGIAFARRRLLLEAKLFFL Staphyloc LDSAVHMAYFVHVGESGAVVYFTMIDQAYEQLTLYFGDHRGSGVSYPRKRLLFEGRLFFL Bsubtilis LDSAIHMAYFVHVGESGAVVYFTMIDRAYEQMTLYFGDHRGSGVSYPRKRLLFEGRLFFL Nostoc712 LLHTQLFKLVGVDK Anaba2941 LLHTQLFKLVGVDK Nosto7310 LLHTQLFKLVGVDK SynPCC680 LLHTQLFKLVGVDK Crocospha LLHTQIFKLVGVDK Trichodes LLHTHLFKLVGVDR SynPCC794 LLHTHLFKLVGVDK SynPCC630 LLHTHLFKLVGVDK SynJA22-1 LLHTQIFKLVGVDK SynJA3-3A LLHTQIFKLVGVDK ThermoBP- LLHTHLFKLVGVDK Gloeobact LLHSQIFPLVGVDK SyneRS991 IMHTHLFKVVGVDR SyneWH780 IMHTHLFKVVGVDR SyneCC960 IMHTHLFKVVGVDR SyneWH810 IMHTHLFKVVGVDR SyneCC990 IMHTHLFKVVGVDR ProMIT931 LMHTHLFKVIDVNR SyneWH570 LLHSHLFKVVGVDR ProCMP137 LMHTHLFKVVGVDR ProMIT921 LMHTHLFKVVGVDR ProCMP198 LMHTHLFKVIDVNR ProMIT931 LMHTHLFKVIDVNR PromNATL2 LMHTHLFKVVGIDR Staphyloc LMFTQIMPLVGIDR Bsubtilis LLFTQIMPLVGINR
Example Output File (Binary)
26 1214 Nostoc712 00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 Anaba2941 00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 Nosto7310 11000000000000001000000000000000000000000000000000000000000000000000000000000000000001000000000000000000000000000100000000000000000000000010000000000001100000000000000000000000000000000000000000001000000000000000001001000000000000001000000000000000100000000000000100001000000000000000000000000010000000000000000000000000000100000000010000000010010000010000001000000000000000001000000000000000000000000000000000000000000000100000000010000000000000000000000010000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000101000000000000000000000000000000000000000000000000000000000000000000000000000001100000000000000000100100000000000010000010000000001000000000000000000000000000000000000000000011000000000001000000000000000000011110000000000000100000000000000000 SynPCC680 11000000000000011010000000000000000000000000000000000000000010000001110000010000000001000000000000010000001100000100000011000010010010000100001001000011110000000000000000001000000001000000000000001000001000100000110010100000000000000100000001000000110000000011000010100100010000010001010010100111000000100000010010100000001101000000000000100110011001010000001000000001000000001000000001100001000000011000000100010000000000110010001010010000000000000010000010000000001010000100001100010000000000000101000000000000000011100000000010000000000000000000001000000000000000000000000000001000000000010000011000001011000000000000100000000000000000001000001000000000000000000000000000000000000000000000000000000000110000011000000001000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000000000000000000000000000000000000100010001000000000000000000000110000000000000000000000000000100000000000000000000111000010000000000000000000100111001010000000000011000000000000001010000001000001000001001000000010100000000000001100100000000000101001000000000000000100000000000010001000000000000000000000000000000001000001001000000100010000100000000000000000 Crocospha 01000000000000011011000000000100000000100000000000010000000000001001011000010101100001000000000000010000001100100101001001000011000110000000001011000001110000100010001000001000000001000000000000001000001001100000010000100000000000000100000001000000010000000111001100000100010000000000010000110111000000000000000010100000000111010000000000000010011100101000001000100001100000000000000000100101000000011000000110010000000010100010001010001000000000000000001010000001000010000100001100010000001000000000000000000000000010100000000000000000000000000000001000000000000000100000000000001000000000010000011000001011000000000000100000000000000000001000001000000000001000000000000000000000000000000000000000000000010000010000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000000000000000000100010001000000000000000000000110000000000000000000000000000100000000000000000000001001000001000000000000000100101001000000000000010000000001000001010000001000001000000001001001010010000000000011000100000000000101001000000001000100000000000000000001000000011001000000000000000000000000001001100000100010000100000000100000000 Trichodes 01000000000000011010000000000000000000000000000000001000010000000010100000010000000001000000000000000000001100100100000000010000010000000100001001000010110000000010000000000000000001000000000000000000001000100100100010000000000000001000000000001000110010000011000100000010000000000000000000101111000000100000000011000001001011010000000100001100111000010000000000100000000000000000000000000001000000000000000100010000000000100010001100000100000000000000010010100000000100000100000100010010001000100110000000000000000001100000000000000000000000000000000000000000000000000000000000001000000000010000000000101010000000000000000000000000000000001010000010000000000000000000000000000100000000000000100000000000010000000000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000100000000000000000000000010000000000000000000000000010000000000000000000000000000000000000000000000001101001000000000000000000001001101001010000000000011000000000000001010000000000001000010000001001000100100000000011010100000000000001001000000000100000000000000000000001000000011000000000000000010000000000010011010000000010000100000001000000001 SynPCC794 11100000000000011010000000000000000000000010000000000000010000000100100000000000000000100100000000010100011100000111010101001111100110000000101001000011110000100010100000000000000001000000000000001000101001111000011001000000000010010000000000101001110000000011001010110010010000011001010110101111000000110000000000100000000101000000000000000010011000100000011000000001000000010010000001000000011010010000000000010000000000100000001010000000000000000000000000100000000000000110000100011010001001000000001100000000000010000000000000000000000000000000000001000000000000000000000000000010000000000000000000001000000001000001100000000000001000001110001000000000000000000000000000000000000000000000000000000000010000011000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000100100010000000000000000000000000110000000001000000000000000000100000000000000000001001000000000000000000000000100101000010000000000010000000000000000000000000000001001010000000001000010110000000001001100000100000001001000000001000000100000000000000000001000011000010000000000000000000000001000010000000010100100000001000000000 SynPCC630 11100000000000011010000000000000000000000010000000000000010000010100100000000000000000100100000000010100011100000111010101001111100110000000101001000011110000100010100000000000000001000000000000001000101001111000011001000000000010010000000000101001110000000011001010110010010000011001010110101111000000110000000000100000000101000000000000000010011000100000011000000001000000010010000001000000011010010000000000010000000000100000001010000000000000000000000000100000000000000110000100011010001001000000001100000000000010000000000000000000000000000000000001000000000000000000000000000010000000000000000000001000000001000001100000000000001000001110001000000000000000000000000000000000000000000000000000000000010000011000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000100100010000000000000000000000000110000000001000000000000000000100000000000000000001001000000000000000000000000100101000010000000000010000000000000000000000000000001001010000000001000010110000000001001100000100000001001000000001000000100000000000000000001000011000010000000000000000000000001000010000000010100100000001000000000 SynJA22-1 01000000000000011010000000000000000000101010000000110000000011000000100010010000000100000000100000010000101000110101000111000011110111000100001000000011100000001001000010000000000001001000000001001000011100100000000011000000000001100000000001000111100010000101000010010110010010010001000000110011010100110001100010000100010011100100000100000001111100011100000100011101000000010000000001000001011000000000100110010000000000100000011010001000000000011000111000100100000000010100001100010011000000000000001000100000011000000101101000000000000000000000001000000000000000000000000000001000000000001000000000001111000000000100100000100000000000001010011010000000110000000100000100000000000000100001100010000000010000001000000001000000000000000000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000100000000100010000000000000000000000000110100000000000000000000001000100000000000000000000001000010101000000000000001100001001000000000100010010100110000101000100000000000000000000000000000100001001000001100100001000010000000011000000000000000000000000010000000101000000000001001000001000000110011101110000000110001110000000100000000 SynJA3-3A 01000000000000011010000000000000000000101010000000110000000011000000000010010000000100000000100000010000101000110101000111000011110111000100001000000011100000001001000110001000000001001100000001001000011101100000000011000000000001100000000001000111100010000101000010010010010010000111000100110011010100110001100010100100010011100100000100000001111100101100000100011101000000010000000001000001001000000000100110010000000000100000011010001000000000011000111000100100000000010100001100010011000000000000001000100000011000000101101000000000000000000000001000000000000000000000000000001000000000001000000000001111000000000100100000100000000000001010011010000000110000000100000100000000000000100001100010000000010000001000000001000000000000000000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000100000000100010000000000000000000000000110100000000000000000000001000100000000000000000000001000010101000000000000001100101001000000000100010010100110000101000100000000000000000000000000000100000001000001100100001000010000000011000000000000000000000000010000100100000000000001001000001000010010011101010000000110001110000000100000000 ThermoBP- 01100000000000011010000000000000000000100010000000000000000001000000100010010000000001000000000000010000001000100100000000000001110000000000001001000001100000100010100000000000000001000000000000001100011001100000000010010000000001000000000001000010010000000000000010010000000000000100000010100111000001010000000000100000000101000000010000001001110100010000000000100001000000000000000000000101000000000000000100010000000000100000001010000000000000000000000000100100001000010100001100010000001001010010001000000000000010000001000000000000000010000000000000000000000000000000000000000000000000000000000000001000000000000000100000000000000000001110001000000001000000000000000000000000000000000010000000000000010000011000000001000000000000000000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000100000000000000000000100010000000000000000000000000110000000000000000000000001000100000000000000000001001000000000000000000000001000101001010000000000110010000000000000010000000000001000100000001000000100000000000011000100000000000001001000000001000000100000000000000000100000011000000000001000000000000100011000000000000110000100000001000000000 Gloeobact 11000000000000011010000010000000000000001010000101101000010000000000110011010000000101000011000001010000001010000100000000000011100010000010001001000011100010001000001100000000010011001001000000000011011001100001000000100000010000000000000101000000010010000001000001011110100000010111001010111011000000100000100000100101010111000100000110000000011010001110100000000000000000000000000001100011001100000000001100011100000000110100100011001000000000000000011000100010000000000100000101010001001000000000001000000000000000000000001000000000000000000000000000000000100000000000000000001000000000000000000000001011000000000000000000100000001000001110101000010000101000000100000000000000000000000100000000000000010000000000000001000000000000000000000001000000000000000000000000000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000100010000000000000000010000000010000000000000000000000000000000000000000000000000101000010101000000000000001101111000010000000100010000100000000101011000000000001100000000001000100101001000000001000110001000010001001111000000000000011100001000010001100010001100100101001100011101010110000101110000000110001100000010101000000 SyneRS991 00000000000000011010000000000000000000000000010000001010010010000100100100010000000001000000010000010100001011101101110111101001000110010100001001000011111000100000110101100000000011000000000000001010001001100000011011001100010010000000000100000000110100100001101100011010010000010001000000100111000000000000000010100000000001000001000000100110010001100000011000100001101000011010100001100100011000011000000010010000000000100001101010101000001000000001000000101010010000100100000111010011001001111010001010000000001100010000001010000100000011011101100101001000100010100000000001001001000000010000000010001001100001000001100000000000000100001110001011000001010000000000000000010001000000010000001010000010010000110000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010101000000100000000000000100000010000000000000000001000000100000000000000100001000010001000000000000001100101000000000000000010000010001000011000000000000001000010000000001000100010011100001001110000010100101001110100000000000000100000000000010001000011000100000000000010101000000001000010010000110000101011001000100001 SyneWH780 11000000000000011010000000000000000000000000010000001010110010000000000100010000000001000000010000010100001011101101110111101001000110010100001001000011111000100000110101100000000011000000000010000010001001100000011011001100000010000000000100000000110100100001101100011010110000010001000000100111000000000000000010100000000001000000000000100110011001100000011000100001000000011010100001100000011000011000000010010000000000100001101010101000001000000001000000101011010000100100000111010011001001111010001010000000001100010000001010000100000011011101100101001000100010100000000001001001000000010000000010001001100001000001100000000000000100001110001011000000010000000000000000010001000000010000001010000011010000110000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010101000000100000000000000100000010000000000000000001000000100000000000000100001000010001000000000000001100101001000000000000010000010001000011001000000000001000010000000001000100010011100001001110000010100101001110100000000000000100000000000000001000011000000000000000010101000000001000010010000110000101011001000100001 SyneCC960 00100000000000011010000000000000000000000000010000001010100000000100100100010000000001000000010000010100001011001101110111101001000110010100001001000011111000100000110100100000000011000000000000001010001001100000010011001100010010000000000100000000110100100001101000011010010000010001000000110111000000000000010010100000000001000000000000100110011001100000011000100001000000011010110000100000011000011000000010010000000000100001101010100000001000000001000010101010010010100100000111010011001001111000001010000000001100010000001010000100000011011101101101001000100010100000000001001001000000010000000010001001100001000001100000000100000100001110001011000000010000000000000000010001000000010000001010000010010000110000000001000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010100000000100000000000000100000010001000000000000001000000100000000000000100001000010001000000000000001100101001000000000000010000010001000010001000000000001000010000000001000100010011100001001111000000100001001110100000000000000100000000000000001000011000000000000000010101000000001000010010000110000101011001000100001 SyneWH810 00100000000000011010000000000000000000000000010000001010100000000100100100010000000001000000010000010100001011000101110111101001000110010100001001000011111000100000110100100000000011000000000000001010001001100000010011001100011010000000000100000000110100100001101100011010110000010001000000110111000000000000010010100000000001000000000000100110011001100000011000100001000000011010100001100000011000011000000010010000000000100001101010101000001000000001000010101010010010100100000100010011001001111010001010000000001100010000001010000100000011011101101101001000100010100000000001001001000000010000000010001001100001000001101000000100000100011110001011000000010000000000000000010001000000010000001010000010010000110000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010100000000100000000000000100000010001000000000000001000000100000000000000100001000010001000000000000001100101000000000000000010000010001000011000000000000001000010000000001000100011011100001001111100000100011001110100000000000000100000000000000001000011000000000000000010101000000001000010010000110000101011001000100001 SyneCC990 00100000000000011010000000000000000000001000010000001010100000000100100100010000000001100000010000010100001011101101110111101001000110010100001001000011111000100000110101100000000011000000000010001010001001100000011011001100010010000000000100000000110100100001101100111010010000010001000000110111000000000000010010100000000001000000000000000110001001100100011100100001001000011011100000100000011000011000010010010000000000100001101010100000001000000001000000101010010010100100000111010011001001111010001010000000001100010000001010000100000011011101101101001000100010100000000001001001000000010000000010001001100001000001100000000100000100001110001011000000010000000000000000010001000000010000001010000010010000110000000001000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000010100000000000000000000000000000000000000000101000000010000000000010000000000100010100000000100000000000000100000010001000000000000001000000100000000000000101001000010001000100000000001100101001000000000000010000010001000011001000000000001000110000000001000100010011100001001110000001100010001110100000000000000100000000000000001000111000000100000000010101000000001000010010000110000101011001000100001 ProMIT931 11000000000000011010000000000000000000000000110000001110010010000100000100010000000001000000010000010100001011000101110111101001000110010100001001000011111000100000111100100000000011000000000010001000001001100000011011001100010010000000010100010000110100100001101000011010110000010001000000100111000000000000000010100000000001000000000000000110001001100000011010100001001000001010100001100000011000011000000000010001000000100001101010101001001001000001000001101011010001100100001110110011001001111010001010000000001100010000001010000100000011011101100101001000100010100000000001001001000000110000000010001001100001000001100000000000000100001110001010000000010000000000000000010001000010010010001010000011010000110000000001000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010101000000100000000000000100000010001000000000000001000000100000000000000100001000010000000100000000001100101001000000000000000000010001000011000100000000001000010000000001000100010011100001001110010001100011001110100000000100010010000000000100010000011011000000001000010010010100011000010010001110000100001001000111011 SyneWH570 00000000000000011010000000000000000000001000010000001010010000000000100110010000000001000100010000010000001011000101010011101001000110010100001001000011110000100000110100100000000011000000000010011000001001100000011011100100010010001000000100001000110100100001101100011010110000010001000010110111000000100000010000100000000001000000000010000010011001100000011000100001011000011010100001100000111000011000000000010000000000100001101010101000001000000001011100101110010100100100000101010011001001011010001010000000001100010000001010000100000011011101100101001000000010100000000001001001000000010000000000001001100001000001100000000000000100001110001011000000010000000000000000010001000000010000001010000010010000110000000001000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010100000000100000000000000100000010001000000000000001000000100000000000000101001000010000000000000000001100101000000000000000010000010001000011001000000000001000010000000001000100110010000001001100001000000011001110100000000000010100000000010001100000011000100000000000010101000000001000010010000110000101000011000100001 ProCMP137 01000000000000011011101000000100000000010000010000000010010000001001100100010000000001000000010000010100001011000101110011101001000110010100001001000011111000100000111100100000000011000000000000001000001001100000011011001101010010010000001000010000110100111001101000111010110000010001000000100111000000100000000010100000000001000000000010100110011101100000011000100001001000011010100001101000011000011000000010010000000010100001101010101001001001000001010100101011010000100100000111110011001000111010001010000000001100010000001010000100000011011101100101001000000010100000000001001001000000010000000010001001100001000001100000000000000100001110001011000000010000010000000000010001000000010000001010000010010000110000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000010100000000000000000000000000000000000000000101000000010000000000010000000000100010100000000100000000000000100000010001000000000000001000000100000000000000101001000010000100000000000001100101001000000000000000000010001000011001000000000001000011000000111001100010011100001001100000000101101001110100000000000100100000000000010000000011000000000001000010101000000001000010010000110000100101001000100001 ProMIT921 11000000000001010010001100000100000000100000010000001010010010001001100100010000000001000000010000010100001011000101110011101001000110010100001001000011111000100000110100100000000011000000000000001000001001100000011011001101010010010000000000010000010100110000100000011010110000010001000000100111000000100000000010100000000001000000000000000110011001100000011000100001101000011000110001100000011000011000000010010000000010100001101010101001001001000001000101101011010000100100000111010011001010111010001010000000001100010000001010000100000011011101100101001000000010100000000001001001000000110000000010001001100001000001100000000000000100001110001011000000010000010000000000010001000000010000001010000010010000110000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000011000000000000000000000000000000000000000000101000000010000000000010000000000100010100010000100000000000000100000010001000000000000001000000100000000000000100001000010000100000000000001100101001000000000000000000010001000011001000000000001001011000000001000100110011000001001100000000101101001110100000000000100100000000000010000000011000000000000000010101000000001000010010000110000100101001000100001 ProCMP198 01100000000000011011000110000101000000000000000100001000011000000100000101010000000001000000010000010100001011000101110011101001000110010100001001000011111000100000111100100000010001001101000010001000011001000100000010100011011000010000001110100000010001001101110000011110110000010001000000100111000000000000001010100111000001000000000000000110010001100000001000000001001000011000110001001000011000011000000010010000000000100001101110111001001001000010010101101011010001100100000110110011001001111010001010000000001100010000001010000100000011011101101101001000000010100000000001001001000000010000000010001001100001000001100000000000000100001110001011000000010000010000000000010001000000010000001010000010010000110000000000000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010100000000000000000000000000000000000000000101000000010000000000010000000000100010100000000100000000000000100000010001000000000000001000000100000000000000100001000010000000000000000001100101001000000000000000000010001000011000100000000001000001011001111010100010011100001101100011000100101001110100000000100010100000000000100010000111001000000001000010000010100011000010010001110000100001001000111011 ProMIT931 11000000000000011010000000000000000000000000110000001110010010000100000100010000000001000000010000010100001011000101110111101001000110010100001001000011111000100000111100100000000011000000000010001000001001100000011011001100010010000000010100010000110100100001101000011010110000010001000000100111000000000000000010100000000001000000000000000110001001100000011010100001001000001010100001100000011000011000000000010001000000100001101010101001001001000001000001101011010001100100001110110011001001111010001010000000001100010000001010000100000011011101100101001000100010100000000001001001000000110000000010001001100001000001100000000000000100001110001010000000010000000000000000010001000010010010001010000011010000110000000001000100000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010101000000100000000000000100000010001000000000000001000000100000000000000100001000010000000100000000001100101001000000000000000000010001000011000100000000001000010000000001000100010011100001001110010001100011001110100000000100010010000000000100010000011011000000001000010010010100011000010010001110000100001001000111011 PromNATL2 10100000000000011011100000000101000000000000010100001000010000000000001101010000000001000000010000010100001011101101110011101001000110010100001001000011111000100000110100100000000011000000000000001000001001100000011011001101010010000000001100011000110100100001101010111110010000010001000000100111000000100000000010100000000001000000000000100110011001100000011000100001100000010001100001000000011100011000000010010000000010100001101010110000001000000011001111101011010000100100000110010011001011111010001010000000001100010000001010000100000011011101101101001000100010100000000001001001000000010000000010001001100000000001101000000000000100011110001011000000010000000000000000010001000000010000001010000010010000110000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000000000000000000000000000000000000101000000010000000000010000000000100010100000000100000000000000100000010001000000000000001000000100000000000000101001000010000000100000000001100101000000000000000010000010001000011000000000000001000111000000001001100010011100001001010000001100100001110100000000000000100000000000010000000011001000000000000010101000000001000010010000110000100001001000100101 Staphyloc 11011111111111111110011001111010111111110111001010100011001100110111010101101111111111111111111111111011001001000100001110110011001100101101110111111101110101011101010010110111101110111110111101101100001011100011100110110110100100000011110110100000111101100100010001000001011101110100101001100111111011011110110111111010111110001111101101111010011111111101100001000010110111100100001111010011110001110111011101111110111101101000001111010110110110111100010100110011101010011001110100110101110101000100110001011111100001001110010101111111111100100110010010110111011101011111111110110101111111000111100111011011011110111010010111111011110011101111110110111110000111101111111111101010111101001100010111111110111111010111111111111011111111111111111110111111111111111111111111111111111111111111111111111111111110111111111011111111111111111111111111111111111111111111111111101111111111111011111011111101110101111111111111111111101011101111111111111111110111011011111111111111011101110111010111011111111111111011111111111111011110111011111111001100011110111110010010111110000100101001101111101011000100100000001111100011110111011001001111111101000000110011100101111110111110000100001111001011101010111011100001100111000101 Bsubtilis 11011111111111111110010001111010111111110111001011100011001100110000110101101111111110111101111111111111000001100101000011100001011110101101110111111101100111011100000010110111101110110110111111110111000011100110100110010100101100010011100101110000011000100100000011000101011101100100101001100111111010011110111110011010111001001110111101111110011111111111110001010010110111110100001111010011111001111111011101111110111101101100001011011010110110111100010000110011101010011101110100110111110101000101110001011111100001001111010101111111111100100110010011110111011101111111111110110101111111000111100111111011011111111010010111111011110011101111110110111110000111101011111111101110111101001100011111111110011111010111111111111011111111111111111110111111111111111111111111111111111111111111111111111111111110111111111011111111111111111111111111111111111111111111111111101111111111111011111011111101010111111111111111111111101011101111111111111111110111011011111111111111011111110111011111011111111110111011111111111111011110111011111111001100111110111110010000110110000000101001110111101000010000100000001111000011110111011001001111111101000100111011100101111110111110100101001111001011101010111011100000100111000111
Example Output File (Single Char)
Nostoc712 4 Anaba2941 0 Nosto7310 7 SynPCC680 9 Crocospha 34 Trichodes 33 SynPCC794 0 SynPCC630 2 SynJA22-1 4 SynJA3-3A 1 ThermoBP- 17 Gloeobact 33 SyneRS991 2 SyneWH780 3 SyneCC960 0 SyneWH810 2 SyneCC990 3 ProMIT931 0 SyneWH570 9 ProCMP137 6 ProMIT921 5 ProCMP198 74 ProMIT931 0 PromNATL2 16 Staphyloc 147 Bsubtilis 83 Total: 494
CLIQUE_TREEVIEWER
This app modifies raw data files made using Clique to clean up and improve the tree displays.
Enter inputs and click "Run Program" to get started.
The tree data file is in the form of {{A, B}, C};
The Clique file is is usally in the form of multiple trees. Use the Tree Number parameter to select which tree to clean up. Default is the first tree.
See the example files for more help on correct input files.
The Clique file is is usally in the form of multiple trees. Use the Tree Number parameter to select which tree to clean up. Default is the first tree.
See the example files for more help on correct input files.
View sample input/output files. All files should be in a plain-text format (.txt, .csv, .xml, etc.).
Download files related to this app.
Program Results
Example Tree Data File
((((((((Mes303099,AgroTuC58),Bruce1130),BToulouse,Aur85-9A1,XanthoPy2), ((RhoBisB18,BrUSDA110),NitrobX14)),((Parvu2503,CauloCB15),Ocean2633)), (((((Ocean2597,SulfEE-36),SiliDSS-3),Rhodo2654),JannaCCS1),ParPD1222)), ((((EhrliJake,AnaplasHZ),WoDrosmel),NeoMiyaya),(RicRML369,RickeCa12)), ((SphiSKA58,ZymomoZM4),(Novo12444,SphRB2256)),(Rhod11170,MagneMS-1), (AcidiJF-5,Gluco621H)),(Campyloba,Candi1002),Helicobac)[0.2500]; ((((((((Bruce1130,Mes303099),AgroTuC58),BToulouse,Aur85-9A1,XanthoPy2), ((RhoBisB18,BrUSDA110),NitrobX14)),((Parvu2503,CauloCB15),Ocean2633)), (((((Ocean2597,SulfEE-36),SiliDSS-3),Rhodo2654),JannaCCS1),ParPD1222)), ((((EhrliJake,AnaplasHZ),WoDrosmel),NeoMiyaya),(RicRML369,RickeCa12)), ((SphiSKA58,ZymomoZM4),(Novo12444,SphRB2256)),(Rhod11170,MagneMS-1), (AcidiJF-5,Gluco621H)),(Campyloba,Candi1002),Helicobac)[0.2500]; ((((((((Mes303099,AgroTuC58),Bruce1130),BToulouse,Aur85-9A1,XanthoPy2), ((RhoBisB18,BrUSDA110),NitrobX14)),((Parvu2503,CauloCB15),Ocean2633)), (((((SiliDSS-3,SulfEE-36),Ocean2597),Rhodo2654),JannaCCS1),ParPD1222)), ((((EhrliJake,AnaplasHZ),WoDrosmel),NeoMiyaya),(RicRML369,RickeCa12)), ((SphiSKA58,ZymomoZM4),(Novo12444,SphRB2256)),(Rhod11170,MagneMS-1), (AcidiJF-5,Gluco621H)),(Campyloba,Candi1002),Helicobac)[0.2500]; ((((((((Bruce1130,Mes303099),AgroTuC58),BToulouse,Aur85-9A1,XanthoPy2), ((RhoBisB18,BrUSDA110),NitrobX14)),((Parvu2503,CauloCB15),Ocean2633)), (((((SiliDSS-3,SulfEE-36),Ocean2597),Rhodo2654),JannaCCS1),ParPD1222)), ((((EhrliJake,AnaplasHZ),WoDrosmel),NeoMiyaya),(RicRML369,RickeCa12)), ((SphiSKA58,ZymomoZM4),(Novo12444,SphRB2256)),(Rhod11170,MagneMS-1), (AcidiJF-5,Gluco621H)),(Campyloba,Candi1002),Helicobac)[0.2500];
Example Tree Clique File
Largest clique program, version 3.66 Largest Cliques ------- ------- Characters: ( 16 17 26 31 42 51 52 53 54 55 56 57 59 61 62 63 67 75 83 90 93 96 97104132135146153160161163167168170192206215218222226227232243 244249250251253255257262268275278279282286287288294309313314316 319323327339344346347349367369377386387388401402407414415423425 435437438442443449455458461463472473480485489491493494497502503 504512514516523524527533534535536541546547550551554564568569571 572573574575578580583597598602606608610615616617618625628629630 631640641644648655663666668672) Tree and characters: 206215583598615663668 59 61 67278279388 83132170218286309319438480516535536602618327369616 16 31104160168244323339349377425437455463473489491493564610641644672 51 93163243249250251253255262275287294314316386423443497503504512546547550554568569571572574575580608648 17 26 42 52 53 54 55 56 57 62 63 75 90 96 97135146153161167192222226227232257268282288313344346347367387401402407414415435442449458461472485494502514523524527533534541551573578597606617625628629630631640655666 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 +-----------------------------------------------------------------------------------------------------------------------------------------------------------------Mes303099 +---------------------------------------------1-+ +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------AgroTuC58 ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Bruce1130 ! +------1-----1-+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------BToulouse ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Aur85-9A1 ! ! +1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------XanthoPy2 ! ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------RhoBisB18 ! ! +------------------------------------------------------------1-+ ! +------------------------------------------------------------------------------------------------------------------------------------------------------1--1-----------------------------------------1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------BrUSDA110 +1--1-+ ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------NitrobX14 ! ! ! ! +-----------------------------------------------------------------------------Parvu2503 ! ! +------------------------------------------------------------------------------------------------------------------------------------------------1-----------------------------------------1-+ ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------1--1--1-+ +-----------------------------------------------------------------------------CauloCB15 ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Ocean2633 +0--0--0-+ ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Ocean2597 ! ! +---------------------------------------------------------------------------1-+ ! ! +---------------------------------------------------------------------------1--1-----1--1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------SulfEE-36 ! ! ! ! ! ! +------1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------SiliDSS-3 ! ! ! ! ! ! +------------------1--1--1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Rhodo2654 ! ! ! ! ! +---------1--1-----1-----1--1--1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------JannaCCS1 ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ParPD1222 ! ! +--------------------------EhrliJake ! +------1--------------------------------------1-----1--1--1-----------------------------1--------------------------------1-----------------1-----1--1-----1-----------1-----1-----1-----1-+ ! +1--1--1--1--1--------1--------------1-----------1--1--1--1--1--1--1--1--1-----1--1--1--1--1--1--1--1-+ +--------------------------AnaplasHZ ! ! ! ! +---------1--1-----1-----1-----1-----1-----1--1--1--1--1--1--1--1--1--1-----1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------WoDrosmel ! ! ! +--0--0--0--0--0--0--0-++------------------------------------------1--1--1--1--1--1+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------NeoMiyaya ! ! ! ! ! ! +-RicRML369 ! ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--1--1--------1--1-----------------------1-----1--1--1--1-----1-----1-----1-----1--------1-----1--1--1-----------1-----1--------1--1--1--1--1--------1-+ ! ! +-RickeCa12 ! ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------SphiSKA58 ! ! +---------------------------------------------------------------------------------------------------------------------------------------------------1-+ ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------ZymomoZM4 ! +---------------------------------------------------------------------------------1-----1-----1-----1--------------------------------1-+ ! ! ! +--Novo12444 ! ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1-----------------------------------------------------------------------------------------------------------------1-+ ! ! +--SphRB2256 ! ! ! ! +-----------------------Rhod11170 ! +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1-----------------------------------------------------------------------------------------1--------------------------------------------------------------------------------------1-+ ! ! +-----------------------MagneMS-1 ! ! ! ! +-----------------------------------------AcidiJF-5 ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--1--1--1--1--1--1--1-----1-----------------------------1--1-----------1-----------------------------------------1-----------------------------------------1-+ ! +-----------------------------------------Gluco621H ! ! +-----Campyloba +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--------------1--------1--------------------------------------------------------------------------------------------------1-+ ! +-----Candi1002 ! +-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Helicobac remember: this is an unrooted tree! Characters: ( 16 17 26 31 42 51 52 53 54 55 56 57 59 61 62 63 67 75 83 90 93 96 97104132146153160161163167168170192206215218222226227232243244 249250251253255257262268275278279282286287288294309313314316319 323327339344346347349367369377386387388401402407414415423425435 437438442443449455458461463472473480485489491493494497502503504 512514516523524527532533534535536541546547550551554564568569571 572573574575578580583597598602606608610615616617618625628629630 631640641644648655663666668672) Tree and characters: 206215583598615663668 59 61 67278279388 83132170218286309319438480516535536602618327369616 16 31104160168244323339349377425437455463473489491493564610641644672 51 93163243249250251253255262275287294314316386423443497503504512546547550554568569571572574575580608648 17 26 42 52 53 54 55 56 57 62 63 75 90 96 97146153161167192222226227232257268282288313344346347367387401402407414415435442449458461472485494502514523524527532533534541551573578597606617625628629630631640655666 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 +--------------------------------------------------Bruce1130 +------------------------------------------------------------------------------------------------------------------------------------------------------------1-+ +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1-+ +--------------------------------------------------Mes303099 ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------AgroTuC58 ! +------1-----1-+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------BToulouse ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Aur85-9A1 ! ! +1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------XanthoPy2 ! ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------RhoBisB18 ! ! +------------------------------------------------------------1-+ ! +------------------------------------------------------------------------------------------------------------------------------------------------------1--1-----------------------------------------1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------BrUSDA110 +1--1-+ ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------NitrobX14 ! ! ! ! +--------------------------------------------------------------------------------Parvu2503 ! ! +---------------------------------------------------------------------------------------------------------------------------------------------1-----------------------------------------1-+ ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------1--1--1-+ +--------------------------------------------------------------------------------CauloCB15 ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Ocean2633 +0--0--0-+ ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Ocean2597 ! ! +---------------------------------------------------------------------------1-+ ! ! +---------------------------------------------------------------------------1--1-----1--1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------SulfEE-36 ! ! ! ! ! ! +------1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------SiliDSS-3 ! ! ! ! ! ! +------------------1--1--1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Rhodo2654 ! ! ! ! ! +---------1--1-----1-----1--1--1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------JannaCCS1 ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ParPD1222 ! ! +--------------------------EhrliJake ! +------1--------------------------------------1--1--1--1-----------------------------1--------------------------------1-----------------1-----1--1-----1--------------1-----1-----1-----1-+ ! +1--1--1--1--1--------1--------------1-----------1--1--1--1--1--1--1--1--1-----1--1--1--1--1--1--1--1-+ +--------------------------AnaplasHZ ! ! ! ! +---------1--1-----1-----1-----1-----1-----1--1--1--1--1--1--1--1--1--1-----1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------WoDrosmel ! ! ! +--0--0--0--0--0--0--0-++------------------------------------------1--1--1--1--1--1+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------NeoMiyaya ! ! ! ! ! ! +-RicRML369 ! ! +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--1--1--------1--1-----------------------1-----1--1--1--1-----1-----1-----1-----1--------1-----1--1-----1-----------1-----1--------1--1--1--1--1--------1-+ ! ! +-RickeCa12 ! ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------SphiSKA58 ! ! +---------------------------------------------------------------------------------------------------------------------------------------------------1-+ ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------ZymomoZM4 ! +---------------------------------------------------------------------------------1-----1-----1-----1--------------------------------1-+ ! ! ! +--Novo12444 ! ! +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--------------------------------------------------------------------------------------------------------------------1-+ ! ! +--SphRB2256 ! ! ! ! +-----------------------Rhod11170 ! +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--------------------------------------------------------------------------------------1-----------------------------------------------------------------------------------------1-+ ! ! +-----------------------MagneMS-1 ! ! ! ! +-----------------------------------------AcidiJF-5 ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--1--1--1--1--1--1--1-----1--------------------------1--1-----------1-----------------------------------------1--------------------------------------------1-+ ! +-----------------------------------------Gluco621H ! ! +-----Campyloba +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1--------------1--------1-----------------------------------------------------------------------------------------------------1-+ ! +-----Candi1002 ! +-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Helicobac remember: this is an unrooted tree! Characters: ( 16 26 31 42 51 52 53 54 55 56 57 59 61 62 63 67 75 83 90 93 96 97 104132135146153160161163167168170192206215218222226227232243244 249250251253255257262268275278279282286287288294309313314316319 323327339344346347349367369376377386387388401402407414415423425 435437438442443449455458461463472473480485489491493494497502503 504512514516523524527533534535536541546547550551554564568569571 572573574575578580583597598602606608610615616617618625628629630 631640641644648655663666668672) Tree and characters: 206215583598615663668 59 61 67278279388 83132170218286309319438480516535536602618327369616 16 31104160168244323339349377425437455463473489491493564610641644672 51 93163243249250251253255262275287294314316386423443497503504512546547550554568569571572574575580608648 26 42 52 53 54 55 56 57 62 63 75 90 96 97135146153161167192222226227232257268282288313344346347367376387401402407414415435442449458461472485494502514523524527533534541551573578597606617625628629630631640655666 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 +--------------------------------------------------------------------------------------------------------------------------------------------------------------------Mes303099 +------------------------------------------1-+ +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------AgroTuC58 ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Bruce1130 ! +------1-----1-+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------BToulouse ! ! ! +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Aur85-9A1 ! ! +1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------XanthoPy2 ! ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------RhoBisB18 ! ! +---------------------------------------------------------1-+ ! +------------------------------------------------------------------------------------------------------------------------------------------------------1--1-----------------------------------------1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------BrUSDA110 +1--1-+ ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------NitrobX14 ! ! ! ! +-----------------------------------------------------------------------------Parvu2503 ! ! +---------------------------------------------------------------------------------------------------------------------------------------------1--------------------------------------------1-+ ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------1--1--1-+ +-----------------------------------------------------------------------------CauloCB15 ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Ocean2633 +0--0--0-+ ! ! +-----------------------------------------------------------------------------------------------------------SiliDSS-3 ! ! +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------1-+ ! ! +---------------------------------------------------------------------------1--1-----1--1-+ +-----------------------------------------------------------------------------------------------------------SulfEE-36 ! ! ! ! ! ! +------1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Ocean2597 ! ! ! ! ! ! +------------------1--1--1-+ +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------Rhodo2654 ! ! ! ! ! +---------1--1-----1-----1--1--1-+ +-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------JannaCCS1 ! ! ! +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ParPD1222 ! ! +--------------------------EhrliJake ! +---1--------------------------------------1-----1--1--1-----------------------------1-----------------------------------1-----------------1-----1--1-----1-----------1-----1-----1-----1-+ !
Example Output File
+-------------- Mes303099 +------1------+ +------1------+ +-------------- AgroTuC58 | | | +---------------------------- Bruce1130 | +------2------+------------------------------------------ BToulouse | | | +------------------------------------------ Aur85-9A1 | | +------1-------+ +------------------------------------------ XanthoPy2 | | | | +-------------- RhoBisB18 | | +------1------+ | +-------------3-------------+ +-------------- BrUSDA110 +------2------+ | | | +---------------------------- NitrobX14 | | | | +-------------- Parvu2503 | | +------2------+ | +--------------------3---------------------+ +-------------- CauloCB15 | | | +---------------------------- Ocean2633 +------3------+ | | +-------------- Ocean2597 | | +------1------+ | | +------4------+ +-------------- SulfEE-36 | | | | | | +------1------+ +---------------------------- SiliDSS-3 | | | | | | +------3-------+ +------------------------------------------ Rhodo2654 | | | | | +------6------+ +-------------------------------------------------------- JannaCCS1 | | | +----------------------------------------------------------------------- ParPD1222 | | +-------------- EhrliJake | +------15-----+ | +------24-----+ +-------------- AnaplasHZ | | | | +------17-----+ +---------------------------- WoDrosmel | | | +------7------+------------------------------------------+ 6 +------------------------------------------ NeoMiyaya | | | | | | +-------------- RicRML369 | | +--------------------26-------------------+ | | +-------------- RickeCa12 | | | | +-------------- SphiSKA58 | | +------1------+ | | | +-------------- ZymomoZM4 | |----------------------------------------------------------------------+ 5 | | | +-------------- Novo12444 | | +------2------+ | | +-------------- SphRB2256 | | | | +-------------- Rhod11170 | |------------------------------------------------------------------------------------+ 3 | | +-------------- MagneMS-1 | | | | +-------------- AcidiJF-5 | +-----------------------------------------14-----------------------------------------+ | +-------------- Gluco621H | | +-------------- Campyloba |--------------------------------------------------------------------------------------------------+ 4 | +-------------- Candi1002 | +----------------------------------------------------------------------------------------------------------------- Helicobac
SEQ_COMPILE
Sequence Compiler by Alex Figueroa
Enter inputs and click "Run Program" to get started.
This app needs documentation.
View sample input/output files. All files should be in a plain-text format (.txt, .csv, .xml, etc.).
Download files related to this app.
Program Results
Example Input File
Organism= testorganism Query= alanyl-tRNA synthetase Length=884 Score E Sequences producing significant alignments: (Bits) Value 01293 1074 0.0 01203 34.3 0.073 00378 31.6 0.48 > 01293 Length=9298 Score = 1074 bits (2778), Expect = 0.0, Method: Compositional matrix adjust. Identities = 567/883 (64%), Positives = 679/883 (77%), Gaps = 5/883 (1%) Frame = -1 Query 3 IPTKFTTSKIRSDFLEFFKNKGHKIVPSAPLVPSNDPTLLFTNSGMVQFKDVFLGAEKRS 62 +P++ T++IRS FL++F++KGH IVPS+ LVP++DPTLLFTNSGMVQFK+VFLG+EK Sbjct 4135 LPSRMKTAEIRSTFLDYFRSKGHTIVPSSSLVPASDPTLLFTNSGMVQFKNVFLGSEKLP 3956 Query 63 EVRVADVQCCLRAGGKHNDLDSVGYTARHHTFFEMLGNWSFGDYFKKEAIMWAWELLTQV 122 VR ADVQ CLRAGGKHNDLDSVGYTARHHTFFEMLGNWSFGDYFK++AI +AWELLT V Sbjct 3955 YVRAADVQRCLRAGGKHNDLDSVGYTARHHTFFEMLGNWSFGDYFKRDAIAYAWELLTDV 3776 Query 123 WELPPERLLVTVYHTDDESYALWRDMVGVPEDRIVRIGDNKGAPFASDNFWQMADTGPCG 182 +LP ++L VTVYHTDDE++ +W VGVP +RIVRIGDNKGAP+ASDNFWQMADTGPCG Sbjct 3775 LKLPKDKLWVTVYHTDDEAFDIWNKEVGVPAERIVRIGDNKGAPYASDNFWQMADTGPCG 3596 Query 183 PCTEIFYDHGEHIpggppgspgEDGDRFIEIWNLVFMQFDRQSDGTLVPLPTPCVDTGMG 242 PCTEIF+DHGE I GGPPGSP EDGDR+IEIWNLVFMQFDR DGTL PLP PCVDTGMG Sbjct 3595 PCTEIFFDHGEEIAGGPPGSPDEDGDRYIEIWNLVFMQFDRAPDGTLSPLPAPCVDTGMG 3416 Query 243 LERLAAILQHVHTNYEIDLFQTLILKAAELTAVADVQNKSLCVIADHSRACAFLIVDGVL 302 LERLAA+LQHVH+NYEIDLF+ LI AA+LT D+ NKSL VIADH RAC+FLIVDGVL Sbjct 3415 LERLAAVLQHVHSNYEIDLFEHLIKVAAQLTHTKDLANKSLRVIADHIRACSFLIVDGVL 3236 Query 303 PSNEGRGYVLRRIIRRALRHGWMLGVRQPFFNNMVPTLIAVMGDAYPKLQAAAESVMRTL 362 PSNEGRGYVLRRIIRRALRHGWMLGVR FF MV L+ MG+AYP+L V L Sbjct 3235 PSNEGRGYVLRRIIRRALRHGWMLGVRGDFFWKMVQPLVEEMGEAYPELAQKQAFVEEAL 3056 Query 363 LAEEERFAETLDVGMKIFNEVAAKVANGVIPGSDAFRLYDTYGFPVDLTADIARERGMRV 422 EE RF ETL+ GM++F+ VAAK +NG IPG+DAFRLYDTYGFPVDLTADIARERG+ V Sbjct 3055 RTEERRFGETLENGMRLFDAVAAK-SNGSIPGADAFRLYDTYGFPVDLTADIARERGLTV 2879 Query 423 DMAGFEAAMTQQRKTARAAGKFGRGVQLSAERAATLSPTVFLGYEQLQADDLRVVALLSD 482 DM GFE +M +Q++ +R GKF Q+ AE A+ L PT FLGY+ L + +VV ++ Sbjct 2878 DMDGFEQSMKEQQERSREGGKFEAKGQMPAELASQLQPTAFLGYDALMSQGSKVVGIVRG 2699 Query 483 GGLTDSASVGDEVIVLTDRTPFYAESGGQVGDIGTLMASDGVRLEVTDTQKLMGQFHGHV 542 G D G+E +V+ DRTPFYAESGGQVGD G L+ + G V DT K+ G F GH Sbjct 2698 GKQYDQLGEGEEALVMLDRTPFYAESGGQVGDTGVLVNTTG-SFAVADTLKMGGVFFGHA 2522 Query 543 ARIV-QGGVKVGDVLSGSVAVARRKMVALNHSATHLLHCALRSVFGTHVVQKGSLVAPDR 601 R + ++VGDV+ +V RR+ + LNHSATHLLH ALR V G HV QKGSLVAP+R Sbjct 2521 GRWSGKQALRVGDVVDANVDGTRRQAIVLNHSATHLLHAALRKVLGEHVTQKGSLVAPER 2342 Query 602 LRFDFSHFEPISAAQMTLIERMVNDEVRANHLVMIEQMGMQAALDAGAMALFGEKYGEHV 661 LRFDFSHF+P+SA ++ IE +VN EVR N + MG A++ GAMALFGEKYG+ V Sbjct 2341 LRFDFSHFKPMSADELRQIEMLVNAEVRRNAAAEVHHMGYNEAIEFGAMALFGEKYGDEV 2162 Query 662 RVVTMGT-SVELCGGTHITRTGDIGLFKIISECGVSSGVRRIEAVTGESALNHVLAEEHR 720 RV+ MG S ELCGGTH+ RTGDIGLFKI+SE GV+SGVRRIEAVTG AL +V EE R Sbjct 2161 RVLRMGEFSTELCGGTHVGRTGDIGLFKIVSEAGVASGVRRIEAVTGAGALAYVADEERR 1982 Query 721 LYEVAGLIGSNANNVVNHIRQLTDRQktlereleklkgklISGTITDLLSMavnvadvkv 780 L E A L+ SN + V +RQL D+QK LERELE L+ K DL S A +VA +KV Sbjct 1981 LGEFAQLLSSNGDEAVEKLRQLFDKQKKLERELESLRAKAAGSATADLASSAKDVAGIKV 1802 Query 781 vaaRLDGLDGKALREALDRLKLQLSDAVIVLAGVTGGKVALVTAVNGPRAMGKVKADTLL 840 +AARL+GLD KALR+++D+LK QL D V++LAG G+V+LV V G +A+G+VKA ++ Sbjct 1801 IAARLEGLDAKALRDSMDQLKQQLGDCVVLLAGAADGRVSLVAGVGG-KALGRVKAGDVV 1625 Query 841 SHVATQINGRGGGRVDFAQGGGEDGPSLRSALDGVATWVKQHL 883 +HVA+QI+G+GGGR D AQGGG D L L+G+A W+ Q L Sbjct 1624 AHVASQIDGKGGGRPDMAQGGGSDTSELPGILEGLADWIGQRL 1496 Query= cell division protein Length=397 Score E Sequences producing significant alignments: (Bits) Value 00621 335 7e-103 01179 102 2e-024 01572 30.0 0.33 01221 28.5 1.6 00073 28.1 1.8 00728 27.3 3.1 00767 26.6 5.7 > 00621 Length=8031 Score = 335 bits (859), Expect = 7e-103, Method: Compositional matrix adjust. Identities = 199/306 (65%), Positives = 243/306 (79%), Gaps = 2/306 (1%) Frame = -1 Query 92 KPNWRERLRGSLFARNINALFSNNPRLDENLLDEIETALITADVGVSTTNAIVDGLRKRM 151 K +WRERL GS FAR++ +LF +P+LD++LLDE+ET LITADVGV + +V+ LRKRM Sbjct 7071 KRSWRERLSGSGFARSLTSLFIRHPKLDDDLLDELETTLITADVGVEASTTLVEDLRKRM 6892 Query 152 KAREFADIQTLLAALRNELITILRPVSKPLIVKRDALPFVFLvvgvngvgktttigKLAK 211 REFAD LLAALR LI +LRPV PL V P+V L VG+NGVGKTTTIGKLA+ Sbjct 6891 HKREFADAGALLAALRQSLIAMLRPVETPLDVS-GLKPYVILTVGINGVGKTTTIGKLAR 6715 Query 212 WFKRDGYSLMLAAGDTFRAAAVAQLQAWGERNSITVIAQKGPNADAASVVYDALQAAKAR 271 ++ +G +MLAAGDTFRAAAV QL+ WGERN + VI+Q G +ADAASV++DALQAA++R Sbjct 6714 RYRDEGRQVMLAAGDTFRAAAVEQLKTWGERNKVPVISQ-GQDADAASVIFDALQAARSR 6538 Query 272 SIEVLIADTAGRLHTQIGLMNELSKIRRVLGKLDSTAPHEVLMVIDGTTGQNALSQLRQF 331 + +VLIADTAGRLHTQ GLM+EL KI RVL K+D+ APHEVLMVIDGTTGQNA+SQ+RQF Sbjct 6537 NADVLIADTAGRLHTQGGLMDELGKIARVLKKIDTAAPHEVLMVIDGTTGQNAVSQVRQF 6358 Query 332 HAAVNVTGIVVTKLDGTAKGGVVFTLAREFGIPIHFISIGEQLEDMHFFDPEAFVDALLP 391 V V+G+VVTKLDGTAKGGVVF LAREFG+PI ++ +GE D+ FD EA+VD LLP Sbjct 6357 RQIVGVSGLVVTKLDGTAKGGVVFALAREFGLPIRYVGLGETATDLRVFDAEAYVDGLLP 6178 Query 392 KTLGNA 397 +LG Sbjct 6177 ASLGQG 6160 Query= chaperonin GroEL Length=547 Score E Sequences producing significant alignments: (Bits) Value 00618 796 0.0 01061 28.9 1.7 > 00618 Length=4924 Score = 796 bits (2056), Expect = 0.0, Method: Compositional matrix adjust. Identities = 428/547 (78%), Positives = 482/547 (88%), Gaps = 0/547 (0%) Frame = -3 Query 1 MAAKEIIFSEKARSRMVHGVNLLANAVKATLGPKGRHVVLDKSFGSPIITKDGVSVAKEI 60 MAAKE+ F E R+RM+ GVN LANAVK TLGPKGR+VV++KSFG+P +TKDGVSVAKEI Sbjct 1802 MAAKEVRFGEDVRARMLKGVNTLANAVKVTLGPKGRNVVIEKSFGAPTVTKDGVSVAKEI 1623 Query 61 ELADKFENMGAQMLKEVASKTNDHAGDGTTTATVLAQALIREGCKAVAAGMNPMDLKRGI 120 ELADK+EN+GAQ++KE ASKT+D AGDGTTTATVLAQA I+EG KAVAAGMNPMDLKRGI Sbjct 1622 ELADKYENIGAQIVKEAASKTSDVAGDGTTTATVLAQAFIQEGLKAVAAGMNPMDLKRGI 1443 Query 121 DKAVIAAVTELKKISKPTSDDKAIAQVATISANSDESIGNIIAEAMKKVGKEGVITIEEG 180 D+AV AAVTELKK+S PT+DDKAIAQV TISANSD +IG+IIA AMKKVGKEGVIT+EEG Sbjct 1442 DQAVNAAVTELKKLSNPTADDKAIAQVGTISANSDANIGDIIATAMKKVGKEGVITVEEG 1263 Query 181 TTLENELDVVEGMQFDRGYSSPYFINNQQSQIVELDNPYILLHDKKISSVRDLLTVLDAV 240 + LENELDVVEGMQFDRGY SPYFINNQQSQ VELD+P+IL+HDKK+S+VR+LL VL+AV Sbjct 1262 SGLENELDVVEGMQFDRGYLSPYFINNQQSQQVELDDPFILIHDKKVSNVRELLPVLEAV 1083 Query 241 AKESKPLLIVAEEVEGEALATLVVNNIRGIIKVCAVKAPGFGDRRKAMLEDMAVLTGGTV 300 AK KPLLIVAEEVEGEALATLVVN IRGI+KV AVKAPGFGDRRKA+LED+A+LT G V Sbjct 1082 AKAGKPLLIVAEEVEGEALATLVVNTIRGIVKVAAVKAPGFGDRRKAILEDIAILTNGVV 903 Query 301 ISEEVGLSLEKATTSHLGKAKKVRVSKENTTIIDGMGDNDAINGRVKQIKTQIEETTSDY 360 ISEEVGL+LEKAT + LG+AK+V ++KENTTIIDG G+ + I R+ QIK QIEET+SDY Sbjct 902 ISEEVGLALEKATITDLGRAKRVVITKENTTIIDGAGEAERIQSRIGQIKAQIEETSSDY 723 Query 361 DREKLQERvaklaggvavikvgaaTEVEMKEKKARVDDALLATRAAVEEGVIPGGGVALI 420 DREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARV+DAL ATRAAVEEGV+PGGGVALI Sbjct 722 DREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDALHATRAAVEEGVVPGGGVALI 543 Query 421 RAITAISNLKGANEDQTHGIQIALRAMEAPLREIVANAGEEPSVILNKVKEGKDNFGYNA 480 R++ A+ LKG N DQ GI I RA+EAPLR IV+NAG+EPSV+LNKVKEG NFGYNA Sbjct 542 RSLKALEGLKGQNTDQDLGIAITRRALEAPLRAIVSNAGDEPSVVLNKVKEGNGNFGYNA 363 Query 481 ATGEFGDMVNLGILDPTKVTRSALQNAASIAGLMITTEAMVAEAPKKDEPTPPAAgggmg 540 A GEFGDM+ GILDPTKVTRSALQ AAS+AG +ITTEA V E PKKDE A GGMG Sbjct 362 ANGEFGDMIAFGILDPTKVTRSALQFAASVAGSIITTEAAVTEVPKKDEGHSHGAPGGMG 183 Query 541 gmggmDF 547 GMGGMDF Sbjct 182 GMGGMDF 162 Query= dimethyladenosine transferase Length=265 Score E Sequences producing significant alignments: (Bits) Value 01768 279 2e-085 01095 35.4 0.005 00969 28.1 1.1 00404 25.8 5.8 00641 25.8 6.5 > 01768 Length=6320 Score = 279 bits (713), Expect = 2e-085, Method: Compositional matrix adjust. Identities = 150/257 (58%), Positives = 184/257 (72%), Gaps = 1/257 (0%) Frame = +1 Query 5 LFNTPAKKAFGQHFLVDRYYIDRIIHAINPQPNDHIVEIGPGQGAITLPLLKCCGSLTAI 64 N KK+FGQHFL ++ YI+RI+ AI+P+ +D +VEIGPG+GA+TLPLL G LTAI Sbjct 2020 FMNARPKKSFGQHFLHEKRYIERIVSAISPRADDFVVEIGPGEGALTLPLLAAAGKLTAI 2199 Query 65 ELDRDLIAPLTAAATPIGKLDIIHRDVLTVDLSILA-KQGNKKLRLVGNLPYNISSPILF 123 ELD DLI L A A +G+L IIH DVL VD + LA + G ++LR+ GNLPY ISSPILF Sbjct 2200 ELDTDLIPGLQARAASVGELSIIHSDVLKVDFTALAHRHGVERLRVAGNLPYYISSPILF 2379 Query 124 HVLQQAAIIADMHFMLQKEVVDRMAAPPGSKVYGRLSVMLQAWCEVTTMFVVPPDAFQPP 183 H ++ AA I DMHFMLQKEVVDRMAA PGSKVYGRLSVMLQ C V +FVVPP AF+PP Sbjct 2380 HCVEHAAAIQDMHFMLQKEVVDRMAAEPGSKVYGRLSVMLQLVCRVEPLFVVPPGAFRPP 2559 Query 184 PKVNSAITRLVPRDPTTIRIADTKRFSDIVRAAFGQRRKTLRNSLADICTPAHFEHAGIR 243 PKV+SA+ RLVP P + AD +R IV+AAF QRRKTL N+L ++ A + Sbjct 2560 PKVDSAVVRLVPLGPDQLPDADPERIHAIVKAAFAQRRKTLSNALKNVMDSNAIMAADVD 2739 Query 244 TNARAEQLEVTEFIALA 260 ARAE L +++ LA Sbjct 2740 PKARAETLSPQDYVRLA 2790 Query= DNA gyrase subunit A Length=893 Score E Sequences producing significant alignments: (Bits) Value 00015 809 0.0 00703 513 5e-175 00049 378 1e-111 00066 50.4 6e-007 00895 29.6 1.5 Repeat-07678 27.7 3.9 01163 28.5 4.0 > 00015 Length=2580 Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust. Identities = 418/611 (68%), Positives = 497/611 (81%), Gaps = 36/611 (6%) Frame = +3 Query 265 YQVNKARLIEKIAELVKEKRIDGISELRDESDKDGMRIYIEVKRGESAEVVLNNLYQQTQ 324 YQVNKARLIEKIAELVKEK+++GISELRDESDKDGMR+ IE++R +VVLNNL+QQTQ Sbjct 3 YQVNKARLIEKIAELVKEKKLEGISELRDESDKDGMRVVIEIRRDSMGDVVLNNLFQQTQ 182 Query 325 MESVFGINMVALVDGRPQLLNLKQILQAFIRHRREVVTRRTIFELRKARARAHVLEGLTV 384 ++ FGINMVAL+DG+P+LLNLK IL+AFIRHRREVVTRRTIF+LRKARARAH+LEGLTV Sbjct 183 LQVTFGINMVALLDGQPRLLNLKDILEAFIRHRREVVTRRTIFDLRKARARAHILEGLTV 362 Query 385 ALANIDEMIHLIKTSPSPQEAKERLLAKTWAPGLVGTLLSASGGEASRPEDLPQGVGLIG 444 ALANIDEMI LI+TS SP EA+ER+LA+ W PG+V TLL A+G EASRPED+ GL Sbjct 363 ALANIDEMIELIRTSASPAEARERMLARKWQPGMVATLLEAAGAEASRPEDMDPREGLKA 542 Query 445 DSYQLTEVQVRQILEMRLHRLTGLEQDKLAEEYQQLLEIIVGLIRILESPDVLLQVIRDE 504 + YQL+EVQ ++IL MRLHRLTGLEQ+KL++EY+Q+LE I GLI ILE P LL VIRDE Sbjct 543 EGYQLSEVQAQEILAMRLHRLTGLEQEKLSDEYRQVLETIRGLIEILEDPARLLTVIRDE 722 Query 505 LLKIREEYGDVRRTEIRHSEEDLDILDLIAPEDVVVTLSHAGYAKRQPVSayraqkrggr 564 L ++EE+GD RRTEI+HS+EDL++LDLIAPEDVVVTLSH GY KRQP S YRAQ+RGG+ Sbjct 723 LEAVKEEFGDKRRTEIQHSQEDLNVLDLIAPEDVVVTLSHTGYVKRQPASTYRAQRRGGK 902 Query 565 graaVTTKEEDFIDHLWLVNTHDTLLTFTSTGKVFWLSVYQLPEAGSNARGRPIINWIPL 624 GR+A K+EDF++ LW+VNTHDTLLTFTS+G+V+WL+VYQLPE+G NARG+P++N +PL Sbjct 903 GRSASALKDEDFVEQLWVVNTHDTLLTFTSSGRVYWLNVYQLPESGPNARGKPMVNLLPL 1082 Query 625 EPGEKVQAVLPVREYAENHYVFFATRQGTVKKTPLSEFAFRLARGKIAINLDQGDALIGV 684 GEKVQAVLPVREY E+ +VFFAT+QGTVKKTPL+EFAF+L +GKIAINLD+GDAL+ V Sbjct 1083 GQGEKVQAVLPVREYTEDQFVFFATKQGTVKKTPLTEFAFQLQKGKIAINLDEGDALVNV 1262 Query 685 ALTDGERDVLLFASNGKTVRFSENTVRSMGRTATGVRGIKLTEGEEVVSLIIAEPATGVD 744 ALT G DVLLFASNGKTVRF E+ VRSMGRTATGVRG+KL EG EVVSLI+A Sbjct 1263 ALTGGNSDVLLFASNGKTVRFDESEVRSMGRTATGVRGMKLGEGAEVVSLIVA------- 1421 Query 745 MLEEAEETADDDIQTANTAESVHIDPTQDDILCILTATENGYGKCTPLAHYPRKGRGTQG 804 A+ D ILTATE GYGK T L +P+KGRGTQG Sbjct 1422 --------AEGD---------------------ILTATERGYGKRTMLDEFPKKGRGTQG 1514 Query 805 VIGIQTTERNGRLVAAVLLGATDEVLLISDGGTLVRTRGSEISRVGRNTQGVTLIRLSNG 864 VIGIQ +ERNG LVAA+ E++LISD GTLVRTR +E+S++GRNTQGVTLIRL + Sbjct 1515 VIGIQCSERNGNLVAAIQATEAHELMLISDQGTLVRTRVAEVSQLGRNTQGVTLIRLPSD 1694 Query 865 EKLQAVERLDA 875 EKL +V RLDA Sbjct 1695 EKLVSVVRLDA 1727 Query= DNA gyrase subunit B [Xy fast 9a5c] Length=814 Score E Sequences producing significant alignments: (Bits) Value 00735 681 0.0 00178 395 2e-130 00603 234 4e-068 00188 171 1e-047 Repeat-08211 51.6 1e-007 00639 30.4 0.96 01321 29.6 1.5 > 00735 Length=3864 Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust. Identities = 369/479 (77%), Positives = 417/479 (87%), Gaps = 0/479 (0%) Frame = -1 Query 16 YDSSKITVLRGLDAVRKRPGMYIGDVHDGTGLHHMVFEVVDNSVDEALAGHADSILVKIH 75 YDSS I VL+GL+AVRKRPGMYIGD DGTGLHHMVFEVVDNS+DEALAG+ D + V I Sbjct 1455 YDSSNIKVLKGLEAVRKRPGMYIGDTDDGTGLHHMVFEVVDNSIDEALAGYCDHVTVTIL 1276 Query 76 VDGSVSVSDNGRGIPVDIHKEEGVSAAEVILTVLHAGGKFDDNSYKvsgglhgvgvsvvN 135 DGSVSVSDNGRGIPVD H EEG S AEV++TVLHAGGKFD NSYKVSGGLHGVGVSVVN Sbjct 1275 DDGSVSVSDNGRGIPVDTHPEEGRSTAEVVMTVLHAGGKFDANSYKVSGGLHGVGVSVVN 1096 Query 136 ALSERLWLDIWRDGYHYQQEYVLGEPQYPLKQLGVSAKRGTTLRFKPAKEIFSDVEFHYE 195 ALS LWL I+R+G YQQEY GEP YP+K +G S KRGTT+RF P+ F++VEFHY+ Sbjct 1095 ALSSHLWLTIYREGKEYQQEYAHGEPLYPIKPVGDSTKRGTTVRFLPSTGTFTNVEFHYD 916 Query 196 NLAKRLRELSFLNSGLQVSLIDERGEGRRDDFHYEGGIRSFVEHLAQLKTPLHSNVISVT 255 LAKRLREL+FLNSG+ + L DERGEGR D F YEGGIRSFV+HLAQLKT LH NVIS++ Sbjct 915 ILAKRLRELAFLNSGVTIDLKDERGEGRSDRFAYEGGIRSFVQHLAQLKTALHPNVISLS 736 Query 256 GEHNGIVVDVALQWTDAYQETMYCFTNNIPQKDGGTHLAGFRAALTRTLGNYIEQNGIAR 315 GI V++A+QWTDAYQETMYCFTNNIPQ+DGGTHL GFRAALTR+L +YIE+ G+A+ Sbjct 735 AMQEGISVELAMQWTDAYQETMYCFTNNIPQRDGGTHLTGFRAALTRSLQSYIEKEGLAK 556 Query 316 QAKITFSGDDMREGMIAVLSVKVPEPSFSSQTKEKLVSSDVKPAVEATFGLRLEEFLQEN 375 AK+T SGDDMREGMIAVLSVKVP+P FSSQTK+KLVSS+VK AVE +L EFL E+ Sbjct 555 NAKVTLSGDDMREGMIAVLSVKVPDPKFSSQTKDKLVSSEVKTAVEQAVNEKLGEFLLEH 376 Query 376 PNEARAIAGKIVDaarareaarkarDLTRRKGVLDIAGLPGKLADCQEKDPAMSELFIVE 435 PNEA+AIA K+VDAARAREAARKAR++TRRKG LDIAGLPGKLADCQEKDPA+ ELF+VE Sbjct 375 PNEAKAIASKVVDAARAREAARKAREMTRRKGALDIAGLPGKLADCQEKDPALCELFLVE 196 Query 436 GDSAGGSAKQGRNRKNQAVLPLRGKILNVERARFDRMLSSAEVgtlitalgtgigKDEY 494 GDSAGGSAKQGRNRK QAVLPL+GKILNVE+ARFD+ML+SAEVGTLITALGTGIGK+EY Sbjct 195 GDSAGGSAKQGRNRKTQAVLPLKGKILNVEKARFDKMLASAEVGTLITALGTGIGKEEY 19 Query= DNA polymerase I Length=923 Score E Sequences producing significant alignments: (Bits) Value 00976 1119 0.0 00408 130 3e-031 > 00976 Length=9530 Score = 1119 bits (2895), Expect = 0.0, Method: Compositional matrix adjust. Identities = 584/934 (63%), Positives = 690/934 (74%), Gaps = 17/934 (2%) Frame = -2 Query 1 MSKLVLIDGSSYLYRAFHALPPLSNAAGEPTGALFGVVNMLRTTLKERPDYAAFVIDAPG 60 M+KL+LIDGSSYLYRAFHALPPL+N+ GEPTGALFGVVNMLR TLK +PDY AFV DAPG Sbjct 7678 MAKLILIDGSSYLYRAFHALPPLTNSQGEPTGALFGVVNMLRATLKAKPDYVAFVSDAPG 7499 Query 61 KTFRDALDSEYKMHRPPMPDDLRVQVEPMCGIVQALGIDILCIDGVEADDVIGTLALAAV 120 TFR+ L +YK +RPPMPDDLR QV+PM IV ALG IL + GVEADDVIGTL A Sbjct 7498 PTFRNQLYDQYKTNRPPMPDDLRAQVDPMLAIVGALGFPILRVAGVEADDVIGTLTEQAH 7319 Query 121 ADGIAVTISTGDKDFAQLVRPGVELVNTMTGSRMDSAMAVINKFGVAPDQIIDFLALMGD 180 A GI V ISTGDKD AQLVRPGV LVNTMT + DSA V++KFGV P QI+DFL+L GD Sbjct 7318 AQGIEVEISTGDKDLAQLVRPGVTLVNTMTNTTTDSA-GVMDKFGVQPSQIVDFLSLTGD 7142 Query 181 AVDNVPGVDKCGPKTAAKWLAEYGSLDAVIANADKIKGKIGDnlrvvlprlllnralITI 240 VDNVPGV KCGPKTAAKWLAEYG+LD +IANADK+ GKIG++LR LP+L L+R L+TI Sbjct 7141 TVDNVPGVPKCGPKTAAKWLAEYGTLDNLIANADKVGGKIGESLRAALPQLPLSRDLVTI 6962 Query 241 KTDVVLEKCPQDLALRERDTEALRGFYQRYGFTQALKELDGGSVAVTVAEQEPGRGRNAG 300 K DV LE+ L RER EALR Y RY F ALK++D + A P R +AG Sbjct 6961 KLDVPLEESVSQLTFRERHDEALRELYTRYEFKAALKDMDAEAGATPPVGAHPVR--DAG 6788 Query 301 FHVAPDASV-------AMEINPA----LSAPGEYETIFTVEQLQDWIVRLRVAGQFALDT 349 V+ +A+ A+ P L+ G+YE + T Q + W+ ++ A + DT Sbjct 6787 GAVSGNATGTASRAQGALPQEPEHAANLAMAGQYELVTTQPQFESWLAKIATAPLVSFDT 6608 Query 350 ETDSLDPMQAVLIGLSFAAEVGCAAYLPFGHDYPGAPVQLDRGQALALLQPLLEDAAVRK 409 ET S+D MQA ++GLS + E G A Y+P HDYPGAP QL R Q LA L+P EDA K Sbjct 6607 ETTSIDSMQADIVGLSLSVEPGHACYVPLAHDYPGAPAQLSREQVLATLKPFFEDATRPK 6428 Query 410 VGQHGKYDMHVLRRYGIVLAGYADDTLLESFVLTSGHARHDMDTLARRHLGYETMKYVDL 469 +GQH KYD+++L YGI L G D++LES++ + RHDMD+LAR++LG ET+KY + Sbjct 6427 MGQHAKYDINILSNYGIHLRGLKHDSMLESYIWNATATRHDMDSLARKYLGVETIKYEQV 6248 Query 470 VGKGAKQIPFSHVGVQEATRYAAEDADITLRLHCVLAPKLLAEPSLERVYREIEMPLVAV 529 GKGAKQIPFS V + A RYAAEDADITLRLH L PKL +EPSL VY IE+PL+ V Sbjct 6247 AGKGAKQIPFSQVDLDTACRYAAEDADITLRLHHALWPKLESEPSLRSVYENIEIPLIPV 6068 Query 530 LERIEANGVQIDAEELYKQSADLSRRMVIAQQKAMDLAGRHFNLDSPKQLQVLLFDELKL 589 L +E GV ID EL QS L +RM+ QQ+A AG FNLDSPKQLQ +LFDEL L Sbjct 6067 LASMEQRGVLIDVSELRMQSQQLGKRMLELQQEAWKGAGHEFNLDSPKQLQAVLFDELGL 5888 Query 590 PVVVKTPKGQPSTNEEALEAIVDQHALPRVILEYRSLAKLRNTYTEKLPEMIHPHTGRVH 649 VKTP GQPSTNEEAL AI D HALPR+IL+YR LAKLR+TYT+KL EM++P TGRVH Sbjct 5887 SAKVKTPTGQPSTNEEALAAIADDHALPRLILDYRGLAKLRSTYTDKLAEMVNPRTGRVH 5708 Query 650 TNYHQAGAATGRLSSSDPNLQNIPIRTDDGRRIRRAFVAPSGRKLIACDYSQIELRIMAH 709 T+YHQ ATGR+SSSDPNLQNIPIRT++GRRIR+AFVAP G ++A DYSQIELRIMAH Sbjct 5707 TSYHQGSVATGRVSSSDPNLQNIPIRTEEGRRIRQAFVAPQGWVVLAADYSQIELRIMAH 5528 Query 710 LSGDAGLVDAFESGVDVHCAIAAEVFGRGIDEVTLNERRAAKAINFGLIYGMSAFGLARQ 769 LSGD GLV AF+ G DVH A AAEVFG ++V+ N+RRAAKAINFGL+YGMSAFGLARQ Sbjct 5527 LSGDEGLVKAFKEGGDVHRATAAEVFGLKPEDVSANQRRAAKAINFGLMYGMSAFGLARQ 5348 Query 770 LGISRGEAQDYIALYFDRFAGVRDFMENTRQQARDRGYVETVFGRRLYLNSIASGNQTQR 829 LG+ RGEA DY+A YF R+ GV FME TRQQA GYVET+FGRRLYL ++ S NQ R Sbjct 5347 LGVDRGEASDYMARYFARYPGVHAFMEATRQQAHRDGYVETLFGRRLYLENLQSRNQALR 5168 Query 830 AGAERAAINAPMQGTAADIIKRAMVAVDGWLGVGGHRERALMILQVHDELVFEADEGFVP 889 AGAERAA+NAPMQGTAADIIKRAM+AV WL R+ A M++QVHDELVFE + V Sbjct 5167 AGAERAAVNAPMQGTAADIIKRAMLAVHDWLQP---RDDAHMLMQVHDELVFEVRQDVVE 4997 Query 890 TLVHEVSARMAAAAQLRVPLVVDVGVGHHWDEAH 923 + V ARM+ AA+L VPL+V+ GVG +WDEAH Sbjct 4996 EVRAGVVARMSGAAELSVPLLVEAGVGKNWDEAH 4895 Query= DNA-directed RNA polymerase subunit beta\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
Example Output File
>alanyl-tRNA synthetase [testorganism] LPSRMKTAEIRSTFLDYFRSKGHTIVPSSSLVPASDPTLLFTNSGMVQFKNVFLGSEKLP YVRAADVQRCLRAGGKHNDLDSVGYTARHHTFFEMLGNWSFGDYFKRDAIAYAWELLTDV LKLPKDKLWVTVYHTDDEAFDIWNKEVGVPAERIVRIGDNKGAPYASDNFWQMADTGPCG PCTEIFFDHGEEIAGGPPGSPDEDGDRYIEIWNLVFMQFDRAPDGTLSPLPAPCVDTGMG LERLAAVLQHVHSNYEIDLFEHLIKVAAQLTHTKDLANKSLRVIADHIRACSFLIVDGVL PSNEGRGYVLRRIIRRALRHGWMLGVRGDFFWKMVQPLVEEMGEAYPELAQKQAFVEEAL RTEERRFGETLENGMRLFDAVAAK-SNGSIPGADAFRLYDTYGFPVDLTADIARERGLTV DMDGFEQSMKEQQERSREGGKFEAKGQMPAELASQLQPTAFLGYDALMSQGSKVVGIVRG GKQYDQLGEGEEALVMLDRTPFYAESGGQVGDTGVLVNTTG-SFAVADTLKMGGVFFGHA GRWSGKQALRVGDVVDANVDGTRRQAIVLNHSATHLLHAALRKVLGEHVTQKGSLVAPER LRFDFSHFKPMSADELRQIEMLVNAEVRRNAAAEVHHMGYNEAIEFGAMALFGEKYGDEV RVLRMGEFSTELCGGTHVGRTGDIGLFKIVSEAGVASGVRRIEAVTGAGALAYVADEERR LGEFAQLLSSNGDEAVEKLRQLFDKQKKLERELESLRAKAAGSATADLASSAKDVAGIKV IAARLEGLDAKALRDSMDQLKQQLGDCVVLLAGAADGRVSLVAGVGG-KALGRVKAGDVV AHVASQIDGKGGGRPDMAQGGGSDTSELPGILEGLADWIGQRL >cell division protein [testorganism] >chaperonin GroEL [testorganism] MAAKEVRFGEDVRARMLKGVNTLANAVKVTLGPKGRNVVIEKSFGAPTVTKDGVSVAKEI ELADKYENIGAQIVKEAASKTSDVAGDGTTTATVLAQAFIQEGLKAVAAGMNPMDLKRGI DQAVNAAVTELKKLSNPTADDKAIAQVGTISANSDANIGDIIATAMKKVGKEGVITVEEG SGLENELDVVEGMQFDRGYLSPYFINNQQSQQVELDDPFILIHDKKVSNVRELLPVLEAV AKAGKPLLIVAEEVEGEALATLVVNTIRGIVKVAAVKAPGFGDRRKAILEDIAILTNGVV ISEEVGLALEKATITDLGRAKRVVITKENTTIIDGAGEAERIQSRIGQIKAQIEETSSDY DREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDALHATRAAVEEGVVPGGGVALI RSLKALEGLKGQNTDQDLGIAITRRALEAPLRAIVSNAGDEPSVVLNKVKEGNGNFGYNA ANGEFGDMIAFGILDPTKVTRSALQFAASVAGSIITTEAAVTEVPKKDEGHSHGAPGGMG GMGGMDF >dimethyladenosine transferase [testorganism] >DNA gyrase subunit A [testorganism] YQVNKARLIEKIAELVKEKKLEGISELRDESDKDGMRVVIEIRRDSMGDVVLNNLFQQTQ LQVTFGINMVALLDGQPRLLNLKDILEAFIRHRREVVTRRTIFDLRKARARAHILEGLTV ALANIDEMIELIRTSASPAEARERMLARKWQPGMVATLLEAAGAEASRPEDMDPREGLKA EGYQLSEVQAQEILAMRLHRLTGLEQEKLSDEYRQVLETIRGLIEILEDPARLLTVIRDE LEAVKEEFGDKRRTEIQHSQEDLNVLDLIAPEDVVVTLSHTGYVKRQPASTYRAQRRGGK GRSASALKDEDFVEQLWVVNTHDTLLTFTSSGRVYWLNVYQLPESGPNARGKPMVNLLPL GQGEKVQAVLPVREYTEDQFVFFATKQGTVKKTPLTEFAFQLQKGKIAINLDEGDALVNV ALTGGNSDVLLFASNGKTVRFDESEVRSMGRTATGVRGMKLGEGAEVVSLIVA------- --------AEGD---------------------ILTATERGYGKRTMLDEFPKKGRGTQG VIGIQCSERNGNLVAAIQATEAHELMLISDQGTLVRTRVAEVSQLGRNTQGVTLIRLPSD EKLVSVVRLDA >DNA gyrase subunit B [Xy fast 9a5c] [testorganism] >DNA polymerase I [testorganism] MAKLILIDGSSYLYRAFHALPPLTNSQGEPTGALFGVVNMLRATLKAKPDYVAFVSDAPG PTFRNQLYDQYKTNRPPMPDDLRAQVDPMLAIVGALGFPILRVAGVEADDVIGTLTEQAH AQGIEVEISTGDKDLAQLVRPGVTLVNTMTNTTTDSA-GVMDKFGVQPSQIVDFLSLTGD TVDNVPGVPKCGPKTAAKWLAEYGTLDNLIANADKVGGKIGESLRAALPQLPLSRDLVTI KLDVPLEESVSQLTFRERHDEALRELYTRYEFKAALKDMDAEAGATPPVGAHPVR--DAG GAVSGNATGTASRAQGALPQEPEHAANLAMAGQYELVTTQPQFESWLAKIATAPLVSFDT ETTSIDSMQADIVGLSLSVEPGHACYVPLAHDYPGAPAQLSREQVLATLKPFFEDATRPK MGQHAKYDINILSNYGIHLRGLKHDSMLESYIWNATATRHDMDSLARKYLGVETIKYEQV AGKGAKQIPFSQVDLDTACRYAAEDADITLRLHHALWPKLESEPSLRSVYENIEIPLIPV LASMEQRGVLIDVSELRMQSQQLGKRMLELQQEAWKGAGHEFNLDSPKQLQAVLFDELGL SAKVKTPTGQPSTNEEALAAIADDHALPRLILDYRGLAKLRSTYTDKLAEMVNPRTGRVH TSYHQGSVATGRVSSSDPNLQNIPIRTEEGRRIRQAFVAPQGWVVLAADYSQIELRIMAH LSGDEGLVKAFKEGGDVHRATAAEVFGLKPEDVSANQRRAAKAINFGLMYGMSAFGLARQ LGVDRGEASDYMARYFARYPGVHAFMEATRQQAHRDGYVETLFGRRLYLENLQSRNQALR AGAERAAVNAPMQGTAADIIKRAMLAVHDWLQP---RDDAHMLMQVHDELVFEVRQDVVE EVRAGVVARMSGAAELSVPLLVEAGVGKNWDEAH >DNA-directed RNA polymerase subunit beta\\\\\\
SEQ_ORDER
Modifies CLUSTAL files to have the same order.
Input files must have the same number of species, and the species names
must be identical. If they are abbreviated, they must have the same
abbreviation.
Input files must have the same number of species, and the species names
must be identical. If they are abbreviated, they must have the same
abbreviation.
Enter inputs and click "Run Program" to get started.
Select one of the input files to be the "Template File." The other files will be ordered to match the chosen template.
All species names must be identical between files. Try using an abbreviation program to match up names before using this app.
If a file is missing a species from the template file, or has an extra species, it will be skipped. The program output will note which files were skipped for what reasons (Click "View Result" after program completes).
The files will be slightly modified to clean up some of the CLUSTAL information.
All species names must be identical between files. Try using an abbreviation program to match up names before using this app.
If a file is missing a species from the template file, or has an extra species, it will be skipped. The program output will note which files were skipped for what reasons (Click "View Result" after program completes).
The files will be slightly modified to clean up some of the CLUSTAL information.
View sample input/output files. All files should be in a plain-text format (.txt, .csv, .xml, etc.).
Download files related to this app.
Program Results
Example Input File
CLUSTAL X (1.83) multiple sequence alignment Gordonibacterpamelae ------------------------------------------------------------ Eggerthella ------------------------------------------------------------ Cryptobacterium ------------------------------------------------------------ SlackiaexiguaATCC700 ------------------------------------------------------------ Slackiahel ------------------------------------------------------------ AtopobiumvaginaeDSM1 ------------------------------------------------------------ Olsenellauli ------------------------------------------------------------ AtopobiumrimaeATCC49 ------------------------------------------------------------ Atopobiumparvulum ------------------------------------------------------------ Olsenellasp.oraltaxo ------------------------------------------------------------ Collinsellaintestina ------------------------------------------------------------ Collinsellastercoris ------------------------------------------------------------ CollinsellatanakaeiY ------------------------------------------------------------ Collinsellaaerofacie ------------------------------------------------------------ Coriobacteriumglomer ------------------------------------------------------------ Bacillussubtilis ------------------------------------------------------------ Rubrobacter ------------------------------------------------------------ Conexibacter ------------------------------------------------------------ Acidimicrobium ------------------------------------------------------------ Clostridium MIVGNLEGIRKTILKKLESIYDFKVDRQSIANEQIIQIISEITGDINKEISVAIDRKGNI Gordonibacterpamelae ------------------------------------------------------------ Eggerthella ------------------------------------------------------------ Cryptobacterium ------------------------------------------------------------ SlackiaexiguaATCC700 ------------------------------------------------------------ Slackiahel ------------------------------------------------------------ AtopobiumvaginaeDSM1 ------------------------------------------------------------ Olsenellauli ------------------------------------------------------------ AtopobiumrimaeATCC49 ------------------------------------------------------------ Atopobiumparvulum ------------------------------------------------------------ Olsenellasp.oraltaxo ------------------------------------------------------------ Collinsellaintestina ------------------------------------------------------------ Collinsellastercoris ------------------------------------------------------------ CollinsellatanakaeiY ------------------------------------------------------------ Collinsellaaerofacie -----------------------------------------------------MQFEVPV Coriobacteriumglomer ------------------------------------------------------------ Bacillussubtilis ------------------------------------------------------------ Rubrobacter ----------------------------------------------------------ML Conexibacter ------------------------------------------------------------ Acidimicrobium ------------------------------------------------------------ Clostridium LSVAVGDSSTVEMPIIDIKSKKLSGVRIIHTHPNGNSRLSSLDVSALIALKLDCMVAVAV Gordonibacterpamelae -----MSLSIGIVGLPN------------VGKSTLFTALTNKGGLAANYPFATIEPNVG- Eggerthella -----MSLSIGIVGLPN------------VGKSTLFTALTNKGGLAANYPFATIEPNVG- Cryptobacterium -----MSLSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIEPNVG- SlackiaexiguaATCC700 -----MSLSIGIVGLPN------------VGKSSLFTALTKKGGLAANYPFATIDPNVG- Slackiahel -----MSLSIGIVGLPN------------VGKSSLFTALTKKTGLAANYPFATIDPNVG- AtopobiumvaginaeDSM1 -----MSLSIGIVGLPN------------VGKSTLFTALTKQTGLAANYPFATIDPNVG- Olsenellauli -----MSLSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- AtopobiumrimaeATCC49 -----MSLSIGIVGLPN------------VGKSTLFTALTRKGGLAANYPFATIDPNVG- Atopobiumparvulum -----MSLSIGIVGLPN------------VGKSTLFTALTRKGGLAANYPFATIDPNVG- Olsenellasp.oraltaxo -----MALSIGIVGLPN------------VGKSTLFTALTRKGGLAANYPFATIDPNVG- Collinsellaintestina -----MALSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- Collinsellastercoris -----MALSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- CollinsellatanakaeiY -----MALSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- Collinsellaaerofacie SKGAVVSLSIGIVGLPN------------VGKSTLFTALTKKTGLAANYPFATIDPNVG- Coriobacteriumglomer -----MSLSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- Bacillussubtilis -----MALTAGIVGLPN------------VGKSTLFNAITQAGAESANYPFCTIDPNVG- Rubrobacter PSDTIPPMKVGIVGLPN------------VGKSTLFNALTRAGAEAQNYPFTTVDPNVG- Conexibacter -------MKVGIVGMPN------------AGKSSLFNALTRAGAEAANYPFTTIEPNVA- Acidimicrobium ------MDKLGLVGLAN------------AGKSTLFNALTGLDTPVAPHPFTTTDTTIA- Clostridium EDGKCKDVTVGFCGIDNNTLIAEVAPNLPLDKALNINILNVVKNIEINLSSNEVEEDKGE . *: *: * .*: :. :. . : . Gordonibacterpamelae ----IVPVPDARLDALAEIDHPARIVPAT-------------------VEFVDIAGLVAG Eggerthella ----VVPVPDARLDALANIDHPTRIVPAT-------------------IEFVDIAGLVAG Cryptobacterium ----VVPVPDARLNRLAEIDHPARIVPAT-------------------VEFVDIAGLVAG SlackiaexiguaATCC700 ----IVAVPDARLDALAAIDHPAKIVPAT-------------------VEFVDIAGLVAG Slackiahel ----MVPVPDTRLEELAKIDHPAKIIPAT-------------------VEFVDIAGLVAG AtopobiumvaginaeDSM1 ----IVQVPDTRLNQLAQIVHPAQIVPAT-------------------VEFVDIAGLVRG Olsenellauli ----VVDVPDERLQRLAEMANPAKVVPAT-------------------VEFVDIAGLVKG AtopobiumrimaeATCC49 ----IVDVPDARLQKLADIVHPGRIVPAT-------------------VEFVDIAGLVKG Atopobiumparvulum ----IVDVPDARLQKLAEIVNPGRIMPAT-------------------VEFVDIAGLVKG Olsenellasp.oraltaxo ----IVDVPDARLQRLADIVHPARIVPAT-------------------VEFVDIAGLVKG Collinsellaintestina ----IVDVPDDRLNALAKIVNPARILPAT-------------------VEFVDIAGLVKG Collinsellastercoris ----IVDVPDDRLQALADIVHPGRIVPAT-------------------VEFVDIAGLVKG CollinsellatanakaeiY ----IVDVPDARLQQLADIVHPGRIVPAT-------------------VEFVDIAGLVKG Collinsellaaerofacie ----IVDVPDSRLQKLADIVNPGRIVPAT-------------------VEFVDIAGLVKG Coriobacteriumglomer ----VVDVPDDRLDRLAKIARPGRIVPAT-------------------VEFVDIAGLVKG Bacillussubtilis ----IVEVPDDRLQKLTELVNPKKTVPTA-------------------FEFTDIAGIVKG Rubrobacter ----VAAVPDGRLQRLAEAVGGVRAVPAT-------------------VEFVDIAGLVRG Conexibacter ----VVPVEDERIDALAELLGASEIVADS-------------------IDFHDIAGLVRG Acidimicrobium ----EAVVPDERVDALAAIHHSRKLVYAH-------------------MQLADIAGLTAG Clostridium RAVLVGIENEESLDELCELAKACNVVTVDRVMQRRVKIDTAYFIGEGKVEELSMVRQASN : :: * . : .: .:. . . Gordonibacterpamelae ---------------ASQGEGLGNQFLAN--------------------IRETDAICEVV Eggerthella ---------------ASQGEGLGNQFLAN--------------------IRETDAICEVV Cryptobacterium ---------------ASQGEGLGNQFLAN--------------------IRETDAICEVV SlackiaexiguaATCC700 ---------------ASQGEGLGNKFLAN--------------------IRETDAICEVV Slackiahel ---------------ASQGEGLGNKFLAN--------------------IRETDAICEVV AtopobiumvaginaeDSM1 ---------------ANNGEGLGNQFLAN--------------------IRQCDAICEVV Olsenellauli ---------------ANAGEGLGNQFLAN--------------------IRNCDAICEVV AtopobiumrimaeATCC49 ---------------ANEGEGLGNQFLAN--------------------IRQTDAICEVV Atopobiumparvulum ---------------ANEGEGLGNQFLAN--------------------IRNTDAICEVV Olsenellasp.oraltaxo ---------------ANEGEGLGNQFLSN--------------------IRNTDAICEVV Collinsellaintestina A--------------ASEGAGLGNQFLAN--------------------IRECDAICQVV Collinsellastercoris A--------------AAEGAGLGNQFLAN--------------------IRECDAICQVV CollinsellatanakaeiY A--------------AAEGAGLGNQFLAN--------------------IRECDAICEVV Collinsellaaerofacie ---------------ANEGEGLGNQFLAN--------------------IRETDAICEVV Coriobacteriumglomer ---------------ANEGEGLGNQFLAN--------------------IRETDAICEVV Bacillussubtilis ---------------ASKGEGLGNKFLSH--------------------IRQVDAICHVV Rubrobacter ---------------ASRGEGLGNRFLAH--------------------IRECDAVAHVV Conexibacter ---------------AHEGEGLGNQFLAN--------------------IRETDAIIHVV Acidimicrobium ---------------SSQGAGLGNRFLGQ--------------------LREADAILYVL Clostridium ANLIIFDDELSASQIRNLESATGTKVIDRTTLILEIFARRAKSKEAKIQVELAQLKYRLP . *.:.: . :. : : Gordonibacterpamelae RFFGDPDVVHVAGK----VDPLSDVDTIKTELILADMATLEKALPRLEKEAK-RDKAGAA Eggerthella RFFGDPDVVHVAGR----VDPSSDVDTIKTELMLADMATIEKALPRLEKEAK-RDKDGAK Cryptobacterium RFFSDPDVVHVAGK----VDPQSDVDTIKTELILADVATLEKALPRLEKEAK-RDKSLAT SlackiaexiguaATCC700 RFFSDPNVEHVAHK----VDPRSDVDTIKTELILADLGTLERAIPRLEKEAR-RDKAGAF Slackiahel RFFSDPNVEHVSKK----VDPQSDVDTIKTELILADMATLEKAIPRLEKEAK-RDKDGVF AtopobiumvaginaeDSM1 RYFSDPDVIHVDGQ----VDPESDVDTIQTELVLADLGTLERAIPKLEKEAK-RDKDQKS Olsenellauli RYFKDPDVVHVDGR----VDPASDADTIITELILADLGSLERSIPKLEKEAK-RDKDNLP AtopobiumrimaeATCC49 RYFSDPDVVHVEGR----VDPDQDVDIIQTELILADLGTLERALPKLEKEAK-RDKDLQP Atopobiumparvulum RYFKDPDVIHVEGR----VNPDEDVDIIQTELILADLGTIERALPKLEKEAK-RDKDLQP Olsenellasp.oraltaxo RYFSDPDVVHVDGR----VDPGSDADTIQTELVLADLGTLERAIPKLEKEAK-RDKEKAP Collinsellaintestina RYFKDPDVMREANHTGEFVDPASDAETIMTELILADIQTLEKQLPKLEKEAK-RDAELAP Collinsellastercoris RYFKDPDVMREVNHTGAFVDPASDAETIMTELILADIQTLEKQLPKLEKEAK-RDKELMP CollinsellatanakaeiY RYFKDPDVMREVNHTGDFVDPASDAETIMTELILADIQTLEKQLPKLEKEAK-RDKTLVP Collinsellaaerofacie RYFKDPNVMREVGRTGEFVDPAGDADTIMTELILADMGTLEKQLPKLEKEAK-RDKELMP Coriobacteriumglomer RYFKDDDVVHVDGR----VDPAADAETIMCELILADIGTLERQLPKLEKESR-RDREVAA Bacillussubtilis RAFSDDNITHVSGK----VDPIDDIETINLELILADMETVEKRITRVSKLAKQKDKDAVF Rubrobacter RCFEDENVAHVHGG----IDPVGDAETVNAELLLADLATVERRLEQASRAAKSGDPKRRA Conexibacter RAHHDDNVIHPEGR----VDPLSDIDTIETELIFADLEQAERRHARVVRAARGGDKVAIA Acidimicrobium RAFHDDRVPGD-------DDPIANLDALELELTLADLASVESALERRRKVAR-SDPSARP Clostridium RLIGMGAVLSRTGAGIGTRGPGEKKLEIDKRHIRERIYDLNKELAKIKKNRQVQREKRSK * : .* . : . : : : : : Gordonibacterpamelae KVAAAQKVFEGLNEGHR---------TRTLDLSDDERAALHDLHLLTMKPMLYIANVDED Eggerthella KVEVAKKVLAGLDEGHR---------ARTLGLDEDEQAAIYDLHLLTMKPMLYIANVDED Cryptobacterium RVEVAQKVMAGLNEGHR---------ARTLGLSHEETAAIYDLHLLTMKPMLYIANVDED SlackiaexiguaATCC700 KLEVAKKVAEGLNEGHR---------ARTLGLSVEEAAAVKDLCLLTMKPMMYIANVDEG Slackiahel KLETAKKVYAGLEEEHR---------ALTLGLSDDEKAAIKDLCLLTMKPMLYIANVDED AtopobiumvaginaeDSM1 RLDIAVRLQAWLNDGKR---------AADMDMTLDEKADAHDLFLLTMKPLLYVANCDED Olsenellauli RLQIARRLQSWLDEGNR---------AATLDMTDDERAAAHDLFLLTMKPILYVANCDED AtopobiumrimaeATCC49 RLSVAKRLQEWLNEGKR---------AADMDMTDDERAAAHDLFLLTMKPILYVANVDED Atopobiumparvulum KLNVAKRLQEWLNEGNR---------AADMEMTDEERLAARDLFLLTMKPILYVANVDED Olsenellasp.oraltaxo NLAIAKRLQDWLNEGKR---------AAELDMSDDERAAAHDLFLLTMKPMLYVANVDED Collinsellaintestina KLAVAKRLVEWLNEGNR---------AITMEMTDEERAAARGLFLLTMKPMLYVANVDED Collinsellastercoris KLEIAKRLVAWLNEGNR---------AITLEMTDEERAAAKGLFLLTMKPMLYVANVDED CollinsellatanakaeiY KLEIAKRLVDWLNEGNR---------AAALGMTDEERAAAKGLFLLTMKPMLYVANVDED Collinsellaaerofacie KFEVAKRLLAWLNEGKR---------AASMEMTDEERAAAKGLFLLTMKPILYVANVDED Coriobacteriumglomer RLDLAKRLIAWLDEGHR---------AAEMQMTDEERERARELFLLTMKPMLYVANIDED Bacillussubtilis EFEILSKLKEAFESEKP---------ARSVEFTEEQQKLVKQLHLLTSKPILYVANVSED Rubrobacter EAAALERLRDHLAKGGQ---------ARTFPEPGEVSEVLRTL--LTAKPTLYVANVDEG Conexibacter EEAWLRELVAALQAGRP---------ARTVEPPADAPNAIRELQPLTAKPVLFVANVDEG Acidimicrobium EVAALEVVQAVLADGIP---------LYRATLETDVLELVRSAFLITTKPAIAVINADEG Clostridium DNVPKISLVGYTNAGKSTLRNKLCEIASPKDVVDKETVFEADMLFATLDVTTRALVLPDN : . * . :. Gordonibacterpamelae QLDAD--------------LPEIDGCAPVPISAKVEADLAELDPAEAREYLEAMGLEESG Eggerthella QLDAD--------------LPEIDGCRPVPISAKVEADLAELEPAEAKEYLEAMGLEESG Cryptobacterium ALSAD--------------LPEIDGTVPVPISAQVEADLADLEPEEAQEYLAELGLSESG SlackiaexiguaATCC700 SVNVE--------------LPEIDGQRPVPISAKIEADLSELDAEDAQMFMEELGIEEGG Slackiahel KMDME--------------LPEIDGQIPVPISAKIEADLSELEPDEAAMFMEELGIKESG AtopobiumvaginaeDSM1 QLQD---------------KPVIHGMPALPICAQVEAELSELDDAEAAEYLQSLGLQHSG Olsenellauli QLGD---------------RPQIDGTPAIPICAKVEAELSELEPEEASEYLESLGLERSG AtopobiumrimaeATCC49 ALSKP--------------APLVGGVEAIPVCAEVEAELAELDPKEAAEYLESLGLERSG Atopobiumparvulum MLTEP--------------APVINGVQSIPVCAGVEAELADLDPEEAAEYLESLGLEHSG Olsenellasp.oraltaxo KVSEV--------------PAPINGQTPIPICAEVEAELAELDAEEAAEYLESLGLEHSG Collinsellaintestina MLTED--------------LDAIEGVKPIPICAKVEAELSELDPEEAAEFLADLGLERSG Collinsellastercoris MLAED--------------LDPIDGVKPIPICAKVEAELSELDPEEAQMFLADLGLERSG CollinsellatanakaeiY ELTSE--------------FEPIDGVTPIPICAKVEAELSELEPEEAAEYLADLGLEHSG Collinsellaaerofacie MLNED--------------LAPIDGVKPLPICAKIEAELSELDPEDAADYLESLGLEQPG Coriobacteriumglomer MLDEK--------------PAPIAGAVPIPVCAKVEAELSELEPDEAREYLDELGLERSG Bacillussubtilis EVADPSGNE---NVAKIREYAAGENAEVIVVCAKIESEIAELEGEEKQMFLEELGIQESG Rubrobacter SLAAGNAYS-----AAVEELAGREGAGAVRLCARLAAELAELPAEEAREYLGVLGVEESG Conexibacter ----TDEVP-----AAIAEHAATHAAKAVAISSRIEAELSELDDEEAAVMREELGIAESG Acidimicrobium AEADVAVED---TVR----GRLGDHATVVTAPLALEAELARLEPAERAEMMEALAIGSSA Clostridium RLVTLTDTVGFIRKLPHDLVEAFKSTLEEVVNSDLLLHVVDSSSKDAYKQIEAVNFVLEE : .: : : . Gordonibacterpamelae LARLVREAYKLLG---------LQSYFTSGE-TETRAWTVPIGAKAPQAAGVIHTDFERG Eggerthella LARLVREAYKLLG---------LQSYFTTGE-QETRAWTIPVGAKAPQAAGVIHTDFERG Cryptobacterium LARLVREAYKLLG---------LQSYFTSGE-TETRAWTIPVGAKAPEAAGVIHSDFERG SlackiaexiguaATCC700 LARLIRAAYGLLG---------LQSYFTSGE-TETRAWTIPVGAKAPQAAGVIHSDFERG Slackiahel LSRLIHEAYRLLG---------LQSYFTSGP-DETRAWTIPVGAKAPQAASVIHTDFERG AtopobiumvaginaeDSM1 LEILAQAAYRLLG---------LQSFFTAGP-KEVRAWTVKIGAKAPQAAGVIHSDFERG Olsenellauli LETLAQAAYRLLG---------LQSFFTAGP-KEVRAWTVRIGAKAPEAAGVIHSDFERG AtopobiumrimaeATCC49 LATLAQAAYKLLG---------LQSYFTAGE-MEVKAWTVRIGAKAPEAAGVIHSDFERG Atopobiumparvulum LETLAQAAYKLLG---------LQSYFTAGP-MEVRAWTVRIGAKAPEAAGVIHSDFERG Olsenellasp.oraltaxo LETLAQAAYHLLG---------LQSYFTAGE-KEVKAWTVHIGAKAPEAAGVIHSDFERG Collinsellaintestina LEVLAQAAYKLLG---------LQSFFTAGE-VEVRAWTVRQGATAPQAAGVIHTDFERG Collinsellastercoris LEVLAQAAYKLLG---------LQSFFTAGE-MEVRAWTVRQGATAPQAAGVIHTDFERG CollinsellatanakaeiY LETLAQAAYRLLG---------LQSFFTAGE-MEVKAWTVRQGATAPQAAGVIHTDFERG Collinsellaaerofacie LDVLAQAAYKLLG---------LQSFFTAGE-MEVKAWTVRRGATAPQAAGVIHTDFERG Coriobacteriumglomer LEALAQAAYALLG---------LQSFFTAGE-MEVRAWTVHRGATAPQAAGVIHSDFERG Bacillussubtilis LDQLIKASYSLLG---------LATYFTAGE-QEVRAWTFKKGMKAPECAGIIHSDFERG Rubrobacter FEEFVRAAYRLLG---------LITFFTFNE-RECRAWTVREGATAREAAGRIHTDMERG Conexibacter LQRIVRGAFDLLN---------LNAFFTVGSGVRAQSWHLRRGLTAWHAAGQIHSDIQRG Acidimicrobium LERIARAAFETLE---------RWTFFTSGD-KDTHAWTFRRGSNAQTCAGIIHSDLARG Clostridium LESINKPMILLLNKIDKADKEQLEGLKEKFNNLKVLEISAKDNLNLDTLLNDICTALPNP : : : * . . . * : : . Gordonibacterpamelae FIKAETAA-FEDYVGLGGEK--GCRDAGKLRQEGKEYVVQDGDVMHFKFNV--- Eggerthella FIKAETAS-YEDYVGLGGEK--GCRDAGKLRQEGKEYVVQDGDVMHFKFNV--- Cryptobacterium FIKAETAS-FADYSELGGEA--GCRAAGKLRQEGKDYVVQDGDVMHFKFNV--- SlackiaexiguaATCC700 FIKAETAS-FDDYVSLGGEA--GCRAAGRLRQEGKDYVVQDGDVMHFKFNV--- Slackiahel FIKAETAS-YEDYVRLGGEK--GCRDAGRLRQEGKEYVVQDGDIMHFKFNV--- AtopobiumvaginaeDSM1 FIKAETIS-FDDYIELGGES--GARDAGKLRMEGKDYVVQDGDVMVFRFNV--- Olsenellauli FIKAETIG-YDDYVSLGGEQ--GAKEAGRLRMEGKDYVVQDGDVMVFRFNV--- AtopobiumrimaeATCC49 FIKAEVAS-YTDYVELGGEA--GCKAAGKLRMEGKEYVVQDGDVMHFRFNV--- Atopobiumparvulum FIKAEVAS-YNDYVELGGEA--GCKAAGKLRIEGKDYVVEDGDVMHFRFNV--- Olsenellasp.oraltaxo FIKAKTIS-YEDYVELGGEA--GAREAGRLRMEGKDYVVQDGDVMEFMFNV--- Collinsellaintestina FIKAETIA-FEDYVALGGEK--GAKEAGRLRMEGKDYIMQDGDVVHFRFNV--- Collinsellastercoris FIKAETIA-FDDYIELGGEQ--GAKAAGRLRMEGKDYVMHDGDVVHFRFNV--- CollinsellatanakaeiY FIKAETIA-FDDYVELGGEQ--GAKAAGRLRMEGKDYVMHDGDVVHFRFNV--- Collinsellaaerofacie FIKAEVIG-YDDYIELGGEQ--GAKAAGKLRIEGKEYVMADGDVVHFRFNV--- Coriobacteriumglomer FIKAEVIA-YSDYIEYGGEQ--GARSVGRLRMEGKEYVMADGDVVHFRFNV--- Bacillussubtilis FIRAETVA-YEDLLAGGGMA--GAKEAGKVRLEGKEYVVQDGDVIHFRFNV--- Rubrobacter FVAAEVGR-WEDIVAAGSWA--RAREEAKVRREGRDYVMRDGDVLLVRFNA--- Conexibacter FVRAEVIG-WRELIDAGGYN--AARERGTLRVEGRDYVMQDGDVITVKFTP--- Acidimicrobium FIRAEVAS-WRDVVEAGSWT--RAKAQNKVRLEGRDYLVADGDVLEIRFNV--- Clostridium LKKVEFLIPYSDSASVAMLHRNGKVLEEEYKDNGTRIIAMVDDKIYNKCEKYVI : .: : : . : :* : .* :
Example Output File
Gordonibacterpamelae ------------------------------------------------------------ Eggerthella ------------------------------------------------------------ Cryptobacterium ------------------------------------------------------------ SlackiaexiguaATCC700 ------------------------------------------------------------ Slackiahel ------------------------------------------------------------ Olsenellasp.oraltaxo ------------------------------------------------------------ Olsenellauli ------------------------------------------------------------ AtopobiumvaginaeDSM1 ------------------------------------------------------------ AtopobiumrimaeATCC49 ------------------------------------------------------------ Atopobiumparvulum ------------------------------------------------------------ Collinsellaintestina ------------------------------------------------------------ Collinsellastercoris ------------------------------------------------------------ CollinsellatanakaeiY ------------------------------------------------------------ Collinsellaaerofacie ------------------------------------------------------------ Coriobacteriumglomer ------------------------------------------------------------ Bacillussubtilis ------------------------------------------------------------ Clostridium MIVGNLEGIRKTILKKLESIYDFKVDRQSIANEQIIQIISEITGDINKEISVAIDRKGNI Conexibacter ------------------------------------------------------------ Rubrobacter ------------------------------------------------------------ Acidimicrobium ------------------------------------------------------------ Gordonibacterpamelae ------------------------------------------------------------ Eggerthella ------------------------------------------------------------ Cryptobacterium ------------------------------------------------------------ SlackiaexiguaATCC700 ------------------------------------------------------------ Slackiahel ------------------------------------------------------------ Olsenellasp.oraltaxo ------------------------------------------------------------ Olsenellauli ------------------------------------------------------------ AtopobiumvaginaeDSM1 ------------------------------------------------------------ AtopobiumrimaeATCC49 ------------------------------------------------------------ Atopobiumparvulum ------------------------------------------------------------ Collinsellaintestina ------------------------------------------------------------ Collinsellastercoris ------------------------------------------------------------ CollinsellatanakaeiY ------------------------------------------------------------ Collinsellaaerofacie -----------------------------------------------------MQFEVPV Coriobacteriumglomer ------------------------------------------------------------ Bacillussubtilis ------------------------------------------------------------ Clostridium LSVAVGDSSTVEMPIIDIKSKKLSGVRIIHTHPNGNSRLSSLDVSALIALKLDCMVAVAV Conexibacter ------------------------------------------------------------ Rubrobacter ----------------------------------------------------------ML Acidimicrobium ------------------------------------------------------------ Gordonibacterpamelae -----MSLSIGIVGLPN------------VGKSTLFTALTNKGGLAANYPFATIEPNVG- Eggerthella -----MSLSIGIVGLPN------------VGKSTLFTALTNKGGLAANYPFATIEPNVG- Cryptobacterium -----MSLSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIEPNVG- SlackiaexiguaATCC700 -----MSLSIGIVGLPN------------VGKSSLFTALTKKGGLAANYPFATIDPNVG- Slackiahel -----MSLSIGIVGLPN------------VGKSSLFTALTKKTGLAANYPFATIDPNVG- Olsenellasp.oraltaxo -----MALSIGIVGLPN------------VGKSTLFTALTRKGGLAANYPFATIDPNVG- Olsenellauli -----MSLSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- AtopobiumvaginaeDSM1 -----MSLSIGIVGLPN------------VGKSTLFTALTKQTGLAANYPFATIDPNVG- AtopobiumrimaeATCC49 -----MSLSIGIVGLPN------------VGKSTLFTALTRKGGLAANYPFATIDPNVG- Atopobiumparvulum -----MSLSIGIVGLPN------------VGKSTLFTALTRKGGLAANYPFATIDPNVG- Collinsellaintestina -----MALSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- Collinsellastercoris -----MALSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- CollinsellatanakaeiY -----MALSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- Collinsellaaerofacie SKGAVVSLSIGIVGLPN------------VGKSTLFTALTKKTGLAANYPFATIDPNVG- Coriobacteriumglomer -----MSLSIGIVGLPN------------VGKSTLFTALTKKGGLAANYPFATIDPNVG- Bacillussubtilis -----MALTAGIVGLPN------------VGKSTLFNAITQAGAESANYPFCTIDPNVG- Clostridium EDGKCKDVTVGFCGIDNNTLIAEVAPNLPLDKALNINILNVVKNIEINLSSNEVEEDKGE Conexibacter -------MKVGIVGMPN------------AGKSSLFNALTRAGAEAANYPFTTIEPNVA- Rubrobacter PSDTIPPMKVGIVGLPN------------VGKSTLFNALTRAGAEAQNYPFTTVDPNVG- Acidimicrobium ------MDKLGLVGLAN------------AGKSTLFNALTGLDTPVAPHPFTTTDTTIA- Gordonibacterpamelae ----IVPVPDARLDALAEIDHPARIVPAT-------------------VEFVDIAGLVAG Eggerthella ----VVPVPDARLDALANIDHPTRIVPAT-------------------IEFVDIAGLVAG Cryptobacterium ----VVPVPDARLNRLAEIDHPARIVPAT-------------------VEFVDIAGLVAG SlackiaexiguaATCC700 ----IVAVPDARLDALAAIDHPAKIVPAT-------------------VEFVDIAGLVAG Slackiahel ----MVPVPDTRLEELAKIDHPAKIIPAT-------------------VEFVDIAGLVAG Olsenellasp.oraltaxo ----IVDVPDARLQRLADIVHPARIVPAT-------------------VEFVDIAGLVKG Olsenellauli ----VVDVPDERLQRLAEMANPAKVVPAT-------------------VEFVDIAGLVKG AtopobiumvaginaeDSM1 ----IVQVPDTRLNQLAQIVHPAQIVPAT-------------------VEFVDIAGLVRG AtopobiumrimaeATCC49 ----IVDVPDARLQKLADIVHPGRIVPAT-------------------VEFVDIAGLVKG Atopobiumparvulum ----IVDVPDARLQKLAEIVNPGRIMPAT-------------------VEFVDIAGLVKG Collinsellaintestina ----IVDVPDDRLNALAKIVNPARILPAT-------------------VEFVDIAGLVKG Collinsellastercoris ----IVDVPDDRLQALADIVHPGRIVPAT-------------------VEFVDIAGLVKG CollinsellatanakaeiY ----IVDVPDARLQQLADIVHPGRIVPAT-------------------VEFVDIAGLVKG Collinsellaaerofacie ----IVDVPDSRLQKLADIVNPGRIVPAT-------------------VEFVDIAGLVKG Coriobacteriumglomer ----VVDVPDDRLDRLAKIARPGRIVPAT-------------------VEFVDIAGLVKG Bacillussubtilis ----IVEVPDDRLQKLTELVNPKKTVPTA-------------------FEFTDIAGIVKG Clostridium RAVLVGIENEESLDELCELAKACNVVTVDRVMQRRVKIDTAYFIGEGKVEELSMVRQASN Conexibacter ----VVPVEDERIDALAELLGASEIVADS-------------------IDFHDIAGLVRG Rubrobacter ----VAAVPDGRLQRLAEAVGGVRAVPAT-------------------VEFVDIAGLVRG Acidimicrobium ----EAVVPDERVDALAAIHHSRKLVYAH-------------------MQLADIAGLTAG Gordonibacterpamelae ---------------ASQGEGLGNQFLAN--------------------IRETDAICEVV Eggerthella ---------------ASQGEGLGNQFLAN--------------------IRETDAICEVV Cryptobacterium ---------------ASQGEGLGNQFLAN--------------------IRETDAICEVV SlackiaexiguaATCC700 ---------------ASQGEGLGNKFLAN--------------------IRETDAICEVV Slackiahel ---------------ASQGEGLGNKFLAN--------------------IRETDAICEVV Olsenellasp.oraltaxo ---------------ANEGEGLGNQFLSN--------------------IRNTDAICEVV Olsenellauli ---------------ANAGEGLGNQFLAN--------------------IRNCDAICEVV AtopobiumvaginaeDSM1 ---------------ANNGEGLGNQFLAN--------------------IRQCDAICEVV AtopobiumrimaeATCC49 ---------------ANEGEGLGNQFLAN--------------------IRQTDAICEVV Atopobiumparvulum ---------------ANEGEGLGNQFLAN--------------------IRNTDAICEVV Collinsellaintestina A--------------ASEGAGLGNQFLAN--------------------IRECDAICQVV Collinsellastercoris A--------------AAEGAGLGNQFLAN--------------------IRECDAICQVV CollinsellatanakaeiY A--------------AAEGAGLGNQFLAN--------------------IRECDAICEVV Collinsellaaerofacie ---------------ANEGEGLGNQFLAN--------------------IRETDAICEVV Coriobacteriumglomer ---------------ANEGEGLGNQFLAN--------------------IRETDAICEVV Bacillussubtilis ---------------ASKGEGLGNKFLSH--------------------IRQVDAICHVV Clostridium ANLIIFDDELSASQIRNLESATGTKVIDRTTLILEIFARRAKSKEAKIQVELAQLKYRLP Conexibacter ---------------AHEGEGLGNQFLAN--------------------IRETDAIIHVV Rubrobacter ---------------ASRGEGLGNRFLAH--------------------IRECDAVAHVV Acidimicrobium ---------------SSQGAGLGNRFLGQ--------------------LREADAILYVL Gordonibacterpamelae RFFGDPDVVHVAGK----VDPLSDVDTIKTELILADMATLEKALPRLEKEAK-RDKAGAA Eggerthella RFFGDPDVVHVAGR----VDPSSDVDTIKTELMLADMATIEKALPRLEKEAK-RDKDGAK Cryptobacterium RFFSDPDVVHVAGK----VDPQSDVDTIKTELILADVATLEKALPRLEKEAK-RDKSLAT SlackiaexiguaATCC700 RFFSDPNVEHVAHK----VDPRSDVDTIKTELILADLGTLERAIPRLEKEAR-RDKAGAF Slackiahel RFFSDPNVEHVSKK----VDPQSDVDTIKTELILADMATLEKAIPRLEKEAK-RDKDGVF Olsenellasp.oraltaxo RYFSDPDVVHVDGR----VDPGSDADTIQTELVLADLGTLERAIPKLEKEAK-RDKEKAP Olsenellauli RYFKDPDVVHVDGR----VDPASDADTIITELILADLGSLERSIPKLEKEAK-RDKDNLP AtopobiumvaginaeDSM1 RYFSDPDVIHVDGQ----VDPESDVDTIQTELVLADLGTLERAIPKLEKEAK-RDKDQKS AtopobiumrimaeATCC49 RYFSDPDVVHVEGR----VDPDQDVDIIQTELILADLGTLERALPKLEKEAK-RDKDLQP Atopobiumparvulum RYFKDPDVIHVEGR----VNPDEDVDIIQTELILADLGTIERALPKLEKEAK-RDKDLQP Collinsellaintestina RYFKDPDVMREANHTGEFVDPASDAETIMTELILADIQTLEKQLPKLEKEAK-RDAELAP Collinsellastercoris RYFKDPDVMREVNHTGAFVDPASDAETIMTELILADIQTLEKQLPKLEKEAK-RDKELMP CollinsellatanakaeiY RYFKDPDVMREVNHTGDFVDPASDAETIMTELILADIQTLEKQLPKLEKEAK-RDKTLVP Collinsellaaerofacie RYFKDPNVMREVGRTGEFVDPAGDADTIMTELILADMGTLEKQLPKLEKEAK-RDKELMP Coriobacteriumglomer RYFKDDDVVHVDGR----VDPAADAETIMCELILADIGTLERQLPKLEKESR-RDREVAA Bacillussubtilis RAFSDDNITHVSGK----VDPIDDIETINLELILADMETVEKRITRVSKLAKQKDKDAVF Clostridium RLIGMGAVLSRTGAGIGTRGPGEKKLEIDKRHIRERIYDLNKELAKIKKNRQVQREKRSK Conexibacter RAHHDDNVIHPEGR----VDPLSDIDTIETELIFADLEQAERRHARVVRAARGGDKVAIA Rubrobacter RCFEDENVAHVHGG----IDPVGDAETVNAELLLADLATVERRLEQASRAAKSGDPKRRA Acidimicrobium RAFHDDRVPGD-------DDPIANLDALELELTLADLASVESALERRRKVAR-SDPSARP Gordonibacterpamelae KVAAAQKVFEGLNEGHR---------TRTLDLSDDERAALHDLHLLTMKPMLYIANVDED Eggerthella KVEVAKKVLAGLDEGHR---------ARTLGLDEDEQAAIYDLHLLTMKPMLYIANVDED Cryptobacterium RVEVAQKVMAGLNEGHR---------ARTLGLSHEETAAIYDLHLLTMKPMLYIANVDED SlackiaexiguaATCC700 KLEVAKKVAEGLNEGHR---------ARTLGLSVEEAAAVKDLCLLTMKPMMYIANVDEG Slackiahel KLETAKKVYAGLEEEHR---------ALTLGLSDDEKAAIKDLCLLTMKPMLYIANVDED Olsenellasp.oraltaxo NLAIAKRLQDWLNEGKR---------AAELDMSDDERAAAHDLFLLTMKPMLYVANVDED Olsenellauli RLQIARRLQSWLDEGNR---------AATLDMTDDERAAAHDLFLLTMKPILYVANCDED AtopobiumvaginaeDSM1 RLDIAVRLQAWLNDGKR---------AADMDMTLDEKADAHDLFLLTMKPLLYVANCDED AtopobiumrimaeATCC49 RLSVAKRLQEWLNEGKR---------AADMDMTDDERAAAHDLFLLTMKPILYVANVDED Atopobiumparvulum KLNVAKRLQEWLNEGNR---------AADMEMTDEERLAARDLFLLTMKPILYVANVDED Collinsellaintestina KLAVAKRLVEWLNEGNR---------AITMEMTDEERAAARGLFLLTMKPMLYVANVDED Collinsellastercoris KLEIAKRLVAWLNEGNR---------AITLEMTDEERAAAKGLFLLTMKPMLYVANVDED CollinsellatanakaeiY KLEIAKRLVDWLNEGNR---------AAALGMTDEERAAAKGLFLLTMKPMLYVANVDED Collinsellaaerofacie KFEVAKRLLAWLNEGKR---------AASMEMTDEERAAAKGLFLLTMKPILYVANVDED Coriobacteriumglomer RLDLAKRLIAWLDEGHR---------AAEMQMTDEERERARELFLLTMKPMLYVANIDED Bacillussubtilis EFEILSKLKEAFESEKP---------ARSVEFTEEQQKLVKQLHLLTSKPILYVANVSED Clostridium DNVPKISLVGYTNAGKSTLRNKLCEIASPKDVVDKETVFEADMLFATLDVTTRALVLPDN Conexibacter EEAWLRELVAALQAGRP---------ARTVEPPADAPNAIRELQPLTAKPVLFVANVDEG Rubrobacter EAAALERLRDHLAKGGQ---------ARTFPEPGEVSEVLRTL--LTAKPTLYVANVDEG Acidimicrobium EVAALEVVQAVLADGIP---------LYRATLETDVLELVRSAFLITTKPAIAVINADEG Gordonibacterpamelae QLDAD--------------LPEIDGCAPVPISAKVEADLAELDPAEAREYLEAMGLEESG Eggerthella QLDAD--------------LPEIDGCRPVPISAKVEADLAELEPAEAKEYLEAMGLEESG Cryptobacterium ALSAD--------------LPEIDGTVPVPISAQVEADLADLEPEEAQEYLAELGLSESG SlackiaexiguaATCC700 SVNVE--------------LPEIDGQRPVPISAKIEADLSELDAEDAQMFMEELGIEEGG Slackiahel KMDME--------------LPEIDGQIPVPISAKIEADLSELEPDEAAMFMEELGIKESG Olsenellasp.oraltaxo KVSEV--------------PAPINGQTPIPICAEVEAELAELDAEEAAEYLESLGLEHSG Olsenellauli QLGD---------------RPQIDGTPAIPICAKVEAELSELEPEEASEYLESLGLERSG AtopobiumvaginaeDSM1 QLQD---------------KPVIHGMPALPICAQVEAELSELDDAEAAEYLQSLGLQHSG AtopobiumrimaeATCC49 ALSKP--------------APLVGGVEAIPVCAEVEAELAELDPKEAAEYLESLGLERSG Atopobiumparvulum MLTEP--------------APVINGVQSIPVCAGVEAELADLDPEEAAEYLESLGLEHSG Collinsellaintestina MLTED--------------LDAIEGVKPIPICAKVEAELSELDPEEAAEFLADLGLERSG Collinsellastercoris MLAED--------------LDPIDGVKPIPICAKVEAELSELDPEEAQMFLADLGLERSG CollinsellatanakaeiY ELTSE--------------FEPIDGVTPIPICAKVEAELSELEPEEAAEYLADLGLEHSG Collinsellaaerofacie MLNED--------------LAPIDGVKPLPICAKIEAELSELDPEDAADYLESLGLEQPG Coriobacteriumglomer MLDEK--------------PAPIAGAVPIPVCAKVEAELSELEPDEAREYLDELGLERSG Bacillussubtilis EVADPSGNE---NVAKIREYAAGENAEVIVVCAKIESEIAELEGEEKQMFLEELGIQESG Clostridium RLVTLTDTVGFIRKLPHDLVEAFKSTLEEVVNSDLLLHVVDSSSKDAYKQIEAVNFVLEE Conexibacter ----TDEVP-----AAIAEHAATHAAKAVAISSRIEAELSELDDEEAAVMREELGIAESG Rubrobacter SLAAGNAYS-----AAVEELAGREGAGAVRLCARLAAELAELPAEEAREYLGVLGVEESG Acidimicrobium AEADVAVED---TVR----GRLGDHATVVTAPLALEAELARLEPAERAEMMEALAIGSSA Gordonibacterpamelae LARLVREAYKLLG---------LQSYFTSGE-TETRAWTVPIGAKAPQAAGVIHTDFERG Eggerthella LARLVREAYKLLG---------LQSYFTTGE-QETRAWTIPVGAKAPQAAGVIHTDFERG Cryptobacterium LARLVREAYKLLG---------LQSYFTSGE-TETRAWTIPVGAKAPEAAGVIHSDFERG SlackiaexiguaATCC700 LARLIRAAYGLLG---------LQSYFTSGE-TETRAWTIPVGAKAPQAAGVIHSDFERG Slackiahel LSRLIHEAYRLLG---------LQSYFTSGP-DETRAWTIPVGAKAPQAASVIHTDFERG Olsenellasp.oraltaxo LETLAQAAYHLLG---------LQSYFTAGE-KEVKAWTVHIGAKAPEAAGVIHSDFERG Olsenellauli LETLAQAAYRLLG---------LQSFFTAGP-KEVRAWTVRIGAKAPEAAGVIHSDFERG AtopobiumvaginaeDSM1 LEILAQAAYRLLG---------LQSFFTAGP-KEVRAWTVKIGAKAPQAAGVIHSDFERG AtopobiumrimaeATCC49 LATLAQAAYKLLG---------LQSYFTAGE-MEVKAWTVRIGAKAPEAAGVIHSDFERG Atopobiumparvulum LETLAQAAYKLLG---------LQSYFTAGP-MEVRAWTVRIGAKAPEAAGVIHSDFERG Collinsellaintestina LEVLAQAAYKLLG---------LQSFFTAGE-VEVRAWTVRQGATAPQAAGVIHTDFERG Collinsellastercoris LEVLAQAAYKLLG---------LQSFFTAGE-MEVRAWTVRQGATAPQAAGVIHTDFERG CollinsellatanakaeiY LETLAQAAYRLLG---------LQSFFTAGE-MEVKAWTVRQGATAPQAAGVIHTDFERG Collinsellaaerofacie LDVLAQAAYKLLG---------LQSFFTAGE-MEVKAWTVRRGATAPQAAGVIHTDFERG Coriobacteriumglomer LEALAQAAYALLG---------LQSFFTAGE-MEVRAWTVHRGATAPQAAGVIHSDFERG Bacillussubtilis LDQLIKASYSLLG---------LATYFTAGE-QEVRAWTFKKGMKAPECAGIIHSDFERG Clostridium LESINKPMILLLNKIDKADKEQLEGLKEKFNNLKVLEISAKDNLNLDTLLNDICTALPNP Conexibacter LQRIVRGAFDLLN---------LNAFFTVGSGVRAQSWHLRRGLTAWHAAGQIHSDIQRG Rubrobacter FEEFVRAAYRLLG---------LITFFTFNE-RECRAWTVREGATAREAAGRIHTDMERG Acidimicrobium LERIARAAFETLE---------RWTFFTSGD-KDTHAWTFRRGSNAQTCAGIIHSDLARG Gordonibacterpamelae FIKAETAA-FEDYVGLGGEK--GCRDAGKLRQEGKEYVVQDGDVMHFKFNV--- Eggerthella FIKAETAS-YEDYVGLGGEK--GCRDAGKLRQEGKEYVVQDGDVMHFKFNV--- Cryptobacterium FIKAETAS-FADYSELGGEA--GCRAAGKLRQEGKDYVVQDGDVMHFKFNV--- SlackiaexiguaATCC700 FIKAETAS-FDDYVSLGGEA--GCRAAGRLRQEGKDYVVQDGDVMHFKFNV--- Slackiahel FIKAETAS-YEDYVRLGGEK--GCRDAGRLRQEGKEYVVQDGDIMHFKFNV--- Olsenellasp.oraltaxo FIKAKTIS-YEDYVELGGEA--GAREAGRLRMEGKDYVVQDGDVMEFMFNV--- Olsenellauli FIKAETIG-YDDYVSLGGEQ--GAKEAGRLRMEGKDYVVQDGDVMVFRFNV--- AtopobiumvaginaeDSM1 FIKAETIS-FDDYIELGGES--GARDAGKLRMEGKDYVVQDGDVMVFRFNV--- AtopobiumrimaeATCC49 FIKAEVAS-YTDYVELGGEA--GCKAAGKLRMEGKEYVVQDGDVMHFRFNV--- Atopobiumparvulum FIKAEVAS-YNDYVELGGEA--GCKAAGKLRIEGKDYVVEDGDVMHFRFNV--- Collinsellaintestina FIKAETIA-FEDYVALGGEK--GAKEAGRLRMEGKDYIMQDGDVVHFRFNV--- Collinsellastercoris FIKAETIA-FDDYIELGGEQ--GAKAAGRLRMEGKDYVMHDGDVVHFRFNV--- CollinsellatanakaeiY FIKAETIA-FDDYVELGGEQ--GAKAAGRLRMEGKDYVMHDGDVVHFRFNV--- Collinsellaaerofacie FIKAEVIG-YDDYIELGGEQ--GAKAAGKLRIEGKEYVMADGDVVHFRFNV--- Coriobacteriumglomer FIKAEVIA-YSDYIEYGGEQ--GARSVGRLRMEGKEYVMADGDVVHFRFNV--- Bacillussubtilis FIRAETVA-YEDLLAGGGMA--GAKEAGKVRLEGKEYVVQDGDVIHFRFNV--- Clostridium LKKVEFLIPYSDSASVAMLHRNGKVLEEEYKDNGTRIIAMVDDKIYNKCEKYVI Conexibacter FVRAEVIG-WRELIDAGGYN--AARERGTLRVEGRDYVMQDGDVITVKFTP--- Rubrobacter FVAAEVGR-WEDIVAAGSWA--RAREEAKVRREGRDYVMRDGDVLLVRFNA--- Acidimicrobium FIRAEVAS-WRDVVEAGSWT--RAKAQNKVRLEGRDYLVADGDVLEIRFNV---