ALIGN_DUALSTATE

This program takes an aligned CLUSTAL sequence file, and finds columns which have exactly two different types of characters. It then computes various output files based on the results.
Enter inputs and click "Run Program" to get started.
Required
Browse…
The program first finds all columns with exactly two different characters present, and where each of those two characters appears at least twice. The result is saved as the first output file. At the top of the file is listed the total number of character columns.

The second output file converts the first output into a binary format. That is, the majority character is converted to a 0 and the minority character is converted to a 1. At the top of the file is listed the number of rows, followed by the number of character columns.

The third output file is slightly different. It lists, for each row, the number of columns in which that character is the only one of its kind. That is, it lists the number of characters which are unique in a particular column (occurs exactly once in a column).

View sample input/output files. All files should be in a plain-text format (.txt, .csv, .xml, etc.).