A novel method of multiple sequence alighment of biopolymers (program H-Align of the GenBee package)
DOI:
https://doi.org/10.7124/bc.000049Abstract
Summary Generalization of the multiple alignment is central to the entire field of biological sequence analysis. The algorithm of alignment by program H-align incorporated in GenBee package is a result of development of the local similarity search principle. It has two stages: 1) generalization of all the conservative regions (they cannot be present in all the aligning sequences). 2) optimal arrangement of these regions using two criteria — maximization of the total power of the conservative regions and minimization of the total number of spaces. This algorithm has at least two advantages over traditional algorithms (such as Needleman-Wunsch's one) : no penalty for insertion / deletion; not subsequent pair aligning procedure. The efficiency of the algorithm is shown at model example.References
Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970;48(3):443-53.
Gotoh O. Alignment of three biological sequences with an efficient traceback procedure. J Theor Biol. 1986;121(3):327-37.
Sobel E, Martinez HM. A multiple sequence alignment program. Nucleic Acids Res. 1986;14(1):363-74.
Bacon DJ, Anderson WF. Multiple sequence alignment. J Mol Biol. 1986;191(2):153-61.
Leontovich AM, Brodsky LI, Gorbalenya AE. Compile of a complete map of local similarity for two biopolymers (DotHelix PROGRAM of the GenBee package). Biopolym Cell. 1990; 6(6):14-21.
Dayhoff MO, Barker WC, Hunt LT. Establishing homologies in protein sequences. Methods Enzymol. 1983;91:524-45.