Biopolym. Cell. 1990; 6(6):42-48.
A method for search of structure motifs in amlnoacid sequences program site of the GenBee package
1, 2Koonin E. V., 2Chumakov K. M., 2Gorbalenya A. E.
  1. Institute of Microbiology, Academy of Sciences of the USSR
    Moscow, USSR
  2. Institute of Poliomyelitis and Viral Encephalitides, Academy of Medical Sciences of the USSR
    Moscow Region, USSR

Abstract

A method is suggested to search for the structure motifs in amino acid sequences, based on their scanning by frequency profiles generated from aligned sequence segments. The program «SITE» implementing the proposed algorithm is discussed, exemplified by the search of an amino acid sequence database for the motif typical of a vast class of purine NTP-binding proteins. The superiorities of the proposed approach as compared to standard pattern-searching routines is demonstrated with respect to selectivity and completeness of extraction of relevant sequences.

References

[1] Staden R. Methods to define and locate patterns of motifs in sequences. Comput Appl Biosci. 1988;4(1):53-60.
[2] Hodgman TC. The elucidation of protein function by sequence motif analysis. Comput Appl Biosci. 1989;5(1):1-13.
[3] Kimura M. The Neutral Theory of Molecular Evolution. Cambridge University Press, 1983; 367 p.
[4] Walker JE, Saraste M, Runswick MJ, Gay NJ. Distantly related sequences in the alpha- and beta-subunits of ATP synthase, myosin, kinases and other ATP-requiring enzymes and a common nucleotide binding fold. EMBO J. 1982;1(8):945-51.
[5] Gorbalenya AE, Koonin EV. Viral proteins containing the purine NTP-binding sequence pattern. Nucleic Acids Res. 1989;17(21):8413-40.
[6] Dayhoff MO, Barker WC, Hunt LT. Establishing homologies in protein sequences. Methods Enzymol. 1983;91:524-45.
[7] Guy B, Kieny MP, Riviere Y, Le Peuch C, Dott K, Girard M, Montagnier L, Lecocq JP. HIV F/3' orf encodes a phosphorylated GTP-binding protein resembling an oncogene product. Nature. 1987 Nov 19-25;330(6145):266-9.
[8] Morrison PT, Lovett ST, Gilson LE, Kolodner R. Molecular analysis of the Escherichia coli recO gene. J Bacteriol. 1989;171(7):3641-9.
[9] Fujisawa H, Yonesaki T, Minagawa T. Sequence of the T4 recombination gene, uvsX, and its comparison with that of the recA gene of Escherichia coli. Nucleic Acids Res. 1985;13(20):7473-81.