Methods



Next: Acknowledgment Up: Discussion Previous: Future directions
Methods
Two programs have been created to achieve the desired color
annotation. The first one is called bp-annotate. This
program, annotates the PostScript structure plots that are created
by the mfold package, or the PostScript
secondary structure plots produced by the XRNA [Weiser & Noller, 1995]
program. Either the bases are colored or the bases are replaced by
colored dots. Both linear and double logarithmic scaling can be
used. Two input files are needed; the unannotated PostScript file
and an annotation file that we call the ann file because it
ends with the suffix '.ann'. This annotation file can contain either
P-num, H-num or probability information. A number of small auxiliary
programs were created to generate this from different types of input
information. The one most worth mentioning is called the
boxct2ann
program. It combines information from a
probability boxplot produced, for example, by the Vienna package,
and the ct file for the secondary structure to be
annotated. The boxplot contains numbers, pij,
for every possible base pair. These are the probabilities of
designated base pairs. The ct file, or connection
table file, is a common format for describing the bases and
base pairs in an RNA secondary structure. The resulting ann
file is a vector of probabilities, pi, where
i ranges over all bases of the sequence. For base pairs
i.j, pi = pj = pij. If
i is single stranded, then pi is the
probability that the base is single-stranded. In this case,
pi = 1 -
Σj<i
pji -
Σj>i
pij.
he second program, ss-annotate annotates the ss files that are used as input to the XRNA program. The figures can then be manipulated interactively by running the XRNA program. This is useful both in creating high quality plots, and for extracting detailed information about individual structural features from the annotated plots. The latter feature is particularly helpful in the analysis of predicted structure of very large sequences. The same color schemes and linear versus double logarithmic scaling are used in both annotation programs.
re comparison annotation is achieved through a program called ss-compare. It requires the ss files that are used as input to the program XRNA. In addition to an ss file, the program also requires the ct file for another reference structure on the same RNA. The base pairs in the ss file that are conserved in the reference structure are represented by thick lines that stand out from the non-conserved base pairs.
The programs bp-annotate
and boxct2ann
were written in Fortran. The programs ss-compare
and
ss-annotate
were written in the C programming
language. All of the software can be obtained from the first author
and is available at
http://mfold.rna.albany.edu/?q=mfold/download-mfold. Energy dot
plots and annotated structure plots are also available for all
structure predictions that are created with the mfold web server at
http://mfold.rna.albany.edu/?q=mfold.
Next: Acknowledgment Up: Discussion Previous: Future
directions
![]() | Michael Zuker Institute for Biomedical Computing Washington University in St. Louis August 21 1998. |