NAME

hybrid-select.pl — fold or hybridize RNA or DNA sequences with selected base pairs

SYNOPSIS

hybrid-select.pl [OPTION]... FILE1 [FILE2]

DESCRIPTION

hybrid-select.pl performs two folding or hybridization calculations in sequence, with the output of the first calculation used to create a list of valid abse pairs for the second calculation.

Both the initial and final calculations may be either energy minimization or partition function; the default is an initial partition function calculation followed by an energy minimization.

When the initial calculation is a partition function, only base pairs having a specified probability of forming are retained for the second run. If the filtering stage is an energy minimization, only base pairs appearing in structures with energies within a specified percentage of the minimum energy are retained.

If the final calculation is an energy minimization, the --mfold option may be used; likewise, --tracebacks is acceptable if the final calculation is a partition function.

OPTIONS

Most of the options below are passed to the subprograms to which they apply, but --cutoff, --initial and --final affect the behavior of hybrid-select.pl directly.

-n, --NA=RNA2|RNA3|RNA|DNA
set nucleic acid type to RNA or DNA. Default is RNA.
-t, --t=temp
set temperature to temp °C. Default is 37.
-N, --sodium=REAL
set Sodium ion concentration to REAL molar. Default is 1.
-M, --magnesium=REAL
set Magnesium ion concentration to REAL molar. Default is 0.
-p, --polymer
use salt corrections for polymers instead of oligomers (the default).
-s, --suffix=STRING
use energy rules with the suffix STRING. Overrides --t, --sodium, --magnesium and --polymer.
-f, --force=i,j[,k]
force all basepairs in the helix from i,j to i+k-1,j-k+1. If j is 0, forces bases i to i+k-1 to be double-stranded; if i is 0, forces bases j to j-k+1 to be double-stranded. k defaults to 1 if not specified.
-r, --prohibit=i,j[,k]
prohibit all basepairs in the helix from i,j to i+k-1,j-k+1. If j is 0, prohibit bases i to i+k-1 from pairing at all; if i is 0, prohibit bases j to j-k+1 from pairing at all. k defaults to 1 if not specified.
-R, --prohibit-range=i,j,k,l
prohibit any base from i to j from pairing with any base from k to l.
-I, --noisolate
prohibit all isolated basepairs. Isolated basepairs are helices of length 1; that is, they do not stack on another basepair on either side. (See also the --prefilter and --nopostfilter options below.)
-k, --tracebacks=NUMBER
compute NUMBER stochastic tracebacks and write them to prefix.temperature.ct. Stochastic tracebacks are computed according to the Boltzmann probability distribution so that the probability of a structure is its Boltzmann factor divided by the partition function.
-m, --maxbp=NUMBER
bases farther apart than NUMBER cannot form. Default is no limit.
-c, --constraints=FILE
read a list of constraints from FILE. Constraints must be in the form "F i j k", "P i j k" or "R i-j k-l". These are equivalent to specifying "--force=i,j,k", "--prohibit=i,j,k" or "--prohibit-range=i,j,k,l", respectively. If FILE is not specified, it defaults to prefix.aux.
-C, --cutoff=REAL
set the cutoff for basepairs to be included in the second run, as a probability or a percent of the minimum energy. Default is 0.001 (partition function) or 5% (energy minimization).
--initial=STRING
specify the program to run in the first phase. May be the name of a program like hybrid-ss or one of the codes "PF" and "EM". PF chooses hybrid-ss or hybrid, while EM chooses hybrid-ss-min or hybrid-min. Default is PF.
--final=STRING
Specifies the program to run in the second phase, using the same choices as --initial. Default is EM.
--circular
treat sequence as circular rather than linear.
--intra
hybridize sequences allowing intramolecular basepairs.

OBSCURE OPTIONS

--allpairs
allow basepairs to form between any two nucleotides. When --allpairs is not specified, only Watson-Crick and wobble basepairs are allowed.
--maxloop=NUMBER
set the maximum size of bulge/interior loops to NUMBER. Default is 30.
--maxas=NUMBER
set the maximum asymmetry of bulge/interior loops to NUMBER. Default is 30.
--nodangle
remove single-base stacking from consideration.
--simple
make the penalty for multibranch loops constant rather than affine.
--prefilter=value1[,value2]
set the prefilter to filter out all basepairs except those in groups of value2 adjacent basepairs of which value1 can form. value2 is the same as value1 if unspecified. Default is 2 of 2. (See also the --noisolate option above.)
--nopostfilter
disable the postfilter. The postfilter, which is enabled by default, removes from consideration all structures that consist of only one basepair.

ENVIRONMENT

UNAFOLDDAT
an alternate location from which to read the energy rules. The default energy rules can be overridden with files in the current directory or in the directory pointed to by UNAFOLDDAT. hybrid-select.pl looks for each file first in the current directory, then in the directory specified by UNAFOLDDAT and last in /usr/local/share/unafold (or wherever the energy rules were installed).

REFERENCES

Markham, N. R. and Zuker, M. (2008) UNAFold: software for nucleic acid folding and hybridization. In Keith, J. M., editor, Bioinformatics, Volume II. Structure, Functions and Applications, number 453 in Methods in Molecular Biology, chapter 1, pages 3-31. Humana Press, Totowa, NJ. ISBN 978-1-60327-428-9.

Other references which may be useful may be found at http://www.unafold.org/Dinamelt/dinamelt-references.php

SEE ALSO

hybrid-ss(1), hybrid-ss-min(1), hybrid(1), hybrid-min(1)

AUTHORS

Nick Markham <markham@alum.rpi.edu> and Michael Zuker <zukerm@alum.mit.edu>

COPYRIGHT

Copyright (c) 2006-2021, Rensselaer Polytechnic Institute.

AVAILABILITY

Both commercial and non-commercial use of UNAFold require a license from RPI; see https://ipo.rpi.edu/invention/unafold-version-40.