NAME
miranda - Finds potential target sites for miRNAs in genomic sequence
SYNOPSIS
Basic Usage:
miranda file1 file2 [ options ... ]
Advanced:
miranda file1 file2 [-sc score] [-en energy] [-scale scale] [-strict] [-go X] [-ge Y] [-out fileout] [-quiet] [-trim T] [-noenergy] [-restrict file]
DESCRIPTION
miRanda
is an algorithm for the detection of potential microRNA target
sites in genomic sequences. miRanda
reads RNA sequences (such as microRNAs) from file1 and genomic
DNA/RNA sequences from file2. Both of these files should be in
FASTA format. This is an example of a FASTA formatted sequence:
>gi|29565487|emb|AJ550546.1| Drosophila melanogaster microRNA miR-bantam
GTGAGATCATTTTGAAAGCTG
One or more miRNA sequences from file1 are scanned
against all sequences in file2 and potential target sites are
reported. Potential target sites are identified using a two-step strategy.
First a dynamic programming local alignment is carried out between the
query miRNA sequence and the reference sequence. This alignment procedure
scores based on sequence complementarity and not on sequence identity.
In other words we look for A:U and G:C matches instead of A:A, G:G, etc.
The G:U wobble bair is also permitted, but generally scores less than
the more optimal matches. Here is an example alignment:
Query: 3' gtCGAAAGTTTTACTAGAGTg 5' (eg. miRNA)
|:||||| |||||||||:
Ref: 5' taGTTTTCACAATGATCTCGg 3' (eg. 3'UTR)
The second phase of the algorithm takes high-scoring alignments
(Those above a score threshold, defined by -sc)
detected from phase 1 and estimates the thermodynamic stability
of RNA duplexes based on these alignments.
This second phase of the method utilizes folding routines from
the RNAlib library, which is part of the ViennaRNA package written
by Ivo Hofacker. At this stage we generate a constrained fictional
single-stranded RNA composed of the query sequence, a linker
and the reference sequence (reversed). This structure then folded
using RNAlib and the minimum free energy (DG kcal/mol) is
calculated for that structure.
Finally, detected targets with energies less than an energy threshold
(defined by -en) are selected as potential target sites. Target
site alignments passing both thresholds and other information is produced
as output.
OPTIONS
- --help -h
-
Displays help, usage information and command-line options.
- --version -v --license
-
Display version and license information.
- -sc score
-
Set the alignment score threshold to score. Only alignments with
scores >= score will be used for further analysis.
- -en energy
-
Set the energy threshold to energy. Only alignments with
energies <= energy will be used for further analysis. A negative
value is required for filtering to occur.
- -scale scale
-
Set the scaling parameter to scale. This scaling is applied
to match / mismatch scores in the critical 7bp region near the 5' end
of the microRNA. Many known examples of miRNA:Target duplexes
are highly complementary in this region. This parameter can be thought
of as a contrast function to more effectively detect alignments of this
type.
- -strict
-
Require strict alignment in the seed region (offset positions 2-8).
This option prevents the detection of target sites which contain gaps
or non-cannonical base pairing in this region.
- -go X
-
Set the gap-opening penalty to X for alignments. This
value must be negative.
- -ge Y
-
Set the gap-extend penalty to Y for alignments. This
value must be negative.
- -out fileout
-
Print results to an output file called fileout.
- -quiet
-
Quiet mode, omit notices of when scans are starting and when
sequences have been loaded from input files.
- -trim T
-
Trim reference sequences to T nucleotides. Useful when
using noisy predicted 3'UTRs as reference sequences.
- -noenergy
-
Turn off thermodynamic calculations from RNAlib. If this is
used, only the alignment score threshold will be used. the -en setting
will be ignored.
- -restrict file
-
Restrict scans to those between specific miRNAs and UTRs.
file should contain lines of tab separated pairs
of sequence identifiers: miRNA_id <tab> target_id.
BUGS
The option -strict prevents the output of target predictions for
sites which lack perfect complementarity in the seed region. It does not
yet insure that all possible sites with perfect complementarity in the
seed region are found. In unusual circumstances, a perfect seed site may
overlap with an imperfect seed site in such a way that the imperfect seed
site has superior alignment score to the perfect seed site. If that happens,
the optimal local alignment detected will be the imperfect seed site. The
impact of this bug has been mitigated in this release, but artificial cases
can still be constructed in which perfect seed sites are not reported.
REFERENCES
If you use this program for your research then please include the following
citation:
A.J. Enright, B. John, U. Gaul, T. Tuschl, C. Sander, D.S. Marks; (2003)
MicroRNA targets in Drosophila; Genome Biology 5(1):R1.
RNAlib Citations:
I.L. Hofacker, W. Fontana, P.F. Stadler, S. Bonhoeffer, M. Tacker, P. Schuster
(1994)
Fast Folding and Comparison of RNA Secondary Structures.
Monatshefte f. Chemie 125: 167-188
M. Zuker, P. Stiegler (1981) Optimal computer folding of large RNA
sequences using thermodynamic and auxiliary information, Nucl Acid Res
9: 133-148
J.S. McCaskill (1990) The equilibrium partition function and base pair
binding probabilities for RNA secondary structures, Biopolymers 29: 1105-1119
VERSION
This man page documents version 3.3a of the miRanda package.
AUTHORS
Anton Enright
Comments and bug-reports should be sent to
.