man/msaClustalOmega.Rd
dafeef0b
 \name{msaClustalOmega}
 \alias{msaClustalOmega}
 \title{Multiple Sequence Alignment with ClustalOmega}
 \description{
   This function calls the multiple sequence alignment
   algorithm ClustalOmega.
 }
 \usage{
     msaClustalOmega(inputSeqs, cluster="default",
                     gapOpening="default", gapExtension="default",
                     maxiters="default",  substitutionMatrix="default",
b5f31f05
                     type="default", order=c("aligned", "input"),
                     verbose=FALSE, help=FALSE, ...) 
dafeef0b
 }
 \arguments{
   \item{inputSeqs}{input sequences; see \code{\link{msa}}.
     In the original ClustalOmega implementation, this
b5f31f05
     parameter is called \code{infile}.}
dafeef0b
   \item{cluster}{The cluster size which should be used. The default is 100.  
     In the original ClustalOmega implementation, this parameter is called 
b5f31f05
     \code{cluster-size}.}
   \item{gapOpening,gapExtension}{ClustalOmega currently does
     not allow to adjust gap penalties; these arguments are only for
     future extensions and consistency with the other algorithms
     and \code{\link{msa}}. However, setting these parameters to values
     other than \code{"default"} will result in a warning.}
dafeef0b
   \item{maxiters}{maximum number of iterations; the default value is 0
     (no limitation). In the original ClustalOmega implementation, this
b5f31f05
     parameter is called \code{iterations}.}
12f5100c
   \item{substitutionMatrix}{name of substitution matrix for scoring matches and
     mismatches; can be one of the choices
b5f31f05
     \code{"BLOSUM30"}, \code{"BLOSUM40"}, \code{"BLOSUM50"},
12f5100c
     \code{"BLOSUM65"}, \code{"BLOSUM80"}, and \code{"Gonnet"}.
     This parameter is a new feature - the original ClustalOmega
dafeef0b
     implementation does not allow for using a custom substitution matrix.}
   \item{type}{type of the input sequences \code{inputSeqs};
     see \code{\link{msa}}.}
b5f31f05
   \item{order}{how the sequences should be ordered in the output object
     (see \code{\link{msa}}); in the original ClustalW implementation, this
     parameter is called \code{output-order}.}
dafeef0b
   \item{verbose}{if \code{TRUE}, the algorithm displays detailed
     information and progress messages.}
   \item{help}{if \code{TRUE}, information about algorithm-specific
     parameters is displayed. In this case, no multiple sequence
     alignment is performed and the function quits after displaying
     the additional help information.}
   \item{...}{further parameters specific to ClustalOmega;
     An overview of parameters that are available in this interface
     is shown when calling \code{msaClustalOmega} with \code{help=TRUE}.
     For more details, see also the documentation of ClustalOmega.}
 }
 \details{This is a function providing the ClustalOmega multiple alignment
   algorithm as an R function. It can be used for various types of
   sequence data (see \code{inputSeqs} argument above). Parameters that
   are common to all multiple sequences alignments provided by the
   \pkg{msa} package are explicitly provided by the function and named
   in the same for all algorithms. Most other parameters that are
   specific to ClustalOmega can be passed to ClustalOmega via additional
   arguments (see argument \code{help} above).
 
12f5100c
   Since ClustalOmega only allows for using built-in amino acid
   substitution matrices, it is hardly useful for multiple alignments
   of nucleotide sequences.
 
dafeef0b
   For a note on the order of output sequences and direct reading from
   FASTA files, see \code{\link{msa}}.
 }
 \value{
    Depending on the type of sequences for which it was called,
    \code{msaClustalOmega} returns a
    \code{\linkS4class{MsaAAMultipleAlignment}}, 
    \code{\linkS4class{MsaDNAMultipleAlignment}}, or
    \code{\linkS4class{MsaRNAMultipleAlignment}} object.
    If called with \code{help=TRUE}, \code{msaClustalOmega} returns
    an invisible \code{NULL}.
 }
 \author{Enrico Bonatesta and Christoph Horejs-Kainrath
   <msa@bioinf.jku.at>
 }
 \references{
   \url{http://www.bioinf.jku.at/software/msa}
12f5100c
   
   U. Bodenhofer, E. Bonatesta, C. Horejs-Kainrath, and S. Hochreiter
   (2015). msa: an R package for multiple sequence alignment. 
fd1f660c
   \emph{Bioinformatics} \bold{31}(24):3997-3999. DOI:
12f5100c
   \href{http://dx.doi.org/10.1093/bioinformatics/btv494}{10.1093/bioinformatics/btv494}.
dafeef0b
 
   \url{http://www.clustal.org/omega/README}
   
   Sievers, F., Wilm, A., Dineen, D., Gibson, T. J., Karplus, K., Li, W.,
   Lopez, R., McWilliam, H., Remmert, M., Soeding, J., Thompson, J. D.,
   and Higgins, D. G. (2011) Fast, scalable generation of high-quality
   protein multiple sequence alignments using Clustal Omega.
   \emph{Mol. Syst. Biol.} \bold{7}:539. DOI:
   \href{http://dx.doi.org/10.1038/msb.2011.75}{10.1038/msb.2011.75}.
 }
 \seealso{\code{\link{msa}}, \code{\linkS4class{MsaAAMultipleAlignment}},
   \code{\linkS4class{MsaDNAMultipleAlignment}},
   \code{\linkS4class{MsaRNAMultipleAlignment}},
   \code{\linkS4class{MsaMetaData}}
 }
 \examples{
 ## read sequences
 filepath <- system.file("examples", "exampleAA.fasta", package="msa")
 mySeqs <- readAAStringSet(filepath)
 
 ## call msaClustalOmega with default values
 msaClustalOmega(mySeqs)
 
 ## call msaClustalOmega with custom parameters
b5f31f05
 msaClustalOmega(mySeqs, auto=FALSE, cluster=120, dealign=FALSE,
                 useKimura=FALSE, order="input", verbose=FALSE)
dafeef0b
 }
 \keyword{manip}