Name Mode Size
.github 040000
R 040000
data 040000
dev 040000
inst 040000
man 040000
tests 040000
vignettes 040000
.Rbuildignore 100644 0 kb
.gitignore 100644 0 kb
DESCRIPTION 100644 2 kb
NAMESPACE 100644 1 kb 100644 0 kb
README.Rmd 100644 4 kb 100644 5 kb
codecov.yml 100644 0 kb
<!-- is generated from README.Rmd. Please edit that file --> # doubletrouble <img src="man/figures/logo.png" align="right" height="139" /> <!-- badges: start --> [![GitHub issues](]( [![Lifecycle: stable](]( [![R-CMD-check-bioc](]( [![Codecov test coverage](]( <!-- badges: end --> The major goal of `doubletrouble` is to identify duplicated genes from whole-genome protein sequences and classify them based on their modes of duplication. The simplest classification scheme has two duplication modes: 1. Whole-genome duplication (WGD); 2. Small-scale duplication (SSD) For a more detailed view of the duplication modes, users can also choose to split SSD into subcategories, so the available duplication modes will be: 1. Whole-genome duplication (WGD); 2. Tandem duplication (TD); 3. Proximal duplication (PD); 4. Transposed duplication (TRD); 5. Dispersed duplication (DD). Besides classifying gene pairs, users can also classify genes, so that each gene is assigned a unique mode of duplication. Users can also calculate substitution rates per substitution site (i.e., Ka and Ks) from duplicate pairs, find peaks in Ks distributions with Gaussian Mixture Models (GMMs), and classify gene pairs into age groups based on Ks peaks. ## Installation instructions Get the latest stable `R` release from [CRAN]( Then install `doubletrouble` from [Bioconductor]( using the following code: ``` r if (!requireNamespace("BiocManager", quietly = TRUE)) { install.packages("BiocManager") } BiocManager::install("doubletrouble") ``` And the development version from [GitHub]( with: ``` r BiocManager::install("almeidasilvaf/doubletrouble") ``` ## Citation Below is the citation output from using `citation('doubletrouble')` in R. Please run this yourself to check for any updates on how to cite **doubletrouble**. ``` r print(citation('doubletrouble'), bibtex = TRUE) #> #> To cite package 'doubletrouble' in publications use: #> #> Almeida-Silva F, Van de Peer Y (2022). _doubletrouble: Identification #> and classification of duplicated genes_. R package version 0.99.1, #> <>. #> #> A BibTeX entry for LaTeX users is #> #> @Manual{, #> title = {doubletrouble: Identification and classification of duplicated genes}, #> author = {Fabrício Almeida-Silva and Yves {Van de Peer}}, #> year = {2022}, #> note = {R package version 0.99.1}, #> url = {}, #> } ``` Please note that the `doubletrouble` was only made possible thanks to many other R and bioinformatics software authors, which are cited either in the vignettes and/or the paper(s) describing this package. ## Code of Conduct Please note that the `doubletrouble` project is released with a [Contributor Code of Conduct]( By contributing to this project, you agree to abide by its terms. ## Development tools - Continuous code testing is possible thanks to [GitHub actions]( through *[usethis](*, *[remotes](*, and *[rcmdcheck](* customized to use [Bioconductor’s docker containers]( and *[BiocCheck](*. - Code coverage assessment is possible thanks to [codecov]( and *[covr](*. - The [documentation website]( is automatically updated thanks to *[pkgdown](*. - The code is styled automatically thanks to *[styler](*. - The documentation is formatted thanks to *[devtools](* and *[roxygen2](*. For more details, check the `dev` directory. This package was developed using *[biocthis](*.