Bioconductor Code: TargetDecoy

Name	Mode	Size
.github	040000
R	040000
data	040000
man	040000
tests	040000
vignettes	040000
.Rbuildignore	100644	0 kb
.gitignore	100644	0 kb
DESCRIPTION	100644	3 kb
NAMESPACE	100644	0 kb
NEWS.md	100644	1 kb
README.Rmd	100644	4 kb
README.md	100644	4 kb
codecov.yml	100644	0 kb

README.md

# TargetDecoy  [![Lifecycle: stable](https://img.shields.io/badge/lifecycle-stable-brightgreen.svg)](https://lifecycle.r-lib.org/articles/stages.html#stable) [![R-CMD-check-bioc](https://github.com/statOmics/TargetDecoy/workflows/R-CMD-check-bioc/badge.svg)](https://github.com/statOmics/TargetDecoy/actions) [![Codecov test coverage](https://codecov.io/gh/statOmics/TargetDecoy/branch/master/graph/badge.svg)](https://codecov.io/gh/statOmics/TargetDecoy?branch=master)  [![BioC status release](http://www.bioconductor.org/shields/build/release/bioc/TargetDecoy.svg)](https://bioconductor.org/checkResults/release/bioc-LATEST/TargetDecoy) [![BioC status devel](http://www.bioconductor.org/shields/build/devel/bioc/TargetDecoy.svg)](https://bioconductor.org/checkResults/devel/bioc-LATEST/TargetDecoy)  The goal of **TargetDecoy** is to to generate diagnostic plots to evaluate the quality of the target decoy approach (TDA). A first step in the data analysis of Mass Spectrometry (MS) based proteomics data is to identify peptides and proteins. With this respect the huge number of experimental mass spectra typically have to be assigned to theoretical peptides derived from a sequence database. Search engines are used for this purpose. These tools compare each of the observed spectra to all candidate theoretical spectra derived from the sequence data base and calculate a score for each comparison. The observed spectrum is then assigned to the theoretical peptide with the best score, which is also referred to as the peptide to spectrum match (PSM). It is of course crucial for the downstream analysis to evaluate the quality of these matches. Therefore False Discovery Rate (FDR) control is used to return a reliable list PSMs. The FDR, however, requires a good characterisation of the score distribution of PSMs that are matched to the wrong peptide (bad target hits). In proteomics, the target decoy approach (TDA) is typically used for this purpose. The TDA method matches the spectra to a database of real (targets) and nonsense peptides (decoys). A popular approach to generate these decoys is to reverse the target database. Hence, all the PSMs that match to a decoy are known to be bad hits and the distribution of their scores are used to estimate the distribution of the bad scoring target PSMs. A crucial assumption of the TDA is that the decoy PSM hits have similar properties as bad target hits so that the decoy PSM scores are a good simulation of the target PSM scores. Users, however, typically do not evaluate these assumptions. To this end we developed TargetDecoy to generate diagnostic plots to evaluate the quality of the target decoy method. ## Installation You can install *[TargetDecoy](https://bioconductor.org/packages/3.15/TargetDecoy)* from [*Bioconductor*](http://bioconductor.org/) using the following code: ``` r if (!requireNamespace("BiocManager", quietly = TRUE)) { install.packages("BiocManager") } BiocManager::install("TargetDecoy") ``` The latest development version of **TargetDecoy** can also be installed from [GitHub](https://github.com/statOmics/TargetDecoy) with: ``` r BiocManager::install("statOmics/TargetDecoy") ``` ## Getting started Check the [vignette](https://bioconductor.org/packages/release/bioc/vignettes/TargetDecoy/inst/doc/TargetDecoy.html).