Name Mode Size
R 040000
data 040000
man 040000
tests 040000
vignettes 040000
.Rbuildignore 100644 0 kb
.gitignore 100644 0 kb
DESCRIPTION 100644 1 kb
NAMESPACE 100644 0 kb
NEWS 100644 0 kb 100644 2 kb
# RAREsim The functions within the RAREsim R package are used to estimate the expected number of rare variants per minor allele count (MAC) bin via the *Number of Variants* and *AFS* functions. Users have the flexibility to estimate the number of variants with default parameters, user-defined parameters, or parameters estimated to match user-provided target data. The default parameters are provided for four ancestries and are expected to perform well for sample sizes up to 125,000. The RAREsim R package is used to calculate the estimated number of variants per MAC bin. Throught the _Number of Variants_ and _AFS_ functions. The output of the RAREsim functions can be directly input into the pruning package for RAREsim, implemented within python: The pruning process requires the expected number of variants per rare MAC bin and over-simulated haplotypes created from simulating from HAPGEN2 with information at all sequencing bases - including monomorphic bases. A complete example with all steps can be found on the Example page: # Installation The RAREsim R package can be downloaded using devtools. ```{r} library(devtools) install_github('meganmichelle/RAREsim') ``` RAREsim also requires [HAPGEN2]( The pruning process is done with the [RAREsim python package]( # More Information RAREsim is a flexible and accurate rare variant simulation algorithm that emulates real data. Using parameters and haplotypes derived from real sequencing data, RAREsim efficiently simulates the expected variant distribution and enables real variant annotations. See the full article here: # Beta RAREsim is currently in beta.