Name Mode Size
R 040000
data 040000
inst 040000
man 040000
tests 040000
vignettes 040000
.Rbuildignore 100644 0 kb
.gitignore 100644 0 kb
DESCRIPTION 100644 1 kb
LICENSE 100644 1 kb
NAMESPACE 100644 1 kb
README.md 100644 4 kb
README.md
# DAMEfinder <img src="inst/figures/damefindersticker.png" width="200" align="right" /> [![platforms](http://bioconductor.org/shields/availability/3.11/DAMEfinder.svg)](https://bioconductor.org/packages/devel/bioc/html/DAMEfinder.html#archives)&nbsp; [![build](http://bioconductor.org/shields/build/devel/bioc/DAMEfinder.svg)](http://bioconductor.org/checkResults/devel/bioc-LATEST/DAMEfinder/) **DAMEfinder** (**D**ifferential **A**llele-specific **ME**thylation **finder**) is an R-package that detects allele-specific methylation (ASM) in a cohort of samples, and detects regions of differential ASM within groups of interest, based on **Bisulfite-sequencing** files. DAMEfinder runs in two modes: **SNP-based** (exhaustive-mode) and **tuple-based** (fast-mode), which converge when calculating differential methylation. <img src="inst/figures/DAMEfinder_workflow.png" width="520"> Please refer to the vignette for more details on running the pipeline. ### News: * Check `NEWS.Rd` [here](https://github.com/markrobinsonuzh/DAMEfinder/blob/master/inst/NEWS.Rd) for latest changes. * Preprint out [here](https://www.biorxiv.org/content/10.1101/800383v1) * Published version [here](https://epigeneticsandchromatin.biomedcentral.com/articles/10.1186/s13072-020-00346-8) * Available on [Bioconductor](https://bioconductor.org/packages/release/bioc/html/DAMEfinder.html) --- ## How do I install it? A stable version of DAMEfinder is available on Bioconductor. To install use: ```{r} if (!requireNamespace("BiocManager", quietly = TRUE)) install.packages("BiocManager") BiocManager::install("DAMEfinder") ``` Or install directly from the repo: ```{r} BiocManager::install("markrobinsonuzh/DAMEfinder") ``` --- ## What mode should I choose? It depends on what you want to do and how much time you have. Either way you have to align your reads with [Bismark](https://github.com/FelixKrueger/Bismark) (apologies to other-aligner users). ### SNP-based To run the **SNP-based** mode you need processed `bam` files *AND* a VCF file for each of your samples with heterozygous SNPs. I know this is typically not the case, so you could alternatively extract heterozygous SNPs using [BisSNP](https://github.com/dnaase/Bis-tools/tree/master/Bis-SNP) (which I have used), or [biscuit](https://github.com/zwdzwd/biscuit). I call this the "exhaustive-mode" because it extracts an ASM score for every CpG site in the reads containing each SNP from the VCF file. Based on this score DAMEs are detected. From a biological point of view, you might want to run this mode if you are interested in loss or gain of allele-specificity linked to somatic heterozygous SNPs. More specifically, you could detect genes that exhibit loss of imprinting (e.g. [in colorectal cancer](http://cancerres.aacrjournals.org/content/62/22/6442.long)). ### tuple-based To run the **tuple-based** mode you have to run [methtuple](https://github.com/PeteHaitch/methtuple) first. The methtuple output is the only thing needed for this mode. I call this the fast-mode because you don't need SNP information. The assumption is that intermediate levels of methylation represent ASM along the genome. For example, we have shown that the ASM score can distinguish females from males in the X chromosome. Using SNP information this wouldn't be possible. --- ### What does that look like? 1. **SNP-based** ASM is sequence dependent, therefore to draw ASM I split the reads (horizontal lines) by allele, a bit like this: <img src="inst/figures/DAME_snp_sampledreads.png" width="550"> This is a (not so great) example of differential SNP-ASM. There is gain of ASM in CR-cancer tissue (CRC2_cimp). The reads shown are a random subset of the total. You could plot all the reads if you like. The actual ASM scores in this region, as well as the methylation level per allele for several samples looks like this: <img src="inst/figures/DAME_snp_allsamps.png" width="550"> or summarized by means: <img src="inst/figures/DAME_snp_allsamps_median.png" width="550"> 2. **tuple-based** ASM only relies on the reads. We can look at the tuple ASM scores in the same region as above, as well as the overall methylation (beta-values) for several samples: <img src="inst/figures/DAME_tuple_allsamps_median.png" width="550"> --- ## Feedback If you have problems installing, running, or just have questions regarding the package or theory behind it, please open an issue and I will answer as soon as possible.