# mbkmeans: Mini-batch k-means clustering for single-cell RNA-seq
[data:image/s3,"s3://crabby-images/b9e94/b9e94dcc624ed28e6925071a6c445a2c4f831a0e" alt="Project Status: Active - The project has reached a stable, usable state and is being actively developed."](http://www.repostatus.org/#active)
[data:image/s3,"s3://crabby-images/9d21f/9d21ff2c35080bc7e65b9f9275322623f2c5103a" alt="BioC release"](https://bioconductor.org/checkResults/release/bioc-LATEST/mbkmeans)
[data:image/s3,"s3://crabby-images/9d21f/9d21ff2c35080bc7e65b9f9275322623f2c5103a" alt="BioC devel"](https://bioconductor.org/checkResults/devel/bioc-LATEST/mbkmeans)
[data:image/s3,"s3://crabby-images/7d0d3/7d0d3b9701e56a5b81ce952d64dce66c554b8123" alt="R-CMD-check"](https://github.com/drisso/mbkmeans/actions)
This package implements the mini-batch k-means algorithm for large datasets,
including support for on-disk data representation.
The method is described in details in the paper:
[S. Hicks, R. Liu, Y. Ni, E. Purdom, D. Risso (2021).
mbkmeans: Fast clustering for single cell data using mini-batch k-means. PLOS Computational Biology.](https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008625)
## Installation
In virtually all cases, installing from Bioconductor is recommended.
```{r}
if (!requireNamespace("BiocManager", quietly=TRUE))
install.packages("BiocManager")
BiocManager::install("mbkmeans")
```
In the rare event you need the development version from GitHub, use the following.
```{r}
library(devtools)
BiocManager::install("drisso/mbkmeans")
```