Name Mode Size
R 040000
inst 040000
man 040000
tests 040000
vignettes 040000
.Rbuildignore 100644 0 kb
.gitattributes 100644 0 kb
.gitignore 100644 0 kb
DESCRIPTION 100644 1 kb 100644 34 kb
NAMESPACE 100644 0 kb 100644 2 kb
BERT <img src="" width="120px" align="right" /> =========== > BERT (Batch-Effect Removal with Trees) offers flexible and efficient batch effect correction of omics data, while providing maximum tolerance to missing values. Tested on multiple datasets from proteomic analyses, BERT offered a typical 5-10x runtime improvement over existing methods, while retaining more numeric values and preserving batch effect reduction quality. As such, BERT is a valuable preprocessing tool for data analysis workflows, in particular for proteomic data. By providing BERT via Bioconductor, we make this tool available to a wider research community. An accompanying research paper is currently under preparation and will be made public soon. BERT addresses the same fundamental data integration challenges than the [HarmonizR]( package, which has been released on Bioconductor in November 2023. However, various algorithmic modications and optimizations of BERT provide better execution time and better data coverage than HarmonizR. Moreover, BERT offers a more user-friendly design and a less error-prone input format. **Please note that our package _BERT_ is neither affiliated with nor related to _Bidirectional Encoder Representations from Transformers_ as published by Google.** ## Usage For details on how to use BERT, please refer to the [vignette]( ## Issues Please report any issues in the GitHub forum or contact [the authors](, directly. ## License This code is published under the GPLv3.0 License. ## Reference Please cite our manuscript, if you use BERT for your research: > Yannis Schumann, Simon Schlumbohm et al., BERT - Batch Effect Reduction Trees for Large-Scale Data Integration with Tolerance for Missing Values and Imbalanced Data, 2024 (in preparation)