Name Mode Size
.github 040000
R 040000
inst 040000
man 040000
tests 040000
vignettes 040000
www 040000
.Rbuildignore 100644 0 kb
.gitattributes 100644 0 kb
.gitignore 100644 0 kb
DESCRIPTION 100644 1 kb
LICENSE 100644 0 kb
LICENSE.md 100644 1 kb
NAMESPACE 100644 1 kb
NEWS.md 100644 2 kb
README.md 100644 3 kb
README.md
# immReferent An Interface for Immune Receptor and HLA Gene Reference Data <!-- badges: start --> [![R-CMD-check](https://github.com/BorchLab/immReferent/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/BorchLab/immReferent/actions/workflows/R-CMD-check.yaml) [![Codecov test coverage](https://codecov.io/gh/BorchLab/immReferent/graph/badge.svg)](https://app.codecov.io/gh/BorchLab/immReferent) <!-- badges: end --> ## Introduction <img align="right" src="https://github.com/BorchLab/immReferent/blob/main/www/immreferent_hex.png" width="305" height="352"> immReferent is an R package designed to provide a stable, reproducible, and lightweight interface to IMGT immune receptor (TCR/BCR) and HLA sequences and the AIRR-C's OGRDB. It serves as the backbone for computational immunology workflows by ensuring a consistent source of high-quality nucleotide and protein sequences. ### Interactions * [IMGT](https://www.imgt.org/) * [IPD-IMGT/HLA](https://www.ebi.ac.uk/ipd/imgt/hla/) * [OGRDB](https://ogrdb.airr-community.org/) Please read more on these amazing resources and cite them in your work! ### This package enables: * Downloading IMGT/OGRDB data (receptor and HLA sequences). * Caching for offline reproducibility. * Querying metadata (allele, gene, species). * Interoperability with Bioconductor objects (DNAStringSet, AAStringSet). ### Why immReferent? As immune repertoire analysis expands, a centralized sequence reference layer is critical for: * Integrating across tools like [scRepertoire](https://github.com/BorchLab/scRepertoire) and [immApex](https://github.com/BorchLab/immApex) * Reducing redundancy by avoiding repeated hard-coded IMGT/OGRDB downloads. * Guaranteeing reproducibility with cached versions. # Installation ``` devtools::install_github("BorchLab/immReferent") ``` Or via Bioconductor (once accepted) ``` if (!require("BiocManager", quietly = TRUE)) install.packages("BiocManager") BiocManager::install("immReferent") ``` # Getting Started ## IMGT usage IMGT is used as a reference for gene names and sequence information can be accessed via `getIMGT()`. Data from IMGT is under a CC BY-NC-ND 4.0 license. Please be aware that attribution is required for usage and it is the intent of IMGT to not allow derivative or commercial usage. ``` library(immReferent) # Check if IMGT is online is_imgt_available() # Download human IGHV sequences ighv <- getIMGT("IGHV") # Load cached data cached <- loadIMGT("IGHV") # List available datasets listIMGT() # Refresh cache refreshIMGT("IGHV") ``` ## OGRDB usage ``` library(immReferent) # Check if OGRDB is online is_ogrdb_available() # Download human IGHV sequences igk_airr <- getOGRDB(species = "human", locus = "IGK", type = "NUC", format = "AIRR") # Load cached data cached <- loadIMGT(species = "human", locus = "IGK", type = "NUC", format = "AIRR") # List available datasets listOGRDB() # Refresh cache refreshOGRB(species = "human", locus = "IGK", type = "NUC", format = "AIRR") ``` ## Bug Reports/New Features #### If you run into any issues or bugs please submit a [GitHub issue](https://github.com/BorchLab/immReferent/issues) with details of the issue. If possible please include a [reproducible example](https://reprex.tidyverse.org/). #### Any requests for new features or enhancements can also be submitted as [GitHub issues](https://github.com/BorchLab/immReferent/issues). #### [Pull Requests](https://github.com/BorchLab/immReferent/pulls) are welcome for bug fixes, new features, or enhancements. ## Citation In Progress