# Modstrings: implementation of Biostrings to work with nucleotide sequences containing modified nucleotides
<img src="https://github.com/FelixErnst/BiocStickers/blob/Modstrings/Modstrings/Modstrings.png" height="300" align="right">
RNA are usually in some form post-transcriptionally modified. Most prominent
examples are of course rRNA and tRNA, but in recent years mRNA was also
discovered to be post-transcriptionally modified.
In many resources, like the tRNAdb ([Juehling et al. 2009](#Literature)) or the
modomics database ([Boccaletto et al. 2018](#Literature)), a dictionary for
modified nucleotides was published. However, in the Bioconductor universe these
information were not directly accessible ore representable, since they rely
extensively on special characters in the RNA modification alphabet.
Therefore, the`Modstrings` package implements the `ModRNAString` class by
extending the `BString` class from the `Biostrings` ([Pages et
al.](#Literature)) package. It can store RNA sequences containing special
characters of the RNA modification alphabet and thus can store location and
identity of modifications. Functions for conversion to a tabular format are
implemented as well.
The implemented classes inherit most of the functions from the parental
`BString` class and it derivatives, which allows them to behave like the
normal `XString` classes within the bioconductor universe.
# Installation
The current version of the `Modstrings` package is available from GitHub.
```{r}
remotes::install_github("FelixErnst/Modstrings")
# Load and attach the package
library("Modstrings")
```
A submission to Bioconductor is planned.
# Literature
- Jühling, Frank; Mörl, Mario; Hartmann, Roland K.; Sprinzl, Mathias; Stadler,
Peter F.; Pütz, Joern (2009): "TRNAdb 2009: Compilation of tRNA Sequences and
tRNA Genes." Nucleic Acids Research 37 (suppl_1): D159–D162.
doi:[10.1093/nar/gkn772](https://doi.org/10.1093/nar/gkn772).
- Boccaletto, Pietro; Machnicka, Magdalena A.; Purta, Elzbieta; Piatkowski,
Pawel; Baginski, Blazej; Wirecki, Tomasz K. et al. (2018): "MODOMICS: a database
of RNA modification pathways. 2017 update." Nucleic Acids Res. 46 (D1),
D303-D307. doi:[10.1093/nar/gkx1030](https://doi.org/10.1093/nar/gkx1030).
- Pagès, H.; Aboyoun, P.; Gentleman, R.; DebRoy, S. (2018). "Biostrings:
Efficient manipulation of biological strings." R package version 2.50.1.