Name Mode Size
R 040000
data 040000
inst 040000
man 040000
tests 040000
vignettes 040000
.Rbuildignore 100644 0 kb
.gitignore 100644 1 kb
DESCRIPTION 100644 2 kb
NAMESPACE 100644 5 kb
NEWS.md 100644 2 kb
README.md 100644 6 kb
README.md
<!-- README.md is generated from README.Rmd. Please edit that file --> # GenomAutomorphism [<img src="man/figures/logo.png" align="right" />](https://genomaths.github.io/genomautomorphism) Robersy Sanchez Department of Biology. Eberly College of Science. Pennsylvania State University, University Park, PA 16802 <genomicmath@gmail.com> [ORCID: orcid.org/0000-0002-5246-1453](https://orcid.org/0000-0002-5246-1453) ## Overview This is a R package to compute the automorphisms between pairwise aligned DNA sequences represented as elements from a Genomic Abelian group as described in the paper [Genomic Abelian Finite Groups](https://www.biorxiv.org/content/10.1101/2021.06.01.446543v2). In a general scenario, whole chromosomes or genomic regions from a population (from any species or close related species) can be algebraically represented as a direct sum of cyclic groups or more specifically Abelian *p*-groups. Basically, we propose the representation of multiple sequence alignments (MSA) of length *N* as a finite Abelian group created by the direct sum of homocyclic Abelian group of *prime-power order*:    *G* = (ℤ<sub>*p*<sub>1</sub><sup>*α*<sub>1</sub></sup></sub>)<sup>*n*<sub>1</sub></sup> ⊕ (ℤ<sub>*p*<sub>1</sub><sup>*α*<sub>2</sub></sup></sub>)<sup>*n*<sub>2</sub></sup> ⊕ … ⊕ (ℤ<sub>*p*<sub>*k*</sub><sup>*α*<sub>*k*</sub></sup></sub>)<sup>*n*<sub>*k*</sub></sup> Where, the *p*<sub>*i*</sub>’s are prime numbers, *α*<sub>*i*</sub> ∈ ℕ and ℤ<sub>*p*<sub>*i*</sub><sup>*α*<sub>*i*</sub></sup></sub> is the group of integer modulo *p*<sub>*i*</sub><sup>*α*<sub>*i*</sub></sup>. For the purpose of automorphism between two aligned DNA sequences, *p*<sub>*i*</sub><sup>*α*<sub>*i*</sub></sup> ∈ {5, 2<sup>6</sup>, 5<sup>3</sup>}. ------------------------------------------------------------------------ ## Status This application is currently available in Bioconductor (version 3.18) <https://doi.org/doi:10.18129/B9.bioc.GenomAutomorphism>. Watch this repo or check for updates. ------------------------------------------------------------------------ ## Tutorials There are several tutorials on how to use the package at [GenomAutomorphism](https://genomaths.github.io/genomautomorphism) website [<img src="man/figures/logo.png" align="middle" width="32" height="32" />](https://genomaths.github.io/genomautomorphism) - <a href="https://is.gd/zwRaUw" target="_blank" rel="noopener">Get started-with GenomAutomorphism</a> - <a href="https://is.gd/A03Fkl" target="_blank" rel="noopener">Analysis of Automorphisms on a DNA Multiple Sequence Alignment</a> - <a href="https://is.gd/gsZcuj" target="_blank" rel="noopener">Analysis of Automorphisms on a MSA of Primate BRCA1 Gene</a> - <a href="https://is.gd/87wlbL" target="_blank" rel="noopener">A Short Introduction to Algebraic Taxonomy on Genes Regions</a> - <a href="https://is.gd/836uas" target="_blank" rel="noopener">Automorphism analysis on COVID-19 data</a> - <a href="https://is.gd/wlyzhr" target="_blank" rel="noopener">Modular Matrix Operations of Mutational Events</a> ## Dependences This package depends, so far, from: *Biostrings*, *GenomicRanges*, *numbers*, and *S4Vectors*. ------------------------------------------------------------------------ ## Installation of R dependencies: if (!requireNamespace("BiocManager")) install.packages("BiocManager") BiocManager::install(c("Biostrings", "GenomicRanges", "S4Vectors", "BiocParallel", "GenomeInfoDb", "BiocGenerics", "numbers", "devtools", "doParallel", "data.table", "foreach","parallel"), dependencies = TRUE) ------------------------------------------------------------------------ ## You can install **GenomAutomorphism** package from GitHub BiocManager::install('genomaths/GenomAutomorphism') ------------------------------------------------------------------------ # References 1. Sanchez R, Morgado E, Grau R. Gene algebra from a genetic code algebraic structure. J Math Biol. 2005 Oct;51(4):431-57. doi: 10.1007/s00285-005-0332-8. Epub 2005 Jul 13. PMID: 16012800. ( [PDF](https://arxiv.org/pdf/q-bio/0412033.pdf)). 2. Sanchez R, Grau R, Morgado E. A novel Lie algebra of the genetic code over the Galois field of four DNA bases. Math Biosci. 2006;202: 156–174. <doi:10.1016/j.mbs.2006.03.017> 3. Sanchez R, Grau R. An algebraic hypothesis about the primeval genetic code architecture. Math Biosci. 2009/07/18. 2009;221: 60–76. [doi:10.1016/j.mbs.2009.07.001](https://doi.org/10.1016/j.mbs.2009.07.001) 4. Robersy Sanchez, Jesús Barreto (2021) Genomic Abelian Finite Groups. [doi: 10.1101/2021.06.01.446543](https://doi.org/10.1101/2021.06.01.446543). 5. M. V José, E.R. Morgado, R. Sánchez, T. Govezensky, The 24 possible algebraic representations of the standard genetic code in six or in three dimensions, Adv. Stud. Biol. 4 (2012) 119–152.[PDF](https://is.gd/na9eap). 6. R. Sanchez. Symmetric Group of the Genetic–Code Cubes. Effect of the Genetic–Code Architecture on the Evolutionary Process MATCH Commun. Math. Comput. Chem. 79 (2018) 527-560. [PDF](https://bit.ly/2Z9mjM7). 7. Sanchez, R., 2014. Evolutionary Analysis of DNA-protein-coding regions based on a genetic code cube metric. Current Topics in Medicinal Chemistry, 14(3), pp.407-417. https://doi.org/10.2174/1568026613666131204110022. ## See also [Symmetric Group of the Genetic-Code Cubes](https://github.com/genomaths/GenomeAlgebra_SymmetricGroup)