1
|
1
|
new file mode 100644
|
...
|
...
|
@@ -0,0 +1,43 @@
|
|
1
|
+---
|
|
2
|
+title: "MotifDb"
|
|
3
|
+output:
|
|
4
|
+ html_document:
|
|
5
|
+ toc: true
|
|
6
|
+ toc_float: false
|
|
7
|
+vignette: >
|
|
8
|
+ %\VignetteIndexEntry{"A collection of PWMs"}
|
|
9
|
+ %\VignetteEngine{knitr::knitr}
|
|
10
|
+ %\VignetteEncoding{UTF-8}
|
|
11
|
+---
|
|
12
|
+# Introduction
|
|
13
|
+
|
|
14
|
+Many kinds of biological activity are regulated by the binding of proteins to their cognate
|
|
15
|
+substrates. Of particular interest is the sequence-specific binding of transcription factors to
|
|
16
|
+DNA, often in regulatory regions just upstream of the transcription start site of a gene. These
|
|
17
|
+binding events play a pivotal role in regulating gene expression. Sequence specificity among
|
|
18
|
+closely related binding sites is nearly always incomplete: some variety in the DNA sequence is
|
|
19
|
+routinely observed. For this reason, these inexact binding sequence patterns are commonly described
|
|
20
|
+as **motifs**, represented numerically as frequency matrices, and visualized as sequence logos.
|
|
21
|
+
|
|
22
|
+Despite their importance in current research, there has been until now no single, annotated,
|
|
23
|
+comprehensive collection of publicly available motifs. The current package attempts to provide such
|
|
24
|
+a collection, offering more than ten thousand annotated matrices from multiple organisms, within the
|
|
25
|
+context of the Bioconductor project. The matrices can be filtered and selected on the basis of
|
|
26
|
+their metadata, used with other Bioconductor packages (for instance, seqLogo can be used for for
|
|
27
|
+visualization) or easily exported for use with standard software and websites such as those provided
|
|
28
|
+by the [MEME Suite](http://meme.sdsc.edu/meme/doc/meme.html).
|
|
29
|
+
|
|
30
|
+Transcription factor binding sites (TFBS) can only be imperfectly predicted from sequence matching
|
|
31
|
+of motif to DNA sequence. When using MotifDb, please keep in mind that actual and functional TF
|
|
32
|
+binding occurs under the influence of many factors:
|
|
33
|
+
|
|
34
|
+ * Chromatin accessibility
|
|
35
|
+ * An appropriate balance of activating and repressing epigenetic influences
|
|
36
|
+ * The presence of the the transcription factor protein in adequate numbers
|
|
37
|
+ * "DNA shape" to complement or substitute for DNA sequence match
|
|
38
|
+ * Cooperativity with other DNA-binding factors, or co-factor interaction
|
|
39
|
+
|
|
40
|
+Still, motif-matching to DNA sequence plays an important role in identifying gene regulatory events.
|
|
41
|
+
|
|
42
|
+# Quick Start
|
|
43
|
+
|