<b dir="9EcwFk"></b>
Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The . gov means it’s official. Federal government websites often end in . gov or VSports app下载. mil. Before sharing sensitive information, make sure you’re on a federal government site. .

Https

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely. V体育官网.

. 2011 Aug 18;12(8):R79.
doi: 10.1186/gb-2011-12-8-r79.

PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data

Affiliations

PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data (VSports最新版本)

David L Corcoran et al. Genome Biol. .

Abstract

Crosslinking and immunoprecipitation (CLIP) protocols have made it possible to identify transcriptome-wide RNA-protein interaction sites. In particular, PAR-CLIP utilizes a photoactivatable nucleoside for more efficient crosslinking. We present an approach, centered on the novel PARalyzer tool, for mapping high-confidence sites from PAR-CLIP deep-sequencing data. We show that PARalyzer delineates sites with a high signal-to-noise ratio. Motif finding identifies the sequence preferences of RNA-binding proteins, as well as seed-matches for highly expressed microRNAs when profiling Argonaute proteins. Our study describes tailored analytical methods and provides guidelines for future efforts to utilize high-throughput sequencing in RNA biology. PARalyzer is available at http://www VSports手机版. genome. duke. edu/labs/ohler/research/PARalyzer/. .

PubMed Disclaimer

"VSports手机版" Figures

Figure 1
Figure 1
Example of PARalyzer interaction site identification. The entire genomic region corresponds to a single read-group from the Pumilio2 library. The orange region represents the nucleotides where the signal kernel density estimate is above background. The light pink locations are the full interaction sites extended by up to 5 nucleotides. A light gold box highlights the sequences that match the known Pumilio2 binding motif.
Figure 2
Figure 2
Nucleotide composition and RNA crosslinking likelihood centered on AGO1-4, QKI, PUM2, and IGF2BP1 interaction sites. The interaction site analysis is from all of the datasets: Quaking (QKI), Pumilio2 (PUM2), Insulin-like growth factor 2 binding protein 1 (IGF2BP1), and Argonaute 1 to 4 (AGO1 to -4). Heatmap: nucleotide composition, relative to a uniform background, of each individual binding site found in the respective genic regions. Barplot: likelihood of a T = > C conversion given that there is a 'T' at the given position. Unlike the heatmap, the barplot is not normalized by the number of reads mapping to an individual binding site. The red dotted line indicates the background conversion probability for all 'T's within the respective genic regions for each respective dataset. (a) Non-redundant seed-matches in 3' UTRs for the top 20 expressed miRNAs in the Argonaute dataset. 8 mer-m1 is a seed-match between the mRNA and nucleotides 1 to 8 of the miRNA seed sequence, 8 mer-A1 matches nucleotides 2 to 8 of the seed sequence paired with an A at position 1. 7 mer-1 m and 7 mer-A1 are similarly defined for nucleotides 1 to 7; 7 mer-m8 is a match utilizing nucleotides 2 to 8 of the seed sequence. 6 mer2-7 is a match utilizing nucleotides 2 to 7 of the seed sequence, and 6 mer3-8 utilizes nucleotides 3 to 8 of the sequence. (b) Motif matches for the two Quaking motifs in 3' UTRs, 5' UTRs, coding regions and introns. (c) Motif matches for the Pumilio 2 dataset in 3' UTRs, 5' UTRs, coding regions and introns. (d) Motif matches for the IGF2BP1 dataset in 3' UTRs, 5' UTRs, coding regions and introns.
Figure 3
Figure 3
Genomic location of PARalyzer generated interaction sites for four RNA-binding proteins. Locations of interaction sites that contained at least two T = > C conversions were compared to transcript sequences as annotated in ENSEMBL (release 57) [42]. The different repeat region classes were identified by RepeatMasker [44]. The following repeat types were collected for this analysis: low complexity repeat family (low complexity), long interspersed nuclear elements (LINE), short interspersed nuclear elements (SINE), DNA transposons (DNA), RNA repeat families (RNA), satellite repeat family (Satellite), rolling circle (RC), unknown repeat family (Unknown), long terminal repeats (LTR) and other repeats (Other).
Figure 4
Figure 4
Properties of Argonaute interaction site generation and their comparison to crosslink-centered regions. (a) Distribution of interaction site sizes for the Argonaute dataset for sites that fall within 3' UTRs and contain two or more T = > C conversion locations. The vertical red line represents the 41-nucleotide size of the Hafner et al. [7] crosslink-centered regions (CCRs). (b) Distribution of interaction site locations across a normalized 3' UTR for all clusters that have two or more T = > C conversion locations. (c) The signal-to-noise for the top 20 expressed miRNAs in the Argonaute dataset for both PARalyzer generated interaction sites and the Hafner et al. [7] CCRs located in 3' UTRs. (d) Average log2 signal-to-noise ratio of window size 21 across all 361 miRNAs reported expressed in Hafner et al. in the order of their expression rank.

"VSports" References

    1. Keene JD. RNA regulons: coordination of post-transcriptional events. Nat Rev Genet. 2007;8:533–543. doi: 10.1038/nrg2111. - "V体育ios版" DOI - PubMed
    1. Keene JD, Komisarow JM, Friedersdorf MB. RIP-Chip: the isolation and identification of mRNAs, microRNAs and protein components of ribonucleoprotein complexes from cell extracts. Nat Protoc. 2006;1:302–307. doi: 10.1038/nprot.2006.47. - DOI - PubMed
    1. Sephton CF, Cenik C, Kucukural A, Dammer EB, Cenik B, Han Y, Dewey CM, Roth FP, Herz J, Peng J, Moore MJ, Yu G. Identification of neuronal RNA targets of TDP-43-containing ribonucleoprotein complexes. J Biol Chem. 2011;286:1204–1215. doi: 10.1074/jbc.M110.190884. - DOI - PMC - PubMed
    1. Zhao J, Ohsumi TK, Kung JT, Ogawa Y, Grau DJ, Sarma K, Song JJ, Kingston RE, Borowsky M, Lee JT. Genome-wide identification of polycomb-associated RNAs by RIP-seq. Mol Cell. 2010;40:939–953. doi: 10.1016/j.molcel.2010.12.011. - DOI - PMC - PubMed
    1. Ule J, Jensen KB, Ruggiu M, Mele A, Ule A, Darnell RB. CLIP identifies Nova-regulated RNA networks in the brain. Science. 2003;302:1212–1215. doi: 10.1126/science.1090095. - "VSports手机版" DOI - PubMed

VSports最新版本 - Publication types

"V体育ios版" MeSH terms

LinkOut - more resources