27876680.txt
22.3 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
Methods
Indeed , other putative targets identified by MAPS still need validation .
Surprisingly , in Lalaouna et al. [ 1 ] , unusual binding partners were also co-purified with MS2-sRNA , such as transcribed spacer sequences of polycistronic tRNA transcripts ( glyW-cysT-leuZ and metZ-metW-metV ) .
For a long time , these sequences were considered as non-functional `` junk RNA '' .
Nevertheless , we demonstrated that , during glyW-cysT-leuZ pre-tRNA processing , the 30external transcribed spacer sequence ( 30ETSleuZ ) is excised and acts as a sRNAs sponge ( RyhB and RybB ) preventing them from binding their real targets [ 1 ] .
Thus , 30ETSleuZ represents the first functional tRNA-derived RNA fragment ( tRF ) in bacteria ( see reviews [ 6 -- 8 ] for more details on tRFs ) .
Evidence also showed that RybB sRNA interacts with another kind of tRFs : the internal transcribed spacers ( ITS ) of the metZ-metW-metV pre-tRNA [ 1,9 ] .
We first noticed a strong enrichment of both ITSmetZ-metW/ITSmetW-metV with MS2-RybB [ 1 ] .
Then , we performed the reverse experiment using each ITS as bait and were able to co-purify RybB sRNA confirming that ITSmetZ-metW/ITSmetW-metV and RybB interact in vivo [ 9 ] .
However , the related mechanism of regulation is still under investigation .
Taken together , these results demonstrated that , unlike other methods commonly used to identify targets ( e.g. microarray , Northern blot , RNAseq ) , MAPS can depict the whole interactome of a specific RNA as it identifies all kinds of interacting partners .
In this report , we concisely describe adjustments made to MAPS to study various kind of RNA : RNA interactions .
We also bring important modifications to MAPS protocol to be suitable for patho-genic or Gram-positive strains .
Finally , we discuss the future of this technology
Recently , we developed an in vivo technology , called MS2-affinity purification coupled with RNA sequencing ( MAPS ) , to tackle the problem of the identification of RNA : RNA interactions [ 1 ] .
Briefly , the technique is based on the strong affinity of the MS2 protein for the MS2 RNA aptamer used to tag an RNA of interest ( see review [ 2 ] for more details on the history of RNA tagging ) .
The MAPS technology was first used in two recent reports studying sRNA-mRNA interactions [ 1,3 ] .
Regulatory small RNAs ( sRNAs ) are involved in the adaptation of bacteria facing environmental fluctuations by post-transcriptionally regulating a subset of mRNAs [ 4,5 ] .
Using MAPS , we already drew the interacting map of RyhB , RybB and DsrA , three well-characterized sRNAs in Escherichia coli .
Here , we confirmed previously known targets but also revealed new ones [ 1,3 ] .
For example , we characterized grxD mRNA , involved in iron-sulfur cluster biosynthesis , as negatively regulated by RyhB sRNA [ 1 ] .
In the same report , we validated yifE , encoding a conserved protein of unknown function , as a RybB positively regulated target .
Moreover , we characterized a new atypical mRNA target of DsrA , coding for the ribose pyranase ( RbsD ) [ 3 ] .
Contrary to the canonical mechanism of repression , DsrA inhibits translation of rbsD mRNA by interacting in the coding sequence and induces mRNA decay by an unknown ribonuclease ( RNase ) .
However , we only touched the tip of the iceberg in these reports .
⇑ Corresponding author at : Department of Biochemistry , Université de Sherbrooke , 3201 Jean Mignault Street , Sherbrooke , QC J1E 4K8 , Canada .
E-mail address : eric.masse@usherbrooke.ca ( E. Massé ) .
2. Materials and methods
2.1. MS2 tagging
We usually fused the 50-end of the RNA fragment of interest with the MS2 RNA aptamer .
We recommend 50-end MS2 tagging in order to keep the endogenous terminator and to minimize secondary structure changes .
To obtain the corresponding chimeric DNA sequence cloned in a plasmid under the control of a pBAD promoter ( arabinose-inducible promoter ) , we have first inserted MS2 stemloops into a pNM12 vector using MscI and EcoRI restriction enzymes ( NEB ) .
Then , the DNA sequence corresponding to the RNA of interest is added using EcoRI and SphI ( Fig. 1A ) .
The secondary structure of the chimeric RNA is verified using mfold software ( http://unafold.rna.albany.edu/?q=mfold ) .
In the case of 50UTR-mRNA ( untranslated region of mRNA ) , we added MS2 stemloops at the 30end ( commonly at the beginning of the coding sequence ) followed by a T7 transcriptional terminator ( Fig. 1B ) .
The MS2 aptamer and the T7 terminator are added to the 50UTR of mRNA using nested PCR .
A stop codon is inserted in frame , just before the MS2 aptamer sequence to interrupt protein translation .
In all cases , we used the untagged RNA as a control .
We performed Northern blot analysis to verify that the MS2-RNA construct is expressed at a level similar to the untagged RNA .
For sRNA , we also tested the activity of the MS2-sRNA construct on known mRNA targets .
This activity has to be equivalent to the control .
2.2. Strains
Typically , we performed MS2-sRNA MAPS in a DsRNA rne131 ( RNA degradosome assembly mutant ) background .
Both mutations maximize mRNA targets enrichment by eliminating interaction with the endogenous untagged sRNA and slowing down mRNA targets decay .
Indeed , the use of a wild-type background reduces the enrichment ratio of negatively regulated targets ( data not shown ) .
In the case of MS2-tRFs and 50UTR-mRNA-MS2 , we did not delete the endogenous gene to avoid potential disruptions of cellular metabolism .
2.3. MS2-MBP protein purification
The plasmid pHMM , first published in Batey and Kieft [ 10 ] , encodes the DNA sequence required to produce the 59 kDa polypeptide His6-MS2-MBP .
This hybrid protein is composed of an N-terminal hexahistidine-tag , a maltose-binding protein ( MBP ) domain , and a C-terminal MS2 coat protein .
The MS2-MBP protein purification was performed as described by Batey and Kieft [ 10 ] with major modifications .
BL21/pLysS E. coli cells containing pHMM plasmid were first grown in 1 L of rich medium ( Luria Bertani medium , LB ) supplemented with 30 lg/mL chloramphenicol and 10 lg/mL kanamycin at 37 C to an OD600nm of 0.7 .
At this point , the construct was expressed by addition of 0.5 mM IPTG .
After 3 h of induction , cells were harvested and centrifuged ( 3800g for 15 min at 4 C ) .
Cells pellets were resuspended in 50 mL of Lysis buffer ( 50 mM NaH2PO4 pH 8.0 , 300 mM NaCl , 0.5 % Tween 20 , 10 mM imidazole and 10 % glycerol ) supplemented with 50 ll of protease inhibitor cocktail ( 1 mg/mL ; Sigma-Aldrich ) and 5 mL of lysozyme ( 10 mg/mL ) .
Then , cells were lysed by sonication ( 25 % for 4 min 5 s sonication , 5 s pause on ice ; Branson Digital Sonifier ) and centrifuged ( 17,000 g for 45 min ) .
A second step of sonication is required if the supernatant remains too viscous .
After the lysis step , the His6-MS2-MBP protein was purified using a Ni-NTA agarose column ( 4 mL of Ni-NTA agarose resin ( Qiagen ) on a 25 mL Econo-Pac chromatography column ( Biorad ) ) equilibrated with 25 mL of lysis buffer .
The supernatant was loaded onto the column and the column was washed with 25 mL of lysis buffer .
Samples were eluted by addition of 4 mL of Lysis buffer supplemented with 250 mM imidazole .
After a dialyze step in 1 L of Dialysis buffer ( 25 mM Na-MES pH 6.0 , 25 mM NaCl ) , samples were loaded on a amylose resin column ( 3 mL of amylose resin ( NEB ) on a 25 mL Econo-Pac chromatography column ) equilibrated with 25 mL of Column buffer ( 20 mM Tris-HCl pH 7.4 , 200 mM NaCl , 0.5 mM EDTA ) .
The column was wash with 36 mL of Column buffer .
Samples were eluted with 6 mL of Column buffer supplemented with 10 mM maltose .
The output was dialyzed in 1 L of buffer A ( 20 mM Tris-HCl pH 8.0 , 150 mM KCl , 1 mM MgCl2 ) supplemented with 10 % glycerol .
Finally , the protein concentration was calculated using OD280nm and a molar extinction coefficient of 83.310 M 1 cm 1 .
2.4. MS2-affinity purification coupled with RNA sequencing
A schematic representation of MAPS technology is available in Fig. 2 .
Cells were grown in LB supplemented with 50 lg/mL ampicillin ( diluted 1/1000 from an overnight culture grown ) .
Note that MAPS was successfully performed in other media ( e.g. M63 minimal medium ) ( data not shown ) .
Cells were harvested in ( A ) exponential ( OD600nm = 0.5 ; 100 mL ) and/or ( B ) stationary phase of growth ( OD600nm > 1 ; 100 mL ) after induction of MS2 construct ( or control ) with 0.1 % arabinose for 10 min and then chilled on ice for 10 min .
For input samples ( collected before affinity purification ) , RNA was extracted from 600 lL of culture using the hot-phenol procedure [ 11 ] .
The remaining cells were washed with 1 mL of buffer A ( supplemented with 1 mM DTT and 1 mM PMSF ) and centrifuged .
Cells were resuspended in ( A ) 2 or ( B ) 3 mL of buffer A and lysed using a French Press Cell Disrupter ( Thermo electron corporation ) with the following parameters : 430 psi , ( A ) three or ( B ) four times .
Next , the lysat was cleared by centrifugation ( 17,000 g for 30 min at 4 C ) .
For protein samples ( input ) , 20 lL of the soluble fraction were collected and mixed with 20 lL of protein sample buffer ( 125 mM Tris-HCl pH 6.8 , 1 % SDS , 20 % glycerol , 0.02 % bromophenol blue and 100 mM DTT ) .
The remaining supernatant was subjected to affinity chromatography ( all the following steps were performed at 4 C ) .
The column was prepared by adding ( A ) 75 or ( B ) 100 lL of amylose resin ( New England Biolabs ) to Bio-Spin disposable chromatography columns ( Bio-Rad ) .
We washed the column with 3 mL of buffer A. Next , ( A ) 100 or ( B ) 200 pmol of His6-MS2-MBP protein were immobilized on the amylose resin .
Again , we washed the column with 2 mL of buffer A. Here , the supernatant was loaded onto the column , before being washed with ( A ) 5 or ( B ) 8 mL of buffer A .
This crucial step requires some adaptation and adjustment in function of cell density .
Bound RNA was eluted using 1 mL of buffer A supplemented with 15 mM maltose .
Eluted RNA was extracted with phenol-chloroform ( V/V ) and precipitated by the addition of ethanol ( 2 vol ) and 20 mg of glycogen .
At the same time , the organic phase was subjected to acetone precipitation to recover proteins .
RNA samples were then analyzed by Northern blot and protein samples by Western blot as described in Lalaouna et al. [ 1 ]
After all these steps , RNA samples ( output ) were treated with TURBOTMDNase ( 4 units ; Ambion ) for 30 min at 37 C to eliminate remaining genomic DNA .
TURBO DNase was then removed using phenol-chloroform extraction and RNA was again precipitated .
RNA quality and quantity were analyzed on Agilent Nano Chip on the bioanalyzer 2100 .
cDNA libraries were prepared with ScriptSeqTM v2 RNA-Seq Library Preparation Kit ( Illumina ) for MS2-sRNA samples and with NEBNext Small RNA Library Prep set E7330S kit for MS2-ITS , MS2-ETS and 50UTR-mRNA-MS2 .
Libraries were sequenced using Illumina MiSeq .
In the case of MS2-30ETSleuZ ( Table 1 ) , we extracted RNA from exponential ( OD600nm = 0.5 ; 100 mL ) and stationary phase ( OD600nm = 1.3 ; 100 mL ) cultures ( WT background ) .
We then followed the ( B ) procedure as described above .
GEO accession numbers of MAPS data are : MS2-30ETSleuZ ( GSE79278 ) , MS2-ITSmetZW/MS2-ITSmetWV ( GSE66517 ) , MS2-RyhB ( GSE66519 ) , MS2-RybB ( GSE66518 ) , MS2-DsrA ( GSE67605 ) and MS2 control ( GSE67606 ) .
2.5. MAPS with pathogenic and/or Gram positive strains
Recently , we adapted MAPS technology to pathogenic strains manipulation .
Interestingly , described modifications are also effective for Gram-positive strains ( data not shown ) .
Here , the medium used for cell growth will depend on the studied strain ( e.g. LB med-ium for Salmonella enterica serovar Typhimurium , brain-heart infusion ( BHI ) broth for Staphylococcus aureus ) and is supplemented by the appropriate antibiotic in function of the overexpression system used .
Generally , the extraction is performed on larger volume ( 300 mL ) .
The lysis is a crucial step and requires adaptations for safety purposes .
Here , the Precellys 24 bead beater was used to break cells instead of French press .
After washing , harvested cells were resuspended in 1 mL of buffer A .
The cell suspension was then transferred to screw cap tubes containing 200 lL of glass beads ( 0.1 mm in diameter ; BioSpec ) .
Cells were disrupted using the bead beater ( three cycles : 6500 rpm for 20 s , followed by 1 min in ice ) .
The lysate was then centrifuged at 17,000 g for 30 min at 4 C and the supernatant was apply to the affinity purification column .
Following steps are identical to those described in Section 2.3 .
For input samples , two different procedures were used depending on bacterial classification .
For Gram-negative bacteria , RNA was extracted from 1 mL of culture using hot-phenol procedure [ 11 ] .
For Gram-positive bacteria , 1 mL of culture was centrifuged and resuspended in 500 lL of lysis solution ( 0.5 % SDS , 1 mM EDTA , 20 mM sodium acetate ) and transferred into a screw cap tube containing 200 lL of glass beads and 500 lL of phenol ( pH 4 ) .
To break the cell wall , cells were vortexed using the bead beater ( three cycles : 6500 rpm for 20 s , followed by 1 min in ice ) .
The sample was then centrifuged and RNA was then precipitated as described in Section 2.3 .
2.6. Data processing
Data processing procedure is described in details in Fig. 3 .
We used bioinformatics tools freely available on Galaxy Project platform ( https://galaxyproject.org/ ) [ 13 ] .
The first workflow enables to align reads to the corresponding genome ( e.g. E. coli K12 ) and to visualize them using the UCSC Gen-ome Browser ( Fig. 3A ) .
First , we used FASTQ Groomer ( Galaxy version 1.0.4 ) to convert FASTQ files .
Second , the quality of raw sequences data was assessed using FastQC ( Galaxy version 0.52 ) .
Third , checked sequences were aligned to the corresponding genome assembly using Map with Bowtie for Illumina ( Galaxy version 1.1.2 ) .
To finish , we used Create a BedGraph of genome coverage ( Galaxy version 0.1.0 ) to obtain data files in a format compatible with UCSC Microbial GenomeBrowser ( http : / / microbes.ucsc.edu / ) .
The second workflow is useful to assign reads to gene .
All required steps are indicated in Fig. 3B .
Briefly , we compare mapped regions with gene positions ( extracted from NCBI/GenBank ) .
For this purpose , we formatted a gene bank file as indicated by the following example : chr 190 255 þ thrL
After all , read counts were normalized by coverage .
Data are presented in a tab-delimited text file which includes normalized reads and MS2-RNA/RNA ( Control ) ratio .
2.7. Primer extension
The cleavage site that releases 30ETSleuZ was determined using primer extension ( Fig. 4 ) .
This result was required to remove RNase E cleavage site from the MS2-30ETSleuZ construct and avoid the loss of the MS2 aptamer .
Briefly , 20 lg of total RNA were incubated with 0.5 pmol of 32P-radiolabelled oligonucleotide ( EM3204 , CCGAAGGTGGTTT-CACGACAC ) and 1 mM dNTPs .
After 5 min of incubation at 65 C , followed by 1 min on ice , 5X reaction buffer , 0.1 M DTT , RNase Inhibitor Murine ( 40 units , NEB ) and ProtoScript II Reverse Transcriptase ( 200 units , NEB ) were added to the reaction .
Reverse transcription was performed for 60 min at 42 C before the enzyme was inactivated at 90 C for 10 min .
Samples were then precipitated and migrated on a denaturing 8 % polyacrylamide gel .
As a control , we performed the same experiment without reverse transcriptase .
The sequencing ladder was obtained with a DNA template ( PCR with oligonucleotides EM3205 ( CTCCGGGTACCATGG-GAAAG ) and EM3206 ( CCTATCTTACATGCCGGTCCG ) ) and the same radiolabelled primer ( EM3204 ) .
Here , we added ddNTP to stop the reaction performed by the Vent DNA polymerase ( 2 units , NEB ) as described below .
For the G lane , we used 0.36 mM ddGTP , 0.037 mM dGTP , 0.03 mM dATP , 0.1 mM dCTP and 0.1 mM dTTP .
For the A lane , we used 0.9 mM ddATP , 0.03 mM dATP , 0.1 mM dGTP , 0.1 mM dCTP and 0.1 mM dTTP .
For the T lane , we used 0.72 mM ddTTP , 0.033 mM dTTP , 0.1 mM dGTP , 0.03 mM dATP and 0.1 mM dCTP .
For the C lane , we used 0.42 mM ddCTP , 0.041 mM dCTP , 0.1 mM dGTP , 0.03 mM dATP and 0.1 mM dTTP .
We also added 0.1 % Triton to the reaction .
Then , we performed the following PCR steps : 1 min at 95 C , 1 min at 52 C and 1 min at 72 C ( 25 cycles ) .
The reaction was stopped by addition of formamide loading dye and migrated next to primer extension samples
3.1. Evolution of MAPS technology
Over the years , we modified and improved key parameters of the MS2-affinity purification [ 1,3,9,14 ] .
We exhaustively expose these modifications in Section 2 .
To perform MAPS with higher cellular concentration ( exponential and stationary phases or stationary phase only ) , we modified the lysis step and increased the loading capacity of the column .
We also adapted MAPS procedure to pathogenic strains manipulation by modifying cell breakage .
We demonstrated that the same protocol can be used for Gram-positive bacteria .
The adjustment of MAPS technology to pathogenic and/or Gram-positive bacteria will certainly facilitate the targets identification in strains remarkably difficult to genetically manipulate .
Deep characterization of the interactome of sRNA involved in virulence in pathogenic strains will also bring to light targets with a therapeutic potential .
To study sRNA interacting with a 50UTR-mRNA or a tRF , we also used another library preparation kit to specifically enrich small RNA fragments .
This approach has already been successfully used with both MS2-ITSmetZW/MS2-ITSmetWV [ 9 ] and MS2-30ETSleuZ ( Fig. 4A ) .
Thus , we confirmed results previously obtained with MS2-RyhB and MS2-RybB [ 1 ] and hence the efficiency of MAPS technology to determine binding partners of tRNA-derived RNA fragments .
In this case , one of the major limitations is the presence of enzymatic cleavage sites that naturally enable the maturation of pre-tRNAs and , as a consequence , the release of tRFs .
To counter this problem and avoid loss of the MS2 aptamer , we added an additional step : we first identify the +1 of each tRF using primer extension ( see Section 2 ) .
Next , we fuse the MS2 aptamer to the determined 50-end .
This procedure has been validated with 30ETSleuZ ( Fig. 4B ) .
Here , the cleavage site occurs 15 nt after the 30CCA end of leuZ gene , releasing a 53 nt-long RNA fragment ( Fig. 4C ) .
MAPS technology was performed with pBAD-MS2 and pNM12 ( Ctrl ) .
Cells were harvested in exponential ( OD600nm = 0.5 ; 100 mL ) and in stationary phase ( OD600nm = 1 ; 100 mL ) ( WT background ) .
After that , the ( B ) procedure described in Section 2 was followed .
The 50 most enriched genes are represented ( with more than 50 reads ) .
The GEO accession number is GSE67606 .
3.2. Removal of false positives
As previously mentioned in the Introduction , a lot of putative targets were revealed by MAPS but not verified [ 1,3 ] .
To facilitate the analysis , we performed MAPS using MS2 aptamer only as bait to discriminate the subpopulation of genes that interact with the MS2 tag rather than the gene of interest [ 3 ] .
The Top 50 most enriched genes are represented in Table 2 .
The complete list is also available ( GEO accession number GSE67606 ) .
This list is useful to eliminate false positives and reduce the number of candidates .
For example , ompW gene is often co-purified with MS2-sRNA without being regulated by them ( data not shown ) .
3.3. Perspectives
In bacteria , the MS2-affinity purification was successfully used to co-purify the RNA chaperone protein Hfq with sRNAs , 50UTR-mRNA and tRFs [ 1,3,15 ] .
Others proteins are known to be involved in sRNA-mediated regulation like various ribonucleases [ 16 ] .
Due to transitory interaction , MS2-pulldown assay failed to pick up RNase E with a sRNA : mRNA complex reproducibly ( data not shown ) .
To improve RNase recovery , an additional step of UV crosslink should be performed before cell lysis .
Especially , application of the UV crosslink to MS2-affinity purification coupled with mass spectrometry should enable the identification of still unknown ribonucleases as in the case of rbsD mRNA [ 3 ] or the characterization of sRNA sequestering proteins by mimicking their recognition site .
Recently , McaS sRNA , first described as regulator of multiple mRNAs through base pairing , was also shown to directly interact with the pleiotropic regulatory protein CsrA and alleviate its activity [ 17 ] .
Hence , we can easily assume that other well-characterized sRNAs could have a dual function .
Certainly , UV crosslink MS2-affinity purification coupled with mass spectrometry will be complementary to recently published method where authors opted for the opposite approach by using protein as bait [ 18 -- 20 ] .
Recently , Wade 's group adapted ribosome profiling technology to the identification of sRNA targets [ 21 ] .
Using RNA sequencing , they compared ribosome-protected mRNA fragments profiles in presence or absence of sRNA .
Thus , ribosome profiling could be performed in parallel to MAPS .
The combination of these two methods will represent a perfect screen for sRNA targetome characterization .
Finally , in Desnoyers and Masse [ 14 ] we demonstrated that MS2-affinity purification can be used to perform co-variational mutagenesis instead of using classical experiments ( e.g. in vitro probing or b-galactosidase ) to prove direct interaction .
Mutations were introduced at either the sRNA pairing site or the Hfq pairing site in the mRNA sequence , which allowed us to characterize the Hfq-mRNA ribonucleoprotein complex .
During the last few years , we performed MAPS with various kind of RNA : sRNA [ 1,3 ] , 50UTR-mRNA [ 3 ] and tRFs [ 1,9 ] .
We overcame technical limitations of MAPS to apply it to all kind of RNA : RNA interaction regardless of the studied organism .
Especially , extension of MAPS technology to pathogenic strains will help to explore the targetome of sRNAs involved in bacterial virulence and , therefore , increase the reservoir of potential molecules for antibacterial design .
Moreover , the combination of MAPS with other high-throughput technologies will pave the way for an easier sRNA functional screening .
This work has been supported by an operating grant from the Canadian Institutes of Health Research ( CIHR ) to EM .