useful.out 42.6 KB
29091192	 Objectives: Polymyxins remain one of the last-resort drugs to treat infections caused by MDR Gram-negative pathogens. Here, we determined the mechanisms by which chromosomally encoded resistance to colistin and polymyxin B can arise in the MDR uropathogenic Escherichia coli ST131 reference strain EC958. Methods: Two complementary approaches, saturated transposon mutagenesis and spontaneous mutation induction with high concentrations of colistin and polymyxin B, were employed to select for mutations associated with resistance to polymyxins. Mutants were identified using transposon-directed insertion-site sequencing or Illumina WGS. A resistance phenotype was confirmed by MIC and further investigated using RT-PCR. Competitive growth assays were used to measure fitness cost. Results: A transposon insertion at nucleotide 41 of the pmrB gene (EC958pmrB41-Tn5) enhanced its transcript level, resulting in a 64- and 32-fold increased MIC of colistin and polymyxin B, respectively. Three spontaneous mutations, also located within the pmrB gene, conferred resistance to both colistin and polymyxin B with a corresponding increase in transcription of the pmrCAB genes. All three mutations incurred a fitness cost in the absence of colistin and polymyxin B. Conclusions: This study identified the pmrB gene as the main chromosomal target for induction of colistin and polymyxin B resistance in E. coli. 
29066548	 Uropathogenic Escherichia coli (UPEC) is a major cause of urinary tract and bloodstream infections and possesses an array of virulence factors for colonization, survival, and persistence. One such factor is the polysaccharide K  capsule. Among the different K capsule types, the K1 serotype is strongly associated with UPEC infection. In this study, we completely sequenced the K1 UPEC urosepsis strain PA45B and employed a novel combination of a lytic K1 capsule-specific phage, saturated Tn5 transposon mutagenesis, and high-throughput transposon-directed insertion site sequencing (TraDIS) to identify the complement of genes required for capsule production. Our analysis identified known genes involved in capsule biosynthesis, as well as two additional regulatory genes (mprA and lrhA) that we characterized at the molecular level. Mutation of mprA resulted in protection against K1 phage-mediated killing, a phenotype restored by complementation. We also identified a significantly increased unidirectional Tn5  insertion frequency upstream of the lrhA gene and showed that strong expression of LrhA induced by a constitutive Pcl promoter led to loss of capsule production. Further analysis revealed loss of MprA or overexpression of LrhA affected the transcription of capsule biosynthesis genes in PA45B and increased sensitivity to killing in whole blood. Similar phenotypes were also observed in UPEC strains UTI89 (K1) and CFT073 (K2), demonstrating that the effects were neither strain nor capsule type specific. Overall, this study defined the genome of a UPEC urosepsis isolate and identified and characterized two new regulatory factors that affect UPEC capsule production.IMPORTANCE Urinary tract infections (UTIs) are among the most common bacterial infections in humans and are primarily caused by uropathogenic Escherichia coli (UPEC). Many UPEC strains express a polysaccharide K capsule that provides protection against host innate immune factors and contributes to survival and persistence during infection. The K1 serotype is one example of a polysaccharide capsule type and is strongly associated with UPEC strains that cause UTIs, bloodstream infections, and meningitis. The number of UTIs caused by antibiotic-resistant UPEC is steadily increasing, highlighting the need to better understand factors (e.g., the capsule) that contribute to UPEC pathogenesis. This study describes the original  and novel application of lytic capsule-specific phage killing, saturated Tn5 transposon mutagenesis, and high-throughput transposon-directed insertion site sequencing to define the entire complement of genes required for capsule production in UPEC. Our comprehensive approach uncovered new genes involved in the regulation of this key virulence determinant. 
28911122	 ChIP-exo/nexus experiments rely on innovative modifications of the commonly used  ChIP-seq protocol for high resolution mapping of transcription factor binding sites. Although many aspects of the ChIP-exo data analysis are similar to those of ChIP-seq, these high throughput experiments pose a number of unique quality control and analysis challenges. We develop a novel statistical quality control pipeline and accompanying R/Bioconductor package, ChIPexoQual, to enable exploration and analysis of ChIP-exo and related experiments. ChIPexoQual evaluates a number of key issues including strand imbalance, library complexity,  and signal enrichment of data. Assessment of these features are facilitated through diagnostic plots and summary statistics computed over regions of the genome with varying levels of coverage. We evaluated our QC pipeline with both large collections of public ChIP-exo/nexus data and multiple, new ChIP-exo datasets from Escherichia coli. ChIPexoQual analysis of these datasets resulted in guidelines for using these QC metrics across a wide range of sequencing depths and provided further insights for modelling ChIP-exo data. 
28439033	 Upon oxygen limitation, the Bacillus subtilis ResE sensor kinase and its cognate  ResD response regulator play primary roles in the transcriptional activation of genes functioning in anaerobic respiration. The nitric oxide (NO)-sensitive NsrR  repressor controls transcription to support nitrate respiration. In addition, the ferric uptake repressor (Fur) can modulate transcription under anaerobic conditions. However, whether these controls are direct or indirect has been investigated only in a gene-specific manner. To gain a genomic view of anaerobic  gene regulation, we determined the genome-wide in vivo DNA binding of ResD, NsrR, and Fur transcription factors (TFs) using in situ DNase I footprinting combined with chromatin affinity precipitation sequencing (ChAP-seq; genome footprinting by high-throughput sequencing [GeF-seq]). A significant number of sites were targets of ResD and NsrR, and a majority of them were also bound by Fur. The binding of multiple TFs to overlapping targets affected each individual TF's binding, which led to combinatorial transcriptional control. ResD bound to both the promoters and the coding regions of genes under its positive control. Other genes showing enrichment of ResD at only the promoter regions are targets of direct ResD-dependent repression or antirepression. The results support previous  findings of ResD as an RNA polymerase (RNAP)-binding protein and indicated that ResD can associate with the transcription elongation complex. The data set allowed us to reexamine consensus sequence motifs of Fur, ResD, and NsrR and uncovered evidence that multiple TGW (where W is A or T) sequences surrounded by  an A- and T-rich sequence are often found at sites where all three TFs competitively bind.IMPORTANCE Bacteria encounter oxygen fluctuation in their natural environment as well as in host organisms. Hence, understanding how bacteria respond to oxygen limitation will impact environmental and human health. ResD, NsrR, and Fur control transcription under anaerobic conditions. This work using in situ DNase I footprinting uncovered the genome-wide binding profile of the three transcription factors (TFs). Binding of the TFs is often competitive or cooperative depending on the promoters and the presence of other TFs, indicating  that transcriptional regulation by multiple TFs is much more complex than we originally thought. The results from this study provide a more complete picture of anaerobic gene regulation governed by ResD, NsrR, and Fur and contribute to our further understanding of anaerobic physiology. 
28039131	 Human enteric pathogens, such as Salmonella spp. and verotoxigenic Escherichia coli, are increasingly recognized as causes of gastroenteritis outbreaks associated with the consumption of fruits and vegetables. Persistence in plants represents an important part of the life cycle of these pathogens. The identification of the full complement of Salmonella genes involved in the colonization of the model plant (tomato) was carried out using transposon insertion sequencing analysis. With this approach, 230,000 transposon insertions  were screened in tomato pericarps to identify loci with reduction in fitness, followed by validation of the screen results using competition assays of the isogenic mutants against the wild type. A comparison with studies in animals revealed a distinct plant-associated set of genes, which only partially overlaps  with the genes required to elicit disease in animals. De novo biosynthesis of amino acids was critical to persistence within tomatoes, while amino acid scavenging was prevalent in animal infections. Fitness reduction of the Salmonella amino acid synthesis mutants was generally more severe in the tomato rin mutant, which hyperaccumulates certain amino acids, suggesting that these nutrients remain unavailable to Salmonella spp. within plants. Salmonella lipopolysaccharide (LPS) was required for persistence in both animals and plants, exemplifying some shared pathogenesis-related mechanisms in animal and plant hosts. Similarly to phytopathogens, Salmonella spp. required biosynthesis of amino acids, LPS, and nucleotides to colonize tomatoes. Overall, however, it appears that while Salmonella shares some strategies with phytopathogens and taps into its animal virulence-related functions, colonization of tomatoes represents  a distinct strategy, highlighting this pathogen's flexible metabolism.IMPORTANCE  Outbreaks of gastroenteritis caused by human pathogens have been increasingly associated with foods of plant origin, with tomatoes being one of the common culprits. Recent studies also suggest that these human pathogens can use plants as alternate hosts as a part of their life cycle. While dual (animal/plant) lifestyles of other members of the Enterobacteriaceae family are well known, the  strategies with which Salmonella colonizes plants are only partially understood.  Therefore, we undertook a high-throughput characterization of the functions required for Salmonella persistence within tomatoes. The results of this study were compared with what is known about genes required for Salmonella virulence in animals and interactions of plant pathogens with their hosts to determine whether Salmonella repurposes its virulence repertoire inside plants or whether it behaves more as a phytopathogen during plant colonization. Even though Salmonella utilized some of its virulence-related genes in tomatoes, plant colonization required a distinct set of functions. 
27492287	 DNA of viral origin represents a ubiquitous element of bacterial genomes. Its integration into host regulatory circuits is a pivotal driver of microbial evolution but requires the stringent regulation of phage gene activity. In this study, we describe the nucleoid-associated protein CgpS, which represents an essential protein functioning as a xenogeneic silencer in the Gram-positive Corynebacterium glutamicum CgpS is encoded by the cryptic prophage CGP3 of the C. glutamicum strain ATCC 13032 and was first identified by DNA affinity chromatography using an early phage promoter of CGP3. Genome-wide profiling of CgpS binding using chromatin affinity purification and sequencing (ChAP-Seq) revealed its association with AT-rich DNA elements, including the entire CGP3 prophage region (187 kbp), as well as several other elements acquired by horizontal gene transfer. Countersilencing of CgpS resulted in a significantly increased induction frequency of the CGP3 prophage. In contrast, a strain lacking the CGP3 prophage was not affected and displayed stable growth. In a bioinformatics approach, cgpS orthologs were identified primarily in actinobacterial genomes as well as several phage and prophage genomes. Sequence analysis of 618 orthologous proteins revealed a strong conservation of the secondary structure, supporting an ancient function of these xenogeneic silencers in phage-host interaction. 
26862720	 The DNA adenine methyltransferase identification (DamID) assay is a powerful method to detect protein-DNA interactions both locally and genome-wide. It is an  alternative approach to chromatin immunoprecipitation (ChIP). An expressed fusion protein consisting of the protein of interest and the E. coli DNA adenine methyltransferase can methylate the adenine base in GATC motifs near the sites of protein-DNA interactions. Adenine-methylated DNA fragments can then be specifically amplified and detected. The original DamID assay detects the genomic locations of methylated DNA fragments by hybridization to DNA microarrays, which  is limited by the availability of microarrays and the density of predetermined probes. In this paper, we report the detailed protocol of integrating high throughput DNA sequencing into DamID (DamID-seq). The large number of short reads generated from DamID-seq enables detecting and localizing protein-DNA interactions genome-wide with high precision and sensitivity. We have used the DamID-seq assay to study genome-nuclear lamina (NL) interactions in mammalian cells, and have noticed that DamID-seq provides a high resolution and a wide dynamic range in detecting genome-NL interactions. The DamID-seq approach enables probing NL associations within gene structures and allows comparing genome-NL interaction maps with other functional genomic data, such as ChIP-seq and RNA-seq. 
26537891	 BACKGROUND: FNR homologues constitute an important class of transcription factors that control a wide range of anaerobic physiological functions in a number of bacterial species. Since FNR homologues are some of the most pervasive transcription factors, an understanding of their involvement in regulating anaerobic gene expression in different species sheds light on evolutionary similarity and differences. To address this question, we used a combination of high throughput RNA-Seq and ChIP-Seq analysis to define the extent of the FnrL regulon in Rhodobacter capsulatus and related our results to that of FnrL in Rhodobacter sphaeroides and FNR in Escherichia coli. RESULTS: Our RNA-seq results show that FnrL affects the expression of 807 genes,  which accounts for over 20 % of the Rba. capsulatus genome. ChIP-seq results indicate that 42 of these genes are directly regulated by FnrL. Importantly, this includes genes involved in the synthesis of the anoxygenic photosystem. Similarly, FnrL in Rba. sphaeroides affects 24 % of its genome, however, only 171 genes are differentially expressed in common between two Rhodobacter species, suggesting significant divergence in regulation. CONCLUSIONS: We show that FnrL in Rba. capsulatus activates photosynthesis while  in Rba. sphaeroides FnrL regulation reported to involve repression of the photosystem. This analysis highlights important differences in transcriptional control of photosynthetic events and other metabolic processes controlled by FnrL orthologues in closely related Rhodobacter species. Furthermore, we also show that the E. coli FNR regulon has limited transcriptional overlap with the FnrL regulons from either Rhodobacter species. 
26483520	 An ability to sense and respond to changes in extracellular phosphate is critical for the survival of most bacteria. For Caulobacter crescentus, which typically lives in phosphate-limited environments, this process is especially crucial. Like many bacteria, Caulobacter responds to phosphate limitation through a conserved two-component signaling pathway called PhoR-PhoB, but the direct regulon of PhoB  in this organism is unknown. Here we used chromatin immunoprecipitation-DNA sequencing (ChIP-Seq) to map the global binding patterns of the phosphate-responsive transcriptional regulator PhoB under phosphate-limited and -replete conditions. Combined with genome-wide expression profiling, our work demonstrates that PhoB is induced to regulate nearly 50 genes under phosphate-starved conditions. The PhoB regulon is comprised primarily of genes known or predicted to help Caulobacter scavenge for and import inorganic phosphate, including 15 different membrane transporters. We also investigated the regulatory role of PhoU, a widely conserved protein proposed to coordinate phosphate import with expression of the PhoB regulon by directly modulating the histidine kinase PhoR. However, our studies show that it likely does not play such a role in Caulobacter, as PhoU depletion has no significant effect on PhoB-dependent gene expression. Instead, cells lacking PhoU exhibit striking accumulation of large polyphosphate granules, suggesting that PhoU participates in controlling intracellular phosphate metabolism.IMPORTANCE: The transcription factor PhoB is widely conserved throughout the bacterial kingdom, where it helps  organisms respond to phosphate limitation by driving the expression of a battery  of genes. Most of what is known about PhoB and its target genes is derived from studies of Escherichia coli. Our work documents the PhoB regulon in Caulobacter crescentus, and comparison to the regulon in E. coli reveals significant differences, highlighting the evolutionary plasticity of transcriptional responses driven by highly conserved transcription factors. We also demonstrated  that the conserved protein PhoU, which is implicated in bacterial persistence, does not regulate PhoB activity, as previously suggested. Instead, our results favor a model in which PhoU affects intracellular phosphate accumulation, possibly through the high-affinity phosphate transporter. 
28348816	 Uropathogenic Escherchia coli (UPEC) is the causative agent of urinary tract infections. Nitric oxide (NO) is a toxic water-soluble gas that is encountered by UPEC in the urinary tract. Therefore, UPEC probably requires mechanisms to detoxify NO in the host environment. Thus far, flavohaemoglobin (Hmp), an NO denitrosylase, is the only demonstrated NO detoxification system in UPEC. Here we show that, in E. coli strain CFT073, the NADH-dependent NO reductase flavorubredoxin (FlRd) also plays a major role in NO scavenging. We generated a mutant that lacks all known and candidate NO detoxification pathways (Hmp, FlRd and the respiratory nitrite reductase, NrfA). When grown and assayed anaerobically, this mutant expresses an NO-inducible NO scavenging activity, pointing to the existence of a novel detoxification mechanism. Expression of this activity is inducible by both NO and nitrate, and the enzyme is membrane-associated. Genome-wide transcriptional profiling of UPEC grown under anaerobic conditions in the presence of nitrate (as a source of NO) highlighted various aspects of the response of the pathogen to nitrate and NO. Several virulence-associated genes are upregulated, suggesting that host-derived NO is a  potential regulator of UPEC virulence. Chromatin immunoprecipitation and sequencing was used to evaluate the NsrR regulon in CFT073. We identified 49 NsrR binding sites in promoter regions in the CFT073 genome, 29 of which were not previously identified in E. coli K-12. NsrR may regulate some CFT073 genes that do not have homologues in E. coli K-12. 
26389830	 The alternative sigma factor σE functions to maintain bacterial homeostasis and membrane integrity in response to extracytoplasmic stress by regulating thousands of genes both directly and indirectly. The transcriptional regulatory network governed by σE in Salmonella and E. coli has been examined using microarray, however a genome-wide analysis of σE-binding sites in Salmonella has not yet been reported. We infected macrophages with Salmonella Typhimurium over a select time  course. Using chromatin immunoprecipitation followed by high-throughput DNA sequencing (ChIP-seq), 31 σE-binding sites were identified. Seventeen sites were  new, which included outer membrane proteins, a quorum-sensing protein, a cell division factor, and a signal transduction modulator. The consensus sequence identified for σE in vivo binding was similar to the one previously reported, except for a conserved G and A between the -35 and -10 regions. One third of the  σE-binding sites did not contain the consensus sequence, suggesting there may be  alternative mechanisms by which σE modulates transcription. By dissecting direct  and indirect modes of σE-mediated regulation, we found that σE activates gene expression through recognition of both canonical and reversed consensus sequence. New σE regulated genes (greA, luxS, ompA and ompX) are shown to be involved in heat shock and oxidative stress responses. 
26070154	 In bacteria the concurrence of DNA replication and transcription leads to potentially deleterious encounters between the two machineries, which can occur in either the head-on (lagging strand genes) or co-directional (leading strand genes) orientations. These conflicts lead to replication fork stalling and can destabilize the genome. Both eukaryotic and prokaryotic cells possess resolution  factors that reduce the severity of these encounters. Though Escherichia coli accessory helicases have been implicated in the mitigation of head-on conflicts,  direct evidence of these proteins mitigating co-directional conflicts is lacking. Furthermore, the endogenous chromosomal regions where these helicases act, and the mechanism of recruitment, have not been identified. We show that the essential Bacillus subtilis accessory helicase PcrA aids replication progression  through protein coding genes of both head-on and co-directional orientations, as  well as rRNA and tRNA genes. ChIP-Seq experiments show that co-directional conflicts at highly transcribed rRNA, tRNA, and head-on protein coding genes are  major targets of PcrA activity on the chromosome. Partial depletion of PcrA renders cells extremely sensitive to head-on conflicts, linking the essential function of PcrA to conflict resolution. Furthermore, ablating PcrA's ATPase/helicase activity simultaneously increases its association with conflict regions, while incapacitating its ability to mitigate conflicts, and leads to cell death. In contrast, disruption of PcrA's C-terminal RNA polymerase interaction domain does not impact its ability to mitigate conflicts between replication and transcription, its association with conflict regions, or cell survival. Altogether, this work establishes PcrA as an essential factor involved  in mitigating transcription-replication conflicts and identifies chromosomal regions where it routinely acts. As both conflicts and accessory helicases are found in all domains of life, these results are broadly relevant. 
25089258	 CarD is an essential mycobacterial protein that binds the RNA polymerase (RNAP) and affects the transcriptional profile of Mycobacterium smegmatis and Mycobacterium tuberculosis (6). We predicted that CarD was directly regulating RNAP function but our prior experiments had not determined at what stage of transcription CarD was functioning and at which genes CarD interacted with the RNAP. To begin to address these open questions, we performed Chromatin Immunoprecipitation sequencing (ChIP-seq) to survey the distribution of CarD throughout the M. smegmatis chromosome. The distribution of RNAP subunits β and σA were also profiled. We expected that RNAP β would be present throughout transcribed regions and RNAP σA would be predominantly enriched at promoters based on work in Escherichia coli (3), however this had yet to be determined in mycobacteria. The ChIP-seq analyses revealed that CarD was never present on the genome in the absence of RNAP, was primarily associated with promoter regions, and was highly correlated with the distribution of RNAP σA. The colocalization of σA and CarD led us to propose that in vivo, CarD associates with RNAP initiation  complexes at most promoters and is therefore a global regulator of transcription  initiation. Here we describe in detail the data from the ChIP-seq experiments associated with the study published by Srivastava and colleagues in the Proceedings of the National Academy of Science in 2013 (5) as well as discuss the findings from this dataset in relation to both CarD and mycobacterial transcription as a whole. The ChIP-seq data have been deposited in the Gene Expression Omnibus (GEO) database, www.ncbi.nlm.nih.gov/geo (accession no. GSE48164). 
24650566	 Inferring gene regulatory networks from gene expression data at whole genome level is still an arduous challenge, especially in higher organisms where the number of genes is large but the number of experimental samples is small. It is reported that the accuracy of current methods at genome scale significantly drops from Escherichia coli to Saccharomyces cerevisiae due to the increase in number of genes. This limits the applicability of current methods to more complex genomes, like human and mouse. Least absolute shrinkage and selection operator (LASSO) is widely used for gene regulatory network inference from gene expression profiles. However, the accuracy of LASSO on large genomes is not satisfactory. In this study, we apply two extended models of LASSO, L0 and L1/2 regularization models to infer gene regulatory network from both high-throughput gene expression data and transcription factor binding data in mouse embryonic stem cells (mESCs). We find that both the L0 and L1/2 regularization models significantly outperform  LASSO in network inference. Incorporating interactions between transcription factors and their targets remarkably improved the prediction accuracy. Current study demonstrates the efficiency and applicability of these two models for gene  regulatory network inference from integrative omics data in large genomes. The applications of the two models will facilitate biologists to study the gene regulation of higher model organisms in a genome-wide scale. 
24565265	 BACKGROUND: Chromatin immunoprecipitation (ChIP) experiments are now the most comprehensive experimental approaches for mapping the binding of transcription factors (TFs) to their target genes. However, ChIP data alone is insufficient for identifying functional binding target genes of TFs for two reasons. First, there  is an inherent high false positive/negative rate in ChIP-chip or ChIP-seq experiments. Second, binding signals in the ChIP data do not necessarily imply functionality. METHODS: It is known that ChIP-chip data and TF knockout (TFKO) data reveal complementary information on gene regulation. While ChIP-chip data can provide TF-gene binding pairs, TFKO data can provide TF-gene regulation pairs. Therefore, we propose a novel network approach for identifying functional TF-gene binding pairs by integrating the ChIP-chip data with the TFKO data. In our method, a TF-gene binding pair from the ChIP-chip data is regarded to be functional if it also has high confident curated TFKO TF-gene regulatory relation or deduced hypostatic TF-gene regulatory relation. RESULTS AND CONCLUSIONS: We first validated our method on a gathered ground truth set. Then we applied our method to the ChIP-chip data to identify functional TF-gene binding pairs. The biological significance of our identified functional TF-gene binding pairs was shown by assessing their functional enrichment, the prevalence of protein-protein interaction, and expression coherence. Our results  outperformed the results of three existing methods across all measures. And our identified functional targets of TFs also showed statistical significance over the randomly assigned TF-gene pairs. We also showed that our method is dataset independent and can apply to ChIP-seq data and the E. coli genome. Finally, we provided an example showing the biological applicability of our notion. 
24053571	 BACKGROUND: Studies of protein association with DNA on a genome wide scale are possible through methods like ChIP-Chip or ChIP-Seq. Massive problems with false  positive signals in our own experiments motivated us to revise the standard ChIP-Chip protocol. Analysis of chromosome wide binding of the alternative sigma  factor σ³² in Escherichia coli with this new protocol resulted in detection of only a subset of binding sites found in a previous study by Wade and colleagues.  We suggested that the remainder of binding sites detected in the previous study are likely to be false positives. In a recent article the Wade group claimed that our conclusion is wrong and that the disputed sites are genuine σ³² binding sites. They further claimed that the non-detection of these sites in our study was due to low data quality. RESULTS/DISCUSSION: We respond to the criticism of Wade and colleagues and discuss some general questions of ChIP-based studies. We outline why the quality  of our data is sufficient to derive meaningful results. Specific points are: (i)  the modifications we introduced into the standard ChIP-Chip protocol do not necessarily result in a low dynamic range, (ii) correlation between ChIP-Chip replicates should not be calculated based on the whole data set as done in transcript analysis, (iii) control experiments are essential for identifying false positives. Suggestions are made how ChIP-based methods could be further optimized and which alternative approaches can be used to strengthen conclusions. CONCLUSION: We appreciate the ongoing discussion about the ChIP-Chip method and hope that it helps other scientist to analyze and interpret their results. The modifications we introduced into the ChIP-Chip protocol are a first step towards  reducing false positive signals but there is certainly potential for further optimization. The discussion about the σ³² binding sites in question highlights the need for alternative approaches and further investigation of appropriate methods for verification. 
23717649	 Fis, one of the most important nucleoid-associated proteins, functions as a global regulator of transcription in bacteria that has been comprehensively studied in Escherichia coli K12. Fis also influences the virulence of Salmonella  enterica and pathogenic E. coli by regulating their virulence genes, however, the relevant mechanism is unclear. In this report, using combined RNA-seq and chromatin immunoprecipitation (ChIP)-seq technologies, we first identified 1646 Fis-regulated genes and 885 Fis-binding targets in the S. enterica serovar Typhimurium, and found a Fis regulon different from that in E. coli. Fis has been reported to contribute to the invasion ability of S. enterica. By using cell infection assays, we found it also enhances the intracellular replication ability of S. enterica within macrophage cell, which is of central importance for the pathogenesis of infections. Salmonella pathogenicity islands (SPI)-1 and SPI-2 are crucial for the invasion and survival of S. enterica in host cells. Using mutation and overexpression experiments, real-time PCR analysis, and electrophoretic mobility shift assays, we demonstrated that Fis regulates 63 of the 94 Salmonella pathogenicity island (SPI)-1 and SPI-2 genes, by three regulatory modes: i) binds to SPI regulators in the gene body or in upstream regions; ii) binds to SPI genes directly to mediate transcriptional activation of themselves and downstream genes; iii) binds to gene encoding OmpR which affects SPI gene expression by controlling SPI regulators SsrA and HilD. Our results provide new insights into the impact of Fis on SPI genes and the pathogenicity of S. enterica. 
23580539	 Accurate identification of the DNA-binding sites of transcription factors and other DNA-binding proteins on the genome is crucial to understanding their molecular interactions with DNA. Here, we describe a new method: Genome Footprinting by high-throughput sequencing (GeF-seq), which combines in vivo DNase I digestion of genomic DNA with ChIP coupled with high-throughput sequencing. We have determined the in vivo binding sites of a Bacillus subtilis global regulator, AbrB, using GeF-seq. This method shows that exact DNA-binding sequences, which were protected from in vivo DNase I digestion, were resolved at  a comparable resolution to that achieved by in vitro DNase I footprinting, and this was simply attained without the necessity of prediction by peak-calling programs. Moreover, DNase I digestion of the bacterial nucleoid resolved the closely positioned AbrB-binding sites, which had previously appeared as one peak  in ChAP-chip and ChAP-seq experiments. The high-resolution determination of AbrB-binding sites using GeF-seq enabled us to identify bipartite TGGNA motifs in 96% of the AbrB-binding sites. Interestingly, in a thousand binding sites with very low-binding intensities, single TGGNA motifs were also identified. Thus, GeF-seq is a powerful method to elucidate the molecular mechanism of target protein binding to its cognate DNA sequences. 
23511241	 As the first, and usually rate-limiting, step of transcription initiation, bacterial RNA polymerase (RNAP) binds to double stranded DNA (dsDNA) and subsequently opens the two strands of DNA (the open complex formation). The rate  determining step in the open complex formation is opening of a short (6 bp) DNA called the -10 region, which interacts with RNAP in both dsDNA and single stranded (ssDNA) forms. Accordingly, formation of the open complex depends on (physically independent) domains of RNAP that interact with ssDNA and dsDNA, as well as on parameters of DNA melting and sequences of -10 regions. We here aim to understand how these different interactions are mutually related to ensure efficient open complex formation. To achieve this, we use a recently developed biophysical model of transcription initiation, which allows the calculation of the kinetic parameters of transcription initiation on the scale of whole genome.  We consequently investigate kinetic properties of sequences derived from all E. coli intergenic regions, and from more than 300 experimentally confirmed E. coli  σ(70) promoters. We find that interaction specificities of σ(70) DNA binding domains reduce the number of sequences where RNAP binds strongly, but forms the open complex too slowly to achieve functional transcription (so-called poised promoters). However, we find that, despite this reduction, there is still a significant number of such poised promoters in the intergenic regions, which may  provide a major source of false positives in genome-wide searches of transcription start sites. Furthermore, we surprisingly find that sequences of -10 regions of the functional promoters increase the extent of RNAP poising, which we interpret in terms of an extension of a recently proposed model of promoter recognition ('mix-and-match model') to kinetic parameters. Overall, our  results allow better understanding of the design of σ(70) DNA binding domains and promoter sequences, and place a fundamental limit on accuracy of methods for promoter detection that are based on strong RNAP binding (e.g. ChIP-chip). 
23470992	 Salmonella Typhi and Typhimurium diverged only ∼50 000 years ago, yet have very different host ranges and pathogenicity. Despite the availability of multiple whole-genome sequences, the genetic differences that have driven these changes in phenotype are only beginning to be understood. In this study, we use transposon-directed insertion-site sequencing to probe differences in gene requirements for competitive growth in rich media between these two closely related serovars. We identify a conserved core of 281 genes that are required for growth in both serovars, 228 of which are essential in Escherichia coli. We are able to identify active prophage elements through the requirement for their repressors. We also find distinct differences in requirements for genes involved  in cell surface structure biogenesis and iron utilization. Finally, we demonstrate that transposon-directed insertion-site sequencing is not only applicable to the protein-coding content of the cell but also has sufficient resolution to generate hypotheses regarding the functions of non-coding RNAs (ncRNAs) as well. We are able to assign probable functions to a number of cis-regulatory ncRNA elements, as well as to infer likely differences in trans-acting ncRNA regulatory networks. 
23275538	 Nanobodies® are single-domain antibody fragments derived from camelid heavy-chain antibodies. Because of their small size, straightforward production in Escherichia coli, easy tailoring, high affinity, specificity, stability and solubility, nanobodies® have been exploited in various biotechnological applications. A major challenge in the post-genomics and post-proteomics era is the identification of regulatory networks involving nucleic acid-protein and protein-protein interactions. Here, we apply a nanobody® in chromatin immunoprecipitation followed by DNA microarray hybridization (ChIP-chip) for genome-wide identification of DNA-protein interactions. The Lrp-like regulator Ss-LrpB, arguably one of the best-studied specific transcription factors of the hyperthermophilic archaeon Sulfolobus solfataricus, was chosen for this proof-of-principle nanobody®-assisted ChIP. Three distinct Ss-LrpB-specific nanobodies®, each interacting with a different epitope, were generated for ChIP.  Genome-wide ChIP-chip with one of these nanobodies® identified the well-established Ss-LrpB binding sites and revealed several unknown target sequences. Furthermore, these ChIP-chip profiles revealed auxiliary operator sites in the open reading frame of Ss-lrpB. Our work introduces nanobodies® as a  novel class of affinity reagents for ChIP. Taking into account the unique characteristics of nanobodies®, in particular, their short generation time, nanobody®-based ChIP is expected to further streamline ChIP-chip and ChIP-Seq experiments, especially in organisms with no (or limited) possibility of genetic  manipulation. 
23232715	 Cyclic AMP receptor protein (Crp) is a transcription regulator controlling diverse cellular processes in many bacteria. In Streptomyces coelicolor, it is well established that Crp plays a critical role in spore germination and colony development. Here, we demonstrate that Crp is a key regulator of secondary metabolism and antibiotic production in S. coelicolor and show that it may additionally coordinate precursor flux from primary to secondary metabolism. We found that crp deletion adversely affected the synthesis of three well-characterized antibiotics in S. coelicolor: actinorhodin (Act), undecylprodigiosin (Red), and calcium-dependent antibiotic (CDA). Using chromatin immunoprecipitation-microarray (ChIP-chip) assays, we determined that eight (out  of 22) secondary metabolic clusters encoded by S. coelicolor contained Crp-associated sites. We followed the effect of Crp induction using transcription profiling analyses and found secondary metabolic genes to be significantly affected: included in this Crp-dependent group were genes from six of the clusters identified in the ChIP-chip experiments. Overexpressing Crp in a panel of Streptomyces species led to enhanced antibiotic synthesis and new metabolite production, suggesting that Crp control over secondary metabolism is broadly conserved in the streptomycetes and that Crp overexpression could serve as a powerful tool for unlocking the chemical potential of these organisms. IMPORTANCE Streptomyces produces a remarkably diverse array of secondary metabolites, including many antibiotics. In recent years, genome sequencing has revealed that  these products represent only a small proportion of the total secondary metabolite potential of Streptomyces. There is, therefore, considerable interest  in discovering ways to stimulate the production of new metabolites. Here, we show that Crp (the classical regulator of carbon catabolite repression in Escherichia  coli) is a master regulator of secondary metabolism in Streptomyces. It binds to  eight of 22 secondary metabolic gene clusters in the Streptomyces coelicolor genome and directly affects the expression of six of these. Deletion of crp in S. coelicolor leads to dramatic reductions in antibiotic levels, while Crp overexpression enhances antibiotic production. We find that the antibiotic-stimulatory capacity of Crp extends to other streptomycetes, where its overexpression activates the production of "cryptic" metabolites that are not otherwise seen in the corresponding wild-type strain. 
22207717	 Gene expression is tightly regulated by transcription factors and cofactors that  function by directly or indirectly interacting with DNA of the genome. Understanding how and where these proteins bind provides essential information to uncover genetic regulatory mechanisms. We have developed a new method to study DNA-protein interaction in vivo called DNA adenine methyltransferase (Dam)IP, which is based on fusing a protein of interest to a mutant form of Dam from Escherichia coli. We showed previously that DamIP can efficiently identify in vivo binding sites of Dam-tethered human estrogen receptor (hER)α. In current study, we present the cistrome of hERα determined by DamIP and high throughput sequencing (DamIP-seq). The DamIP-seq-defined hERα cistrome identifies many new binding regions and overlaps with those determined by chromatin immunoprecipitation (ChIP)-chip or ChIP-seq. Elements uniquely identified by DamIP-seq include a unique class of elements that show low, but persistent, hERα  binding when reexamined by conventional ChIP. In contrast, DamIP-seq fails to detect some elements with very transient hERα binding. The methyl-adenine modifications introduced by Dam are stable and do not decrease over 12 d. In summary, the current study provides both an alternate view of the hERα cistrome to further understand the mechanism of hERα-mediated transcription and a new tool to explore other transcriptional factors and cofactors that is very different from conventional ChIP. 
19843227	 StpA is a paralogue of the nucleoid-associated protein H-NS that is conserved in  a range of enteric bacteria and had no known function in Salmonella Typhimurium.  We show that 5% of the Salmonella genome is regulated by StpA, which contrasts with the situation in Escherichia coli where deletion of stpA only had minor effects on gene expression. The StpA-dependent genes of S. Typhimurium are a specific subset of the H-NS regulon that are predominantly under the positive control of sigma(38) (RpoS), CRP-cAMP and PhoP. Regulation by StpA varied with growth phase; StpA controlled sigma(38) levels at mid-exponential phase by preventing inappropriate activation of sigma(38) during rapid bacterial growth. In contrast, StpA only activated the CRP-cAMP regulon during late exponential phase. ChIP-chip analysis revealed that StpA binds to PhoP-dependent genes but not to most genes of the CRP-cAMP and sigma(38) regulons. In fact, StpA indirectly regulates sigma(38)-dependent genes by enhancing sigma(38) turnover by repressing the anti-adaptor protein rssC. We discovered that StpA is essential for the dynamic regulation of sigma(38) in response to increased glucose levels.  Our findings identify StpA as a novel growth phase-specific regulator that plays  an important physiological role by linking sigma(38) levels to nutrient availability.