not_useful_abstracts.txt
140 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
1470
1471
1472
1473
1474
1475
1476
1477
1478
1479
1480
1481
1482
1483
1484
1485
1486
1487
1488
1489
1490
1491
1492
1493
1494
1495
1496
1497
1498
1499
1500
1501
1502
1503
1504
1505
1506
1507
1508
1509
1510
1511
1512
1513
1514
1515
1516
1517
1518
1519
1520
1521
1522
1523
1524
1525
1526
1527
1528
1529
1530
1531
1532
1533
1534
1535
1536
1537
1538
1539
1540
1541
1542
1543
1544
1545
1546
1547
1548
1549
1550
1551
1552
1553
1554
1555
1556
1557
1558
1559
1560
1561
1562
1563
1564
1565
1566
1567
1568
1569
1570
1571
1572
1573
1574
1575
1576
1577
1578
1579
1580
1581
1582
1583
1584
1585
1586
1587
1588
1589
1590
1591
1592
1593
1594
1595
1596
1597
1598
1599
1600
1601
1602
1603
1604
1605
1606
1607
1608
1609
1610
1611
1612
1613
1614
1615
1616
1617
1618
1619
1620
1621
1622
1623
1624
1625
1626
1627
1628
1629
1630
1631
1632
1633
1634
1635
1636
1637
1638
1639
1640
1641
1642
1643
1644
1645
1646
1647
1648
1649
1650
1651
1652
1653
1654
1655
1656
1657
1658
1659
1660
1661
1662
1663
1664
1665
1666
1667
1668
1669
1670
1671
1672
1673
1674
1675
1676
1677
1678
1679
1680
1681
1682
1683
1684
1685
1686
1687
1688
1689
1690
1691
1692
1693
1694
1695
1696
1697
1698
1699
1700
1701
1702
1703
1704
1705
1706
1707
1708
1709
1710
1711
1712
1713
1714
1715
1716
1717
1718
1719
1720
1721
1722
1723
1724
1725
1726
1727
1728
1729
1730
1731
1732
1733
1734
1735
1736
1737
1738
1739
1740
1741
1742
1743
1744
1745
1746
1747
1748
1749
1750
1751
1752
1753
1754
1755
1756
1757
1758
1759
1760
1761
1762
1763
1764
1765
1766
1767
1768
1769
1770
1771
1772
1773
1774
1775
1776
1777
1778
1779
1780
1781
1782
1783
1784
1785
1786
1787
1788
1789
1790
1791
1792
1793
1794
1795
1796
1797
1798
1799
1800
1801
1802
1803
1804
1805
1806
1807
1808
1809
1810
1811
1812
1813
1814
1815
1816
1817
1818
1819
1820
1821
1822
1823
1824
1825
1826
1827
1828
1829
1830
1831
1832
1833
1834
1835
1836
1837
1838
1839
1840
1841
1842
1843
1844
1845
1846
1847
1848
1849
1850
1851
1852
1853
1854
1855
1856
1857
1858
1859
1860
1861
1862
1863
1864
1865
1866
1867
1868
1869
1870
1871
1872
1873
1874
1875
1876
1877
1878
1879
1880
1881
1882
1883
1884
1885
1886
1887
1888
1889
1890
1891
1892
1893
1894
1895
1896
1897
1898
1899
1900
1901
1902
1903
1904
1905
1906
1907
1908
1909
1910
1911
1912
1913
1914
1915
1916
1917
1918
1919
1920
1921
1922
1923
1924
1925
1926
1927
1928
1929
1930
1931
1932
1933
1934
1935
1936
1937
1938
1939
1940
1941
1942
1943
1944
1945
1946
1947
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
2027
2028
2029
2030
2031
2032
2033
2034
2035
2036
2037
2038
2039
2040
2041
2042
2043
2044
2045
2046
2047
2048
2049
2050
2051
2052
2053
2054
2055
2056
2057
2058
2059
2060
2061
2062
2063
2064
2065
2066
2067
2068
2069
2070
2071
2072
2073
2074
2075
2076
2077
2078
2079
2080
2081
2082
2083
2084
2085
2086
2087
2088
2089
2090
2091
2092
2093
2094
2095
2096
2097
2098
2099
2100
2101
2102
2103
2104
2105
2106
2107
2108
2109
2110
2111
2112
2113
2114
2115
2116
2117
2118
2119
2120
2121
2122
2123
2124
2125
2126
2127
2128
2129
2130
2131
2132
2133
2134
2135
2136
2137
2138
2139
2140
2141
2142
2143
2144
2145
2146
2147
2148
2149
2150
2151
2152
2153
2154
2155
2156
2157
2158
2159
2160
2161
2162
2163
2164
2165
2166
2167
2168
2169
2170
2171
2172
2173
2174
2175
2176
2177
2178
2179
2180
2181
2182
2183
2184
2185
2186
2187
2188
2189
2190
2191
2192
2193
2194
2195
2196
2197
2198
2199
2200
2201
2202
2203
2204
2205
2206
2207
2208
2209
2210
2211
2212
2213
2214
2215
2216
2217
2218
2219
2220
2221
2222
2223
2224
2225
2226
2227
2228
2229
2230
2231
2232
2233
2234
2235
2236
2237
2238
2239
2240
2241
2242
2243
2244
2245
2246
2247
2248
2249
2250
2251
2252
2253
2254
2255
2256
2257
2258
2259
2260
2261
2262
2263
2264
2265
2266
2267
2268
2269
2270
2271
2272
2273
2274
2275
2276
2277
2278
2279
2280
2281
2282
2283
2284
2285
2286
2287
2288
2289
2290
2291
2292
2293
2294
2295
2296
2297
2298
2299
2300
2301
2302
2303
2304
2305
2306
2307
2308
2309
2310
2311
2312
2313
2314
2315
2316
2317
2318
2319
2320
2321
2322
2323
2324
2325
2326
2327
2328
2329
2330
2331
2332
2333
2334
2335
2336
2337
2338
2339
2340
2341
2342
2343
2344
2345
2346
2347
2348
2349
2350
2351
2352
2353
2354
2355
2356
2357
2358
2359
2360
2361
2362
2363
2364
2365
2366
2367
2368
2369
2370
2371
2372
2373
2374
2375
2376
2377
2378
2379
2380
2381
2382
2383
2384
2385
2386
2387
2388
2389
2390
2391
2392
2393
2394
2395
2396
2397
2398
2399
2400
2401
2402
2403
2404
2405
2406
2407
2408
2409
2410
2411
2412
2413
2414
2415
2416
2417
2418
2419
2420
2421
2422
2423
2424
2425
2426
2427
2428
2429
2430
2431
2432
2433
2434
2435
2436
2437
2438
2439
2440
2441
2442
2443
2444
2445
2446
2447
2448
2449
2450
2451
2452
2453
2454
2455
2456
2457
2458
2459
2460
2461
2462
2463
2464
2465
2466
2467
2468
2469
2470
2471
2472
2473
2474
2475
2476
2477
2478
2479
2480
2481
2482
2483
2484
2485
2486
2487
2488
2489
2490
2491
2492
2493
2494
2495
2496
2497
2498
2499
2500
2501
2502
2503
2504
2505
2506
2507
2508
2509
2510
2511
2512
2513
2514
2515
2516
2517
2518
2519
2520
2521
2522
2523
2524
2525
2526
2527
2528
2529
2530
2531
2532
2533
2534
2535
2536
2537
2538
2539
2540
2541
2542
2543
2544
2545
2546
2547
2548
2549
2550
2551
2552
2553
2554
2555
2556
2557
2558
2559
2560
2561
2562
2563
2564
2565
2566
2567
2568
2569
2570
2571
2572
2573
2574
2575
2576
2577
2578
2579
2580
2581
2582
2583
2584
2585
2586
2587
2588
2589
2590
2591
2592
2593
2594
2595
2596
2597
2598
2599
2600
2601
2602
2603
2604
2605
2606
2607
2608
2609
2610
2611
2612
2613
2614
2615
2616
2617
2618
2619
2620
2621
1. J Bacteriol. 2018 Mar 12;200(7). pii: e00698-17. doi: 10.1128/JB.00698-17. Print
2018 Apr 1.
Genome-Wide Identification by Transposon Insertion Sequencing of Escherichia coli
K1 Genes Essential for In Vitro Growth, Gastrointestinal Colonizing Capacity, and
Survival in Serum.
McCarthy AJ(1), Stabler RA(2), Taylor PW(3).
Author information:
(1)School of Pharmacy, University College London, London, United Kingdom.
(2)London School of Hygiene and Tropical Medicine, London, United Kingdom.
(3)School of Pharmacy, University College London, London, United Kingdom
peter.taylor@ucl.ac.uk.
Escherichia coli K1 strains are major causative agents of invasive disease of
newborn infants. The age dependency of infection can be reproduced in neonatal
rats. Colonization of the small intestine following oral administration of K1
bacteria leads rapidly to invasion of the blood circulation; bacteria that avoid
capture by the mesenteric lymphatic system and evade antibacterial mechanisms in
the blood may disseminate to cause organ-specific infections such as meningitis.
Some E. coli K1 surface constituents, in particular the polysialic acid capsule,
are known to contribute to invasive potential, but a comprehensive picture of the
factors that determine the fully virulent phenotype has not emerged so far. We
constructed a library and constituent sublibraries of ∼775,000 Tn5 transposon
mutants of E. coli K1 strain A192PP and employed transposon-directed insertion
site sequencing (TraDIS) to identify genes required for fitness for infection of
2-day-old rats. Transposon insertions were lacking in 357 genes following
recovery on selective agar; these genes were considered essential for growth in
nutrient-replete medium. Colonization of the midsection of the small intestine
was facilitated by 167 E. coli K1 gene products. Restricted bacterial
translocation across epithelial barriers precluded TraDIS analysis of
gut-to-blood and blood-to-brain transits; 97 genes were required for survival in
human serum. This study revealed that a large number of bacterial genes, many of
which were not previously associated with systemic E. coli K1 infection, are
required to realize full invasive potential.IMPORTANCEEscherichia coli K1 strains
cause life-threatening infections in newborn infants. They are acquired from the
mother at birth and colonize the small intestine, from where they invade the
blood and central nervous system. It is difficult to obtain information from
acutely ill patients that sheds light on physiological and bacterial factors
determining invasive disease. Key aspects of naturally occurring age-dependent
human infection can be reproduced in neonatal rats. Here, we employ
transposon-directed insertion site sequencing to identify genes essential for the
in vitro growth of E. coli K1 and genes that contribute to the colonization of
susceptible rats. The presence of bottlenecks to invasion of the blood and
cerebrospinal compartments precluded insertion site sequencing analysis, but we
identified genes for survival in serum.
Copyright © 2018 McCarthy et al.
DOI: 10.1128/JB.00698-17
PMID: 29339415
2. J Antimicrob Chemother. 2017 Oct 1;72(10):2729-2736. doi: 10.1093/jac/dkx204.
Modifications in the pmrB gene are the primary mechanism for the development of
chromosomally encoded resistance to polymyxins in uropathogenic Escherichia coli.
Phan MD(1)(2), Nhu NTK(1)(2), Achard MES(1)(2), Forde BM(1)(2)(3), Hong KW(4),
Chong TM(4), Yin WF(4), Chan KG(4), West NP(1)(2), Walker MJ(1)(2), Paterson
DL(2)(5), Beatson SA(1)(2)(3), Schembri MA(1)(2).
Author information:
(1)School of Chemistry & Molecular Biosciences, The University of Queensland,
Brisbane, Queensland, Australia.
(2)Australian Infectious Diseases Research Centre, The University of Queensland,
Brisbane, Queensland, Australia.
(3)Australian Centre for Ecogenomics, The University of Queensland, Brisbane,
Queensland, Australia.
(4)Division of Genetics and Molecular Biology, Institute of Biological Sciences,
Faculty of Science, University of Malaya, 50603 Kuala, Lumpur, Malaysia.
(5)The University of Queensland Centre for Clinical Research, The University of
Queensland, Brisbane, Queensland, Australia.
Objectives: Polymyxins remain one of the last-resort drugs to treat infections
caused by MDR Gram-negative pathogens. Here, we determined the mechanisms by
which chromosomally encoded resistance to colistin and polymyxin B can arise in
the MDR uropathogenic Escherichia coli ST131 reference strain EC958.
Methods: Two complementary approaches, saturated transposon mutagenesis and
spontaneous mutation induction with high concentrations of colistin and polymyxin
B, were employed to select for mutations associated with resistance to
polymyxins. Mutants were identified using transposon-directed insertion-site
sequencing or Illumina WGS. A resistance phenotype was confirmed by MIC and
further investigated using RT-PCR. Competitive growth assays were used to measure
fitness cost.
Results: A transposon insertion at nucleotide 41 of the pmrB gene
(EC958pmrB41-Tn5) enhanced its transcript level, resulting in a 64- and 32-fold
increased MIC of colistin and polymyxin B, respectively. Three spontaneous
mutations, also located within the pmrB gene, conferred resistance to both
colistin and polymyxin B with a corresponding increase in transcription of the
pmrCAB genes. All three mutations incurred a fitness cost in the absence of
colistin and polymyxin B.
Conclusions: This study identified the pmrB gene as the main chromosomal target
for induction of colistin and polymyxin B resistance in E. coli.
© The Author 2017. Published by Oxford University Press on behalf of the British
Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions,
please email: journals.permissions@oup.com.
DOI: 10.1093/jac/dkx204
PMID: 29091192
3. MBio. 2017 Oct 24;8(5). pii: e01558-17. doi: 10.1128/mBio.01558-17.
Genome-Wide Discovery of Genes Required for Capsule Production by Uropathogenic
Escherichia coli.
Goh KGK(1)(2), Phan MD(1)(2), Forde BM(1)(2)(3), Chong TM(4), Yin WF(4), Chan
KG(4), Ulett GC(5), Sweet MJ(2)(6), Beatson SA(1)(2)(3), Schembri MA(7)(2).
Author information:
(1)School of Chemistry and Molecular Biosciences, University of Queensland,
Brisbane, Queensland, Australia.
(2)Australian Infectious Diseases Research Centre, University of Queensland,
Brisbane, Queensland, Australia.
(3)Australian Centre for Ecogenomics, University of Queensland, Brisbane,
Queensland, Australia.
(4)Division of Genetics and Molecular Biology, Institute of Biological Sciences,
Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia.
(5)School of Medical Science and Menzies Health Institute Queensland, Griffith
University, Gold Coast, Australia.
(6)Institute for Molecular Bioscience, University of Queensland, Brisbane,
Queensland, Australia.
(7)School of Chemistry and Molecular Biosciences, University of Queensland,
Brisbane, Queensland, Australia m.schembri@uq.edu.au.
Uropathogenic Escherichia coli (UPEC) is a major cause of urinary tract and
bloodstream infections and possesses an array of virulence factors for
colonization, survival, and persistence. One such factor is the polysaccharide K
capsule. Among the different K capsule types, the K1 serotype is strongly
associated with UPEC infection. In this study, we completely sequenced the K1
UPEC urosepsis strain PA45B and employed a novel combination of a lytic K1
capsule-specific phage, saturated Tn5 transposon mutagenesis, and high-throughput
transposon-directed insertion site sequencing (TraDIS) to identify the complement
of genes required for capsule production. Our analysis identified known genes
involved in capsule biosynthesis, as well as two additional regulatory genes
(mprA and lrhA) that we characterized at the molecular level. Mutation of mprA
resulted in protection against K1 phage-mediated killing, a phenotype restored by
complementation. We also identified a significantly increased unidirectional Tn5
insertion frequency upstream of the lrhA gene and showed that strong expression
of LrhA induced by a constitutive Pcl promoter led to loss of capsule production.
Further analysis revealed loss of MprA or overexpression of LrhA affected the
transcription of capsule biosynthesis genes in PA45B and increased sensitivity to
killing in whole blood. Similar phenotypes were also observed in UPEC strains
UTI89 (K1) and CFT073 (K2), demonstrating that the effects were neither strain
nor capsule type specific. Overall, this study defined the genome of a UPEC
urosepsis isolate and identified and characterized two new regulatory factors
that affect UPEC capsule production.IMPORTANCE Urinary tract infections (UTIs)
are among the most common bacterial infections in humans and are primarily caused
by uropathogenic Escherichia coli (UPEC). Many UPEC strains express a
polysaccharide K capsule that provides protection against host innate immune
factors and contributes to survival and persistence during infection. The K1
serotype is one example of a polysaccharide capsule type and is strongly
associated with UPEC strains that cause UTIs, bloodstream infections, and
meningitis. The number of UTIs caused by antibiotic-resistant UPEC is steadily
increasing, highlighting the need to better understand factors (e.g., the
capsule) that contribute to UPEC pathogenesis. This study describes the original
and novel application of lytic capsule-specific phage killing, saturated Tn5
transposon mutagenesis, and high-throughput transposon-directed insertion site
sequencing to define the entire complement of genes required for capsule
production in UPEC. Our comprehensive approach uncovered new genes involved in
the regulation of this key virulence determinant.
Copyright © 2017 Goh et al.
DOI: 10.1128/mBio.01558-17
PMCID: PMC5654933
PMID: 29066548
4. Nucleic Acids Res. 2017 Sep 6;45(15):e145. doi: 10.1093/nar/gkx594.
Data exploration, quality control and statistical analysis of ChIP-exo/nexus
experiments.
Welch R(1), Chung D(2), Grass J(3)(4), Landick R(3)(4)(5), Keles S(1)(6).
Author information:
(1)Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706,
USA.
(2)Department of Public Health Sciences, Medical University of South Carolina, SC
29425, USA.
(3)Great Lakes Bioenergy Research Center, University of Wisconsin-Madison,
Madison, WI 53726, USA.
(4)Department of Biochemistry, University of Wisconsin-Madison, Madison, WI
53706, USA.
(5)Department of Bacteriology, University of Wisconsin-Madison, Madison, WI
53706, USA.
(6)Department of Biostatistics and Medical Informatics, University of
Wisconsin-Madison, Madison, WI 53792, USA.
ChIP-exo/nexus experiments rely on innovative modifications of the commonly used
ChIP-seq protocol for high resolution mapping of transcription factor binding
sites. Although many aspects of the ChIP-exo data analysis are similar to those
of ChIP-seq, these high throughput experiments pose a number of unique quality
control and analysis challenges. We develop a novel statistical quality control
pipeline and accompanying R/Bioconductor package, ChIPexoQual, to enable
exploration and analysis of ChIP-exo and related experiments. ChIPexoQual
evaluates a number of key issues including strand imbalance, library complexity,
and signal enrichment of data. Assessment of these features are facilitated
through diagnostic plots and summary statistics computed over regions of the
genome with varying levels of coverage. We evaluated our QC pipeline with both
large collections of public ChIP-exo/nexus data and multiple, new ChIP-exo
datasets from Escherichia coli. ChIPexoQual analysis of these datasets resulted
in guidelines for using these QC metrics across a wide range of sequencing depths
and provided further insights for modelling ChIP-exo data.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic
Acids Research.
DOI: 10.1093/nar/gkx594
PMCID: PMC5587812
PMID: 28911122 [Indexed for MEDLINE]
5. Int J Genomics. 2017;2017:6489383. doi: 10.1155/2017/6489383. Epub 2017 Jul 16.
Differential MicroRNA Analyses of Burkholderia pseudomallei- and Francisella
tularensis-Exposed hPBMCs Reveal Potential Biomarkers.
Cer RZ(1)(2), Herrera-Galeano JE(1)(2)(3), Frey KG(1)(2), Schully KL(1)(2), Luu
TV(1)(2), Pesce J(1)(2)(4), Mokashi VP(1)(5), Keane-Myers AM(1)(2)(6),
Bishop-Lilly KA(1)(2).
Author information:
(1)Genomics and Bioinformatics Department, Biological Defense Research
Directorate, Naval Medical Research Center, Frederick, MD, USA.
(2)Henry M. Jackson Foundation for the Advancement of Military Medicine,
Bethesda, MD, USA.
(3)KCE Services and Consulting LLC, Columbia, MD, USA.
(4)Division of Microbiology and Infectious Diseases, National Institute of
Allergy and Infectious Diseases, Bethesda, MD, USA.
(5)Navy Drug Screening Laboratory, Jacksonville, FL, USA.
(6)Immunology, National Institute of Health, Bethesda, MD, USA.
Increasing evidence that microRNAs (miRNAs) play important roles in the immune
response against infectious agents suggests that miRNA might be exploitable as
signatures of exposure to specific infectious agents. In order to identify
potential early miRNA biomarkers of bacterial infections, human peripheral blood
mononuclear cells (hPBMCs) were exposed to two select agents, Burkholderia
pseudomallei K96243 and Francisella tularensis SHU S4, as well as to the
nonpathogenic control Escherichia coli DH5α. RNA samples were harvested at three
early time points, 30, 60, and 120 minutes postexposure, then sequenced. RNAseq
analyses identified 87 miRNAs to be differentially expressed (DE) in a linear
fashion. Of these, 31 miRNAs were tested using the miScript miRNA qPCR assay.
Through RNAseq identification and qPCR validation, we identified differentially
expressed miRNA species that may be involved in the early response to bacterial
infections. Based upon its upregulation at early time points postexposure in two
different individuals, hsa-mir-30c-5p is a miRNA species that could be studied
further as a potential biomarker for exposure to these gram-negative
intracellular pathogens. Gene ontology functional analyses demonstrated that
programmed cell death is the first ranking biological process associated with
miRNAs that are upregulated in F. tularensis-exposed hPBMCs.
DOI: 10.1155/2017/6489383
PMCID: PMC5534298
PMID: 28791299
6. NPJ Syst Biol Appl. 2017 Jun 22;3:17. doi: 10.1038/s41540-017-0019-y. eCollection
2017.
Reverse engineering highlights potential principles of large gene regulatory
network design and learning.
Carré C(1)(2), Mas A(1), Krouk G(2).
Author information:
(1)Institut Montpelliérain Alexander Grothendieck, Université de Montpellier,
Montpellier, France.
(2)Laboratoire de Biochimie et Physiologie Moléculaire des Plantes, Institut de
Biologie Intégrative des Plantes 'Claude Grignon', UMR5004 CNRS, INRA, SupAgro,
UM, Place Pierre Viala, Montpellier, 34060 France.
Inferring transcriptional gene regulatory networks from transcriptomic datasets
is a key challenge of systems biology, with potential impacts ranging from
medicine to agronomy. There are several techniques used presently to
experimentally assay transcription factors to target relationships, defining
important information about real gene regulatory networks connections. These
techniques include classical ChIP-seq, yeast one-hybrid, or more recently,
DAP-seq or target technologies. These techniques are usually used to validate
algorithm predictions. Here, we developed a reverse engineering approach based on
mathematical and computer simulation to evaluate the impact that this prior
knowledge on gene regulatory networks may have on training machine learning
algorithms. First, we developed a gene regulatory networks-simulating engine
called FRANK (Fast Randomizing Algorithm for Network Knowledge) that is able to
simulate large gene regulatory networks (containing 104 genes) with
characteristics of gene regulatory networks observed in vivo. FRANK also
generates stable or oscillatory gene expression directly produced by the
simulated gene regulatory networks. The development of FRANK leads to important
general conclusions concerning the design of large and stable gene regulatory
networks harboring scale free properties (built ex nihilo). In combination with
supervised (accepting prior knowledge) support vector machine algorithm we (i)
address biologically oriented questions concerning our capacity to accurately
reconstruct gene regulatory networks and in particular we demonstrate that
prior-knowledge structure is crucial for accurate learning, and (ii) draw
conclusions to inform experimental design to performed learning able to solve
gene regulatory networks in the future. By demonstrating that our predictions
concerning the influence of the prior-knowledge structure on support vector
machine learning capacity holds true on real data (Escherichia coli K14 network
reconstruction using network and transcriptomic data), we show that the formalism
used to build FRANK can to some extent be a reasonable model for gene regulatory
networks in real cells.
DOI: 10.1038/s41540-017-0019-y
PMCID: PMC5481436
PMID: 28649444
7. PLoS One. 2017 Jun 14;12(6):e0178966. doi: 10.1371/journal.pone.0178966.
eCollection 2017.
Intestinal organoids model human responses to infection by commensal and Shiga
toxin producing Escherichia coli.
Karve SS(1), Pradhan S(1), Ward DV(2), Weiss AA(1).
Author information:
(1)Department of Molecular Genetics, Biochemistry, and Microbiology, University
of Cincinnati, Cincinnati, Ohio, United States of America.
(2)Center for Microbiome Research and Department of Microbiology and
Physiological Systems, University of Massachusetts Medical School, Worcester,
Massachusetts, United States of America.
Infection with Shiga toxin (Stx) producing Escherichia coli O157:H7 can cause the
potentially fatal complication hemolytic uremic syndrome, and currently only
supportive therapy is available. Lack of suitable animal models has hindered
study of this disease. Induced human intestinal organoids (iHIOs), generated by
in vitro differentiation of pluripotent stem cells, represent differentiated
human intestinal tissue. We show that iHIOs with addition of human neutrophils
can model E. coli intestinal infection and innate cellular responses. Commensal
and O157:H7 introduced into the iHIO lumen replicated rapidly achieving high
numbers. Commensal E. coli did not cause damage, and were completely contained
within the lumen, suggesting defenses, such as mucus production, can constrain
non-pathogenic strains. Some O157:H7 initially co-localized with cellular actin.
Loss of actin and epithelial integrity was observed after 4 hours. O157:H7 grew
as filaments, consistent with activation of the bacterial SOS stress response.
SOS is induced by reactive oxygen species (ROS), and O157:H7 infection increased
ROS production. Transcriptional profiling (RNAseq) demonstrated that both
commensal and O157:H7 upregulated genes associated with gastrointestinal
maturation, while infection with O157:H7 upregulated inflammatory responses,
including interleukin 8 (IL-8). IL-8 is associated with neutrophil recruitment,
and infection with O157:H7 resulted in recruitment of human neutrophils into the
iHIO tissue.
DOI: 10.1371/journal.pone.0178966
PMCID: PMC5470682
PMID: 28614372 [Indexed for MEDLINE]
8. J Bacteriol. 2017 Jun 13;199(13). pii: e00086-17. doi: 10.1128/JB.00086-17. Print
2017 Jul 1.
Genome-Wide Analysis of ResD, NsrR, and Fur Binding in Bacillus subtilis during
Anaerobic Fermentative Growth by In Vivo Footprinting.
Chumsakul O(1), Anantsri DP(2), Quirke T(3), Oshima T(1), Nakamura K(4), Ishikawa
S(5)(6), Nakano MM(7).
Author information:
(1)Graduate School of Biological Sciences, Nara Institute of Science and
Technology, Nara, Japan.
(2)Institute of Environmental Health, Oregon Health & Science University,
Portland, Oregon, USA.
(3)Sam Barlow High School, Gresham, Oregon, USA.
(4)Department of Life Science and Informatics, Maebashi Institute of Technology,
Gunma, Japan.
(5)Graduate School of Biological Sciences, Nara Institute of Science and
Technology, Nara, Japan shu@people.kobe-u.ac.jp nakanom@ohsu.edu.
(6)Graduate School of Science, Technology & Innovation, Kobe University, Kobe,
Japan.
(7)Institute of Environmental Health, Oregon Health & Science University,
Portland, Oregon, USA shu@people.kobe-u.ac.jp nakanom@ohsu.edu.
Upon oxygen limitation, the Bacillus subtilis ResE sensor kinase and its cognate
ResD response regulator play primary roles in the transcriptional activation of
genes functioning in anaerobic respiration. The nitric oxide (NO)-sensitive NsrR
repressor controls transcription to support nitrate respiration. In addition, the
ferric uptake repressor (Fur) can modulate transcription under anaerobic
conditions. However, whether these controls are direct or indirect has been
investigated only in a gene-specific manner. To gain a genomic view of anaerobic
gene regulation, we determined the genome-wide in vivo DNA binding of ResD, NsrR,
and Fur transcription factors (TFs) using in situ DNase I footprinting combined
with chromatin affinity precipitation sequencing (ChAP-seq; genome footprinting
by high-throughput sequencing [GeF-seq]). A significant number of sites were
targets of ResD and NsrR, and a majority of them were also bound by Fur. The
binding of multiple TFs to overlapping targets affected each individual TF's
binding, which led to combinatorial transcriptional control. ResD bound to both
the promoters and the coding regions of genes under its positive control. Other
genes showing enrichment of ResD at only the promoter regions are targets of
direct ResD-dependent repression or antirepression. The results support previous
findings of ResD as an RNA polymerase (RNAP)-binding protein and indicated that
ResD can associate with the transcription elongation complex. The data set
allowed us to reexamine consensus sequence motifs of Fur, ResD, and NsrR and
uncovered evidence that multiple TGW (where W is A or T) sequences surrounded by
an A- and T-rich sequence are often found at sites where all three TFs
competitively bind.IMPORTANCE Bacteria encounter oxygen fluctuation in their
natural environment as well as in host organisms. Hence, understanding how
bacteria respond to oxygen limitation will impact environmental and human health.
ResD, NsrR, and Fur control transcription under anaerobic conditions. This work
using in situ DNase I footprinting uncovered the genome-wide binding profile of
the three transcription factors (TFs). Binding of the TFs is often competitive or
cooperative depending on the promoters and the presence of other TFs, indicating
that transcriptional regulation by multiple TFs is much more complex than we
originally thought. The results from this study provide a more complete picture
of anaerobic gene regulation governed by ResD, NsrR, and Fur and contribute to
our further understanding of anaerobic physiology.
Copyright © 2017 American Society for Microbiology.
DOI: 10.1128/JB.00086-17
PMCID: PMC5472814
PMID: 28439033 [Indexed for MEDLINE]
9. BMC Genet. 2017 Mar 7;18(1):21. doi: 10.1186/s12863-017-0488-4.
Deciphering alternative splicing and nonsense-mediated decay modulate expression
in primary lymphoid tissues of birds infected with avian pathogenic E. coli
(APEC).
Sun H(1).
Author information:
(1)College of Animal Science and Technology, Yangzhou University, Yangzhou,
Jiangsu, 225009, China. hongyans2392@163.com.
BACKGROUND: Avian pathogenic E. coli (APEC) can lead to a loss in millions of
dollars in poultry annually because of mortality and produce contamination.
Studies have verified that many immune-related genes undergo changes in
alternative splicing (AS), along with nonsense mediated decay (NMD), to regulate
the immune system under different conditions. Therefore, the splicing profiles of
primary lymphoid tissues with systemic APEC infection need to be comprehensively
examined.
RESULTS: Gene expression in RNAseq data were obtained for three different immune
tissues (bone marrow, thymus, and bursa) from three phenotype birds
(non-challenged, resistant, and susceptible birds) at two time points.
Alternative 5' splice sites and exon skipping/inclusion were identified as the
major alternative splicing events in avian primary immune organs under systemic
APEC infection. In this study, we detected hundreds of
differentially-expressed-transcript-containing genes (DETs) between different
phenotype birds at 5 days post-infection (dpi). DETs, PSAP and STT3A, with NMD
have important functions under systemic APEC infection. DETs, CDC45, CDK1, RAG2,
POLR1B, PSAP, and DNASE1L3, from the same transcription start sites (TSS)
indicate that cell death, cell cycle, cellular function, and maintenance were
predominant in host under systemic APEC.
CONCLUSIONS: With the use of RNAseq technology and bioinformatics tools, this
study provides a portrait of the AS event and NMD in primary lymphoid tissues,
which play critical roles in host homeostasis under systemic APEC infection.
According to this study, AS plays a pivotal regulatory role in the immune
response in chicken under systemic APEC infection via either NMD or alternative
TSSs. This study elucidates the regulatory role of AS for the immune complex
under systemic APEC infection.
DOI: 10.1186/s12863-017-0488-4
PMCID: PMC5341183
PMID: 28270101 [Indexed for MEDLINE]
10. PLoS Negl Trop Dis. 2017 Jan 6;11(1):e0005273. doi: 10.1371/journal.pntd.0005273.
eCollection 2017 Jan.
Transcriptome Sequencing Reveals Large-Scale Changes in Axenic Aedes aegypti
Larvae.
Vogel KJ(1), Valzania L(1), Coon KL(1), Brown MR(1), Strand MR(1).
Author information:
(1)Department of Entomology, The University of Georgia, Athens, Georgia, United
States of America.
Mosquitoes host communities of microbes in their digestive tract that consist
primarily of bacteria. We previously reported that Aedes aegypti larvae colonized
by a native community of bacteria and gnotobiotic larvae colonized by only
Escherichia coli develop very similarly into adults, whereas axenic larvae never
molt and die as first instars. In this study, we extended these findings by first
comparing the growth and abundance of bacteria in conventional, gnotobiotic, and
axenic larvae during the first instar. Results showed that conventional and
gnotobiotic larvae exhibited no differences in growth, timing of molting, or
number of bacteria in their digestive tract. Axenic larvae in contrast grew
minimally and never achieved the critical size associated with molting by
conventional and gnotobiotic larvae. In the second part of the study we compared
patterns of gene expression in conventional, gnotobiotic and axenic larvae by
conducting an RNAseq analysis of gut and nongut tissues (carcass) at 22 h
post-hatching. Approximately 12% of Ae. aegypti transcripts were differentially
expressed in axenic versus conventional or gnotobiotic larvae. However, this
profile consisted primarily of transcripts in seven categories that included the
down-regulation of select peptidases in the gut and up-regulation of several
genes in the gut and carcass with roles in amino acid transport, hormonal
signaling, and metabolism. Overall, our results indicate that axenic larvae
exhibit alterations in gene expression consistent with defects in acquisition and
assimilation of nutrients required for growth.
DOI: 10.1371/journal.pntd.0005273
PMCID: PMC5245907
PMID: 28060822 [Indexed for MEDLINE]
Conflict of interest statement: The authors have declared that no competing
interests exist.
11. Appl Environ Microbiol. 2017 Feb 15;83(5). pii: e03028-16. doi:
10.1128/AEM.03028-16. Print 2017 Mar 1.
Salmonella Persistence in Tomatoes Requires a Distinct Set of Metabolic Functions
Identified by Transposon Insertion Sequencing.
de Moraes MH(1), Desai P(2), Porwollik S(2), Canals R(2), Perez DR(3), Chu W(2),
McClelland M(2), Teplitski M(3).
Author information:
(1)Soil and Water Science Department, Genetics Institute, University of
Florida-IFAS, Gainesville, Florida, USA mhdmoraes@gmail.com.
(2)Department of Microbiology and Molecular Genetics, University of California,
Irvine, Irvine, California, USA.
(3)Soil and Water Science Department, Genetics Institute, University of
Florida-IFAS, Gainesville, Florida, USA.
Human enteric pathogens, such as Salmonella spp. and verotoxigenic Escherichia
coli, are increasingly recognized as causes of gastroenteritis outbreaks
associated with the consumption of fruits and vegetables. Persistence in plants
represents an important part of the life cycle of these pathogens. The
identification of the full complement of Salmonella genes involved in the
colonization of the model plant (tomato) was carried out using transposon
insertion sequencing analysis. With this approach, 230,000 transposon insertions
were screened in tomato pericarps to identify loci with reduction in fitness,
followed by validation of the screen results using competition assays of the
isogenic mutants against the wild type. A comparison with studies in animals
revealed a distinct plant-associated set of genes, which only partially overlaps
with the genes required to elicit disease in animals. De novo biosynthesis of
amino acids was critical to persistence within tomatoes, while amino acid
scavenging was prevalent in animal infections. Fitness reduction of the
Salmonella amino acid synthesis mutants was generally more severe in the tomato
rin mutant, which hyperaccumulates certain amino acids, suggesting that these
nutrients remain unavailable to Salmonella spp. within plants. Salmonella
lipopolysaccharide (LPS) was required for persistence in both animals and plants,
exemplifying some shared pathogenesis-related mechanisms in animal and plant
hosts. Similarly to phytopathogens, Salmonella spp. required biosynthesis of
amino acids, LPS, and nucleotides to colonize tomatoes. Overall, however, it
appears that while Salmonella shares some strategies with phytopathogens and taps
into its animal virulence-related functions, colonization of tomatoes represents
a distinct strategy, highlighting this pathogen's flexible metabolism.IMPORTANCE
Outbreaks of gastroenteritis caused by human pathogens have been increasingly
associated with foods of plant origin, with tomatoes being one of the common
culprits. Recent studies also suggest that these human pathogens can use plants
as alternate hosts as a part of their life cycle. While dual (animal/plant)
lifestyles of other members of the Enterobacteriaceae family are well known, the
strategies with which Salmonella colonizes plants are only partially understood.
Therefore, we undertook a high-throughput characterization of the functions
required for Salmonella persistence within tomatoes. The results of this study
were compared with what is known about genes required for Salmonella virulence in
animals and interactions of plant pathogens with their hosts to determine whether
Salmonella repurposes its virulence repertoire inside plants or whether it
behaves more as a phytopathogen during plant colonization. Even though Salmonella
utilized some of its virulence-related genes in tomatoes, plant colonization
required a distinct set of functions.
Copyright © 2017 American Society for Microbiology.
DOI: 10.1128/AEM.03028-16
PMCID: PMC5311394
PMID: 28039131 [Indexed for MEDLINE]
12. Antimicrob Agents Chemother. 2017 Jan 24;61(2). pii: e01740-16. doi:
10.1128/AAC.01740-16. Print 2017 Feb.
Identification of IncA/C Plasmid Replication and Maintenance Genes and
Development of a Plasmid Multilocus Sequence Typing Scheme.
Hancock SJ(1)(2), Phan MD(3)(2), Peters KM(1)(2), Forde BM(1)(2), Chong TM(4),
Yin WF(4), Chan KG(4), Paterson DL(5), Walsh TR(6), Beatson SA(1)(2), Schembri
MA(3)(2).
Author information:
(1)Australian Infectious Diseases Research Centre, University of Queensland,
Brisbane, Australia.
(2)School of Chemistry and Molecular Biosciences, University of Queensland,
Brisbane, Australia.
(3)Australian Infectious Diseases Research Centre, University of Queensland,
Brisbane, Australia m.phan1@uq.edu.au m.schembri@uq.edu.au.
(4)Faculty of Science, Division of Genetics and Molecular Biology, Institute of
Biological Sciences, University of Malaya, Kuala Lumpur, Malaysia.
(5)University of Queensland Centre for Clinical Research, Brisbane, Australia.
(6)Department of Medical Microbiology and Infectious Disease, Cardiff University,
Cardiff, United Kingdom.
Plasmids of incompatibility group A/C (IncA/C) are becoming increasingly
prevalent within pathogenic Enterobacteriaceae They are associated with the
dissemination of multiple clinically relevant resistance genes, including blaCMY
and blaNDM Current typing methods for IncA/C plasmids offer limited resolution.
In this study, we present the complete sequence of a blaNDM-1-positive IncA/C
plasmid, pMS6198A, isolated from a multidrug-resistant uropathogenic Escherichia
coli strain. Hypersaturated transposon mutagenesis, coupled with
transposon-directed insertion site sequencing (TraDIS), was employed to identify
conserved genetic elements required for replication and maintenance of pMS6198A.
Our analysis of TraDIS data identified roles for the replicon, including repA, a
toxin-antitoxin system; two putative partitioning genes, parAB; and a putative
gene, 053 Construction of mini-IncA/C plasmids and examination of their stability
within E. coli confirmed that the region encompassing 053 contributes to the
stable maintenance of IncA/C plasmids. Subsequently, the four major maintenance
genes (repA, parAB, and 053) were used to construct a new plasmid multilocus
sequence typing (PMLST) scheme for IncA/C plasmids. Application of this scheme to
a database of 82 IncA/C plasmids identified 11 unique sequence types (STs), with
two dominant STs. The majority of blaNDM-positive plasmids examined (15/17; 88%)
fall into ST1, suggesting acquisition and subsequent expansion of this
blaNDM-containing plasmid lineage. The IncA/C PMLST scheme represents a
standardized tool to identify, track, and analyze the dissemination of important
IncA/C plasmid lineages, particularly in the context of epidemiological studies.
Copyright © 2017 American Society for Microbiology.
DOI: 10.1128/AAC.01740-16
PMCID: PMC5278728
PMID: 27872077 [Indexed for MEDLINE]
13. EMBO J. 2017 Feb 1;36(3):374-387. doi: 10.15252/embj.201694639. Epub 2016 Nov 11.
Small RNA interactome of pathogenic E. coli revealed through crosslinking of
RNase E.
Waters SA(1), McAteer SP(2), Kudla G(3), Pang I(1)(4), Deshpande NP(1)(4), Amos
TG(1), Leong KW(5), Wilkins MR(1)(4), Strugnell R(5), Gally DL(2), Tollervey
D(6), Tree JJ(7).
Author information:
(1)School of Biotechnology and Biomolecular Sciences, University of New South
Wales, Sydney, NSW, Australia.
(2)The Roslin Institute, University of Edinburgh, Edinburgh, UK.
(3)MRC Human Genetic Unit, University of Edinburgh, Edinburgh, UK.
(4)Systems Biology Initiative, University of New South Wales, Sydney, NSW,
Australia.
(5)Peter Doherty Institute, University of Melbourne, Melbourne, Victoria,
Australia.
(6)Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, UK
j.tree@unsw.edu.au d.tollervey@ed.ac.uk.
(7)School of Biotechnology and Biomolecular Sciences, University of New South
Wales, Sydney, NSW, Australia j.tree@unsw.edu.au d.tollervey@ed.ac.uk.
RNA sequencing studies have identified hundreds of non-coding RNAs in bacteria,
including regulatory small RNA (sRNA). However, our understanding of sRNA
function has lagged behind their identification due to a lack of tools for the
high-throughput analysis of RNA-RNA interactions in bacteria. Here we demonstrate
that in vivo sRNA-mRNA duplexes can be recovered using UV-crosslinking, ligation
and sequencing of hybrids (CLASH). Many sRNAs recruit the endoribonuclease, RNase
E, to facilitate processing of mRNAs. We were able to recover base-paired
sRNA-mRNA duplexes in association with RNase E, allowing proximity-dependent
ligation and sequencing of cognate sRNA-mRNA pairs as chimeric reads. We verified
that this approach captures bona fide sRNA-mRNA interactions. Clustering analyses
identified novel sRNA seed regions and sets of potentially co-regulated target
mRNAs. We identified multiple mRNA targets for the pathotype-specific sRNA Esr41,
which was shown to regulate colicin sensitivity and iron transport in E. coli
Numerous sRNA interactions were also identified with non-coding RNAs, including
sRNAs and tRNAs, demonstrating the high complexity of the sRNA interactome.
© 2016 The Authors. Published under the terms of the CC BY 4.0 license.
DOI: 10.15252/embj.201694639
PMCID: PMC5286369
PMID: 27836995 [Indexed for MEDLINE]
14. Nucleic Acids Res. 2016 Dec 1;44(21):10117-10131. Epub 2016 Aug 4.
Silencing of cryptic prophages in Corynebacterium glutamicum.
Pfeifer E(1), Hünnefeld M(1), Popa O(2), Polen T(1), Kohlheyer D(1), Baumgart
M(1), Frunzke J(3).
Author information:
(1)Institute of Bio- und Geosciences, IBG-1: Biotechnology, Forschungszentrum
Jülich, 52425 Jülich, Germany.
(2)Quantitative and Theoretical Biology, Heinrich-Heine-Universität Düsseldorf,
40225, Düsseldorf, Germany.
(3)Institute of Bio- und Geosciences, IBG-1: Biotechnology, Forschungszentrum
Jülich, 52425 Jülich, Germany j.frunzke@fz-juelich.de.
DNA of viral origin represents a ubiquitous element of bacterial genomes. Its
integration into host regulatory circuits is a pivotal driver of microbial
evolution but requires the stringent regulation of phage gene activity. In this
study, we describe the nucleoid-associated protein CgpS, which represents an
essential protein functioning as a xenogeneic silencer in the Gram-positive
Corynebacterium glutamicum CgpS is encoded by the cryptic prophage CGP3 of the C.
glutamicum strain ATCC 13032 and was first identified by DNA affinity
chromatography using an early phage promoter of CGP3. Genome-wide profiling of
CgpS binding using chromatin affinity purification and sequencing (ChAP-Seq)
revealed its association with AT-rich DNA elements, including the entire CGP3
prophage region (187 kbp), as well as several other elements acquired by
horizontal gene transfer. Countersilencing of CgpS resulted in a significantly
increased induction frequency of the CGP3 prophage. In contrast, a strain lacking
the CGP3 prophage was not affected and displayed stable growth. In a
bioinformatics approach, cgpS orthologs were identified primarily in
actinobacterial genomes as well as several phage and prophage genomes. Sequence
analysis of 618 orthologous proteins revealed a strong conservation of the
secondary structure, supporting an ancient function of these xenogeneic silencers
in phage-host interaction.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic
Acids Research.
DOI: 10.1093/nar/gkw692
PMCID: PMC5137423
PMID: 27492287 [Indexed for MEDLINE]
15. Poult Sci. 2016 Dec 1;95(12):2803-2814. Epub 2016 Jul 27.
Thymus transcriptome reveals novel pathways in response to avian pathogenic
Escherichia coli infection.
Sun H(1)(2), Liu P(3), Nolan LK(4), Lamont SJ(5).
Author information:
(1)College of Animal Science and Technology, Yangzhou University, Yangzhou,
Jiangsu, China, 225009.
(2)Department of Animal Science, Iowa State University, Ames 50011.
(3)Department of Statistics, Iowa State University, Ames 50011.
(4)Department of Veterinary Microbiology and Preventive Medicine, Iowa State
University, Ames 50011.
(5)Department of Animal Science, Iowa State University, Ames 50011
sjlamont@iastate.edu.
Avian pathogenic Escherichia coli (APEC) can cause significant morbidity in
chickens. The thymus provides the essential environment for T cell development;
however, the thymus transcriptome has not been examined for gene expression in
response to APEC infection. An improved understanding of the host genomic
response to APEC infection could inform future breeding programs for disease
resistance and APEC control. We therefore analyzed the transcriptome of the
thymus of birds challenged with APEC, contrasting susceptible and resistant
phenotypes. Thousands of genes were differentially expressed in birds of the
5-day post infection (dpi) challenged-susceptible group vs. 5 dpi non-challenged,
in 5 dpi challenged-susceptible vs. 5 dpi challenged-resistant birds, as well as
in 5 dpi vs. one dpi challenged-susceptible birds. The Toll-like receptor
signaling pathway was the major innate immune response for birds to respond to
APEC infection. Moreover, lysosome and cell adhesion molecules pathways were
common mechanisms for chicken response to APEC infection. The T-cell receptor
signaling pathway, cell cycle, and p53 signaling pathways were significantly
activated in resistant birds to resist APEC infection. These results provide a
comprehensive assessment of global gene networks and biological functionalities
of differentially expressed genes in the thymus under APEC infection. These
findings provide novel insights into key molecular genetic mechanisms that
differentiate host resistance from susceptibility in this primary lymphoid
tissue, the thymus.
© The Author 2016. Published by Oxford University Press on behalf of Poultry
Science Association.
DOI: 10.3382/ps/pew202
PMCID: PMC5144662
PMID: 27466434 [Indexed for MEDLINE]
16. Sci Rep. 2016 Jul 18;6:30025. doi: 10.1038/srep30025.
Systematic analysis of an evolved Thermobifida fusca muC producing malic acid on
organic and inorganic nitrogen sources.
Deng Y(1)(2), Lin J(3), Mao Y(1)(2), Zhang X(4).
Author information:
(1)National Engineering Laboratory for Cereal Fermentation Technology (NELCF),
Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu 214122, China.
(2)The Key Laboratory of Industrial Biotechnology, Ministry of Education,
Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu 214122, China.
(3)College of Life Science, North China University of Science and Technology,
Tangshan 063000, China.
(4)School of pharmaceutical science, Jiangnan University, 1800 Lihu Road, Wuxi,
Jiangsu 214122, China.
Thermobifida fusca is a thermophilic actinobacterium. T. fusca muC obtained by
adaptive evolution preferred yeast extract to ammonium sulfate for accumulating
malic acid and ammonium sulfate for cell growth. We did transcriptome analysis of
T. fusca muC on Avicel and cellobiose with addition of ammonium sulfate or yeast
extract, respectively by RNAseq. The transcriptional results indicate that
ammonium sulfate induced the transcriptions of the genes related to carbohydrate
metabolisms significantly more than yeast extract. Importantly, Tfu_2487,
encoding histidine-containing protein (HPr), didn't transcribe on yeast extract
at all, while it transcribed highly on ammonium sulfate. In order to understand
the impact of HPr on malate production and cell growth of the muC strain, we
deleted Tfu_2487 to get a mutant strain: muCΔ2487, which had
1.33 mole/mole-glucose equivalent malate yield, much higher than that on yeast
extract. We then developed an E. coli-T. fusca shuttle plasmid for
over-expressing HPr in muCΔ2487, a strain without HPr background, forming the
muCΔ2487S strain. The muCΔ2487S strain had a much lower malate yield but faster
cell growth than the muC strain. The results of both mutant strains confirmed
that HPr was the key regulatory protein for T. fusca's metabolisms on nitrogen
sources.
DOI: 10.1038/srep30025
PMCID: PMC4948018
PMID: 27424527
17. PLoS One. 2016 Jun 23;11(6):e0157480. doi: 10.1371/journal.pone.0157480.
eCollection 2016.
The Impact of Intramammary Escherichia coli Challenge on Liver and Mammary
Transcriptome and Cross-Talk in Dairy Cows during Early Lactation Using RNAseq.
Moyes KM(1), Sørensen P(2), Bionaz M(3).
Author information:
(1)Department of Animal and Avian Sciences, University of Maryland, College Park,
Maryland, United States of America.
(2)Center for Quantitative Genetics and Genomics, Department of Molecular Biology
and Genetics, Aarhus University, 8830 Tjele, Denmark.
(3)Department of Animal and Rangeland Sciences, Oregon State University,
Corvallis, Oregon, United States of America.
Our objective was to identify the biological response and the cross-talk between
liver and mammary tissue after intramammary infection (IMI) with Escherichia coli
(E. coli) using RNAseq technology. Sixteen cows were inoculated with live E. coli
into one mammary quarter at ~4-6 weeks in lactation. For all cows, biopsies were
performed at -144, 12 and 24 h relative to IMI in liver and at 24 h post-IMI in
infected and non-infected (control) mammary quarters. For a subset of cows (n =
6), RNA was extracted from both liver and mammary tissue and sequenced using a
100 bp paired-end approach. Ingenuity Pathway Analysis and the Dynamic Impact
Approach analysis of differentially expressed genes (overall effect False
Discovery Rate≤0.05) indicated that IMI induced an overall activation of
inflammation at 12 h post-IMI and a strong inhibition of metabolism, especially
related to lipid, glucose, and xenobiotics at 24 h post-IMI in liver. The data
indicated in mammary tissue an overall induction of inflammatory response with
little effect on metabolism at 24 h post-IMI. We identified a large number of
up-stream regulators potentially involved in the response to IMI in both tissues
but a relatively small core network of transcription factors controlling the
response to IMI for liver whereas a large network in mammary tissue.
Transcriptomic results in liver and mammary tissue were supported by changes in
inflammatory and metabolic mediators in blood and milk. The analysis of potential
cross-talk between the two tissues during IMI uncovered a large communication
from the mammary tissue to the liver to coordinate the inflammatory response but
a relatively small communication from the liver to the mammary tissue. Our
results indicate a strong induction of the inflammatory response in mammary
tissue and impairment of liver metabolism 24h post-IMI partly driven by the
signaling from infected mammary tissue.
DOI: 10.1371/journal.pone.0157480
PMCID: PMC4919052
PMID: 27336699 [Indexed for MEDLINE]
18. Genes Dev. 2016 Jun 1;30(11):1327-38. doi: 10.1101/gad.280834.116.
S1-DRIP-seq identifies high expression and polyA tracts as major contributors to
R-loop formation.
Wahba L(1), Costantino L(1), Tan FJ(2), Zimmer A(1), Koshland D(1).
Author information:
(1)Department of Cell and Molecular Biology, University of California at
Berkeley, Berkeley, California 94720, USA;
(2)Department of Embryology, Carnegie Institution for Science, Baltimore,
Maryland 21218, USA.
R loops form when transcripts hybridize to homologous DNA on chromosomes,
yielding a DNA:RNA hybrid and a displaced DNA single strand. R loops impact the
genome of many organisms, regulating chromosome stability, gene expression, and
DNA repair. Understanding the parameters dictating R-loop formation in vivo has
been hampered by the limited quantitative and spatial resolution of current
genomic strategies for mapping R loops. We report a novel whole-genome method,
S1-DRIP-seq (S1 nuclease DNA:RNA immunoprecipitation with deep sequencing), for
mapping hybrid-prone regions in budding yeast Saccharomyces cerevisiae Using this
methodology, we identified ∼800 hybrid-prone regions covering 8% of the genome.
Given the pervasive transcription of the yeast genome, this result suggests that
R-loop formation is dictated by characteristics of the DNA, RNA, and/or
chromatin. We successfully identified two features highly predictive of hybrid
formation: high transcription and long homopolymeric dA:dT tracts. These
accounted for >60% of the hybrid regions found in the genome. We demonstrated
that these two factors play a causal role in hybrid formation by genetic
manipulation. Thus, the hybrid map generated by S1-DRIP-seq led to the
identification of the first global genomic features causal for R-loop formation
in yeast.
© 2016 Wahba et al.; Published by Cold Spring Harbor Laboratory Press.
DOI: 10.1101/gad.280834.116
PMCID: PMC4911931
PMID: 27298336 [Indexed for MEDLINE]
19. BMC Genomics. 2016 Mar 22;17:253. doi: 10.1186/s12864-016-2387-x.
FabR regulates Salmonella biofilm formation via its direct target FabB.
Hermans K(1), Roberfroid S(1), Thijs IM(1), Kint G(1), De Coster D(1), Marchal
K(1), Vanderleyden J(1), De Keersmaecker SC(1)(2), Steenackers HP(3).
Author information:
(1)Department of Microbial and Molecular Systems, Centre of Microbial and Plant
Genetics, Katholieke Universiteit Leuven, Kasteelpark Arenberg 20, 3001, Leuven,
Belgium.
(2)Platform Biotechnology and Molecular Biology, Scientific Institute of Public
Health (WIV-ISP), Brussels, Belgium.
(3)Department of Microbial and Molecular Systems, Centre of Microbial and Plant
Genetics, Katholieke Universiteit Leuven, Kasteelpark Arenberg 20, 3001, Leuven,
Belgium. Hans.Steenackers@biw.kuleuven.be.
BACKGROUND: Biofilm formation is an important survival strategy of Salmonella in
all environments. By mutant screening, we showed a knock-out mutant of fabR,
encoding a repressor of unsaturated fatty acid biosynthesis (UFA), to have
impaired biofilm formation. In order to unravel how this regulator impinges on
Salmonella biofilm formation, we aimed at elucidating the S. Typhimurium FabR
regulon. Hereto, we applied a combinatorial high-throughput approach, combining
ChIP-chip with transcriptomics.
RESULTS: All the previously identified E. coli FabR transcriptional target genes
(fabA, fabB and yqfA) were shown to be direct S. Typhimurium FabR targets as
well. As we found a fabB overexpressing strain to partly mimic the biofilm defect
of the fabR mutant, the effect of FabR on biofilms can be attributed at least
partly to FabB, which plays a key role in UFA biosynthesis. Additionally,
ChIP-chip identified a number of novel direct FabR targets (the intergenic
regions between hpaR/hpaG and ddg/ydfZ) and yet putative direct targets (i.a.
genes involved in tRNA metabolism, ribosome synthesis and translation). Next to
UFA biosynthesis, a number of these direct targets and other indirect targets
identified by transcriptomics (e.g. ribosomal genes, ompA, ompC, ompX, osmB,
osmC, sseI), could possibly contribute to the effect of FabR on biofilm
formation.
CONCLUSION: Overall, our results point at the importance of FabR and UFA
biosynthesis in Salmonella biofilm formation and their role as potential targets
for biofilm inhibitory strategies.
DOI: 10.1186/s12864-016-2387-x
PMCID: PMC4804515
PMID: 27004424 [Indexed for MEDLINE]
20. J Vis Exp. 2016 Jan 27;(107):e53620. doi: 10.3791/53620.
DamID-seq: Genome-wide Mapping of Protein-DNA Interactions by High Throughput
Sequencing of Adenine-methylated DNA Fragments.
Wu F(1), Olson BG(1), Yao J(2).
Author information:
(1)Department of Cell Biology, Yale University School of Medicine.
(2)Department of Cell Biology, Yale University School of Medicine;
yao.j@yale.edu.
The DNA adenine methyltransferase identification (DamID) assay is a powerful
method to detect protein-DNA interactions both locally and genome-wide. It is an
alternative approach to chromatin immunoprecipitation (ChIP). An expressed fusion
protein consisting of the protein of interest and the E. coli DNA adenine
methyltransferase can methylate the adenine base in GATC motifs near the sites of
protein-DNA interactions. Adenine-methylated DNA fragments can then be
specifically amplified and detected. The original DamID assay detects the genomic
locations of methylated DNA fragments by hybridization to DNA microarrays, which
is limited by the availability of microarrays and the density of predetermined
probes. In this paper, we report the detailed protocol of integrating high
throughput DNA sequencing into DamID (DamID-seq). The large number of short reads
generated from DamID-seq enables detecting and localizing protein-DNA
interactions genome-wide with high precision and sensitivity. We have used the
DamID-seq assay to study genome-nuclear lamina (NL) interactions in mammalian
cells, and have noticed that DamID-seq provides a high resolution and a wide
dynamic range in detecting genome-NL interactions. The DamID-seq approach enables
probing NL associations within gene structures and allows comparing genome-NL
interaction maps with other functional genomic data, such as ChIP-seq and
RNA-seq.
DOI: 10.3791/53620
PMCID: PMC4781701
PMID: 26862720 [Indexed for MEDLINE]
21. Semin Cell Dev Biol. 2016 May;53:2-9. doi: 10.1016/j.semcdb.2015.11.012. Epub
2015 Dec 17.
Connecting the dots of the bacterial cell cycle: Coordinating chromosome
replication and segregation with cell division.
Hajduk IV(1), Rodrigues CD(2), Harry EJ(3).
Author information:
(1)The ithree institute, University of Technology Sydney, Sydney 2007, NSW,
Australia.
(2)Department of Microbiology and Immunobiology, Harvard Medical School, Boston,
MA 02115, USA.
(3)The ithree institute, University of Technology Sydney, Sydney 2007, NSW,
Australia. Electronic address: liz.harry@uts.edu.au.
Proper division site selection is crucial for the survival of all organisms. What
still eludes us is how bacteria position their division site with high precision,
and in tight coordination with chromosome replication and segregation. Until
recently, the general belief, at least in the model organisms Bacillus subtilis
and Escherichia coli, was that spatial regulation of division comes about by the
combined negative regulatory mechanisms of the Min system and nucleoid occlusion.
However, as we review here, these two systems cannot be solely responsible for
division site selection and we highlight additional regulatory mechanisms that
are at play. In this review, we put forward evidence of how chromosome
replication and segregation may have direct links with cell division in these
bacteria and the benefit of recent advances in chromosome conformation capture
techniques in providing important information about how these three processes
mechanistically work together to achieve accurate generation of progenitor cells.
Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
DOI: 10.1016/j.semcdb.2015.11.012
PMID: 26706151 [Indexed for MEDLINE]
22. BMC Genomics. 2015 Nov 4;16:895. doi: 10.1186/s12864-015-2162-4.
Analysis of the FnrL regulon in Rhodobacter capsulatus reveals limited regulon
overlap with orthologues from Rhodobacter sphaeroides and Escherichia coli.
Kumka JE(1), Bauer CE(2).
Author information:
(1)Molecular and Cellular Biochemistry Department, Indiana University, Simon Hall
MSB, 212 S. Hawthorne Dr, Bloomington, IN, 47405-7003, USA.
(2)Molecular and Cellular Biochemistry Department, Indiana University, Simon Hall
MSB, 212 S. Hawthorne Dr, Bloomington, IN, 47405-7003, USA. bauer@indiana.edu.
BACKGROUND: FNR homologues constitute an important class of transcription factors
that control a wide range of anaerobic physiological functions in a number of
bacterial species. Since FNR homologues are some of the most pervasive
transcription factors, an understanding of their involvement in regulating
anaerobic gene expression in different species sheds light on evolutionary
similarity and differences. To address this question, we used a combination of
high throughput RNA-Seq and ChIP-Seq analysis to define the extent of the FnrL
regulon in Rhodobacter capsulatus and related our results to that of FnrL in
Rhodobacter sphaeroides and FNR in Escherichia coli.
RESULTS: Our RNA-seq results show that FnrL affects the expression of 807 genes,
which accounts for over 20 % of the Rba. capsulatus genome. ChIP-seq results
indicate that 42 of these genes are directly regulated by FnrL. Importantly, this
includes genes involved in the synthesis of the anoxygenic photosystem.
Similarly, FnrL in Rba. sphaeroides affects 24 % of its genome, however, only 171
genes are differentially expressed in common between two Rhodobacter species,
suggesting significant divergence in regulation.
CONCLUSIONS: We show that FnrL in Rba. capsulatus activates photosynthesis while
in Rba. sphaeroides FnrL regulation reported to involve repression of the
photosystem. This analysis highlights important differences in transcriptional
control of photosynthetic events and other metabolic processes controlled by FnrL
orthologues in closely related Rhodobacter species. Furthermore, we also show
that the E. coli FNR regulon has limited transcriptional overlap with the FnrL
regulons from either Rhodobacter species.
DOI: 10.1186/s12864-015-2162-4
PMCID: PMC4634722
PMID: 26537891 [Indexed for MEDLINE]
23. J Bacteriol. 2015 Oct 19;198(1):187-200. doi: 10.1128/JB.00658-15. Print 2016 Jan
1.
Identification of the PhoB Regulon and Role of PhoU in the Phosphate Starvation
Response of Caulobacter crescentus.
Lubin EA(1), Henry JT(2), Fiebig A(2), Crosson S(2), Laub MT(3).
Author information:
(1)Department of Biology, Massachusetts Institute of Technology, Cambridge,
Massachusetts, USA.
(2)Department of Biochemistry and Molecular Biology, University of Chicago,
Chicago, Illinois, USA.
(3)Department of Biology, Massachusetts Institute of Technology, Cambridge,
Massachusetts, USA Howard Hughes Medical Institute, Massachusetts Institute of
Technology, Cambridge, Massachusetts, USA laub@mit.edu.
An ability to sense and respond to changes in extracellular phosphate is critical
for the survival of most bacteria. For Caulobacter crescentus, which typically
lives in phosphate-limited environments, this process is especially crucial. Like
many bacteria, Caulobacter responds to phosphate limitation through a conserved
two-component signaling pathway called PhoR-PhoB, but the direct regulon of PhoB
in this organism is unknown. Here we used chromatin immunoprecipitation-DNA
sequencing (ChIP-Seq) to map the global binding patterns of the
phosphate-responsive transcriptional regulator PhoB under phosphate-limited and
-replete conditions. Combined with genome-wide expression profiling, our work
demonstrates that PhoB is induced to regulate nearly 50 genes under
phosphate-starved conditions. The PhoB regulon is comprised primarily of genes
known or predicted to help Caulobacter scavenge for and import inorganic
phosphate, including 15 different membrane transporters. We also investigated the
regulatory role of PhoU, a widely conserved protein proposed to coordinate
phosphate import with expression of the PhoB regulon by directly modulating the
histidine kinase PhoR. However, our studies show that it likely does not play
such a role in Caulobacter, as PhoU depletion has no significant effect on
PhoB-dependent gene expression. Instead, cells lacking PhoU exhibit striking
accumulation of large polyphosphate granules, suggesting that PhoU participates
in controlling intracellular phosphate metabolism.IMPORTANCE: The transcription
factor PhoB is widely conserved throughout the bacterial kingdom, where it helps
organisms respond to phosphate limitation by driving the expression of a battery
of genes. Most of what is known about PhoB and its target genes is derived from
studies of Escherichia coli. Our work documents the PhoB regulon in Caulobacter
crescentus, and comparison to the regulon in E. coli reveals significant
differences, highlighting the evolutionary plasticity of transcriptional
responses driven by highly conserved transcription factors. We also demonstrated
that the conserved protein PhoU, which is implicated in bacterial persistence,
does not regulate PhoB activity, as previously suggested. Instead, our results
favor a model in which PhoU affects intracellular phosphate accumulation,
possibly through the high-affinity phosphate transporter.
Copyright © 2015, American Society for Microbiology. All Rights Reserved.
DOI: 10.1128/JB.00658-15
PMCID: PMC4686198
PMID: 26483520 [Indexed for MEDLINE]
24. Microb Genom. 2015 Oct 13;1(4):e000031. doi: 10.1099/mgen.0.000031. eCollection
2015 Oct.
Genome-wide analysis of the response to nitric oxide in uropathogenic Escherichia
coli CFT073.
Mehta HH(1), Liu Y(2), Zhang MQ(2), Spiro S(1).
Author information:
(1)Department of Biological Sciences, University of Texas at Dallas, 800 W
Campbell Road, Richardson, TX 75080, USA.
(2)Department of Biological Sciences, University of Texas at Dallas, 800 W
Campbell Road, Richardson, TX 75080, USA; Center for Systems Biology, University
of Texas at Dallas, 800 W Campbell Road, Richardson, TX 75080, USA.
Uropathogenic Escherchia coli (UPEC) is the causative agent of urinary tract
infections. Nitric oxide (NO) is a toxic water-soluble gas that is encountered by
UPEC in the urinary tract. Therefore, UPEC probably requires mechanisms to
detoxify NO in the host environment. Thus far, flavohaemoglobin (Hmp), an NO
denitrosylase, is the only demonstrated NO detoxification system in UPEC. Here we
show that, in E. coli strain CFT073, the NADH-dependent NO reductase
flavorubredoxin (FlRd) also plays a major role in NO scavenging. We generated a
mutant that lacks all known and candidate NO detoxification pathways (Hmp, FlRd
and the respiratory nitrite reductase, NrfA). When grown and assayed
anaerobically, this mutant expresses an NO-inducible NO scavenging activity,
pointing to the existence of a novel detoxification mechanism. Expression of this
activity is inducible by both NO and nitrate, and the enzyme is
membrane-associated. Genome-wide transcriptional profiling of UPEC grown under
anaerobic conditions in the presence of nitrate (as a source of NO) highlighted
various aspects of the response of the pathogen to nitrate and NO. Several
virulence-associated genes are upregulated, suggesting that host-derived NO is a
potential regulator of UPEC virulence. Chromatin immunoprecipitation and
sequencing was used to evaluate the NsrR regulon in CFT073. We identified 49 NsrR
binding sites in promoter regions in the CFT073 genome, 29 of which were not
previously identified in E. coli K-12. NsrR may regulate some CFT073 genes that
do not have homologues in E. coli K-12.
DOI: 10.1099/mgen.0.000031
PMCID: PMC5320621
PMID: 28348816
25. PLoS One. 2015 Sep 21;10(9):e0138466. doi: 10.1371/journal.pone.0138466.
eCollection 2015.
ChIP-Seq Analysis of the σE Regulon of Salmonella enterica Serovar Typhimurium
Reveals New Genes Implicated in Heat Shock and Oxidative Stress Response.
Li J(1), Overall CC(2), Johnson RC(1), Jones MB(3), McDermott JE(2), Heffron
F(1), Adkins JN(2), Cambronne ED(1).
Author information:
(1)Department of Molecular Microbiology and Immunology, Oregon Health & Science
University, Portland, Oregon, United States of America.
(2)Biological Sciences Division, Pacific Northwest National Laboratory, Richland,
Washington, United States of America.
(3)Department of Infectious Diseases, J. Craig Venter Institute, Rockville,
Maryland, United States of America.
The alternative sigma factor σE functions to maintain bacterial homeostasis and
membrane integrity in response to extracytoplasmic stress by regulating thousands
of genes both directly and indirectly. The transcriptional regulatory network
governed by σE in Salmonella and E. coli has been examined using microarray,
however a genome-wide analysis of σE-binding sites in Salmonella has not yet been
reported. We infected macrophages with Salmonella Typhimurium over a select time
course. Using chromatin immunoprecipitation followed by high-throughput DNA
sequencing (ChIP-seq), 31 σE-binding sites were identified. Seventeen sites were
new, which included outer membrane proteins, a quorum-sensing protein, a cell
division factor, and a signal transduction modulator. The consensus sequence
identified for σE in vivo binding was similar to the one previously reported,
except for a conserved G and A between the -35 and -10 regions. One third of the
σE-binding sites did not contain the consensus sequence, suggesting there may be
alternative mechanisms by which σE modulates transcription. By dissecting direct
and indirect modes of σE-mediated regulation, we found that σE activates gene
expression through recognition of both canonical and reversed consensus sequence.
New σE regulated genes (greA, luxS, ompA and ompX) are shown to be involved in
heat shock and oxidative stress responses.
DOI: 10.1371/journal.pone.0138466
PMCID: PMC4577112
PMID: 26389830 [Indexed for MEDLINE]
26. Pathogens. 2015 Jun 26;4(3):422-30. doi: 10.3390/pathogens4030422.
Molecular Characterization of the Multidrug Resistant Escherichia coli ST131
Clone.
Schembri MA(1)(2), Zakour NL(3)(4), Phan MD(5)(6), Forde BM(7)(8), Stanton-Cook
M(9)(10), Beatson SA(11)(12).
Author information:
(1)School of Chemistry & Molecular Biosciences, the University of Queensland,
Brisbane, Queensland 4072, Australia. m.schembri@uq.edu.au.
(2)Australian Infectious Diseases Research Centre, the University of Queensland,
Brisbane, Queensland 4072, Australia. m.schembri@uq.edu.au.
(3)School of Chemistry & Molecular Biosciences, the University of Queensland,
Brisbane, Queensland 4072, Australia. n.benzakour@uq.edu.au.
(4)Australian Infectious Diseases Research Centre, the University of Queensland,
Brisbane, Queensland 4072, Australia. n.benzakour@uq.edu.au.
(5)School of Chemistry & Molecular Biosciences, the University of Queensland,
Brisbane, Queensland 4072, Australia. m.phan1@uq.edu.au.
(6)Australian Infectious Diseases Research Centre, the University of Queensland,
Brisbane, Queensland 4072, Australia. m.phan1@uq.edu.au.
(7)School of Chemistry & Molecular Biosciences, the University of Queensland,
Brisbane, Queensland 4072, Australia. b.forde@uq.edu.au.
(8)Australian Infectious Diseases Research Centre, the University of Queensland,
Brisbane, Queensland 4072, Australia. b.forde@uq.edu.au.
(9)School of Chemistry & Molecular Biosciences, the University of Queensland,
Brisbane, Queensland 4072, Australia. m.stantoncook@uq.edu.au.
(10)Australian Infectious Diseases Research Centre, the University of Queensland,
Brisbane, Queensland 4072, Australia. m.stantoncook@uq.edu.au.
(11)School of Chemistry & Molecular Biosciences, the University of Queensland,
Brisbane, Queensland 4072, Australia. s.beatson@uq.edu.au.
(12)Australian Infectious Diseases Research Centre, the University of Queensland,
Brisbane, Queensland 4072, Australia. s.beatson@uq.edu.au.
Escherichia coli ST131 is a recently emerged and globally disseminated multidrug
resistant clone associated with urinary tract and bloodstream infections in both
community and clinical settings. The most common group of ST131 strains are
defined by resistance to fluoroquinolones and possession of the type 1 fimbriae
fimH30 allele. Here we provide an update on our recent work describing the
globally epidemiology of ST131. We review the phylogeny of ST131 based on whole
genome sequence data and highlight the important role of recombination in the
evolution of this clonal lineage. We also summarize our findings on the virulence
of the ST131 reference strain EC958, and highlight the use of transposon directed
insertion-site sequencing to define genes associated with serum resistance and
essential features of its large antibiotic resistance plasmid pEC958.
DOI: 10.3390/pathogens4030422
PMCID: PMC4584265
PMID: 26131613
27. PLoS Genet. 2015 Jun 12;11(6):e1005289. doi: 10.1371/journal.pgen.1005289.
eCollection 2015 Jun.
The B. subtilis Accessory Helicase PcrA Facilitates DNA Replication through
Transcription Units.
Merrikh CN(1), Brewer BJ(2), Merrikh H(3).
Author information:
(1)Department of Microbiology, University of Washington, Seattle, Washington,
United States of America.
(2)Department of Genome Sciences, University of Washington, Seattle, Washington,
United States of America.
(3)Department of Microbiology, University of Washington, Seattle, Washington,
United States of America; Department of Genome Sciences, University of
Washington, Seattle, Washington, United States of America.
In bacteria the concurrence of DNA replication and transcription leads to
potentially deleterious encounters between the two machineries, which can occur
in either the head-on (lagging strand genes) or co-directional (leading strand
genes) orientations. These conflicts lead to replication fork stalling and can
destabilize the genome. Both eukaryotic and prokaryotic cells possess resolution
factors that reduce the severity of these encounters. Though Escherichia coli
accessory helicases have been implicated in the mitigation of head-on conflicts,
direct evidence of these proteins mitigating co-directional conflicts is lacking.
Furthermore, the endogenous chromosomal regions where these helicases act, and
the mechanism of recruitment, have not been identified. We show that the
essential Bacillus subtilis accessory helicase PcrA aids replication progression
through protein coding genes of both head-on and co-directional orientations, as
well as rRNA and tRNA genes. ChIP-Seq experiments show that co-directional
conflicts at highly transcribed rRNA, tRNA, and head-on protein coding genes are
major targets of PcrA activity on the chromosome. Partial depletion of PcrA
renders cells extremely sensitive to head-on conflicts, linking the essential
function of PcrA to conflict resolution. Furthermore, ablating PcrA's
ATPase/helicase activity simultaneously increases its association with conflict
regions, while incapacitating its ability to mitigate conflicts, and leads to
cell death. In contrast, disruption of PcrA's C-terminal RNA polymerase
interaction domain does not impact its ability to mitigate conflicts between
replication and transcription, its association with conflict regions, or cell
survival. Altogether, this work establishes PcrA as an essential factor involved
in mitigating transcription-replication conflicts and identifies chromosomal
regions where it routinely acts. As both conflicts and accessory helicases are
found in all domains of life, these results are broadly relevant.
DOI: 10.1371/journal.pgen.1005289
PMCID: PMC4466434
PMID: 26070154 [Indexed for MEDLINE]
28. PLoS One. 2015 Apr 15;10(4):e0122369. doi: 10.1371/journal.pone.0122369.
eCollection 2015.
Molecular characterization of a multidrug resistance IncF plasmid from the
globally disseminated Escherichia coli ST131 clone.
Phan MD(1), Forde BM(1), Peters KM(1), Sarkar S(1), Hancock S(1), Stanton-Cook
M(1), Ben Zakour NL(1), Upton M(2), Beatson SA(1), Schembri MA(1).
Author information:
(1)Australian Infectious Diseases Research Centre, School of Chemistry and
Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072,
Australia.
(2)Plymouth University Peninsula Schools of Medicine and Dentistry, Plymouth,
United Kingdom.
Escherichia coli sequence type 131 (E. coli ST131) is a recently emerged and
globally disseminated multidrug resistant clone associated with urinary tract and
bloodstream infections. Plasmids represent a major vehicle for the carriage of
antibiotic resistance genes in E. coli ST131. In this study, we determined the
complete sequence and performed a comprehensive annotation of pEC958, an IncF
plasmid from the E. coli ST131 reference strain EC958. Plasmid pEC958 is 135.6 kb
in size, harbours two replicons (RepFIA and RepFII) and contains 12 antibiotic
resistance genes (including the blaCTX-M-15 gene). We also carried out
hyper-saturated transposon mutagenesis and multiplexed transposon directed
insertion-site sequencing (TraDIS) to investigate the biology of pEC958. TraDIS
data showed that while only the RepFII replicon was required for pEC958
replication, the RepFIA replicon contains genes essential for its partitioning.
Thus, our data provides direct evidence that the RepFIA and RepFII replicons in
pEC958 cooperate to ensure their stable inheritance. The gene encoding the
antitoxin component (ccdA) of the post-segregational killing system CcdAB was
also protected from mutagenesis, demonstrating this system is active. Sequence
comparison with a global collection of ST131 strains suggest that IncF represents
the most common type of plasmid in this clone, and underscores the need to
understand its evolution and contribution to the spread of antibiotic resistance
genes in E. coli ST131.
DOI: 10.1371/journal.pone.0122369
PMCID: PMC4398462
PMID: 25875675 [Indexed for MEDLINE]
29. Nucleic Acids Res. 2015 Jul 27;43(13):e87. doi: 10.1093/nar/gkv300. Epub 2015 Apr
14.
cMonkey2: Automated, systematic, integrated detection of co-regulated gene
modules for any organism.
Reiss DJ(1), Plaisier CL(2), Wu WJ(2), Baliga NS(3).
Author information:
(1)Institute for Systems Biology, 401 Terry Ave N, Seattle, WA 98109, USA
dreiss@systemsbiology.org.
(2)Institute for Systems Biology, 401 Terry Ave N, Seattle, WA 98109, USA.
(3)Institute for Systems Biology, 401 Terry Ave N, Seattle, WA 98109, USA
Department of Microbiology, University of Washington, Seattle, WA 98103, USA
nbaliga@systemsbiology.org.
The cMonkey integrated biclustering algorithm identifies conditionally
co-regulated modules of genes (biclusters). cMonkey integrates various orthogonal
pieces of information which support evidence of gene co-regulation, and optimizes
biclusters to be supported simultaneously by one or more of these prior
constraints. The algorithm served as the cornerstone for constructing the first
global, predictive Environmental Gene Regulatory Influence Network (EGRIN) model
for a free-living cell, and has now been applied to many more organisms. However,
due to its computational inefficiencies, long run-time and complexity of various
input data types, cMonkey was not readily usable by the wider community. To
address these primary concerns, we have significantly updated the cMonkey
algorithm and refactored its implementation, improving its usability and
extendibility. These improvements provide a fully functioning and user-friendly
platform for building co-regulated gene modules and the tools necessary for their
exploration and interpretation. We show, via three separate analyses of data for
E. coli, M. tuberculosis and H. sapiens, that the updated algorithm and inclusion
of novel scoring functions for new data types (e.g. ChIP-seq and transcription
factor over-expression [TFOE]) improve discovery of biologically informative
co-regulated modules. The complete cMonkey2 software package, including source
code, is available at https://github.com/baliga-lab/cmonkey2.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic
Acids Research.
DOI: 10.1093/nar/gkv300
PMCID: PMC4513845
PMID: 25873626 [Indexed for MEDLINE]
30. Methods Mol Biol. 2015;1284:3-25. doi: 10.1007/978-1-4939-2444-8_1.
Epigenome profiling of specific plant cell types using a streamlined INTACT
protocol and ChIP-seq.
Wang D(1), Deal RB.
Author information:
(1)Department of Biology, O. Wayne Rollins Research Center, Emory University,
1510 Clifton Road NE, Atlanta, GA, 30322, USA.
Plants consist of many functionally specialized cell types, each with its own
unique epigenome, transcriptome, and proteome. Characterization of these cell
type-specific properties is essential to understanding cell fate specification
and the responses of individual cell types to the environment. In this chapter we
describe an approach to map chromatin features in specific cell types of
Arabidopsis thaliana using nuclei purification from individual cell types with
the INTACT method (isolation of nuclei tagged in specific cell types) followed by
chromatin immunoprecipitation and high-throughput sequencing (ChIP-seq). The
INTACT system employs two transgenes to generate affinity-labeled nuclei in the
cell type of interest, and these tagged nuclei can then be selectively purified
from tissue homogenates. The primary transgene encodes the nuclear tagging fusion
protein (NTF), which consists of a nuclear envelope-targeting domain, the green
fluorescent protein, and a biotin ligase recognition peptide, while the second
transgene encodes the E. coli biotin ligase (BirA), which selectively
biotinylates NTF. Expression of NTF and BirA in a specific cell type thus yields
nuclei that are coated with biotin and can be purified by virtue of their
affinity for streptavidin-coated magnetic beads. Compared with the original
INTACT nuclei purification protocol, the procedure presented here is greatly
simplified and shortened. After nuclei purification, we provide detailed
instructions for chromatin isolation, shearing, and immunoprecipitation. Finally,
we present a low input ChIP-seq library preparation protocol based on the
nano-ChIP-seq method of Adli and Bernstein, and we describe multiplex Illumina
sequencing of these libraries to produce high quality, cell type-specific
epigenome profiles at a relatively low cost. The procedures given here are
optimized for Arabidopsis but should be easily adaptable to other plant species.
DOI: 10.1007/978-1-4939-2444-8_1
PMID: 25757765 [Indexed for MEDLINE]
31. Genom Data. 2014 Dec;2:110-113.
Genome-Wide Mapping of the Distribution of CarD, RNAP σA, and RNAP β on the
Mycobacterium smegmatis Chromosome using Chromatin Immunoprecipitation
Sequencing.
Landick R(1), Krek A(2), Glickman MS(3), Socci ND(2), Stallings CL(4).
Author information:
(1)Departments of Biochemistry and Bacteriology, University of Wisconsin,
Madison, WI, 53706, USA.
(2)Bioinformatics Core, Memorial Sloan-Kettering Cancer Center, New York, NY,
10065, USA.
(3)Immunology Program, Sloan-Kettering Institute, and Division of Infectious
Diseases, Memorial Sloan-Kettering Cancer Center, New York, NY, 10065, USA.
(4)Department of Molecular Microbiology, Washington University School of
Medicine, St. Louis, MO 63110, USA.
CarD is an essential mycobacterial protein that binds the RNA polymerase (RNAP)
and affects the transcriptional profile of Mycobacterium smegmatis and
Mycobacterium tuberculosis (6). We predicted that CarD was directly regulating
RNAP function but our prior experiments had not determined at what stage of
transcription CarD was functioning and at which genes CarD interacted with the
RNAP. To begin to address these open questions, we performed Chromatin
Immunoprecipitation sequencing (ChIP-seq) to survey the distribution of CarD
throughout the M. smegmatis chromosome. The distribution of RNAP subunits β and
σA were also profiled. We expected that RNAP β would be present throughout
transcribed regions and RNAP σA would be predominantly enriched at promoters
based on work in Escherichia coli (3), however this had yet to be determined in
mycobacteria. The ChIP-seq analyses revealed that CarD was never present on the
genome in the absence of RNAP, was primarily associated with promoter regions,
and was highly correlated with the distribution of RNAP σA. The colocalization of
σA and CarD led us to propose that in vivo, CarD associates with RNAP initiation
complexes at most promoters and is therefore a global regulator of transcription
initiation. Here we describe in detail the data from the ChIP-seq experiments
associated with the study published by Srivastava and colleagues in the
Proceedings of the National Academy of Science in 2013 (5) as well as discuss the
findings from this dataset in relation to both CarD and mycobacterial
transcription as a whole. The ChIP-seq data have been deposited in the Gene
Expression Omnibus (GEO) database, www.ncbi.nlm.nih.gov/geo (accession no.
GSE48164).
DOI: 10.1016/j.gdata.2014.05.012
PMCID: PMC4115788
PMID: 25089258
32. BMC Microbiol. 2014 Aug 1;14:206. doi: 10.1186/s12866-014-0206-6.
Characterization and analysis of the Burkholderia pseudomallei BsaN virulence
regulon.
Chen Y, Schröder I, French CT, Jaroszewicz A, Yee XJ, Teh BE, Toesca IJ, Miller
JF, Gan YH(1).
Author information:
(1)Department of Biochemistry, Yong Loo Lin School of Medicine, National
University of Singapore, Singapore 117597, Singapore. yunn_hwen_gan@nuhs.edu.sg.
BACKGROUND: Burkholderia pseudomallei is a facultative intracellular pathogen and
the causative agent of melioidosis. A conserved type III secretion system (T3SS3)
and type VI secretion system (T6SS1) are critical for intracellular survival and
growth. The T3SS3 and T6SS1 genes are coordinately and hierarchically regulated
by a TetR-type regulator, BspR. A central transcriptional regulator of the BspR
regulatory cascade, BsaN, activates a subset of T3SS3 and T6SS1 loci.
RESULTS: To elucidate the scope of the BsaN regulon, we used RNAseq analysis to
compare the transcriptomes of wild-type B. pseudomallei KHW and a bsaN deletion
mutant. The 60 genes positively-regulated by BsaN include those that we had
previously identified in addition to a polyketide biosynthesis locus and genes
involved in amino acid biosynthesis. BsaN was also found to repress the
transcription of 51 genes including flagellar motility loci and those encoding
components of the T3SS3 apparatus. Using a promoter-lacZ fusion assay in E. coli,
we show that BsaN together with the chaperone BicA directly control the
expression of the T3SS3 translocon, effector and associated regulatory genes that
are organized into at least five operons (BPSS1516-BPSS1552). Using a mutagenesis
approach, a consensus regulatory motif in the promoter regions of BsaN-regulated
genes was shown to be essential for transcriptional activation.
CONCLUSIONS: BsaN/BicA functions as a central regulator of key virulence clusters
in B. pseudomallei within a more extensive network of genetic regulation. We
propose that BsaN/BicA controls a gene expression program that facilitates the
adaption and intracellular survival of the pathogen within eukaryotic hosts.
DOI: 10.1186/s12866-014-0206-6
PMCID: PMC4236580
PMID: 25085508 [Indexed for MEDLINE]
33. PLoS Genet. 2014 Apr 17;10(4):e1004288. doi: 10.1371/journal.pgen.1004288.
eCollection 2014 Apr.
Genome-wide profiling of yeast DNA:RNA hybrid prone sites with DRIP-chip.
Chan YA(1), Aristizabal MJ(2), Lu PY(2), Luo Z(3), Hamza A(1), Kobor MS(4),
Stirling PC(5), Hieter P(6).
Author information:
(1)Michael Smith Laboratories, University of British Columbia, Vancouver, Canada.
(2)Centre for Molecular Medicine and Therapeutics, Child and Family Research
Institute, Vancouver, Canada.
(3)Wine Research Centre, University of British Columbia, Vancouver, Canada.
(4)Centre for Molecular Medicine and Therapeutics, Child and Family Research
Institute, Vancouver, Canada; Department of Medical Genetics, University of
British Columbia, Vancouver, Canada.
(5)Department of Medical Genetics, University of British Columbia, Vancouver,
Canada; Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, Canada.
(6)Michael Smith Laboratories, University of British Columbia, Vancouver, Canada;
Department of Medical Genetics, University of British Columbia, Vancouver,
Canada.
DNA:RNA hybrid formation is emerging as a significant cause of genome instability
in biological systems ranging from bacteria to mammals. Here we describe the
genome-wide distribution of DNA:RNA hybrid prone loci in Saccharomyces cerevisiae
by DNA:RNA immunoprecipitation (DRIP) followed by hybridization on tiling
microarray. These profiles show that DNA:RNA hybrids preferentially accumulated
at rDNA, Ty1 and Ty2 transposons, telomeric repeat regions and a subset of open
reading frames (ORFs). The latter are generally highly transcribed and have high
GC content. Interestingly, significant DNA:RNA hybrid enrichment was also
detected at genes associated with antisense transcripts. The expression of
antisense-associated genes was also significantly altered upon overexpression of
RNase H, which degrades the RNA in hybrids. Finally, we uncover mutant-specific
differences in the DRIP profiles of a Sen1 helicase mutant, RNase H deletion
mutant and Hpr1 THO complex mutant compared to wild type, suggesting different
roles for these proteins in DNA:RNA hybrid biology. Our profiles of DNA:RNA
hybrid prone loci provide a resource for understanding the properties of
hybrid-forming regions in vivo, extend our knowledge of hybrid-mitigating
enzymes, and contribute to models of antisense-mediated gene regulation. A
summary of this paper was presented at the 26th International Conference on Yeast
Genetics and Molecular Biology, August 2013.
DOI: 10.1371/journal.pgen.1004288
PMCID: PMC3990523
PMID: 24743342 [Indexed for MEDLINE]
34. Methods. 2014 Jun 1;67(3):294-303. doi: 10.1016/j.ymeth.2014.03.006. Epub 2014
Mar 17.
Inferring gene regulatory networks by integrating ChIP-seq/chip and transcriptome
data via LASSO-type regularization methods.
Qin J(1), Hu Y(2), Xu F(1), Yalamanchili HK(1), Wang J(3).
Author information:
(1)Department of Biochemistry, LKS Faculty of Medicine, The University of Hong
Kong, Hong Kong Special Administrative Region, China; Shenzhen Institute of
Research & Innovation, The University of Hong Kong, Shenzhen, China.
(2)Department of Mathematics, Zhejiang University, Hangzhou, China; Department of
Applied Mathematics, The Hong Kong Polytechnic University, Hung Hom, Kowloon,
Hong Kong Special Administrative Region, China.
(3)Department of Biochemistry, LKS Faculty of Medicine, The University of Hong
Kong, Hong Kong Special Administrative Region, China; Shenzhen Institute of
Research & Innovation, The University of Hong Kong, Shenzhen, China; Centre for
Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong
Special Administrative Region, China. Electronic address: junwen@hku.hk.
Inferring gene regulatory networks from gene expression data at whole genome
level is still an arduous challenge, especially in higher organisms where the
number of genes is large but the number of experimental samples is small. It is
reported that the accuracy of current methods at genome scale significantly drops
from Escherichia coli to Saccharomyces cerevisiae due to the increase in number
of genes. This limits the applicability of current methods to more complex
genomes, like human and mouse. Least absolute shrinkage and selection operator
(LASSO) is widely used for gene regulatory network inference from gene expression
profiles. However, the accuracy of LASSO on large genomes is not satisfactory. In
this study, we apply two extended models of LASSO, L0 and L1/2 regularization
models to infer gene regulatory network from both high-throughput gene expression
data and transcription factor binding data in mouse embryonic stem cells (mESCs).
We find that both the L0 and L1/2 regularization models significantly outperform
LASSO in network inference. Incorporating interactions between transcription
factors and their targets remarkably improved the prediction accuracy. Current
study demonstrates the efficiency and applicability of these two models for gene
regulatory network inference from integrative omics data in large genomes. The
applications of the two models will facilitate biologists to study the gene
regulation of higher model organisms in a genome-wide scale.
Copyright © 2014 Elsevier Inc. All rights reserved.
DOI: 10.1016/j.ymeth.2014.03.006
PMID: 24650566 [Indexed for MEDLINE]
35. BMC Syst Biol. 2013;7 Suppl 6:S13. doi: 10.1186/1752-0509-7-S6-S13. Epub 2013 Dec
13.
Inferring functional transcription factor-gene binding pairs by integrating
transcription factor binding data with transcription factor knockout data.
Yang TH, Wu WS.
BACKGROUND: Chromatin immunoprecipitation (ChIP) experiments are now the most
comprehensive experimental approaches for mapping the binding of transcription
factors (TFs) to their target genes. However, ChIP data alone is insufficient for
identifying functional binding target genes of TFs for two reasons. First, there
is an inherent high false positive/negative rate in ChIP-chip or ChIP-seq
experiments. Second, binding signals in the ChIP data do not necessarily imply
functionality.
METHODS: It is known that ChIP-chip data and TF knockout (TFKO) data reveal
complementary information on gene regulation. While ChIP-chip data can provide
TF-gene binding pairs, TFKO data can provide TF-gene regulation pairs. Therefore,
we propose a novel network approach for identifying functional TF-gene binding
pairs by integrating the ChIP-chip data with the TFKO data. In our method, a
TF-gene binding pair from the ChIP-chip data is regarded to be functional if it
also has high confident curated TFKO TF-gene regulatory relation or deduced
hypostatic TF-gene regulatory relation.
RESULTS AND CONCLUSIONS: We first validated our method on a gathered ground truth
set. Then we applied our method to the ChIP-chip data to identify functional
TF-gene binding pairs. The biological significance of our identified functional
TF-gene binding pairs was shown by assessing their functional enrichment, the
prevalence of protein-protein interaction, and expression coherence. Our results
outperformed the results of three existing methods across all measures. And our
identified functional targets of TFs also showed statistical significance over
the randomly assigned TF-gene pairs. We also showed that our method is dataset
independent and can apply to ChIP-seq data and the E. coli genome. Finally, we
provided an example showing the biological applicability of our notion.
DOI: 10.1186/1752-0509-7-S6-S13
PMCID: PMC4029220
PMID: 24565265 [Indexed for MEDLINE]
36. PLoS Genet. 2013;9(10):e1003834. doi: 10.1371/journal.pgen.1003834. Epub 2013 Oct
3.
The serum resistome of a globally disseminated multidrug resistant uropathogenic
Escherichia coli clone.
Phan MD(1), Peters KM, Sarkar S, Lukowski SW, Allsopp LP, Gomes Moriel D, Achard
ME, Totsika M, Marshall VM, Upton M, Beatson SA, Schembri MA.
Author information:
(1)Australian Infectious Diseases Research Centre, The University of Queensland,
Brisbane, Queensland, Australia ; School of Chemistry and Molecular Biosciences,
The University of Queensland, Brisbane, Queensland, Australia.
Escherichia coli ST131 is a globally disseminated, multidrug resistant clone
responsible for a high proportion of urinary tract and bloodstream infections.
The rapid emergence and successful spread of E. coli ST131 is strongly associated
with antibiotic resistance; however, this phenotype alone is unlikely to explain
its dominance amongst multidrug resistant uropathogens circulating worldwide in
hospitals and the community. Thus, a greater understanding of the molecular
mechanisms that underpin the fitness of E. coli ST131 is required. In this study,
we employed hyper-saturated transposon mutagenesis in combination with
multiplexed transposon directed insertion-site sequencing to define the essential
genes required for in vitro growth and the serum resistome (i.e. genes required
for resistance to human serum) of E. coli EC958, a representative of the
predominant E. coli ST131 clonal lineage. We identified 315 essential genes in E.
coli EC958, 231 (73%) of which were also essential in E. coli K-12. The serum
resistome comprised 56 genes, the majority of which encode membrane proteins or
factors involved in lipopolysaccharide (LPS) biosynthesis. Targeted mutagenesis
confirmed a role in serum resistance for 46 (82%) of these genes. The murein
lipoprotein Lpp, along with two lipid A-core biosynthesis enzymes WaaP and WaaG,
were most strongly associated with serum resistance. While LPS was the main
resistance mechanism defined for E. coli EC958 in serum, the enterobacterial
common antigen and colanic acid also impacted on this phenotype. Our analysis
also identified a novel function for two genes, hyxA and hyxR, as minor
regulators of O-antigen chain length. This study offers novel insight into the
genetic make-up of E. coli ST131, and provides a framework for future research on
E. coli and other Gram-negative pathogens to define their essential gene
repertoire and to dissect the molecular mechanisms that enable them to survive in
the bloodstream and cause disease.
DOI: 10.1371/journal.pgen.1003834
PMCID: PMC3789825
PMID: 24098145 [Indexed for MEDLINE]
Conflict of interest statement: The authors have declared that no competing
interests exist.
37. BMC Genomics. 2013 Sep 22;14:638. doi: 10.1186/1471-2164-14-638.
"Non-canonical protein-DNA interactions identified by ChIP are not artifacts":
response.
Schindler D(1), Waldminghaus T.
Author information:
(1)LOEWE-Center for Synthetic Microbiology (SYNMIKRO), Philipps-Universität
Marburg, Hans-Meerwein-Str, 6, D-35043, Marburg, Germany.
Torsten.Waldminghaus@synmikro.uni-marburg.de.
BACKGROUND: Studies of protein association with DNA on a genome wide scale are
possible through methods like ChIP-Chip or ChIP-Seq. Massive problems with false
positive signals in our own experiments motivated us to revise the standard
ChIP-Chip protocol. Analysis of chromosome wide binding of the alternative sigma
factor σ³² in Escherichia coli with this new protocol resulted in detection of
only a subset of binding sites found in a previous study by Wade and colleagues.
We suggested that the remainder of binding sites detected in the previous study
are likely to be false positives. In a recent article the Wade group claimed that
our conclusion is wrong and that the disputed sites are genuine σ³² binding
sites. They further claimed that the non-detection of these sites in our study
was due to low data quality.
RESULTS/DISCUSSION: We respond to the criticism of Wade and colleagues and
discuss some general questions of ChIP-based studies. We outline why the quality
of our data is sufficient to derive meaningful results. Specific points are: (i)
the modifications we introduced into the standard ChIP-Chip protocol do not
necessarily result in a low dynamic range, (ii) correlation between ChIP-Chip
replicates should not be calculated based on the whole data set as done in
transcript analysis, (iii) control experiments are essential for identifying
false positives. Suggestions are made how ChIP-based methods could be further
optimized and which alternative approaches can be used to strengthen conclusions.
CONCLUSION: We appreciate the ongoing discussion about the ChIP-Chip method and
hope that it helps other scientist to analyze and interpret their results. The
modifications we introduced into the ChIP-Chip protocol are a first step towards
reducing false positive signals but there is certainly potential for further
optimization. The discussion about the σ³² binding sites in question highlights
the need for alternative approaches and further investigation of appropriate
methods for verification.
DOI: 10.1186/1471-2164-14-638
PMCID: PMC3870955
PMID: 24053571 [Indexed for MEDLINE]
38. BMC Bioinformatics. 2013 Jul 17;14:227. doi: 10.1186/1471-2105-14-227.
A fast weak motif-finding algorithm based on community detection in graphs.
Jia C(1), Carson MB, Yu J.
Author information:
(1)School of Computer and Information Technology, Beijing Jiaotong University,
Beijing 100044, China. cyjia@bjtu.edu.cn
BACKGROUND: Identification of transcription factor binding sites (also called
'motif discovery') in DNA sequences is a basic step in understanding genetic
regulation. Although many successful programs have been developed, the problem is
far from being solved on account of diversity in gene expression/regulation and
the low specificity of binding sites. State-of-the-art algorithms have their own
constraints (e.g., high time or space complexity for finding long motifs, low
precision in identification of weak motifs, or the OOPS constraint: one
occurrence of the motif instance per sequence) which limit their scope of
application.
RESULTS: In this paper, we present a novel and fast algorithm we call TFBSGroup.
It is based on community detection from a graph and is used to discover long and
weak (l,d) motifs under the ZOMOPS constraint (zero, one or multiple
occurrence(s) of the motif instance(s) per sequence), where l is the length of a
motif and d is the maximum number of mutations between a motif instance and the
motif itself. Firstly, TFBSGroup transforms the (l, d) motif search in sequences
to focus on the discovery of dense subgraphs within a graph. It identifies these
subgraphs using a fast community detection method for obtaining coarse-grained
candidate motifs. Next, it greedily refines these candidate motifs towards the
true motif within their own communities. Empirical studies on synthetic (l, d)
samples have shown that TFBSGroup is very efficient (e.g., it can find true (18,
6), (24, 8) motifs within 30 seconds). More importantly, the algorithm has
succeeded in rapidly identifying motifs in a large data set of prokaryotic
promoters generated from the Escherichia coli database RegulonDB. The algorithm
has also accurately identified motifs in ChIP-seq data sets for 12 mouse
transcription factors involved in ES cell pluripotency and self-renewal.
CONCLUSIONS: Our novel heuristic algorithm, TFBSGroup, is able to quickly
identify nearly exact matches for long and weak (l, d) motifs in DNA sequences
under the ZOMOPS constraint. It is also capable of finding motifs in real
applications. The source code for TFBSGroup can be obtained from
http://bioinformatics.bioengr.uic.edu/TFBSGroup/.
DOI: 10.1186/1471-2105-14-227
PMCID: PMC3726413
PMID: 23865838 [Indexed for MEDLINE]
39. PLoS One. 2013 May 23;8(5):e64688. doi: 10.1371/journal.pone.0064688. Print 2013.
Genome-wide analysis of the salmonella Fis regulon and its regulatory mechanism
on pathogenicity islands.
Wang H(1), Liu B, Wang Q, Wang L.
Author information:
(1)TEDA School of Biological Sciences and Biotechnology, Nankai University, TEDA,
Tianjin, PR China.
Fis, one of the most important nucleoid-associated proteins, functions as a
global regulator of transcription in bacteria that has been comprehensively
studied in Escherichia coli K12. Fis also influences the virulence of Salmonella
enterica and pathogenic E. coli by regulating their virulence genes, however, the
relevant mechanism is unclear. In this report, using combined RNA-seq and
chromatin immunoprecipitation (ChIP)-seq technologies, we first identified 1646
Fis-regulated genes and 885 Fis-binding targets in the S. enterica serovar
Typhimurium, and found a Fis regulon different from that in E. coli. Fis has been
reported to contribute to the invasion ability of S. enterica. By using cell
infection assays, we found it also enhances the intracellular replication ability
of S. enterica within macrophage cell, which is of central importance for the
pathogenesis of infections. Salmonella pathogenicity islands (SPI)-1 and SPI-2
are crucial for the invasion and survival of S. enterica in host cells. Using
mutation and overexpression experiments, real-time PCR analysis, and
electrophoretic mobility shift assays, we demonstrated that Fis regulates 63 of
the 94 Salmonella pathogenicity island (SPI)-1 and SPI-2 genes, by three
regulatory modes: i) binds to SPI regulators in the gene body or in upstream
regions; ii) binds to SPI genes directly to mediate transcriptional activation of
themselves and downstream genes; iii) binds to gene encoding OmpR which affects
SPI gene expression by controlling SPI regulators SsrA and HilD. Our results
provide new insights into the impact of Fis on SPI genes and the pathogenicity of
S. enterica.
DOI: 10.1371/journal.pone.0064688
PMCID: PMC3662779
PMID: 23717649 [Indexed for MEDLINE]
40. DNA Res. 2013 Aug;20(4):325-38. doi: 10.1093/dnares/dst013. Epub 2013 Apr 11.
High-resolution mapping of in vivo genomic transcription factor binding sites
using in situ DNase I footprinting and ChIP-seq.
Chumsakul O(1), Nakamura K, Kurata T, Sakamoto T, Hobman JL, Ogasawara N, Oshima
T, Ishikawa S.
Author information:
(1)Graduate School of Biological Sciences, Nara Institute of Science and
Technology, 8916-5, Takayama, Ikoma, Nara 630-0192, Japan.
Accurate identification of the DNA-binding sites of transcription factors and
other DNA-binding proteins on the genome is crucial to understanding their
molecular interactions with DNA. Here, we describe a new method: Genome
Footprinting by high-throughput sequencing (GeF-seq), which combines in vivo
DNase I digestion of genomic DNA with ChIP coupled with high-throughput
sequencing. We have determined the in vivo binding sites of a Bacillus subtilis
global regulator, AbrB, using GeF-seq. This method shows that exact DNA-binding
sequences, which were protected from in vivo DNase I digestion, were resolved at
a comparable resolution to that achieved by in vitro DNase I footprinting, and
this was simply attained without the necessity of prediction by peak-calling
programs. Moreover, DNase I digestion of the bacterial nucleoid resolved the
closely positioned AbrB-binding sites, which had previously appeared as one peak
in ChAP-chip and ChAP-seq experiments. The high-resolution determination of
AbrB-binding sites using GeF-seq enabled us to identify bipartite TGGNA motifs in
96% of the AbrB-binding sites. Interestingly, in a thousand binding sites with
very low-binding intensities, single TGGNA motifs were also identified. Thus,
GeF-seq is a powerful method to elucidate the molecular mechanism of target
protein binding to its cognate DNA sequences.
DOI: 10.1093/dnares/dst013
PMCID: PMC3738160
PMID: 23580539 [Indexed for MEDLINE]
41. Integr Biol (Camb). 2013 May;5(5):796-806. doi: 10.1039/c3ib20221f.
Efficient transcription initiation in bacteria: an interplay of protein-DNA
interaction parameters.
Djordjevic M(1).
Author information:
(1)Institute of Physiology and Biochemistry, Faculty of Biology, University of
Belgrade, Studentski trg 16, 11000 Belgrade, Serbia. dmarko@bio.bg.ac.rs
As the first, and usually rate-limiting, step of transcription initiation,
bacterial RNA polymerase (RNAP) binds to double stranded DNA (dsDNA) and
subsequently opens the two strands of DNA (the open complex formation). The rate
determining step in the open complex formation is opening of a short (6 bp) DNA
called the -10 region, which interacts with RNAP in both dsDNA and single
stranded (ssDNA) forms. Accordingly, formation of the open complex depends on
(physically independent) domains of RNAP that interact with ssDNA and dsDNA, as
well as on parameters of DNA melting and sequences of -10 regions. We here aim to
understand how these different interactions are mutually related to ensure
efficient open complex formation. To achieve this, we use a recently developed
biophysical model of transcription initiation, which allows the calculation of
the kinetic parameters of transcription initiation on the scale of whole genome.
We consequently investigate kinetic properties of sequences derived from all E.
coli intergenic regions, and from more than 300 experimentally confirmed E. coli
σ(70) promoters. We find that interaction specificities of σ(70) DNA binding
domains reduce the number of sequences where RNAP binds strongly, but forms the
open complex too slowly to achieve functional transcription (so-called poised
promoters). However, we find that, despite this reduction, there is still a
significant number of such poised promoters in the intergenic regions, which may
provide a major source of false positives in genome-wide searches of
transcription start sites. Furthermore, we surprisingly find that sequences of
-10 regions of the functional promoters increase the extent of RNAP poising,
which we interpret in terms of an extension of a recently proposed model of
promoter recognition ('mix-and-match model') to kinetic parameters. Overall, our
results allow better understanding of the design of σ(70) DNA binding domains and
promoter sequences, and place a fundamental limit on accuracy of methods for
promoter detection that are based on strong RNAP binding (e.g. ChIP-chip).
DOI: 10.1039/c3ib20221f
PMID: 23511241 [Indexed for MEDLINE]
42. Nucleic Acids Res. 2013 Apr;41(8):4549-64. doi: 10.1093/nar/gkt148. Epub 2013 Mar
6.
A comparison of dense transposon insertion libraries in the Salmonella serovars
Typhi and Typhimurium.
Barquist L(1), Langridge GC, Turner DJ, Phan MD, Turner AK, Bateman A, Parkhill
J, Wain J, Gardner PP.
Author information:
(1)Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK.
lb14@sanger.ac.uk
Salmonella Typhi and Typhimurium diverged only ∼50 000 years ago, yet have very
different host ranges and pathogenicity. Despite the availability of multiple
whole-genome sequences, the genetic differences that have driven these changes in
phenotype are only beginning to be understood. In this study, we use
transposon-directed insertion-site sequencing to probe differences in gene
requirements for competitive growth in rich media between these two closely
related serovars. We identify a conserved core of 281 genes that are required for
growth in both serovars, 228 of which are essential in Escherichia coli. We are
able to identify active prophage elements through the requirement for their
repressors. We also find distinct differences in requirements for genes involved
in cell surface structure biogenesis and iron utilization. Finally, we
demonstrate that transposon-directed insertion-site sequencing is not only
applicable to the protein-coding content of the cell but also has sufficient
resolution to generate hypotheses regarding the functions of non-coding RNAs
(ncRNAs) as well. We are able to assign probable functions to a number of
cis-regulatory ncRNA elements, as well as to infer likely differences in
trans-acting ncRNA regulatory networks.
DOI: 10.1093/nar/gkt148
PMCID: PMC3632133
PMID: 23470992 [Indexed for MEDLINE]
43. Nucleic Acids Res. 2013 Mar 1;41(5):e59. doi: 10.1093/nar/gks1342. Epub 2012 Dec
28.
Nanobody(R)-based chromatin immunoprecipitation/micro-array analysis for
genome-wide identification of transcription factor DNA binding sites.
Nguyen-Duc T(1), Peeters E, Muyldermans S, Charlier D, Hassanzadeh-Ghassabeh G.
Author information:
(1)Research group of Cellular and Molecular Immunology, Vrije Universiteit
Brussel, Pleinlaan 2, B-1050 Brussel, Belgium.
Nanobodies® are single-domain antibody fragments derived from camelid heavy-chain
antibodies. Because of their small size, straightforward production in
Escherichia coli, easy tailoring, high affinity, specificity, stability and
solubility, nanobodies® have been exploited in various biotechnological
applications. A major challenge in the post-genomics and post-proteomics era is
the identification of regulatory networks involving nucleic acid-protein and
protein-protein interactions. Here, we apply a nanobody® in chromatin
immunoprecipitation followed by DNA microarray hybridization (ChIP-chip) for
genome-wide identification of DNA-protein interactions. The Lrp-like regulator
Ss-LrpB, arguably one of the best-studied specific transcription factors of the
hyperthermophilic archaeon Sulfolobus solfataricus, was chosen for this
proof-of-principle nanobody®-assisted ChIP. Three distinct Ss-LrpB-specific
nanobodies®, each interacting with a different epitope, were generated for ChIP.
Genome-wide ChIP-chip with one of these nanobodies® identified the
well-established Ss-LrpB binding sites and revealed several unknown target
sequences. Furthermore, these ChIP-chip profiles revealed auxiliary operator
sites in the open reading frame of Ss-lrpB. Our work introduces nanobodies® as a
novel class of affinity reagents for ChIP. Taking into account the unique
characteristics of nanobodies®, in particular, their short generation time,
nanobody®-based ChIP is expected to further streamline ChIP-chip and ChIP-Seq
experiments, especially in organisms with no (or limited) possibility of genetic
manipulation.
DOI: 10.1093/nar/gks1342
PMCID: PMC3597646
PMID: 23275538 [Indexed for MEDLINE]
44. MBio. 2012 Dec 11;3(6). pii: e00407-12. doi: 10.1128/mBio.00407-12.
Crp is a global regulator of antibiotic production in streptomyces.
Gao C(1), Hindra, Mulder D, Yin C, Elliot MA.
Author information:
(1)Department of Biology and Institute for Infectious Disease Research, McMaster
University, Hamilton, Ontario, Canada.
Cyclic AMP receptor protein (Crp) is a transcription regulator controlling
diverse cellular processes in many bacteria. In Streptomyces coelicolor, it is
well established that Crp plays a critical role in spore germination and colony
development. Here, we demonstrate that Crp is a key regulator of secondary
metabolism and antibiotic production in S. coelicolor and show that it may
additionally coordinate precursor flux from primary to secondary metabolism. We
found that crp deletion adversely affected the synthesis of three
well-characterized antibiotics in S. coelicolor: actinorhodin (Act),
undecylprodigiosin (Red), and calcium-dependent antibiotic (CDA). Using chromatin
immunoprecipitation-microarray (ChIP-chip) assays, we determined that eight (out
of 22) secondary metabolic clusters encoded by S. coelicolor contained
Crp-associated sites. We followed the effect of Crp induction using transcription
profiling analyses and found secondary metabolic genes to be significantly
affected: included in this Crp-dependent group were genes from six of the
clusters identified in the ChIP-chip experiments. Overexpressing Crp in a panel
of Streptomyces species led to enhanced antibiotic synthesis and new metabolite
production, suggesting that Crp control over secondary metabolism is broadly
conserved in the streptomycetes and that Crp overexpression could serve as a
powerful tool for unlocking the chemical potential of these organisms. IMPORTANCE
Streptomyces produces a remarkably diverse array of secondary metabolites,
including many antibiotics. In recent years, genome sequencing has revealed that
these products represent only a small proportion of the total secondary
metabolite potential of Streptomyces. There is, therefore, considerable interest
in discovering ways to stimulate the production of new metabolites. Here, we show
that Crp (the classical regulator of carbon catabolite repression in Escherichia
coli) is a master regulator of secondary metabolism in Streptomyces. It binds to
eight of 22 secondary metabolic gene clusters in the Streptomyces coelicolor
genome and directly affects the expression of six of these. Deletion of crp in S.
coelicolor leads to dramatic reductions in antibiotic levels, while Crp
overexpression enhances antibiotic production. We find that the
antibiotic-stimulatory capacity of Crp extends to other streptomycetes, where its
overexpression activates the production of "cryptic" metabolites that are not
otherwise seen in the corresponding wild-type strain.
DOI: 10.1128/mBio.00407-12
PMCID: PMC3520106
PMID: 23232715 [Indexed for MEDLINE]
45. Mol Microbiol. 2013 Feb;87(3):526-38. doi: 10.1111/mmi.12111. Epub 2012 Dec 19.
ChIP-seq and transcriptome analysis of the OmpR regulon of Salmonella enterica
serovars Typhi and Typhimurium reveals accessory genes implicated in host
colonization.
Perkins TT(1), Davies MR, Klemm EJ, Rowley G, Wileman T, James K, Keane T,
Maskell D, Hinton JC, Dougan G, Kingsley RA.
Author information:
(1)The Wellcome Trust Sanger Institute, The Wellcome Trust Genome Campus,
Hinxton, Cambridge CB10 1SA, UK.
OmpR is a multifunctional DNA binding regulator with orthologues in many enteric
bacteria that exhibits classical regulator activity as well as
nucleoid-associated protein-like characteristics. In the enteric pathogen
Salmonella enterica, using chromatin immunoprecipitation of OmpR:FLAG and
nucleotide sequencing, 43 putative OmpR binding sites were identified in
S. enterica serovar Typhi, 22 of which were associated with OmpR-regulated genes.
Mutation of a sequence motif (TGTWACAW) that was associated with the putative
OmpR binding sites abrogated binding of OmpR:6×His to the tviA upstream region. A
core set of 31 orthologous genes were found to exhibit OmpR-dependent expression
in both S. Typhi and S. Typhimurium. S. Typhimurium-encoded orthologues of two
divergently transcribed OmpR-regulated operons (SL1068-71 and SL1066-67) had a
putative OmpR binding site in the inter-operon region in S. Typhi, and were
characterized using in vitro and in vivo assays. These operons are widely
distributed within S. enterica but absent from the closely related Escherichia
coli. SL1066 and SL1067 were required for growth on N-acetylmuramic acid as a
sole carbon source. SL1068-71 exhibited sequence similarity to sialic acid uptake
systems and contributed to colonization of the ileum and caecum in the
streptomycin-pretreated mouse model of colitis.
© 2012 Blackwell Publishing Ltd.
DOI: 10.1111/mmi.12111
PMCID: PMC3586657
PMID: 23190111 [Indexed for MEDLINE]
46. Nucleic Acids Res. 2012 Dec;40(22):e175. doi: 10.1093/nar/gks771. Epub 2012 Aug
25.
Improved predictions of transcription factor binding sites using physicochemical
features of DNA.
Maienschein-Cline M(1), Dinner AR, Hlavacek WS, Mu F.
Author information:
(1)Department of Chemistry, University of Chicago, Chicago, IL 60637, USA.
Typical approaches for predicting transcription factor binding sites (TFBSs)
involve use of a position-specific weight matrix (PWM) to statistically
characterize the sequences of the known sites. Recently, an alternative
physicochemical approach, called SiteSleuth, was proposed. In this approach, a
linear support vector machine (SVM) classifier is trained to distinguish TFBSs
from background sequences based on local chemical and structural features of DNA.
SiteSleuth appears to generally perform better than PWM-based methods. Here, we
improve the SiteSleuth approach by considering both new physicochemical features
and algorithmic modifications. New features are derived from Gibbs energies of
amino acid-DNA interactions and hydroxyl radical cleavage profiles of DNA.
Algorithmic modifications consist of inclusion of a feature selection step, use
of a nonlinear kernel in the SVM classifier, and use of a consensus-based
post-processing step for predictions. We also considered SVM classification based
on letter features alone to distinguish performance gains from use of SVM-based
models versus use of physicochemical features. The accuracy of each of the
variant methods considered was assessed by cross validation using data available
in the RegulonDB database for 54 Escherichia coli TFs, as well as by experimental
validation using published ChIP-chip data available for Fis and Lrp.
DOI: 10.1093/nar/gks771
PMCID: PMC3526315
PMID: 22923524 [Indexed for MEDLINE]
47. Gene. 2012 Oct 25;508(2):221-8. doi: 10.1016/j.gene.2012.07.064. Epub 2012 Aug 4.
Under-representation of intrinsic terminators across bacterial genomic islands:
Rho as a principal regulator of xenogenic DNA expression.
Mitra A(1), Nagaraja V.
Author information:
(1)Department of Microbiology and Cell Biology, Indian Institute of Science,
Bangalore, India.
Two transcription termination mechanisms - intrinsic and Rho-dependent - have
evolved in bacteria. The Rho factor occurs in most bacterial lineages, and has
been hypothesized to play a global regulatory role. Genome-wide studies using
microarray, 2D-gel electrophoresis and ChIP-chip provided evidence that Rho
serves to silence transcription from horizontally acquired genes and prophages in
Escherichia coli K-12, implicating the factor to be a part of the "cellular
immune mechanism" protecting against deleterious phages and aberrant gene
expression from acquired xenogenic DNA. We have investigated this model by
adopting an alternate in silico approach and have extended the study to other
species. Our analysis shows that several genomic islands across diverse phyla
have under-representation of intrinsic terminators, similar to that
experimentally observed in E. coli K-12. This implies that Rho-dependent
termination is the predominant process operational in these islands and that
silencing of foreign DNA is a conserved function of Rho. From the present
analysis, it is evident that horizontally acquired islands have lost intrinsic
terminators to facilitate Rho-dependent termination. These results underscore the
importance of Rho as a conserved, genome-wide sentinel that regulates potentially
toxic xenogenic DNA.
Copyright © 2012 Elsevier B.V. All rights reserved.
DOI: 10.1016/j.gene.2012.07.064
PMID: 22890136 [Indexed for MEDLINE]
48. Gut Microbes. 2012 Mar-Apr;3(2):93-103. doi: 10.4161/gmic.19578. Epub 2012 Mar 1.
Signature tagged mutagenesis in the functional genetic analysis of
gastrointestinal pathogens.
Cummins J(1), Gahan CG.
Author information:
(1)Alimentary Pharmabiotic Centre, University College Cork, Cork, Ireland.
Signature tagged mutagenesis is a genetic approach that was developed to identify
novel bacterial virulence factors. It is a negative selection method in which
unique identification tags allow analysis of pools of mutants in mixed
populations. The approach is particularly well suited to functional genetic
analysis of the gastrointestinal phase of infection in foodborne pathogens and
has the capacity to guide the development of novel vaccines and therapeutics. In
this review we outline the technical principles underpinning signature-tagged
mutagenesis as well as novel sequencing-based approaches for transposon mutant
identification such as TraDIS (transposon directed insertion-site sequencing). We
also provide an analysis of screens that have been performed in gastrointestinal
pathogens which are a global health concern (Escherichia coli, Listeria
monocytogenes, Helicobacter pylori, Vibrio cholerae and Salmonella enterica). The
identification of key virulence loci through the use of signature tagged
mutagenesis in mice and relevant larger animal models is discussed.
DOI: 10.4161/gmic.19578
PMCID: PMC3370953
PMID: 22555467 [Indexed for MEDLINE]
49. Mol Endocrinol. 2012 Feb;26(2):349-57. doi: 10.1210/me.2011-1080. Epub 2011 Dec
29.
Research resource: The estrogen receptor α cistrome defined by DamIP.
Xiao R(1), Sun D, Ayers S, Xi Y, Li W, Baxter JD, Moore DD.
Author information:
(1)Department of Molecular and Cellular Biology, Baylor College of Medicine,
Houston, Texas 77030, USA.
Gene expression is tightly regulated by transcription factors and cofactors that
function by directly or indirectly interacting with DNA of the genome.
Understanding how and where these proteins bind provides essential information to
uncover genetic regulatory mechanisms. We have developed a new method to study
DNA-protein interaction in vivo called DNA adenine methyltransferase (Dam)IP,
which is based on fusing a protein of interest to a mutant form of Dam from
Escherichia coli. We showed previously that DamIP can efficiently identify in
vivo binding sites of Dam-tethered human estrogen receptor (hER)α. In current
study, we present the cistrome of hERα determined by DamIP and high throughput
sequencing (DamIP-seq). The DamIP-seq-defined hERα cistrome identifies many new
binding regions and overlaps with those determined by chromatin
immunoprecipitation (ChIP)-chip or ChIP-seq. Elements uniquely identified by
DamIP-seq include a unique class of elements that show low, but persistent, hERα
binding when reexamined by conventional ChIP. In contrast, DamIP-seq fails to
detect some elements with very transient hERα binding. The methyl-adenine
modifications introduced by Dam are stable and do not decrease over 12 d. In
summary, the current study provides both an alternate view of the hERα cistrome
to further understand the mechanism of hERα-mediated transcription and a new tool
to explore other transcriptional factors and cofactors that is very different
from conventional ChIP.
DOI: 10.1210/me.2011-1080
PMCID: PMC3384088
PMID: 22207717 [Indexed for MEDLINE]
50. J Bacteriol. 2011 Jun;193(12):3090-9. doi: 10.1128/JB.00086-11. Epub 2011 Apr 22.
Transcription factor GreA contributes to resolving promoter-proximal pausing of
RNA polymerase in Bacillus subtilis cells.
Kusuya Y(1), Kurokawa K, Ishikawa S, Ogasawara N, Oshima T.
Author information:
(1)Graduate School of Information Science, Nara Institute of Science and
Technology, 8916-5, Takayama, Ikoma, Nara 630-0192, Japan. taku@bs.naist.jp
Bacterial Gre factors associate with RNA polymerase (RNAP) and stimulate
intrinsic cleavage of the nascent transcript at the active site of RNAP.
Biochemical and genetic studies to date have shown that Escherichia coli Gre
factors prevent transcriptional arrest during elongation and enhance
transcription fidelity. Furthermore, Gre factors participate in the stimulation
of promoter escape and the suppression of promoter-proximal pausing during the
beginning of RNA synthesis in E. coli. Although Gre factors are conserved in
general bacteria, limited functional studies have been performed in bacteria
other than E. coli. In this investigation, ChAP-chip analysis (chromatin affinity
precipitation coupled with DNA microarray) was conducted to visualize the
distribution of Bacillus subtilis GreA on the chromosome and to determine the
effects of GreA inactivation on core RNAP trafficking. Our data show that GreA is
uniformly distributed in the transcribed region from the promoter to coding
region with core RNAP, and its inactivation induces RNAP accumulation at many
promoter or promoter-proximal regions. Based on these findings, we propose that
GreA would constantly associate with core RNAP during transcriptional initiation
and elongation and resolves its stalling at promoter or promoter-proximal
regions, thus contributing to the even distribution of RNAP along the promoter
and coding regions in B. subtilis cells.
DOI: 10.1128/JB.00086-11
PMCID: PMC3133182
PMID: 21515770 [Indexed for MEDLINE]
51. J Bacteriol. 2011 Apr;193(7):1771-6. doi: 10.1128/JB.01292-10. Epub 2011 Jan 28.
Retrospective application of transposon-directed insertion site sequencing to a
library of signature-tagged mini-Tn5Km2 mutants of Escherichia coli O157:H7
screened in cattle.
Eckert SE(1), Dziva F, Chaudhuri RR, Langridge GC, Turner DJ, Pickard DJ, Maskell
DJ, Thomson NR, Stevens MP.
Author information:
(1)Roslin Institute and Royal (Dick) School of Veterinary Studies, University of
Edinburgh, Bush Farm Road, Roslin, Midlothian EH25 9RG, United Kingdom.
Massively parallel sequencing of transposon-flanking regions assigned the
genotype and fitness score to 91% of Escherichia coli O157:H7 mutants previously
screened in cattle by signature-tagged mutagenesis (STM). The method obviates the
limitations of STM and markedly extended the functional annotation of the
prototype E. coli O157:H7 genome without further animal use.
DOI: 10.1128/JB.01292-10
PMCID: PMC3067669
PMID: 21278291 [Indexed for MEDLINE]
52. PLoS Comput Biol. 2010 Nov 18;6(11):e1001007. doi: 10.1371/journal.pcbi.1001007.
Using sequence-specific chemical and structural properties of DNA to predict
transcription factor binding sites.
Bauer AL(1), Hlavacek WS, Unkefer PJ, Mu F.
Author information:
(1)Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos
National Laboratory, Los Alamos, New Mexico, United States of America.
An important step in understanding gene regulation is to identify the DNA binding
sites recognized by each transcription factor (TF). Conventional approaches to
prediction of TF binding sites involve the definition of consensus sequences or
position-specific weight matrices and rely on statistical analysis of DNA
sequences of known binding sites. Here, we present a method called SiteSleuth in
which DNA structure prediction, computational chemistry, and machine learning are
applied to develop models for TF binding sites. In this approach, binary
classifiers are trained to discriminate between true and false binding sites
based on the sequence-specific chemical and structural features of DNA. These
features are determined via molecular dynamics calculations in which we consider
each base in different local neighborhoods. For each of 54 TFs in Escherichia
coli, for which at least five DNA binding sites are documented in RegulonDB, the
TF binding sites and portions of the non-coding genome sequence are mapped to
feature vectors and used in training. According to cross-validation analysis and
a comparison of computational predictions against ChIP-chip data available for
the TF Fis, SiteSleuth outperforms three conventional approaches: Match, MATRIX
SEARCH, and the method of Berg and von Hippel. SiteSleuth also outperforms
QPMEME, a method similar to SiteSleuth in that it involves a learning algorithm.
The main advantage of SiteSleuth is a lower false positive rate.
DOI: 10.1371/journal.pcbi.1001007
PMCID: PMC2987836
PMID: 21124945 [Indexed for MEDLINE]
53. Nucleic Acids Res. 2011 Mar;39(5):1656-65. doi: 10.1093/nar/gkq848. Epub 2010 Nov
4.
Sensitive and accurate identification of protein-DNA binding events in ChIP-chip
assays using higher order derivative analysis.
Barrett CL(1), Cho BK, Palsson BO.
Author information:
(1)Department of Bioengineering, University of California, San Diego, La Jolla,
CA 92093, USA. cbarrett@ucsd.edu
Immuno-precipitation of protein-DNA complexes followed by microarray
hybridization is a powerful and cost-effective technology for discovering
protein-DNA binding events at the genome scale. It is still an unresolved
challenge to comprehensively, accurately and sensitively extract binding event
information from the produced data. We have developed a novel strategy composed
of an information-preserving signal-smoothing procedure, higher order derivative
analysis and application of the principle of maximum entropy to address this
challenge. Importantly, our method does not require any input parameters to be
specified by the user. Using genome-scale binding data of two Escherichia coli
global transcription regulators for which a relatively large number of
experimentally supported sites are known, we show that ∼90% of known sites were
resolved to within four probes, or ∼88 bp. Over half of the sites were resolved
to within two probes, or ∼38 bp. Furthermore, we demonstrate that our strategy
delivers significant quantitative and qualitative performance gains over
available methods. Such accurate and sensitive binding site resolution has
important consequences for accurately reconstructing transcriptional regulatory
networks, for motif discovery, for furthering our understanding of local and
non-local factors in protein-DNA interactions and for extending the usefulness
horizon of the ChIP-chip platform.
DOI: 10.1093/nar/gkq848
PMCID: PMC3061075
PMID: 21051353 [Indexed for MEDLINE]
54. J Bacteriol. 2010 Nov;192(21):5778-87. doi: 10.1128/JB.00489-10. Epub 2010 Sep 3.
RNA polymerase trafficking in Bacillus subtilis cells.
Ishikawa S(1), Oshima T, Kurokawa K, Kusuya Y, Ogasawara N.
Author information:
(1)Graduate School of Information Science, Nara Institute of Science and
Technology, 8916-5, Takayama, Ikoma, Nara 630-0192, Japan.
To obtain insight into the in vivo dynamics of RNA polymerase (RNAP) on the
Bacillus subtilis genome, we analyzed the distribution of the σ(A) and β'
subunits of RNAP and the NusA elongation factor on the genome in exponentially
growing cells using chromatin affinity precipitation coupled with gene chip
mapping (ChAP-chip). In contrast to Escherichia coli RNAP, which often
accumulates at the promoter-proximal region, B. subtilis RΝΑP is evenly
distributed from the promoter to the coding sequences. This finding suggests
that, in general, B. subtilis RNAP recruited to the promoter promptly
translocates away from the promoter to form the elongation complex and proceeds
without intragenic transcription attenuation. We detected RNAP accumulation in
the promoter-proximal regions of some genes, most of which can be identified as
transcription attenuation systems in the leader region. Our findings suggest that
the differences in RNAP behavior between E. coli and B. subtilis during
initiation and elongation steps might result in distinct strategies for
postinitiation control of transcription. The E. coli mechanism involves trapping
at the promoter and promoter-proximal pausing of RNAP in addition to
transcription attenuation, whereas transcription attenuation in leader sequences
is mainly employed in B. subtilis.
DOI: 10.1128/JB.00489-10
PMCID: PMC2953687
PMID: 20817769 [Indexed for MEDLINE]
55. J Bacteriol. 2010 Sep;192(18):4720-31. doi: 10.1128/JB.00591-10. Epub 2010 Jul
16.
Pmr, a histone-like protein H1 (H-NS) family protein encoded by the IncP-7
plasmid pCAR1, is a key global regulator that alters host function.
Yun CS(1), Suzuki C, Naito K, Takeda T, Takahashi Y, Sai F, Terabayashi T,
Miyakoshi M, Shintani M, Nishida H, Yamane H, Nojiri H.
Author information:
(1)Biotechnology Research Center and Agricultural Bioinformatics Research Unit,
Graduate School of Agricultural and Life Sciences, the University of Tokyo, 1-1-1
Yayoi, Bunkyo-ku, Tokyo 113-8657, Japan.
Histone-like protein H1 (H-NS) family proteins are nucleoid-associated proteins
(NAPs) conserved among many bacterial species. The IncP-7 plasmid pCAR1 is
transmissible among various Pseudomonas strains and carries a gene encoding the
H-NS family protein, Pmr. Pseudomonas putida KT2440 is a host of pCAR1, which
harbors five genes encoding the H-NS family proteins PP_1366 (TurA), PP_3765
(TurB), PP_0017 (TurC), PP_3693 (TurD), and PP_2947 (TurE). Quantitative reverse
transcription-PCR (qRT-PCR) demonstrated that the presence of pCAR1 does not
affect the transcription of these five genes and that only pmr, turA, and turB
were primarily transcribed in KT2440(pCAR1). In vitro pull-down assays revealed
that Pmr strongly interacted with itself and with TurA, TurB, and TurE.
Transcriptome comparisons of the pmr disruptant, KT2440, and KT2440(pCAR1)
strains indicated that pmr disruption had greater effects on the host
transcriptome than did pCAR1 carriage. The transcriptional levels of some genes
that increased with pCAR1 carriage, such as the mexEF-oprN efflux pump genes and
parI, reverted with pmr disruption to levels in pCAR1-free KT2440.
Transcriptional levels of putative horizontally acquired host genes were not
altered by pCAR1 carriage but were altered by pmr disruption. Identification of
genome-wide Pmr binding sites by ChAP-chip (chromatin affinity purification
coupled with high-density tiling chip) analysis demonstrated that Pmr
preferentially binds to horizontally acquired DNA regions. The Pmr binding sites
overlapped well with the location of the genes differentially transcribed
following pmr disruption on both the plasmid and the chromosome. Our findings
indicate that Pmr is a key factor in optimizing gene transcription on pCAR1 and
the host chromosome.
DOI: 10.1128/JB.00591-10
PMCID: PMC2937398
PMID: 20639326 [Indexed for MEDLINE]
56. Nucleic Acids Res. 2010 Sep;38(17):5735-45. doi: 10.1093/nar/gkq363. Epub 2010
May 11.
Identification of {beta}-catenin binding regions in colon cancer cells using
ChIP-Seq.
Bottomly D(1), Kyler SL, McWeeney SK, Yochum GS.
Author information:
(1)Oregon Clinical and Translational Research Institute, Oregon Health and
Science University, Portland, OR, USA.
Deregulation of the Wnt/β-catenin signaling pathway is a hallmark of colon
cancer. Mutations in the adenomatous polyposis coli (APC) gene occur in the vast
majority of colorectal cancers and are an initiating event in cellular
transformation. Cells harboring mutant APC contain elevated levels of the
β-catenin transcription coactivator in the nucleus which leads to abnormal
expression of genes controlled by β-catenin/T-cell factor 4 (TCF4) complexes.
Here, we use chromatin immunoprecipitation coupled with massively parallel
sequencing (ChIP-Seq) to identify β-catenin binding regions in HCT116 human colon
cancer cells. We localized 2168 β-catenin enriched regions using a concordance
approach for integrating the output from multiple peak alignment algorithms.
Motif discovery algorithms found a core TCF4 motif (T/A-T/A-C-A-A-A-G), an
extended TCF4 motif (A/T/G-C/G-T/A-T/A-C-A-A-A-G) and an AP-1 motif
(T-G-A-C/T-T-C-A) to be significantly represented in β-catenin enriched regions.
Furthermore, 417 regions contained both TCF4 and AP-1 motifs. Genes associated
with TCF4 and AP-1 motifs bound β-catenin, TCF4 and c-Jun in vivo and were
activated by Wnt signaling and serum growth factors. Our work provides evidence
that Wnt/β-catenin and mitogen signaling pathways intersect directly to regulate
a defined set of target genes.
DOI: 10.1093/nar/gkq363
PMCID: PMC2943592
PMID: 20460455 [Indexed for MEDLINE]
57. Mol Microbiol. 2009 Dec;74(5):1169-86. doi: 10.1111/j.1365-2958.2009.06929.x.
Epub 2009 Oct 19.
The H-NS-like protein StpA represses the RpoS (sigma 38) regulon during
exponential growth of Salmonella Typhimurium.
Lucchini S(1), McDermott P, Thompson A, Hinton JC.
Author information:
(1)Institute of Food Research, Colney Lane, Norwich, NR4 7UA, UK.
sacha.lucchini@bbsrc.ac.uk
StpA is a paralogue of the nucleoid-associated protein H-NS that is conserved in
a range of enteric bacteria and had no known function in Salmonella Typhimurium.
We show that 5% of the Salmonella genome is regulated by StpA, which contrasts
with the situation in Escherichia coli where deletion of stpA only had minor
effects on gene expression. The StpA-dependent genes of S. Typhimurium are a
specific subset of the H-NS regulon that are predominantly under the positive
control of sigma(38) (RpoS), CRP-cAMP and PhoP. Regulation by StpA varied with
growth phase; StpA controlled sigma(38) levels at mid-exponential phase by
preventing inappropriate activation of sigma(38) during rapid bacterial growth.
In contrast, StpA only activated the CRP-cAMP regulon during late exponential
phase. ChIP-chip analysis revealed that StpA binds to PhoP-dependent genes but
not to most genes of the CRP-cAMP and sigma(38) regulons. In fact, StpA
indirectly regulates sigma(38)-dependent genes by enhancing sigma(38) turnover by
repressing the anti-adaptor protein rssC. We discovered that StpA is essential
for the dynamic regulation of sigma(38) in response to increased glucose levels.
Our findings identify StpA as a novel growth phase-specific regulator that plays
an important physiological role by linking sigma(38) levels to nutrient
availability.
DOI: 10.1111/j.1365-2958.2009.06929.x
PMID: 19843227 [Indexed for MEDLINE]
58. Nucleic Acids Res. 2009 Jan;37(Database issue):D464-70. doi: 10.1093/nar/gkn751.
Epub 2008 Oct 30.
EcoCyc: a comprehensive view of Escherichia coli biology.
Keseler IM(1), Bonavides-Martínez C, Collado-Vides J, Gama-Castro S, Gunsalus RP,
Johnson DA, Krummenacker M, Nolan LM, Paley S, Paulsen IT, Peralta-Gil M,
Santos-Zavaleta A, Shearer AG, Karp PD.
Author information:
(1)SRI International, 333 Ravenswood Ave., Menlo Park, CA 94025, USA.
EcoCyc (http://EcoCyc.org) provides a comprehensive encyclopedia of Escherichia
coli biology. EcoCyc integrates information about the genome, genes and gene
products; the metabolic network; and the regulatory network of E. coli. Recent
EcoCyc developments include a new initiative to represent and curate all types of
E. coli regulatory processes such as attenuation and regulation by small RNAs.
EcoCyc has started to curate Gene Ontology (GO) terms for E. coli and has made a
dataset of E. coli GO terms available through the GO Web site. The curation and
visualization of electron transfer processes has been significantly improved.
Other software and Web site enhancements include the addition of tracks to the
EcoCyc genome browser, in particular a type of track designed for the display of
ChIP-chip datasets, and the development of a comparative genome browser. A new
Genome Omics Viewer enables users to paint omics datasets onto the full E. coli
genome for analysis. A new advanced query page guides users in interactively
constructing complex database queries against EcoCyc. A Macintosh version of
EcoCyc is now available. A series of Webinars is available to instruct users in
the use of EcoCyc.
DOI: 10.1093/nar/gkn751
PMCID: PMC2686493
PMID: 18974181 [Indexed for MEDLINE]
59. Bioinformatics. 2008 Oct 15;24(20):2288-95. doi: 10.1093/bioinformatics/btn420.
Epub 2008 Aug 12.
MotifVoter: a novel ensemble method for fine-grained integration of generic motif
finders.
Wijaya E(1), Yiu SM, Son NT, Kanagasabai R, Sung WK.
Author information:
(1)School of Computing, National University of Singapore, Singapore.
MOTIVATION: Locating transcription factor binding sites (motifs) is a key step in
understanding gene regulation. Based on Tompa's benchmark study, the performance
of current de novo motif finders is far from satisfactory (with sensitivity
<or=0.222 and precision <or=0.307). The same study also shows that no motif
finder performs consistently well over all datasets. Hence, it is not clear which
finder one should use for a given dataset. To address this issue, a class of
algorithms called ensemble methods have been proposed. Though the existing
ensemble methods overall perform better than stand-alone motif finders, the
improvement gained is not substantial. Our study reveals that these methods do
not fully exploit the information obtained from the results of individual
finders, resulting in minor improvement in sensitivity and poor precision.
RESULTS: In this article, we identify several key observations on how to utilize
the results from individual finders and design a novel ensemble method,
MotifVoter, to predict the motifs and binding sites. Evaluations on 186 datasets
show that MotifVoter can locate more than 95% of the binding sites found by its
component motif finders. In terms of sensitivity and precision, MotifVoter
outperforms stand-alone motif finders and ensemble methods significantly on
Tompa's benchmark, Escherichia coli, and ChIP-Chip datasets. MotifVoter is
available online via a web server with several biologist-friendly features.
DOI: 10.1093/bioinformatics/btn420
PMID: 18697768 [Indexed for MEDLINE]
60. BMC Bioinformatics. 2008 May 6;9:228. doi: 10.1186/1471-2105-9-228.
Inferring the role of transcription factors in regulatory networks.
Veber P(1), Guziolowski C, Le Borgne M, Radulescu O, Siegel A.
Author information:
(1)Centre INRIA Rennes Bretagne Atlantique, IRISA, Rennes, France.
philippe.veber@googlemail.com
BACKGROUND: Expression profiles obtained from multiple perturbation experiments
are increasingly used to reconstruct transcriptional regulatory networks, from
well studied, simple organisms up to higher eukaryotes. Admittedly, a key
ingredient in developing a reconstruction method is its ability to integrate
heterogeneous sources of information, as well as to comply with practical
observability issues: measurements can be scarce or noisy. In this work, we show
how to combine a network of genetic regulations with a set of expression
profiles, in order to infer the functional effect of the regulations, as inducer
or repressor. Our approach is based on a consistency rule between a network and
the signs of variation given by expression arrays.
RESULTS: We evaluate our approach in several settings of increasing complexity.
First, we generate artificial expression data on a transcriptional network of E.
coli extracted from the literature (1529 nodes and 3802 edges), and we estimate
that 30% of the regulations can be annotated with about 30 profiles. We
additionally prove that at most 40.8% of the network can be inferred using our
approach. Second, we use this network in order to validate the predictions
obtained with a compendium of real expression profiles. We describe a filtering
algorithm that generates particularly reliable predictions. Finally, we apply our
inference approach to S. cerevisiae transcriptional network (2419 nodes and 4344
interactions), by combining ChIP-chip data and 15 expression profiles. We are
able to detect and isolate inconsistencies between the expression profiles and a
significant portion of the model (15% of all the interactions). In addition, we
report predictions for 14.5% of all interactions.
CONCLUSION: Our approach does not require accurate expression levels nor times
series. Nevertheless, we show on both data, real and artificial, that a
relatively small number of perturbation experiments are enough to determine a
significant portion of regulatory effects. This is a key practical asset compared
to statistical methods for network reconstruction. We demonstrate that our
approach is able to provide accurate predictions, even when the network is
incomplete and the data is noisy.
DOI: 10.1186/1471-2105-9-228
PMCID: PMC2422845
PMID: 18460200 [Indexed for MEDLINE]
61. J Bioinform Comput Biol. 2007 Apr;5(2B):549-60.
Intragenic promotor-like sites in the genome of Escherichia coli discovery and
functional implication.
Tutukina MN(1), Shavkunov KS, Masulis IS, Ozoline ON.
Author information:
(1)Institute of Cell Biophysics Russian Academy of Sciences, Pushchino, Moscow
region, 142290, Russia. maria@icb.psn.ru
Mapping of putative promoters within the entire genome of Escherichia coli (E.
coli) by means of pattern-recognition software PlatProm revealed several thousand
of sites having high probability to perform promoter function. Along with the
expected promoters located upstream of coding sequences, PlatProm identified more
than a thousand potential promoters for antisense transcription and several
hundred very similar signals within coding sequences having the same direction
with the genes. Since recently developed ChIP-chip technology also testified the
presence of intragenic RNA polymerase binding sites, such distribution of
putative promoters is likely to be a general biological phenomenon reflecting yet
undiscovered regulatory events. Here, we provide experimental evidences that two
internal promoters are recognized by bacterial RNA polymerase. One of them is
located within the hns coding sequence and may initiate synthesis of RNA from the
antisense strand. Another one is found within the overlapping genes htgA/yaaW and
may control the production of a shortened mRNA or an RNA-product complementary to
mRNA of yaaW. Both RNA-products can form secondary structures with free energies
of folding close to those of small regulatory RNAs (sRNAs) of the same length.
Folding propensity of known sRNAs was further compared with that of antisense
RNAs (aRNAs), predicted in E. coli as well as in Salmonella typhimurium (S.
typhimurium). Slightly lower stability observed for aRNAs assumes that their
structural compactness may be less significant for biological function.
PMID: 17636861 [Indexed for MEDLINE]