Améliorateur | Groupe d'isolants Jiaxing

Nature Genetics volume 54, pages 1919-1932 (2022)Citer cet article

20 000 accès

26 citations

55 Altmétrique

Détails des métriques

On ne sait toujours pas pourquoi l'épuisement aigu du CTCF (facteur de liaison CCCTC) et de la cohésine n'affecte que marginalement l'expression de la plupart des gènes malgré une perturbation substantielle du repliement du génome tridimensionnel (3D) au niveau des domaines et des boucles structurelles. Pour résoudre cette énigme, nous avons utilisé le profilage Micro-C haute résolution et les transcriptions naissantes dans les cellules souches embryonnaires de souris. Nous constatons que les interactions amplificateur-promoteur (E – P) sont largement insensibles à la déplétion aiguë (3 heures) du CTCF, de la cohésine ou du WAPL. YY1 a été proposé comme régulateur structurel des boucles E – P, mais l'épuisement aigu de YY1 a également eu des effets minimes sur les boucles E – P, la transcription et le repliement du génome 3D. De manière frappante, l’imagerie d’une seule molécule de cellules vivantes a révélé que l’épuisement de la cohésine réduisait la liaison du facteur de transcription (TF) à la chromatine. Ainsi, bien que CTCF, cohésine, WAPL ou YY1 ne soient pas nécessaires au maintien à court terme de la plupart des interactions E – P et de l'expression des gènes, nos résultats suggèrent que la cohésine pourrait permettre aux TF de rechercher et de lier leurs cibles plus efficacement.

Les tests basés sur la capture de conformation chromosomique à haut débit (Hi-C) ont transformé notre compréhension du repliement du génome 3D1,2. Sur la base de ces études, nous pouvons distinguer au moins trois niveaux de repliement du génome 3D. Premièrement, le génome est séparé en compartiments A et B, qui correspondent en grande partie aux segments de chromatine actifs et inactifs, respectivement, et apparaissent sous la forme d'un motif en forme de plaid dans les cartes de contact Hi-C3. Deuxièmement, les protéines CTCF et cohésine aident à replier le génome en domaines d’association topologique (TAD)4,5 et en boucles structurelles de chromatine6, probablement par extrusion de boucles d’ADN7,8. Troisièmement, à une échelle beaucoup plus fine, les éléments transcriptionnels s'engagent dans des interactions chromatiniennes à longue portée telles que les interactions E – P et promoteur-promoteur (P – P) pour former des domaines locaux 9,10,11.

Des expériences élégantes combinant une déplétion protéique aiguë du CTCF, de la cohésine et des protéines régulatrices de la cohésine avec des approches Hi-C ou d'imagerie ont révélé le rôle du CTCF et de la cohésine dans la régulation des deux premiers niveaux : TAD et compartiments12,13,14,15,16. Cependant, Hi-C est inefficace pour capturer le troisième niveau de repliement du génome 3D : les interactions E – P / P – P à petite échelle importantes sur le plan transcriptionnel9,17,18. Notre compréhension du rôle du CTCF et de la cohésine dans la régulation de l'expression des gènes provient principalement d'expériences génétiques axées sur quelques loci de développement19,20,21. Ainsi, il reste difficile de savoir si, quand, où et comment le CTCF/cohésine régule les interactions E-P/P-P et l’expression des gènes.

Nous avons récemment signalé que Micro-C pouvait résoudre efficacement le repliement ultra-fin du génome 3D à la résolution des nucléosomes , y compris les interactions E – P / P – P 9, 17. Dans la présente étude, nous avons utilisé Micro-C, le séquençage par immunoprécipitation de la chromatine (ChIP-seq), le séquençage de l'ARN total (RNA-seq) et l'ARN-seq24 naissant pour étudier systématiquement l'épuisement aigu du CTCF, du RAD21 (sous-unité de cohésine), du WAPL ( déchargeur de cohésine) ou YY1 (une protéine structurelle putative25) affecte les interactions chromatiniennes régulatrices des gènes et la transcription dans les cellules souches embryonnaires de souris (MESC). Enfin, en se concentrant sur la dynamique de YY1, nous avons découvert un rôle inattendu de la cohésine dans la facilitation de la liaison du TF.

Notre étude précédente utilisait Micro-C pour révéler que la structure du génome 3D à petite échelle est bien corrélée à l'activité transcriptionnelle, formant des « points » ou des « boucles » (voir Méthodes pour la terminologie) aux intersections E – P et P – P9. Dans la présente étude, nous avons identifié plus de 75 000 boucles statistiquement significatives dans les MESC à l'aide du nouvel appelant de boucle Moustache26 (Fig. 1a) ou Chromosight27 (Extended Data Fig. 1a), soit environ 2,5 fois plus que dans notre rapport précédent9,26 et environ 4. × plus que Hi-C26,28 (données étendues, Fig. 1b). Grâce à l'analyse de l'état local de la chromatine au niveau des ancres de boucle (données étendues, Fig. 1c, d), nous avons sous-classé ces boucles en boucles de cohésine (~ 13 735), boucles E – P (~ 20 369), boucles P – P (~ 7 433) et polycomb. -contacts associés (~ 700) (Fig. 1a, b), avec une taille médiane d'environ 160 Ko pour les boucles de cohésine et d'environ 100 Ko pour les boucles E – P / P – P (Données étendues, Fig. 1e).

75,190 chromatin dots/loops, subclassified into four primary types (Mustache loop caller26; see Methods and Supplementary Note). b, Probability distribution of loop strength for cohesin, E–P, P–P and random loops. Chromatin loop numbers are shown on the left. The box plot indicates the quartiles for the loop strength score distribution (min. = lower end of line, Q1 = lower bound of box, Q2 = line in box, Q3 = higher bound of box and max. = higher end of line). Genome-wide averaged contact signals (aggregate peak analysis (APA)) are plotted on the right. The contact map was normalized by matrix balancing and distance (Obs/Exp), with positive enrichment in red and negative signal in blue, shown as the diverging color map with the gradient of normalized contact enrichment in log10. The ratio of contact enrichment for the center pixels is annotated within each plot. This color scheme and normalization method are used for normalized matrices throughout the manuscript unless otherwise mentioned. Loop anchors are annotated as ‘C’ for CTCF/cohesin, ‘P’ for promoter and ‘E’ for enhancer. Asterisks denote a P < 10−16 using two-sided Wilcoxon’s signed-rank test. The data are presented in the same format and color scheme throughout the manuscript unless otherwise indicated (n = 37 biological replicates)9. c, Genome-wide averaged transcript counts for nascent transcript profiling. Genes are grouped into high, medium and low expression levels based on nascent RNA-seq data (gene body) and rescaled to the same length from TSS (transcription start site) to poly(adenylation) cleavage site (PAS) or TES (transcription end site) on the x axis. d, Rank-ordered distribution of loop strength against gene expression for cohesin, E–P and P–P loops. Gene expression levels for the corresponding chromatin loop were calculated by averaging the genes with TSSs located ±5 kb around the loop anchors. Loop strength was obtained from the same analysis shown in b. The distribution for each loop type was fitted and smoothed by LOESS (locally estimated scatterplot smoothing) regression. Error bands indicate fitted curve ± s.e.m. with 95% confidence interval (CI). e, APAs are plotted by paired E–P/P–P loops and sorted by the level of nascent transcription into high, mid and low levels./p>90% of CTCF peaks and 60% of cohesin peaks are significantly decreased on loss of CTCF (Padj < 0.05; Fig. 3e and Extended Data Fig. 3g). Despite the substantial loss of cohesin peaks, biochemical fractionation experiments show that the fraction of RAD21 associated with chromatin remains fairly constant 3 h after CTCF degradation (Extended Data Fig. 2f, green box). Thus, our results are in line with the widely accepted conclusion that CTCF positions cohesin43. On the other hand, loss of cohesin affects a subset of CTCF binding (Fig. 3c,d)13, resulting in ~20% reduction in the number of CTCF peaks (Fig. 3e) and a slight decrease in its global chromatin association (Extended Data Fig. 2f, blue box)./p> 0.1 µm2 s−1), which can be separated further into slow (Dslow ~0.1–2 µm2 s−1) and fast moving (Dfast > 2 µm2 s−1). Scale bar, 1 μm. f, Aggregate likelihood of diffusive YY1 molecules. Top, bar graph showing fractions of YY1 binned into bound, slow- and fast-diffusing subpopulations. Bottom, YY1 diffusion coefficient estimation by regular Brownian motion with marginalized localization errors. g, Western blots of cytoplasmic (Cyt) and nuclear proteins dissociating from chromatin at increasing salt concentrations (Extended Data Fig. 2b). A subpopulation (~30%) of YY1 stays on chromatin, resisting 1 M washes. Ins, insoluble pellet after sonication; Son, sonicated, solubilized chromatin. Percentage of total shows the signal intensity of the indicated fractions divided by the total signal intensity. Anti-histone 2B controls for chromatin integrity during fractionation. h, FRAP analysis of YY1 bleached with a square spot. Error bars are fitted curve ± s.e.m. with 95% CI. i, Slow-SPT measuring YY1 residence time. Individual molecules were tracked at 100-ms exposure time to blur fast-moving molecules into the background and capture stable binding. The unbinding rate is obtained by fitting a model to the molecules’ survival curve. Each datapoint indicates the unbinding rate of YY1 molecules in a single cell. The box plot shows quartiles of data. Error bars are mean ± s.d. j. Slow-SPT measures YY1’s residence time at multiple exposure times./p>90% depletion after 3 h of IAA treatment (Fig. 7a and Extended Data Fig. 9a). Despite the high degradation efficiency, neither YY1’s nuclear distribution nor its clustering was strongly affected after acute loss of CTCF and cohesin in either live or fixed cells (Fig. 7b,c and Extended Data Fig. 9b). This suggests that the maintenance of YY1 hubs is independent of CTCF and cohesin./p>82% of these loci were associated with promoter regions (Fig. 7f and Extended Data Fig. 9d,e). In contrast, both CTCF and WAPL depletion had a negligible effect on YY1 occupancy (Fig. 7f and Extended Data Fig. 9d,e). In biochemical fractionation analysis, we also observed a similar, though less pronounced, reduction in YY1 chromatin association after RAD21 depletion (Extended Data Fig. 9f). To test whether cohesin facilitates the target search of TFs in general, we performed spaSPT on additional TFs. We thus generated RAD21–AID cell lines stably expressing either HaloTag-conjugated SOX2 or KLF4 and found that the bound fraction of both TFs was reduced by ~20% after 3-h cohesin degradation (Extended Data Fig. 9g). These results suggest that cohesin probably facilitates chromatin binding of TFs in general./p>20% of E–P/P–P loops can cross TAD boundaries and retain high contact probability and transcriptional activity (Fig. 2)18,35; (2) only a very small handful of genes showed altered expression levels after CTCF, cohesin or WAPL depletion (Fig. 3)12,13,14,15,16; (3) CTCF and cohesin loops are both rare (~5% of the time) and dynamic (median lifetime ~10–30 min)34; (4) most of the E–P/P–P loops persist after depletion of these structural proteins (Fig. 4)39,63; (5) CTCF/cohesin generally does not colocalize with transcription loci67; and (6) E–P loops and transcription can be established before CTCF/cohesin interactions on mitotic exit71, in some cases even with no CTCF/cohesin expression36,65,66. Second, YY1 was proposed to be a master structural regulator of E–P interactions25 (Fig. 8, Model 2). However, our Micro-C data are inconsistent with this model, because acute YY1 depletion has little effect on E–P/P–P interactions or gene expression. It is still possible that YY1 specifically connects development-related chromatin loops during neural lineage commitment47, but is less important in the pluripotent state. In summary, we conclude that, in mESCs, CTCF, cohesin, WAPL or YY1 is not generally required for the short-term maintenance of most E–P interactions and the subsequent expression of most genes after acute depletion and loss of function./p>

2. Full lists of DEGs are available in Supplementary Table 11./p>2). Full lists of DEGs are available in Supplementary Table 12./p> 100 & intensity > 100 & sigma < 220 & uncertainty_xy < 50; (2) merge: Max distance = 10 & Max frame off = 1 & Max frames = 0; and (3) remove duplicates enabled. This setting combines the blinking molecules into one and removes the multiple localizations in a frame./p>

20 kb). b. Micro-C reproducibility tests. Top: pairwise similarity scores measured by GenomeDisco between UT vs. IAA and UT vs. UT samples using 10-kb resolution of Micro-C matrices. Bottom: similarity scores measured by QuASAR between replicates (light lines) or comparing the UT and IAA-treated samples (dark lines) using Micro-C matrices at 250-kb, 50-kb, 25-kb, and 10-kb resolutions. c. Genome-wide contact decaying P(s) analysis (bottom) and slope distributions of the P(s) curves (top) for UT cells. d. Micro-C contact maps at specific regions or at genome-wide scale across multiple resolutions in the UT and IAA-treated cells. Left to right: examples of Pearson’s correlation matrices showing plaid-like chromosome compartments; saddle plots showing overall compartment strength (A-A: bottom-right; B-B: top left); differential saddle plots showing changes in compartment strength; contact matrices showing TADs along the diagonal; ADA showing all TADs; differential ADA showing TAD strength changes. e. Slope distribution of P(s) curves for UT and IAA-treated cells. Dashed lines highlight the range of genome distances affected by CTCF, RAD21, or WAPL depletion. CTCF depletion had minimal impact on overall interactions across the genome. RAD21 depletion reduced contact frequencies in the range of 10–200 kb but increased interactions at 300 kb – 5 Mb. WAPL depletion showed the opposite trend, with increased contacts at 70–700 kb but reduced contacts at 1–5 Mb. f. Scatter plot of cohesin loops scores in UT and IAA-treated cells. The overlaid heatmap indicates dot density (red: highest, blue: lowest). Dashed lines along the diagonal delimit unchanged loops. g. Loop numbers called by Mustache for UT and IAA-treated cells. The additional loops (n = 5764) identified after WAPL depletion show longer lengths, with a 570-kb median. h. APA for loops across multiple ranges of genomic distance in UT and IAA-treated cells./p> 10), suggesting that while CTCF and cohesin are required for the transcriptional maintenance of only a small subset of genes, those genes tend to require the presence of both factors. Statistical test: Fisher’s exact test. g. Snapshots of Micro-C maps comparing chromatin interactions in the UT (top-right) and IAA-treated (bottom-left) cells surrounding Klf4 locus. Contact maps are annotated with gene boxes and 1D chromatin tracks showing the ChIP-seq signal enrichment in the same region./p>20 kb) interactions. j. Genome-wide contact decaying P(s) analysis (bottom) and slope distributions of the P(s) curves (top) for UT cells. k. MA plot of total RNA-seq and nascent RNA-seq for YY1 degron 3 to 24 hours after IAA treatment. l. Scatter plots of loop scores (quantified using 2-kb-resolution Micro-C data) plotted for E-P or P-P loops in UT and IAA-treated cells. APA for YY1, E-P, or P-P anchored loops plotted for the ΔYY1 degron cell line in UT and IAA-treated cells. m. Micro-C maps comparing chromatin interactions in UT and IAA-treated ΔYY1 cells surrounding Nes gene./p>