565 Publications

Promoter and Gene-Body RNA-Polymerase II co-exist in partial demixed condensates

Arya Changiarath , Jasper J. Michels, S. Hanson

In cells, transcription is tightly regulated on multiple layers. The condensation of the transcription machinery into distinct phases is hypothesized to spatio-temporally fine tune RNA polymerase II behaviour during two key stages, transcription initiation and the elongation of the nascent RNA transcripts. However, it has remained unclear whether these phases would mix when present at the same time or remain distinct chemical environments; either as multi-phase condensates or by forming entirely separate condensates. Here we combine particle-based multi-scale simulations and experiments in the model organism C. elegans to characterise the biophysical properties of RNA polymerase II condensates. Both simulations and the in vivo work describe a lower critical solution temperature (LCST) behaviour of RNA Polymerase II, with condensates dissolving at lower temperatures whereas higher temperatures promote condensate stability, which highlights that these condensates are physio-chemically distinct from heterochromatin condensates. The LCST behavior of CTD correlates with gradual shifts in the transcription program but is largely uncoupled from the classical stress response. Expanding the simulations we model how the degree of phosphorylation of the disordered C-terminal domain of RNA polymerase II (CTD), which is characteristic for each step of transcription, controls the existence and morphology of multi-phasic condensates. We show that the two phases putatively underpinning the initiation of transcription and transcription elongation constitute distinct chemical environments and are in agreement with RNA polymerase II condensates observed in C. elegans embryos by super resolution microscopy. Our analysis shows how depending on its post transcriptional modifications and its interaction partner a single protein can form multiple partially engulfed condensates, potentially promoting the selective recruitment of additional factors to these two phases.

Show Abstract
March 27, 2024

Supercharged coiled-coil protein with N-terminal decahistidine tag boosts siRNA complexation and delivery efficiency of a lipoproteoplex

Jonathan W. Sun, Joseph S. Thomas, D. Renfrew, et al.

Short interfering RNA (siRNA) therapeutics have soared in popularity due to their highly selective and potent targeting of faulty genes, providing a non-palliative approach to address diseases. Despite their potential, effective transfection of siRNA into cells requires the assistance of an accompanying vector. Vectors constructed from non-viral materials, while offering safer and non-cytotoxic profiles, often grapple with lackluster loading and delivery efficiencies, necessitating substantial milligram quantities of expensive siRNA to confer the desired downstream effects. We detail the recombinant synthesis of a diverse series of coiled-coil supercharged protein (CSP) biomaterials systematically designed to investigate the impact of two arginine point mutations (Q39R and N61R) and decahistidine tags on liposomal siRNA delivery. The most efficacious variant, N8, exhibits a twofold increase in its affinity to siRNA and achieves a twofold enhancement in transfection activity with minimal cytotoxicity in vitro. Subsequent analysis unveils the destabilizing effect of the Q39R and N61R supercharging mutations and the incorporation of C-terminal decahistidine tags on α-helical secondary structure. Cross-correlational regression analyses reveal that the amount of helical character in these mutants is key in N8's enhanced siRNA complexation and downstream delivery efficiency.

Show Abstract

Good rates from bad coordinates: the exponential average time-dependent rate approach

Nicodemo Mazzaferro, Subarna Sasmal, P. Cossio, Glen M. Hocky

Our ability to calculate rates of biochemical processes using molecular dynamics simulations is severely limited by the fact that the time scales for reactions, or changes in conformational state, scale exponentially with the relevant free-energy barriers. In this work, we improve upon a recently proposed rate estimator that allows us to predict transition times with molecular dynamics simulations biased to rapidly explore one or several collective variables. This approach relies on the idea that not all bias goes into promoting transitions, and along with the rate, it estimates a concomitant scale factor for the bias termed the collective variable biasing efficiency γ. First, we demonstrate mathematically that our new formulation allows us to derive the commonly used Infrequent Metadynamics (iMetaD) estimator when using a perfect collective variable, γ=1. After testing it on a model potential, we then study the unfolding behavior of a previously well characterized coarse-grained protein, which is sufficiently complex that we can choose many different collective variables to bias, but which is sufficiently simple that we are able to compute the unbiased rate dire ctly. For this system, we demonstrate that our new Exponential Average Time-Dependent Rate (EATR) estimator converges to the true rate more rapidly as a function of bias deposition time than does the previous iMetaD approach, even for bias deposition times that are short. We also show that the γ parameter can serve as a good metric for assessing the quality of the biasing coordinate. Finally, we demonstrate that the approach works when combining multiple less-than-optimal bias coordinates.

Show Abstract
March 15, 2024

Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins

Moritz Ertelt, V. Mulligan, et al.

Post-translational modifications (PTMs) of proteins play a vital role in their function and stability. These modifications influence protein folding, signaling, protein-protein interactions, enzyme activity, binding affinity, aggregation, degradation, and much more. To date, over 400 types of PTMs have been described, representing chemical diversity well beyond the genetically encoded amino acids. Such modifications pose a challenge to the successful design of proteins, but also represent a major opportunity to diversify the protein engineering toolbox. To this end, we first trained artificial neural networks (ANNs) to predict eighteen of the most abundant PTMs, including protein glycosylation, phosphorylation, methylation, and deamidation. In a second step, these models were implemented inside the computational protein modeling suite Rosetta, which allows flexible combination with existing protocols to model the modified sites and understand their impact on protein stability as well as function. Lastly, we developed a new design protocol that either maximizes or minimizes the predicted probability of a particular site being modified. We find that this combination of ANN prediction and structure-based design can enable the modification of existing, as well as the introduction of novel, PTMs. The potential applications of our work include, but are not limited to, glycan masking of epitopes, strengthening protein-protein interactions through phosphorylation, as well as protecting proteins from deamidation liabilities. These applications are especially important for the design of new protein therapeutics where PTMs can drastically change the therapeutic properties of a protein. Our work adds novel tools to Rosetta’s protein engineering toolbox that allow for the rational design of PTMs.

Show Abstract

Ensemble Detection of DNA Engineering Signatures

Aaron Adler, Joel S. Bader, A. Persikov

Synthetic biology is creating genetically engineered organisms at an increasing rate for many potentially valuable applications, but this potential comes with the risk of misuse or accidental release. To begin to address this issue, we have developed a system called GUARDIAN that can automatically detect signatures of engineering in DNA sequencing data, and we have conducted a blinded test of this system using a curated Test and Evaluation (T&E) data set. GUARDIAN uses an ensemble approach based on the guiding principle that no single approach is likely to be able to detect engineering with perfect accuracy. Critically, ensembling enables GUARDIAN to detect sequence inserts in 13 target organisms with a high degree of specificity that requires no subject matter expert (SME) review.

Show Abstract

Computational Design of Phosphotriesterase Improves V-Agent Degradation Efficiency

Jacob Kronenberg, Stanley Chu, D. Renfrew, et al.

Organophosphates (OPs) are a class of neurotoxic acetylcholinesterase inhibitors including widely used pesticides as well as nerve agents such as VX and VR. Current treatment of these toxins relies on reactivating acetylcholinesterase, which remains ineffective. Enzymatic scavengers are of interest for their ability to degrade OPs systemically before they reach their target. Here we describe a library of computationally designed variants of phosphotriesterase (PTE), an enzyme that is known to break down OPs. The mutations G208D, F104A, K77A, A80V, H254G, and I274N broadly improve catalytic efficiency of VX and VR hydrolysis without impacting the structure of the enzyme. The mutation I106 A improves catalysis of VR and L271E abolishes activity, likely due to disruptions of PTE's structure. This study elucidates the importance of these residues and contributes to the design of enzymatic OP scavengers with improved efficiency.

Show Abstract

Precision Medicine in Nephrology: An Integrative Framework of Multidimensional Data in the Kidney Precision Medicine Project

Tarek M. El-Achkar, Michael T. Eadon, R. Sealfon

Chronic kidney disease (CKD) and acute kidney injury (AKI) are heterogeneous syndromes defined clinically by serial measures of kidney function. Each condition possesses strong histopathologic associations, including glomerular obsolescence or acute tubular necrosis, respectively. Despite such characterization, there remains wide variation in patient outcomes and treatment responses. Precision medicine efforts, as exemplified by the Kidney Precision Medicine Project (KPMP), have begun to establish evolving, spatially anchored, cellular and molecular atlases of the cell types, states, and niches of the kidney in health and disease. The KPMP atlas provides molecular context for CKD and AKI disease drivers and will help define subtypes of disease that are not readily apparent from canonical functional or histopathologic characterization but instead are appreciable through advanced clinical phenotyping, pathomic, transcriptomic, proteomic, epigenomic, and metabolomic interrogation of kidney biopsy samples. This perspective outlines the structure of the KPMP, its approach to the integration of these diverse datasets, and its major outputs relevant to future patient care.

Show Abstract

Specificity, synergy, and mechanisms of splice-modifying drugs

Yuma Ishigami, Mandy S. Wong, S. Hanson, et al.

Drugs that target pre-mRNA splicing hold great therapeutic potential, but the quantitative understanding of how these drugs work is limited. Here we introduce mechanistically interpretable quantitative models for the sequence-specific and concentration-dependent behavior of splice-modifying drugs. Using massively parallel splicing assays, RNA-seq experiments, and precision dose-response curves, we obtain quantitative models for two small-molecule drugs, risdiplam and branaplam, developed for treating spinal muscular atrophy. The results quantitatively characterize the specificities of risdiplam and branaplam for 5’ splice site sequences, suggest that branaplam recognizes 5’ splice sites via two distinct interaction modes, and contradict the prevailing two-site hypothesis for risdiplam activity at SMN2 exon 7. The results also show that anomalous single-drug cooperativity, as well as multi-drug synergy, are widespread among small-molecule drugs and antisense-oligonucleotide drugs that promote exon inclusion. Our quantitative models thus clarify the mechanisms of existing treatments and provide a basis for the rational development of new therapies.

Show Abstract

Training self-learning circuits for power-efficient solutions

Menachem Stern , Douglas J. Durian, Andrea J. Liu, et al.

As the size and ubiquity of artificial intelligence and computational machine learning models grow, the energy required to train and use them is rapidly becoming economically and environmentally unsustainable. Recent laboratory prototypes of self-learning electronic circuits, such as “physical learning machines,” open the door to analog hardware that directly employs physics to learn desired functions from examples at a low energy cost. In this work, we show that this hardware platform allows for an even further reduction in energy consumption by using good initial conditions and a new learning algorithm. Using analytical calculations, simulations, and experiments, we show that a trade-off emerges when learning dynamics attempt to minimize both the error and the power consumption of the solution—greater power reductions can be achieved at the cost of decreasing solution accuracy. Finally, we demonstrate a practical procedure to weigh the relative importance of error and power minimization, improving the power efficiency given a specific tolerance to error.

Show Abstract

Nuclear instance segmentation and tracking for preimplantation mouse embryos

H. Nunley , Binglun Shao, Prateek Grover, A. Watters, S. Shvartsman, L. M. Brown, et al.

For investigations into fate specification and cell rearrangements in live images of preimplantation embryos, automated and accurate 3D instance segmentation of nuclei is invaluable; however, the performance of segmentation methods is limited by the images' low signal-to-noise ratio and high voxel anisotropy and the nuclei's dense packing and variable shapes. Supervised machine learning approaches have the potential to radically improve segmentation accuracy but are hampered by a lack of fully annotated 3D data. In this work, we first establish a novel mouse line expressing near-infrared nuclear reporter H2B-miRFP720. H2B-miRFP720 is the longest wavelength nuclear reporter in mice and can be imaged simultaneously with other reporters with minimal overlap. We then generate a dataset, which we call BlastoSPIM, of 3D microscopy images of H2B-miRFP720-expressing embryos with ground truth for nuclear instance segmentation. Using BlastoSPIM, we benchmark the performance of five convolutional neural networks and identify Stardist-3D as the most accurate instance segmentation method across preimplantation development. Stardist-3D, trained on BlastoSPIM, performs robustly up to the end of preimplantation development (> 100 nuclei) and enables studies of fate patterning in the late blastocyst. We, then, demonstrate BlastoSPIM's usefulness as pre-train data for related problems. BlastoSPIM and its corresponding Stardist-3D models are available at: blastospim.flatironinstitute.org.

Show Abstract
February 23, 2024
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates