565 Publications

A hydraulic instability drives the cell death decision in the nematode germline

N. T. Chartier, A. Mukherjee, Sebastian Fürthauer, et al.

Oocytes are large cells that develop into an embryo upon fertilization1. As interconnected germ cells mature into oocytes, some of them grow—typically at the expense of others that undergo cell death. We present evidence that in the nematode Caenorhabditis elegans, this cell-fate decision is mechanical and related to tissue hydraulics. An analysis of germ cell volumes and material fluxes identifies a hydraulic instability that amplifies volume differences and causes some germ cells to grow and others to shrink, a phenomenon that is related to the two-balloon instability. Shrinking germ cells are extruded and they die, as we demonstrate by artificially reducing germ cell volumes via thermoviscous pumping. Our work reveals a hydraulic symmetry-breaking transition central to the decision between life and death in the nematode germline.

Show Abstract

SynNotch-CAR T cells overcome challenges of specificity, heterogeneity, and persistence in treating glioblastoma

J. Choe, P. Watchmaker, M. Simic , O. Troyanskaya, et al.

Two major hurdles in chimeric antigen receptor (CAR) T cell therapy for solid tumors are ensuring specificity to tumor cells without affecting healthy cells and avoiding tumor escape due to antigen loss. To address these challenges, Hyrenius-Wittsten et al. and Choe et al. developed synthetic notch (synNotch)–CAR T cells targeting solid tumor antigens and used them to treat mouse models of mesothelioma, ovarian cancer, and glioblastoma. In both studies, the authors demonstrated that synNotch-CAR T cells were better at controlling tumors than traditional CAR T cells and did not result in toxicity or damage to healthy tissue. These results suggest that synNotch-CAR T cells may be an effective treatment strategy for solid tumors.

Show Abstract

MHCEpitopeEnergy, a Flexible Rosetta-Based Biotherapeutic Deimmunization Platform

B. Yachnin, V. Mulligan, S. Khare, C. Bailey-Kellogg

As non-“self” macromolecules, biotherapeutics can trigger an immune response that can reduce drug efficacy, require patients to be taken off therapy, or even cause life-threatening reactions. To enable the flexible and facile design of protein biotherapeutics while reducing the prevalence of T-cell epitopes that drive immune recognition, we have integrated into the Rosetta protein design suite a new scoring term that allows design protocols to account for predicted or experimentally identified epitopes in the optimized objective function. This flexible scoring term can be used in any Rosetta design trajectory, can be targeted to specific regions of a protein, and can be readily extended to work with a variety of epitope predictors. By performing extensive design runs with varied design parameter choices for three case study proteins as well as a larger diverse benchmark, we show that the incorporation of this scoring term enables the effective exploration of an alternative, deimmunized sequence space to discover diverse proteins that are potentially highly deimmunized while retaining physical and chemical qualities similar to those yielded by equivalent nondeimmunizing sequence design protocols.

Show Abstract

Tissue-specific enhancer functional networks for associating distal regulatory regions to disease

X. Chen, J. Zhou, R. Zhang, A. Wong, C. Park, C. Theesfeld, O. Troyanskaya

Systematic study of tissue-specific function of enhancers and their disease associations is a major challenge. We present an integrative machine-learning framework, FENRIR, that integrates thousands of disparate epigenetic and functional genomics datasets to infer tissue-specific functional relationships between enhancers for 140 diverse human tissues and cell types, providing a regulatory-region-centric approach to systematically identify disease-associated enhancers. We demonstrated its power to accurately prioritize enhancers associated with 25 complex diseases. In a case study on autism, FENRIR-prioritized enhancers showed a significant proband-specific de novo mutation enrichment in a large, sibling-controlled cohort, indicating pathogenic signal. We experimentally validated transcriptional regulatory activities of eight enhancers, including enhancers not previously reported with autism, and demonstrated their differential regulatory potential between proband and sibling alleles. Thus, FENRIR is an accurate and effective framework for the study of tissue-specific enhancers and their role in disease. FENRIR can be accessed at fenrir.flatironinstitute.org/.

Show Abstract

Modeling transcriptional regulation of model species with deep learning

E. Cofer, A. Wong, O. Troyanskaya, et al.

To enable large-scale analyses of regulatory logic in model species, we developed DeepArk, a set of deep learning models of the cis-regulatory codes of four widely-studied species: Caenorhabditis elegans, Danio rerio, Drosophila melanogaster, and Mus musculus DeepArk accurately predicts the presence of thousands of different context-specific regulatory features, including chromatin states, histone marks, and transcription factors. In vivo studies show that DeepArk can predict the regulatory impact of any genomic variant (including rare or not previously observed), and enables the regulatory annotation of understudied model species.

Show Abstract
April 19, 2021

Molecular mechanisms underlying cellular effects of human MEK1 mutations

R. Marmion, L. Yang, Y. Goyal, G. Jindal, J. Wetzel, M. Singh, T. Schüpbach, S. Shvartsman

Terminal regions of Drosophila embryos are patterned by signaling through ERK, which is genetically deregulated in multiple human diseases. Quantitative studies of terminal patterning have been recently used to investigate gain-of-function variants of human MEK1, encoding the MEK kinase that directly activates ERK by dual phosphorylation. Unexpectedly, several mutations reduced ERK activation by extracellular signals, possibly through a negative feedback triggered by signal-independent activity of the mutant variants. Here we present experimental evidence supporting this model. Using a MEK variant that combines a mutation within the negative regulatory region with alanine substitutions in the activation loop, we prove that pathogenic variants indeed acquire signal-independent kinase activity. We also demonstrate that signal-dependent activation of these variants is independent of kinase suppressor of Ras, a conserved adaptor that is indispensable for activation of normal MEK. Finally, we show that attenuation of ERK activation by extracellular signals stems from transcriptional induction of Mkp3, a dual specificity phosphatase that deactivates ERK by dephosphorylation. These findings in the Drosophila embryo highlight its power for investigating diverse effects of human disease mutations.

Show Abstract

ChIP-BIT2: a software tool to detect weak binding events using a Bayesian integration approach

X. Chen, A. Neuwald, L. Hilakivi-Clarke, R. Clarke, J. Xuan

Background
ChIP-seq combines chromatin immunoprecipitation assays with sequencing and identifies genome-wide binding sites for DNA binding proteins. While many binding sites have strong ChIP-seq ‘peak’ observations and are well captured, there are still regions bound by proteins weakly, with a relatively low ChIP-seq signal enrichment. These weak binding sites, especially those at promoters and enhancers, are functionally important because they also regulate nearby gene expression. Yet, it remains a challenge to accurately identify weak binding sites in ChIP-seq data due to the ambiguity in differentiating these weak binding sites from the amplified background DNAs.

Results
ChIP-BIT2 (http://sourceforge.net/projects/chipbitc/) is a software package for ChIP-seq peak detection. ChIP-BIT2 employs a mixture model integrating protein and control ChIP-seq data and predicts strong or weak protein binding sites at promoters, enhancers, or other genomic locations. For binding sites at gene promoters, ChIP-BIT2 simultaneously predicts their target genes. ChIP-BIT2 has been validated on benchmark regions and tested using large-scale ENCODE ChIP-seq data, demonstrating its high accuracy and wide applicability.

Conclusion
ChIP-BIT2 is an efficient ChIP-seq peak caller. It provides a better lens to examine weak binding sites and can refine or extend the existing binding site collection, providing additional regulatory regions for decoding the mechanism of gene expression regulation.

Show Abstract
April 15, 2021

Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks

J. Koehler, S. Lyskov, S. Lewis, J. Adolf-Bryfogle, R. Alford, K. Barlow, Z. Ben-Aharon, D. Farrell , J. Fell, W. Hansen, A. Harmalkar, J. Jeliazkov, G. Kuenze, J. Krys, A. Ljubetič, A. Loshbaugh, J. Maguire, R. Moretti, V. Mulligan, R. Bonneau, et al

Each year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.

Show Abstract

Constrained non-negative matrix factorization enabling real-time insights of in situ and high-throughput experiments

M. Maffettone, A. Daly, D. Olds

Non-negative Matrix Factorization (NMF) methods offer an appealing unsupervised learning method for real-time analysis of streaming spectral data in time-sensitive data collection, such as in situ characterization of materials. However, canonical NMF methods are optimized to reconstruct a full dataset as closely as possible, with no underlying requirement that the reconstruction produces components or weights representative of the true physical processes. In this work, we demonstrate how constraining NMF weights or components, provided as known or assumed priors, can provide significant improvement in revealing true underlying phenomena. We present a PyTorch based method for efficiently applying constrained NMF and demonstrate this on several synthetic examples. When applied to streaming experimentally measured spectral data, an expert researcher-in-the-loop can provide and dynamically adjust the constraints. This set of interactive priors to the NMF model can, for example, contain known or identified independent components, as well as functional expectations about the mixing of components. We demonstrate this application on measured X-ray diffraction and pair distribution function data from in situ beamline experiments. Details of the method are described, and general guidance provided to employ constrained NMF in extraction of critical information and insights during in situ and high-throughput experiments.

Show Abstract
April 2, 2021

Comparison of explicit and mean-field models of cytoskeletal filaments with crosslinking motors

A. Lamson, J. Moore, F. Fang, M. Glaser, M. Shelley, M. Betterton

In cells, cytoskeletal filament networks are responsible for cell movement, growth, and division. Filaments in the cytoskeleton are driven and organized by crosslinking molecular motors. In reconstituted cytoskeletal systems, motor activity is responsible for far-from-equilibrium phenomena such as active stress, self-organized flow, and spontaneous nematic defect generation. How microscopic interactions between motors and filaments lead to larger-scale dynamics remains incompletely understood. To build from motor–filament interactions to predict bulk behavior of cytoskeletal systems, more computationally efficient techniques for modeling motor–filament interactions are needed. Here, we derive a coarse-graining hierarchy of explicit and continuum models for crosslinking motors that bind to and walk on filament pairs. We compare the steady-state motor distribution and motor-induced filament motion for the different models and analyze their computational cost. All three models agree well in the limit of fast motor binding kinetics. Evolving a truncated moment expansion of motor density speeds the computation by 103–106 compared to the explicit or continuous-density simulations, suggesting an approach for more efficient simulation of large networks. These tools facilitate further study of motor–filament networks on micrometer to millimeter length scales.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates