linkedin reddit search_black sharethis
619 Publications

ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements

X. Chen, Andrew F. Neuwald, Leena Hilakivi-Clarke, Robert Clarke, Jianhua Xuan

Transcription factors (TFs) often function as a module including both master factors and mediators binding at cis-regulatory regions to modulate nearby gene transcription. ChIP-seq profiling of multiple TFs makes it feasible to infer functional TF modules. However, when inferring TF modules based on co-localization of ChIP-seq peaks, often many weak binding events are missed, especially for mediators, resulting in incomplete identification of modules. To address this problem, we develop a ChIP-seq data-driven Gibbs Sampler to infer Modules (ChIP-GSM) using a Bayesian framework that integrates ChIP-seq profiles of multiple TFs. ChIP-GSM samples read counts of module TFs iteratively to estimate the binding potential of a module to each region and, across all regions, estimates the module abundance. Using inferred module-region probabilistic bindings as feature units, ChIP-GSM then employs logistic regression to predict active regulatory elements. Validation of ChIP-GSM predicted regulatory regions on multiple independent datasets sharing the same context confirms the advantage of using TF modules for predicting regulatory activity. In a case study of K562 cells, we demonstrate that the ChIP-GSM inferred modules form as groups, activate gene expression at different time points, and mediate diverse functional cellular processes. Hence, ChIP-GSM infers biologically meaningful TF modules and improves the prediction accuracy of regulatory region activities.

Show Abstract

Machine learning methods to model multicellular complexity and tissue specificity

Experimental approaches to study tissue specificity enable insight into the nature and organization of the cell types and tissues that constitute complex multicellular organisms. Machine learning provides a powerful tool to investigate and interpret tissue-specific experimental data. In this Review, we first provide a brief introduction to key single-cell and whole-tissue approaches that allow investigation of tissue specificity and then highlight two classes of machine-learning-based methods, which can be applied to analyse, model and interpret these experimental data. Deep learning methods can predict tissue-dependent effects of individual mutations on gene expression, alternative splicing and disease phenotypes. Network-based approaches can capture relationships between biomolecules, integrate large heterogeneous data compendia to model molecular circuits and identify tissue-specific functional relationships and regulatory connections. We conclude with an outlook to future possibilities in examining multicellular complexity by combining high-resolution, large-scale multiomics data sets and interpretable machine learning models.

Show Abstract

CROTON: an automated and variant-aware deep learning framework for predicting CRISPR/Cas9 editing outcomes

Victoria R Li, O. Troyanskaya, Z. Zhang

CRISPR/Cas9 is a revolutionary gene-editing technology that has been widely utilized in biology, biotechnology and medicine. CRISPR/Cas9 editing outcomes depend on local DNA sequences at the target site and are thus predictable. However, existing prediction methods are dependent on both feature and model engineering, which restricts their performance to existing knowledge about CRISPR/Cas9 editing

Show Abstract
July 12, 2021

A fast Chebyshev method for the Bingham closure with application to active nematic suspensions

Scott Weady, D. Stein, M. Shelley

Continuum kinetic theories provide an important tool for the analysis and simulation of particle suspensions. When those particles are anisotropic, the addition of a particle orientation vector to the kinetic description yields a 2d−1 dimensional theory which becomes intractable to simulate, especially in three dimensions or near states where the particles are highly aligned. Coarse-grained theories that track only moments of the particle distribution functions provide a more efficient simulation framework, but require closure assumptions. For the particular case where the particles are apolar, the Bingham closure has been found to agree well with the underlying kinetic theory; yet the closure is non-trivial to compute, requiring the solution of an often nearly-singular nonlinear equation at every spatial discretization point at every timestep. In this paper, we present a robust, accurate, and efficient numerical scheme for evaluating the Bingham closure, with a controllable error/efficiency tradeoff. To demonstrate the utility of the method, we carry out high-resolution simulations of a coarse-grained continuum model for a suspension of active particles in parameter regimes inaccessible to kinetic theories. Analysis of these simulations reveals that inaccurately computing the closure can act to effectively limit spatial resolution in the coarse-grained fields. Pushing these simulations to the high spatial resolutions enabled by our method reveals a coupling between vorticity and topological defects in the suspension director field, as well as signatures of energy transfer between scales in this active fluid model.

Show Abstract
arXiv e-prints
June 28, 2021

Experiences and lessons learned from two virtual, hands-on microbiome bioinformatics workshops

Matthew R Dillon, Evan Bolyen, Anja Adamov, J. Morton, R. Bonneau, et al.

In October of 2020, in response to the Coronavirus Disease 2019 (COVID-19) pandemic, our team hosted our first fully online workshop teaching the QIIME 2 microbiome bioinformatics platform. We had 75 enrolled participants who joined from at least 25 different countries on 6 continents, and we had 22 instructors on 4 continents. In the 5-day workshop, participants worked hands-on with a cloud-based shared compute cluster that we deployed for this course. The event was well received, and participants provided feedback and suggestions in a postworkshop questionnaire. In January of 2021, we followed this workshop with a second fully online workshop, incorporating lessons from the first. Here, we present details on the technology and protocols that we used to run these workshops, focusing on the first workshop and then introducing changes made for the second workshop. We discuss what worked well, what didn’t work well, and what we plan to do differently in future workshops.

Show Abstract

How crosslink numbers shape the large-scale physics of cytoskeletal materials

S. Fürthauer, M. Shelley

Cytoskeletal networks are the main actuators of cellular mechanics, and a foundational example for active matter physics. In cytoskeletal networks, motion is generated on small scales by filaments that push and pull on each other via molecular-scale motors. These local actuations give rise to large scale stresses and motion. To understand how microscopic processes can give rise to self-organized behavior on larger scales it is important to consider what mechanisms mediate long-ranged mechanical interactions in the systems. Two scenarios have been considered in the recent literature. The first are systems which are relatively sparse, in which most of the large scale momentum transfer is mediated by the solvent in which cytoskeletal filaments are suspended. The second, are systems in which filaments are coupled via crosslink molecules throughout. Here, we review the differences and commonalities between the physics of these two regimes. We also survey the literature for the numbers that allow us to place a material within either of these two classes.

Show Abstract
June 24, 2021

Microtubule reorganization during female meiosis in C. elegans

Ina Lantzsch, S. Fürthauer

Most female meiotic spindles undergo striking morphological changes while transitioning from metaphase to anaphase. The ultra-structure of meiotic spindles, and how changes to this structure correlate with such dramatic spindle rearrangements remains largely unknown. To address this, we applied light microscopy, large-scale electron tomography and mathematical modeling of female meiotic Caenorhabditis elegans spindles. Combining these approaches, we find that meiotic spindles are dynamic arrays of short microtubules that turn over within seconds. The results show that the metaphase to anaphase transition correlates with an increase in microtubule numbers and a decrease in their average length. Detailed analysis of the tomographic data revealed that the microtubule length changes significantly during the metaphase-to-anaphase transition. This effect is most pronounced for microtubules located within 150 nm of the chromosome surface. To understand the mechanisms that drive this transition, we developed a mathematical model for the microtubule length distribution that considers microtubule growth, catastrophe, and severing. Using Bayesian inference to compare model predictions and data, we find that microtubule turn-over is the major driver of the spindle reorganizations. Our data suggest that in metaphase only a minor fraction of microtubules, those closest to the chromosomes, are severed. The large majority of microtubules, which are not in close contact with chromosomes, do not undergo severing. Instead, their length distribution is fully explained by growth and catastrophe. This suggests that the most prominent drivers of spindle rearrangements are changes in nucleation and catastrophe rate. In addition, we provide evidence that microtubule severing is dependent on katanin.

Show Abstract
June 11, 2021

Metallic Microswimmers Driven up the Wall by Gravity

Q. Brosseau, F. Balboa Usabiaga, E. Lushi, Y. Wu, L. Ristroph, M. D. Ward, M. Shelley, J. Zhang

Experiments on autophoretic bimetallic nanorods propelling within a fuel of hydrogen peroxide show that tail-heavy swimmers preferentially orient upwards and ascend along inclined planes. We show that such gravitaxis is strongly facilitated by interactions with solid boundaries, allowing even ultraheavy microswimmers to climb nearly vertical surfaces. Theory and simulations show that the buoyancy or gravitational torque that tends to align the rods is reinforced by a fore-aft drag asymmetry induced by hydrodynamic interactions with the wall.

Show Abstract
June 11, 2021

Temporal integration of inductive cues on the way to gastrulation

Sarah McFann, Sayantan Dutta, Jared E. Toettcher, S. Shvartsman

In early development, cells commit to a single germ fate despite receiving multiple, conflicting inductive cues. Here, we examine how cells in the Drosophila embryo integrate promesodermal and proendodermal signals. We find that proendoderm signals repress transcriptional determinants of mesodermal cell movements during a critical time window in the early embryo. Based on precise optogenetic perturbations, live imaging, and computational modeling, our work provides a framework for quantitative understanding of combinatorial control of gastrulation dynamics. All study data are included in the article and/or supporting information.

Show Abstract

Anchor extension: a structure-guided approach to design cyclic peptides targeting enzyme active sites

Parisa Hosseinzadeh, Paris R. Watson, Timothy W. Craven, V. Mulligan, et al.

Despite recent success in computational design of structured cyclic peptides, de novo design of cyclic peptides that bind to any protein functional site remains difficult. To address this challenge, we develop a computational “anchor extension” methodology for targeting protein interfaces by extending a peptide chain around a non-canonical amino acid residue anchor. To test our approach using a well characterized model system, we design cyclic peptides that inhibit histone deacetylases 2 and 6 (HDAC2 and HDAC6) with enhanced potency compared to the original anchor (IC50 values of 9.1 and 4.4 nM for the best binders compared to 5.4 and 0.6 µM for the anchor, respectively). The HDAC6 inhibitor is among the most potent reported so far. These results highlight the potential for de novo design of high-affinity protein-peptide interfaces, as well as the challenges that remain..

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates