563 Publications

A sequence-based global map of regulatory activity for deciphering human genetics

Kathleen Chen, A. Wong, O. Troyanskaya, Jian Zhou

Epigenomic profiling has enabled large-scale identification of regulatory elements, yet we still lack a systematic mapping from any sequence or variant to regulatory activities. We address this challenge with Sei, a framework for integrating human genetics data with sequence information to discover the regulatory basis of traits and diseases. Sei learns a vocabulary of regulatory activities, called sequence classes, using a deep learning model that predicts 21,907 chromatin profiles across >1,300 cell lines and tissues. Sequence classes provide a global classification and quantification of sequence and variant effects based on diverse regulatory activities, such as cell type-specific enhancer functions. These predictions are supported by tissue-specific expression, expression quantitative trait loci and evolutionary constraint data. Furthermore, sequence classes enable characterization of the tissue-specific, regulatory architecture of complex traits and generate mechanistic hypotheses for individual regulatory pathogenic mutations. We provide Sei as a resource to elucidate the regulatory basis of human health and disease.

Show Abstract

Weakly nonlinear analysis of pattern formation in active suspensions

Laurel Ohm, M. Shelley

We consider the Saintillan--Shelley kinetic model of active rodlike particles in Stokes flow (Saintillan & Shelley 2008a,b), for which the uniform, isotropic suspension of pusher particles is known to be unstable in certain settings. Through weakly nonlinear analysis accompanied by numerical simulations, we determine exactly how the isotropic steady state loses stability in different parameter regimes. We study each of the various types of bifurcations admitted by the system, including both subcritical and supercritical Hopf and pitchfork bifurcations. Elucidating this system's behavior near these bifurcations provides a theoretical means of comparing this model with other physical systems which transition to turbulence, and makes predictions about the nature of bifurcations in active suspensions that can be explored experimentally.

Show Abstract

Self-organized flows in phase-synchronizing active fluids

B. Chakrabarti, M. Shelley, Sebastian Fürthauer

Many active biological particles, such as swimming microorganisms or motor-proteins, do work on their environment by going though a periodic sequence of shapes. Interactions between particles can lead to the phase-synchronization of their duty cycles. Here we consider collective dynamics in a suspension of such active particles coupled through hydrodynamics. We demonstrate that the emergent non-equilibrium states feature stationary patterned flows and robust unidirectional pumping states under confinement. Moreover the phase-synchronized state of the suspension exhibits spatially robust chimera patterns in which synchronized and phase-isotropic regions coexist within the same system. These findings demonstrate a new route to pattern formation and could guide the design of new active materials.

Show Abstract
June 24, 2022

Stability selection enables robust learning of differential equations from limited noisy data

S. Maddu, Bevan L. Cheeseman , Ivo F. Sbalzarini, C. Müller

We present a statistical learning framework for robust identification of differential equations from noisy spatio-temporal data. We address two issues that have so far limited the application of such methods, namely their robustness against noise and the need for manual parameter tuning, by proposing stability-based model selection to determine the level of regularization required for reproducible inference. This avoids manual parameter tuning and improves robustness against noise in the data. Our stability selection approach, termed PDE-STRIDE, can be combined with any sparsity-promoting regression method and provides an interpretable criterion for model component importance. We show that the particular combination of stability selection with the iterative hard-thresholding algorithm from compressed sensing provides a fast and robust framework for equation inference that outperforms previous approaches with respect to accuracy, amount of data required, and robustness. We illustrate the performance of PDE-STRIDE on a range of simulated benchmark problems, and we demonstrate the applicability of PDE-STRIDE on real-world data by considering purely data-driven inference of the protein interaction network for embryonic polarization in Caenorhabditis elegans. Using fluorescence microscopy images of C. elegans zygotes as input data, PDE-STRIDE is able to learn the molecular interactions of the proteins.

Show Abstract

Parallel Discrete Convolutions on Adaptive Particle Representations of Images

Joel Jonsson, S. Maddu, et al.

We present data structures and algorithms for native implementations of discrete convolution operators over Adaptive Particle Representations (APR) of images on parallel computer architectures. The APR is a content-adaptive image representation that locally adapts the sampling resolution to the image signal. It has been developed as an alternative to pixel representations for large, sparse images as they typically occur in fluorescence microscopy. It has been shown to reduce the memory and runtime costs of storing, visualizing, and processing such images. This, however, requires that image processing natively operates on APRs, without intermediately reverting to pixels. Designing efficient and scalable APR-native image processing primitives, however, is complicated by the APR’s irregular memory structure. Here, we provide the algorithmic building blocks required to efficiently and natively process APR images using a wide range of algorithms that can be formulated in terms of discrete convolutions. We show that APR convolution naturally leads to scale-adaptive algorithms that efficiently parallelize on multi-core CPU and GPU architectures. We quantify the speedups in comparison to pixel-based algorithms and convolutions on evenly sampled data. We achieve pixel-equivalent throughputs of up to 1TB/s on a single Nvidia GeForce RTX 2080 gaming GPU, requiring up to two orders of magnitude less memory than a pixel-based implementation.

Show Abstract

A reference tissue atlas for the human kidney

Jens Hansen, R. Sealfon, O. Troyanskaya, et al.

Kidney Precision Medicine Project (KPMP) is building a spatially specified human kidney tissue atlas in health and disease with single-cell resolution. Here, we describe the construction of an integrated reference map of cells, pathways, and genes using unaffected regions of nephrectomy tissues and undiseased human biopsies from 56 adult subjects. We use single-cell/nucleus transcriptomics, subsegmental laser microdissection transcriptomics and proteomics, near-single-cell proteomics, 3D and CODEX imaging, and spatial metabolomics to hierarchically identify genes, pathways, and cells. Integrated data from these different technologies coherently identify cell types/subtypes within different nephron segments and the interstitium. These profiles describe cell-level functional organization of the kidney following its physiological functions and link cell subtypes to genes, proteins, metabolites, and pathways. They further show that messenger RNA levels along the nephron are congruent with the subsegmental physiological activity. This reference atlas provides a framework for the classification of kidney disease when multiple molecular mechanisms underlie convergent clinical phenotypes.

Show Abstract

Molecular Characterization of Membranous Nephropathy

R. Sealfon, Laura Mariani, J. Funk, A. Wong, O. Troyanskaya

Although membranous nephropathy (MN) is one of the most common causes of nephrotic syndrome, the molecular characteristics of the kidney damage in MN remain poorly defined. In this study, the authors applied a machine-learning framework to predict diagnosis on the basis of gene expression in microdissected kidney tissue from patients with glomerulonephropathies. They found that MN has a glomerular transcriptional signature that distinguishes it from other glomerulonephropathies and identified a set of MN-specific genes differentially expressed across two independent cohorts and robustly recovered in an additional validation cohort. They also found the MN-specific genes are enriched in targets of transcription factor NF-κB and are predominantly expressed in podocytes. This work provides a molecular snapshot of MN and elucidates transcriptional alterations specific to this disease.

Show Abstract

Towards the cellular-scale simulation of motor-driven cytoskeletal assemblies

W. Yan, Saad Ansari, A. Lamson, Matthew A. Glaser, Meredith Betterton, M. Shelley

The cytoskeleton -- a collection of polymeric filaments, molecular motors, and crosslinkers -- is a foundational example of active matter, and in the cell assembles into organelles that guide basic biological functions. Simulation of cytoskeletal assemblies is an important tool for modeling cellular processes and understanding their surprising material properties. Here we present aLENS, a novel computational framework to surmount the limits of conventional simulation methods. We model molecular motors with crosslinking kinetics that adhere to a thermodynamic energy landscape, and integrate the system dynamics while efficiently and stably enforcing hard-body repulsion between filaments -- molecular potentials are entirely avoided in imposing steric constraints. Utilizing parallel computing, we simulate different mixtures of tens to hundreds of thousands of cytoskeletal filaments and crosslinking motors, recapitulating self-emergent phenomena such as bundle formation and buckling, and elucidating how motor type, thermal fluctuations, internal stresses, and confinement determine the evolution of active matter aggregates.

Show Abstract
May 26, 2022

Sex-specific topological differences in germline cell lineage trees

Rocky Diegmiller, J. I. Alsous, S. Shvartsman

A conserved phase of gametogenesis is the development of oocytes and sperm within cell clusters (germline cysts) that arise through serial divisions of a founder cell. The resulting cell lineage trees (CLTs) exhibit diverse topologies across animals and can give rise to numerous emergent behaviors. Despite their centrality, sex-specific differences underlying the evolution and patterning of these cell trees are unknown. In Drosophila melanogaster, oocytes develop within a highly invariant and maximally branched 16-cell tree whose topology is constrained by the fusome – a branched membranous organelle critical for proper mitosis in females; the same division pattern and topology are widely thought to occur during spermatogenesis. Using highly-resolved three-dimensional reconstructions based on a supervised learning algorithm, we show that cell divisions in male cysts can deviate from the maximally branched pattern, leading to greater topological variability. Furthermore, in contrast to females, fusome fragmentation is common, suggesting germ cell mitoses can occur in its absence. These findings thus add to the repertoire of CLT formation strategies, highlighting the diversity of mechanisms employed during gametogenesis in the animal kingdom.

Show Abstract

Exploring the Adjugate Matrix Approach to Quaternion Pose Extraction

Andrew J. Hanson, S. Hanson

Quaternions are important for a wide variety of rotation-related problems in computer graphics, machine vision, and robotics. We study the nontrivial geometry of the relationship between quaternions and rotation matrices by exploiting the adjugate matrix of the characteristic equation of a related eigenvalue problem to obtain the manifold of the space of a quaternion eigenvector. We argue that quaternions parameterized by their corresponding rotation matrices cannot be expressed, for example, in machine learning tasks, as single-valued functions: the quaternion solution must instead be treated as a manifold, with different algebraic solutions for each of several single-valued sectors represented by the adjugate matrix. We conclude with novel constructions exploiting the quaternion adjugate variables to revisit several classic pose estimation applications: 2D point-cloud matching, 2D point-cloud-to-projection matching, 3D point-cloud matching, 3D orthographic point-cloud-to-projection matching, and 3D perspective point-cloud-to-projection matching. We find an exact solution to the 3D orthographic least squares pose extraction problem, and apply it successfully also to the perspective pose extraction problem with results that improve on existing methods.

Show Abstract
May 17, 2022
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates