563 Publications

Lower airway dysbiosis affects lung cancer progression

J Tsay,, B Wu, I Sulaiman, K Gershner , R Schluger, Y Li, T Yie, P Meyn, E Olsen, L Perez, B Franca, J Carpenito, T Iizumi, M El-Ashmawy, M Badri, J. Morton, N Shen, L He, G Michaud, S Rafeq, J Bessich, R L Smith, H Sauthoff, K Felner, R Pillai, A-M Zavitsanou, S B Koralov, V Mezzano, C A Loomis, A L Moreira, W Moore, A Tsirigos, A Heguy, W N Rom, D H Sterman, H I Pass , J C Clemente, H Li, R. Bonneau, K-K Wong, T Papagiannakopoulos, L N Degal

In lung cancer, enrichment of the lower airway microbiota with oral commensals commonly occurs and ex vivo models support that some of these bacteria can trigger host transcriptomic signatures associated with carcinogenesis. Here, we show that this lower airway dysbiotic signature was more prevalent in group IIIB-IV TNM stage lung cancer and is associated with poor prognosis, as shown by decreased survival among subjects with early stage disease (I-IIIA) and worse tumor progression as measured by RECIST scores among subjects with IIIB-IV stage disease. In addition, this lower airway microbiota signature was associated with upregulation of IL-17, PI3K, MAPK and ERK pathways in airway transcriptome, and we identified Veillonella parvula as the most abundant taxon driving this association. In a KP lung cancer model, lower airway dysbiosis with V. parvula led to decreased survival, increased tumor burden, IL-17 inflammatory phenotype and activation of checkpoint inhibitor markers.

Show Abstract

Auto-deconvolution and molecular networking of gas chromatography–mass spectrometry data

A. Aksenov, I. Laponogov, Z. Zhang, ..., J. Morton, et. al.

We engineered a machine learning approach, MSHub, to enable auto-deconvolution of gas chromatography–mass spectrometry (GC–MS) data. We then designed workflows to enable the community to store, process, share, annotate, compare and perform molecular networking of GC–MS data within the Global Natural Product Social (GNPS) Molecular Networking analysis platform. MSHub/GNPS performs auto-deconvolution of compound fragmentation patterns via unsupervised non-negative matrix factorization and quantifies the reproducibility of fragmentation patterns across samples.

Show Abstract

Shear-induced dispersion in peristaltic flow

B. Chakrabarti, D. Saintillan

The effective diffusivity of a Brownian tracer in unidirectional flow is well known to be enhanced due to shear by the classic phenomenon of Taylor dispersion. At long times, the average concentration of the tracer follows a simplified advection–diffusion equation with an effective shear-dependent dispersivity. In this work, we make use of the generalized Taylor dispersion theory for periodic domains to analyze tracer dispersion by peristaltic pumping. In channels with small aspect ratios, asymptotic expansions in the lubrication limit are employed to obtain analytical expressions for the dispersion coefficient at both small and high Péclet numbers. Channels of arbitrary aspect ratios are also considered using a boundary integral formulation for the fluid flow coupled to a conservation equation for the effective dispersivity, which is solved using the finite-volume method. Our theoretical calculations, which compare well with results from Brownian dynamics simulations, elucidate the effects of channel geometry and pumping strength on shear-induced dispersion. We further discuss the connection between the present problem and dispersion due to Taylor’s swimming sheet and interpret our results in the purely diffusive regime in the context of Fick–Jacobs theory. Our results provide the theoretical basis for understanding passive scalar transport in peristaltic flow, for instance, in the ureter or in microfluidic peristaltic pumps.

Show Abstract

Protein Structural Alignments From Sequence

J. Morton, C. E.M. Strauss, R. Blackwell, D. Berenberg, V. Gligorijevic, R. Bonneau

Computing sequence similarity is a fundamental task in biology, with alignment forming the basis for the annotation of genes and genomes and providing the core data structures for evolutionary analysis. Standard approaches are a mainstay of modern molecular biology and rely on variations of edit distance to obtain explicit alignments between pairs of biological sequences. However, sequence alignment algorithms struggle with remote homology tasks and cannot identify similarities between many pairs of proteins with similar structures and likely homology. Recent work suggests that using machine learning language models can improve remote homology detection. To this end, we introduce DeepBLAST, that obtains explicit alignments from residue embeddings learned from a protein language model integrated into an end-to-end differentiable alignment framework. This approach can be accelerated on the GPU architectures and outperforms conventional sequence alignment techniques in terms of both speed and accuracy when identifying structurally similar proteins.

Show Abstract
November 4, 2020

Kernel Aggregated Fast Multipole Method: Efficient summation of Laplace and Stokes kernel functions

Many different simulation methods for Stokes flow problems involve a common computationally intense task—the summation of a kernel function over O(N2) pairs of points. One popular technique
is the Kernel Independent Fast Multipole Method (KIFMM), which constructs a spatial adaptive octree and places a small number of equivalent multipole and local points around each octree box, and completes the kernel sum with O(N) performance. However, the KIFMM cannot be used directly with nonlinear kernels, can be inefficient for complicated linear kernels, and in general is difficult to implement compared to less-efficient alternatives such as Ewald-type methods. Here we present the Kernel Aggregated Fast Multipole Method (KAFMM), which overcomes these drawbacks by allowing different kernel functions to be used for specific stages of octree traversal. In many cases a simpler linear kernel suffices during the most extensive stage of octree traversal, even for nonlinear kernel summation problems. The KAFMM thereby improves computational efficiency in general and also allows efficient evaluation of some nonlinear kernel functions such as the regularized Stokeslet. We have implemented our method as an open-source software library STKFMM with support for Laplace kernels, the Stokeslet, regularized Stokeslet, Rotne-Prager-Yamakawa (RPY) tensor, and the Stokes double-layer and traction operators. Open and periodic boundary conditions are supported for all kernels, and the no-slip wall boundary condition is supported for the Stokeslet and RPY tensor.
The package is designed to be ready-to-use as well as being readily extensible to additional kernels. Massive parallelism is supported with mixed OpenMP and MPI.

Show Abstract
October 28, 2020

Specificities of modeling membrane proteins using multi-template homology modeling

J. Koehler, R. Bonneau

Structures of membrane proteins are challenging to determine experimentally and currently represent only about 2% of the structures in the ProteinDataBank. Because of this disparity, methods for modeling membrane proteins are fewer and of lower quality than those for modeling soluble proteins. However, better expression, crystallization, and cryo-EM techniques have prompted a recent increase in experimental structures of membrane proteins, which can act as templates to predict the structure of closely related proteins through homology modeling. Because homology modeling relies on a structural template, it is easier and more accurate than fold recognition methods or de novo modeling, which are used when the sequence similarity between the query sequence and the sequence of related proteins in structural databases is below 25%. In homology modeling, a query sequence is mapped onto the coordinates of a single template and refined. With the increase in available templates, several templates often cover overlapping segments of the query sequence. Multi-template modeling can be used to identify the best template for local segments and join them into a single model. Here we provide a protocol for modeling membrane proteins from multiple templates in the Rosetta software suite. This approach takes advantage of several integrated frameworks, namely RosettaScripts, RosettaCM, and RosettaMP with the membrane scoring function.

Show Abstract
October 23, 2020

Apical stress fibers enable a scaling between cell mechanical response and area in epithelial tissue

J. López-Gay, H. Nunley , M. Spencer, F. di Pietro, B. Guriao, F. Bosveld

Tissue development, homeostasis, and repair require cells to sense mechanical forces. Although many molecular actors implicated in cell mechanosensitivity have been extensively studied, the basis by which cells adapt their mechanical responses to their geometry remains poorly defined. López-Gay et al. now identify how two fundamental epithelial structures—stress fibers and tricellular junctions—endow Drosophila cells with an internal ruler to scale their mechanical response with their area. This work explains how cells of different sizes within an epithelial tissue collectively adapt their mechanical response to control tissue shape and proliferation. Scaling of biological properties with size is a core property of other biological systems.

Show Abstract
Science, 370:eabb2169
October 16, 2020

Computational design of mixed chirality peptide macrocycles with internal symmetry

V. Mulligan, C Kang, M Sawaya, S Rettie, X Li, I Antselovich, T Craven, A Watkins, J Labonte, F DiMaio, T Yeates, D Baker

Cyclic symmetry is frequent in protein and peptide homo‐oligomers, but extremely rare within a single chain, as it is not compatible with free N‐ and C‐termini. Here we describe the computational design of mixed‐chirality peptide macrocycles with rigid structures that feature internal cyclic symmetries or improper rotational symmetries inaccessible to natural proteins. Crystal structures of three C2‐ and C3‐symmetric macrocycles, and of six diverse S2‐symmetric macrocycles, match the computationally‐designed models with backbone heavy‐atom RMSD values of 1 Å or better. Crystal structures of an S4‐symmetric macrocycle (consisting of a sequence and structure segment mirrored at each of three successive repeats) designed to bind zinc reveal a large‐scale zinc‐driven conformational change from an S4‐symmetric apo‐state to a nearly inverted S4‐symmetric holo‐state almost identical to the design model. These symmetric structures provide promising starting points for applications ranging from design of cyclic peptide based metal organic frameworks to creation of high affinity binders of symmetric protein homo‐oligomers. More generally, this work demonstrates the power of computational design for exploring symmetries and structures not found in nature, and for creating synthetic switchable systems.

Show Abstract

SARS-CoV-2 receptor networks in diabetic and COVID-19 associated kidney disease

R Menon, E Otto, R. Sealfon, V Nair, A. Wong, C Theesfeld, X. Chen, Y Wang, A Boppana, J Luo, Y Yang, P Kasson, J Schaub, C Berthier, S Eddy , C Lienczewski , B Godfrey, S Dagenais , R Sohaney, J Hartman, D Fermin, L Subramanian, H Looker , J Harder, L Mariani, J Hodgin, J Sexton, C Wobus , A Naik, R Nelson, O. Troyanskaya, M Kretzler

COVID-19 morbidity and mortality are increased via unknown mechanisms in patients with diabetes and kidney disease. SARS-CoV-2 uses angiotensin-converting enzyme 2 (ACE2) for entry into host cells. Because ACE2 is a susceptibility factor for infection, we investigated how diabetic kidney disease and medications alter ACE2 receptor expression in kidneys. Single cell RNA profiling of kidney biopsies from healthy living donors and patients with diabetic kidney disease revealed ACE2 expression primarily in proximal tubular epithelial cells. This cell-specific localization was confirmed by in situ hybridization. ACE2 expression levels were unaltered by exposures to renin-angiotensin-aldosterone system inhibitors in diabetic kidney disease. Bayesian integrative analysis of a large compendium of public -omics datasets identified molecular network modules induced in ACE2-expressing proximal tubular epithelial cells in diabetic kidney disease (searchable at hb.flatironinstitute.org/covid-kidney) that were linked to viral entry, immune activation, endomembrane reorganization, and RNA processing. The diabetic kidney disease ACE2-positive proximal tubular epithelial cell module overlapped with expression patterns seen in SARS-CoV-2–infected cells. Similar cellular programs were seen in ACE2-positive proximal tubular epithelial cells obtained from urine samples of 13 hospitalized patients with COVID-19, suggesting a consistent ACE2-coregulated proximal tubular epithelial cell expression program that may interact with the SARS-CoV-2 infection processes. Thus SARS-CoV-2 receptor networks can seed further research into risk stratification and therapeutic strategies for COVID-19–related kidney damage.

Show Abstract

Identification of new therapeutic targets in CRLF2-overexpressing B-ALL through discovery of TF-gene regulatory interactions

S. Badri, B. Carella, P. Lhoumaud, D. Castro, C. Skok Gibbs, R. Raviram, S. Narang, N. Evensen, A. Watters, W. Carroll, R. Bonneau, J. Skok

Although genetic alterations are initial drivers of disease, aberrantly activated transcriptional regulatory programs are often responsible for the maintenance and progression of cancer. CRLF2-overexpression in B-ALL patients leads to activation of JAK-STAT, PI3K and ERK/MAPK signaling pathways and is associated with poor outcome. Although inhibitors of these pathways are available, there remains the issue of treatment-associated toxicities, thus it is important to identify new therapeutic targets. Using a network inference approach, we reconstructed a B-ALL specific transcriptional regulatory network to evaluate the impact of CRLF2-overexpression on downstream regulatory interactions.

Comparing RNA-seq from CRLF2-High and other B-ALL patients (CRLF2-Low), we defined a CRLF2-High gene signature. Patient-specific chromatin accessibility was interrogated to identify altered putative regulatory elements that could be linked to transcriptional changes. To delineate these regulatory interactions, a B-ALL cancer-specific regulatory network was inferred using 868 B-ALL patient samples from the NCI TARGET database coupled with priors generated from ATAC-seq peak TF-motif analysis. CRISPRi, siRNA knockdown and ChIP-seq of nine TFs involved in the inferred network were analyzed to validate predicted TF-gene regulatory interactions.

In this study, a B-ALL specific regulatory network was constructed using ATAC-seq derived priors. Inferred interactions were used to identify differential patient-specific transcription factor activities predicted to control CRLF2-High deregulated genes, thereby enabling identification of new potential therapeutic targets.

Show Abstract
September 26, 2020
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates