PLOS Computational Biology
Journal Information
ISSN / EISSN: 1553734X / 15537358
Published by:
Public Library of Science (PLoS)
Total articles ≅ 10,201
Latest articles in this journal
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011174
Abstract:
The entry of influenza virus into the host cell requires fusion of its lipid envelope with the host membrane. It is catalysed by viral hemagglutinin protein, whose fragments called fusion peptides become inserted into the target bilayer and initiate its merging with the viral membrane. Isolated fusion peptides are already capable of inducing lipid mixing between liposomes. Years of studies indicate that upon membrane binding they form bend helical structure whose degree of opening fluctuates between tightly closed hairpin and an extended boomerang. The actual way in which they initiate fusion remains elusive. In this work we employ atomistic simulations of wild type and fusion inactive W14A mutant of influenza fusion peptides confined between two closely apposed lipid bilayers. We characterise peptide induced membrane perturbation and determine the potential of mean force for the formation of the first fusion intermediate, an interbilayer lipid bridge called stalk. Our results demonstrate two routes through which the peptides can lower free energy barrier towards fusion. The first one assumes peptides capability to adopt transmembrane configuration which subsequently promotes the creation of a stalk-hole complex. The second involves surface bound peptide configuration and proceeds owing to its ability to stabilise stalk by fitting into the region of extreme negative membrane curvature resulting from its formation. In both cases, the active peptide conformation corresponds to tight helical hairpin, whereas extended boomerang geometry appears to be unable to provide favourable thermodynamic effect. The latter observation offers plausible explanation for long known inactivity of boomerang-stabilising W14A mutation.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1010263
Abstract:
PNCK, or CAMK1b, is an understudied kinase of the calcium-calmodulin dependent kinase family which recently has been identified as a marker of cancer progression and survival in several large-scale multi-omics studies. The biology of PNCK and its relation to oncogenesis has also begun to be elucidated, with data suggesting various roles in DNA damage response, cell cycle control, apoptosis and HIF-1-alpha related pathways. To further explore PNCK as a clinical target, potent small-molecule molecular probes must be developed. Currently, there are no targeted small molecule inhibitors in pre-clinical or clinical studies for the CAMK family. Additionally, there exists no experimentally derived crystal structure for PNCK. We herein report a three-pronged chemical probe discovery campaign which utilized homology modeling, machine learning, virtual screening and molecular dynamics to identify small molecules with low-micromolar potency against PNCK activity from commercially available compound libraries. We report the discovery of a hit-series for the first targeted effort towards discovering PNCK inhibitors that will serve as the starting point for future medicinal chemistry efforts for hit-to-lead optimization of potent chemical probes.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011175
Abstract:
Machine learning tools have proven useful across biological disciplines, allowing researchers to draw conclusions from large datasets, and opening up new opportunities for interpreting complex and heterogeneous biological data. Alongside the rapid growth of machine learning, there have also been growing pains: some models that appear to perform well have later been revealed to rely on features of the data that are artifactual or biased; this feeds into the general criticism that machine learning models are designed to optimize model performance over the creation of new biological insights. A natural question arises: how do we develop machine learning models that are inherently interpretable or explainable? In this manuscript, we describe the SWIF(r) reliability score (SRS), a method building on the SWIF(r) generative framework that reflects the trustworthiness of the classification of a specific instance. The concept of the reliability score has the potential to generalize to other machine learning methods. We demonstrate the utility of the SRS when faced with common challenges in machine learning including: 1) an unknown class present in testing data that was not present in training data, 2) systemic mismatch between training and testing data, and 3) instances of testing data that have missing values for some attributes. We explore these applications of the SRS using a range of biological datasets, from agricultural data on seed morphology, to 22 quantitative traits in the UK Biobank, and population genetic simulations and 1000 Genomes Project data. With each of these examples, we demonstrate how the SRS can allow researchers to interrogate their data and training approach thoroughly, and to pair their domain-specific knowledge with powerful machine-learning frameworks. We also compare the SRS to related tools for outlier and novelty detection, and find that it has comparable performance, with the advantage of being able to operate when some data are missing. The SRS, and the broader discussion of interpretable scientific machine learning, will aid researchers in the biological machine learning space as they seek to harness the power of machine learning without sacrificing rigor and biological insight.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011072
Abstract:
To address ongoing academic achievement gap, there is a need for more school-university partnerships promoting early access to STEM education. During summer 2020, members of our institute initiated QBio-EDGE (Quantitative Biology—Empowering Diversity and Growth in Education), an outreach program for high schools in Los Angeles. In the hope of contributing to increasing diversity in academia, QBio-EDGE aims to make STEM education more accessible for students from historically excluded communities by exposing them to scientific research and diverse scientist role models. This program is led by early career researchers (ECRs), i.e., undergraduate, graduate, and postdoctoral researchers. In our first year, the outreach activities took place during virtual learning, presenting challenges and opportunities within the program development. Here, we provide a practical guide outlining our outreach efforts, key factors we considered in the program development, and hurdles we overcame. Specifically, we describe how we assembled our diverse team, how we established trusting partnerships with participating schools, and how we designed engaging student-centered, problem-based classroom modules on quantitative biology and computational methods applications to understand living systems. We also discuss the importance of increased institutional support. We hope that this may inspire researchers at all career stages to engage with local schools by participating in science outreach, specifically in quantitative and computational fields. We challenge institutions to actively strengthen these efforts.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011122
Abstract:
Lung adenocarcinoma (LUAD) is a deadly tumor with dynamic evolutionary process. Although much endeavors have been made in identifying the temporal patterns of cancer progression, it remains challenging to infer and interpret the molecular alterations associated with cancer development and progression. To this end, we developed a computational approach to infer the progression trajectory based on cross-sectional transcriptomic data. Analysis of the LUAD data using our approach revealed a linear trajectory with three different branches for malignant progression, and the results showed consistency in three independent cohorts. We used the progression model to elucidate the potential molecular events in LUAD progression. Further analysis showed that overexpression of BUB1B, BUB1 and BUB3 promoted tumor cell proliferation and metastases by disturbing the spindle assembly checkpoint (SAC) in the mitosis. Aberrant mitotic spindle checkpoint signaling appeared to be one of the key factors promoting LUAD progression. We found the inferred cancer trajectory allows to identify LUAD susceptibility genetic variations using genome-wide association analysis. This result shows the opportunity for combining analysis of candidate genetic factors with disease progression. Furthermore, the trajectory showed clear evident mutation accumulation and clonal expansion along with the LUAD progression. Understanding how tumors evolve and identifying mutated genes will help guide cancer management. We investigated the clonal architectures and identified distinct clones and subclones in different LUAD branches. Validation of the model in multiple independent data sets and correlation analysis with clinical results demonstrate that our method is effective and unbiased.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011184
Abstract:
Peripheral nerve stimulation is being investigated as a therapeutic tool in several clinical scenarios. However, the adopted devices have restricted ability to obtain desired outcomes with tolerable off-target effects. Recent promising solutions are not yet employed in clinical practice due to complex required surgeries, lack of long-term stability, and implant invasiveness. Here, we aimed to design a neural interface to address these issues, specifically dimensioned for pudendal and sacral nerves to potentially target sexual, bladder, or bowel dysfunctions. We designed the adaptable intrafascicular radial electrode (AIR) through realistic computational models. They account for detailed human anatomy, inhomogeneous anisotropic conductance, following the trajectories of axons along curving and branching fascicles, and detailed biophysics of axons. The model was validated against available experimental data. Thanks to computationally efficient geometry-based selectivity estimations we informed the electrode design, optimizing its dimensions to obtain the highest selectivity while maintaining low invasiveness. We then compared the AIR with state-of-the-art electrodes, namely InterStim leads, multipolar cuffs and transversal intrafascicular multichannel electrodes (TIME). AIR, comprising a flexible substrate, surface active sites, and radially inserted intrafascicular needles, is designed to be implanted in a few standard steps, potentially enabling fast implants. It holds potential for repeatable stimulation outcomes thanks to its radial structural symmetry. When compared in-silico, AIR consistently outperformed cuff electrodes and InterStim leads in terms of recruitment threshold and stimulation selectivity. AIR performed similarly or better than a TIME, with quantified less invasiveness. Finally, we showed how AIR can adapt to different nerve sizes and varying shapes while maintaining high selectivity. The AIR electrode shows the potential to fill a clinical need for an effective peripheral nerve interface. Its high predicted performance in all the identified requirements was enabled by a model-based approach, readily applicable for the optimization of electrode parameters in any peripheral nerve stimulation scenario.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011105
Abstract:
Single-pulse electrical stimulation in the nervous system, often called cortico-cortical evoked potential (CCEP) measurement, is an important technique to understand how brain regions interact with one another. Voltages are measured from implanted electrodes in one brain area while stimulating another with brief current impulses separated by several seconds. Historically, researchers have tried to understand the significance of evoked voltage polyphasic deflections by visual inspection, but no general-purpose tool has emerged to understand their shapes or describe them mathematically. We describe and illustrate a new technique to parameterize brain stimulation data, where voltage response traces are projected into one another using a semi-normalized dot product. The length of timepoints from stimulation included in the dot product is varied to obtain a temporal profile of structural significance, and the peak of the profile uniquely identifies the duration of the response. Using linear kernel PCA, a canonical response shape is obtained over this duration, and then single-trial traces are parameterized as a projection of this canonical shape with a residual term. Such parameterization allows for dissimilar trace shapes from different brain areas to be directly compared by quantifying cross-projection magnitudes, response duration, canonical shape projection amplitudes, signal-to-noise ratios, explained variance, and statistical significance. Artifactual trials are automatically identified by outliers in sub-distributions of cross-projection magnitude, and rejected. This technique, which we call “Canonical Response Parameterization” (CRP) dramatically simplifies the study of CCEP shapes, and may also be applied in a wide range of other settings involving event-triggered data.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011146
Abstract:
Current agricultural practices facilitate emergence and spread of plant diseases through the wide use of monocultures. Host mixtures are a promising alternative for sustainable plant disease control. Their effectiveness can be partly explained by priming-induced cross-protection among plants. Priming occurs when plants are challenged with non-infective pathogen genotypes, resulting in increased resistance to subsequent infections by infective pathogen genotypes. We developed an epidemiological model to explore how mixing two distinct resistant varieties can reduce disease prevalence. We considered a pathogen population composed of three genotypes infecting either one or both varieties. We found that host mixtures should not contain an equal proportion of resistant plants, but a biased ratio (e.g. 80 : 20) to minimize disease prevalence. Counter-intuitively, the optimal ratio of resistant varieties should contain a lower proportion of the costliest resistance for the pathogen to break. This benefit is amplified by priming. This strategy also prevents the invasion of pathogens breaking all resistances.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011142
Abstract:
Chromosomes are arranged in distinct territories within the nucleus of animal cells. Recent experiments have shown that these territories overlap at their edges, suggesting partial mixing during interphase. Experiments that knock-down of condensin II proteins during interphase indicate increased chromosome mixing, which demonstrates control of the mixing. In this study, we use a generic polymer simulation to quantify the dynamics of chromosome mixing over time. We introduce the chromosome mixing index, which quantifies the mixing of distinct chromosomes in the nucleus. We find that the chromosome mixing index in a small confinement volume (as a model of the nucleus), increases as a power-law of the time, with the scaling exponent varying non-monotonically with self-interaction and volume fraction. By comparing the chromosome mixing index with both monomer subdiffusion due to (non-topological) intermingling of chromosomes as well as even slower reptation, we show that for relatively large volume fractions, the scaling exponent of the chromosome mixing index is related to Rouse dynamics for relatively weak chromosome attractions and to reptation for strong attractions. In addition, we extend our model to more realistically account for the situation of the Drosophila chromosome by including the heterogeneity of the polymers and their lengths to account for microphase separation of euchromatin and heterochromatin and their interactions with the nuclear lamina. We find that the interaction with the lamina further impedes chromosome mixing.
PLOS Computational Biology, Volume 19; https://doi.org/10.1371/journal.pcbi.1011168
Abstract:
Random walks on networks are widely used to model stochastic processes such as search strategies, transportation problems or disease propagation. A prominent example of such process is the dynamics of naive T cells within the lymph node while they are scanning for antigens. The observed T cells trajectories in small sub-volumes of the lymph node are well modeled as a random walk and they have been shown to follow the lymphatic conduit network as substrate for migration. One can then ask how does the connectivity patterns of the lymph node conduit network affect the T cells collective exploration behavior. In particular, does the network display properties that are uniform across the whole volume of the lymph node or can we distinguish some heterogeneities? We propose a workflow to accurately and efficiently define and compute these quantities on large networks, which enables us to characterize heterogeneities within a very large published dataset of Lymph Node Conduit Network. To establish the significance of our results, we compared the results obtained on the lymph node to null models of varying complexity. We identified significantly heterogeneous regions characterized as “remote regions” at the poles and next to the medulla, while a large portion of the network promotes uniform exploration by T cells.