No Arabic abstract
Single-molecule force spectroscopy has proven to be a powerful tool for studying the kinetic behavior of biomolecules. Through application of an external force, conformational states with small or transient populations can be stabilized, allowing them to be characterized and the statistics of individual trajectories studied to provide insight into biomolecular folding and function. Because the observed quantity (force or extension) is not necessarily an ideal reaction coordinate, individual observations cannot be uniquely associated with kinetically distinct conformations. While maximum-likelihood schemes such as hidden Markov models have solved this problem for other classes of single-molecule experiments by using temporal information to aid in the inference of a sequence of distinct conformational states, these methods do not give a clear picture of how precisely the model parameters are determined by the data due to instrument noise and finite-sample statistics, both significant problems in force spectroscopy. We solve this problem through a Bayesian extension that allows the experimental uncertainties to be directly quantified, and build in detailed balance to further reduce uncertainty through physical constraints. We illustrate the utility of this approach in characterizing the three-state kinetic behavior of an RNA hairpin in a stationary optical trap.
Loops are essential secondary structure elements in folded DNA and RNA molecules and proliferate close to the melting transition. Using a theory for nucleic acid secondary structures that accounts for the logarithmic entropy c ln m for a loop of length m, we study homopolymeric single-stranded nucleic acid chains under external force and varying temperature. In the thermodynamic limit of a long strand, the chain displays a phase transition between a low temperature / low force compact (folded) structure and a high temperature / high force molten (unfolded) structure. The influence of c on phase diagrams, critical exponents, melting, and force extension curves is derived analytically. For vanishing pulling force, only for the limited range of loop exponents 2 < c < 2.479 a melting transition is possible; for c <= 2 the chain is always in the folded phase and for 2.479 < c always in the unfolded phase. A force induced melting transition with singular behavior is possible for all loop exponents c < 2.479 and can be observed experimentally by single molecule force spectroscopy. These findings have implications for the hybridization or denaturation of double stranded nucleic acids. The Poland-Scheraga model for nucleic acid duplex melting does not allow base pairing between nucleotides on the same strand in denatured regions of the double strand. If the sequence allows these intra-strand base pairs, we show that for a realistic loop exponent c ~ 2.1 pronounced secondary structures appear inside the single strands. This leads to a lower melting temperature of the duplex than predicted by the Poland-Scheraga model. Further, these secondary structures renormalize the effective loop exponent c^, which characterizes the weight of a denatured region of the double strand, and thus affect universal aspects of the duplex melting transition.
Many proteins in cells are capable of sensing and responding to piconewton scale forces, a regime in which conformational changes are small but significant for biological processes. In order to efficiently and effectively sample the response of these proteins to small forces, enhanced sampling techniques will be required. In this work, we derive, implement, and evaluate an efficient method to simultaneously sample the result of applying any constant pulling force within a specified range to a molecular system of interest. We start from Simulated Tempering in Force, whereby force is applied as a linear bias on a collective variable to the systems Hamiltonian, and the coefficient is taken as a continuous auxiliary degree of freedom. We derive a formula for an average collective-variable-dependent force, which depends on a set of weights, learned on-the-fly throughout a simulation, that reflect the limit where force varies infinitely quickly. These weights can then be used to retroactively compute averages of any observable at any force within the specified range. This technique is based on recent work deriving similar equations for Infinite Switch Simulated Tempering in Temperature, that showed the infinite switch limit is the most efficient for sampling. Here, we demonstrate that our method accurately and simultaneously samples molecular systems at all forces within a user defined force range, and show how it can serve as an enhanced sampling tool for cases where the pulling direction destabilizes states of low free-energy at zero-force. This method is implemented in, and will be freely-distributed with, the PLUMED open-source sampling library, and hence can be readily applied to problems using a wide range of molecular dynamics software packages.
We study a phase transition in parameter learning of Hidden Markov Models (HMMs). We do this by generating sequences of observed symbols from given discrete HMMs with uniformly distributed transition probabilities and a noise level encoded in the output probabilities. By using the Baum-Welch (BW) algorithm, an Expectation-Maximization algorithm from the field of Machine Learning, we then try to estimate the parameters of each investigated realization of an HMM. We study HMMs with n=4, 8 and 16 states. By changing the amount of accessible learning data and the noise level, we observe a phase-transition-like change in the performance of the learning algorithm. For bigger HMMs and more learning data, the learning behavior improves tremendously below a certain threshold in the noise strength. For a noise level above the threshold, learning is not possible. Furthermore, we use an overlap parameter applied to the results of a maximum-a-posteriori (Viterbi) algorithm to investigate the accuracy of the hidden state estimation around the phase transition.
In combining DNA nanotechnology and high-bandwidth single-molecule detection in nanopipettes, we demonstrate an all-electric, label-free hybridisation sensor for short DNA sequences (< 100 nt). Such short fragments are known to occur as circulating cell-free DNA in various bodily fluids, such as blood plasma and saliva, and have been identified as disease markers for cancer and infectious diseases. To this end, we use as a model system a 88-mer target from the RV1910c gene in Mycobacterium tuberculosis that is associated with antibiotic (isoniazid) resistance in TB. Upon binding to short probes attached to long carrier DNA, we show that resistive pulse sensing in nanopipettes is capable of identifying rather subtle structural differences, such as the hybridisation state of the probes, in a statistically robust manner. With significant potential towards multiplexing and high-throughput analysis, our study points towards a new, single-molecule DNA assay technology that is fast, easy to use and compatible with point of care environments.
Diffusion processes in biological membranes are of interest to understand the macromolecular organisation and function of several molecules. Fluorescence Recovery After Photobleaching (FRAP) has been widely used as a method to analyse this processes using classical Brownian diffusion model. In the first part of this work, the analytical expression of the fluorescence recovery as a function of time has been established for anomalous diffusion due to long waiting times. Then, experimental fluorescence recoveries recorded in living cells on a membrane-bound protein have been analysed using three different models : normal Brownian diffusion, Brownian diffusion with an immobile fraction and anomalous diffusion due to long waiting times.