# NUMEV Seminar #10 : « Data based modeling and time series prediction using the generalized Langevin equation » – ROLAND NETZ, Free University Berlin

The next NUMEV seminar will take place on Friday, May 12 at 11am at Amphi Moreau, Campus Saint Priest.

**What : « Data based modeling and time series prediction using the generalized Langevin equation » – Roland Netz**, Department of Physics, Free University Berlin, Arnimallee 14, 14195 Berlin, Germany

**When : **11am on Friday, May 12 + Lunch

**Where :** Amphi Moreau, Bât. 2 (LMGC), Campus Saint Priest, 860 rue de Saint Priest, Montpellier

Open to all researchers from all disciplines. Registration is free but mandatory.

**Abstract**

Most systems of scientific interest are interacting many-body systems. One typically describes their kinetics in terms of a low-dimensional reaction coordinate, which in general is influenced by the entire system. The dynamics of such a reaction coordinate is governed by the generalized Langevin equation (GLE), an integro-differential stochastic equation, and involves a memory function, which describes how the dynamics depends on previous values of the reaction coordinate. The GLE is thus an systematic non-Markovian description of the dynamics of a system in terms of coarse-grained variables. We have recently introduced a novel hybrid projection scheme that allows to extract the GLE parameters from time series data in a form that is convenient for analytic and numerical treatments [1]. In the talk I discuss a few examples where the GLE can be used to interpret and model data in different fields of science.

Protein-folding kinetics is typically described as Markovian (i.e., memoryless) diffusion in a one-dimensional free energy landscape, governed by an instantaneous friction coefficient that is fitted to reproduce experimental or simulated folding times. According to this view, the folding time is dominated by the exponential of the folding free energy barrier, the Arrhenius factor, where the friction coefficient only sets the pre-exponential time scale and plays a subordinate role. By analysis of large-scale molecular-dynamics simulation trajectories of fast-folding proteins from the Shaw group using the special-purpose computer ANTON, it is demonstrated that the friction characterizing protein folding exhibits significant memory with a decay time that is of the same order as the folding and unfolding times [2,3]. Non-Markovian modeling not only reproduces simulations accurately but also demonstrates that memory friction effects lead to anomalous and drastically modified protein kinetics. For the set of proteins for which simulations are available, it is shown that the folding and unfolding times are not dominated by the free-energy barrier but rather by the non-Markovian friction.

Memory effects are also present for non-equilibrium systems. Using an appropriate non-equilibrium formulation of the GLE, it is demonstrated that the motion of living organisms is characterized by memory friction, which allows to characterize internal feedback loops of such organisms and to classify and sort individual organisms [4]. The GLE can be even used to predict complex phenomena such as weather data.

[1] Generalized Langevin equation with a nonlinear potential of mean force and nonlinear memory friction from a hybrid projection scheme

Cihan Ayaz , Laura Scalfi , Benjamin A. Dalton, and Roland R. Netz, PHYSICAL REVIEW E 105, 054138 (2022)

[2] Non-Markovian modeling of protein folding

Cihan Ayaza, Lucas Tepper, Florian N. Brünig, Julian Kappler, Jan O. Daldrop, Roland R. Netz, Proc. Natl Acad. Sci. 118, e2023856118 (2021)

[3] Fast protein folding is governed by memory-dependent friction

Benjamin A. Dalton, Cihan Ayaz, Lucas Tepper, and Roland R. Netz (available on Arxiv)

[4] Non-Markovian data-driven modeling of single-cell motility

Bernhard G. Mitterwallner , Christoph Schreiber, Jan O. Daldrop, Joachim O. Rädler, and Roland R. Netz, PHYSICAL REVIEW E 101, 032408 (2020)