أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Pavlos Protopapas

Unsupervised Reservoir Computing for Solving Ordinary Differential Equations

91 - Marios Mattheakis , Hayden Joy , Pavlos Protopapas 2021

There is a wave of interest in using unsupervised neural networks for solving differential equations. The existing methods are based on feed-forward networks, {while} recurrent neural network differential equation solvers have not yet been reported. We introduce an unsupervised reservoir computing (RC), an echo-state recurrent neural network capable of discovering approximate solutions that satisfy ordinary differential equations (ODEs). We suggest an approach to calculate time derivatives of recurrent neural network outputs without using backpropagation. The internal weights of an RC are fixed, while only a linear output layer is trained, yielding efficient training. However, RC performance strongly depends on finding the optimal hyper-parameters, which is a computationally expensive process. We use Bayesian optimization to efficiently discover optimal sets in a high-dimensional hyper-parameter space and numerically show that one set is robust and can be used to solve an ODE for different initial conditions and time ranges. A closed-form formula for the optimal output weights is derived to solve first order linear equations in a backpropagation-free learning process. We extend the RC approach by solving nonlinear system of ODEs using a hybrid optimization method consisting of gradient descent and Bayesian optimization. Evaluation of linear and nonlinear systems of equations demonstrates the efficiency of the RC ODE solver.

التعلم الآلي الحوسبة العصبية والتطورية الفيزياء الحسابية

Convolutional Neural Network Models and Interpretability for the Anisotropic Reynolds Stress Tensor in Turbulent One-dimensional Flows

111 - Haitz Saez de Ocariz Borde , David Sondak , Pavlos Protopapas 2021

The Reynolds-averaged Navier-Stokes (RANS) equations are widely used in turbulence applications. They require accurately modeling the anisotropic Reynolds stress tensor, for which traditional Reynolds stress closure models only yield reliable results in some flow configurations. In the last few years, there has been a surge of work aiming at using data-driven approaches to tackle this problem. The majority of previous work has focused on the development of fully-connected networks for modeling the anisotropic Reynolds stress tensor. In this paper, we expand upon recent work for turbulent channel flow and develop new convolutional neural network (CNN) models that are able to accurately predict the normalized anisotropic Reynolds stress tensor. We apply the new CNN model to a number of one-dimensional turbulent flows. Additionally, we present interpretability techniques that help drive the model design and provide guidance on the model behavior in relation to the underlying physics.

ديناميات السوائل

StelNet: Hierarchical Neural Network for Automatic Inference in Stellar Characterization

39 - Cecilia Garraffo , Pavlos Protopapas , Jeremy J. Drake 2021

Characterizing the fundamental parameters of stars from observations is crucial for studying the stars themselves, their planets, and the galaxy as a whole. Stellar evolution theory predicting the properties of stars as a function of stellar age and mass enables translating observables into physical stellar parameters by fitting the observed data to synthetic isochrones. However, the complexity of overlapping evolutionary tracks often makes this task numerically challenging, and with a precision that can be highly variable, depending on the area of the parameter space the observation lies in. This work presents StelNet, a Deep Neural Network trained on stellar evolutionary tracks that quickly and accurately predicts mass and age from absolute luminosity and effective temperature for stars with close to solar metallicity. The underlying model makes no assumption on the evolutionary stage and includes the pre-main sequence phase. We use bootstrapping and train many models to quantify the uncertainty of the model. To break the models intrinsic degeneracy resulting from overlapping evolutionary paths, we also built a hierarchical model that retrieves realistic posterior probability distributions of the stellar mass and age. We further test and train StelNet using a sample of stars with well-determined masses and ages from the literature.

الفيزياء الفلكية الشمسية والنجوم

Encoding Involutory Invariance in Neural Networks

125 - Anwesh Bhattacharya , Marios Mattheakis , Pavlos Protopapas 2021

In certain situations, Neural Networks (NN) are trained upon data that obey underlying physical symmetries. However, it is not guaranteed that NNs will obey the underlying symmetry unless embedded in the network structure. In this work, we explore a special kind of symmetry where functions are invariant with respect to involutory linear/affine transformations up to parity $p=pm 1$. We develop mathematical theorems and propose NN architectures that ensure invariance and universal approximation properties. Numerical experiments indicate that the proposed models outperform baseline networks while respecting the imposed symmetry. An adaption of our technique to convolutional NN classification tasks for datasets with inherent horizontal/vertical reflection symmetry has also been proposed.

التعلم الآلي الذكاء الاصطناعي الحوسبة العصبية والتطورية

A New Artificial Neuron Proposal with Trainable Simultaneous Local and Global Activation Function

215 - Tiago A. E. Ferreira , Marios Mattheakis , Pavlos Protopapas 2021

The activation function plays a fundamental role in the artificial neural network learning process. However, there is no obvious choice or procedure to determine the best activation function, which depends on the problem. This study proposes a new ar tificial neuron, named global-local neuron, with a trainable activation function composed of two components, a global and a local. The global component term used here is relative to a mathematical function to describe a general feature present in all problem domain. The local component is a function that can represent a localized behavior, like a transient or a perturbation. This new neuron can define the importance of each activation function component in the learning phase. Depending on the problem, it results in a purely global, or purely local, or a mixed global and local activation function after the training phase. Here, the trigonometric sine function was employed for the global component and the hyperbolic tangent for the local component. The proposed neuron was tested for problems where the target was a purely global function, or purely local function, or a composition of two global and local functions. Two classes of test problems were investigated, regression problems and differential equations solving. The experimental tests demonstrated the Global-Local Neuron networks superior performance, compared with simple neural networks with sine or hyperbolic tangent activation function, and with a hybrid network that combines these two simple neural networks.

الحوسبة العصبية والتطورية التعلم الآلي

Learning a Reduced Basis of Dynamical Systems using an Autoencoder

111 - David Sondak , Pavlos Protopapas 2020

Machine learning models have emerged as powerful tools in physics and engineering. Although flexible, a fundamental challenge remains on how to connect new machine learning models with known physics. In this work, we present an autoencoder with laten t space penalization, which discovers finite dimensional manifolds underlying the partial differential equations of physics. We test this method on the Kuramoto-Sivashinsky (K-S), Korteweg-de Vries (KdV), and damped KdV equations. We show that the resulting optimal latent space of the K-S equation is consistent with the dimension of the inertial manifold. The results for the KdV equation imply that there is no reduced latent space, which is consistent with the truly infinite dimensional dynamics of the KdV equation. In the case of the damped KdV equation, we find that the number of active dimensions decreases with increasing damping coefficient. We then uncover a nonlinear basis representing the manifold of the latent space for the K-S equation.

الفيزياء الحسابية النظم الديناميكية ديناميات الفوضوية

Unsupervised Neural Networks for Quantum Eigenvalue Problems

124 - Henry Jin , Marios Mattheakis , Pavlos Protopapas 2020

Eigenvalue problems are critical to several fields of science and engineering. We present a novel unsupervised neural network for discovering eigenfunctions and eigenvalues for differential eigenvalue problems with solutions that identically satisfy the boundary conditions. A scanning mechanism is embedded allowing the method to find an arbitrary number of solutions. The network optimization is data-free and depends solely on the predictions. The unsupervised method is used to solve the quantum infinite well and quantum oscillator eigenvalue problems.

الفيزياء الحسابية التعلم الآلي

MPCC: Matching Priors and Conditionals for Clustering

36 - Nicolas Astorga , Pablo Huijse , Pavlos Protopapas 2020

Clustering is a fundamental task in unsupervised learning that depends heavily on the data representation that is used. Deep generative models have appeared as a promising tool to learn informative low-dimensional data representations. We propose Mat ching Priors and Conditionals for Clustering (MPCC), a GAN-based model with an encoder to infer latent variables and cluster categories from data, and a flexible decoder to generate samples from a conditional latent space. With MPCC we demonstrate that a deep generative model can be competitive/superior against discriminative methods in clustering tasks surpassing the state of the art over a diverse set of benchmark datasets. Our experiments show that adding a learnable prior and augmenting the number of encoder updates improve the quality of the generated samples, obtaining an inception score of 9.49 $pm$ 0.15 and improving the Frechet inception distance over the state of the art by a 46.9% in CIFAR10.

التعلم الآلي

Unsupervised Learning of Solutions to Differential Equations with Generative Adversarial Networks

88 - Dylan Randle , Pavlos Protopapas , David Sondak 2020

Solutions to differential equations are of significant scientific and engineering relevance. Recently, there has been a growing interest in solving differential equations with neural networks. This work develops a novel method for solving differentia l equations with unsupervised neural networks that applies Generative Adversarial Networks (GANs) to emph{learn the loss function} for optimizing the neural network. We present empirical results showing that our method, which we call Differential Equation GAN (DEQGAN), can obtain multiple orders of magnitude lower mean squared errors than an alternative unsupervised neural network method based on (squared) $L_2$, $L_1$, and Huber loss functions. Moreover, we show that DEQGAN achieves solution accuracy that is competitive with traditional numerical methods. Finally, we analyze the stability of our approach and find it to be sensitive to the selection of hyperparameters, which we provide in the appendix. Code available at https://github.com/dylanrandle/denn. Please address any electronic correspondence to [email protected].

التعلم الآلي التعلم الالي

Solving Differential Equations Using Neural Network Solution Bundles

145 - Cedric Flamant , Pavlos Protopapas , David Sondak 2020

The time evolution of dynamical systems is frequently described by ordinary differential equations (ODEs), which must be solved for given initial conditions. Most standard approaches numerically integrate ODEs producing a single solution whose values are computed at discrete times. When many varied solutions with different initial conditions to the ODE are required, the computational cost can become significant. We propose that a neural network be used as a solution bundle, a collection of solutions to an ODE for various initial states and system parameters. The neural network solution bundle is trained with an unsupervised loss that does not require any prior knowledge of the sought solutions, and the resulting object is differentiable in initial conditions and system parameters. The solution bundle exhibits fast, parallelizable evaluation of the system state, facilitating the use of Bayesian inference for parameter estimation in real dynamical systems.

التعلم الآلي الفيزياء الحسابية

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد