Abstract
The main aim of this paper is to conduct the convergence analysis of the gradient descent for two-layer physics-informed neural networks (PINNs). Here, the loss function involves derivatives of neural network outputs with respect to its inputs, so the interaction between the trainable parameters is more complicated compared with simple regression and classification tasks. We first develop the positive definiteness of Gram matrices and prove that the gradient flow finds the global optima of the empirical loss under over-parameterization. Then, we demonstrate that the standard gradient descent converges to the global optima of the loss with proper choices of learning rates. The framework of our analysis works for various categories of PDEs (e.g., linear second-order PDEs) and common types of network initialization (LecunUniform etc.). Our theoretical results do not need a very strict hypothesis for training samples and have a looser requirement on the network width compared with some previous works.
Original language | English |
---|---|
Title of host publication | Proceedings of the 40th International Conference on Machine Learning |
Editors | Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett |
Publisher | ML Research Press |
Pages | 10676-10707 |
Number of pages | 32 |
Publication status | Published - Jul 2023 |
Event | 40th International Conference on Machine Learning, ICML 2023 - Honolulu, United States Duration: 23 Jul 2023 → 29 Jul 2023 https://icml.cc/Conferences/2023 https://proceedings.mlr.press/v202/ https://openreview.net/group?id=ICML.cc/2023/Conference |
Publication series
Name | Proceedings of Machine Learning Research |
---|---|
Publisher | ML Research Press |
Volume | 202 |
ISSN (Print) | 2640-3498 |
Conference
Conference | 40th International Conference on Machine Learning, ICML 2023 |
---|---|
Country/Territory | United States |
City | Honolulu |
Period | 23/07/23 → 29/07/23 |
Internet address |
Scopus Subject Areas
- Artificial Intelligence
- Software
- Control and Systems Engineering
- Statistics and Probability