A proof of convergence for the gradient descent optimization method with random initializations in the training of neural networks with ReLU activation for piecewise linear target functions

Jentzen, Arnulf; Riekert, Adrian

Research article (journal) | Peer reviewed

Details about the publication

JournalJournal of Machine Learning Research
Volume23
Issue260
Page range1-50
StatusPublished
Release year2022
Language in which the publication is writtenEnglish
Link to the full texthttps://jmlr.org/papers/v23/21-0962.html
KeywordsGradient descent; Artificial neural networks; Non-convex optimization

Authors from the University of Münster

Jentzen, Arnulf
Institute for Analysis and Numerics
Riekert, Adrian
Mathematical Institute