Convergence to good non-optimal critical points in the training of neural networks: Gradient descent optimization with one random initialization overcomes all bad non-global local minima with high probability

Ibragimov, S.; Jentzen, A.; Riekert, A.

Research article in digital collection | Preprint | Peer reviewed

Details about the publication

Name of the
Article number2212.13111
Statussubmitted / under review
Release year2022
Link to the full text

Authors from the University of Münster

Ibragimov, Shokhrukh
Professorship for applied mathematics (Prof. Jentzen)
Jentzen, Arnulf
Institute for Analysis and Numerics
Riekert, Adrian
Mathematical Institute