Convergence to good non-optimal critical points in the training of neural networks: Gradient descent optimization with one random initialization overcomes all bad non-global local minima with high probability

Ibragimov, S.; Jentzen, A.; Riekert, A.

Research article in digital collection | Preprint

Details about the publication

Name of the repositoryarXiv.org
Article number2212.13111
Statussubmitted / under review
Release year2022
Link to the full texthttps://arxiv.org/abs/2212.13111

Authors from the University of Münster

Ibragimov, Shokhrukh
Jentzen, Arnulf
Riekert, Adrian