Convergence to good non-optimal critical points in the training of neural networks: Gradient descent optimization with one random initialization overcomes all bad non-global local minima with high probability

Ibragimov, S.; Jentzen, A.; Riekert, A.

Research article in digital collection | Preprint

Details about the publication

Name of the repository: arXiv.org

Article number: 2212.13111

Status: submitted / under review

Release year: 2022

Ibragimov, Shokhrukh	Professorship for applied mathematics (Prof. Jentzen)
Jentzen, Arnulf	Institute for Analysis and Numerics
Riekert, Adrian	Mathematical Institute