Deep Learning - Loss and Optimization Part 3

12 - Deep Learning - Loss and Optimization Part 3/ClipID:14187 vorhergehender Clip nächster Clip

Schlüsselworte: Optimization loss artificial intelligence deep learning machine learning pattern recognition Feedforward Networks Gradient descent

Die automatischen Untertitel, die mit Whisper Open AI in diesem Video-Player (und im Multistream-Video-Player) generiert werden, dienen der Bequemlichkeit und Barrierefreiheit. Es ist jedoch zu beachten, dass die Genauigkeit und Interpretation variieren können. Für mehr Informationen lesen Sie bitte die FAQs (Absatz 14)

Aufnahme Datum 2020-04-26

Video CC Herunterladen Clip RSS Feeds

Kurs-Verknüpfung

Deep Learning

Lehrende(r)

Prof. Dr. Andreas Maier

Zugang

Frei

Sprache

Englisch

Einrichtung

Lehrstuhl für Informatik 5 (Mustererkennung)

Produzent

Lehrstuhl für Informatik 5 (Mustererkennung)

Format

Screencapture

Typ

universitäre Vorlesung

Deep Learning - Loss and Optimization Part 3

This video discusses details on optimization and different options in gradient descent procedure such as momentum and ADAM.

Video References:
Lex Fridman's Channel

References

[1] Christopher M. Bishop. Pattern Recognition and Machine Learning (Information Science and Statistics). Secaucus, NJ, USA: Springer-Verlag New York, Inc., 2006.
[2] Anna Choromanska, Mikael Henaff, Michael Mathieu, et al. “The Loss Surfaces of Multilayer Networks.” In: AISTATS. 2015.
[3] Yann N Dauphin, Razvan Pascanu, Caglar Gulcehre, et al. “Identifying and attacking the saddle point problem in high-dimensional non-convex optimization”. In: Advances in neural information processing systems. 2014, pp. 2933–2941.
[4] Yichuan Tang. “Deep learning using linear support vector machines”. In: arXiv preprint arXiv:1306.0239 (2013).
[5] Sashank J. Reddi, Satyen Kale, and Sanjiv Kumar. “On the Convergence of Adam and Beyond”. In: International Conference on Learning Representations. 2018.
[6] Katarzyna Janocha and Wojciech Marian Czarnecki. “On Loss Functions for Deep Neural Networks in Classification”. In: arXiv preprint arXiv:1702.05659 (2017).
[7] Jeffrey Dean, Greg Corrado, Rajat Monga, et al. “Large scale distributed deep networks”. In: Advances in neural information processing systems. 2012, pp. 1223–1231.
[8] Maren Mahsereci and Philipp Hennig. “Probabilistic line searches for stochastic optimization”. In: Advances In Neural Information Processing Systems. 2015, pp. 181–189.
[9] Jason Weston, Chris Watkins, et al. “Support vector machines for multi-class pattern recognition.” In: ESANN. Vol. 99. 1999, pp. 219–224.
[10] Chiyuan Zhang, Samy Bengio, Moritz Hardt, et al. “Understanding deep learning requires rethinking generalization”. In: arXiv preprint arXiv:1611.03530 (2016).

Further Reading:
A gentle Introduction to Deep Learning

Nächstes Video

13 - Deep Learning - Activations, Convolutions, and Pooling Part 1

Prof. Dr. Andreas Maier

2020-04-27

Frei

14 - Deep Learning - Activations, Convolutions, and Pooling Part 2

Prof. Dr. Andreas Maier

2020-04-28

Frei

15 - Deep Learning - Activations, Convolutions, and Pooling Part 3

Prof. Dr. Andreas Maier

2020-05-01

Frei

16 - Deep Learning - Activations, Convolutions, and Pooling Part 4

Prof. Dr. Andreas Maier

2020-05-01

IdM-Anmeldung

17 - Deep Learning - Regularization Part 1

Prof. Dr. Andreas Maier

2020-05-07

Frei

Mehr Videos aus der Kategorie "Technische Fakultät"

1 - Organisatorisches

2024-03-31

IdM-Anmeldung

geschützte Daten

3 - Tutorial Fundamentals WS 23-24 - Teil 1

2024-03-28

IdM-Anmeldung

geschützte Daten

4 - Tutorial Fundamentals WS 23-24 - Teil 2

2024-03-28

IdM-Anmeldung

geschützte Daten

2 - Generative Künstliche Intelligenz

2024-03-13

Frei

freie Daten

5 - Podiumsdiskussion

2024-03-14

Frei

freie Daten

11 - VE7: Gieß- und bearbeitungsgerechte Konstruktion sowie Prozessauslegung in Dauerformverfahren

2024-02-12

IdM-Anmeldung

geschützte Daten