The deep linear network
九州確率論セミナー
開催期間
2023.6.12(月)
16:40 ~ 18:00
16:40 ~ 18:00
場所
IMIオーディトリアム(W1-D-413)
講演者
Govind Menon(Brown University)
概要
The deep linear network is a matrix model introduced by computer scientists Arora, Cohen and Hazan to capture the effect of overparametrization in deep learning. This talk is a description of its mathematical structure that involves an interplay between geometry, dynamics and some probability.
This is joint work with Nadav Cohen (Tel Aviv) and Zsolt Veraszto (Brown).