【信通论坛】Understanding the Loss Surface of Neural Networks for Binary Classification

文:Ruoyu Sun,University of Illinois at Urbana-Champaign (UIUC)|图:信通学院| 发布时间: 2018-08-23 00:00:00|

讲座时间:2018-08-28 14:30

讲座地点:科研楼B302

讲座题目:Understanding the Loss Surface of Neural Networks for Binary Classification

主讲人:Ruoyu Sun,University of Illinois at Urbana-Champaign (UIUC)

  内容简介:

  One of the major challenges of training neural networks is the non-convexity of the loss function, which can lead to many local minima. Due to the recent success of deep learning, it is widely conjectured that the local minima of neural networks may lead to similar training performance and thus not a big issue. In this talk, we discuss the loss surface of neural networks for binary classification. We provide a collection of necessary and sufficient conditions under which the neural network problem has no bad local minima. On the positive side, we prove that no bad local minima exist under a few conditions on the neuron types, the neural-network structure (e.g. skip-like connection), the loss function and the dataset. While there seem to be quite a few conditions, on the negative side, we provide dozens of counterexamples which show that bad local minima exist when these conditions do not hold. For example, ReLU neurons lead to bad local minima while increasing and strictly convex neurons (e.g. smooth versions of ReLUs) can eliminate bad local minima.


  主讲人简介:

  944289ec70a3450b522b895ca9b25bbe.jpg

Ruoyu Sun is an assistant professor in the Department of Industrial and Enterprise Systems Engineering Department (ISE) and Coordinate Science Lab (CSL), University of Illinois at Urbana-Champaign. Before joining UIUC, he was a visiting research scientist at Facebook AI Research, and was a postdoctoral researcher at Stanford University. He obtained PhD in electrical engineering from University of Minnesota, and B.S. in mathematics from Peking University. He has won the second place of INFORMS George Nicholson student paper competition, and honorable mention of INFORMS optimization society student paper competition. His research interests lie on optimization, machine learning and signal processing, especially large-scale optimization and non-convex optimization for machine learning.


清水河校区地址:成都市高新区(西区)西源大道2006号 电子科技大学清水河校区科研楼B区

邮编:611731 Email: xintong@uestc.edu.cn

电话:028-61830156 传真:028-61831665

学院官微

分享