mathematical optimization

On the resolution of a theoretical question related to the nature of local training in federated learning

Peter Richtarik, Professor, Computer Science

Sep 13, 15:30 - 17:00

B1 L3 R3119

machine learning mathematical optimization communications algorithms

In this talk, I will explain the problem, its solution, and some subsequent work generalizing, extending and improving the ProxSkip method in various ways. We study distributed optimization methods based on the local training (LT) paradigm - achieving improved communication efficiency by performing richer local gradient-based training on the clients before parameter averaging - which is of key importance in federated learning. Looking back at the progress of the field in the last decade, we identify 5 generations of LT methods: 1) heuristic, 2) homogeneous, 3) sublinear, 4) linear, and 5) accelerated. The 5th generation, initiated by the ProxSkip method of Mishchenko et al (2022) and its analysis, is characterized by the first theoretical confirmation that LT is a communication acceleration mechanism.

Optimization for Deep Learning

Dr. Konstantin Mischenko, Samsung AI Center, Cambridge, UK

May 7, 16:00 - 17:00

B5 L5 R5209

Deep learning mathematical optimization

Abstract The field of optimization for machine learning has undergone significant changes in recent years with deep learning models increasing in scale and fine-tuning taking a more prominent role. In this presentation, I will share a perspective on the direction of changes in the field and highlight interesting research directions. I will provide real-world examples of what practitioners want from optimization methods to train deep networks at scale. I will then present my recent work on adaptive methods, such as Adam and Adagrad, and explain how we can estimate the learning rate for these

Jongho Park

Research Scientist, Applied Mathematics and Computational Science

Computational mathematics numerical analysis mathematical optimization machine learning Scientific Machine Learning computational imaging

Jongho Park is a Research Scientist at King Abdullah University of Science and Technology (KAUST) where his research focuses on the design and analysis of efficient numerical methods for variational problems.