Marc Deisenroth

Google DeepMind Chair of Machine Learning and Artificial Intelligence

University College London

Biography

Professor Marc Deisenroth is the Google DeepMind Chair of Machine Learning and Artificial Intelligence at University College London and part of the UNESCO Chair on Artificial Intelligence at UCL. He also holds a visiting faculty position at the University of Johannesburg. Marc co-leads the Sustainability and Machine Learning Group at UCL. His research interests center around data-efficient machine learning, probabilistic modeling and autonomous decision making with applications in climate/weather science, nuclear fusion, and robotics.

Marc was Program Chair of EWRL 2012, Workshops Chair of RSS 2013, EXPO Chair at ICML 2020, Tutorials Chair at NeurIPS 2021, and Program Chair at ICLR 2022. He is an elected member of the ICML Board and serves on the Scientific Advisory Boards of the National Oceanography Centre as well as the United Nations University Global AI Network. He received Paper Awards at ICRA 2014, ICCAS 2016, ICML 2020, AISTATS 2021, and FAccT 2023. In 2019, Marc co-organized the Machine Learning Summer School in London.

In 2018, Marc received The President’s Award for Outstanding Early Career Researcher at Imperial College. He is a recipient of a Google Faculty Research Award and a Microsoft PhD Grant.

In 2018, Marc spent four months at the African Institute for Mathematical Sciences (Rwanda), where he taught a course on Foundations of Machine Learning as part of the African Masters in Machine Intelligence. He is co-author of the book Mathematics for Machine Learning, published by Cambridge University Press.

Research Expertise

Machine Learning: Data-efficient machine learning, Gaussian processes, reinforcement learning, Bayesian optimization, approximate inference, deep probabilistic models, geo-spatial models

Environment and Sustainability: Data assimilation, data-driven forecasting models, renewables

Robotics and Control: Robot learning, legged locomotion, planning under uncertainty, imitation learning, adaptive control, robust control, learning control, optimal control

Signal Processing: Nonlinear state estimation, Kalman filtering, time-series modeling, dynamical systems, system identification, stochastic information processing

Key Publications

Vignesh Gopakumar, Ander Gray, Lorenzo Zanisi, Timothy Nunn, Daniel Giles, Matt Kusner, Stanislas Pamela, Marc Peter Deisenroth

2025-07-13 Proceedings of the International Conference on Machine Learning (ICML)

Calibrated Physics-Informed Uncertainty Quantification

Neural PDEs have emerged as inexpensive surrogate models for numerical PDE solvers. While they offer efficient approximations, they often lack robust uncertainty quantification (UQ), limiting their practical utility. Existing UQ methods for these models typically have high computational demands and lack guarantees. We introduce a novel framework for calibrated physics-informed uncertainty quantification to address these limitations. Our approach leverages physics residual errors as a nonconformity score within a conformal prediction (CP) framework. This enables data-free, model-agnostic, and statistically guaranteed uncertainty estimates. Our framework utilises convolutional layers as finite difference stencils for gradient estimation, our framework provides inexpensive coverage bounds for the violation of conservation laws within model predictions. In our experiments, we utilise CP to obtain marginal coverage for each cell and joint coverage over the entire prediction domain of various PDEs.

Denis Hadjivelichkov, Sicelukwanda N. T. Zwane, Marc P. Deisenroth, Lourdes Agapito, Dimitrios Kanoulas

2025-05-19 Proceedings of the International Conference on Robotics and Automation (ICRA)

Semantic Cross-Pose Correspondence from a Single Example

This article focuses on predicting how an object can be transformed to a semantically meaningful pose relative to another object, given only one or few examples. Current pose correspondence methods rely on vast 3D object datasets and do not actively consider semantic information, which limits the objects to which they can be applied. We present a novel method for learning cross-object pose correspondence. The proposed method detects interacting object parts, performs one-shot part correspondence, and uses geometric and visual-semantic features. Given one example of two objects posed relative to each other, the model can learn how to transfer the demonstrated relations to unseen object instances.

Joel Oskarsson, Tomas Landelius, Marc P. Deisenroth, Fredrik Lindsten

2024-12-11 Advances in Neural Information Processing Systems (NeurIPS)

Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks

In recent years, machine learning has established itself as a powerful tool for high-resolution weather forecasting. While most current machine learning models focus on deterministic forecasts, accurately capturing the uncertainty in the chaotic weather system calls for probabilistic modeling. We propose a probabilistic weather forecasting model called Graph-EFM, combining a flexible latent-variable formulation with the successful graph-based forecasting framework. The use of a hierarchical graph construction allows for efficient sampling of spatially coherent forecasts. Requiring only a single forward pass per time step, Graph-EFM allows for fast generation of arbitrarily large ensembles. We experiment with the model on both global and limited area forecasting. Ensemble forecasts from Graph-EFM achieve equivalent or lower errors than comparable deterministic models, with the added benefit of accurately capturing forecast uncertainty.

Jake Cunningham, Giorgio Giannone, Mingtian Zhang, Marc P. Deisenroth

2024-12-11 Advances in Neural Information Processing Systems (NeurIPS)

Reparameterized Multi-Resolution Convolutions for Long Sequence Modelling

Global convolutions have shown increasing promise as powerful general-purpose sequence models. However, training long convolutions is challenging, and kernel parameterizations must be able to learn long-range dependencies without overfitting. This work introduces reparameterized multi-resolution convolutions (MRConv), a novel approach to parameterizing global convolutional kernels for long-sequence modelling. By leveraging multi-resolution convolutions, incorporating structural reparameterization and introducing learnable kernel decay, MRConv learns expressive long-range kernels that perform well across various data modalities. Our experiments demonstrate state-of-the-art performance on the Long Range Arena, Sequential CIFAR, and Speech Commands tasks among convolution models and linear-time transformers. Moreover, we report improved performance on ImageNet classification by replacing 2D convolutions with 1D MRConv layers.

Sicelukwanda Zwane, Daniel G. Cheney, Curtis C. Johnson, Yicheng Luo, Yasemin Bekiroglu, Marc Killpack, Marc P. Deisenroth

2024-10-14 Proceedings of the International Conference on Intelligent Robots and Systems (IROS)

Learning Dynamic Tasks on a Large-scale Soft Robot in a Handful of Trials

Unlike traditional rigid robots, soft robots offer more flexibility, compliance, and adaptability. They are also typically cheaper to manufacture and are lighter than their rigid counterparts. However, due to modeling difficulties, real-world applications for soft robots are still limited. This is especially true for applications that would require dynamic or fast motion. In addition, their operating principles and compliance make integrating effective proprioceptive sensors difficult. As such, state estimation and predictions of how the state evolves in time are challenging modeling tasks. Large-scale ($≈$ two meters in length), particularly fluid-driven, soft robots have greater modeling complexity due to increased inertia and related effects of gravity. Few approaches to soft robot control (learned or model-based) have enabled dynamic motion such as throwing or hammering since most methods require limiting assumptions about the kinematics, dynamics, or actuation models to make the control problem tractable or performant. To address this issue, we propose using Bayesian optimization to learn policies for dynamic tasks on a large-scale soft robot. This approach optimizes the task objective function directly from commanded pressures, without requiring approximate kinematics or dynamics as an intermediate step. We also present simulated and real-world experiments to illustrate the efficacy of the proposed approach.

See all publications