30 November 2025 to 5 December 2025
Building 40
Australia/Sydney timezone
AIP Summer Meeting 2025 - University of Wollongong

Accelerating physical systems with quantized utility agents

3 Dec 2025, 12:30
15m
Hope Theatre (Building 40)

Hope Theatre

Building 40

University of Wollongong Northfields Avenue Wollongong NSW 2522
Contributed Oral Industrial Applied Physics Industrial Applied Physics

Speaker

Arindam Saha (The Australian National University)

Description

The automation of complex experimental systems has been a long-standing goal with numerous real-world applications. While reinforcement learning (RL) has achieved breakthroughs across many domains, prior approaches to physical system control required crafting realistic simulations, complex reward engineering, and extensive training times, limiting practical deployment.

Saha et.al. (2025) introduce a “quantized utility agent” (AQUA), a novel framework that aims to simplify RL for autonomous control in complex physical systems. It declutters the learning process, allowing operational control agents to be pre-trained offline using existing datasets, boosting sample efficiency significantly.

The framework employs generative "world models" that extract meaningful representations from noisy experimental data, encoding inputs and outputs into a compressed feature space that captures process states without explicit temporal ordering. During deployment, AQUA executes efficient searches in this compressed space to imagine outcomes and plan optimal actions through population refinement techniques.

Demonstrated performance across complex optical [1] and quantum systems, AQUA performs optimally out-of-the-box, when deployed on physical systems, adapts to changing conditions, fine-tunes online and surpasses human-level performance.

Emerging quantum computing, communications, sensing and space technologies are carefully assembled, collection of inter-dependent components. As the complexity of these systems grows, maintaining optimal performance manually becomes a major bottleneck, especially for remotely deployed systems. AQUA’s task-agnostic, pre-trainable and adaptable design could effectively streamline these processes, allowing them to scale.

Besides saving hours of human labour, the method’s implied ability to learn features from raw experimental data could, in principle, be used to study the underlying properties of the physical system that are hard to describe in theory, potentially bridging the gap [2].

References:

  1. A. Saha, et. al., Automating experimental optics with
    sample-efficient machine learning methods, Optica 12, 1304-1310
    (2025)
  2. S.J. Wetzel, et.al., Interpretable Machine Learning in Physics: A
    Review, arXiv:2503.23616 (2025)

Author

Arindam Saha (The Australian National University)

Presentation materials

There are no materials yet.