25th IEEE Real Time Conference - La Biodola, Elba, Italy

Name: 25th IEEE Real Time Conference - La Biodola, Elba, Italy
Start: 2026-05-25T01:00:00+02:00
End: 2026-05-29T23:59:00+02:00
Location: La Biodola - Isola d'Elba (Italy)

25–29 May 2026

La Biodola - Isola d'Elba (Italy)

Europe/Rome timezone

NB: The submission deadline for the Student Paper Awards is Monday, 11 May.

General Inquiries

rt2026@pi.infn.it

52 Design-Space Exploration and Integer Quantization of Graph Neural Networks for Real-Time FPGA Track Finding

27 May 2026, 10:22

Maria Luisa Room (Hotel Hermitage)

Maria Luisa Room

Hotel Hermitage

Mini Oral AI, Machine Learning, Real Time Simulation, Intelligent Signal Processing Mini Orals

Andrea Cardini (Universidad de Oviedo)

Real-time track finding for displaced-muon signatures in the CMS Level-1 trigger must operate under strict fixed-latency constraints (12.5~$\mu$s) while processing high-throughput detector data. Graph neural networks (GNNs) provide a natural representation of sparse, irregular detector geometries; however, mapping message-passing models to FPGAs requires careful co-optimization of numerical formats, architectural parameters, and high-level synthesis (HLS) microarchitecture.

We present an end-to-end workflow bridging GNN training and FPGA prototyping for a GraphSAGE-based model targeting real-time inference. The pipeline integrates: (i) automated design-space exploration across model dimensions, fixed-point precision, and HLS parameters to expose accuracy--latency--resource trade-offs; (ii) an integer-only INT8 implementation with data-driven bit-width optimization, reducing accumulator and scaling widths while preserving numerical correctness; and (iii) modular C++ kernels synthesized with Vitis HLS and validated through bit-exact C-simulation against Python integer references.

Preliminary validation on the Cora benchmark demonstrates that post-training quantization preserves model accuracy within 0.1\% of the floating-point baseline, while enabling substantial reductions in memory footprint and arithmetic complexity. Bit-exact agreement between software and hardware models is achieved using optimized fixed-point scaling. Quantization-aware training and physics-driven datasets for displaced-muon reconstruction are currently under development.

This work establishes a reproducible methodology for deploying message-passing GNNs on FPGAs under strict real-time constraints, providing a concrete path toward fixed-latency GNN-based track reconstruction in the CMS trigger system.

Minioral	Yes
IEEE Member	No
Are you a student?	No

Andrea Cardini (Universidad de Oviedo) Mr Pelayo Leguina (Universidad de Oviedo (ES)) Santiago Folgueras (Universidad de Oviedo (ES))

There are no materials yet.

25th IEEE Real Time Conference - La Biodola, Elba, Italy

General Inquiries

52 Design-Space Exploration and Integer Quantization of Graph Neural Networks for Real-Time FPGA Track Finding

Maria Luisa Room

Hotel Hermitage

Speaker

Description

Authors

Presentation materials