24–28 Aug 2026
Leiden University
Europe/Zurich timezone

Toward Robust LSST Cosmology: Data-Augmented Redshift Calibration and Efficient Marginalisation

Not scheduled
20m
Gorlaeus gebouw (Leiden University)

Gorlaeus gebouw

Leiden University

Einsteinweg 55, 2333 CC Leiden
Talk Methods / Statistical Inference / Machine Learning

Speaker

Yun-Hao Zhang (Leiden University)

Description

The unprecedented statistical power and imaging depth of the Vera C. Rubin Observatory's LSST will enable transformative tests of ΛCDM and the nature of dark matter and dark energy. Realising this potential hinges on accurately estimating ensemble redshift distributions and efficiently marginalising over their uncertainties.
I will present a new framework for photometric redshift calibration based on Self-Organising Maps (SOMs), which project the high-dimensional galaxy colour space onto a two-dimensional representation. This identifies regions poorly sampled by spectroscopy, which we then augment with synthetic galaxy catalogues to construct representative training sets. The method substantially reduces systematic biases for high-redshift galaxies, achieving sub-percent accuracy on the mean redshift for both Year 1 and Year 10 LSST configurations, while forward-modelling the dominant sources of systematic error — photometric noise, point-estimation bias, tomographic binning, and spectroscopic selection effects. Preliminary validation on early LSST DP1 data shows consistent performance, and a formal calibration analysis is in preparation for the upcoming DP2 release to support LSST anlyses on galaxy-halo connections.
Using these simulated n(z) ensembles, I show that conventional parametrisations — typically simple shifts along the redshift axis — can underestimate statistical uncertainties by up to an order of magnitude for galaxy clustering. To address this, I develop a data-driven approach based on a Variational Autoencoder (VAE) for non-linear dimensionality reduction, which outperforms standard PCA by compressing thousands of n(z) realisations into a Gaussianised latent space of substantially lower dimension. Resampling this latent space under Gaussian priors enables faithful reconstruction of the full n(z) covariance and accurate marginalisation within the cosmological likelihood.
Cosmological forecasts on synthetic LSST data show that weak-lensing constraints are robust across marginalisation choices, whereas the combination with large-scale structure reveals that conventional methods overestimate the dark energy Figure-of-Merit by 20% — underscoring the necessity of this improved framework for unbiased precision cosmology. The pipeline is being integrated into the LSST-DESC inference infrastructure for collaboration-wide application.

Authors

Dr Irene Moskowitz (Rutgers University) Yun-Hao Zhang (Leiden University)

Co-authors

Prof. Catherine Heymans (University of Edinburgh) Prof. Elisa Chisari (Utrecht University) Prof. Eric Gawiser (Rutgers University) Prof. Henk Hoekstra (Leiden University) Dr Jaime Ruiz-Zapatero (University College London) Prof. Joe Zuntz (University of Edinburgh) Prof. Konrad Kuijken (Leiden University) Prof. Marika Asgari (Newcastle University) Dr Shun-Sheng Li (Stanford University) Dr Tianqing Zhang (University of Pittsburgh) Dr Ziang Yan (Nagoya University)

Presentation materials

There are no materials yet.