1-Dimensional
Machine Learning
Secondary anisotropies
Galaxy formation
Intrinsic alignments
DESI / SphereX
Euclid / LSST
SO / CMB-S4
Ligo / Einstein
xAstrophysics
HERA / CHIME
SAGA / MANGA
Galaxy formation
Hosts
Reionization
Cosmic Microwave Background
Galaxies / Dwarfs
21 cm
Galaxy Surveys
Gravitational Lensing
Gravitational Waves
AGN Feedback/Supernovae
"Better inference methods = Better Data"
"Cosmology needs Astrophysics
Astrophysics needs Cosmology"
GANS
Deep Belief Networks
2006
VAEs
Normalising Flows
BigGAN
Diffusion Models
2014
2017
2019
2022
A folk music band of anthropomorphic autumn leaves playing bluegrass instruments
Contrastive Learning
2023
2026
"Write a C compiler"
AGI?
["Genie 2: A large-scale foundation model" Parker-Holder et al (2024)]
Probabilistic ML has made high dimensional inference tractable
1024x1024xTime
["Genie 3: A new frontier for world models" Parker-Holder et al (2025)]
Goal: Estimate unknown p(x1) from samples
Base
Target
Transport Map
Base
Data
"Creating noise from data is easy; creating data from noise is generative modeling."
(Yang Song)
Neural Network
Transport Map
Continuity Equation
Interpolant
Base
Data
Neural Network
1) Training
2) Inference
Estimated from samples
(Implicit Likelihood)
A digital twin of our Universe
Observed Galaxy Distribution
Simulated Galaxy Distribution
Field Level Inference
Forward Model
(= no Cosmic Variance)
Optimal constraints
Counts-in-cell
Do we really need to infer 10^9 parameters to constrain ~10?
Compression
Marginal Likelihood
Initial Conditions
["Simulation-Based Emulators for Galaxy Clustering in the Era of Stage-IV Surveys: I. Two-Point Statistics and Beyond" Paillas et al (include CCL) 2026]
Reconstructing ALL latent variables:
Dark Matter distribution
Entire formation history
Peculiar velocities
Predictive Cross Validation:
Cross-Correlation with other probes without Cosmic Variance
[Image Credit: Yuuki Omori]
Constraining Inflation:
Inferring primordial non-gaussianity
Data-driven Subgrid models / Data-driven Systematics
"Joint cosmological parameter inference and initial condition reconstruction with Stochastic Interpolants"
Cuesta-Lazaro, Bayer, Albergo et al
NeurIPs ML4PS 2024 Particle Mesh
Dark Matter Only
Gaussian Likelihood
1) Likelihood not necessarily Gaussian
2) Forward model no need differentiable
3) Amortized
Generative Model: Marginalizing over ICs
Generative Model: Fixing ICs
HMC: Marginalizing over ICs
True
Reconstructed
SBI
HMC
Cross Correlation Coefficient
Scaling up in volume
DESI Y1 LRG Effective volumes already larger than our sims!
Small Scale Galaxy Bias
Selection
Fibre collisions
Forward Modelling the Survey Systematics
EFT
Adapted from arXiv:1804.03097
Symmetries
Connected to Underlying Physics
Hydro sims
Empirical
Halo Occupation Distribution (HOD)
EFT bias expansion
Matter Density
Galaxy Distribution
Large Scale
True
Reconstructed
Power Spectrum
Cross Correlation
Peculiar Velocities
True
Reconstructed
Matter Density
Galaxy Distribution
Dimensions + Symmetries
Rotational invariance
(+ Galilean inv)
Equivalence Principle
"Large Scale Galaxy Bias"
Desjacques, Jeong, SchmidtSimulated Galaxies
EFT Field Level Fit
Fit:
?
["Full-shape analysis with simulation-based priors: Constraints on single field inflation from BOSS" Ivanov, Cuesta-Lazaro et al 2025]
40% Improvement!
x2 survey volume
BOSS + Conservative Priors
BOSS + Simulation Based Priors
Simulation Based Priors
Adapted from arXiv:1804.03097
Symmetries
Connected to Underlying Physics
Hydro sims
Empirical
Halo Occupation Distribution (HOD)
EFT bias expansion
Matter Density
Galaxy Distribution
Self-Consistent Predictions across observables
X-Ray
Cluster gas mass fractions
Cluster gas density profiles
Sunyaev-Zeldovich
Galaxy Properties
Thermal Integrated electron pressure (hot electrons / big objects)
Star formation + histories
Stellar mass / halo mass relation
FRBs
Integrated electron density
Kinetic Integrated electron density x peculiar velocity
["BaryonBridge: Interpolants models for fast hydrodynamical simulations" Horowitz, Cuesta-Lazaro, Yehia ML4Astro workshop 2025]Particle Mesh for Gravity
CAMELS Volumes
1000 boxes with varying cosmology and feedback models
Gas Properties
Current model optimised for Lyman Alpha forest
7 GPU minutes for a 50 Mpc simulation
130 million CPU core hours for TNG50
Density
Temperature
Galaxy Distribution
["BaryonBridge: Interpolants models for fast hydrodynamical simulations" Horowitz, Cuesta-Lazaro, Yehia ML4Astro workshop 2025]Variations in Subgrid Physics
Volume Upscaling
[Video credit: Francisco Villaescusa-Navarro]
Gas density
Gas temperature
Subgrid model 1
Subgrid model 2
Subgrid model 3
Subgrid model 4
Gas
Galaxies
Dark Matter
Baryonic fields
Marginalize over a broader set of subgrid physics
Interpolate between simulators
Mingshau Liu
(Ming)
Constrain z via multi-wavelength observations
Trained on:
TNG, SIMBA, Astrid, EAGLE
Encoder
1) Encoder
Gas
Galaxies
Dark Matter
Baryonic fields
2) Probabilistic Decoder
Dark Matter
Baryonic fields
(Test suite)
Gas Density
Temperature
Astrid
EAGLE
https://parti.research.google
A portrait photo of a kangaroo wearing an orange hoodie and blue sunglasses standing on the grass in front of the Sydney Opera House holding a sign on the chest that says Welcome Friends!
Bag of verifiable tasks
Policy (LLM)
Verifiable Reward
Expected Returns
["DeepSeek-R1" Guo et al 2025 arXiv:2501.12948]Coding competitions
Should Academia give up on training LLMs?
Should we design our own RL environments?
Should we think about the most ambitious projects we could tackle with a "country of geniuses in a data center"?
A radical change to how we work or just highlighting what was obviously wrong?
["DESI 2024 VI: Cosmological Constraints from the Measurements of Baryon Acoustic Oscillations" arXiv:2404.03002]
Dark Energy is constant over time
["An LLM-driven framework for cosmological
model-building and exploration" Mudur, Cuesta-Lazaro, Toomey ]
Propose a model for Dark Energy
Implement it in a Cosmology simulation code: CLASS
Test fit to DESI Observations
Iterate to improve fit
Quintessence, DE/DM interactions....
Must pass a set of general tests for "reasonable" models
Ideally, compare evidence to LCDM.
For now, Bayesian Information Criteria (BIC)
1
2
Nayantara Mudur (Harvard)
Thawing Quintessence
Axion-like Early Dark Energy
Ultra-light scalar field that temporarily acts as dark energy in the early universe
Implementation Challenge:
Dynamic dark energy model: scalar field transitions from "frozen" (cosmological constant-like) to evolving as the universe expands.
Oscillatory behaviour
Can take advantage of existing scalar field implementations in CLASS
+ 43,000 lines of C code
+ 10,000 lines of numerical files
CLASS Challenge:
1) Code compiles + passes unit tests (reasonable observables, numerical convergence...)
2) Implementation agrees with target repository
3) Goodness of fit for DESI + Supernovae
4) H0 tension metrics
Curated
1 page long description of model to be implemented, CLASS tips + very explicit units
Paper
Directly from a full paper
If fails, get feedback from another LLM