The development of comprehensive benchmarks to assess the performance of algorithms on causal tasks is an important, emerging area. The introduction of two physical ‘causal chamber’ systems serves as a firm step towards future, more reliable benchmarks in the field.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
27,99 € / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
118,99 € per year
only 9,92 € per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout

Artur Debat / Moment / Getty images
References
Pearl, J. Int. J. Biostat. 6, 7 (2010).
Zeitler, J., Vlontzos, A. & Gilligan-Lee, C. M. In Proc. 2nd Conference on Causal Learning and Reasoning Vol. 213, 850–865 (PMLR, 2023).
Runge, J. et al. Nat. Commun. 10, 2553 (2019).
Gamella, J. L., Peters, J. & Bühlmann, P. Nat. Mach. Intell. 7, 107–118 (2025).
Brand, A. et al. Br. J. Cancer. 128, 1278–1285 (2023).
von Kügelgen, J., Agarwal, N., Zeitler, J., Mastouri, A. & Schölkopf, B. Preprint at https://doi.org/10.48550/arXiv.2106.11849 (2021).
Chawla, S. et al. R. Soc. Open Sci. 10, 221414 (2023).
Reisach, A. G., Seiler, C. & Weichwald, S. In NIPS'21: Proc. 35th International Conference on Neural Information Processing Systems no. 2127, 27772–27784 (2021).
Bareinboim, E., Correa, J. D., Ibeling, D. & Icard, T. (2022). In Probabilistic and Causal Inference: The Works of Judea Pearl 1st edn, 507–556 (ACM, 2022).
Briggs, J. & Carnevali, L. Embedding Methods for Image Search Ch. 3, https://go.nature.com/3WJGmvu (accessed 10 February 2025).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The author declares no competing interests.
Rights and permissions
About this article
Cite this article
Zeitler, J. Physical benchmarks for testing algorithms. Nat Mach Intell 7, 166–167 (2025). https://doi.org/10.1038/s42256-025-00999-8
Published:
Issue Date:
DOI: https://doi.org/10.1038/s42256-025-00999-8