The lack of generalizability and reproducibility of machine learning models in medical applications is increasingly recognized as a substantial barrier to implementing such approaches in real-world clinical settings. Highlighting this issue, Jie Cao et al. aim to reproduce a recent acute kidney injury prediction model, and find persistent discrepancies in model performance in different subgroups.
- Jie Cao
- Xiaosong Zhang
- Karandeep Singh