Fig. 9
From: Large-scale evaluation of k-fold cross-validation ensembles for uncertainty estimation

Overview of the ensemble sizes where the saturation was reached for each dataset as box-whisker plots, for predictive performance (a) and UQ performance (b). The box-whisker plots are sorted by descending median of the predictive performance