ZFIN Image: Segebarth et al., 2020, Figure 5

Image

Figure Caption

Figure 5—figure supplement 1. (A) Schematic overview depicting three initialization variants for creating consensus ensembles on new datasets. Data annotation by multiple human experts and subsequent ground truth estimation are required for all three initialization variants. In the from scratch variant, a U-Net model with random initialized weights is trained on pairs of microscopy images and estimated ground truth annotations. This variant was used to create consensus ensembles for the initial Lab-Wue1 dataset. Alternatively, the same training dataset can be used to adapt a U-Net model with pretrained weights by means of transfer-learning (fine-tuned). In both variants, models are evaluated and selected on base of a validation set after model training. In a third variant, U-Net models with pretrained weights can be evaluated directly on a validation dataset, without further training (frozen). In all three variants, consensus ensembles of the respective models are then used for bioimage analysis. (B) Overall reliability of bioimage analysis results of each variant assessed as variation per effect. In all three strategies, consensus ensembles (orange) showed lower standard deviations than consensus models (blue). The frozen results need to be considered with caution as they are based on models that did not meet the selection criterion (see Figure 5—source data 3). N_{pairwise comparisons} = 6; N_{consensus models} = 15, and N_{consensus ensembles} = 3 for each variant. (C–E) Detailed comparison of the two external datasets with highest (Lab-Mue) and lowest (Lab-Wue2) similarity to Lab-Wue1. (C) Representative microscopy images. Orange: representative annotations of a lab-specific from scratch consensus ensemble. PVT: para-ventricular nucleus of thalamus, eRet: early retrieval, lRet: late retrieval, HB: hindbrain, wt: wildtype, kd: gad1b knock-down. Scale bars: Lab-Mue 100 µm and Lab-Wue2 6 µm. (D) Mean M_{F1 score} of from scratch (solid line) and fine-tuned (dashed line) consensus models on the validation dataset over the course of training (iterations). Mean M_{F1 score} of frozen consensus models are indicated with arrows. Box plots show the M_{F1 score} among the annotations of human experts as reference and the mean M_{F1 score} of selected consensus models. Two dotted horizontal lines mark the whisker ends of the M_{F1 score} among the human expert annotations. (E) Effect sizes of all individual bioimage analyses (black: manual experts, blue: consensus models, orange: consensus ensembles). Three horizontal lines separate the significance intervals (n.s.: not significant, *: 0.05≥ p>0.01, **0.01≥ p>0.001, ***p ≤ 0.001 with Mann-Whitney-U tests). Lab-Mue: N_{consensus ensembles} = 3 for all initialization variants; N_{from scratch/fine-tuned consensus models} = 12 (for each ensemble, 4/5 trained models per ensemble met the selection criterion), N_{frozen consensus models} = 12 (for each ensemble, 4/4 models per ensemble did not meet the selection criterion). N_eRet = 4, N_lRet = 4; n_eRet = 12, n_lRet = 11. Lab-Wue2: N_{consensus ensembles} = 3 for each initialization variant; N_{from scratch/fine-tuned consensus models} = 15 (for each ensemble, 5/5 trained models per ensemble met the selection criterion), N_{frozen consensus models} = 12 (for each ensemble, 4/4 models per ensemble did not meet the selection criterion). N_wt = 5, N_kd = 4, n_wt = 20, n_kd = 15. Source files of all statistical analyses (including Figure 5—figure supplement 2 and Figure 5—figure supplement 1) are available in Figure 5—source data 1. Information on all bioimage datasets (e.g. the number of images, image resolution, imaging techniques, etc.) are available in Figure 5—source data 2. Source files on model performance and selection are available in (Figure 5—source data 3).

Acknowledgments

This image is the copyrighted work of the attributed author or publisher, and ZFIN has permission only to display this image to its users. Additional permissions should be obtained from the applicable author or publisher of the image. Full text @ Elife