PUBLICATION

A deep learning approach for staging embryonic tissue isolates with small data

Authors

Pond, A.J.R., Hwang, S., Verd, B., Steventon, B.

ID

ZDB-PUB-210109-27

Date

2021

Source

PLoS One 16: e0244151 (Journal)

Registered Authors

Keywords

none

MeSH Terms

Animals
Deep Learning*
Embryo, Nonmammalian/metabolism*
Embryo, Nonmammalian/pathology
Gene Expression
Image Processing, Computer-Assisted
Microscopy, Confocal
Tail/metabolism
Tail/pathology
Zebrafish/growth & development
Zebrafish/metabolism*

PubMed

33417603 Full text @ PLoS One

Abstract

Machine learning approaches are becoming increasingly widespread and are now present in most areas of research. Their recent surge can be explained in part due to our ability to generate and store enormous amounts of data with which to train these models. The requirement for large training sets is also responsible for limiting further potential applications of machine learning, particularly in fields where data tend to be scarce such as developmental biology. However, recent research seems to indicate that machine learning and Big Data can sometimes be decoupled to train models with modest amounts of data. In this work we set out to train a CNN-based classifier to stage zebrafish tail buds at four different stages of development using small information-rich data sets. Our results show that two and three dimensional convolutional neural networks can be trained to stage developing zebrafish tail buds based on both morphological and gene expression confocal microscopy images, achieving in each case up to 100% test accuracy scores. Importantly, we show that high accuracy can be achieved with data set sizes of under 100 images, much smaller than the typical training set size for a convolutional neural net. Furthermore, our classifier shows that it is possible to stage isolated embryonic structures without the need to refer to classic developmental landmarks in the whole embryo, which will be particularly useful to stage 3D culture in vitro systems such as organoids. We hope that this work will provide a proof of principle that will help dispel the myth that large data set sizes are always required to train CNNs, and encourage researchers in fields where data are scarce to also apply ML approaches.

Genes / Markers

Figures

Show all Figures

Expression

Phenotype

Mutations / Transgenics

Human Disease / Model

Sequence Targeting Reagents

Fish

Antibodies

Orthology

Engineered Foreign Genes

Mapping

Pond et al., 2021 ZDB-PUB-210109-27 PMID:33417603

A deep learning approach for staging embryonic tissue isolates with small data

Pond et al., 2021

ZDB-PUB-210109-27
PMID:33417603