FIGURE SUMMARY
Title

Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology

Authors
Green, A.J., Mohlenkamp, M.J., Das, J., Chaudhari, M., Truong, L., Tanguay, R.L., Reif, D.M.
Source
Full text @ PLoS Comput. Biol.

Regression generator diagram.

Schematic representation of Go-ZT architecture showing chemical structural input represented as weights (wi) and views (vi) matrices passed through two fully connected neural networks to produce a predicted toxicity matrix. Darker matrix shading indicates higher toxicity values.

Schematic representation of GAN-ZT architecture showing chemical structural input represented as weights (wi) and views (vi) matrices passed through two fully connected neural networks to produce a predicted toxicity matrix. Chemical features along with predicted or empirical toxicity matrices are then passed to a discriminator comprising a fully-connected neural network. Darker matrix shading indicates higher toxicity values.

Principal component analysis displayed against the background of over 800,000 chemicals in the Integrated Chemical Environment database. Compares physical chemical properties between the training and test sets.

Schematic representation of the experimental approach for screening developmental and neurotoxicity of chemicals in larval zebrafish.

Diagram showing the vectorization of Methyl isothiocyanate.

Atom information from the PDB file (shown in grey) in converted into the views and weights matrices. The views space (vi) columns one and two identify the chemical species and correspond to an atom’s position on the periodic table indicating their period and group, respectively. While the last three columns show the relative position of each atom. The weight space (wi) values correspond to each of the views space matrices. In the first views Table C1 is set at the center while in the second view C2 is set at the center of the view. This molecule has nine views, which can be reduced to three views if preference is given only to carbon.

Go-ZT and GAN-ZT loss functions during training.

Changes of loss functions during the training of (A) Go-ZT and (B) GAN-ZT.

Test dataset confusion matrices.

Evaluation of the classification of chemicals in the test data set as either active or inactive using real versus generated toxicity matrices by (A) Go-ZT or (B) GAN-ZT. Color scale represents percent of total chemicals.

Model consensus on chemical activity.

(A)Venn diagram showing the overlap between true active chemicals and chemicals predicted to be active by either Go-ZT or GAN-ZT. (B) A confusion matrix showing the performance of the combined Go-ZT and GAN-ZT models using the test dataset.

Acknowledgments
This image is the copyrighted work of the attributed author or publisher, and ZFIN has permission only to display this image to its users. Additional permissions should be obtained from the applicable author or publisher of the image. Full text @ PLoS Comput. Biol.