Page 309 - Artificial Intelligence in the Age of Neural Networks and Brain Computing
P. 309

302    CHAPTER 15 Evolving Deep Neural Networks













































                         FIGURE 15.3
                         The best-performing LSTM variant after 25 generations of neuroevolution. It includes a
                         novel skip connection between the two memory cells, resulting in 5% improvement over
                         the vanilla LSTM baseline. Such improvements are difficult to discover by hand;
                         CoDeepNEAT with LSTM-specific mutation searches for them automatically.


                         5. APPLICATION CASE STUDY: IMAGE CAPTIONING FOR THE
                            BLIND
                         In a real-world case study, the vision and language capabilities of CoDeepNEAT
                         were combined to build a real-time online image captioning system. In this applica-
                         tion, CoDeepNEAT searches for architectures that learn to integrate image and text
                         representations to produce captions that blind users can access through existing
                         screen readers. This application was implemented for a major online magazine web-
                         site. Evolved networks were trained with the open source MSCOCO image
                         captioning dataset [46], along with a new dataset collected for this website.
   304   305   306   307   308   309   310   311   312   313   314