[1910.09399] A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image Synthesis

Text-to-image synthesis refers to computational methods which translate human written textual descriptions, in the form of keywords or sentences, into images with similar semantic meaning to the text. In earlier research, image synthesis relied mainly on word to image correlation analysis combined with supervised methods to find best alignment of the visual content matching to the text. Recent progress in deep learning (DL) has brought a new set of unsupervised deep learning methods, particularly deep generative models which are able to generate realistic visual images using suitably trained neural network models. In this paper, we review the most recent development in the text-to-image synthesis research domain. Our survey first introduces image synthesis and its challenges, and then reviews key concepts such as generative adversarial networks (GANs) and deep convolutional encoder-decoder neural networks (DCNN). After that, we propose a taxonomy to summarize GAN based text-to-image sy

5 mentions: @arxiv_cs_cv_pr
Date: 2019/10/26 03:49

Related Entries

Read more Survey of Scientific Publication Analysis by NLP and CV
1 users, 4 mentions 2019/08/15 08:16
Read more [1911.12116] Analysis of Explainers of Black Box Deep Neural Networks for Computer Vision: A Survey
0 users, 4 mentions 2019/11/28 23:20
Read more [1809.02165] Deep Learning for Generic Object Detection: A Survey
0 users, 0 mentions 2018/09/10 03:23
Read more A Survey on Cross-Modal Embedding
1 users, 1 mentions 2019/08/15 09:46