[1612.03242] StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

Synthesizing high-quality images from text descriptions is a challenging problem in computer vision and has many practical applications. Samples generated by existing text-to-image approaches can roughly reflect the meaning of the given descriptions, but they fail to contain necessary details and vivid object parts. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) to generate 256x256 photo-realistic images conditioned on text descriptions. We decompose the hard problem into more manageable sub-problems through a sketch-refinement process. The Stage-I GAN sketches the primitive shape and colors of the object based on the given text description, yielding Stage-I low-resolution images. The Stage-II GAN takes Stage-I results and text descriptions as inputs, and generates high-resolution images with photo-realistic details. It is able to rectify defects in Stage-I results and add compelling details with the refinement process. To improve the diversity of the synt...

1 mentions: @shion_honda
Date: 2019/04/23 12:47

Referring Tweets

@shion_honda StackGAN [Zhang+, 2017, ICCV] 説明文から256*256pxの対応画像を生成するStackGANを提案。2段構成で、1段目では粗い画像を生成し、2段目では画像の精細化を担う。データ拡張手法として、説明文の埋め込みに摂動を加えるConditioning Augmentationも提案。 https://t.co/iz5khvxagH #NowReading https://t.co/m69u8yaX5E

Bookmark Comments

id:stealthinu StackGANすごい件の論文。普通にGANで出したもやっとした画像と生成に使った文言とでさらに2段めのGAN通してぱきっとしたきれいな画像だす。すげえ。これいろんな応用思いうかぶよね。
id:masatoi 噂のStackGANの論文。あとで読む。
id:iR3 凄い!文章から画像を作成できるのか!

Related Entries

Read more Goodfellow先生おすすめのGAN論文6つを紹介
Read more Deep Learningを用いた教師なし画像検査の論文調査 GAN/SVM/Autoencoderとか .pdf
Read more GAN(と強化学習との関係)
Read more 敵対的生成ネットワーク(GAN)
Read more Icml2018読み会_overview&GANs