Bibliography#

DAL22

DALL·E 2 Research Preview Update. May 2022. URL: https://openai.com/blog/dall-e-2-update/ (visited on 2022-06-25).

DPC+21

Boris Dayma, Suraj Patil, Pedro Cuenca, Khalid Saifullah, Tanishq Abraham, Phúc Lê Kha\u ć, Luke Melas, and Ritobrata Ghosh. DALL·E Mini. July 2021. URL: https://github.com/borisdayma/dalle-mini, doi:10.5281/zenodo.5146400.

DN21

Prafulla Dhariwal and Alex Nichol. Diffusion Models Beat GANs on Image Synthesis. June 2021. URL: http://arxiv.org/abs/2105.05233 (visited on 2022-06-07), arXiv:2105.05233.

HJA20

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising Diffusion Probabilistic Models. December 2020. URL: http://arxiv.org/abs/2006.11239 (visited on 2022-06-07), arXiv:2006.11239.

HS21

Jonathan Ho and Tim Salimans. Classifier-Free Diffusion Guidance. In NeurIPS 2021 Workshop on Deep Generative Models and Downstream Applications. November 2021. URL: https://openreview.net/forum?id=qw8AKxfYbI (visited on 2022-06-26).

MAB+22

Pamela Mishkin, Lama Ahmad, Miles Brundage, Gretchen Krueger, and Girish Sastry. DALL·E 2 Preview - Risks and Limitations. 2022. URL: https://github.com/openai/dalle-2-preview/blob/main/system-card.md.

NDR+22

Alex Nichol, Prafulla Dhariwal, Aditya Ramesh, Pranav Shyam, Pamela Mishkin, Bob McGrew, Ilya Sutskever, and Mark Chen. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models. March 2022. URL: http://arxiv.org/abs/2112.10741 (visited on 2022-06-07), arXiv:2112.10741.

RKH+21

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. Learning Transferable Visual Models From Natural Language Supervision. February 2021. URL: http://arxiv.org/abs/2103.00020 (visited on 2022-06-07), arXiv:2103.00020.

Ram

Aditya Ramesh. How DALL·E 2 Works. URL: http://adityaramesh.com/posts/dalle2/dalle2.html (visited on 2022-06-21).

RDN+22

Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, and Mark Chen. Hierarchical Text-Conditional Image Generation with CLIP Latents. April 2022. URL: http://arxiv.org/abs/2204.06125 (visited on 2022-06-09), arXiv:2204.06125, doi:10.48550/arXiv.2204.06125.

RPG+21

Aditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, and Ilya Sutskever. Zero-Shot Text-to-Image Generation. February 2021. URL: http://arxiv.org/abs/2102.12092 (visited on 2022-06-12), arXiv:2102.12092, doi:10.48550/arXiv.2102.12092.

SCS+22

Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J. Fleet, and Mohammad Norouzi. Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding. May 2022. URL: http://arxiv.org/abs/2205.11487 (visited on 2022-06-12), arXiv:2205.11487, doi:10.48550/arXiv.2205.11487.