StyleGAN, developed by NVIDIA Research, is a groundbreaking architecture for generating ultra-realistic synthetic images, especially of human faces. Its ability to control “styles” across image layers set a new standard in AI image generation and led to viral real World applications like This Person Does Not Exist. Today, it’s used across art, games, fashion, and media, with both exciting and troubling implications.
The lead authors are Tero Karras, Samuli Laine and Timo Aila. Unlike earlier GANs, which often had limited control over image attributes, StyleGAN introduced a “style-based” architecture that revolutionized image synthesis. Images are generated in a multi-scale, layered way : High-level (pose, identity, …), mid-level (features like eye shape, …), low-level (color, texture, …). A latent input vector is transformed into an intermediate latent space (W space).
The following list presents the timescale of the launch of different StyleGAN versions :
🔹 StyleGAN1 (2018)
- Introduced style-based generation
- Produced realistic but occasionally distorted faces
🔹 StyleGAN2 (2019–2020)
- Major quality improvement
- Fixed artifacts and strange features in faces (e.g., weird teeth or asymmetry)
- Used in “This Person Does Not Exist”
🔹 StyleGAN3 (2021)
- Introduced equivariance, making it better at handling rotation and translation
- Improved realism and temporal coherence (useful for video)