Architectural Facade Generation Machine and Controller — Original Paper Explanation

date

type

status

slug

summary

category

icon

password

URL

July 26, 2021 • 2 min read

by Simon's Daydream, mp.weixin.qq.com • See original

In this year’s CDRF 2021 (Tongji Digital Futures 2021 Conference), I presented my paper titled "Architectural Facade Perpetual Motion Machine — Exploring Architectural Design in High-Dimensional Latent Space" (translated title, casually translated to make it sound impressive). This Chinese explanation video explores an AI method capable of generating high-confidence architectural facades and visualizing images in high-dimensional latent space based on their latent vector coordinates, along with performing semantic operations on image content using principal component analysis.

Some advanced mathematics and machine learning concepts will be discussed; feel free to ask me questions if you don’t understand (though I might not be able to explain it any clearer!). I am not from a computer science background, so if there are any mistakes, please feel free to point them out gently!

Demo

Explanation

Link to paper

1 Introduction

With the emerging of Generative Adversarial Network (GAN) based image generation methodin recent years, many attempts havebeen made to apply GAN into architectural images and drawing generation research(Goodfellowet al. 2014). However, for therealistic building façade images generation task, most attempts faced different challenges, such as quality and controllability ofgenerated image, and interpretabilityof model.

These challenges were due to various limitations, such as performance ofthe selected GAN model, the size of training dataset, the understanding oflatent space, etc. In this paper, by training the state-of-the-art GAN basedimage generation model, StyleGAN2 (Karras et al. 2020), with high-resolution building façade image dataset, andexploring its latent space by applying PCA and GANSpace analysis, we couldovercome above challenges in different extend (Härkönen et al. 2020).

In summary,the main functions and contributions of this paper are:

1. A StyleGAN2 model instance which could generate plausible buildingfaçade images without conditional input.

2. Introduce GANSpace and image embedding method to visualize the correlationbetween the generated building façade images and their corresponding latentvectors, which achievedunsupervised classification and high-level propertiescontrol of both generated and novel images.