Text2City: One-Stage Text-Driven Urban Layout Regeneration

Authors

  • Yiming Qin Shanghai Jiao Tong University City University of Hong Kong
  • Nanxuan Zhao Adobe Research
  • Bin Sheng Shanghai Jiao Tong University
  • Rynson W.H. Lau City University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v38i5.28257

Keywords:

CV: Applications

Abstract

Regenerating urban layout is an essential process for urban regeneration. In this paper, we propose a new task called text-driven urban layout regeneration, which provides an intuitive input modal - text - for users to specify the regeneration, instead of designing complex rules. Given the target region to be regenerated, we propose a one-stage text-driven urban layout regeneration model, Text2City, to jointly and progressively regenerate the urban layout (i.e., road and building layouts) based on textual layout descriptions and surrounding context (i.e., urban layouts and functions of the surrounding regions). Text2City first extracts road and building attributes from the textual layout description to guide the regeneration. It includes a novel one-stage joint regenerator network based on the conditioned denoising diffusion probabilistic models (DDPMs) and prior knowledge exchange. To harmonize the regenerated layouts through joint optimization, we propose the interactive & enhanced guidance module for self-enhancement and prior knowledge exchange between road and building layouts during the regeneration. We also design a series of constraints from attribute-, geometry- and pixel-levels to ensure rational urban layout generation. To train our model, we build a large-scale dataset containing urban layouts and layout descriptions, covering 147K regions. Qualitative and quantitative evaluations show that our proposed method outperforms the baseline methods in regenerating desirable urban layouts that meet the textual descriptions.

Published

2024-03-24

How to Cite

Qin, Y., Zhao, N., Sheng, B., & Lau, R. W. (2024). Text2City: One-Stage Text-Driven Urban Layout Regeneration. Proceedings of the AAAI Conference on Artificial Intelligence, 38(5), 4578-4586. https://doi.org/10.1609/aaai.v38i5.28257

Issue

Section

AAAI Technical Track on Computer Vision IV