Journal article icon

Journal article

Revisiting U-Net: a foundational backbone for modern generative AI

Abstract:
This survey explores the evolution and application of U-Net in generative AI, highlighting its success across various modalities, including image, text, audio, video, 3D, and pose/action generation. Initially designed for biomedical segmentation, U-Net has been adapted and enhanced with architectural innovations such as normalization techniques, self and cross-attention mechanisms, and residual connections. These advancements have made U-Net a powerful backbone for modern generative models in diffusion-based frameworks, GANs, and autoregressive architectures. The survey comprehensively reviews U-Net’s modality-specific applications, from high-resolution image synthesis and text-to-image generation to speech enhancement, video generation, 3D reconstruction, and pose/action generation. Despite its widespread success, U-Net faces challenges in computational efficiency, contextual understanding, and scalability for multimodal tasks. Future directions focus on optimizing U-Net for lightweight and real-time applications, enhancing its contextual awareness, and improving its integration with emerging architectures like transformers and diffusion models.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1007/s10462-025-11450-0

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Sub department:
Computer Science
Role:
Author


Publisher:
Springer
Journal:
Artificial Intelligence Review More from this journal
Volume:
59
Issue:
2
Article number:
45
Publication date:
2025-11-24
Acceptance date:
2025-11-11
DOI:
EISSN:
1573-7462
ISSN:
0269-2821


Language:
English
Keywords:
UUID:
uuid_acc434c6-60f6-4369-a94a-a38f142df9c3
Source identifiers:
3638841
Deposit date:
2026-01-07
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP