JoPano: Unified Panorama Generation via Joint Modeling

Wancheng Feng1,3*   Chen An1,2*   Zhenliang He1✉
Meina Kan1,2   Shiguang Shan1,2   Lukun Wang3
1State Key Lab of AI Safety, Institute of Computing Technology, CAS, China
2University of Chinese Academy of Sciences (CAS), China
3Shandong University of Science and Technology, China

* Equal Contribution

Abstract

Panorama generation has recently attracted growing interest in the research community, with two core tasks, text-to-panorama and view-to-panorama generation. However, existing methods still face two major challenges: their U-Net-based architectures constrain the visual quality of the generated panoramas, and they usually treat the two core tasks independently, which leads to modeling redundancy and inefficiency. To overcome these challenges, we propose a joint-face panorama (JoPano) generation approach that unifies the two core tasks within a DiT-based model. To transfer the rich generative capabilities of existing DiT backbones learned from natural images to the panorama domain, we propose a Joint-Face Adapter built on the cubemap representation of panoramas, which enables a pretrained DiT to jointly model and generate different views of a panorama. We further apply Poisson Blending to reduce seam inconsistencies that often appear at the boundaries between cube faces. Correspondingly, we introduce Seam-SSIM and Seam-Sobel metrics to quantitatively evaluate the seam consistency. Moreover, we propose a condition switching mechanism that unifies text-to-panorama and view-to-panorama tasks within a single model. Comprehensive experiments show that JoPano can generate high-quality panoramas for both text-to-panorama and view-to-panorama generation tasks, achieving state-of-the-art performance on FID, CLIP-FID, IS, and CLIP-Score metrics.

Overview of JoPano panorama generation

Results

Text-to-Panorama

Text-to-Panorama

View-to-Panorama

View-to-Panorama

Interactive Panorama Viewer

Click a panorama preview below to view it in 360°. Drag to look around, scroll to zoom.

Panorama 1 preview
Panorama 1
Panorama 2 preview
Panorama 2
Panorama 3 preview
Panorama 3
Panorama 4 preview
Panorama 4
Panorama 5 preview
Panorama 5
Panorama 6 preview
Panorama 6
Panorama 7 preview
Panorama 7
Panorama 8 preview
Panorama 8
Panorama 9 preview
Panorama 9
Panorama 10 preview
Panorama 10

BibTeX

@article{JoPano2025,
  title={JoPano: Unified Panorama Generation via Joint Modeling},
  author={Wancheng Feng, Chen An, Zhenliang He, Meina Kan, Shiguang Shan, Lukun Wang},
  journal={arXiv preprint arXiv:2512.06885},
  year={2025}
}