GeoDiffusion

A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation

🎨 IEEE/CVF International Conference on Computer Vision (ICCV) 2025

Overview

Diffusion models generate stunning images β€” but controlling their 3D geometry usually requires costly retraining. What if you could get precise geometric control without any training at all?

GeoDiffusion is a training-free framework that injects accurate 3D geometric conditioning into image generation. It enables precise control over scene geometry while maintaining high image quality β€” no fine-tuning, no additional training, just plug and play.

Key Contributions

  • πŸ†“ Training-free β€” no fine-tuning or retraining needed to add geometric control
  • πŸ“ Accurate 3D conditioning β€” precise control over the 3D geometry of generated scenes
  • 🎨 High image quality β€” maintains generation fidelity while enforcing geometric constraints
  • ⚑ Efficient and accessible β€” easily integrable into existing diffusion pipelines

Why It Matters

Geometric consistency is critical for applications like autonomous driving simulation and robotics. GeoDiffusion makes precise 3D-aware image generation accessible to everyone β€” without the cost of retraining large diffusion models.

(Mueller et al., 2025)

References

2025

  1. GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation
    Phillip Mueller, Talip Uenlue, Sebastian Schmidt, and 4 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2025