| Frame | With SH | Without SH | Description |
|---|---|---|---|
| 16 | ![]() |
![]() |
Instead of a single shade, the shade of this chair with spherical harmonics varies smoothly across the surface. |
| 17 | ![]() |
![]() |
The shade changes depends on the angle as well. |
All images trained for 2000 iterations
Without guidance, the optimization struggles to produce recognizable objects, while with guidance the objects match the text prompts much more accurately.
This extension implements view-dependent text conditioning. By conditioning the diffusion model on viewing direction (front, side, back, overhead), the optimization produces more 3D-consistent results.
Overall, view-dependent text conditioning is crucial for creating convincing 3D assets from text descriptions. While it may add complexity, the improvement in 3D consistency far outweighs the drawbacks.