
Submission: In your webpage, include the following details:
- Learning rates that you used for each parameter. If you had experimented with multiple sets of learning rates, just mention the set that obtains the best performance in the next question.
- Number of iterations that you trained the model for.
- The PSNR and SSIM.
- Both the GIFs output by
train.py.

Submission: In your webpage, include the following details:
- Attach the GIF you obtained using
render.pyfor questions 1.3.1 (this question) and 1.1.5 (older question).- Attach 2 or 3 side by side RGB image comparisons of the renderings obtained from both the cases. The images that are being compared should correspond to the same view/frame.
- For each of the side by side comparisons that are attached, provide some explanation of differences (if any) that you notice.

without spherical harmonics
with spherical harmonics

The details and shades are slightly more refined with spherical harmonics.
without spherical harmonics
with spherical harmonics

The render with spherical harmonics have more realistic shading and finer details, whereas the one without retains the same shading pattern similar to the previous image, despite having different view angles.
without spherical harmonics
with spherical harmonics

Both cases appear quite blurry and show little noticeable difference.
"a hamburger"
Without guidance (400 iterations) With guidance (700 iterations; guidance scale=15)
"a standing corgi dog"
Without guidance (2000 iterations) With guidance (1600 iterations; guidance scale=10)
"a flying cucumber"
Without guidance (2000 iterations) With guidance (1600 iterations; guidance scale=10)
"a renaissance painting"
Without guidance (2000 iterations) With guidance (2000 iterations; guidance scale=100)
"a black and white cow" (guidance=15) "a cow with zebra stripes" (guidance=100)
RGB depth
RGB depth
RGB depth
Before incorporating view dependence, the corgi has two faces, which is a common artifact caused by the Stable Diffusion SDS loss lacking sufficient context to distinguish between front and side views. Introducing view dependence resolved this problem.
- rgb
![]()
![]()
"Without view-dependent" "With view-dependent
"Without view-dependent" "With view-dependent
In this case, incorporating view dependence didn’t alleviate the multi-face artifact. The monitor still resembles a lamp, because none of the viewpoints sufficiently capture its flat surface.
- rgb
![]()
![]()
"Without view-dependent" "With view-dependent
"Without view-dependent" "With view-dependent