16-825 Assignment 2: Single View to 3D

2

2.1

2.2

2.3

2.4

Voxel Grid - Avg F1@0.05: 34.675
Point Cloud - Avg F1@0.05: 81.987  
Mesh - Avg F1@0.05: 70.931
                

Point clouds achieve highest F1 scores due to their ability to capture fine geometric details. Voxel grids suffer from low resolution limitations and blocky representations that poorly approximate curved surfaces. Meshes provide intermediate performance by maintaining surface topology but face challenges with complex geometry reconstruction from single views, and require more training iteration than point clouds. We use same iteration number for all three's training process
                

2.5

n_points = 2000 vs n_points = 5000

Higher point density (5000 points) significantly improves reconstruction quality by capturing finer geometric details and reducing sparsity artifacts. However, the computational cost increases quadratically with point count, making 2000 points much faster, which can be used for early testing.

2.6

Side-by-side overlay comparison with synchronized rotation: Left (Red) - Predicted point cloud, Right (Blue) - Ground truth point cloud. This visualization enables direct geometric comparison and reveals reconstruction accuracy patterns across different viewpoints.
                

3

3.3

Single Class Training (n_points=2000) - Avg F1@0.05: 81.987
Three Class Training (n_points=5000) - Avg F1@0.05: 89.876

Training on extended dataset (chair, car, plane) with higher point density improves average F1 score by 7.889 points. I am not sure whether the improvement is due to increased data diversity across multiple object categories. But it also shows that higher point resolution (5000 vs 2000 points) can result in better performance.
                

Assignment 2

1

1.1

1.2

1.3

2

2.1

2.2

2.3

2.4

2.5

2.6

3

3.3