16-825 Assignment 2: Single View to 3D
1. Exploring Loss Functions
1.1. Fitting a Voxel Grid (5 points)
| Ground Truth |
Optimized |
 |
 |
1.2. Fitting a Point Cloud (10 points)
| Ground Truth |
Optimized |
 |
 |
1.3. Fitting a Mesh (5 points)
| Ground Truth |
Optimized |
 |
 |
2. Reconstructing 3D from Single View
2.1. Image to Voxel Grid (15 points)
| Input RGB |
Ground Truth Voxel |
Predicted Voxel |
 |
 |
 |
 |
 |
 |
 |
 |
 |
2.2. Image to Point Cloud (15 points)
| Input RGB |
Ground Truth Mesh |
Ground Truth Cloud |
Predicted Cloud |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
2.3. Image to Mesh (15 points)
| Input RGB |
Ground Truth Mesh |
Predicted Mesh |
 |
 |
 |
 |
 |
 |
 |
 |
 |
2.4. Quantitative Comparisons (10 points)
Average test F1 score at 0.05 threshold for voxelgrid, pointcloud and the mesh network.
| Type |
Voxel |
Point Cloud |
Mesh |
| Avg F1@0.05 |
64.32% |
74.86% |
66.81% |
| Voxel F1 Curve |
Point Cloud F1 Curve |
Mesh F1 Curve |
 |
 |
 |
2.5. Analyze Effects of Hyperparameter Variations (10 points)
Analysis of hyperparameter effects on model performance.
| n points |
500 |
1000 |
2000 |
| Point Cloud Examples |
 |
 |
 |
| Mesh Examples |
 |
 |
 |
| Point Cloud Evaluation Curves |
 |
 |
 |
| Mesh Evaluation Curves |
 |
 |
 |
| f1@0.05 |
Point: 58.59% Mesh: 59.71% |
Point: 74.86% Mesh: 66.81% |
Point: 50.47% Mesh: 72.47% |
Analysis: Here we obsever that in case of point n_points = 1000 have the highest F1@0.05 score.
The reason although I am not sure might be because higher number of points might lead to model confused which points belongs where
creating some adversarial atttack in the training. However any increase in number of points should increase the F1@0.05 score.
since more points means better estimation of the geometry. This is the case for meshes where this is very evident to notice.
2.6. Interpret Your Model (15 points)
Simply seeing final predictions and numerical evaluations is not always insightful. Can you create some visualizations that help highlight what your learned model does? Be creative and think of what visualizations would help you gain insights.
We model the failure cases of the model by visualizing the ground truth and prediction for each type of model.
Example 1
| Label |
Voxel Example |
Mesh Example |
Point Cloud Example |
| Target |
 |
 |
 |
| Prediction |
 |
 |
 |
Example 2
| Label |
Voxel Example |
Mesh Example |
Point Cloud Example |
| Target |
 |
 |
 |
| Prediction |
 |
 |
 |
Example 3
| Label |
Voxel Example |
Mesh Example |
Point Cloud Example |
| Target |
 |
 |
 |
| Prediction |
 |
 |
 |
In all the example tthe voxels performs better as in it s able to capture the detailed structure of the object.
The mesh has the most outliers and visually looks the workst since the outliers can be attibuted dur to the
low weights of lalcaian smoother loss. However despite loooking worse it does seem to be able to capture the overall shape of the object.
Point cloud have the higher f1 score than the mesh and voxel since it is able to capture the overall shape of the object in more relaxed manner.
In example 1 we can notice that only point cloud is able to capture the legs of the object.
Similarly in example 2 we can notice that only voxel is able to capture the legs of the object.
On the other hand meshes fails to capture details and gets lower F1 score by minimizing the loss by having
outliers
3. Exploring Some Recent Architectures
3.2. Parametric Surface Reconstruction
| Input RGB |
Ground Truth Parametric |
Predicted Parametric |
 |
 |
 |
 |
 |
 |
 |
 |
 |
3.2. Parametric Evaluation
| Parametric F1 Curve |
F1@0.05 Score |
 |
64.33% |