NeRF: Neural Radiance Fields for 3D Scene Reconstruction

Overview

This project implements Neural Radiance Fields (NeRF), a deep learning approach for synthesizing novel views of complex 3D scenes from a sparse set of 2D images. The implementation uses PyTorch and demonstrates how implicit neural representations combined with volume rendering can achieve photorealistic view synthesis.

Key Features:

Fully-connected MLP architecture with positional encoding for high-frequency detail capture
Differentiable volume rendering pipeline with stratified ray sampling
Training and evaluation on the Blender synthetic dataset
Quantitative evaluation using PSNR, SSIM, and LPIPS metrics
Modular, research-oriented codebase with Hydra configuration management

Method

NeRF represents a 3D scene as a continuous 5D function that maps spatial coordinates (x, y, z) and viewing direction (θ, φ) to color (RGB) and volume density (σ). The core components include:

Architecture

MLP Network: Multi-layer perceptron with skip connections mapping encoded positions and directions to density and RGB values
Positional Encoding: Sinusoidal encoding of input coordinates to enable learning of high-frequency scene details
Volume Rendering: Numerical integration along camera rays to compute pixel colors

Training Pipeline

Generate camera rays from known camera poses and intrinsics
Sample 3D points along each ray using stratified sampling
Query the MLP network to predict density and color at each point
Integrate along rays using classical volume rendering equations
Compute photometric loss against ground-truth images

Results

Trained and evaluated on the Blender Synthetic Dataset (Lego scene):

Metric	Value
PSNR ↑	TBD
SSIM ↑	TBD
LPIPS ↓	TBD

Note: Run evaluation to populate these metrics

The model successfully reconstructs 3D geometry and appearance, enabling photorealistic novel view synthesis as shown in the visualization above.

Repository Structure

torch_nerf/
├── configs/              # Hydra configuration files
├── runners/              # Training, rendering, and evaluation scripts
└── src/
    ├── cameras/          # Camera models and ray generation
    ├── network/          # NeRF MLP implementation
    ├── renderer/         # Ray sampling and volume rendering
    ├── scene/            # Scene representation and dataset loaders
    ├── signal_encoder/   # Positional encoding implementations
    └── utils/            # Utility functions and helpers

Setup

Environment

Create a conda environment with Python 3.8:

conda create --name nerf python=3.8
conda activate nerf

Dependencies

Install PyTorch with CUDA support:

pip install torch==1.10.1+cu113 torchvision==0.11.2+cu113 torchaudio==0.10.1 -f https://download.pytorch.org/whl/cu113/torch_stable.html

Install additional requirements:

pip install -r requirements.txt
pip install torchmetrics[image]
pip install tensorboard

Set the Python path:

export PYTHONPATH=.

Dataset

Download the NeRF Blender synthetic dataset:

# Download and extract to data/nerf_synthetic/
wget http://cseweb.ucsd.edu/~viscomp/projects/LF/papers/ECCV20/nerf/nerf_synthetic.zip
unzip nerf_synthetic.zip -d data/

The directory structure should look like:

data/nerf_synthetic/
└── lego/
    ├── train/
    ├── val/
    ├── test/
    └── transforms_*.json

Usage

Training

Train NeRF on the lego scene:

python torch_nerf/runners/train.py

Training outputs (checkpoints, logs, visualizations) are saved to outputs/ with automatic experiment tracking via Hydra. Monitor training progress with TensorBoard:

tensorboard --logdir outputs/

Configuration: Modify training parameters in configs/ or override via command line:

python torch_nerf/runners/train.py data.scene=lego train.batch_size=1024 train.lr=5e-4

Rendering

Render novel views from a trained model:

# Render spiral path
python torch_nerf/runners/render.py +log_dir=outputs/<experiment_dir> +render_test_views=False

# Render test set views
python torch_nerf/runners/render.py +log_dir=outputs/<experiment_dir> +render_test_views=True

Evaluation

Compute quantitative metrics on the test set:

python torch_nerf/runners/evaluate.py <rendered_test_dir> data/nerf_synthetic/lego/test

This outputs PSNR, SSIM, and LPIPS scores comparing rendered images to ground truth.

Implementation Details

Key technical components implemented:

MLP Architecture: 8-layer fully-connected network with skip connection at layer 4
Positional Encoding: Frequency encoding with L=10 for positions, L=4 for directions
Stratified Sampling: 64 coarse samples + 128 fine samples per ray with importance sampling
Volume Rendering: Numerical quadrature using alpha compositing
Hierarchical Sampling: Two-stage coarse-to-fine sampling strategy

References

Mildenhall et al., "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis," ECCV 2020
Original NeRF Paper
NeRF Project Page

Acknowledgments

This implementation was developed as part of exploring 3D machine learning and neural rendering techniques. The codebase structure follows modern research practices with modular components and reproducible experiment management.

License

MIT License - feel free to use this code for research and educational purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
configs		configs
runners		runners
src		src
README.md		README.md
__init__.py		__init__.py
desktop.ini		desktop.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeRF: Neural Radiance Fields for 3D Scene Reconstruction

Overview

Method

Architecture

Training Pipeline

Results

Repository Structure

Setup

Environment

Dependencies

Dataset

Usage

Training

Rendering

Evaluation

Implementation Details

References

Acknowledgments

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NeRF: Neural Radiance Fields for 3D Scene Reconstruction

Overview

Method

Architecture

Training Pipeline

Results

Repository Structure

Setup

Environment

Dependencies

Dataset

Usage

Training

Rendering

Evaluation

Implementation Details

References

Acknowledgments

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages