Implementing a SLAM framework from scratch - README

output.mp4

Preface

This project was was done for OpenCV under the guidance of Gary Bradski and Reza Amayeh supported by Google Summer of Code(GSOC) 2025. This project presents a methodological approach to designing, implementing, and documenting a Simultaneous Localization and Mapping(SLAM) framework from scratch in Python. The primary objective is to build a fully functional, OpenCV-based SLAM system with clear documentation to facilitate reproducibility, future extensions, and adoption by the broader computer vision community. Traditional SLAM systems that rely on hand-crafted features (e.g., ORB, SIFT) often exhibit reduced robustness under challenging conditions such as viewpoint shifts, illumination changes, or motion blur. To address these limitations, we integrate modern learned features—ALIKED keypoints and descriptors (Zhao et al., 2022) combined with the LightGlue matcher (Lindenberger et al., 2023)—into a streamlined SLAM pipeline. This integration improves tracking stability and relocalization performance, especially in environments with significant photometric and geometric variations.

In addition, the system extends feature extraction and matching using LightGlue (Lindenberger et al., 2023) and leverages PyCeres for non-linear optimization, enabling efficient bundle adjustment and graph optimization within the pipeline.

Overview

This repository contains two Python scripts that demonstrate basic Structure-from-Motion (SfM) pipelines:

sfm.py
- A more classical approach that uses standard OpenCV feature detectors (e.g., SIFT, ORB, AKAZE) and BFMatcher or FLANN to match keypoints between images.
- Performs pose estimation (essential matrix or PnP) and triangulation to build a sparse map of 3D points.
- Uses optional non-linear refinement via scipy’s least squares to improve the estimated camera pose.
sfm_lightglue_aliked.py
- An enhanced pipeline that integrates neural network-based feature extraction (ALIKED) and feature matching (LightGlue).
- Demonstrates how modern, learned feature detectors and matchers can improve keypoint reliability and reduce drift.
- Also includes the same core SfM steps (pose estimation, triangulation, optional non-linear refinement).
- Tracks a simple Absolute Trajectory Error (ATE) and accumulates a cumulative translation error for quick performance checks.

Both scripts are prototypes designed primarily for concept validation and experimentation. For real-time, production-grade implementations, it’s recommended to integrate a C++ back end (e.g., Ceres Solver) for optimization and manage heavy-lifting tasks in a more performant environment.

Features

Common SfM Steps

Dataset Loading (KITTI, Malaga, or custom folder with images).
Camera Calibration for loading intrinsic/extrinsic parameters.
Feature Extraction
- sfm.py: classical (SIFT, ORB, AKAZE)
- sfm_lightglue_aliked.py: ALIKED (learned keypoints + descriptors)
Feature Matching
- sfm.py: BFMatcher or FLANN
- sfm_lightglue_aliked.py: LightGlue (neural network-based matching)
Motion Estimation
- 2D-2D with essential matrix.
- 2D-3D with PnP (once 3D map points are available).
Triangulation
- Convert 2D matches into 3D points.
Non-linear Refinement
- Uses scipy’s Levenberg-Marquardt (least_squares) to minimize reprojection error.
Basic Stereo Handling (KITTI, Malaga)
- Combine left and right images for better scale recovery if stereo calibration is present.
Trajectory Evaluation
- ATE (Absolute Trajectory Error) if ground truth is available.
- A simple “cumulative translation error” measure.

Requirements

Python 3.7+
OpenCV (>= 4.x recommended)
NumPy
Matplotlib (for visualization)
scipy (for non-linear refinement)
tqdm (for progress bars)
PyTorch (only required for sfm_lightglue_aliked.py, if using LightGlue + ALIKED)
lightglue (the Python package for the LightGlue matching framework)

Usage

1. Cloning & Installation

Clone this repository:

git clone https://github.com/your-organization/your-repo.git

Install dependencies:

pip install -r requirements.txt

Or individually:

pip install opencv-python numpy matplotlib scipy tqdm torch
# plus LightGlue if not already installed

2. Running sfm.py

python sfm.py --dataset kitti --data_path ./Dataset/kitti

Arguments:
- --dataset: name of the dataset (kitti, malaga, or custom).
- --data_path: path to the dataset folder.
Behavior:
- Loads images, performs feature detection + matching (SIFT, ORB, AKAZE), estimates camera motion, triangulates points.
- Optionally runs non-linear refinement on the pose.
- Plots or logs the results (trajectory, errors).

(Adjust arguments to match your own script’s CLI if needed.)

3. Running sfm_lightglue_aliked.py

python sfm_lightglue_aliked.py --dataset kitti --data_path ./Dataset/kitti --use_lightglue True

Arguments:
- --dataset: name of the dataset (kitti, malaga, or custom).
- --data_path: path to the dataset folder.
- --use_lightglue: enable or disable ALIKED + LightGlue pipeline.
Behavior:
- Loads images, runs ALIKED for feature extraction, and LightGlue for matching (GPU if available).
- Estimates camera motion, triangulates points, performs non-linear refinement if configured.
- Computes:
  - ATE (Absolute Trajectory Error)
  - A “cumulative translation error” measure
- Optionally displays debug visualizations (keypoints, matches, trajectory).

4. Visualizations

Matplotlib windows may pop up showing:
- Keypoints and matches for consecutive frames.
- The evolution of the 3D point cloud (if any).
- The camera’s estimated trajectory vs. ground truth (if available).

5. Customization

Modify THUMPUP_POS_THRESHOLD and THUMPUP_ROT_THRESHOLD for keyframe selection.
Tweak the maximum keypoints or confidence in ALIKED or LightGlue for performance vs. accuracy trade-offs.
Adjust RANSAC thresholds or non-linear refinement parameters (in refine() method) for more robust or faster solutions.

Implementation Details

sfm.py
- Uses OpenCV for feature detection (SIFT, ORB, or AKAZE).
- BFMatcher or FLANN for matching.
- Essential matrix / PnP for pose.
- Minimal keyframe selection logic.
sfm_lightglue_aliked.py
- ALIKED for learned keypoints + descriptors, LightGlue for matching.
- Similar pose estimation logic (PnP, essential matrix).
- Triangulation + refinement steps are nearly the same.
- Typically yields more reliable matches and lower drift.
Stereo logic (KITTI, Malaga) uses left/right cameras for absolute scale.
Monocular is scale-invariant and can produce an arbitrary scale.
Error Metrics:
- ATE: Norm of translation difference from ground truth.
- Cumulative translation error: Summation of frame-by-frame translation offsets.

Performance & Future Directions

Python Prototyping: Great for algorithmic experimentation but can be slower for large-scale or real-time tasks.
Production-Grade: Offload heavy steps (bundle adjustment, large-scale optimization) to C++.
Loop Closure & Full SLAM: These scripts focus on Visual Odometry. Future expansions may include place recognition, pose graph optimization, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
.vscode		.vscode
config		config
docs		docs
media		media
refrences		refrences
scripts		scripts
slam		slam
tests		tests
tools		tools
.gitignore		.gitignore
2D Trajectory.png		2D Trajectory.png
BA KF 1_screenshot_07.08.2025.png		BA KF 1_screenshot_07.08.2025.png
BA KF 2_screenshot_07.08.2025.png		BA KF 2_screenshot_07.08.2025.png
BA KF 3_screenshot_07.08.2025.png		BA KF 3_screenshot_07.08.2025.png
BA KF 4_screenshot_07.08.2025.png		BA KF 4_screenshot_07.08.2025.png
F inlier tracks_screenshot_08.10.2025.png		F inlier tracks_screenshot_08.10.2025.png
LICENSE		LICENSE
PnP reprojection_screenshot_08.08.2025.png		PnP reprojection_screenshot_08.08.2025.png
README.md		README.md
Trajectory 2D (x–z)_screenshot_27.08.2025.png		Trajectory 2D (x–z)_screenshot_27.08.2025.png
Trajectory_2D_(x–z)-malaga.png		Trajectory_2D_(x–z)-malaga.png
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Implementing a SLAM framework from scratch - README

Preface

Overview

Features

Common SfM Steps

Requirements

Usage

1. Cloning & Installation

2. Running sfm.py

3. Running sfm_lightglue_aliked.py

4. Visualizations

5. Customization

Implementation Details

Performance & Future Directions

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

KlrShaK/opencv-SimpleSLAM

Folders and files

Latest commit

History

Repository files navigation

Implementing a SLAM framework from scratch - README

Preface

Overview

Features

Common SfM Steps

Requirements

Usage

1. Cloning & Installation

2. Running sfm.py

3. Running sfm_lightglue_aliked.py

4. Visualizations

5. Customization

Implementation Details

Performance & Future Directions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages