Perception Box

Perception Box Overview

The Perception Box is an integrated framework for real-time visual-inertial SLAM, 3D semantic mapping, and indoor navigation — designed to make advanced perception algorithms as easy to use as a Python API call.

Complex spatial understanding tasks often require robotics developers and researchers to manually install, build, and configure multiple libraries and algorithms from scratch. The Perception Box removes this barrier by packaging 3D mapping, semantic segmentation, and navigation algorithms into an accessible, modular system with a simple, consistent interface.

This project lowers the entry barrier for deploying modern robotic perception capabilities on embedded systems like the NVIDIA Jetson, as well as standard Ubuntu desktops. By exposing a clear XML-RPC and Python API, it allows anyone to run live SLAM, stream labeled 3D maps, and control mapping tasks remotely — without the overhead of deep low-level integration.

Acknowledgments

We extend our gratitude to our mentors and advisors for their guidance and support:

Professor Kris Hauser
João Marques

This project is built as a fork of the stella-vslam-examples repository, with contributions from members of the IML Research Team:

Aaditya Voruganti (GitHub)
Vallabh Nadgir (GitHub)

Features

Multi-Camera Support: Monocular, and RGB-D cameras.
Flexible Input: Supports video files and live camera feeds.
Pangolin Viewer: OpenGL-based viewer for real-time visualization.
Optimized for Embedded Systems: Suitable for NVIDIA Jetson platforms.

Prerequisites

Operating System: Ubuntu 18.04 or 20.04 (for NVIDIA Jetson)
Compiler: GCC with C++11 support
CMake: Version 3.5 or higher
Git

Installation

1. Install Dependencies

Open a terminal and execute the following commands:

sudo apt update
sudo apt upgrade -y --no-install-recommends

# Basic dependencies
sudo apt install -y build-essential pkg-config cmake git wget curl unzip

# g2o dependencies
sudo apt install -y libatlas-base-dev libsuitesparse-dev

# OpenCV dependencies
sudo apt install -y libgtk-3-dev ffmpeg libavcodec-dev libavformat-dev \
libavutil-dev libswscale-dev libavresample-dev libtbb-dev

# Eigen dependencies
sudo apt install -y gfortran

# backward-cpp dependencies (optional but recommended)
sudo apt install -y binutils-dev

# Other dependencies
sudo apt install -y libyaml-cpp-dev libgflags-dev sqlite3 libsqlite3-dev

# Pangolin dependencies
sudo apt install -y libglew-dev

2. Install Required Libraries

Install Eigen

cd ~/Downloads
wget -q https://gitlab.com/libeigen/eigen/-/archive/3.3.7/eigen-3.3.7.tar.bz2
tar xf eigen-3.3.7.tar.bz2
cd eigen-3.3.7
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=/usr/local ..
make -j$(nproc)
sudo make install

Install OpenCV

cd ~/Downloads
# Download OpenCV source
wget -q https://github.com/opencv/opencv/archive/4.5.5.zip
unzip -q 4.5.5.zip
cd opencv-4.5.5
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release \
      -DCMAKE_INSTALL_PREFIX=/usr/local \
      -DBUILD_DOCS=OFF \
      -DBUILD_EXAMPLES=OFF \
      -DBUILD_TESTS=OFF \
      -DBUILD_PERF_TESTS=OFF \
      -DBUILD_opencv_python_bindings_generator=OFF \
      -DWITH_TBB=ON \
      -DWITH_OPENMP=ON \
      -DWITH_FFMPEG=ON \
      -DWITH_GTK=ON \
      -DWITH_V4L=ON \
      -DWITH_OPENGL=ON \
      -DWITH_GSTREAMER=ON \
      -DENABLE_FAST_MATH=ON \
      -DWITH_CUDA=ON \
      ..
make -j$(nproc)
sudo make install

Install FBoW

cd ~/Downloads
git clone https://github.com/stella-cv/FBoW.git
cd FBoW
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=/usr/local ..
make -j$(nproc)
sudo make install

Install g2o

cd ~/Downloads
git clone https://github.com/RainerKuemmerle/g2o.git
cd g2o
git checkout 20230223_git
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release \
      -DCMAKE_INSTALL_PREFIX=/usr/local \
      -DBUILD_SHARED_LIBS=ON \
      -DBUILD_UNITTESTS=OFF \
      -DG2O_USE_CHOLMOD=OFF \
      -DG2O_USE_CSPARSE=ON \
      -DG2O_USE_OPENGL=OFF \
      -DG2O_USE_OPENMP=OFF \
      -DG2O_BUILD_APPS=OFF \
      -DG2O_BUILD_EXAMPLES=OFF \
      -DG2O_BUILD_LINKED_APPS=OFF \
      ..
make -j$(nproc)
sudo make install

Install backward-cpp (Optional but Recommended)

cd ~/Downloads
git clone https://github.com/bombela/backward-cpp.git
cd backward-cpp
git checkout 5ffb2c879ebdbea3bdb8477c671e32b1c984beaa
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=/usr/local ..
make -j$(nproc)
sudo make install

Install Pangolin

cd ~/Downloads
git clone https://github.com/stevenlovegrove/Pangolin.git
cd Pangolin
git checkout eab3d3449a33a042b1ee7225e1b8b593b1b21e3e
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=Release \
      -DCMAKE_INSTALL_PREFIX=/usr/local \
      -DBUILD_EXAMPLES=OFF \
      -DBUILD_PANGOLIN_PYTHON=OFF \
      ..
make -j$(nproc)
sudo make install

3. Build and Install Stella-VSLAM

mkdir -p ~/stella_ws/src
cd ~/stella_ws/src
git clone --recursive https://github.com/stella-cv/stella_vslam.git
cd stella_vslam
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=RelWithDebInfo ..
make -j$(nproc)
sudo make install

4. Build and Install Pangolin Viewer

cd ~/stella_ws/src
git clone --recursive https://github.com/stella-cv/pangolin_viewer.git
cd pangolin_viewer
mkdir build && cd build
cmake -DCMAKE_BUILD_TYPE=RelWithDebInfo ..
make -j$(nproc)
sudo make install

5. Build Perception Box

git clone https://github.com/uiuc-iml/Perception-Box.git
cd Perception-Box
mkdir build
cd build
cmake ..

Download the Vocab file from: Vocab file

Open3D: Jetson fork

Bash Installation

cd ~/Downloads
git clone "https://github.com/uiuc-iml/Perception-Box.git"
chmod +x install_stella_vslam.sh
./install_stella_vslam.sh

Configurator

In python helpers, there is a configurator for the zed and realsense camera - Usage

Run the Flask App

Access the Web App Open your browser and go to http://127.0.0.1:5000.

Configure Camera Settings

Select the camera type (ZED or RealSense).
Choose resolution, frame rate, and other camera options.
Set the socket address and port.

Generate YAML

Click the Generate YAML button.
Download the generated camera_config.yaml file.

Install requirements for the mapping module

cd Perception-Box
pip install -r requirements.txt

Run Mapping

On the client device, create an XML-RPC client using the perception box's local network IP. Enter the mapping directory of the Perception-Box folder and run the mapping server on the perception box:

python testserver.py

Next, run the perception box using the script given below. Then, use the available APIs to start, pause, and end mapping from the client side. Use the get_metric_map and get_semantic_map APIs to get the map over the XML-RPC interface. Look at the documentation for more information. See example_client.py for inspiration. Note: Be sure to change the IP to your perception box's IP.

Run Perception Box

Example:

./run_camera_slam --vocab /home/perception/lib/stella_vslam_examples/build/orb_vocab.fbow --config /home/perception/lib/stella_vslam_examples/build/realsense.yaml --number 4 --viewer pangolin_viewer

Note: Adjust the paths based on the local configuration of your system

Example sequence of commands

First run the mapping box (We could make a bash script that does this on start everytime). Then, use the above bash script to start SLAM (We could make it so that it executes on start_task()). Then, call start_mapping() to start mapping (from the client). Finally, use the APIs to get maps and call stop_mapping() and stop_task() when finished.

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
.github/workflows		.github/workflows
3rd		3rd
Kimera-VIO		Kimera-VIO
Yaml-files		Yaml-files
build		build
deps		deps
mapping		mapping
onnx_model_transfer		onnx_model_transfer
python_helpers		python_helpers
src		src
vocab		vocab
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
camera_config.yaml		camera_config.yaml
example_client.py		example_client.py
install_stella_vslam.sh		install_stella_vslam.sh
installation.txt		installation.txt
perception_box.py		perception_box.py
requirements.txt		requirements.txt
socket_test.py		socket_test.py
testopen3d.py		testopen3d.py
zedpy.py		zedpy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Perception Box

Perception Box Overview

Table of Contents

Ver 1

Ver 2

Acknowledgments

Features

Prerequisites

Installation

1. Install Dependencies

2. Install Required Libraries

3. Build and Install Stella-VSLAM

4. Build and Install Pangolin Viewer

5. Build Perception Box

Bash Installation

Configurator

Install requirements for the mapping module

Run Mapping

Run Perception Box

Example sequence of commands

About

Uh oh!

Releases

Packages

Languages

License

uiuc-iml/Robotic-Perception-Box

Folders and files

Latest commit

History

Repository files navigation

Perception Box

Perception Box Overview

Table of Contents

Ver 1

Ver 2

Acknowledgments

Features

Prerequisites

Installation

1. Install Dependencies

2. Install Required Libraries

3. Build and Install Stella-VSLAM

4. Build and Install Pangolin Viewer

5. Build Perception Box

Bash Installation

Configurator

Install requirements for the mapping module

Run Mapping

Run Perception Box

Example sequence of commands

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages