Burn Deep Learning Ecosystem #2893

salimmghari · 2025-03-11T02:31:01Z

Feature description & motivation

(Previously posted but removed for better visibility) Rust Deep Learning has Burn, but Burn lacks in its ecosystem these four: Vision, Audio, Text, and 3D. What I suggest for us developers is to build 4 crates that will grow the Burn's ecosystem: burn-vision, burn-audio, burn-text, and finally burn-3d. This is very broad and what we should do is focus only on one by one, for now we can build the entire burn-vision crate as opposed to torchvision in PyTorch.

Feature technical details

burn-vision would provide the essential tools for deep learning in vision from the following list:

Image Transforms & Preprocessing:

Pretrained Vision Models:

Datasets & Data Loaders:

Object Detection & Segmentation Utilities:

Bounding Box Utilities – Resize, convert, visualize bounding boxes.
IoU (Intersection over Union) – Compute overlap for object detection evaluation.
Mask Transformations – Convert segmentation masks to tensors.
Keypoint Detection – Process landmark-based annotations (e.g., face keypoints).

Image I/O & Visualization:

Image Loading & Saving – Support PNG, JPEG, BMP, TIFF, etc.
Show Image Tensors – Convert tensors to displayable images.
Grid Visualization – Display multiple images in a grid format.
Draw Bounding Boxes / Masks – Overlay bounding boxes and segmentation masks.

Video Processing & Streaming Support:

Read Video Frames – Load frames from video files.
Stream Processing – Process live video frames for real-time AI applications.
Optical Flow Estimation – Track motion between frames.
Frame Extraction & Augmentation – Manipulate frames like static images.

Efficient Training Utilities:

Mixed Precision Training – Use FP16 for faster model training.
AutoML / Hyperparameter Optimization – Automated tuning for vision models.
Model Quantization – Reduce model size for deployment.
Model Pruning – Remove unnecessary connections for efficiency.

Feature Solution

Leveraging existing stuff from torchvision if allowed can be a helpful solution to complete one of the four crates: burn-vision.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Burn Deep Learning Ecosystem #2893

Burn Deep Learning Ecosystem #2893

salimmghari commented Mar 11, 2025

Burn Deep Learning Ecosystem #2893

Burn Deep Learning Ecosystem #2893

Comments

salimmghari commented Mar 11, 2025

Feature description & motivation

Feature technical details

Feature Solution