[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
-
Updated
Apr 30, 2025 - Python
[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
The PyVisionAI Official Repo
A hands-on collection of computer vision projects for everyone.
This is an official repository for "Harnessing Vision Models for Time Series Analysis: A Survey".
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and performance.
we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI
Enhance your skills in prompt engineering for vision models. Learn to effectively prompt, fine-tune, and track experiments for models like SAM, OWL-ViT, and Stable Diffusion 2.0 to achieve precise image generation, segmentation, and object detection.
Implementation of Midas from [Towards Robust Monocular Depth Estimation] in Pytorch and Zeta
An implementation of gated MLPs in tinygrad, as an alternative to transformers.
DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid architecture that combines diffusion-based and autoregressive approaches for text generation.
PoC Code for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models
In This repo i FineTuned a Pretrained ResNet18 model from PyTorch library
A framework to compute threshold sensitivity of deep networks to visual stimuli.
An awesome list of "small but mighty" models and resources.
These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.
Diffusion Models crash course with Pytorch from DeepLearningAI
Testing the Moondream tiny vision model
Vision-based swarms in the Presence of Occlusions
Add a description, image, and links to the vision-models topic page so that developers can more easily learn about it.
To associate your repository with the vision-models topic, visit your repo's landing page and select "manage topics."