Build software better, together

aimagelab / LLaVA-MORE

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

vision-and-language llms llava siglip multimodal-llms llama3 llava-llama3 llama3-vision gemma-2 llama3-1 deepseek-r1 siglip2

Updated Apr 1, 2025
Python

PRITHIVSAKTHIUR / Augmented-Waste-Classifier-SigLIP2

Star

Augmented-Waste-Classifier-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

google image-classification vit hazard-detection hazard-assessment waste-management huggingface-transformers siglip2

Updated Mar 11, 2025
Python

PRITHIVSAKTHIUR / Facial-Emotion-Detection-SigLIP2

Star

Facial-Emotion-Detection-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

image-classification emotion-analysis emotion-detection emotion-recognition huggingface-transformers siglip2

Updated Mar 11, 2025
Python

PRITHIVSAKTHIUR / Age-Classification-SigLIP2

Star

Age-Classification-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to predict the age group of a person from an image using the SiglipForImageClassification architecture.

google vit age-detection huggingface-transformers vision-transformer huggingface-models siglip2

Updated Mar 28, 2025
Python

PRITHIVSAKTHIUR / Mnist-Digits-SigLIP2

Star

classify handwritten digits (0-9)

numbers mnist-classification image-classification vit gradio digits-recognition digits-classification huggingface-transformers siglip2 0-9

Updated Mar 28, 2025
Python

PRITHIVSAKTHIUR / Fashion-Mnist-SigLIP2

Star

Fashion-Mnist-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images into Fashion-MNIST categories using the SiglipForImageClassification architecture.

google image-classification clothing fashion-mnist huggingface-transformers vision-transformer siglip2

Updated Mar 21, 2025
Python

PRITHIVSAKTHIUR / Traffic-Density-Classification

Star

Traffic-Density-Classification is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images into traffic density categories using the SiglipForImageClassification architecture.

google analysis transformers traffic torch density vit gradio torchvision huggingface-transformers siglip2

Updated Mar 22, 2025
Python

PRITHIVSAKTHIUR / Multisource-121-DomainNet

Star

Multisource-121-DomainNet is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images into 121 domain categories using the SiglipForImageClassification architecture.

image-classification gradio f32 huggingface-transformers vision-transformer domainnet siglip2

Updated Mar 25, 2025
Python

PRITHIVSAKTHIUR / Gender-Classifier-Mini

Star

Gender-Classifier-Mini is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images based on gender using the SiglipForImageClassification architecture.

gender-recognition vit gradio gender-classification gender-detection huggingface-transformers vision-transformer vision-language-model siglip siglip2

Updated Mar 30, 2025
Python

PRITHIVSAKTHIUR / Deepfake-vs-Real-8000

Star

Deepfake vs Real is a dataset designed for image classification, distinguishing between deepfake and real images.

detection vit deepfake vision-transformer siglip2

Updated Mar 27, 2025
Python

PRITHIVSAKTHIUR / Clipart-126-DomainNet

Star

Clipart-126-DomainNet is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify clipart images into 126 domain categories using the SiglipForImageClassification architecture

art classification image-classification llama demo-app gradio torchvision huggingface-transformers vision-transformer huggingface-spaces siglip2

Updated Mar 26, 2025
Python

PRITHIVSAKTHIUR / Human-vs-NonHuman-Detection

Star

Human-vs-NonHuman-Detection is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images as either human or non-human using the SiglipForImageClassification architecture.

google human image-classification vit gradio non-human huggingface-transformer huggingface-transformers siglip2

Updated Apr 4, 2025
Python

PRITHIVSAKTHIUR / Bird-Species-Classifier-526

Star

Bird-Species-Classifier-526 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

bird image-classification vit species-identification huggingface-transformers siglip2

Updated Mar 28, 2025
Python

PRITHIVSAKTHIUR / Painting-126-DomainNet

Star

Painting-126-DomainNet is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify paintings into 126 domain categories using the SiglipForImageClassification architecture

google interface transformers image-classification gradio huggingface-transformers domainnet siglip siglip2

Updated Apr 1, 2025
Python

PRITHIVSAKTHIUR / Fire-Detection-Siglip2

Star

Fire-Detection-Siglip2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to detect fire, smoke, or normal conditions using the SiglipForImageClassification architecture.

google smoke image-classification llama vit normal fire-detection huggingface huggingface-transformers siglip siglip2

Updated Mar 31, 2025
Python

PRITHIVSAKTHIUR / Hand-Gesture-2-Robot

Star

Hand-Gesture-2-Robot is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to recognize hand gestures and map them to specific robot commands using the SiglipForImageClassification architecture.

png robot jpeg pillow pil image-classification visionprocessing gesture-recognition huggingface-transformers vision-transformer vision-language-model siglip2

Updated Apr 2, 2025
Python

PRITHIVSAKTHIUR / Gym-Workout-Classifier-SigLIP2

Star

Gym-Workout-Classifier-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224

google transformers image-classification vit siglip siglip2

Updated Mar 21, 2025
Python

PRITHIVSAKTHIUR / Alphabet-Sign-Language-Detection

Star

Alphabet-Sign-Language-Detection is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images into sign language alphabet categories using the SiglipForImageClassification architecture.

alphabet image-classification vit gradio sign-language-recognition huggingface-transformers vision-transformer siglip2

Updated Mar 22, 2025
Python

PRITHIVSAKTHIUR / SAT-Landforms-Classifier

Star

SAT-Landforms-Classifier is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify satellite images into different landform categories using the SiglipForImageClassification architecture

classification image-classification gradio satteliteimage landform huggingface-transformers gradio-python-llm siglip siglip2

Updated Mar 25, 2025
Python

PRITHIVSAKTHIUR / Food-101-93M

Star

Food-101-93M is a fine-tuned image classification model built on top of google/siglip2-base-patch16-224 using the SiglipForImageClassification architecture. It is trained to classify food images into one of 101 popular dishes, derived from the Food-101 dataset.

food image-classification huggingface-transformers vision-transformer siglip2

Updated Apr 7, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

siglip2

Here are 24 public repositories matching this topic...

aimagelab / LLaVA-MORE

PRITHIVSAKTHIUR / Augmented-Waste-Classifier-SigLIP2

PRITHIVSAKTHIUR / Facial-Emotion-Detection-SigLIP2

PRITHIVSAKTHIUR / Age-Classification-SigLIP2

PRITHIVSAKTHIUR / Mnist-Digits-SigLIP2

PRITHIVSAKTHIUR / Fashion-Mnist-SigLIP2

PRITHIVSAKTHIUR / Traffic-Density-Classification

PRITHIVSAKTHIUR / Multisource-121-DomainNet

PRITHIVSAKTHIUR / Gender-Classifier-Mini

PRITHIVSAKTHIUR / Deepfake-vs-Real-8000

PRITHIVSAKTHIUR / Clipart-126-DomainNet

PRITHIVSAKTHIUR / Human-vs-NonHuman-Detection

PRITHIVSAKTHIUR / Bird-Species-Classifier-526

PRITHIVSAKTHIUR / Painting-126-DomainNet

PRITHIVSAKTHIUR / Fire-Detection-Siglip2

PRITHIVSAKTHIUR / Hand-Gesture-2-Robot

PRITHIVSAKTHIUR / Gym-Workout-Classifier-SigLIP2

PRITHIVSAKTHIUR / Alphabet-Sign-Language-Detection

PRITHIVSAKTHIUR / SAT-Landforms-Classifier

PRITHIVSAKTHIUR / Food-101-93M

Improve this page

Add this topic to your repo