Classification of 101 food classes using ResNet, YOLO and Falcon 2

Main idea and focus was to compare CNN architectures such as ResNet and YOLO vs a vision-language model (VLM) such as Falcon 2.

Workflow:

ResNet with ImageNet weights was trained via transfer learning.
YOLOv8 model was trained.
Automatic pipeline was built for Falcon 2 to classify images via zero-shot prompting.

Models

ResNet-50

YOLOv8

Falcon 2

Results:

Model	Accuracy	Loss	F1
ResNet
YOLO
Falcon2

Links to notebooks:

Dataset: food101

This dataset consists of 101 food categories, with 101'000 images. For each class, 250 manually reviewed test images are provided as well as 750 training images. On purpose, the training images were not cleaned, and thus still contain some amount of noise. This comes mostly in the form of intense colors and sometimes wrong labels. All images were rescaled to have a maximum side length of 512 pixels.

CLASSES:

['apple_pie', 'baby_back_ribs', 'baklava', 'beef_carpaccio', 'beef_tartare', 'beet_salad', 'beignets', 'bibimbap', 'bread_pudding', 'breakfast_burrito', 'bruschetta', 'caesar_salad', 'cannoli', 'caprese_salad', 'carrot_cake', 'ceviche', 'cheesecake', 'cheese_plate', 'chicken_curry', 'chicken_quesadilla', 'chicken_wings', 'chocolate_cake', 'chocolate_mousse', 'churros', 'clam_chowder', 'club_sandwich', 'crab_cakes', 'creme_brulee', 'croque_madame', 'cup_cakes', 'deviled_eggs', 'donuts', 'dumplings', 'edamame', 'eggs_benedict', 'escargots', 'falafel', 'filet_mignon', 'fish_and_chips', 'foie_gras', 'french_fries', 'french_onion_soup', 'french_toast', 'fried_calamari', 'fried_rice', 'frozen_yogurt', 'garlic_bread', 'gnocchi', 'greek_salad', 'grilled_cheese_sandwich', 'grilled_salmon', 'guacamole', 'gyoza', 'hamburger', 'hot_and_sour_soup', 'hot_dog', 'huevos_rancheros', 'hummus', 'ice_cream', 'lasagna', 'lobster_bisque', 'lobster_roll_sandwich', 'macaroni_and_cheese', 'macarons', 'miso_soup', 'mussels', 'nachos', 'omelette', 'onion_rings', 'oysters', 'pad_thai', 'paella', 'pancakes', 'panna_cotta', 'peking_duck', 'pho', 'pizza', 'pork_chop', 'poutine', 'prime_rib', 'pulled_pork_sandwich', 'ramen', 'ravioli', 'red_velvet_cake', 'risotto', 'samosa', 'sashimi', 'scallops', 'seaweed_salad', 'shrimp_and_grits', 'spaghetti_bolognese', 'spaghetti_carbonara', 'spring_rolls', 'steak', 'strawberry_shortcake', 'sushi', 'tacos', 'takoyaki', 'tiramisu', 'tuna_tartare', 'waffles']

DATASET SIZE: 4.77 GiB images

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
logs		logs
mlruns/1/fbe123a065124ff39e2068480c3ead8e/artifacts		mlruns/1/fbe123a065124ff39e2068480c3ead8e/artifacts
.gitignore		.gitignore
Food-Vision.ipynb		Food-Vision.ipynb
README.md		README.md
consts.py		consts.py
mlflow.db		mlflow.db
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classification of 101 food classes using ResNet, YOLO and Falcon 2

Workflow:

Models

ResNet-50

YOLOv8

Falcon 2

Results:

Links to notebooks:

Dataset: food101

About

Releases

Packages

Languages

akvachan/Food-Vision

Folders and files

Latest commit

History

Repository files navigation

Classification of 101 food classes using ResNet, YOLO and Falcon 2

Workflow:

Models

ResNet-50

YOLOv8

Falcon 2

Results:

Links to notebooks:

Dataset: food101

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages