Skip to content

marc-lehner/Transformers_for_images

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

89 Commits
 
 
 
 
 
 

Repository files navigation

KNIME Nodes for Vision Transformers

License: GPL v3

This repository provides a KNIME extension for fine-tuning and predicting with Vision Transformer (ViT) models. The nodes are fully developed in Python using PyTorch and HuggingFace Transformers and can be integrated into your KNIME workflows via the KNIME Analytics Platform.

Installation

KNIME Analytics Platform

The extension can be installed via the KNIME Hub by dragging and doping or installed like any other KNIME extension via the KNIME Extension Manager.

Usage

Here is an example of workflow that uses the extension.

Features

  • ViT Classification Learner Node

    • Train transformer models on image classification tasks.
    • Supports ViT, Swin Transformer, and Pyramid Transformer architectures.
    • Accepts training and validation image sets in PNG format.
    • Configurable epochs, batch size, learning rate, and model type.
  • ViT Classification Predictor Node

    • Predict labels and class probabilities on new image data.
    • Auto-decodes predictions to original label strings.
    • Customizable output column names and probability formatting.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%