AMPLIfy Prefect Pipeline

A Prefect server for orchestrating machine learning training and inference runs. Supports YOLO (training and inference), SegGPT (inference), ONNX models (inference), and IFCB flow metric (training). After setting up the system with Docker, users can monitor and run the workflows using the UI accessible in a browser.

Setup

1. Environment Configuration

Copy the example environment file and fill in your values:

cp .env.example .env

Edit .env with your specific values:

POSTGRES_USERNAME: Your PostgreSQL username
POSTGRES_PASSWORD: Your PostgreSQL password
EXTERNAL_HOST_NAME: External hostname of your machine
PROVENANCE_STORE_URL: URL for provenance store
MEDIASTORE_URL: URL for your media store
MEDIASTORE_TOKEN: Authentication token for media store

2. Launch PostgreSQL Database

Use Docker Compose to start the PostgreSQL container:

docker compose up -d postgres

3. Python Environment Setup

Create a virtual environment and install dependencies:

python3 -m venv .venv
source .venv/bin/activate
pip install -r src/requirements.txt

4. Configure and Start Prefect Server

Load environment variables and configure Prefect:

# Load environment variables
source .env

# Set Prefect configuration
prefect config set PREFECT_SERVER_API_HOST="$EXTERNAL_HOST_NAME"
prefect config set PREFECT_API_DATABASE_CONNECTION_URL="postgresql+asyncpg://$POSTGRES_USERNAME:$POSTGRES_PASSWORD@localhost:5432/prefect"

# Start Prefect server
prefect server start

5. Deploy Workflows

In separate terminal windows, deploy the workflows you want to use:

For ONNX Inference:

source .venv/bin/activate
source .env
python src/flows/onnx_inference.py

For YOLO Inference:

source .venv/bin/activate  
source .env
python src/flows/yolo_inference.py

For IFCB Flow Metric Training:

source .venv/bin/activate
source .env
python src/flows/ifcb_training.py

Using the Workflows

Navigate to the Prefect UI in your browser at http://{EXTERNAL_HOST_NAME}:4200

ONNX Inference Workflow

The ONNX inference workflow requires the following parameters in the Prefect UI:

ONNXInferenceParams:

model: Path to the ONNX model file
input_dir: Directory containing input data
output_dir: Directory where results will be saved
batch (optional): Batch size for inference
classes (optional): Specific classes to process
outfile (optional): Custom output filename
force_notorch (optional): Force non-PyTorch backend
cuda_visible_devices: GPU devices to use (default: "0,1,2,3")

YOLO Inference Workflow

The YOLO inference workflow requires two parameter sets:

YOLOInferenceParams:

data_dir: Directory containing input images/videos
output_dir: Directory where results will be saved
model_weights_path: Path to YOLO model weights (.pt file)
device: Compute device for inference (e.g., "0" for GPU 0, "cpu")
agnostic_nms: Class-agnostic Non-Maximum Suppression (default: true)
iou: IoU threshold for NMS to eliminate overlapping boxes (default: 0.5)
conf: Minimum confidence threshold for detections (default: 0.1)
imgsz: Image size for inference (default: 1280)
batch: Batch size for processing multiple inputs (default: 16)
half: Half-precision (FP16) inference for speed (default: false)
max_det: Maximum detections allowed per image (default: 300)
vid_stride: Frame stride for video processing (default: 1)
stream_buffer: Queue frames vs drop old frames (default: false)
visualize: Visualize model features during inference (default: false)
augment: Test-time augmentation for improved robustness (default: false)
classes: Filter predictions to specific class IDs (optional)
retina_masks: High-resolution segmentation masks (default: false)
embed: Extract feature vectors from specified layers (optional)
name: Name for prediction run subdirectory (optional)
verbose: Display detailed inference logs (default: true)

YOLOVisualizationParams:

show: Display annotated images/videos in window (default: false)
save: Save annotated images/videos to file (default: false)
save_frames: Save individual video frames as images (default: false)
save_txt: Save detection results in text format (default: false)
save_conf: Include confidence scores in saved text files (default: false)
save_crop: Save cropped images of detections (default: false)
show_labels: Display labels for each detection (default: true)
show_conf: Display confidence scores alongside labels (default: true)
show_boxes: Draw bounding boxes around detected objects (default: true)

IFCB Flow Metric Training Workflow

The IFCB flow metric training workflow requires the following parameters in the Prefect UI:

IFCBTrainingParams:

data_dir: Directory containing IFCB point cloud data
output_dir: Directory where trained model will be saved
id_file (optional): File containing list of IDs to load (one PID per line)
n_jobs: Number of parallel jobs for load/extraction phase (-1 uses all CPUs, default: -1)
contamination: Expected fraction of anomalous distributions (default: 0.1)
aspect_ratio: Camera frame aspect ratio (width/height, default: 1.36)
chunk_size: Number of PIDs to process in each chunk (default: 100)
model_filename: Filename for the trained model (default: "classifier.pkl")

YOLO Training Data Format

For YOLO training, the data directory should contain a dataset.yaml file:

path: /data # dataset root dir
train: images/train # train images (relative to 'path')
val: images/val # val images (relative to 'path') 
test: images/test # test images (optional)

names:
  0: person
  1: bicycle
  2: car

Ensure your directory structure matches:

data_dir/
├── dataset.yaml
├── images/
│   ├── train/
│   ├── val/
│   └── test/ (optional)
└── labels/
    ├── train/
    ├── val/
    └── test/ (optional)

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.github/workflows		.github/workflows
docker		docker
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AMPLIfy Prefect Pipeline

Setup

1. Environment Configuration

2. Launch PostgreSQL Database

3. Python Environment Setup

4. Configure and Start Prefect Server

5. Deploy Workflows

Using the Workflows

ONNX Inference Workflow

YOLO Inference Workflow

IFCB Flow Metric Training Workflow

YOLO Training Data Format

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

WHOIGit/amplify-prefect

Folders and files

Latest commit

History

Repository files navigation

AMPLIfy Prefect Pipeline

Setup

1. Environment Configuration

2. Launch PostgreSQL Database

3. Python Environment Setup

4. Configure and Start Prefect Server

5. Deploy Workflows

Using the Workflows

ONNX Inference Workflow

YOLO Inference Workflow

IFCB Flow Metric Training Workflow

YOLO Training Data Format

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

Packages