AI-powered clothing visualization system using FitDiT deep learning model to generate photorealistic renderings of garments on person images. It handles segmentation, pose estimation, and image synthesis to create a seamless try-on experience.
┌─────────────────────────────────────────────────────────────────────┐
│ Client Applications │
│ (Web Interface, Mobile App, E-commerce Integration, Python Client) │
└───────────────────────────────┬─────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────────┐
│ REST API Layer │
│ (Request Handling, Authentication) │
└───────────────────────────────┬─────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────────┐
│ ComfyUI Integration Layer │
│ (Workflow Management, Job Processing) │
└───────────────────────────────┬─────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────────────┐
│ Core Processing Pipeline │
├─────────────────┬─────────────────────────┬─────────────────────────┤
│ FitDiT Model │ Mask Generator Module │ Try-On Processor Module│
│ (AI Engine) │ (Segmentation & Pose) │ (Image Synthesis) │
└─────────────────┴─────────────────────────┴─────────────────────────┘
- User uploads person image and garment image
- System validates image formats and dimensions
- FitDiT model processes the person image to generate segmentation and pose information
- Try-On processor combines all inputs to render the garment on the person
- System returns final image and any requested intermediate outputs
- Results are displayed to the user or returned via API
┌─────────────┐ ┌─────────────┐ ┌──────────────────┐
│ Person Image│ │Garment Image│ │ Model Loading │
└──────┬──────┘ └──────┬──────┘ └────────┬─────────┘
│ │ │
▼ │ ▼
┌──────────────┐ │ ┌──────────────────┐
│ Pose │ │ │ FitDiT Model │
│ Extraction │◄─────────┼───────────┤ Inference │
└──────┬───────┘ │ └────────┬─────────┘
│ │ │
▼ │ ▼
┌──────────────┐ │ ┌──────────────────┐
│ Segmentation │ │ │ Mask Creation │
│ Mask │◄─────────┼───────────┤ │
└──────┬───────┘ │ └────────┬─────────┘
│ │ │
│ ▼ │
│ ┌──────────────┐ │
└──────────►│ Try-On │◄───────────┘
│ Processing │
└──────┬───────┘
│
▼
┌──────────────┐
│ Final Image │
│ Rendering │
└──────────────┘
- Endpoint:
/api/v1/tryon
- Method: POST
- Parameters:
person_image
: Base64-encoded person imagegarment_image
: Base64-encoded garment imageoptions
: JSON object with rendering parameters
- Response: JSON object with result URLs and processing metadata
- Endpoint:
/api/v1/status/{job_id}
- Method: GET
- Response: JSON object with job status and progress information
Model Input | Garment Input | Output |
---|---|---|
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |