FitVTON: Fit-aware Virtual Try-On
via Body-Garment Size Control

Yiqun Ning, Ao Shen, Chenhang He, Lei Zhang

Corresponding author

FitVTON teaser: compared with prompt-driven commercial editing, FitVTON adapts hem position, tightness, and cuff elasticity to each body shape
With garment-body size prompts, commercial editing models produce a “neutral fit” across different body shapes. FitVTON produces faithful fit: hem position, tightness, and cuff elasticity vary across bodies.

Abstract

FitVTON is a fit-aware virtual try-on model that generates authentic garment fitting effects across diverse body shapes. FitVTON encodes garment-body size as structured text prompts (e.g., “long-length upper garment” on a “slim, medium-tall body”) and learns fitting dynamics from physically simulated try-on triplets. A FLUX.1 Kontext flow-matching backbone is fine-tuned with modality-specific LoRA adapters and dual-branch garment/body mask supervision, then rectified on real images in a second stage to bridge the sim-to-real gap.

GarmentCodeVTON Explorer

Drag either orbit to rotate · click a card to bring it to front · use the arrows to switch body. Top orbit: garments · center: SMPL-X body · bottom orbit: simulated try-on for the current body.

SMPL-X body
Body 1 / 10

Method

FitVTON architecture: FLUX.1 Kontext backbone with modality-specific LoRA adapters, dual-branch mask supervision, and two-stage training
Top: Person + garment + Garment-Body Size prompt → fit-aware try-on via FLUX.1 Kontext with dual LoRA adapters and dual-branch mask supervision. Bottom: Stage I fits on simulation triplets; Stage II rectifies textures on real images with the image LoRA only.

Highlights

Results

Fit-oriented protocol on FittingEffect3K (GPT-scored, 1–5; category averages across GB / T-L / SC / LF)

MethodUpper AvgLower AvgDress AvgWhole Avg
CatVTON2.622.091.952.30
OmniTry3.002.152.402.55
Any2AnyTryOn2.922.471.792.57
JCo-MVTON2.962.712.152.74
Nano Banana3.192.452.832.82
FitVTON3.222.992.903.08

Human preference study on FittingEffect3K (20 participants, 100 cases, 2,000 selections; best fit vs. ground truth)

MethodSelections ↑Ratio ↑
FitVTON66633.30%
Nano Banana51725.85%
JCo-MVTON42121.05%
OmniTry1638.15%
Any2AnyTryOn1477.35%
CatVTON864.30%

Resources

BibTeX

@article{ning2026fitvton,
  title   = {FitVTON: Fit-aware Virtual Try-On via Body-Garment Size Control},
  author  = {Ning, Yiqun and Shen, Ao and He, Chenhang and Zhang, Lei},
  journal = {arXiv preprint arXiv:2606.12012},
  year    = {2026}
}