BeautyMaster

👋 join us on

📘 Introduction

The "BeautyMaster" uses the InternLM2 large model, XTuner fine-tuning tool, and LMDeploy deployment tool to provide users with personalized fashion matching suggestions. By uploading full-body photos and wardrobe clothing, combined with seasons, temperature, weather, travel purposes and personal needs, the most perfect dressing plan can be customized.

The highlights of the project are as follows:

Fine-tuning of large models 🔧: By fine-tuning the InternLM2 model with the help of XTuner, the accuracy of personalized recommendations is improved;
Support personalized dressing 👗: Customize personalized fashion matching solutions according to the user's individual needs and occasions;
Intelligent recommendation 🌦️📅: Combine seasons, temperature, weather and other factors to provide users with practical dressing suggestions;
Easy to use 📸: Upload photos and clothes to easily obtain fashion matching solutions and say goodbye to choice difficulties;
Professional guidance 💡🔍: With the help of big data analysis and artificial intelligence technology, create the best fashion look for users;
Good scalability ✨: It can be expanded to facial makeup, hairstyles, bags and accessories in the later stage;
Immersive experience 🎨🌟: In the later stage, the matching and dressing will be applied to the user to provide 3D visual effects.

Welcome Everyone is welcome to pull request and ✨star✨ our project, your ✨star✨ will provide us with the motivation to move forward!!!

💪 Goal

Primary

Train a VLM to have basic aesthetic ability.
Align human aesthetic abilities.
Choose the right match according to aesthetic ability.
The previous results are presented through pictures.

Flow

✊ Todo

Train a VLM to have basic aesthetic ability

Data Processing(WIP)
- Collect fashion wear data.
- Look for a VLM to label ready-made images internVL.
- Make a data annotation tool.
- Correct data form VLM for test and train.

Align human aesthetic abilities

RLHF instruct
DPO DPO
ORPO ORPO(https://github.com/xfactlab/orpo/tree/main)
RLHF-V RLHF-V(https://github.com/RLHF-V/RLHF-V)

Choose the right match according to aesthetic ability.

The trained VLM gives suggestions on what to wear
- internVL

The previous results are presented through pictures

Segment Tool
- segment-anything: Features: it can be used to generate masks for all objects in an image.
- Self Correction for Human ParsingFeatures: An out-of-box human parsing representation extractor.
Virtual Try-on(WIP)
- IDM-VTON: Features: it could keep background of the VToN, Techs: IP-Adapter, TryonNet, GarmentNet.
- COTTON-size-does-matter: Features: it can adjust the size of VToN with a parameter.
- OOTDiffusion: Techs: LDM-based, outfitting UNet, outfitting dropout, Classifier-free guidance.
- OutfitAnyone: Ultra-high quality virtual try-on for Any Clothing and Any Person (Including Anime character pictures in demo).
- StableVITON: Features: it could keep background of the VToN, Techs: LDM-based, zero cross-attention blocks, attention total variation loss and augmentation.
- AnyFit
Visualize the results of the suggestions to the user
- ControlNet
- 4d-dress: A 4D Dataset of Real-world Human Clothing with Semantic Annotations
- champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
- MOSS

📂 Repo structure (WIP)

├── README.md
├── README_cn.md
├── docs
├── scripts
├── beautymaster
│   ├── datasets
│   ├── third_party
│   │   ├── internvl
│   │   ├── IDM-VTON                     
│   │   ├── champ
│   │   ├── archpp
│   │   ├── ControlNet
│   │   └── OOTDiffusion
│   ├── demo
│   ├── src
│   ├── train     
│   └── utils
├── requirements.txt

🛠️ Requirements and Installation

Clone this repository and open BeautyMaster folder

git clone --recursive https://github.com/RayTang88/BeautyMaster.git
cd BeautyMaster

Install required packages

conda create -n beautyMaster python=3.10 -y
conda activate beautyMaster

Install additional packages for training cases

pip install -r requirements.txt
MAX_JOBS=4 pip install -U flash-attn==2.5.8 --no-build-isolation # faster instalation for flash-attn

Datasets

Refer to Data.md

Infer

set env

export MODEL_ROOT="..."
export CODE_ROOT="..."
export DATA_ROOT="..."

Note: Set the root path for downloading models, the root path for cloning code, and the root path for data respectively.

gradio

python $CODE_ROOT/BeautyMaster/beautymaster/demo/app.py

python

python $CODE_ROOT/BeautyMaster/beautymaster/demo/infer.py

🔒 License

Usage and License Notices: The data, code, and checkpoint are intended and licensed for research use only,non-commercial use. They are also restricted to uses that follow the license agreement of InternVL, RLHF,DPO,ORPO,RLHF-V,segment-anything,SCHP,IDM-VTON,ControlNet,4d-dress,champ. The dataset is CC BY NC 4.0 (allowing only non-commercial use) and models trained using the dataset should not be used outside of research purposes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_en.md

README_en.md

BeautyMaster

📘 Introduction

💪 Goal

✊ Todo

Train a VLM to have basic aesthetic ability

Align human aesthetic abilities

Choose the right match according to aesthetic ability.

The previous results are presented through pictures

📂 Repo structure (WIP)

🛠️ Requirements and Installation

Datasets

Infer

set env

gradio

python

🔒 License

Files

README_en.md

Latest commit

History

README_en.md

File metadata and controls

BeautyMaster

📘 Introduction

💪 Goal

✊ Todo

Train a VLM to have basic aesthetic ability

Align human aesthetic abilities

Choose the right match according to aesthetic ability.

The previous results are presented through pictures

📂 Repo structure (WIP)

🛠️ Requirements and Installation

Datasets

Infer

set env

gradio

python

🔒 License