TinyVAE Training Repository

This repository contains the implementation of a Tiny Variational Autoencoder (VAE) trained using a cyclical annealing schedule. The model is trained on images and saves the encoder and decoder weights periodically.

Repository Structure

├── .gitignore
├── LICENSE
├── decoded_image_epoch_999.png
├── input.jpg
├── requirements.txt
├── tiny_decoder_epoch_1000.pth
├── tiny_encoder_epoch_1000.pth
└── train_flux_tinyvae.py

Installation

Clone the repository:

git clone https://github.com/XmYx/tinyvae-flux
cd tinyvae-flux

Create and activate a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install the required packages:

pip install -r requirements.txt

Training the Model

To start training the TinyVAE model, use the following command. Make sure you have a folder with training images.

python train_flux_tinyvae.py <data_folder> <output_folder>

Replace <data_folder> with the path to your folder containing the images, and <output_folder> with the path to the folder where you want to save the model checkpoints and generated images.

Example

python train_flux_tinyvae.py ./data ./output

Testing the Model

After training, you can use the saved encoder and decoder weights to test the model. Below is an example of how to load the model weights and generate an image from a sample input.

import torch
from torchvision import transforms
from PIL import Image
from train_flux_tinyvae import TinyAutoEncoder, VaeImageProcessor, postprocess

# Load the model
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = TinyAutoEncoder(size_variant='tiny').to(device)
model.encoder.load_state_dict(torch.load('output/tiny_encoder_epoch_1000.pth'))
model.decoder.load_state_dict(torch.load('output/tiny_decoder_epoch_1000.pth'))
model.eval()

# Load the input image
transform = transforms.Compose([
    transforms.Resize((512, 512)),
    transforms.ToTensor(),
])
input_image = Image.open('input.jpg').convert("RGB")
input_tensor = transform(input_image).unsqueeze(0).to(device)

# Process the image
processor = VaeImageProcessor(vae_scale_factor=16, vae_latent_channels=16)
preprocessed = processor.preprocess(input_tensor, width=512, height=512)
encoded_sample = model.encoder(preprocessed)
decoded_sample = model.decoder(encoded_sample)

# Postprocess and save the output image
output_image = postprocess(decoded_sample[0])
output_image.save('output_image.png')

Results

Sample output image after training for 1000 epochs:

Additional Resources

For more information on cyclical annealing schedules in VAE training, check out this article: A Must-Have Training Trick for VAE (Variational Autoencoder).

License

This project is licensed under the MIT License - see the LICENSE file for details.

Feel free to open an issue or a pull request if you have any questions or suggestions!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TinyVAE Training Repository

Repository Structure

Installation

Training the Model

Example

Testing the Model

Results

Additional Resources

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
decoded_image_epoch_999.png		decoded_image_epoch_999.png
input.jpg		input.jpg
requirements.txt		requirements.txt
tiny_decoder_epoch_1000.pth		tiny_decoder_epoch_1000.pth
tiny_encoder_epoch_1000.pth		tiny_encoder_epoch_1000.pth
train_flux_tinyvae.py		train_flux_tinyvae.py

License

XmYx/tinyvae-flux

Folders and files

Latest commit

History

Repository files navigation

TinyVAE Training Repository

Repository Structure

Installation

Training the Model

Example

Testing the Model

Results

Additional Resources

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages