Font Classification

This repository contains code for classifying fonts using CNN techniques. Font classification involves identifying the font family or typeface of a given text image. This can be useful for various applications such as font recognition in images, document analysis, and typography-related projects. A few sample images

Dataset

The dataset used for font classification consists of a collection of text images in corresponding folders indicating the font family or typeface. Each image in the dataset represents a sample of text written in a specific font.
There are total 10 font classes present in the dataset with approximate 80 sample images per font. The data is present at project_files/data of the repo. The folder structure looks like this

Generating more data

The dataset also contains .ttf file for each font which can be used to generate synthetic images.

The process of generating synthetic data is explanied in generating_synthetic_images.ipynb

After creating synthetic dataset we have a total of 10,000 training images and 1000 validation images, an ample amount suitable for training CNN models effectively.

Models

The model's architecture can be found in models folder. The best performing models can be found in saved_models folder of the repo.
The following models were trained on the dataset

1. LeNet model

The LeNet model is the first choice for image classification tasks due to its simplicity and effectiveness. We can convert the images to Grayscale and then pass it through Conv2d and max pool layers. Since it is small model (200k params) , it is trianed from scratch on the given dataset.

2. ResNet18 model

The second model that was chosen was ResNet18 model. Transfer learning method was used to fine tune the model by freezing all the weights. The last layer of the model was modified to have number of output neurons as 10. Two versions of Resnet model (with different hyperparams and augmentation techniques) were trained and evaulated.

3. Efficientnet model

In order to experiment with some modern architectures , fine tuning on efficientnet-b1 is done. EfficientNet models are highly efficient due to their compound scaling method, achieving good performance with fewer parameters. Since we have limited compute fine tuning this model was a prudent choice.

Performance Metrics

Since the number of sample images were balanced, accuracy was considered as the first metric. In order to make the model more generalised and keep track of false positives and false negatives we also logged precision, recall and f1 score. During training the checkpoints were comparing the validation f1 score and the models with highest score were saved.

Training, Logging

The training is done on single instance of Cloud GPUs. To monitor train metrics a subset of train data was put into eval mode along with the validation data
Logging and experiment tracking was done using Weights and Biases. Here's a snapshot of performance of top 4 models

Results

Here is a table summarizing the performance of the top 4 models:

Model	Accuracy	F1 Score
LeNet	0.81	0.83
ResNet18	0.78	0.80
ResNet18-256	0.83	0.85
EfficientNet	0.80	0.79

Installation and Usage

A demo workflow of complete code is present at font_classification.ipynb

1. Clone the GitHub repo:

 git clone https://github.com/mishra-kunal1/font_classification.git

2. Change the dir and Install the requirements:

cd font_classification
pip install -r requirements.txt

3. Prepare the dataset:

The following command will perform the train,val and test split and will generate synthetic data.

python prepare_dataset.py

4. Training the model:

4.1 To start training the data from scratch using LeNet:

python train.py --model lenet

4.2 To Fine tune the model using ResNet18:

Make sure to remove the grayscale transformation in the train.py file.

python train.py --model resnet

4.3 To Fine tune the model using EfficientNet B1 :

Make sure to remove the grayscale transformation in the train.py file.

 python train.py --model enet

4.4 To resume training from last checkpoint

 python train.py --model resnet --resume yes

5. Evaluating the performance of test data

Important - Make sure to add the test_data_path to main method of inference.py before running the code.
You can also add the test_data_path in inference.ipynb to get evaluation results

  python inference.py

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
models		models
notebooks		notebooks
project_files		project_files
saved_models		saved_models
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
config.py		config.py
font_classification.ipynb		font_classification.ipynb
image-1.png		image-1.png
inference.ipynb		inference.ipynb
inference.py		inference.py
performance_measure.py		performance_measure.py
prepare_dataset.py		prepare_dataset.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Font Classification

Dataset

Generating more data

Models

1. LeNet model

2. ResNet18 model

3. Efficientnet model

Performance Metrics

Training, Logging

Results

Installation and Usage

1. Clone the GitHub repo:

2. Change the dir and Install the requirements:

3. Prepare the dataset:

4. Training the model:

4.1 To start training the data from scratch using LeNet:

4.2 To Fine tune the model using ResNet18:

4.3 To Fine tune the model using EfficientNet B1 :

4.4 To resume training from last checkpoint

5. Evaluating the performance of test data

About

Releases

Packages

Languages

mishra-kunal1/font_classification

Folders and files

Latest commit

History

Repository files navigation

Font Classification

Dataset

Generating more data

Models

1. LeNet model

2. ResNet18 model

3. Efficientnet model

Performance Metrics

Training, Logging

Results

Installation and Usage

1. Clone the GitHub repo:

2. Change the dir and Install the requirements:

3. Prepare the dataset:

4. Training the model:

4.1 To start training the data from scratch using LeNet:

4.2 To Fine tune the model using ResNet18:

4.3 To Fine tune the model using EfficientNet B1 :

4.4 To resume training from last checkpoint

5. Evaluating the performance of test data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages