GitHub - DotIN13/tang-syn-trocr: Training TrOCR with synthesized handwritten chinese textlines.

Training logs

The model converged faster with 50k steps on a batch size of 540, compared with training for only 10k steps on a batch size of 720 (one augmentation only for each online generated image sample).

The difference could lie in the different choice of pretained decoder, or the augmentation method employed. Under current circumstances, the next step would be to keep using the Mengzi decoder, and go back to the original augmentation method that simultaeously apply various types of visual distortions. So that we will be able to observe the possible performance differences between the two methods of data augmentation.

The experiment with the augmentation method revealed the inefficiency of the mixed augmentations. The model converged slower when applying multiple augmentations to the same image sample.

2023-09-03

If the training on Google cloud achieved the same result as the July 23 variant, then it should be okay to say that the new data augmentation method works as good as the previous one.

And it could also possibly indicate that the old messier dataset works better.

But it won't provide much insight on the comparison of linear and bicubic interpolation modes, as the dataset is different.

2023-09-03-01

The training on Google Cloud using linear interpolation showed that the dataset and interpolation mode does not affect greatly the model convergence.

So the gap in performance might lie in the new elastic method, the new text sampling technique, and the introduction of attention_mask.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
augmentation		augmentation
configs		configs
dataset-syn		dataset-syn
dataset		dataset
fonts		fonts
lib		lib
log-archives		log-archives
samples		samples
.gitignore		.gitignore
README.md		README.md
data_augmentation.ipynb		data_augmentation.ipynb
data_generation.ipynb		data_generation.ipynb
data_processing.ipynb		data_processing.ipynb
image_transform_inspect.ipynb		image_transform_inspect.ipynb
learning_rate.png		learning_rate.png
requirements.txt		requirements.txt
tang-syn.code-workspace		tang-syn.code-workspace
tang_syn.ipynb		tang_syn.ipynb
tang_syn_config-32.yml		tang_syn_config-32.yml
tang_syn_config-64-no-elastic.yml		tang_syn_config-64-no-elastic.yml
tang_syn_config-64.yml		tang_syn_config-64.yml
test.ipynb		test.ipynb
torch_bench.py		torch_bench.py
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Training logs

2023-09-03

2023-09-03-01

About

Releases

Packages

Languages

DotIN13/tang-syn-trocr

Folders and files

Latest commit

History

Repository files navigation

Training logs

2023-09-03

2023-09-03-01

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages