Skip to content

package for easily using 'comics text+' paper's models

License

Notifications You must be signed in to change notification settings

gsoykan/comics_ocr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

to build locally after cloning

pip install comics-ocr[cuda] -f https://download.pytorch.org/whl/torch_stable.html

or 

pip install comics-ocr[cpu]

You can get the necessary model checkpoints and configs from COMICS TEXT+ repository.

Usage

# Import library
from comics_ocr import ComicsOCR

# initalize the model
e2e_ocr_model = ComicsOCR(
    ocr_detector_config="fcenet_r50dcnv2_fpn_1500e_ctw1500_custom/fcenet_r50dcnv2_fpn_1500e_ctw1500_custom.py",
    ocr_detector_checkpoint='fcenet_r50dcnv2_fpn_1500e_ctw1500_custom/best_0_hmean-iou:hmean_epoch_5.pth',
    recog_config='master_custom_dataset.py',
    ocr_recognition_checkpoint='best_0_1-N.E.D_epoch_4.pth',
    det='FCE_CTW_DCNv2',
    recog='MASTER')

# Run the model
img_path = "speech_bubble/0/3/9.jpg"
text, preprocessed_text, sanitized_text = e2e_ocr_model.extract_text(img_path)

About

package for easily using 'comics text+' paper's models

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages